Advanced Configuration Machine Learning

Aniket_Pant · September 29, 2022, 10:42am

I want to use high count function for a field , for this i select Advanced Confiugration wizard
Machine Learning > Anomaly Detection > Create Job > Advanced configuration

Here i am using high_count detector for test field , i want to find anomaly in this field and i have another field called verb.keyword which contain service name , so basically verb.keyword (service name) based anomaly detection on high_count of test value (decline count)

While selecting detector it shown me two options by_field and over_field

I randomly selected By field and i don't know the difference between by field and over field

in influencer i choose verb.keyword

How can i know what i have to choose in by_field or in over_field is there any option where i can choose high_count becuase i cant see in multi-metric.

Aniket_Pant · September 29, 2022, 1:53pm

And what is this found for mean here in above image

dkyle · September 29, 2022, 2:36pm

How can i know what i have to choose in by_field or in over_field

Choose an over_field when you want to perform a population analysis, choose a by_field when you want to split your data by a value in another field.

You can learn more about population analysis at Configuring population analysis | Machine Learning in the Elastic Stack [master] | Elastic

is there any option where i can choose high_count becuase i cant see in multi-metric.

The high_count function is available in the multi metric wizard, first pick the option High count(Event rate) in the field picker

then pick a split field. From the description of your data I suggest using verb.keyword as the split field.

The count functions work with event rates, i.e. the number of occurrences of the event in a time period. high count split by verb.keyword will find anomalies where there is an unusually high number of events for any of the values of verb.keyword.

What is in the test field? Is it a numerical value? If test is a number then consider the high_sum function on the test field split by verb.keyword

Aniket_Pant · September 29, 2022, 3:37pm

Hi David,
Test is a number field and we want to apply high count.
Is there any difference between high_count and High Count(Event rate)

Aniket_Pant · September 29, 2022, 6:41pm

Thank you @dkyle , can you please tell me what this graph depict

In this job i applied high_count by test over "verb.keyword" and also choose verb.keyword as an inluencer

What this above graph shoowing, i know upper bound and lower bound but what does these(upper and lower bound) means in above graph

And one thing i learned that in single metric viewer we get slider to view anomaly and in Anomaly Explorer we can see as a table . So am i right about Single Metric Viewer and Anomaly Explorer

dkyle · September 29, 2022, 7:43pm

Is there any difference between high_count and High Count(Event rate)

No they are both the same function but the text description is slightly different

I recommend reviewing the documentation for the different anomaly detector functions. Count functions are described as "Count functions detect anomalies when the number of events in a bucket is anomalous.". If test is a numerical field and you are looking for anomalies in the value of test then try the metric function.

In the chart above you see a plot of the number of documents over time that contain the verb.keyword field with the value REPL_BACKLOG_REPORT. The count goes from 1 to 3, per bucket interval there is nothing particularly interesting here. I am guessing that analysing the value of test with a metric function will be more interesting.

Aniket_Pant · September 30, 2022, 3:37am

How can you see the bucket when you said

The count goes from 1 to 3, per bucket interval there is nothing particularly interesting here.

dkyle · September 30, 2022, 7:52am

The bucket span or interval is 15minutes. Bucket span is a configuration option when creating the job

Aniket_Pant · September 30, 2022, 7:58am

@dkyle thanks

system · October 28, 2022, 7:58am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ML Anomaly Job with exclude_frequent option Elasticsearch	2	164	December 28, 2023
Retrieving script field in elasticsearch for ML Elasticsearch elastic-stack-machine-learning	3	334	October 25, 2022
Max Function vs High Count Kibana elastic-stack-machine-learning	2	304	November 17, 2022
ML Kibana: problem with an advanced job using partitionfield Kibana elastic-stack-machine-learning	18	1139	September 3, 2019
[ML] Custom function in anomaly detection job Kibana elastic-stack-machine-learning	7	476	March 25, 2023

Advanced Configuration Machine Learning

Related topics