Advanced Configuration Machine Learning

I want to use high count function for a field , for this i select Advanced Confiugration wizard
Machine Learning > Anomaly Detection > Create Job > Advanced configuration

Here i am using high_count detector for test field , i want to find anomaly in this field and i have another field called verb.keyword which contain service name , so basically verb.keyword (service name) based anomaly detection on high_count of test value (decline count)

While selecting detector it shown me two options by_field and over_field

I randomly selected By field and i don't know the difference between by field and over field

in influencer i choose verb.keyword

How can i know what i have to choose in by_field or in over_field is there any option where i can choose high_count becuase i cant see in multi-metric.


And what is this found for mean here in above image

How can i know what i have to choose in by_field or in over_field

Choose an over_field when you want to perform a population analysis, choose a by_field when you want to split your data by a value in another field.

You can learn more about population analysis at Configuring population analysis | Machine Learning in the Elastic Stack [master] | Elastic

is there any option where i can choose high_count becuase i cant see in multi-metric.

The high_count function is available in the multi metric wizard, first pick the option High count(Event rate) in the field picker

then pick a split field. From the description of your data I suggest using verb.keyword as the split field.

The count functions work with event rates, i.e. the number of occurrences of the event in a time period. high count split by verb.keyword will find anomalies where there is an unusually high number of events for any of the values of verb.keyword.

What is in the test field? Is it a numerical value? If test is a number then consider the high_sum function on the test field split by verb.keyword

Hi David,
Test is a number field and we want to apply high count.
Is there any difference between high_count and High Count(Event rate)

Thank you @dkyle , can you please tell me what this graph depict

In this job i applied high_count by test over "verb.keyword" and also choose verb.keyword as an inluencer

What this above graph shoowing, i know upper bound and lower bound but what does these(upper and lower bound) means in above graph

And one thing i learned that in single metric viewer we get slider to view anomaly and in Anomaly Explorer we can see as a table . So am i right about Single Metric Viewer and Anomaly Explorer

Is there any difference between high_count and High Count(Event rate)

No they are both the same function but the text description is slightly different

I recommend reviewing the documentation for the different anomaly detector functions. Count functions are described as "Count functions detect anomalies when the number of events in a bucket is anomalous.". If test is a numerical field and you are looking for anomalies in the value of test then try the metric function.

In the chart above you see a plot of the number of documents over time that contain the verb.keyword field with the value REPL_BACKLOG_REPORT. The count goes from 1 to 3, per bucket interval there is nothing particularly interesting here. I am guessing that analysing the value of test with a metric function will be more interesting.

1 Like

How can you see the bucket when you said

The count goes from 1 to 3, per bucket interval there is nothing particularly interesting here.

The bucket span or interval is 15minutes. Bucket span is a configuration option when creating the job

@dkyle thanks

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.