Anomaly detection on ratio of two counts

nandrik · January 18, 2019, 6:17pm

Hi, I'm indexing a log file where the majority of the lines contain actions but a small percentage (1-2%) contains error lines.

I have managed to successfully plot the hourly error rate using either Timelion or the Visual Builder.

However, what I'd like to do is to perform anomaly detection on the hourly error rate.

What's the easiest way of doing this?

richcollier · January 22, 2019, 2:53pm

Create a bucket script aggregation: https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-pipeline-bucket-script-aggregation.html

ML jobs can use aggregations as input. See: https://www.elastic.co/guide/en/elastic-stack-overview/current/ml-configuring-aggregation.html

nandrik · January 22, 2019, 3:35pm

@richcollier, thanks for the reply.
Is there a way to store the bucket script aggregation as a new index?
Or is this just for the purposes of the ML job?
In other words, wonder if this script can be stored generically and then used in plots, ML, watcher, etc.?

richcollier · January 22, 2019, 3:39pm

If you want to use that info for things other than ML, you could possibly build a summarization job in Watcher that will routinely query, aggregate and index the output.

Otherwise, if the information is only for the purposes of ML, then just do the aggregations in-line within the ML job

system · February 19, 2019, 3:39pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

richcollier · March 26, 2020, 1:26pm

Prior advice of using Watcher for a summarization job is now irrelevant given the addition of the Transform capability.

Topic		Replies	Views
ML datafeed bucket aggregation/script challenge Elasticsearch elastic-stack-machine-learning	7	662	April 4, 2022
Analyzing a ratio of documents over time with Anomaly Detection Elasticsearch elastic-stack-machine-learning	1	955	April 23, 2020
Bucket_script location Elasticsearch	3	2319	October 11, 2018
How to craft watches for delta change anomalies? Elasticsearch elastic-stack-alerting	12	2981	July 19, 2017
Realtime anomaly score analysis Elasticsearch elastic-stack-machine-learning	3	796	April 16, 2018

Anomaly detection on ratio of two counts

Related topics