Aletring based on anomaly duration

richcollier · June 3, 2022, 12:55pm

Well, there's probably many ways this could be solved, but here's one approach - see example

Run once per day, look over the last 24 hours (the range in the example needs to be modified here because the example uses old data, not live data)
Filter your query by job_id and result_type:record
Do a terms aggregation on the partition field
Do a date_histogram sub-aggregation with an interval that matches the bucket_span of the job (the example shown had a 1m bucket span due to the data set being used in order to have consecutive anomalous buckets, so you would need to change to 15m)
Use the moving_fn aggregation to invoke a 3 bucket sliding window sum of the record_score
Use a bucket_selector aggregation to eliminate any individual values where the record_score is below some arbitrary value (I chose 40).
The condition script loops through and finds if any 3 bucket sliding window sum of the record_score is greater than some arbitrary value (I chose 120)
In the actions section, gather up all of the partitions that violated the threshold and print them with the latest timestamp at which they violated (obviously use your preferred action method)

An example output is:

          Anomalies:
          ==========
          AAL had 3 anomalies in a row at 2021-02-10T12:32:00.000Z
          AWE had 3 anomalies in a row at 2021-02-10T19:19:00.000Z
          AMX had 3 anomalies in a row at 2021-02-10T22:10:00.000Z

Topic		Replies	Views
Watcher Alerting on multi-bucket anomaly? Kibana elastic-stack-machine-learning , elastic-stack-alerting	2	518	December 28, 2020
Watcher not triggering alerts Elasticsearch elastic-stack-machine-learning	2	525	May 16, 2020
Send an email alert after 3 major/critical anomalies for a given time range Elasticsearch elastic-stack-machine-learning	3	473	August 12, 2019
Machine Learning module is triggering alerts when there is no anomaly Elasticsearch elastic-stack-machine-learning	27	2941	July 1, 2019
ML watcher configs Elasticsearch elastic-stack-machine-learning	4	639	November 23, 2020

Aletring based on anomaly duration

Related topics