Machine Learning 1 Day Bucket Span and Alerts

edang · October 31, 2022, 1:45pm

Hi All,

I have unique data coming in once a day for a field and set up an advanced ML job with a bucket span of one day accordingly. The configured detector is a summation of a unique number field partitioned by another field.

Alerts are also set up so any anomalies above a threshold is emailed to me but I would also like to know when data comes in late (compared to its historic timing).

How could I configure my ML to satisfy my needs?

richcollier · November 1, 2022, 1:45pm

Sounds like a second ML job is required using time_of_day function: Appendix P: Time functions | Machine Learning in the Elastic Stack [8.4] | Elastic

edang · November 2, 2022, 1:54pm

Okay, thank you. That is very helpful.

A follow up question I have about this is, would 1 ML job with multiple detectors be more efficient?

richcollier · November 3, 2022, 5:52pm

In general, yes. Because in each bucket_span, the data only needs to be queried once, then applied to both detectors. However, the viewing/interpreting of the results is easier in our UI (I find) if a job only has one detector.

edang · November 9, 2022, 4:28pm

Hi richcollier, I have set up the ML according to the time_of_day function with a bucket span of 1 day. However, it is not exactly how I would prefer it to behave.

In your experience, is it possible to get real time alerts for late data with unique data coming in once a day?

richcollier · November 9, 2022, 7:46pm

Note in : Appendix P: Time functions | Machine Learning in the Elastic Stack [8.11] | Elastic

Shorter bucket spans (for example, 10 minutes) are recommended when performing a time_of_day or time_of_week analysis. The time of the events being modeled are not affected by the bucket span, but a shorter bucket span enables quicker alerting on unusual events.

edang · November 10, 2022, 2:23pm

The separate ML with the time of day detector worked well for late data congestion!

My current configuration for the first ML is a summation of a number field by a field that is consumed once per day. Therefore, I set a bucket span of 1 day. However, when setting up my alerts, I will only get 1 alert at the end of day (after the ML has run).

Are there any configurations for the ML to run real time for the alerts to be real time as well (with the constraint of how my data is coming in)?

system · December 8, 2022, 2:23pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Machine Learning Bucket Span Millisecond Precision Kibana elastic-stack-machine-learning	11	830	October 3, 2019
Aletring based on anomaly duration Kibana elastic-stack-machine-learning	6	394	July 1, 2022
How does Anomaly Detection work? Elasticsearch elastic-stack-machine-learning	2	590	March 17, 2023
Weakly repetitive pattern and real-time alert Elasticsearch elastic-stack-machine-learning	7	578	October 30, 2018
Aggregation interval: 1m, bucket span: 1m Kibana	4	302	November 3, 2020

Machine Learning 1 Day Bucket Span and Alerts

Related topics