Are these values right for Query delay, Frequency and Bucket Span?

pk.241011 · July 1, 2020, 1:46pm

I am trying out the anomaly detection job.
The data is coming from logstash at interval of 5 mins. But sometimes there will be no data in those 5 mins. The data itself will be randomly distrubuted in the 5 min slot. Sometimes there will be 5 data points at start of the time slot. Sometimes in middle. Sometime at end.

Like all these are possible scenarios:

No data:
00000

Data at start:
D0000

Data in middle:
00D00

Data in end:
0000D

I have kept the Query delay and Frequency delay to 5m. My idea is to not miss any data.

The suggested Bucket Span was 30m.
Should it not have been 5 mins?

richcollier · July 1, 2020, 2:47pm

Think of bucket_span as the analysis aggregation interval. This is different than the frequency and delay of getting the data from the source index.

pk.241011 · July 2, 2020, 1:38am

Thanks for response @richcollier.
Is it fine that I keep Query delay equal to 1d? I assume the datafeed keeps a track of the point till which it has taken in the data to avoid the duplicate issues. And the only cost for me will be a more expensive query since the time range is bigger.

I am asking this since the data is actually coming from production line. And they run the line when needed. There is no schedule. There maybe days during which they do not make anything. And few days when the run the line 24hrs non stop.

richcollier · July 3, 2020, 12:39pm

There are 3 important parameters: bucket_span, query_delay, and frequency

bucket_span is the analytics aggregation interval
frequency is how often the data is queried via the datafeed
query_delay is the total offset (from "now")

In other words, having a query_delay of 1d doesn't make the query more expensive or the time range bigger. It is purely a lag behind real-time.

system · July 31, 2020, 12:39pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
What is Query delay and bucket span Elasticsearch elastic-stack-machine-learning	2	495	September 18, 2023
Decrease query delay time in ML jobs Elasticsearch elastic-stack-machine-learning	3	665	July 14, 2021
Datafeed frequency must be a multiple of the aggregation interval Elasticsearch elastic-stack-machine-learning	3	898	October 30, 2018
Watcher alerts 7.5 interval and bucket span Elasticsearch elastic-stack-alerting	4	461	June 3, 2021
How to configure ANOMALY DETECTION with DAILY buckets Kibana elastic-stack-machine-learning	5	837	March 2, 2020

Are these values right for Query delay, Frequency and Bucket Span?

Related topics