Latest Timestamp Machine learning doesn't change

Angelos · March 16, 2021, 11:57am

Elasticsearch: Platinum 7.10.1
Kibana: 7.10.1

Hi,

I am creating Machine learning jobs in real time. The questions are:
What kind of information Latest Timestamp provides?
and
Why the Latest Timestamp remains the same as it was in the first day I created it, even though the latest timestamp has changed inside the graph?
I

Thank you in advance!

/Angelos

dolaru · March 16, 2021, 2:45pm

Hi @Angelos,

The value presented in the Latest timestamp column is the timestamp of the latest document that was processed by the job.

Regarding the second part of your question, could you please clarify which graph are you referring to?

Angelos · March 16, 2021, 4:19pm

Hi David,

I meant the Single Metric Viewer. It seems like the Latest Timestamp there is changing but not in the overview.

/Angelos

dolaru · March 16, 2021, 4:49pm

When you say that the latest timestamp is changing in the Single Metric Viewer, are you referring to the time picker in the top-right or are you seeing results in the chart with a newer timestamp than what you see in the job list?

A screenshot would be helpful here. Thanks!

Angelos · March 19, 2021, 12:34pm

Sorry for the late reply I was working on this.

No the latest data that we have in the metric viewer:

Here it says March 12th 2021 but in the overview in the Latest Timestamp had a very old date.

I increased the query_delay and it seems to be better, however now I have (random)warnings saying that "Datafeed has missed a number of documents due to ingest latency ".

We ingest data every day at random time and I put the delay to 1d but then again I got the warning.
I don't really understand how it works.
How bigger should the query_delay be in order to avoid those warnings?

/Angelos

richcollier · March 19, 2021, 1:52pm

See this for explanations of query_delay, frequency, and bucket_span: Are these values right for Query delay, Frequency and Bucket Span? - #4 by richcollier

In short, query_delay is what lags the entire job behind "real-time". If you only ingest data once per day then indeed, you will need to lag your job with a query_delay of at least 1 day. It also matters what the bucket_span of your job is.

Keep in mind that the anomaly detection jobs can either be running in real-time (with a delay, of course) or they could be invoked periodically (with a script that hits the datafeed API with a start and end time, for example) to process previously ingested documents.

What you DON'T want is for the Anomaly Detection job to search for data in the ES index for a certain time range, but have no documents in the index because they are not ingested yet.

system · April 16, 2021, 1:53pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ML job not updating in real time Elasticsearch elastic-stack-machine-learning	4	862	October 29, 2018
Machine Learning datafeed skipping documents that seem to be there Elasticsearch elastic-stack-machine-learning	14	4189	April 9, 2019
Datafeed has missed documents due to ingest latency error Elasticsearch elastic-stack-machine-learning	5	2826	September 21, 2020
ML jobs kibana not working as expected Kibana elastic-stack-machine-learning	3	332	June 23, 2022
Machine learning jobs not reflecting new data Elasticsearch elastic-stack-machine-learning	5	839	October 30, 2018

Latest Timestamp Machine learning doesn't change

Related topics