Insufficient History for prediction

Shubhangi · July 17, 2020, 7:07pm

Hi,
I'm using Elastic ML 7.8.

I've a monthly index with 350k+ documents count/events per month or 118.3mb worth of data per month. I've data from March to July present (4 months+ data). So, in total, that is 1400k events with 473.2mb worth of data, still I'm getting the following error when I'm trying to run Forecast for a day.

Detector[0]: Insufficient history to forecast

May I know how much data is sufficient data for ML forecasts? Please respond with the amount of data(GB or events).

Thanks!

richcollier · July 17, 2020, 8:12pm

The general rule-of-thumb is that you should not ask for a forecast with a duration that exceeds your historical data. In other words, if I had a week's worth of data, I should not expect to be able to forecast out a month.

So, I need to ask a few questions:

What does the general trend of the data look like? Perhaps a screenshot of the data in the Single Metric Viewer
What detector function are you using?
What is the bucket span of the job?

Shubhangi · July 18, 2020, 5:17pm

Thanks for replying, Rich.
Here's the general trend of data:

Detector: high_count() by lbname
Bucket span: 3h

richcollier · July 18, 2020, 5:53pm

since you are asking it to predict for every lbname I wonder if there are some instances of lbname that don't have very much data (although it looks like this one does). There may be a situation in which even 1 by_field out of all of them could maybe cause this error to be thrown. How many lbnames are there and is there a reasonable way to check to see if every one has sufficient historical data?

I see that your data doesn't really have any discernable trends to it (i.e. no cyclicality, no general upwards slope, etc.). Even if you get this to work you might be underwhelmed by the forecast. You'll likely get a horizontal line at around the middle of the range of the data (somwhere around 30-ish in this case). Maybe it is not obvious, but the forcast capability extrapolates trends in the data - it doesn't "predict" anomalies. Anomalies are surprise events, by definition.

One thing you could try is to isolate the data for one lbname (filter in Discover tab and save it as a saved search). Then, create an ML job from that saved search on that single time series and do the forecast on that.

system · August 15, 2020, 5:53pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ML monthly forecast Elasticsearch elastic-stack-machine-learning	5	868	February 27, 2018
Size of Training Data Elasticsearch elastic-stack-machine-learning	3	586	July 5, 2018
Machine learning jobs not reflecting new data Elasticsearch elastic-stack-machine-learning	5	824	October 30, 2018
Anomaly detection: minimum amount base data requirement? Elasticsearch elastic-stack-machine-learning	3	620	April 15, 2019
ML - Not picking up anomalies? Elasticsearch elastic-stack-machine-learning	9	2160	October 7, 2017

Insufficient History for prediction

Related topics