I just started to do some testing with X-Pack machine learning and especially the forecasting feature. I have very simple data with (time, value) on a monthly basis with 60 datapoints (5 years) in history. Until now I did not achieve a very good result. I choose different buckets (between 30d and 365d) as well as different aggregations.
Does somebody have a similar usecase and made some experience?
In general, you probably simply don't have enough data. Only having 60 points is not enough to create a robust statistical model of data - having on the order of hundreds of data points should be sufficient, however. Do you have access to daily (or hourly) data instead?
thank's for the fast feedback. Sadly I only have the monthly data and it's true that 60 points are not very much. We allready experienced a bit with ARIMA and LLP models (programing it manually) and it worked not pwerfect but quite ok although there are only 60 points. Do you know what kind of model is used within the ML Library?
I also recognized that the forecast is only possible for the next 8 weeks. Is it possible to change it to the next x points in future?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.