Hi all. I have an Anomaly Job running with hourly buckets. The values rise and fall once per day. It seems to work great in tracking that pattern, and learning about weekends.
But I have a newbie question. When ML is looking at say, the 3 p.m. bucket today, is it primarily comparing against the OTHER 3 p.m.s it saw in the past? Assuming yes, do more recent 3 p.m.s count more than say, 3 p.m. from a month ago?
Anomaly Detection jobs do learn the trends in your data (i.e. busier during the day than at night, busier during the weekdays than on the weekends, etc.) and those trends are "factored out" of the probabilistic model that observations in your data help form.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.