Exclude data from certain interval from Machine Learning

heric · September 22, 2020, 9:53pm

Hi All,

I have collected data to be used for machine learning for more than 2 weeks and accidentally i ruined my data collection for 1 interval, it is getting huge jump in value for this interval.

now machine learning anomaly detection is being affected by this highly abnormal value and if i run forecasting it will consider this faulty data as well

Is there any way that i can exclude this particular interval from the machine learning ?
i tried to create a calendar mentioning this interval , but it doesn't have any effect , maybe because the interval was in the past ?

Below are the screenshot from the single metric viewer.

data crossing the faulty interval

data from beginning until before the faulty interval

Thanks,
Heri

sophie_chang · September 23, 2020, 9:11am

Anomaly detection is online learning, so we constantly update the model to reflect the data we have seen.

We store snapshots of this model along the way and it is possible to restore to a previous model snapshot. From 7.9, we have a UI for model snapshot management which will create a calendar event to skip the faulty period. Prior to that we have APIs that support restoring model state.

Alternatively you can clone the job, and run it again over the same data. Please remember to set a calendar event before hand.

heric · September 23, 2020, 10:43am

Thank you @sophie_chang .

system · October 21, 2020, 10:43am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Skip Anomalous data in job Elasticsearch elastic-stack-machine-learning	2	487	July 16, 2019
Machine Learning predictions are 30 minutes off, raises false positives Kibana elastic-stack-machine-learning	3	374	June 30, 2020
Machine Learning handling special cases Elasticsearch elastic-stack-machine-learning	3	807	August 4, 2017
Deleting old elastic logs data Elasticsearch elastic-stack-machine-learning	2	267	May 9, 2022
Export model Elasticsearch elastic-stack-machine-learning	6	601	July 19, 2019

Exclude data from certain interval from Machine Learning

Related topics