How to avoid false positive [Updated]

genPz01 · June 7, 2018, 1:56pm

Hi,

Elastic.co claims that ML reduces false positive but doesn't tell how.

To explain my question let's consider this scenario:

My aim is to monitor -using ML and ELK stack- a Java application installed on a Linux Host.

Let's say that suddenly there is a lot of traffic generated by/within the app (ex: a lot of visitors connecting to the GUI, JMS messages goes up,...), this means for example that the RAM usage (it can be the JVM instead, but let's keep the RAM) will grow significantly!

Is there any ML job applied to "RAM-used metric" that if the RAM usage grows and the traffic generated grows also, ML considers that situation normal and doesn't shoots a notification or consider it an anomaly ?!

Another general question: Can we (via api for example) tell the ML that a generated anomaly is a false positive and so delete it ?

With regards,

richcollier · June 8, 2018, 12:55pm

ML helps reduce false positives over other techniques (i.e. static threshold alerts or simplified stats like standard deviations) because of the very nature of the approach, which is to only "alert" when the behavior of something is statistically significantly different than it usually is. I put "alert" in quotes because ML isn't doing the alerting, the integration with X-Pack Alerting ("Watcher") is the mechanism to actually alert.

Your second question about doing an AND on two analyses (RAM usage and traffic) is currently best solved via a chained-input Watch in which you first query the results of the 1st ML job, and then use the contextual information (i.e. the hostname that the anomaly is for) to subsequently query for anomalies in the 2nd ML job for that entity - then only alert if both conditions match.

There is no current facility to tag an individual generated anomaly as a false positive. What's the concern here?

genPz01 · June 12, 2018, 8:24am

Thank you for your response.
For my last question, I thought it would be a good idea to mark a generated anomaly as a false positive in case of a known and unusual activity that we forgot to declare in "Calendar & scheduler events".

richcollier · June 12, 2018, 4:04pm

Ok - thanks for clarifying. We'll take that suggestion into consideration.

Topic		Replies	Views
Is ML capable to detect RAM saturation? Elasticsearch elastic-stack-machine-learning	10	706	October 30, 2018
ML/Alerting for amount of successful transactions Elastic Observability elastic-stack-machine-learning , elastic-stack-alerting	6	541	November 4, 2022
Anomaly Jobs - General Strategies to reduce false positives Elasticsearch elastic-stack-machine-learning	3	257	December 26, 2023
Trigering Alerts for Machine learning Jobs SIEM	3	119	August 1, 2024
Machine Learning module is triggering alerts when there is no anomaly Elasticsearch elastic-stack-machine-learning	27	2809	July 1, 2019

How to avoid false positive [Updated]

Related topics