Dear community, I'm feeding documents directly from a source index to an ML anomaly job detection, using population analysis for a high cardinality use case (I don't want to feed aggregated data directly or using transform as it consume too much time for the high cardinality i have) I use built in…

[ML] Custom function in anomaly detection job

richcollier (rich collier) February 23, 2023, 3:46pm 2

You should do this with a set of query aggregations in the datafeed of the ML job so that the sums and the ratio are calculated first, then fed to ML.

An older, but relevant example is here: Analyzing a ratio of documents over time with Anomaly Detection

In your case, you'd do two different sum aggregations, then do the ratio with a bucket_script aggregation

Topic		Replies	Views
Filtering Index or custom rules for Elastic ML anomaly detection Elasticsearch elastic-stack-machine-learning	6	1523	July 9, 2021
Question on how to choose the aggregations for ML job Kibana elastic-stack-machine-learning	3	363	December 10, 2019
Anomaly Result Interpretation for Seasonal Data Elasticsearch elastic-stack-machine-learning	4	695	July 31, 2020
Problems using Sum Aggregations for ML jobs Kibana elastic-stack-machine-learning	6	647	July 29, 2019
Machine Learning for Visualizations Kibana elastic-stack-machine-learning	4	661	December 15, 2021

[ML] Custom function in anomaly detection job

Related topics