ML: difference between partition_field_name and by_field_name?

CameronCenic · July 29, 2021, 8:13pm

I do not really understand the difference between these two settings. They seem to perform the same function from my perspective.

BenTrent · July 29, 2021, 8:28pm

This other discuss thread may shed some light: ML Kibana: difference between by_field_name and partition_field_name - #4 by richcollier

CameronCenic · July 29, 2021, 8:45pm

Alright thanks for the link. As I understand it, the partition_field_name is going to be a harder split in the model, then? So if I want the anomaly scores to be solely based on data matching the split field, I should use partition_field_name. And I should only use by_field_name if I want a softer split that is going to let data from the whole population affect anomaly scores.

richcollier · July 30, 2021, 2:01pm

Yes, that's pretty much it. Think of using partition_field_name as practically the equivalent of N number of single metric jobs, one for every value of partition_field_name (with a cardinality of N). The scoring for anomalies in a partition (since version 6.5) is very independent of anomalies in other partitions.

So, utilize partition_field_name for logical splits that should be more independent from each other.

system · August 27, 2021, 2:01pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ML Kibana: difference between by_field_name and partition_field_name Kibana elastic-stack-machine-learning	4	2751	August 29, 2019
ML What is the difference between by_field_name and partition_field_name Elasticsearch elastic-stack-machine-learning	2	2452	December 27, 2017
ML: difference between partition_field_name and by_field_name in a population job? Elasticsearch elastic-stack-machine-learning	9	1462	December 7, 2021
ML Kibana: problem with an advanced job using partitionfield Kibana elastic-stack-machine-learning	18	1155	September 3, 2019
Can you set partition field and count by as the same field? Kibana elastic-stack-machine-learning	3	413	December 14, 2022

ML: difference between partition_field_name and by_field_name?

Related topics