Numeric Population fields in Population Job?

jong99 · March 6, 2019, 11:20am

We're evaluating machine learning to see if it will do the job we want it to, but we are coming against issues when creating Population Jobs. For the population field we can only select keywords. Since our user ids are integers we can't split the population this way. If we store our user id instead as a keyword then we can select this and we get good results.

Is this restricted for cardinality issues, i.e. the same as this issue?

richcollier · March 6, 2019, 11:59am

The Population UI doesn't allow it because often, a numerical field chosen as the population would be a misconfiguration (imagine selecting response_time as a population field, for example - it would not be sensible). In other words, it is trying to prevent you from making a mistake.

Now, if you really don't want to modify how that field is stored, you can still create a ML job in the UI - you just need to do it in the Advanced Job wizard (or the API). To make a job a population job, select the field that is the population as the over_field_name (link). The Advanced Job Wizard does not prevent you from selecting numerical fields.

system · April 3, 2019, 12:10pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Can not split by number fields in multi metric machine learning jobs Elasticsearch elastic-stack-machine-learning	4	989	August 11, 2017
ML multi metric split filed only have keywork Kibana elastic-stack-machine-learning	13	760	September 10, 2021
Fields are not visible in machine learning job Elasticsearch elastic-stack-machine-learning	14	959	January 17, 2019
Why can't I use user.name as field in machine learning job, but the standard jobs can? Kibana elastic-stack-machine-learning	25	948	June 14, 2022
Creating multi metric job can only use distinct count on IP Elasticsearch elastic-stack-machine-learning	8	1162	March 5, 2018

Numeric Population fields in Population Job?

Related topics