Sub partition in Machine Learning

OmarDacca · December 1, 2020, 2:57pm

Hi,

Is it possible to make sub partitions in Machine Learning ?
domain and event are fields in every document.
I created a categorization ml job, with partition_field = domain, but this is not enough, I must split it by domain and event, and since its a categorization job, I already have by_field_name = mlcategory, so I guess the only way to achieve this is to concatenate domain and event and partition on it ?

this is the job config:

{
  "job_id": "test_ml",
  "description": "",
  "groups": [],
  "analysis_config": {
    "bucket_span": "15m",
    "detectors": [
      {
        "function": "count",
        "by_field_name": "mlcategory",
        "partition_field_name": "domain"
      }
    ],
    "influencers": [
      "mlcategory"
    ],
    "categorization_field_name": "error"
  },
  "data_description": {
    "time_field": "date_time"
  },
  "analysis_limits": {
    "model_memory_limit": "139MB"
  },
  "results_index_name": "test_ml"
}

thanks,
Omar

richcollier · December 1, 2020, 11:50pm

Correct, since mlcategory is taking up the by_field_name, it is taking away your ability to double-split.

You'd need to create a scripted_field in your datafeed's query that was the concatenation of domain and event and then partition on this new domain_event field

system · December 29, 2020, 11:51pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ML partition by two fields Elasticsearch elastic-stack-machine-learning , painless	3	577	September 15, 2021
Can we use "sub-partitioning" in ML? Elasticsearch elastic-stack-machine-learning	6	1302	October 5, 2017
Question on how to create a simple ML job Elasticsearch elastic-stack-machine-learning	12	1103	October 29, 2018
Machine Learning Multiple Split Data Fields Elasticsearch elastic-stack-machine-learning	2	1123	October 29, 2018
Machine Learning - Categorization status changed to warn Elasticsearch elastic-stack-machine-learning	3	573	January 5, 2021

Sub partition in Machine Learning

Related topics