How to split a field value into separated fields in elasticsearch

Hung_M_Le · July 13, 2020, 9:50am

Hi

I have the following issue that I hope to get some help to resolve

background:

. I ingest a log file using filebeat
. I defined inside elasticsearch grok and kv statements to split incoming data into separated fields

Question:
. If I have field that II want to further split down to different field, how can I do it?
. Is there a way to apply a regular expression to a field to determine a match and split this field into different values?
. Can I assign the new split values different fields?

example:

I have a field --
navlog.context.filename : https://xxx.yyy.com/NA/GEN4/LANDMARK/version.properties

I want to split the above field into:

navlog.context.region: NA
navlog.context.product:GEN4
navlog.context.layer:LANDMARK
navlog.context.filename:version.properties

Thank you in advance for your help.

Best Regards

Hung Le

S0ul · July 16, 2020, 2:34pm

Hi Hung_M_Le,

Have you considered using grok again on your newly generated fields ? You could also split by "/", rename fields you want to keep and drop the others but I don't see why you would do this if grok is usable.

Regards,
S0ul

Vinayak_Sapre · July 17, 2020, 4:13am

@Hung_M_Le
I would use script processor to avoid running regex multiple times. For ex.

PUT _ingest/pipeline/filename_splitter
{
  "processors": [
    {
      "script": {
        "lang": "painless", 
        "source": """
          String fn = ctx['navlog.context.filename'];
          int loc = fn.indexOf('/', 'https://'.length()); 
          int loc2 = fn.indexOf('/', loc+1);
          if (loc2 > -1) {
            ctx['navlog.context.region'] = fn.substring(loc+1, loc2);
            loc = loc2;
            loc2 = fn.indexOf('/', loc+1);
              if (loc2 > -1) {
                ctx['navlog.context.product'] = fn.substring(loc+1, loc2);
              }
          }
          """
      }
    }
  ]
}

POST navlog/_doc?pipeline=filename_splitter
{
  "navlog.context.filename" : "https://xxx.yyy.com/NA/GEN4/LANDMARK/version.properties"
}

spinscale · July 23, 2020, 8:45am

The split processor might be another way to go.

Vinayak_Sapre · July 23, 2020, 9:00am

Split processor generates array. We need a dictionary. String parts are set to different fields.

spinscale · July 23, 2020, 9:21am

you can just pick the array elements then and set them to fields manually using a script processor

Hung_M_Le · August 6, 2020, 8:51am

THank you Vinayak and Alexander for your recommendation. I have tried both "script" and "split" and I found some issues when the format of a field change; however, I found a way to parse the field using grok. Here is a grok syntax that I used and it works pretty good. I also like the fact the I can use the grok debugger to test out the grok pattern.

{
"grok": {
"if": "ctx.navlog?.message != null && ctx.navlog?.message =~ /^T\|/",
"field": "navlog.context.filename",
"patterns":["https://%{DATA:navlog.context.web_server}/%{DATA:navlog.context.region}/%{DATA:navlog.context.project}/%{DATA:navlog.context.layer}/%{DATA:navlog.context.map_level}/%{DATA:navlog.context.map_sublevel}/%{DATA:navlog.context.tiles}/%{DATA:navlog.context.tile_id}/%{DATA:navlog.context.filename}","https://%{DATA:navlog.context.web_server}/%{DATA:navlog.context.region}/%{DATA:navlog.context.project}/%{DATA:navlog.context.layer}/%{DATA:navlog.context.filename}"]
}
}

system · September 3, 2020, 8:51am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Split Processor: Elasticsearch	3	477	March 19, 2019
Can't figure out the field to split on Logstash	1	267	June 21, 2019
How to Split the one field into multiple fields using kv plugins Logstash	5	2618	April 11, 2019
Splitting a string to fields and values Elasticsearch	3	1384	March 1, 2019
Split a specific csv column into multiple fields Logstash	3	474	December 8, 2021

How to split a field value into separated fields in elasticsearch

Related topics