Parsing logs with a value_split

user-27022024 · September 24, 2024, 3:09pm

  "processors" : [
      {
        "grok": {
          "field": "log",
          "patterns": ["%{TIME_STAMP:ts} %{GREEDYDATA:logtail}"],
          "pattern_definitions" : {
             "TIME_STAMP" : "%{YEAR}-%{MONTHNUM}-%{MONTHDAY} %{TIME}"
          },
          "ignore_failure" : true,
          "ignore_missing" : true
        }
      },
      {
        "kv" : {
          "field": "logtail",
          "field_split": "\\s(?![^=]+?(\\s|$))",
          "value_split": "=",
          "ignore_failure" : true
        }
      },
      {
        "remove" : {
          "field": "logtail",
          "ignore_failure" : true
        }
      },
      {
        "date" : {
          "field" : "ts",
          "formats" : ["yyyy-MM-dd HH:mm:ss,SSS"],
          "ignore_failure" : true
        }
      }
  ]

Above is our grok pipeline.

Normally our logs are nice and clean

e.g "2024-09-24 15:07:59,572 level=INFO channel=wsgi.request method=GET path=/health/ user_agent="ELB-HealthChecker/2.0" request_action=finish duration=0.005 status=200 content_length=26"

That works perfectly.

but for example if we have another = in the log all hell breaks loose!

e.g.

2024-09-24 15:07:59,572 level=INFO channel=wsgi.request method=GET path=/job?id=12345 user_agent="ELB-HealthChecker/2.0" request_action=finish duration=0.005 status=200 content_length=26"

This seems like it must be a very common use case, is there an off the shelf fix for it?

Badger · September 24, 2024, 3:14pm

That's really not a logstash question.

user-27022024 · September 25, 2024, 9:12am

What type of question is it then?

Badger · September 25, 2024, 12:30pm

It is about elasticsearch ingestion pipelines, not logstash.

Topic		Replies	Views
Issue with grok pattern which doesn't work Elasticsearch	2	1199	March 17, 2020
Ingest Node Pipeline doesn't allow for "\[" pattern Elasticsearch	3	1265	November 16, 2020
ES grok processor break_on_match => false needed Elasticsearch	3	3134	September 19, 2017
Ingest Pipeline for parsing multiline fields giving provided Grok expressions do not match field value error error Elasticsearch	2	298	August 6, 2023
Ingest pipeline grok processor on_failure Elasticsearch ingest-pipeline	2	826	January 4, 2021

Parsing logs with a value_split

Related topics