Grok: multiple occurences of pattern

janvkn · March 28, 2018, 3:02pm

Hello,

I am looking for a solution to do the following: I have a log file where a single log entry can contain multiple key/value pairs that I want to extract. The problem is, I want to extract all occurences in a single entry, not just the first one. (kinda like the "g" option in sed and vim)

For example, a log entry could look like this:

2018-03-28 14:23:56 something something user=foo something something more user=bar

It's trivial to write the grok filter that matches the first occurrence:

grok {
    match => [ "logText", "[^A-Z^a-z^0-9]user=(?<user>[A-Za-z0-9]*)" ]
}

This would give me a document like this:

{
    "user": "foo"
}

What I want though, is this:

{
    "user": ["foo","bar"]
}

Any idea on how to accomplish this?

Badger · March 28, 2018, 3:10pm

Perhaps surprisingly, a kv filter will do this.

kv { field_split => " " value_split => "=" }

       "message" => "2018-03-28 14:23:56 something something user=foo something something more user=bar",
          "user" => [
        [0] "foo",
        [1] "bar"
    ]

janvkn · March 28, 2018, 3:17pm

Interesting, but I'm not looking to extract all possible kv pairs. For instance, a log entry often contains a lot of "uninterestingvariable=something" entries that I do not want to store.

Also, I would like to find something more generic, and not just applicable to key/value pairs. For example, there could be something like IDs in the logfile that have a specific, and easily matched alphanumeric format.

Badger · March 28, 2018, 3:24pm

Then I would use a ruby filter to split and then iterate over the fields doing a match on each one.

magnusbaeck · April 2, 2018, 7:29pm

Interesting, but I'm not looking to extract all possible kv pairs. For instance, a log entry often contains a lot of "uninterestingvariable=something" entries that I do not want to store.

The kv filter's include_keys option can deal with that.

Also, I would like to find something more generic, and not just applicable to key/value pairs. For example, there could be something like IDs in the logfile that have a specific, and easily matched alphanumeric format.

Short of a ruby filter I don't think there's a way of doing that with the standard filters.

system · April 30, 2018, 7:29pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Grok filter server logs with kv plugin Logstash	5	428	June 12, 2018
Need help with key-value pair extraction Logstash	2	488	May 5, 2020
Multiple regex matches in a single line Logstash	5	1703	July 6, 2017
Grok filter on massive log Logstash	4	332	July 23, 2019
Multile pattren in single Grok Filter Logstash	5	332	August 28, 2018

Grok: multiple occurences of pattern

Related topics