Logstash - extraction substring from the raw message

jfnoteba · May 1, 2018, 8:52pm

Hi Everyone,

I'm trying to see if there is any logstash plugin which we can use to perform a quick filtering on 2 substrings without need to parse the entire message using grok command:

An example of the log is below and I'm only interesting to extract the 2 substrings in Bold.

<12> May 1 11:46:42 EMCstorevntd: [fmt=evt] [evtid=1072] [date=2018-05-01T15:43:35Z] [symid=000196801796] [Device=00037] [sev=warning] = SRDF R2 device not ready."

With Logstatch, can I do similar regular expression like (symid=)[0-9]{12} to extract the symmetrix ID number and (sev=)(warning|critical) to extract the severity ?

thanks by advanced to share feedback

yaauie · May 1, 2018, 9:37pm

grok doesn't need to match against the whole pattern; it can be used to extract specific bits, and the break_on_match => false directive allows us to extract multiple bits independent of their order:

filter {
  grok {
    break_on_match => false
    match => {
      "message" => ["\[symid=%{NUMBER:symid}\]", "\[sev=%{LOGLEVEL:sev}\]"]
    }
  }
}

When using break_on_match => false, you'll want to make sure your the patterns begin with as specific a string as possible, which will empower the matcher to avoid unnecessary work.

jfnoteba · May 2, 2018, 12:07pm

Thanks you very much. It works perfectly.
Can I put in the same grok filter break_on_match =>fais and break_on_match =true to concatenate multiple filter matching ?
Something like that :

filter {
grok {
break_on_match => true
match => {
"message" => [xxxxx]
break_on_match => false
match => {
"message" => [xxxxx]
}
}

Thanks again to share your knowledge.

yaauie · May 2, 2018, 4:59pm

no; the break_on_match directive applies to the entire grok filter instance and cannot be flip/flopped.

You can, however, use multiple grok filters:

filter {
  # without the `break_on_match` directive (or when explicitly set to `true`),
  # once a match is found, the remaining patterns are not run
  grok {
    match => {
      "message" => [xxxxx, yyyy]
    }
  }

  # when `break_on_match` is set to `false`, grok will attempt to capture using
  # all patterns, even after it finds a match.
  grok {
    break_on_match => false
    match => {
      "message" => [zzzzz, wwww]
    }
}

system · May 30, 2018, 4:59pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Grok filter multiple match Logstash	4	18317	October 31, 2019
GROK multiple MATCH filters Logstash	3	5583	July 6, 2017
How to get all matches for a grok pattern in a multiline message Logstash	5	2809	July 6, 2017
Match specific word in grok filter Logstash	2	3180	August 22, 2019
Want to match against single grok pattern and multiple patterns in same filter Logstash	1	257	September 23, 2020

Logstash - extraction substring from the raw message

Related topics