Split grok-pattern into multiple lines

sthorn · August 27, 2025, 8:45am

Hi,

Is it possible to split a grok-pattern into multiple lines instead of have one big?

grok {
  pattern_definitions => {
    "CM" => "[/.\-\w\s]*"
  }
  match => {"syslog_message" => "%{CM:unknown_id} %{IP:this_ip},%{HOSTNAME:hostname},%{CM:resource_type},%{CM:some_name},%{CM:unknown_01},%{IP:src_ip},%{CM:unknown_02},%{IP:dst_ip},%{NUMBER:src_port:int},%{NUMBER:dst_port:int},%{CM:partition},%{CM:protocol},%{NUMBER:domain},%{CM:unknown_03},%{CM:unknown_04},%{CM:unknown_05},%{CM:unknown_06},%{CM:unknown_09},%{CM:unknown_10},%{CM:unknown_11},%{CM:policy_type},%{CM:policy_name},%{CM:rule_name},%{CM:unknown_15},%{CM:dev_action},%{CM:unknown_17},%{CM:unknown_18},%{CM:unknown_19},%{CM:unknown_20},%{CM:unknown_21},%{CM:unknown_22},%{CM:unknown_23},%{CM:unknown_24},%{CM:unknown_25},%{CM:unknown_26},%{CM:unknown_27},%{CM:unknown_28},%{CM:unknown_29},%{CM:unknown_30}"}
}

This line is way to long.

Rios · August 27, 2025, 9:10am

Most likely yes, we need to se the org. message and what should be result.

sthorn · August 27, 2025, 1:48pm

The parsing works fine, I see no need for a message.

The issue I’m finding is having a line with 700+ chars in git and editors is not optimal.
Is there a way to build the grok-pattern over multiple lines, with an << or += operator perhaps?

leandrojmp · August 27, 2025, 1:54pm

Can you share a sample of your message?

From the pattern you are using in the grok filter your message seems to be a csv message, you could use the csv filter to parse it instead of grok.

stephenb · August 27, 2025, 3:25pm

@leandrojmp has good suggestion to use csv...

But to answer your question ... Yes... but it will not be effecient....

It would be something like

grok {
  pattern_definitions => {
    "CM" => "[/.\-\w\s]*"
  }
  match => {"syslog_message" => "%{CM:unknown_id} %{IP:this_ip},%{HOSTNAME:hostname},...%{GREEDYDATA:msg_part2}
}


grok {
  pattern_definitions => {
    "CM" => "[/.\-\w\s]*"
  }
  match => {"msg_part2" => "<Grok Patterns>%{GREEDYDATA:msg_part3}
}


grok {
  pattern_definitions => {
    "CM" => "[/.\-\w\s]*"
  }
  match => {"msg_part3" => "<Grok Patterns>}
}

Not as efficient

Badger · August 27, 2025, 3:50pm

Yes, you can use custom pattern definitions within a custom pattern definition.

input { generator { count => 1 lines => [ 'Foo, Or Bar,Or Baz' ] } }

output { stdout { codec => rubydebug { metadata => false } } }
filter {
    grok {
        pattern_definitions => {
            ONE => "%{WORD}"
            TWO => "(?<Foo>[^,]*),%{GREEDYDATA}"
            OVERALL => "%{ONE},%{TWO}"
        }
        match => { "message" => "^%{OVERALL}" }
    }

will produce

       "Foo" => " Or Bar"

stephenb · August 27, 2025, 4:49pm

TIL!

sthorn · August 28, 2025, 10:04am

Great I will try that.

sthorn · August 28, 2025, 10:07am

Yes, the message looks very much as a csv-row in this example, but in our real config there is multiple “match”-lines and the input varies in number of columns.
The first and second column determines what column has what values.

The idea with CSV is good and I did not think of it, will try and see if we can use it.

Thank!

sthorn · August 28, 2025, 10:09am

This is one of the possible alternatives that we thought of.
We also was thinking of the performance of it.

Thanks!

Rios · August 28, 2025, 10:38am

Not sure which are better performances, CSV or dissect, you can try both in your case. The dissect filter is ~10x faster than grok. However csv is much more useful if you have pure csv format.

leandrojmp · August 28, 2025, 12:47pm

This is not an issue, if you have different types of message you can still combine othe filters or use conditional to correctly parse it.

Not clear what you mean with this, without you sharing sample of messages is pretty complicated to provide any insight.

The main thing is that while grok can parse almost anything, sometimes you can use other parse filters or combination of other parse filters to make things easier.

Personally I only use grok as the last option, when a message cannot be parsed using other filters or combination of filters.

Topic		Replies	Views
How to parse single log file with multiple grok pattern Logstash	4	3099	June 1, 2017
Logstash grok multiple pattern , multi-line Logstash	2	316	March 19, 2021
Multiline pattern with grok filter Logstash	2	1541	July 6, 2017
Multiple grok pattern, really multiple? Logstash	2	263	April 28, 2022
Grok pattern does not work in logstash however it works in kibana Logstash	12	474	October 21, 2019

Split grok-pattern into multiple lines

Related topics