Hard to stash a log file with different occurrence of order for a field using Logstash

VijayKarthikeyan · December 21, 2016, 7:12am

I am trying to stash a log file to elasticsearch using Logstash. I am facing a problem while doing this.

If the log file has same kind of log lines like the below,

[12/Sep/2016:18:23:07] VendorID=5037 Code=C AcctID=5317605039838520 [12/Sep/2016:18:23:22] VendorID=9108 Code=A AcctID=2194850084423218 [12/Sep/2016:18:23:49] VendorID=1285 Code=F AcctID=8560077531775179 [12/Sep/2016:18:23:59] VendorID=1153 Code=D AcctID=4433276107716482

where the date, vendorId, code and acctID's order of occurrence of fields does not change or a new element is not added in to it, then the filter(given below) in the config files work well.

\[%{MONTHDAY}/%{MONTH}/%{YEAR}:%{TIME}\] VendorID=%{INT:VendorID} Code=%{WORD:Code} AcctID=%{INT:AcctID}

Suppose the order changes like the example given below or if a new element is added to one of the log lines, then the grokparsefailure occurs.

[12/Sep/2016:18:23:07] VendorID=5037 Code=C AcctID=5317605039838520
[12/Sep/2016:18:23:22] VendorID=9108 Code=A AcctID=2194850084423218 [12/Sep/2016:18:23:49] VendorID=1285 Code=F AcctID=8560077531775179 [12/Sep/2016:18:23:59] VendorID=1153 Code=D AcctID=4433276107716482 [12/Sep/2016:18:24:50] AcctID=3168124750473449 VendorID=1065 Code=L [12/Sep/2016:18:24:50] AcctID=3168124750473449 VendorID=1065 Code=L [12/Sep/2016:18:24:50] AcctID=3168124750473449 VendorID=1065 Code=L

Here in the example, the last three log lines are different from the first four log lines in order of occurrence of the fields. And because of this, the filter message with the grok pattern could not parse the below three lines as it is written for the first four lines.

How should I handle this scenario, when i come across this case? Please help me solve this problem. Also provide any link to any document for detailed explanation with examples.

Thank you very much in advance.

magnusbaeck · December 21, 2016, 7:36am

Use a kv filter instead of grok.

VijayKarthikeyan · December 22, 2016, 6:36am

KV filter works when there is a case like key comes first and then value comes after...

My problem comes when there is a scenario where there is no key value pairs.. but it is for other fields like

date username some message ----------- first log line
date username domain program port some message ----------- second log line

we can see the difference of fields in a same log file. What should i do in this case?

magnusbaeck · December 22, 2016, 6:41am

Use a grok filter with multiple expressions. The first one tries to match the kind of event that contains domain, program, and port and assumes the rest is key/value pairs, and the second expression assumes everything after the timestamp is key/value pairs.

VijayKarthikeyan · December 22, 2016, 6:53am

using multiple expressions is the only solution for this?

Because I am trying to handle lot of log lines with multiple structure of data and it is not just limited to one or two structures..

I can not go and read the entire log file to write the expression and even if i do, what if there are more than 10 to 20 different structure of log lines included in there?

Configuration for that particular case will be of lot of lines and do you think that is a feasible solution?

magnusbaeck · December 22, 2016, 7:05am

Are you interested in saving the fields that come before the key/value pairs? If yes then I don't see a way to write something generic, but if you don't care about those fields (or at least not all of them) it's of course possible to ignore tokens up until the first one with a key=value form.

VijayKarthikeyan · December 22, 2016, 7:07am

Yes I need to save the fields. And as you said, if there is no generic way to do this, I will go for writing multiple expressions.

Thank you Magnus

system · January 19, 2017, 7:07am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Logstash : Grok : Out of sequence key:value filter Logstash	2	1023	July 6, 2017
Parsing human readable log files with grok Logstash	19	2701	July 6, 2017
Not displaying fields specified in the Logstash filter Logstash	11	4730	July 6, 2017
How to parse logs in same order as in log files Kibana	5	2122	July 6, 2018
Mixed columns in logstash output Logstash	2	364	July 25, 2018

Hard to stash a log file with different occurrence of order for a field using Logstash

Related topics