Logstash issue with dropping rows with a condition check on empty elements

Nick11 · January 7, 2020, 9:07pm

I have a CSV file with the following 2 rows (sample)

reservation date, reservationID
Jan 6th, res:id:adbcj-oksok-gjkk
Jan 10th,
Mar 10th, res:id:kkbcj-oksok-gjkk

My ask is to drop empty rows and apply a grok filter on reservationID to extract the last elements after the "-". This is what I did without success

csv {
        separator => ","
        skip_header => "true"
        autodetect_column_names => "true"
        skip_empty_columns => "true"
        skip_empty_rows => "true"
    }  

if [reservationID] =~ "" {
	grok {
		reservationID => "MY GROK PATTERN HERE, WHICH IS WORKING FINE EXTERNALLY THROUGH THE DEBUGGER"
	}
}

I was expecting the first and the third row in the output (not worried about the grok). Instead I see all 3 rows. Am I missing anything. I do not want the 2nd row in my output.

Thanks

Badger · January 7, 2020, 9:15pm

You do not have any empty rows, so it will not skip any. The second line will not have a [reservationID] field (because you have set skip_empty_columns). So test that:

if ! [reservationID] { drop {} }

Nick11 · January 7, 2020, 9:44pm

Thanks for your reply.

I did try that and it did not work. I still get the 2nd row. I suspect that the skip_empty_columns condition is stripping the reservationID field even before I get a chance to do what you are suggesting.

Badger · January 7, 2020, 9:54pm

That's the idea. That is meant to test whether the reservationID field exists, and dropping the event if it does not.

Nick11 · January 7, 2020, 11:03pm

Yes, makes sense. However, for the expression to evaluate the reservationID needs to be present.

What does skip_empty_colums exactly do? Does it strip out the column completely?

Badger · January 8, 2020, 12:09am

No, that is not correct.

If skip_empty_columns is set then columns containing no value will not get set.

Nick11 · January 8, 2020, 3:03am

If that's the case then how can you use a conditional statement on that column in the next subsequent line.

Did you try your solution?

Badger · January 8, 2020, 2:51pm

The conditional is testing whether the field exists.

Note that your header row has a leading space on the column name so

if ! [reservationID] { drop {} }

will drop everything.

if ! [ reservationID] { drop {} }

will just drop the second row. And yes, I tested it.

system · February 5, 2020, 2:51pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Logstahs behavior about empty lines Logstash	9	6585	September 28, 2017
Skip empty fields Logstash	3	967	November 24, 2017
Skip header line in CSV input (v 1.5.0) Logstash	8	18856	July 6, 2017
Parsing csv and conditions Logstash	2	289	January 7, 2020
Logstash csv - Filters only rows with specific column count Logstash	2	951	January 30, 2020

Logstash issue with dropping rows with a condition check on empty elements

Related topics