Remove first line csv logstash

frangolzmil · December 21, 2019, 6:08pm

Hi dudes!

I just wanna remove the header (first line) from my csv files. The first line is containing the column names. After parsing csv file, the visualization on Kibana shows the first line parsed as the column name indicates.

My code:
input {
file {
path => "/home/admxxx/xxxx/xxxx/*.csv"
start_position => "beginning"
sincedb_path => "/dev/null"
}
}

filter {
csv {
columns => ["id tweet","date","author","text","app","id user","followers","following","stauses","location","urls","geolocation","name","description","url_media","type media","quoted","relation","replied_id","user replied","retweeted_id","user retweeted","quoted_id","user quoted","first HT","lang","created_at","verified","avatar","link"]
separator => ";" #tab
skip_header => true
#autodetect_column_names => true
#autogenerate_column_names => true

			    }
		      date {
          #match => ["date","yyyy-MM-dd HH:mm:ss"]
          match => ["date","dd/MM/yyyy HH:mm","dd/MM/yyyy H:mm","yyyy-MM-dd HH:mm:ss"]
          timezone => "UTC"
          target => "@timestamp"
		      }
        
        mutate {
          #gsub => ["message","\","'"]
          #remove_field => ["message"]
        }
        ruby {
        code => "
            event.set('index_monitoring_twitter',event.get('path').split('/')[-1].gsub('.csv',''))
        "
        }
        #grok { 
        #  match => [ "path", "/(?<index_monitoring_twitter>[^/]+).csv" ] 
        # }
	    }

output {
elasticsearch {
hosts => ["http://localhost:9200"]
#index => "monitorizaciones_hoarder"
index => "monitorizaciones_%{index_monitoring_twitter}"

}
stdout{codec => rubydebug} #para comprobaciones
}

Kibana:

Badger · December 21, 2019, 7:19pm

You can check whether the parsed line looks like a header

if [id tweet] == "id tweet" { drop {} }

after the csv filter, or

if [message] =~ /^id tweet;/ { drop {} }

before the csv filter.

system · January 18, 2020, 7:19pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Csv filtering misses 2 first columns Logstash	1	399	November 7, 2017
Remove first few lines of csv in logstash Logstash	2	440	June 16, 2021
Skip header line in CSV input (v 1.5.0) Logstash	8	18820	July 6, 2017
Removing whitespace from column in csv import Logstash	4	2380	August 9, 2017
Parse 1st Line of Multiple CSV files and set as Columns Logstash	5	3457	July 6, 2017

Remove first line csv logstash

Related topics