Logstash read csv column problem

CCH0124 · February 14, 2019, 6:07am

I added autodetect_column_names to the csv block on the filter to automatically detect the head value, but the index on kibana appears column2, column3 ... , what is the cause?

config

nput {
        file {
                path => ["/usr/share/logstash/DataSet/TBrain_IPS.csv"]
                close_older => 3600
                codec => "plain"
                delimiter => "n"
                discover_interval => 30
                enable_metric => true
                id => "ips"
                max_open_files => 5
                sincedb_path => "/dev/null"
                sincedb_write_interval => 15
                start_position => "beginning"
                stat_interval => 7200
                tags => "ips"
                type => "ips"
        }
}
filter {
        csv {
                separator => ","
                autodetect_column_names=> true
                skip_empty_columns=> false
                skip_empty_rows=> false
                skip_header=> false
                periodic_flush => true
                id => "csv"
        }
        if [tags] == "ips" {
                mutate {
                        convert => {
                                "event_protocol_id" => "integer"
                        }
                        rename => {
                                "event_rule_reference" => "event_rule_referenceCVE"
                        }
                        split => {
                                "event_rule_reference" => ";"
                        }
                }

        }

}
output {
        elasticsearch {
                hosts => "elasticsearch:9200"
                document_type => "ips-csv"
                index => "ips-%{+YYYY.MM.dd}"
        }
        stdout {
                 codec => rubydebug
        }
}

danhermann · February 14, 2019, 3:39pm

The autodetect_column_names option on the CSV filter work reliably only if you set the number of worker threads to 1. With more than one worker thread, there's a race condition in which an indeterminate row will be selected as the header row. There's a bug filed on the CSV filter for that, but a fix is very difficult.

CCH0124 · February 15, 2019, 5:13am

Can I currently only specify with columns and pipline.work?
Still have a better way to solve it ?

system · March 15, 2019, 5:25am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Autodetect_column_names is not working as expected in csv filter plugin Logstash	3	302	June 12, 2023
Autodetect_column_name with 2 different CSV Logstash	5	469	September 29, 2022
Autodetect_column_names is not working as expected in csv filter pluing Logstash	3	1979	May 15, 2019
Logstash unable to upload csv Logstash	4	495	February 26, 2019
Unknown setting 'autodetect_column_names' for csv Logstash	2	2058	September 17, 2017

Logstash read csv column problem

Related topics