Import CSV with different column names to same field

Fabio-sama · March 16, 2020, 12:35pm

Ok so, admitting you have files like the following:

You can even do a single pipeline like the following:

input {
  file {
    path => "/absolute_path_to_file1.csv"
    start_position => "beginning"
    sincedb_path => "/dev/null"
    tags => ["file1"]
  }

  file {
    path => "absolute_path_to_file2.csv"
    start_position => "beginning"
    sincedb_path => "/dev/null"
    tags => ["file2"]
  }
}

filter {
  if "file1" in [tags] {
    csv {
      separator => ","
      columns => ["ID","Date","Name"]
    }

    if [ID] == "ID" {
      drop{}
    }
  } else if "file2" in [tags] {
    csv {
      separator => ","
      columns => ["ID","Ref","Subject","Operator"]
    }

    if [ID] == "Identifier" {
      drop{}
    }
  }
}

output {
  elasticsearch {
    hosts => "localhost:9200"
    index => "test"
    document_id => "%{ID}"
    action => "update"
    doc_as_upsert => true
  }
}

This way, you don't even need to rename anything, since you're directly mapping the Identifier value to the ID field.

In your elasticsearch you'll have something like this:

Does this answer to all your questions?

Topic		Replies	Views
Load multiple csv files and update existing index Logstash	7	2704	November 1, 2019
Import csv files to two different index Logstash	6	971	August 31, 2020
Merging CSV files based on a common key column Logstash	2	1745	July 30, 2018
Multiple CSV files need to upload for same column names Logstash	10	2520	February 6, 2018
Import Multiple Files CSV - Aggregate Data Logstash	3	457	April 6, 2020

Import CSV with different column names to same field

Related topics