Hi all, I'm trying to create a processor for an ingest pipeline. I'm checking all available existing processor but I can't understand if it's possible to remove a line from a CSV.
The data we need to import has 2 headers, line 1 contains the human readable column name and the second one is the code of the column (example: "Quantity" and "QTY").
We need to keep line 1 only, do you think it's possible?
Hi @stephenb you're right I was confused about sending a CSV file or a single row of my CSV.
I can make my own script that parse the CSV but then I don't understand what the processors are useful for. If I have to parse the CSV I can simply create and forge the document to store the way I need, without any processor.
What I'd like to have is less code in my script and more configuration on Elasticsearch (with processors).
If I sent the whole CSV as a single document and use the CSV processor.. that brings to my first questions: how can I instruct the pipeline to skip line 2 of the CSV? Maybe I should use the SCRIPT processor before the CSV processor?
Thank you @stephenb your explanation cleared all my doubts!
I have another question related to Filebeat, hope you can help with this too. I have many CSV in different folders, each folder is for a different index. So my question is, how can I pass this information to the ingest pipeline?
I don't need to dynamically read the name of the folders, I can manually create an input for each folder.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.