When I run it in my terminal, to stop logstash I have to press ctrl +C, which is quite annoying because I have to cronize my script so that it runs every day at a given time.
Every day I receive csvs in a directory. logstash ingests them into elasticsearch. How do I tell it that after integrating a csv into elasticsearch, it moves to another directory or simply that it doesn't ingest the same file that it already ingested the day before?
Logstash cannot move files, only delete, but it needs to use the file input with the mode set as read.
Per default it will use the mode as tail, which will constantly look for changes in the files, if the files in the directory are not being constantly updated, then you could change the mode to read and configure logstash to delete the files.
Check the documentiaton for more information about it.
This is done by the sincedb_path configuration, since you set it to /dev/null you are telling logstash to reprocess everything. If you want it to not process files that it already read you need to point the sincedb_path to a custom file or just remove this setting and logstash will create a sincedb file per default.
Yeah, but this doesn't matter if the logstash user cannot access the path, the logstash user needs to have permissions on the path as well, does it?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.