Hi Everyone,
I just have a query regarding last read s3 file or the last read line in the s3 file.
How or where is that data stored? and how is that shared among multiple pods running logstash?
What if the pod is replaced? How does the new pod know the last read location in s3?
The plugin stores the time of the last read inside a file in the configured sincedb_path.
It is not shared, the plugin was not build to have multiple inputs consuming from the same bucket, so to avoid duplication you can have only one input per bucket or prefix.
Thanks for the reply @leandrojmp .
A follow up question on that.
If the logstash container restarts, the file might get deleted from the configured or default sincedb_path. How is that handled in case of logstash running as a container?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.