Moving from Logstash to Filebeat => no duplicate log

Nicolas_Guyomar · December 7, 2016, 10:33am

Hi everyone,

I would like to move from logstash as a log shipper, to filebeat.

I'm using the logstash file input plugin to collect logs, same thing with filebeat, to send everything to a centralized logstash-shipper before writing to elasticsearch.

The thing is, if I shutdown logstash and start a fresh filebeat instance instead, filebeat will start from the beginning of the file, leading to duplicate logs in Elasticsearch.

I could have add a "log content hash based" on logstash-shipper side in elasticsearch document_id to avoid duplicates, but I have to admit I'd like an easier solution.

Would you have any idea on how to "bootstrap" a filebeat instance with logstash file cursor maybe ?

Thank you

ruflin · December 7, 2016, 11:50am

I never tried it but I think it should be possible to write a small script in your preferred language that takes the sincedb from LS and converts it into a filebeat registry file. An alternative is using tail_files in filebeat, but if during shutdown LS and boot up Filebeat log lines were added, these are lost.

Other solution could be, that you write filebeat logs to a different index and then manually check (based on the timestamp?) what the time range of the duplicated events is and then use delete_by_query to remove these from of the two indices. That would mean running both for a certain. This is also what I would recommend.

Nicolas_Guyomar · December 7, 2016, 1:02pm

Hi ruflin,

Thanks for such a quick answer !

Maybe loosing some logs using tail_files will be acceptable for my client.
Converting the sincedb into a filebeat registry file seems ok to me, I'll look into it.

Thank you

system · January 4, 2017, 1:02pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Duplicate documents using Filebeat Beats	3	2411	July 15, 2016
Duplicated logs Beats filebeat	8	364	April 25, 2024
Duplicate events in filebeat + logstash + elasticsearch pipeline Logstash	2	1913	July 6, 2017
How to upgrade Logstash in Filebeat + Logstash pipeline? Beats filebeat	6	520	September 15, 2021
Filebeat sending duplicates events Beats filebeat	2	972	December 23, 2021

Moving from Logstash to Filebeat => no duplicate log

Related topics