Filebeat add_id processor mechanism

Musketeer7 · September 19, 2021, 11:13am

Hello everyone.

I have this relatively large cluster of ES, where logs from many filebeat instances are being shipped into. There are plenty of duplicates, that we can't identify their origin for sure. And because of resource restrictions we can't use the id that add_id processor generates as _id for Elasticsearch.
We added add_id processor "as a new unique id" so that we could get an idea of whether filebeat is sending the duplicates or logstash is.
So my question is, how does filebeat handle the resends? does it generate a new unique id with add_id when it's shipping the line for the second time? Or it send the line second time with the same unique id?

system · October 17, 2021, 1:14pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filebeat set add_id: ~ does not take effect Beats filebeat	18	545	August 25, 2023
Duplication in Filebeat to Elasticsearch data pushing Beats filebeat	5	734	December 28, 2017
Filebeat, Logstash, Elasticsearch robustness and duplicated documents Beats filebeat	11	4327	July 5, 2017
Filebeat and updating documents Beats filebeat	2	1149	March 7, 2019
Filebeat addad an entry multiple times Beats filebeat	2	439	March 2, 2018

Filebeat add_id processor mechanism

Related topics