Filebeat on ECK reading multiple copies

Varun_Tokas · June 21, 2024, 3:31am

I have ECK deployed on a cluster with 10 nodes. There is a large file which I want uploaded to Elasticsearch, and the file is stored on a network drive mounted onto each machine, meaning that each node can access the file at the same path. When I use Filebeat, what ends up happening is that since one instance runs on every machine, each instance sends the same data to the same index since they share the configuration. How can I tell filebeat to read the file from only one instance? Any advice would be appreciated.

leandrojmp · June 21, 2024, 3:57am

I don't think you can, if you have multiple filebeats reading the same file, you will get duplicates.

In this case you need to have just one filebeat reading from that file.

Varun_Tokas · June 21, 2024, 3:58am

But this means I cannot run Filebeat as a DaemonSet on Kubernetes, right? Since all the created instances in that case would have the exact same configuration, and hence read the same file

leandrojmp · June 21, 2024, 4:06am

I do not use Kubernetes, but as mentioned you need to have just one instance of Filebeat reading from that file.

Can you not configure the DaemonSet to run just on one node? From this example it seems to be possible to do something like that.

Topic		Replies	Views
2 instance filebeat reading the same file Beats filebeat	4	1520	September 1, 2020
Filebeat used for k8s autodiscovery and file logs sending duplicated events Beats filebeat	4	565	June 2, 2020
Filebeat high availability, avoiding duplicated results Beats filebeat	7	895	March 13, 2023
Filebeat: How to create multiple instances/pipelines on same VM Beats filebeat	6	439	March 11, 2024
How to run multiple instances of Filebeat for the same input/output? Beats filebeat	6	1110	November 10, 2021

Filebeat on ECK reading multiple copies

Related topics