Filebeat not starting log aggregation on some kubernetes worker nodes

lbenton · February 23, 2021, 1:26pm

Hi All,

I've run into an issue with filebeat in kubernetes, but only on some worker nodes. If useful, here's a bit of background

-Deployed using helm, via terraform
-currently 16 worker nodes, 7 of which filebeat refuses to process logs.
-filebeat version 7.10.2
-Running in AKS

Looking through the logs I'm just not able to tell why some filebeat pods work and some don't.

I've created a gist with:

Logs from a good pod (DEBUG enabled)
Logs from a bad pod (DEBUG enabled)
filebeat.yaml used to deploy across all the nodes

Things I've tried:

Destroying and recreating the EFK stack and filebeat
Wiping the beats-data persistence storage location on worker nodes for both working and non-working filebeat pods
Rebooting
Verified logs are mounted and readable from within the pods

This will sound crazy but the only common thread I can find is that the filebeat pods that are not working are all on worker nodes whose name ends in a letter. I think it's coincidental but thought I'd mention it.

Thanks in advance for any help. I do appreciate it.

system · March 23, 2021, 3:27pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
When I restart the pod and move the worker node, filebeat does not start collecting logs Beats docker , filebeat	4	333	March 10, 2021
Can't get logs from some pods Beats filebeat	1	1149	February 7, 2020
Filebeat Pod not working properly Beats docker , filebeat , metricbeat	4	1094	March 10, 2021
Filebeat is partially collecting logs Beats filebeat	3	817	May 9, 2019
Filebeat does not harvest logs Beats filebeat	5	1545	February 23, 2019

Filebeat not starting log aggregation on some kubernetes worker nodes

Related topics