Filebeat consumes a large amount of disk io reads on a Kubernetes node

Max_Brain · May 7, 2019, 3:49pm

When running Filebeat on a Kubernetes node, the system is spending a large amount of cpu cycles on iowait. According to according to iotop, the system during this time is writing 10 MB/S. Reads account for upwards of 150 MB/S. The Filebeat container gradually uses more memory in conjunction with increasing the io load until the memory limit we setup on the pod is reached and the pod gets restarted. There are no logs in the Filebeat container that indicate that the pod is getting into an error state or is failing to send logs to Kafka. Is there a setting we are overlooking that is causing this? Here is what we have for the two input types used by the Filebeat container. The node is using the Docker json logging driver and is using the default file size and count.

- type: docker
        combine_partial: true
        cri.parse_flags: true
        close_inactive: 48h
        containers.ids:
        - "*"
        exclude_lines:
        - 1
        .
        .
        - 8
- type: log
        paths:
        - "path 1"
        - "path 2"
        - "path 3"
        exclude_lines:
        - 1
        .
        .
        - 5
        fields:
          log_topic: 'log_topic_name'
        fields_under_root: true
        scan_frequency: 1s

Michal_Pristas · May 9, 2019, 1:29pm

can you please use </> button to correctly format the configuration?

Max_Brain · May 9, 2019, 4:21pm

Formatting has been corrected.

Michal_Pristas · May 10, 2019, 12:55pm

is the docker spec under k8s template? if so you may want to avoid having * for containers.id as for note in the docs:
See warning above this section https://www.elastic.co/guide/en/beats/filebeat/7.0/configuration-autodiscover.html#_kubernetes

Max_Brain · May 10, 2019, 5:11pm

The spec is part of a Kubernetes resource definition that Filebeat reads from. We do not have use autodiscover. Should we use it instead of looking for all container ids that match * ? According to the Docker input documentation, it tells us to use containers.ids: '*' to read from all containers.

Michal_Pristas · May 21, 2019, 7:25am

sorry I made wrong assumption. also filebeat use to consume more memory and cpu cycles with more and more files harvested. do you know how many files are being processed ?

Max_Brain · May 23, 2019, 6:11pm

We are creating roughly 5-8 new files every minute due to docker logs getting rotated. We have at most 42 docker log files. We have 10-12 Harvesters running when Filebeat starts consuming a lot of IO. Filebeat has around 900 open files, of which are docker log files that have been deleted, but Filebeat still holds open the inode. Filebeat does not show any error messages. It only shows metrics being published.

system · June 20, 2019, 6:11pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filebeat: "memory leak" via filebeat.autodiscover and >>200.000 goroutines Beats filebeat	6	1546	January 31, 2023
Filebeat logs /tmp/filebeat.{host}.root.log.INFO.{date} Beats docker , filebeat	3	573	July 30, 2020
Filebeat daemonset in Kubernetes is slow (or fails) to harvest logs from multiple pods Beats filebeat	3	1037	February 7, 2023
Filebeat I/O Writes Higher Than Expected Beats filebeat	3	702	January 8, 2019
Filebeat daemonset losing messages Beats docker , filebeat	2	810	June 22, 2022

Filebeat consumes a large amount of disk io reads on a Kubernetes node

Related topics