Filebeat can't keep up with logs volume

nsouth · October 22, 2021, 7:53pm

We are trying to run Filebeat on a production server to ship logs to Logstash. However, it is failing to ship logs as quickly as they are being written. If I inspect the Filebeat's registry, it shows that over the past 3 hours it has only processed 26 GB of an 81 GB log file. Our expected volume is 400 - 800 GB of logs per day from this one log file. We are also seeing Filebeat take 60-70% CPU utilization on this server, which is a very beefy server. This makes us wonder if Filebeat can handle the task at hand.

We have checked our Logstash ingester (which sits in front of RabbitMQ) and it does not seem to be bottlenecking. The rest of our pipeline seems healthy.

I thank you for any suggestions.

filebeat.inputs:
- type: log
  paths:
     - /mypath/log.csv
processors:
- add_fields:
    fields:
      myfield1: "1"
      myfield2: "2"
      myfield3: "3"
logging.level: info
logging.to_files: true
logging.files:
  path: /var/log/filebeat
  name: filebeat
  keepfiles: 7
  permissions: 0644
output.logstash:
  hosts: ["my-load-balancer.domain.com:5045"]
  ssl.certificate_authorities: ["/etc/pki/tls/certs/ca-bundle.crt"]
  worker: 2
  bulk_max_size: 3200

leandrojmp · October 22, 2021, 8:39pm

What is your Logstash config pipeline?

The bottleneck could be not on Filebeat, but on Logstash or on the outputs, for example, if you are using elasticsearc as an output and it can't keep up with the events per second rate, it will put back pressure on logstash that will then put back pressure on filebeat.

On most of the time the bottleneck is the logstash outputs or pipeline, not on filebeat.

What are your outputs?

nsouth · October 22, 2021, 8:43pm

Filebeat outputs to an ingestion Logstash layer which does not filter and only forwards to RabbitMQ, from which another Logstash layer applies filters and outputs to Elasticsearch. We have not seen any signs of bottlenecks in any of these downstream layers.

nsouth · October 24, 2021, 5:45pm

Update: We are in a good state now. I was able to use exclude_lines to filter out a lot of noise which didn't need to be shipped and that allowed Filebeat to keep pace. We also were able to decrease the log size by stopping some superfluous processes.

Also, I misspoke when I said that Filebeat took up 60-70% of the CPU. It took up that much on one core of many cores.

system · November 21, 2021, 7:46pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filebeat unable to cope with incoming logs Beats filebeat	7	1538	February 8, 2018
Filebeat can't keep up with the logs Beats filebeat	3	1466	May 12, 2021
Overcome 2600 msg/seconds with Logstash/Filebeat Logstash	3	1222	July 6, 2017
Speed limitations of filebeat? Beats filebeat	14	15214	July 5, 2017
Filebeat get big log file Beats filebeat	2	3743	July 5, 2017

Filebeat can't keep up with logs volume

Related topics