Filebeat Intermittently Hanging with Increasing Memory Cache while Processing High-Traffic Nginx Accesslog

ryoni88 · July 26, 2023, 5:09pm

Hello,

I have set up a process using Filebeat to send a high traffic Nginx accesslog to Logstash. However, Filebeat intermittently hangs, with a consistent pattern of increasing memory cache.

Both Filebeat and Nginx are configured in individual container environments within the same pod in a Kubernetes (k8s) setup, utilizing the accesslog path volume mounted.

Accesslog files are rotated every 4 hours. The rotation method is simple: rename, then gzip compression. It goes like this:

application.log -> application.{date}.log -> application.{date}.log.gz
and then we start writing a new application.log.

I've observed a peculiar pattern. The hanging Filebeat resumes operation through the logrotate process. When the log file rotates, Filebeat starts operating again (with memory cache usage decreasing), and logs are sent until, after a certain period of time, it hangs again. I speculate that this could be a problem with the file descriptor usage of the harvester.

In addition, Filebeat does not hang during early morning hours when traffic is low. During the daytime when traffic is high, the log file size increases to about 5GB every 4 hours.

Filebeat version: 7.12.1
Filebeat configuration:

queue.mem:
  events: 40960
  flush.min_events: 20480

filebeat.inputs:
  - type: log
    fields:
      _@type: nginx-log
      instance_name: myinstance
    fields_under_root: true
    exclude_files: ['\.gz$']
    paths:
      - mylogpath/application.log*

output:
  logstash:
    hosts: ["mylogstashinfo"]
    loadbalance: true
logging:
  level: warning
  to_files: true
  to_syslog: false
  files:
    path: /myfilebeatpath/logs
    name: filebeat-plain.log
    keepfiles: 10

Any insights or possible solutions for this hanging issue would be greatly appreciated.

Thank you!

ryoni88 · July 26, 2023, 5:10pm

Problematic Filebeat container's memory usage information:

image4018×1246 440 KB

ryoni88 · July 26, 2023, 5:10pm

Normal Filebeat container's memory usage information:

1690279821810@2x1920×807 84.7 KB

jsoriano · July 31, 2023, 4:35pm

Hey @ryoni88, welcome to discuss

Would you have the chance to update to Filebeat 7.17 (or 8.x)? The newer filestream input is GA on this version and may solve performance issues found in the log input. You can give this input a try on 7.12, but it was in beta then.

You can read more about this input in filestream input | Filebeat Reference [7.17] | Elastic

ryoni88 · August 21, 2023, 5:05pm

Hello, @jsoriano
I want to express my deepest gratitude once again for your recommendation.

I have upgraded filebeat to version 7.17.12 and configured it to use filestream input type. I applied this to half of the instances we operate and tested it over several days. As a result, there has been a significant performance improvement, with about a 20% increase in the amount of logs being ingested.

However, log ingestion gaps are still occurring. If you look at the attached Kibana view, you will see that there are gaps in log ingestion that are resolved at the logrotate interval. I suspect that Filebeat may have a resource leak when handling large files.

This pattern occurs during high-traffic periods from 12:00 to 24:00 and is characterized by a rapid increase in container memory cache (page cache) usage, which is resolved at the logrotate interval.

Increases around 15:00 and resolves at 16:00 logrotate
Increases around 19:00 and resolves at 20:00 logrotate
Increases around 23:00 and resolves at 24:00 logrotate

I could consider running logrotate more frequently, but that doesn't seem like a fundamental solution. Do you have any other tuning suggestions to further enhance filebeat performance in this regard?

ryoni88 · August 21, 2023, 5:07pm

I will also attach the memory usage of filebeat.

system · September 18, 2023, 7:08pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filebeat stop to harvest for nginx log [Solved] Beats filebeat	2	2600	March 22, 2019
Filebeat crashing repeatedly with out of error on first use Beats filebeat	5	2238	December 26, 2016
Filebeat with high memory consumption after logrotate Beats filebeat	7	1065	September 17, 2020
Filebeat stopped sending logs in realtime Beats filebeat	8	3950	December 7, 2018
Filebeat and busy files Beats filebeat	17	4059	August 8, 2016

Filebeat Intermittently Hanging with Increasing Memory Cache while Processing High-Traffic Nginx Accesslog

Related topics