Big delay in writing to Elasticsearch from some log inputs of Filebeat

r2r2 · April 14, 2020, 11:14pm

Hello!
I had to migrate my Elasticsearch 7.0.1 in a docker container from ssd to hdd.
After that I can see some filebeat log inputs are very late.
I use filebeat to send Nginx logs using a log file input for multiple files and use my custom pipeline for parsing them. Data from access log files with little write load appears in Elasticsearch very fast. But data from busy log files (from the same filebeat and the same server) is late for 4-6 hours.

How does the concurency work in this case? I don't know which parameters I need to tune. Is it possible to solve this problem by simply increasing count of workers?

Best regards,
Artur

Luca_Belluccini · April 14, 2020, 11:23pm

Hello @r2r2

First of all, I want to inform you Filebeat has a Nginx module. Consider it in the future.

Regarding your indexing delay, it is necessary to first understand if Filebeat is slow parsing the log files or if Elasticsearch is getting overwhelmed and it is sending 429 TOO MANY REQUESTS to Filebeat.

Check the Filebeat logs for errors or messages related to Elasticsearch or "output pipeline"
Check the Elasticsearch logs for errors related to write queue or EsRejectedExecutionException (see this blog post)

r2r2 · April 15, 2020, 11:41pm

@Luca_Belluccini
Thank you for your answer!
I know about Nginx module but we have too much customization in log fields order and pipeline. It's also very comfortable to put pipeline into ES from git-server instead of filebeats.

There aren't any warning or error messages in the filebeat journal. only INFO: count of metrics and harvester's start/close.
I need more time to check all the details described in the blog post. Now I just can say my log file doesn't give me any scary lines with grep -i "reject\|error\|fail\|429\|warn\|wrn". And today the delay of logs from the busiest access log file was in a range from 3 till 90 minutes.

Luca_Belluccini · April 16, 2020, 10:41pm

First check if Elasticsearch is queueing or not.
See the output of GET _cat/thread_pool/write?v, you should have no rejections and queue ~ 0.

If Elasticsearch is fine, you can start tweaking the Filebeat configuration.
By default the bulk sizes of Filebeat might be small.
Try with:

output.elasticsearch:
  bulk_max_size: 1000
  workers: 2

If you have multiple nodes in your cluster, provide the list of hosts (excluding dedicated masters).

Another good blog post about Beats is available here.

If possible, try to enable Beats Monitoring and Elasticsearch Monitoring to get more metrics on what is happening.

r2r2 · April 17, 2020, 3:28pm

Elasticsearch has neither rejects nor queue.
The reason was in Filebeat. These parameters helped me:

output.elasticsearch:
bulk_max_size: 1000
workers: 2

Thank you for you help!

system · May 15, 2020, 3:28pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filebeat sends logs with delay Beats filebeat	2	1492	August 29, 2022
About the time spent when output to elasticsearch Beats filebeat	7	807	September 5, 2018
Filebeat slowing to a halt within 20-30 minutes of starting Beats filebeat	4	1362	May 19, 2017
How to push logs to elasticsearch in filebeat? Beats filebeat	2	987	March 9, 2020
Filebeat stops sending to es eventually, bulk index 400 Beats filebeat	10	4182	July 14, 2017

Big delay in writing to Elasticsearch from some log inputs of Filebeat

Related topics