Filebeat and Logstash tuning - How to align performance

bar0n36 · March 22, 2022, 5:19am

I am in the process of trying to ingest a massive backlog of logs (10s of TB) via Filebeat > Logstash > Ingest Node > Elasticsearch.

I have scaled out the pipeline extensively and am now at a point where I am struggling to get more throughput out of the pipeline. The current throughput is VERY peaky, despite the fact the logs being available on disk, and there being very little network latency.

The peaky-ness appears to be between Filebeat and Logstash. Basically I am getting a rate of between 10 and 25 thousand events per second, yet neither component is close to saturated for memory, CPU or IO. My question is, is there a guide or rule of thumb about aligning the batch sizes, worker threads etc etc between Filebeat and Logstash to ensure that they work efficiently together? Is there a relationship between bulk_max_size with Filebeat and the pipeline.batch.size in Logstash? I would have thought it would have made sense to have them set to similar values so that 1 filebeat batch triggers 1 logstash batch execution?

I have spent several hours fiddling with the various bits and pieces and am not having a great deal of luck. I had my pipeline thoroughly saturated at 2 Logstash nodes, but have added 2 additional nodes and I am not getting the gains I would have hoped to get.

Thanks in advance.

bar0n36 · March 24, 2022, 10:22pm

So I largely answered my own question with further exclusion testing on a single Filebeat > Logstash > Ingest node over several hours and by reviewing the fine print on https://www.elastic.co/blog/how-to-tune-elastic-beats-performance-a-practical-example-with-batch-size-worker-count-and-more

The gist of it is align the batch sizes right through the stack and ensure the worker counts are tested for the batch size for optimal throughput. Having differing batch sizes at different points in the pipeline is sub-optimal (as I would have expected) from my experience.

Key tuning points:

Filebeat queue.mem
Filebeat output.logstash.bulk_max_size / worker
Logstash pipeline.batch.size
Logstash pipeline.workers

system · April 22, 2022, 12:23am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Rightsizing elastic batch size and number of workers Logstash	5	496	December 21, 2021
Filebeat sending data to Logstash seems too slow Beats filebeat	20	22139	June 1, 2017
Tuning up filebeat Beats filebeat	2	1799	January 3, 2019
Filebeat 6.2 throughput and general performance Beats filebeat	7	4461	April 3, 2018
Increasing throughput from Filebeat to Logstash Beats filebeat	1	1191	November 1, 2019

Filebeat and Logstash tuning - How to align performance

Related topics