How to slow down large amount of data coming from filebeat?

rcowart · March 26, 2020, 9:37am

@A_B as you make the move to Kafka, a few things that will really boost throughput...

increase pipeline.batch.size from the default of 125 to at least 1024 (1280 was best in my environment)
increase pipeline.batch.delay from the default of 50 to at least 500 (1000 was best in my environment)
in the kafka input, set max_poll_records to the same value as pipeline.batch.size
each thread defined by consumer_threads in the kafka input will be an instance of a consumer. So if you have 4 instances with 2 threads, that is 8 consumer instances. Your Kafka topics must have at least 8 partitions for all consumer threads to ingest data. You will want more partitions than your current needs so you can easily scale in the future.
the number of pipeline.workers should be at least equal to consumer_threads.
the kafka output should set batch_size to at least 16384

You may end up tweaking some of the buffer settings as well, but the above will give you a good starting point.

Rob

Topic		Replies	Views
Filebeat sending data to Logstash seems too slow Beats filebeat	20	22250	June 1, 2017
Logstash taking too long to process data Logstash	22	9926	March 2, 2017
Logstash - Slow data processing Logstash	19	154	December 3, 2024
Filebeat can't keep up with logs volume Beats filebeat	4	771	November 21, 2021
Logstash 6.4 use beats input plugin to collect large log file, but too slow Logstash	2	631	March 19, 2019