Filebeat harvesting to logstash is not fast enough

Tomer_Lev1 · May 3, 2018, 7:42am

Hi All, when harvesting logs from my machines to logstash using filebeat I noticed logs are not delivered fast enough, especially when using TRACE logs and adding more machine (filebeats).
My stack is something like ~40 machines each writing ~20 logs per second and one logstash with 2 CPU and 4GB RAM (aws t2.medium) - machine metrics looks good and it's not looks like logstash having tough life while processing the logs, but still I can see delay of something like ~1 hour of logs being harvested and send to logstash.

What is the ratio of logs per sec to number of logstash ? how much logstash instances needed? what size of logstash instances ? how can I make filebeat harvest faster ?

Christian_Dahlqvist · May 3, 2018, 8:22am

I would be surprised if the Filebeat harvester is the bottleneck.

I do not think t2.medium is a very suitable instance type as it has quite limited, burstable CPU allocated, so it could very well be that your Logstash instance is the bottleneck. I would recommend upgrading to a m4/m5.large instance instead and see if that improves the throughput. You also need to make sure that Elasticsearch (or any other output used) is able to process data fast enough as this also will limit Logstash throughput.

Tomer_Lev1 · May 6, 2018, 8:46am

Hi Christian, thank for the reply I'll try to increase the machine to m5.large and see if there is any change but as I said the machine stats are very low and I don't think the machine size comes into account here.

I'm using 2 outputs: s3 and 3 nodes m5.large elasticsearch cluster aren't the logs delivers to outputs asynchronously ? how come the outputs are the bottleneck and how can I measure that ?

system · June 3, 2018, 8:47am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filebeat can't keep up with logs volume Beats filebeat	4	734	November 21, 2021
Filebeat 6.2 throughput and general performance Beats filebeat	7	4461	April 3, 2018
Filebeat sending data to Logstash seems too slow Beats filebeat	20	22142	June 1, 2017
Bottleneck Data Pipeline Beats filebeat	8	518	January 12, 2019
FIlebeat-Redis-Logstash : Filebeat fast and Logstah slow, logstash threading? Logstash	19	3765	February 10, 2017

Filebeat harvesting to logstash is not fast enough

Related topics