Logstash S3 input slow ingestion

true64gurus · March 14, 2023, 12:15pm

I have setup where logstash reads Kubernetes logs from 20 different buckets and send them to ELK. The logs seems to be coming 3-5 minutes late to ELK. The logstash running docker on VM with 31GB Xms/Xmx.

I am using one pipeline . Tried 6 pipelines with 2~3 each and got double/triple events from each pipeline.

How to speed up logstash ingestion from S3 buckets.

Thanks

leandrojmp · March 14, 2023, 12:53pm

Do these buckets have a lot of files in them?

This seems way too much, Logstash is more CPU bound than memory bound, what is the CPU count for your Logstash container?

true64gurus · March 14, 2023, 1:33pm

The buckets may had lots of files initially but now we have regular number of files. I have delete => true to delete file post processing. I have 10 cpus assigned to VM. I started with 16GB for jvm then kept increasing it.

Steady state shows ~ 80-90 MB per bucket.

true64gurus · March 14, 2023, 2:41pm

@leandrojmp do I need to change the way kubernetes clusters saves logs to buckets to lower number of files ? Right now files get saved on each bucket as "cluster_name/yyyy/mm/dd/yyyymmddxxxxxx__yy.gz"

leandrojmp · March 14, 2023, 3:38pm

If you can reduce the number of files I think that you should try to do it.

Logstash s3 input has a couple of issues when working with buckets with a lot of files.

Personally I do not use this input because the performance is pretty bad in my use case (logs from AWS services) and I was not able to fix or improve it, so a custom collector was needed.

true64gurus · March 14, 2023, 3:55pm

@leandrojmp what do you recommend for temporary storage for Kubernetes logs until logstash pulls them.

system · April 11, 2023, 3:55pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Ingesting High Volume of AWS Flowlogs Logstash	2	279	August 12, 2020
Logstash performance (choice) Logstash	4	297	July 1, 2019
Speeding up Logstash using Docker Logstash	2	885	July 22, 2017
Logstash ingestion slows rapidly after 1 hour Logstash	4	1285	June 2, 2020
Logstash fine tuning for ingesting more events (s3 input) Logstash	4	614	May 30, 2022

Logstash S3 input slow ingestion

Related topics