GC for persistent queue and LS clustering

stefsamm · October 4, 2019, 11:42pm

I'm workin on switching our log aggregation from Graylog to full ELK setup. While reading on LS clustering found recommendation of adding queue.checkpoint.writes: 1 to the config to increase durability: https://www.elastic.co/guide/en/logstash/current/deploying-and-scaling.html

Unfortunately with this setting on, performance drop is unacceptable. It's dropping from average of 15-4K events per second to 500 with gaps between sending events to ES:

Without

With

While queue size is pretty minimal:

My guess the culprit is I/O since we're running LS in the cloud with regular EBS backed up root volume.

So first question was: How important it is to have persistent queues with queue.checkpoint.writes set to 1 in case when we want to have multiple instances for each pipeline?

The other question that I'm struggling to find answer for is GC during persistent queues activated. The same workers/batch.size settings have very different GC performance without (left side) and with (right side) persistent queue:

What is causing such spikes in GC with persistent queues? Is it something that needs to be addressed with other settings/tuning or is expected in this case?

Will appreciate any feedback. Thank you!

Badger · October 5, 2019, 12:23am

It doesn't look like you are measuring spikes in GC, you are measuring spikes in memory usage. Those are a really good thing! My guess is that on the left you are looking at more and more and more unpersisted events being stored on the heap, whilst on the right you are seeing them being stuffed in a queue so that logstash can get on and do something. Rapid sawtooth patterns in memory usage are usually really healthy.

system · November 2, 2019, 12:23am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Persistent Queue Configuration question Logstash	13	5826	March 16, 2017
Logstash: Persistent Queue Behaviour Logstash	4	886	February 22, 2021
Regarding Logstash Persistent Queus Logstash	1	281	June 22, 2018
Correct way of Logstash persisted queue performance testing Logstash	6	2889	March 20, 2019
Logstash persisant queue usage without input Logstash	1	256	May 6, 2020

GC for persistent queue and LS clustering

Related topics