Logstash: Persistent Queue Behaviour

dawiro · January 25, 2021, 4:38pm

Hi,
I've seen some behaviour with persistent queues that surprises me.I noticed this when throughput was capped on a logstash node when the EBS Burst Balance was exceeded for the volume of the logstash node in question. Here is a screenshot from cloudwatch illustrating the behaviour:

Given that the persistent queue length is usually either very low or zero it makes me wonder whether every inbound message is being written to and read back from disk before being indexed to elasticsearch?

The persistent queue overview that correlates with the above screenshot can be seen here:

So, can someone clarify for me what my expectations should be when using persistent queues on EBS volumes ? Why does persistent queueing eat so much burst quota even when the node isn't falling behind particularly badly?

Thx
D

leandrojmp · January 25, 2021, 4:49pm

You are right, when using persistent queues every message will be written and read from the queue in the disk.

From the documentation, this is how the persistent queue works.

input → queue → filter + output

dawiro · January 25, 2021, 4:53pm

Thankyou @leandrojmp for your prompt reply. How do users generally handle this? Use provisioned IOP volumes for persistent queueing? Or put a message queue upstream and not implement queueing in logstash? It seems that persistent queues are a slightly double edged sword.

leandrojmp · January 25, 2021, 5:03pm

It will depend on your infrastructure and preferences, I do not like to use the logstash persistent queue, I choose to use Kafka as message queue.

I send everything to a Kafka cluster and Logstash will consume from the topics in Kafka.

If for some reason I can't send directly to Kafka, I will ship to logstash into a pipeline that will only ship to kafka, then another pipeline will read and send to elasticsearch.

system · February 22, 2021, 5:03pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Persistent queue not fully emptied after ES output unavailable Logstash	1	710	August 23, 2019
Logstash Kafka Input backpressure from Persistent Queue Logstash	3	1056	October 21, 2019
Logstash persisant queue usage without input Logstash	1	239	May 6, 2020
Logstash queue + kafka Logstash elastic-stack-monitoring , elastic-stack-security	1	269	April 19, 2019
Logstash stops processing when using persistent queues Logstash	2	450	July 5, 2018

Logstash: Persistent Queue Behaviour

Related topics