Filebeat: logstash.pipelining: how does it work?

mguiwin · August 4, 2017, 4:13pm

Hello,

The documentation being pretty succinct on the topic of logstash.pipelining parameter (and Google searches often return articles related to LS and ES pipelines), I'd like to know how these 3 parameters interact together:

logstash.pipelining
filebeat.spool_size
logstash.bulk_max_size

Should one assume that if pipelining is enabled, filebeat.spool_size must be equal to logstash.pipelining * logstash.bulk_max_size? Or in other words, what would be the size of one pipelined request "chunk" ?

Also, any recommendations for the value of logstash.pipelining?

Thanks,
MG

steffens · August 7, 2017, 1:23pm

See this wikipedia article for getting an idea what pipelining is for.

Logstash internally uses some windowing, sarting with 10 events and exponentially growing up to bulk_max_size. If pipelining is enabled, the 'windowed' batches will be pipelined.

The spool_size (removed in 6.0 beta1) is the maximum number of events pushed to the output on flush. This batch is split into sub-batches of bulk_max_size, which are again split according to the current window size. In 6.0 we will remove the spooler, in favour of full asynchronous sends.

I found both, 3 and 5 for pipelining to improve throughput at times. Bigger values don't gain you much of an advantage.

system · September 4, 2017, 1:23pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Regarding Pipeline Batch Sizing Logstash	1	324	July 6, 2017
Filebeat unable to cope with incoming logs Beats filebeat	7	1511	February 8, 2018
Filebeat and Logstash tuning - How to align performance Beats filebeat	2	1862	April 22, 2022
Rightsizing elastic batch size and number of workers Logstash	5	510	December 21, 2021
Regarding Bulk Indexing Requests Elasticsearch	17	214	April 10, 2024

Filebeat: logstash.pipelining: how does it work?

Related topics