How to use aggregate filter with multiple workers

Christian_Dahlqvist · November 21, 2018, 6:54am

The aggregate filter indeed has this limitation, which limits performance considerable and prevents scaling to multiple threads and Logstash instances. To get a solution that scales it is probably better to have a solution that does not rely on the ingest layer to handle this.

One option could be to have a batch process that periodically queries new data and updates documents where needed. This would typically run externally to Elasticsearch and be implemented using one of the language client.

You could also create an entity-centric index where you store a single document per UUID (and use this as the document ID). When you find a document that should be aggregated, you update this document (first time it would be indexed) while at the same time writing the document to the standard index.

Topic		Replies	Views
Aggregate filter plugin + Logstash	2	268	February 27, 2020
Pipeline.workers configuration and aggregation filter Logstash	9	981	October 15, 2021
Elapsed filter with multiple workers..does it work or not? Logstash	8	1898	December 29, 2017
Elapsed and aggregate filter with multiple workers Logstash	6	1658	November 1, 2018
Concerns with scaling Logstash when using the Aggregate filter Logstash	6	585	August 27, 2021

How to use aggregate filter with multiple workers

Related topics