Bulk index efficiency advice

Christian_Dahlqvist · April 25, 2024, 5:17am

Absolutely. If you are not sending bulk requests in parallel across multiple connections to Elasticsearch that lack of parallelism is likely your bottleneck. When I have run benchmarks I have always required multiple parallel indexing jobs to saturate Elasticsearch. Exacty how many parallel tasks are ideal and what the optimal bulk size is will depend on your cluster, data and sharding strategy as well as what other load the cluster is under, so this is something you need to test.

You can see this recommended in the official documentation.

Topic		Replies	Views
ES Indexing take huge time Elasticsearch	6	1685	July 5, 2017
How to increase indexing speed? Elasticsearch	5	5471	April 18, 2017
Slow bulk indexing performance Elasticsearch	6	1435	December 11, 2018
How Can I increase ES's indexing Data speed?Bulk can't achieve it! Elasticsearch	12	1319	July 5, 2017
Alternative bulk indexing implementations? Elasticsearch	10	2381	July 5, 2017

Bulk index efficiency advice

Related topics