How many workers to parallel bulk indexing in cluster?

misam · September 11, 2019, 8:07pm

I have a cluster with 8 nodes. I want to index a lot of docs. For the best performance,
how many workers (to parallel bulk indexing) must run on each node?

Christian_Dahlqvist · September 11, 2019, 8:52pm

It depends on a number of factors, e.g. document size, number of shards indexed into, mappings, hardware and bulk size. I would recommend running a benchmark with as realistic data and settings as possible.

misam · September 12, 2019, 6:46am

If N is the best number, I must send N parallel bulk to one node or split them between all nodes?

Christian_Dahlqvist · September 12, 2019, 7:08am

It will be the optimum for the way you tested it. I would recommend sending bulk requests to all data nodes that do indexing though.

system · October 10, 2019, 7:08am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Anyone with Petabyte indexing experience using parallel tasks? Elasticsearch	9	1183	May 25, 2017
Bulk Indexing Rate Elasticsearch	4	629	April 18, 2018
Fastest way to index huge data in elastic Elasticsearch	9	9606	November 11, 2018
ElasticSearch Bulk indexing is not scaling Elasticsearch	7	2984	July 5, 2017
Index paramerters Elasticsearch	2	296	July 6, 2017

How many workers to parallel bulk indexing in cluster?

Related topics