Set primary shards location

israel · November 18, 2015, 9:22am

Hi,

I have 10 machines and configured 10 shards (2 replicas for each). Is there a way for me to set that Shard 1 primary will reside on machine 1, Shard 2 primary will reside on machine 2 and so on...?

Thanks,

dadoonet · November 18, 2015, 9:37am

Not really. But why do you want that?

If you have 10 nodes and 10 shards in total, you will end up with one shard per node.

israel · November 18, 2015, 9:45am

I need this to improve indexing time. I can route relevant data to a local indexer process running on the same machine.
I could disable balancing, route each primary to its desired place, but then I am loosing the auto balancing. Also needs maintenance while new indexes are created....

Christian_Dahlqvist · November 18, 2015, 9:53am

The best way to improve indexing performance is to use the _bulk API. As the documents in a single bulk request can belong to different shards, it is best to treat the cluster as a black box and let Elasticsearch manage distribution of data. Have you looked at the available documentation regarding indexing performance tips and performance considerations for Elasticsearch indexing?

israel · November 18, 2015, 11:14am

I am using bulk API. This is to improve the performance more.
For example, we have an API in our app to scroll all data. Instead of having one request scrolling on ES, I have created parallel scroll streams for each primary shard. This gave as a huge boost in throughput.

Topic		Replies	Views
Shard allocation Elasticsearch	7	27	September 30, 2024
Primary shard allocation on a server running multiple ES nodes Elasticsearch	5	1602	July 5, 2017
Distributing primary shards? Elasticsearch	8	9027	December 30, 2016
Shard balancing questions Elasticsearch	8	381	March 16, 2019
Is there a performance issue if all the primary shards are located on a single node? Elasticsearch	2	331	July 27, 2020

Set primary shards location

Related topics