Active_primary_shards > 1 when you have more than 1 data node

21908 · November 16, 2020, 1:43pm

I have a 3 node Elasticsearch cluster. All nodes are data bearing nodes.

If active_primary_shards is set to 1 my data would not be distributed among the 3 nodes, is that a correct understanding? My single shard would be sitting on a single node. Wouldn't setting number of shards to 1 be defeating the purpose of having multiple nodes?

DavidTurner · November 16, 2020, 4:04pm

No, you could (for instance) set number_of_shards: 1 and number_of_replicas: 2 to have a copy of the shard on every node, all of which will respond to searches.

21908 · November 16, 2020, 4:10pm

Thank you, I'm starting to understand better now.

Only after I start to hit data limits for a single shard, would increasing "number of shards" make sense.

If I have 150GB of data and a 3 node cluster, where all 3 nodes are data bearing, then it would make sense to set "number of shards" to 3, would you agree?

DavidTurner · November 16, 2020, 4:22pm

It depends™ Specifically the details of your data and how you are indexing & searching it will affect the answer here. But as a starting point number_of_shards: 3 on a 150GB index sounds reasonable to me.

system · December 14, 2020, 4:22pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch Index shards per nodes Elasticsearch	13	1232	October 5, 2020
Performance implications of multiple primary shards for one index on the same data node Elasticsearch	4	1466	June 24, 2021
Single Shard 0 replicas 3 nodes Elasticsearch	2	2402	September 19, 2018
Dynamic growing, a solution for a fixed shard number? Elasticsearch	2	1792	July 6, 2017
Is there a performance issue if all the primary shards are located on a single node? Elasticsearch	2	331	July 27, 2020

Active_primary_shards > 1 when you have more than 1 data node

Related topics