Primary shard rebalancing

Kellan · December 15, 2011, 10:34pm

I have a 2-node (1 process on each node) ES cluster setup with 2
shards and 1 replica per shard. With this configuration, I would think
that the ideal balance would be 1 primary shard and 1 replica shard on
each node, and indeed after the initial data insert, this is the case.
However, after one or both processes are restarted, the cluster seems
to "rebalance" itself with both primary shards on one node and both
replicas on the other. Is there a way to direct the cluster back
toward the 1 primary/1 replica per node configuration? Is it correct
that all updates go to the primary shard? My index configuration is
below.

Thanks for any help you can provide,
Kellan

index:
number_of_shards: 2
number_of_replicas: 1

bootstrap.mlockall: true

cluster.name: shardtest

network.host: ip1

http.port: 9200

transport.port: 9400

discovery.zen.ping.multicast.enabled: false
discovery.zen.ping.unicast.hosts: [ip1, ip2]
discovery.zen.minimum_master_nodes: 2

kimchy · December 16, 2011, 3:48pm

There is no meaning to have balanced primary allocation since primary and
replica shards do the same work.

On Fri, Dec 16, 2011 at 12:34 AM, Kellan wampleek@gmail.com wrote:

I have a 2-node (1 process on each node) ES cluster setup with 2
shards and 1 replica per shard. With this configuration, I would think
that the ideal balance would be 1 primary shard and 1 replica shard on
each node, and indeed after the initial data insert, this is the case.
However, after one or both processes are restarted, the cluster seems
to "rebalance" itself with both primary shards on one node and both
replicas on the other. Is there a way to direct the cluster back
toward the 1 primary/1 replica per node configuration? Is it correct
that all updates go to the primary shard? My index configuration is
below.

Thanks for any help you can provide,
Kellan

index:
number_of_shards: 2
number_of_replicas: 1

bootstrap.mlockall: true

cluster.name: shardtest

network.host: ip1

http.port: 9200

transport.port: 9400

discovery.zen.ping.multicast.enabled: false
discovery.zen.ping.unicast.hosts: [ip1, ip2]
discovery.zen.minimum_master_nodes: 2

Kellan · December 16, 2011, 7:44pm

Can replica shards handle data inserts? I thought the primary shard
handled all data inserts and reindexing.

On Dec 16, 10:48 am, Shay Banon kim...@gmail.com wrote:

There is no meaning to have balanced primary allocation since primary and
replica shards do the same work.

On Fri, Dec 16, 2011 at 12:34 AM, Kellan wampl...@gmail.com wrote:

I have a 2-node (1 process on each node) ES cluster setup with 2
shards and 1 replica per shard. With this configuration, I would think
that the ideal balance would be 1 primary shard and 1 replica shard on
each node, and indeed after the initial data insert, this is the case.
However, after one or both processes are restarted, the cluster seems
to "rebalance" itself with both primary shards on one node and both
replicas on the other. Is there a way to direct the cluster back
toward the 1 primary/1 replica per node configuration? Is it correct
that all updates go to the primary shard? My index configuration is
below.

Thanks for any help you can provide,
Kellan

index:
number_of_shards: 2
number_of_replicas: 1

bootstrap.mlockall: true

cluster.name: shardtest

network.host: ip1

http.port: 9200

transport.port: 9400

discovery.zen.ping.multicast.enabled: false
discovery.zen.ping.unicast.hosts: [ip1, ip2]
discovery.zen.minimum_master_nodes: 2

kimchy · December 16, 2011, 8:58pm

Yes, replicas also handle indexing in order to provide (near) realtime
support search and HA.

On Fri, Dec 16, 2011 at 9:44 PM, Kellan wampleek@gmail.com wrote:

Can replica shards handle data inserts? I thought the primary shard
handled all data inserts and reindexing.

On Dec 16, 10:48 am, Shay Banon kim...@gmail.com wrote:

There is no meaning to have balanced primary allocation since primary and
replica shards do the same work.

On Fri, Dec 16, 2011 at 12:34 AM, Kellan wampl...@gmail.com wrote:

I have a 2-node (1 process on each node) ES cluster setup with 2
shards and 1 replica per shard. With this configuration, I would think
that the ideal balance would be 1 primary shard and 1 replica shard on
each node, and indeed after the initial data insert, this is the case.
However, after one or both processes are restarted, the cluster seems
to "rebalance" itself with both primary shards on one node and both
replicas on the other. Is there a way to direct the cluster back
toward the 1 primary/1 replica per node configuration? Is it correct
that all updates go to the primary shard? My index configuration is
below.

Thanks for any help you can provide,
Kellan

index:
number_of_shards: 2
number_of_replicas: 1

bootstrap.mlockall: true

cluster.name: shardtest

network.host: ip1

http.port: 9200

transport.port: 9400

discovery.zen.ping.multicast.enabled: false
discovery.zen.ping.unicast.hosts: [ip1, ip2]
discovery.zen.minimum_master_nodes: 2

Lukas_Vlcek1 · December 16, 2011, 9:40pm

Kellan,

the data is first indexed on primary shard and the primary shard then makes
sure it is also replicated to all replicas. So if you have two nodes, two
indices each with 1 shard and 1 replica then even if each primary would be
located on different node the indexing would still propagate every document
to both nodes equally.

Regards,
Lukas

On Fri, Dec 16, 2011 at 9:58 PM, Shay Banon kimchy@gmail.com wrote:

Yes, replicas also handle indexing in order to provide (near) realtime
support search and HA.

On Fri, Dec 16, 2011 at 9:44 PM, Kellan wampleek@gmail.com wrote:

Can replica shards handle data inserts? I thought the primary shard
handled all data inserts and reindexing.

On Dec 16, 10:48 am, Shay Banon kim...@gmail.com wrote:

There is no meaning to have balanced primary allocation since primary
and
replica shards do the same work.

On Fri, Dec 16, 2011 at 12:34 AM, Kellan wampl...@gmail.com wrote:

I have a 2-node (1 process on each node) ES cluster setup with 2
shards and 1 replica per shard. With this configuration, I would think
that the ideal balance would be 1 primary shard and 1 replica shard on
each node, and indeed after the initial data insert, this is the case.
However, after one or both processes are restarted, the cluster seems
to "rebalance" itself with both primary shards on one node and both
replicas on the other. Is there a way to direct the cluster back
toward the 1 primary/1 replica per node configuration? Is it correct
that all updates go to the primary shard? My index configuration is
below.

Thanks for any help you can provide,
Kellan

index:
number_of_shards: 2
number_of_replicas: 1

bootstrap.mlockall: true

cluster.name: shardtest

network.host: ip1

http.port: 9200

transport.port: 9400

discovery.zen.ping.multicast.enabled: false
discovery.zen.ping.unicast.hosts: [ip1, ip2]
discovery.zen.minimum_master_nodes: 2

Topic		Replies	Views
Balanced shards and replicas in ES Elasticsearch	3	447	July 6, 2017
How to rebalance primary shards on elastic cluster Elasticsearch	5	11998	May 23, 2019
Elasticsearch 2.4 Shard Rebalancing Elasticsearch	7	889	May 27, 2019
One node one primary shard Elasticsearch	1	246	September 17, 2021
Rebalance primary shards Elasticsearch	4	3104	July 6, 2017

Primary shard rebalancing

Related topics