[SOLVED] Shard routing allocation pattern

mikula · November 30, 2015, 3:39pm

Hello,
I would like to ask about shard allocation:
We have ES cluster of five machines which four of them are "twins" - two machines per one "sleeve" with single PSU unit.
I would like to set replica allocation that none of replicas is allocated to second "twin", so in case of PSU filure We woul not possibly loose whole part of index.

Layout is this:

Sleeve1

Node1 Shard1 Shard2Replica
Node2 Shard2 Shard5Replica
Sleeve2
Node3 Shard3 Shard4Replica
Node4 Shard4 Sahrd1Replica
Node5 Shard5 Shard3Replica

I do not want replica of shard2 be on Node1 because its Twin with one PSU for node2....
How to get it to work this way?
It is for logstash indexing
Than you
AM

.

vtst2412 · November 30, 2015, 5:36pm

(Option 1)
The simplest solution is to have 2 replicas. That way, if any single PSU fail, you will still have 1 shard (assuming node 5 is on a different PSU?)

https://www.elastic.co/guide/en/elasticsearch/reference/current/indices-update-settings.html.

Note that the API above will only change the number of replica for existing indices. To persist this setting to new indices, you will also need to change the settings in the templates.

https://www.elastic.co/guide/en/elasticsearch/reference/current/indices-templates.html

(Option 2)
Otherwise, you can disable allocations on the cluster and only issue explicit shard allocations (this is a trade-off between control and convenience).

https://www.elastic.co/guide/en/elasticsearch/reference/current/cluster-reroute.html

Christian_Dahlqvist · November 30, 2015, 5:47pm

You can use shard allocation awareness to prevent that a primary and replica for any shard end up on nodes under the same PSU. Assign Node1 and Node 2 to 'rack1', Node3 and Node4 to 'rack2' and finally Node 5 to 'rack3'. The configure the shard allocation awareness to consider the rack id when placing shards. As long as you only have 1 replica configured, it should be possible to distribute the data fairly evenly even though one off the racks only has a single node. The uneven balance between racks can however become a problem if a PSU fails and takes down 2 nodes as recovery then will try to move a copy of all data over to Node5, which could cause problems.

mikula · December 1, 2015, 8:51am

Thank you very much that was what I was looking for.

Topic		Replies	Views
Shard allocation filtering for replications Elasticsearch	3	194	September 7, 2022
Shards get not distributed across the cluster Elasticsearch	1	450	July 6, 2017
Multiple nodes on same machine : replicas? Elasticsearch	4	1004	July 6, 2017
Primary shard allocation on a server running multiple ES nodes Elasticsearch	5	1602	July 5, 2017
Routing allocation shard doesn't work Elasticsearch	5	348	April 16, 2024

[SOLVED] Shard routing allocation pattern

Related topics