Shard Awareness Question

Dauni · September 22, 2021, 8:54pm

Hello,
We are running an ES 5.16 cluster in 3 DC's (total of 15 hosts, each host is 8 cpu / 32 GB ram and we have heap set to 16GB).

DC1 = zone1 (1 master node, 6 data nodes)
DC2 = zone2 (1 master node, 6 data nodes)
DC3 = zone1 (master only)

Reason for the DC3 with only a master is for redundancy in event we lose an entire DC (without having to enable multiple masters in each DC).

On all 15 nodes we have:
node.attr.rack: r1

Currently we have the awareness set to zone:
cluster.routing.allocation.awareness.attributes: zone

When we do maintenance work like OS patching or even bouncing, we do node by node to prevent any downtime. We will turn off shard allocation on the node we are working on so the shards that were on that node do not get reallocated. Then do the necessary work.

After doing the work, we then enable shard allocation once the node is up and joins the cluster.

Since our current setup will never have the Primary and Replica in the same DC (zone), it takes 30+ minutes for each shard in 1 index to sync up with the Primary.

The 1 index in question we have 15 primaries and 1 replica set. The size of each shard is approximately 8GB and growing daily.

We increased the max_bytes_per_sec to 200mb, and the concurrent_streams to 4.

While this did help on the overall issue, it didn't help any on the individual shard level (still takes 30+ minutes to recovery from primary due to cross DC copying.)

Is there anything we could do by changing awareness settings that could help out? My thought is that we will always have the potential for cross DC copying during recovery since the primary could be in the other DC regardless of how we setup the zones.

We are not using snapshots, but if this is a viable option to speed up recovery...we could look into that aspect.

Any ideas or suggestions would be appreciated.

warkolm · September 22, 2021, 10:33pm

Welcome to our community!

5.X has been EOL for a few years now, you really need to upgrade as a matter of serious urgency.

Running an Elasticsearch cluster across DCs is not recommended or supported.
Please upgrade and use CCR and/or CCS.

Upgrade, upgrade, upgrade. There has been a lot of work done around shard allocation between 5.X and latest - 7.15.

warkolm · September 23, 2021, 8:13am

Just an update here, in newer versions you can run cross-DC clusters, please see Resilience in larger clusters | Elasticsearch Guide [7.15] | Elastic

Dauni · September 23, 2021, 12:11pm

Thank you Mark for the information and link!

system · October 21, 2021, 12:12pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Details of ES Index Shard allocation behaviour Elasticsearch	3	572	November 16, 2017
Replica allocation control across a cluster Elasticsearch	8	1321	July 5, 2017
Advanced Configuration Elasticsearch	5	309	July 6, 2017
Shard and Zone awareness Elasticsearch	9	1567	March 22, 2018
Allocation Awareness with 3 DC Elasticsearch	2	532	September 27, 2019

Shard Awareness Question

Related topics