Unassigned primary + replica shard, minimise data loss

tg295 · October 25, 2022, 10:03am

Hello, my cluster is currently in red state due to one of both the primary and replica shard of an index becoming unassigned. This happened after a number of large tasks were executed simultaneously by accident on the same index alias. The index in question that was affected was the current write index of that index alias. The index contains valuable data and I would like to try to minimise any data loss if possible, ideally with none.

Calling GET _cluster/allocation/explain on the the primary returns unassigned_info->reason:

"ALLOCATION_FAILED"

and allocate_explanation:

"cannot allocate because all found copies of the shard are either stale or corrupt"

finally within unassigned_info->details:

""failed shard on node [<node_id>]: shard failure, reason [merge failed], failure NotSerializableExceptionWrapper[merge_exception: org.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?)..."

Calling the same on the replica returns:

unassigned_info->reason: "ALLOCATION_FAILED"

allocate_explanation:"cannot allocate because allocation is not permitted to any of the nodes"

unassigned_info->details:

"failed shard on node [<node_id>]: failed to perform indices:data/write/bulk[s] on replica [<index_name>][<shard_num>], node[<node_id>], [R], s[STARTED], a[id=<>], failure IndexShardClosedException[CurrentState[CLOSED] Primary closed.]"

I attempted a dry run of manually reallocating the replica using the reroute API, and received a status 400 with: "[allocate_replica] trying to allocate a replica shard [<index_name>][<shard_num>], while corresponding primary shard is still unassigned"

What is the best course of action here? I gather I need to assign the primary shard before I can do anything with the replica. I am concerned given the CorruptIndex exception that the primary shard (and potentially the replica too..) has suffered data losses, so my thinking was that recovering from the replica was my best bet? Is my understanding incorrect here / am I going to have to be content with data losses?

Many thanks

warkolm · October 26, 2022, 1:16am

What version are you running?

That's not good and may indicate data loss. Do you have snapshots?

tg295 · October 26, 2022, 8:31am

I'm running 7.13.2. And no I do not have any snapshots...

tg295 · October 28, 2022, 11:05am

does anyone have any recommendations for how to proceed?

tg295 · October 28, 2022, 11:10am

My current thinking was to follow an approach similar to: When everything else fails. We are using Elasticsearch on a Google… | by Remco Verhoef | Medium

Use the CheckIndex tool to see whether the shards are actually corrupt, and then proceed from there?

leandrojmp · October 28, 2022, 2:04pm

You may use the elasticsearch-shard cli tool to see if you can recover something, here is the documentation.

Be aware of this warning in the documentation.

You will lose the corrupted data when you run elasticsearch-shard . This tool should only be used as a last resort if there is no way to recover from another copy of the shard or restore a snapshot.

From what you shared there is not much else you can do since it appears that your index is corrupted and you may already have some level o data loss.

tg295 · October 28, 2022, 3:52pm

Thank you! Wish me luck...

tg295 · November 2, 2022, 2:38pm

The documentation explicitly says to "Stop Elasticsearch before running elasticsearch-shard."

Does this apply to the particular node or my entire cluster?

leandrojmp · November 2, 2022, 2:47pm

Never used this command, but I would assume that it is the Elasticsearch node that has the shard you want to try to fix.

tg295 · November 4, 2022, 10:16am

Due to the size of our cluster, stopping Elasticsearch / reallocating an entire node is quite an operation - do you know of any method that might allow us to address the issue without doing so? I assume not but worth an ask.. Many thanks for all your help by the way

leandrojmp · November 4, 2022, 1:23pm

The elasticsearch-shard command is already a last resort for cases like yours, and there is no guarantee that it will work, but to be able to try it you will need to shutdown this node.

tg295 · November 4, 2022, 2:27pm

Ok, thanks for the info and your swift reply.

system · December 2, 2022, 2:28pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Unassigned primary and replica shards Elasticsearch	6	2068	July 6, 2017
Why shard unassigned after cluster restart completely? Elasticsearch	1	394	May 28, 2020
Unassigned missng shards after node failure Elasticsearch	1	254	February 18, 2023
Cluster Health RED unassigned primary and replica shards Elasticsearch	3	1481	January 18, 2021
UNASSIGNED replicas after reroute allocate_stale_primary Elasticsearch	1	464	March 18, 2022

Unassigned primary + replica shard, minimise data loss

Related topics