Indices in red state. "cannot allocate because all found copies of the shard are either stale or corrupt"

Bhuvesh_Seth · March 29, 2023, 10:29pm

Hi, we are facing one issue where some of indices health not getting updated to yellow or green due to this error "cannot allocate because all found copies of the shard are either stale or corrupt". Can someone please guide how to fix this issue without any data loss.
we tried reroute api with retry_failed=true but that didn't work.
We are using Elasticsearch version 6.2.3

warkolm · March 29, 2023, 10:33pm

Please share more of the response you are getting.

Please note that version is EOL and no longer supported, you should be looking to upgrade as a matter of urgency.

system · March 29, 2023, 10:33pm

Elasticsearch version 6.2.3 is EOL and no longer supported. Please upgrade ASAP.

(This is an automated response from your friendly Elastic bot. Please report this post if you have any suggestions or concerns )

Bhuvesh_Seth · March 29, 2023, 10:58pm

{
"index" : "IndexName",
"shard" : 0,
"primary" : true,
"current_state" : "unassigned",
"unassigned_info" : {
"reason" : "NODE_LEFT",
"at" : "2023-03-26T06:26:40.845Z",
"details" : "node_left[kKhS_E7FQDS_xVaLZTs4gg]",
"last_allocation_status" : "no_valid_shard_copy"
},
"can_allocate" : "no_valid_shard_copy",
"allocate_explanation" : "cannot allocate because all found copies of the shard are either stale or corrupt"
"node_allocation_decisions" : [
{
"node_id" : "22l8b-PaTAey6kfuWsx9uQ",
"node_name" : "node-c007-data-vm6",
"transport_address" : "10.7.11.16:9300",
"node_decision" : "no",
"store" : {
"found" : false
}
},

warkolm · March 29, 2023, 11:10pm

Is that node no longer part of the cluster? Do you have a replica?

Bhuvesh_Seth · March 30, 2023, 1:33am

that node is part of cluster. We do have replica copy of same index but that shows the status as cluster_Recovered.

Bhuvesh_Seth · March 30, 2023, 1:34am

Sorry, what do you mean by replica? Replica shard. right?

Bhuvesh_Seth · March 31, 2023, 4:55pm

can someone please help fix this issue?

warkolm · April 4, 2023, 1:28am

What's the output from this;

GET /_cluster/allocation/explain?pretty
{
  "index": "IndexName",
  "shard": 0,
  "primary": false
}

Bhuvesh_Seth · April 13, 2023, 2:43am

Sorry for late reply.
Here is the result I got.
primary shard for this replica is not yet active"},{"decider":"throttling","decision":"NO","explanation":"primary shard for this replica is not yet active"}]},{"node_id":"UnTTeuAkQsW_Qt_fyfiP1Q","node_name":"..-c007-data-vm13","transport_address":"...18:9300","node_decision":"no","deciders":[{"decider":"replica_after_primary_active","decision":"NO","explanation":"primary shard for this replica is not yet active"},{"decider":"throttling","decision":"NO","explanation":"primary shard for this replica is not yet active"}]},{"node_id":"Y916v1p1SIaOwhTcBGqHDQ","node_name":"..-c007-data-vm11","transport_address":"...20:9300","node_decision":"no","deciders":[{"decider":"replica_after_primary_active","decision":"NO","explanation":"primary shard for this replica is not yet active"},{"decider":"throttling","decision":"NO","explanation":"primary shard for this replica is not yet active"}]},{"node_id":"_25POYZISYaZkdUNAW0hfQ","node_name":"..-c007-data-vm8","transport_address":"...14:9300","node_decision":"no","deciders":[{"decider":"replica_after_primary_active","decision":"NO","explanation":"primary shard for this replica is not yet active"},{"decider":"throttling","decision":"NO","explanation":"primary shard for this replica is not yet active"}]},{"node_id":"in7cl1tERVapS5n6EofsrQ","node_name":"..-c007-data-vm3","transport_address":"...12:9300","node_decision":"no","deciders":[{"decider":"replica_after_primary_active","decision":"NO","explanation":"primary shard for this replica is not yet active"},{"decider":"throttling","decision":"NO","explanation":"primary shard for this replica is not yet active"}]},{"node_id":"jtFs4rt3S_e4VB6TSAhn1Q","node_name":"..-c007-data-vm16","transport_address":"...10:9300","node_decision":"no","deciders":[{"decider":"replica_after_primary_active","decision":"NO","explanation":"primary shard for this replica is not yet active"},{"decider":"throttling","decision":"NO","explanation":"primary shard for this replica is not yet active"}]},{"node_id":"kKhS_E7FQDS_xVaLZTs4gg","node_name":"..-c007-data-vm10","transport_address":"...22:9300","node_decision":"no","deciders":[{"decider":"replica_after_primary_active","decision":"NO","explanation":"primary shard for this replica is not yet active"},{"decider":"throttling","decision":"THROTTLE","explanation":"reached the limit of incoming shard recoveries [2], cluster setting [cluster.routing.allocation.node_concurrent_incoming_recoveries=2] (can also be set via [cluster.routing.allocation.node_concurrent_recoveries])"}]},{"node_id":"tE8ueSj9TgeyapbURZqnMw","node_name":"..-c007-data-vm19","transport_address":"...23:9300","node_decision":"no","deciders":[{"decider":"replica_after_primary_active","decision":"NO","explanation":"primary shard for this replica is not yet active"},{"decider":"throttling","decision":"NO","explanation":"primary shard for this replica is not yet active"}]},{"node_id":"ujM8E509TvysjIAbyslefQ","node_name":"..-c007-data-vm15","transport_address":"...13:9300","node_decision":"no","deciders":[{"decider":"replica_after_primary_active","decision":"NO","explanation":"primary shard for this replica is not yet active"},{"decider":"throttling","decision":"NO","explanation":"primary shard for this replica is not yet active"}]},{"node_id":"wlE49OBsSnubzwCYlOQH2A","node_name":"..-c007-data-vm4","transport_address":"...17:9300","node_decision":"no","deciders":[{"decider":"replica_after_primary_active","decision":"NO","explanation":"primary shard for this replica is not yet active"},{"decider":"throttling","decision":"NO","explanation":"primary shard for this replica is not yet active"}]},{"node_id":"xh21XA3_TzmsaamYMdZihQ","node_name":"..-c007-data-vm5","transport_address":"...11:9300","node_decision":"no","deciders":[{"decider":"replica_after_primary_active","decision":"NO","explanation":"primary shard for this replica is not yet active"},{"decider":"throttling","decision":"NO","explanation":"primary shard for this replica is not yet active"}]}]}

Christian_Dahlqvist · April 13, 2023, 6:20am

If all primaries and replicas are corrupted I am not sure that is possible. I would recommend reverting to a recent snapshot.

system · May 11, 2023, 6:20am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Indices stuck after recovery from backup Elasticsearch	10	2658	August 8, 2019
The shard cannot be allocated to the same node on which a copy of the shard already exists Elasticsearch docker	3	22982	June 17, 2020
Recovering from a potentially corrupt cluster state: UnavailableShardsException Elasticsearch	4	1184	July 6, 2017
How to find which shard got corrupt Elasticsearch	5	3015	July 6, 2017
Failing Replica Shards Elasticsearch	5	1207	July 6, 2017

Indices in red state. "cannot allocate because all found copies of the shard are either stale or corrupt"

Related topics