Shards unassigned after elasticsearch restart

Drift · August 5, 2017, 10:45pm

My elasticsearch instances were restarted, but it looks like after that I have about 10 unassisned shards
The reason seems to be node_left, how can I re-assign the shards to the respective nodes?

{
"cluster_name" : "santorini_rec",
"status" : "yellow",
"timed_out" : false,
"number_of_nodes" : 5,
"number_of_data_nodes" : 3,
"active_primary_shards" : 16,
"active_shards" : 18,
"relocating_shards" : 0,
"initializing_shards" : 4,
"unassigned_shards" : 10,
"delayed_unassigned_shards" : 0,
"number_of_pending_tasks" : 2,
"number_of_in_flight_fetch" : 0,
"task_max_waiting_in_queue_millis" : 0,
"active_shards_percent_as_number" : 56.25
}

Does this mean I lost all the data?

.monitoring-es-6-2017.08.05 0 r UNASSIGNED NODE_LEFT
santo_pipeline_data 5 r UNASSIGNED NODE_LEFT
santo_pipeline_data 3 r UNASSIGNED NODE_LEFT
santo_pipeline_data 8 r UNASSIGNED NODE_LEFT
santo_pipeline_data 2 r UNASSIGNED NODE_LEFT
santo_pipeline_data 1 r UNASSIGNED PRIMARY_FAILED
.monitoring-alerts-6 0 r UNASSIGNED NODE_LEFT
.watches 0 r UNASSIGNED NODE_LEFT
.watcher-history-3-2017.08.05 0 r UNASSIGNED NODE_LEFT
.monitoring-kibana-6-2017.08.05 0 r UNASSIGNED NODE_LEFT

I tried to enable re-allocation, but it dint help.

{
"transient": {
"cluster.routing.allocation.enable": "all"
}
}
'

Elasticsearch seems to be so sensitive about any operation. Everytime I touch it, something or the other goes wrong and its just so hard to recover it back.

Can someone please help?

warkolm · August 5, 2017, 11:04pm

What version are you on?

Drift · August 5, 2017, 11:05pm

@warkolm I am on 5.5.1

I just tried turning off all replicas and turning back on again, not sure if it will help.
Its still initializing the shards.

warkolm · August 5, 2017, 11:06pm

Why were the instances restarted?

Drift · August 5, 2017, 11:07pm

I disabled swapping on all hosts, and I had to tune some networking related parameters. So I gracefully shut down elastic using systemd , except for the master node.

warkolm · August 5, 2017, 11:07pm

Did you shut down all the data nodes at once?

Drift · August 5, 2017, 11:07pm

No, one by one, making sure each instance came up.

Drift · August 5, 2017, 11:09pm

Not sure if it was a good idea, i turned replica to 0 and back to 10 again...

"cluster_name" : "santorini_rec",
"status" : "yellow",
"timed_out" : false,
"number_of_nodes" : 5,
"number_of_data_nodes" : 3,
"active_primary_shards" : 16,
"active_shards" : 23,
"relocating_shards" : 0,
"initializing_shards" : 6,
"unassigned_shards" : 84,
"delayed_unassigned_shards" : 0,
"number_of_pending_tasks" : 0,
"number_of_in_flight_fetch" : 0,
"task_max_waiting_in_queue_millis" : 0,
"active_shards_percent_as_number" : 20.353982300884958
}

Drift · August 5, 2017, 11:16pm

I still see indexing work and the logs are being ingested.
Should I shut down logstash until I fix the state, or it should be ok?

Drift · August 6, 2017, 1:25am

All the unassigned are replicas. Unable to recover them. Appreciate any suggestions.

warkolm · August 6, 2017, 4:07am

Why do you need 10 replicas?

Drift · August 6, 2017, 4:14am

Actually I dont, I created 9 shards for 3 nodes, and it automatically created 10 replicas total.
Some of them recovered automatically

warkolm · August 6, 2017, 4:15am

You definitely have more than one replica set, and if you have more than 2 then those extras won't even be assigned.

Drift · August 6, 2017, 4:16am

Can I reduce the numnber of replicas now? What solution would you suggest

warkolm · August 6, 2017, 9:12am

Usually having more than 1 replica set is not required unless you have unstable infrastructure or low volumes of data queried at a high rate.

You have 3 data nodes, that means you can only store the primary and 2 replica sets of the data. Setting more will mean unassigned (replica) shards, which causes the yellow status until you either have more nodes to hold them or you reduce the replica count to fit in your cluster.

curl -XPUT localhost:9200/*/_settings -d '{ "index" : { "number_of_replicas" : N } }', where N is the number you want.

Drift · August 8, 2017, 12:28am

Thankyou @warkolm. That makes sense. I have adjusted my replicas accordingly and got it to green now

system · September 5, 2017, 12:28am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Unassigned shards, v2 Elasticsearch	5	1296	July 6, 2017
Unassigned shards after node was restarted Elasticsearch	3	546	June 8, 2020
Shards unassigned after node restarts - reason: NODE_LEFT Elasticsearch	16	36327	December 28, 2016
Unassigned Shards after After One of the Nodes Restart Elasticsearch	4	958	July 5, 2017
All Shards Unassigned due to Data Node Restarts Elasticsearch	3	1579	May 28, 2019

Shards unassigned after elasticsearch restart

Related topics