After a cluster restart, both primary and replica shards become unallocated after a long time

(WeiqiangYuan) #1

Version: 1.5.2
Data size: 60GB
Shard number: 30 primary, 30 replica
These values are default.

Description: We have 12 nodes: 3 master nodes and 9 data nodes. In order to uninstall some plugins, we have to do the cluster restart. We do the following steps:

  1. Uninstall the plugin and shutdown node one by one (We didn't disable shard reallocation).
    As a result, all the nodes are shutdown.
  2. Start the master nodes one by one.
  3. Start the data nodes one by one.
    However, after about an hour, some primary and replica nodes are still unallocated.
    The error logs are as followed:

So we have to reroute the unallocated shards using the following api. However, after rerouting the shards, the data in these shards are lost.

curl -XPOST 'localhost:9200/_cluster/reroute' -d '{ "commands" : [ { "allocate" : { "index" : "t37", "shard" : $shard, "node" : "datanode15", "allow_primary" : true } } ] }'

Therefore, my questions are as followed:

  1. Why primary shards become unallocated after a long time?
  2. How to do the cluster restart correctly and safely?
  3. If some primary shards become unallocated unfortunately, how to reroute them without losing data?

Thank you.

(Mark Walkom) #2

That's too high, who so many?

You should ideally follow this procedure -

(WeiqiangYuan) #4

Yes, I know this solution. However, we don't restart node one by one(rolling restart). What we have to do is that we shutdown every node one by one. And then all the nodes were shutdown. Then we start every node one by one. In this case, the solution will not work:

The reason why allocate such shards is that we may extend to more data nodes in the future

(system) #5