A snapshot process is stuck in "IN_PROGRESS" although the status of the snapshot is already "ABORTED" ... canceling the snapshot or generating a new one is not possible any more.
Tried already the cleanup script from https://github.com/imotov/elasticsearch-snapshot-cleanup, no change.
The error was described for versions before 1.5.x but seems still exists ... so
is there a possibility to resolve this issue without restart?
will be a rolling restart (first master, then data nodes) resolve that issue?
I've seen this a couple of times, and have resolved it by running GET /_snapshot/backup/_status to get the details of the snapshot that is stuck in progress state and searching the output for "stage": "INIT". In each case, I found a shard in initializing (INIT) stage for one of the indices in the stuck snapshot. You'll see something like:
You can then run Get _nodes/stats to get the friendly name associated to that node value, and then restart only that node. That has worked twice for me.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.