Can't take Elastic 6.8.0 snapshot

tfe2012 · November 16, 2019, 3:47pm

Hi, Try to make a snapshot of the cluster

PUT /_snapshot/my_backup/pre_prod_2

It works for 3-4 hours and after struggling on this state:

"snapshot" : "pre_prod_2",
"repository" : "my_backup",
"uuid" : "YhlA8Ke9Rlu34HpVLDvsLg",
"state" : "STARTED",
"include_global_state" : true,
"shards_stats" : {
"initializing" : 39,
"started" : 4,
"finalizing" : 0,
"done" : 308,
"failed" : 89,
"total" : 440
},

And nothing happening.

What to do? Is it possible rerun?

Armin_Braun · November 16, 2019, 7:20pm

Hi @tfe2012

two questions:

If you abort this kind of stuck snapshot (by deleting it), does it eventually stop properly?
What are those 89 failed shards? Why did they fail? (can you share logs or the concrete failures?)

What to do? Is it possible rerun?

Aborting and running the snapshot again seems like the best option here if the snapshot isn't making any progress at all. If it's making some progress, letting it finish and the running another snapshot will be faster due to the incremental nature of snapshots. Even if you have some failures during one snapshot, the data it put in the repository will be reused by the next snapshot you run where possible so even a partially failed snapshot contributes progress to future snapshots.

tfe2012 · November 21, 2019, 11:24am

Hi,

I delete snapshot, and start snapshot again, it works.
I stop all update/delete process. Connect another remote disk driver. Turn off compression on snapshot.

And after that it start works well. I don't know what exactly help

Armin_Braun · November 21, 2019, 8:18pm

@tfe2012

I stop all update/delete process.

Likely this would've sufficed to fix the issue as the snapshot implementation is designed to be resilient to errors. As a tip for next time you run into any trouble, I'd try this first before moving to other more time-consuming work-arounds

system · December 19, 2019, 8:18pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Snapshot process stuck in one last shard Elasticsearch	5	961	July 5, 2017
Snapshot failed with partial state Elasticsearch	7	6294	January 11, 2019
Snapshot doesn't complete with "failed to finalize snapshot" Elasticsearch	3	258	March 21, 2024
Snapshot failed for a shard Elasticsearch snapshot-and-restore	3	569	November 2, 2022
Snapshots: State: Partial failure (?) Elasticsearch	2	461	January 30, 2021

Can't take Elastic 6.8.0 snapshot

Related topics