What are the best (preferred) methods to recover data from a corrupted shard?

I recently ran into an out of space issue (I know, bad planning) and I would like to recover as much data from the shard as possible. I can see the data files under the node directory for that particular index and was wondering if there were any tools available to attempt to recover data from a shard (using Elasticsearch 7.4 currently).

I've seen online tutorials such as this one: https://grh.am/2018/recovering-a-corrupted-elasticsearch-index-shard/

Is this still a viable method to recover data?

Thank you!


there is something that I do not understand. If you are running out space, elasticsearch will stop writing to the shards this instance holds. Everything should be readable and the elasticsearch instance will still be available and you can query it. At the moment you delete something from the harddisk or expand it somehow and elasticsearch checks that the harddisk has not reached it watermark, it will start writing to the shards again.

What version of elasticsearch are you running? Is it a cluster? How many nodes?

Hey there! Thanks for replying. I used the tool Glen mentioned here: Option missing "fix" on startup

The reason it was still writing is because I disabled watermark checks and forgot to re-enable them a while back. :-/

But the data was restored!

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.