How does a node behave with the failure of a data disk?

clark · July 9, 2015, 2:12pm

Does the node go offline?

or

Are only the shards in that path.data effected?

or

Something completely different?

Thanks.

shat · July 9, 2015, 7:51pm

I am curious what the consensus from the ES community is on disk failures. I've read some stories, both success and horror about replacing "failed" drives. Supposedly, if the cluster goes yellow and you have a failed disk -- you can replace it because yellow means the primary shards are still okay; but I do not have definitive answers.

Hopefully someone smarter than I will chime in. Hate depending upon random Google Group answers or non green checked stack overflow responses. ;(

Pradeep_Gowda · July 9, 2015, 8:55pm

Adding to this, I came across java.lang.OutOfMemoryError: Java heap space when my disc was full and all clients started throwing NoNodeAvailableException. Hope ES provide some way to catch this type of exception early so that we wont loose data.

warkolm · July 10, 2015, 1:28am

Depends on how you have things setup - are you using path.data settings to each disk? Are you using RAID, if so what level?

warkolm · July 10, 2015, 1:29am

This is not really related to the topic at hand.

However look for disk threshold watermarks in the docs, we do try to prevent this sort of thing.

Topic		Replies	Views
Data loss with 0.19.8 Elasticsearch	3	636	July 6, 2017
How to replace failed disk when using multiple path.data entries? Elasticsearch	4	2708	July 5, 2017
What happens if data folder is wiped out on individual nodes Elasticsearch	12	1554	December 25, 2018
Replace failing disks on a single node Elasticsearch	4	1391	July 6, 2017
Node Starts Without Its Shards Elasticsearch	11	1732	July 5, 2017

How does a node behave with the failure of a data disk?

Related topics