Cluster locks up if master node filesystem becomes read-only

jprante · January 2, 2016, 12:51am

There are two solutions.

First one is ES-only. The reason why ES is not shutting down automatically is because org.elasticsearch.env.NodeEnviroment keeps a java.nio.file.FileStore which is never monitored by calling isReadOnly() method regularly.

The java.nio.file.FileStore of all writable paths would have to be monitored for emergency stop in such kind of event.

To make the cluster drop a node with readonly file store, you would have modify the code and submit a patch.

The second solution: to detect general hardware malfunctions, JVM-based methods are quite not sufficient. Hence, ES is the not the best place to implement that, but the OS. You have to set up server monitoring software which can understand SNMP or IPMI or triggers for mcelog https://github.com/andikleen/mcelog that can kill ES (and other) processes in case of severe events.

Topic		Replies	Views
Read-only file system Elasticsearch	7	420	February 6, 2025
Handling unmounted data volume Elasticsearch	2	859	February 4, 2021
Healthy cluster is completely hosed by one node failing Elasticsearch docker	6	1555	November 4, 2019
ES node remained green on VM although underlying disk failed Elasticsearch	7	511	May 16, 2019
Node seems to lock up randomly Elasticsearch	4	512	January 5, 2017

Cluster locks up if master node filesystem becomes read-only

Related topics