Over the weekend, the 3 master nodes in a 7 node cluster would not start up due to lack of space in the data path and the logs path. Only one of the mast nodes came up. But the data nodes started to complain about bad state files.
I had to relocate the data and log paths (due to system issues(vormetric/encryption/security) I could not copy the data folder to new location). Once relocated, the master nodes came up but, the data nodes would not. They started to complain about state files.
A couple of questions :
Can state be recovered in such a situation ?
If I deleted the corrupt state files, would they be recreated ?
if they cannot be recreated, can the data be recovered ?
Any help/ insight into this is much appreciated.
ES version 5.2
running on 3 Master Nodes (VMs), 4 data Nodes and 4 client nodes running on 4 servers.
Kibana 5.2 running on the data node servers as well.
@warkolm, Thanks for that. However, the documentation does not ell me how to leverage the Dangling indicies functionality... any pointers to hat is much appreciated
The situation is this :
there aren't any dangling indices that show up. but the data is taking up physical space on the data disk. the structure seems to exists except ES has no idea how to recover it. Any pointers on that ? I could try going node by node and doingstuff.. because at this point I am willing to try anything.
I was able to recover the master nodes so it coud find what the state ws before the disaster. However when I get the shard status, it shows all shards for the indices as being UNASSIGNED. So now my data node is seeing the content but doe snot know how to get at it(??) Any way I force the data node to refresh state for those indices ?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.