Our cluster contains 3 master nodes and 4 data nodes ,with each node has 64G memory and 24 Cores.
We have 57 indexs and 570 shards (one primary and one replic), the number of docs is about 30 Billion.
The circumstances is following:
when I restart our cluster by some reason, the cluster is in yellow status and gets allocating.Everything seems to be fine. After several hours later , I found one of data nodes get crashed and the status of cluster is red.
I used jps command in THE crashed node, it didn't show me the elasticsearch process.
I am confused that I didn't do any operation and query to the cluster, only waiting it allocating, it still can get one nodes crashed.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.