I have a 8 node elk cluster, 3 data dedicated nodes, 3 master dedicated node and 2 http(kibana) nodes. Currently the indexes are daily created with date stamp. It contains around 21293408 docs on daily average. The indexes are 15 ~ 25 GB. Also currently i have no of shard equals to 1 and replica is 1. I am trying to achieve FT, so in case of two data node failure it can recover smoothly. What will be the no of shards and replica do i need to have ?
If you want to be able to handle 2 data nodes going down at once without data loss you need the number of replicas set to 2. I am however not sure that you would’ve able to continue indexing with only one shard remaining as quorum might be required. If you add another data node you should be fine though.
Thanks both, loosing data node is only when there is a patching goes on, So after patching the nodes comes quickly back online.
Right now I has 3 data nodes, I am going to increase it to 4 or if elk needs odd number then to 5. Also I may need to change 3 shards ( matching one per node) and two replicas (2 it will be on the other nodes i guess)
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.