Hello,
We are in the process of building a new ELK cluster, we are looking to incorporate HA and data resiliency in a DR setup (across 2 sites).
We will be using a SAN in each of our 2 sites and will be looking to be ingesting anywhere between 500-700GB and retain the data in the cluster for about 1 year.
We are struggling to determine if we should be using containers or VMs.
Another thing we are trying to understand is if we can make the storage independent from the elasticsearch nodes - ex. if a node fails the data is preserved on the SAN.
In terms of DR, we have also been exploring the different options of using CCR, doing logshipping to the secondary site or simply leveraging the snapshot/restore feature from elasticsearch.
Few of our biggest goals here are: 1) Make it as easy as possible to scale up; 2) Make it as hard as possible to lose data; 3) Make it as easy as possible to restore in case of a DR scenario.
We wanted to see if the community can give us some advise/help us determine which of the above paths we should/shouldn't take.
Thanks!