We are migrating our old system to ElasticSearch. We have our data centers distributed across two location (one primary and one DR). We are thinking of keeping following structure in Production:
We have existing data of about 630g (approx 14Mil documents) which we are going to index into ES before putting it into production. Once its in production, we are expecting to index about 1000 documents daily, however reading can be up to 50 hits per seconds.
I am planning to keep following configuration:
- No of Primary Shards : 50
- Number of Replica : 1
- Each machine have 2 elastic instances running, 1 Master and 1 Primary
At regular intervals we will purge the documents from ES, so the size is pretty much going to be the same.
HA is not our requirement that's why I only kept 1 master node and 1 master eligible node (node.master = true) and 2 Data node (node.data = true). I will keep the Master Eligible node down, and in case something happens to primary one, then I am planning to bring it up manually.
Is this the correct design, depending upon the needs I have?