Looking to further my foray into elasticsearch as we are pretty happy with our small scale tests.
As I will be moving into production (running in AWS on EC2) I am looking to make sure I have architected correctly. Part of my issue is that I don't really understand what the node descriptions mean for their hardware requirements - https://www.elastic.co/guide/en/elasticsearch/reference/current/modules-node.html
For master only nodes (of a small cluster) I am looking at having 3, starting with the t2.medium version as it doesn't appear that the Masters do a great deal when using a small cluster. When I need to increase my cluster size in the future I can increase the size of the master nodes. If i was to increase the size of these node, would they be skewed more towards greater RAM, CPU or somewhere in between (C4, M4 or R3)?
For Data + Client Nodes I am looking at starting with 2 Nodes of r3.xlarge. I wouldn't imagine that I wouldn't see much of a benefit in Client only nodes at the moment.
In the future however, would a Data only Node be skewed more towards greater RAM, CPU or somewhere in between (C4, M4 or R3)?
Ditto for Client Only Node.
My workload is heavily skewed towards search / aggregations rather than data ingest.
Should the following node types use M4, R4 or C3 instance types in AWS