Thank you for your detail reply @DavidTurner
Do you need the "warm" tier at all? Do you need three data nodes or would two be enough?
The Hot/Warm architecture was already in place when I took over this project, however based on your recommendations I will look at this again. It does certainly appear to be excessive on reflection. I'm considering a three node setup for the following two reasons:
- I'm thinking about using a RAID 0 array for Elasticsearch data configured with a replication factor of 2. I've based this on somewhat old documentation but the logic still seems valid: Hardware | Elasticsearch: The Definitive Guide [2.x] | Elastic
- We have an "N+2" philosophy for all production kit, unless there is a good reason not to do this I'm inclined to keep this the same. We could absolutely start with two nodes however and then scale up, my concern then would be not wanting to run a RAID 0 array.
I think again the logic of N+2 comes into play with regard to 3 nodes. I expect the search load to be light however the types of search will usually involves massive amounts of results of large date ranges. Think along the lines of number of times an IP has hit our servers over a 3 month period. (We average ~600 requests per second through our loadbalancers).
I did consider this approach however I thought for a relatively low spend we would get the added benefit of "smart loadbalancing" (Scatter/Gather benefits) with coordinator nodes which we wouldn't get with HAProxy.
Thank you for this, I'll keep this in mind.
Once again, appreciate your input ![]()