I know I have seen similar questions and am still struggling to figure out the best way to architect a 5 node cluster. This is for geo-spatial data and we expect very heavy indexing, roughly 200gb of data in 9 indexes, queries around 1100/second and, of course, minimal latency and ability to expand as needed. I want to follow best practices but it seems doing so would require a lot of overhead and resources (3 masters, however many data nodes necessary, a separate cluster just for marvel indices, Kibana on it's own etc.) and we are really trying to keep to the 5 nodes to start. Is it better to have 3 dedicated masters and two data nodes maxed out at 64GB of Ram with a 31GB heap? Or have one dedicated master and 4 data/master nodes? What about sending marvel indices elsewhere? Would you anticipate an issue if the marvel indices were actually sent to a cluster at another site (so all marvel indices could be consolidated across my environments)?
Any assistance with suggestions/explanations would be very helpful. Thank you.