A node in a cluster can be configured to serve exclusively as: a data,
master or "client" node. By deciding a node's role a-priori, I suspect one
should tweak its hardware (more RAM, less CPU, etc.) to fit. What would be
the (relative) recommendations per role?
data nodes (not master eligible, not serving requests):
- lots of RAM for faceting and sorting
- lots of CPU for indexing, analyzing and querying
master nodes (no data, not serving requests):
seems it's not doing much... lazy master nodes!
client nodes (not master eligible, no data):
- lots of CPU for aggregating and serving lots of parallel requests
- is RAM still important here (is it affected by faceting and sorting?)
If there are several master-only nodes, are they all idle except for one at
any given time?
Anyone have experience in deploying a cluster with "load-balancing" clients
for serving requests?