I have an index that is 1 shard with 2 replicas, and only 3 nodes in the cluster. If I make a query against this index, is it randomly routed to 1 node in the cluster, or is it routed to all nodes in the cluster (and a response accepted from whichever shard responds first)?
If I have a low-data, write-light, and query-heavy workload (like percolation), am I better served by having more nodes, and a replica on each? Or does adding another node with another copy of the data just mean that I have 4 nodes running every query instead of 3?