I have an index that is 1 shard with 2 replicas, and only 3 nodes in the cluster. If I make a query against this index, is it randomly routed to 1 node in the cluster, or is it routed to all nodes in the cluster (and a response accepted from whichever shard responds first)?
If I have a low-data, write-light, and query-heavy workload (like percolation), am I better served by having more nodes, and a replica on each? Or does adding another node with another copy of the data just mean that I have 4 nodes running every query instead of 3?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.