I noticed something intriguing. The numbers using a 3-node Elasticsearch is smaller than the number obtained using the default one node cluster. Why is that? Shouldn't a 3-node cluster provide better indexing numbers that a single node with default configuration?
Broadly speaking, yes, 3-node should provide higher indexing throughput.
Indeed this is the case with other tracks: nyc taxis, pmc.
However, one difference between the 3-node and the default benchmarks is that the former has 1 replica, which reduces the initial indexing performance (see also here). Additionally http_log, as well as geonames and geopoint have a very small doc size and get impacted more.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.