Hey,
I have about 7 million documents in my ES cluster using default
configurations (5 shards, 1 replica).
I set up 4 virtual machines with similar configurations on a 8 core server,
each virtual machine assigned with 2 virtual cores and 4 GB memory. On each
node I installed Elasticsearch.
I did several search tests to measure query response time and the results
are a bit confusing:
When using only a single node, I get the best response time.
When using a 2-node cluster response time increases significantly,
meaning the response time gets worse (the more found documents the worse
the response time)
For a three- or four-node cluster the response time improves again and
is at the level of the single-node configuration.
Any idea why the 2-node cluster has such a bad performance compared to the
other configurations?
Hey,
I have about 7 million documents in my ES cluster using default
configurations (5 shards, 1 replica).
I set up 4 virtual machines with similar configurations on a 8 core
server, each virtual machine assigned with 2 virtual cores and 4 GB memory.
On each node I installed Elasticsearch.
I did several search tests to measure query response time and the results
are a bit confusing:
When using only a single node, I get the best response time.
When using a 2-node cluster response time increases significantly,
meaning the response time gets worse (the more found documents the worse
the response time)
For a three- or four-node cluster the response time improves again and
is at the level of the single-node configuration.
Any idea why the 2-node cluster has such a bad performance compared to the
other configurations?
I guess I found the problem: The results that 2 nodes perform much worse,
cannot be reproduced.
There has been going on something else on the server at that time when I
executed the tests for 2 nodes, which influenced the results. (Probably
somebody copied a large file to or from the server, which made the disk
kind of busy)
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.