I'm running Elasticsearch 1.1.0 on 3 servers, each has 24 cores and 72GB
memory, Elasticsearch is given 16GB for heap space. Out of the default
settings, I got some tweaks in elasticsearch.yml:
I'm trying to index 150+ million records but when we reached 30 million
documents, one of the server got really high CPU load (20-30) while the
other two were not really busy (load was only 1-3).
Also, there's only one index with 20 shards in the cluster, replication
factor set to 1 and refresh_interval is 30 seconds.
Any suggestion to re-balance the load of this cluster ?
I'm running Elasticsearch 1.1.0 on 3 servers, each has 24 cores and 72GB
memory, Elasticsearch is given 16GB for heap space. Out of the default
settings, I got some tweaks in elasticsearch.yml:
I'm trying to index 150+ million records but when we reached 30 million
documents, one of the server got really high CPU load (20-30) while the
other two were not really busy (load was only 1-3).
Also, there's only one index with 20 shards in the cluster, replication
factor set to 1 and refresh_interval is 30 seconds.
Any suggestion to re-balance the load of this cluster ?
I'm running Elasticsearch 1.1.0 on 3 servers, each has 24 cores and 72GB
memory, Elasticsearch is given 16GB for heap space. Out of the default
settings, I got some tweaks in elasticsearch.yml:
I'm trying to index 150+ million records but when we reached 30 million
documents, one of the server got really high CPU load (20-30) while the
other two were not really busy (load was only 1-3).
Also, there's only one index with 20 shards in the cluster, replication
factor set to 1 and refresh_interval is 30 seconds.
Any suggestion to re-balance the load of this cluster ?
I think I know the reason of this issue, it looks that the busy server is
handling most of the Bulk request while other nodes rarely share the load
(screenshot attached)
I supposed that Bulk requests should be distributed evenly among the
servers but turns out that they are not. Any thoughts on this?
Thanks,
Huy Phan
On Thursday, 10 April 2014 22:09:17 UTC+8, Huy Phan wrote:
I'm running Elasticsearch 1.1.0 on 3 servers, each has 24 cores and 72GB
memory, Elasticsearch is given 16GB for heap space. Out of the default
settings, I got some tweaks in elasticsearch.yml:
I'm trying to index 150+ million records but when we reached 30 million
documents, one of the server got really high CPU load (20-30) while the
other two were not really busy (load was only 1-3).
Also, there's only one index with 20 shards in the cluster, replication
factor set to 1 and refresh_interval is 30 seconds.
Any suggestion to re-balance the load of this cluster ?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.