I'm doing consultant work for a large government organization that need
some guidelines for how to set up ElasticSearch or Solr for their projects.
One topic that has surfaced several times is the validity in running
several ES instances per node. The use-case is that several smaller
projects (about 10-15)
need access to a search cluster with replication and redundancy, so I'm
guessing that a cluster size of 3-4 nodes will be typical. The argument for
running several instances per node vs having multiple indices per instance
is that each project gets more "control" over the environment during events
such as full
cluster restarts or version upgrades.
However, I have read that the ES team does not recommend running multiple
instances per node unless you have tons of memory (to preserve compressed
oops).
It would be very helpful to know the reasoning behind this, so why is
running multiple instances not recommended?
Basically because of IO.
Each node could potentially being a noisy neighbor.
Because of HEAP as well. Instead of needing 8 Gb HEAP per project (it's a random number here - don't take it as a mandatory value!), may be sharing 16 Gb HEAP with 10 projects will be fine?
Le 26 novembre 2013 at 12:26:12, Gunnar Gauslaa Bergem (gauslaabergem@yahoo.no) a écrit:
I'm doing consultant work for a large government organization that need some guidelines for how to set up ElasticSearch or Solr for their projects.
One topic that has surfaced several times is the validity in running several ES instances per node. The use-case is that several smaller projects (about 10-15)
need access to a search cluster with replication and redundancy, so I'm guessing that a cluster size of 3-4 nodes will be typical. The argument for
running several instances per node vs having multiple indices per instance is that each project gets more "control" over the environment during events such as full
cluster restarts or version upgrades.
However, I have read that the ES team does not recommend running multiple instances per node unless you have tons of memory (to preserve compressed oops).
It would be very helpful to know the reasoning behind this, so why is running multiple instances not recommended?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.