I'll have 5 ES nodes in my cluster, at this time I do not know how many
documents we are going to have or what the average queries per second will
be as we haven't go live yet.
Should I set up one of the nodes as a HTTP node to take some of the
processing load of the data nodes or will having a cluster of 5 data nodes
(i.e. no HTTP nodes) be sufficient to handle the processing plus indexing
and searching?
If you will access with two clients only, there is almost no need in
preparing extra nodes for them.
Criteria for setting up HTTP proxy nodes are massive parallel HTTP accesses
(a few hundred requests in parallel) and large result sets, which can
legitimate a separation of index/search data nodes from proxy nodes for
HTTP traffic handling and response generation phase.
If I have 5 ES nodes in my cluster which ES node should should my clients
point to, the master node? If the master node goes down and a new one is
elected how will my client communicate with Elasticsearch, they will still
be pointing the the node that is down?
On Sunday, October 27, 2013 1:56:31 PM UTC, Jörg Prante wrote:
If you will access with two clients only, there is almost no need in
preparing extra nodes for them.
Criteria for setting up HTTP proxy nodes are massive parallel HTTP
accesses (a few hundred requests in parallel) and large result sets, which
can legitimate a separation of index/search data nodes from proxy nodes for
HTTP traffic handling and response generation phase.
If I have 5 ES nodes in my cluster which ES node should should my clients
point to, the master node? If the master node goes down and a new one is
elected how will my client communicate with Elasticsearch, they will still
be pointing the the node that is down?
On Sunday, October 27, 2013 1:56:31 PM UTC, Jörg Prante wrote:
If you will access with two clients only, there is almost no need in
preparing extra nodes for them.
Criteria for setting up HTTP proxy nodes are massive parallel HTTP
accesses (a few hundred requests in parallel) and large result sets, which
can legitimate a separation of index/search data nodes from proxy nodes for
HTTP traffic handling and response generation phase.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.