Aggregation and client node - does client pull in complete doc data for aggregation?

virtuman · July 2, 2015, 12:46am

we recently started running into network capacity limitations and trying to figure out if our deployment is what it is intended to be:

we have 4 elastic search data nodes
all of our web servers are 64gb machines and have 2 LAN interfaces, one for public and one for internal network switch
all web servers have client running on the machine

Question 1:
is this a correct setup to have client running on each web server? I'm assuming the benefit is to rely on internal mechanisms of elastic search to relay requests to proper server based on what is up/down

Question 2:
we have a few very heavy aggregations in the public-facing website. Do aggregate requests pull in data to client node first, prior to performing actual aggregation operation or do aggregations happen on the data server itself before serving data off to the client node? - My understanding is that client node is the one that performs the aggregation, hence overloading our internal network whenever heavy aggs operations are being performed?

Thank you for any input

Topic		Replies	Views
Choosing between client nodes and data nodes Elasticsearch	4	836	July 5, 2017
Impact of client nodes on aggregations/Optimization of heavy aggregations Elasticsearch	6	3122	July 5, 2017
Node related queries Elasticsearch	7	777	July 5, 2017
Using client nodes for reducing memory pressure from data nodes Elasticsearch	4	624	December 3, 2017
How to use client nodes Elasticsearch	7	546	July 5, 2017

Aggregation and client node - does client pull in complete doc data for aggregation?

Related topics