Using client nodes for reducing memory pressure from data nodes

Eli_Revach · November 5, 2017, 6:03pm

Hi ,
ES version 1.7.5 .

We have high memory pressure in our data nodes mainly by segments memory ( around 65% of the memory ) .
This left enough very small memory for queris and we keep getting OLD GC's .

We considure using client nodes for reducing the pressure from the data nodes .

Our querys main use cases are :

pulling querys - its a simple querys that pull 5000 events from ES , order by desc by one of the the columns that we have on the events index .
We have some aggrigations .

We add one client node to our prod env , and run most of our use cases querys on top of it , when i monitor the JVM utilization of the client node during query execution , i see very minor utilization per query comparing to what i see when we run it directly on one of the data node . Its seems like the client node do very small part of the execution . For the first use case I would except to see some memory utilization as it should bring all the data from the data nodes and perform and sort it localy on the client node , but the utiliztion is so small , so i guess its run on one of the data nodes , and we get the end result ( our assumtion was that client node suppose to act as reducer node that get the data from the data nodes and do the sort locay )

Any idea if client nodes is good for our use case , or when client nodes will be useful

Thanks

Christian_Dahlqvist · November 5, 2017, 6:32pm

Dedicated client nodes take some load away from the data nodes, e.g. request parsing and collection of results from the data nodes. To what extent it will help depends on your query types and patterns. If it does not help much in your use case, it may be better to try and address the main source of the problem, which based on your description seems to be segment memory.

How much data do you have in the cluster? How many indices and shards do you have in the cluster? What is the size and specification of the cluster?

Eli_Revach · November 5, 2017, 6:34pm

where the sort suppose to run ? on the data node or on the data node (On our first use case )

Eli_Revach · November 5, 2017, 6:35pm

"collection of results from the data nodes." - can you elborate more on this part ?

And Thanks for helping

system · December 3, 2017, 6:36pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Opinions on using different client nodes for indexing and searching Elasticsearch	5	951	July 5, 2017
ElasticSearch client options Elasticsearch	3	420	July 6, 2017
Choosing between client nodes and data nodes Elasticsearch	4	831	July 5, 2017
Impact of client nodes on aggregations/Optimization of heavy aggregations Elasticsearch	6	3122	July 5, 2017
Elasticsearch client nodes Elasticsearch	2	879	July 5, 2017

Using client nodes for reducing memory pressure from data nodes

Related topics