2 Data nodes being used unevenly

pokaleshrey · September 21, 2019, 9:24am

ES - v6.4.1
3Master, 2Data, 2Coord nodes
One replica setup. I use _routing.

Can somebody please explain me the below behavior ??

When i send continuous requests to my both coordinating nodes, i have observed that only one data node is utilized at a time.
When search threads in one data node function, at same time the ones in other data node are in park state and vice-versa.

Its evident from below image:

Christian_Dahlqvist · September 21, 2019, 9:40am

What is the configuration of your nodes? Do you use preference in addition to routing?

pokaleshrey · September 21, 2019, 9:43am

Master - 4Core 8GbRam
Data - 16Core 64GbRam (Heap allocated is 10Gb)
Coord - 8Core 32Gb Ram

No, we do not use preference. Only routing is used.

Christian_Dahlqvist · September 21, 2019, 10:00am

What does your elasticsearch.yml look like? Any non-defaut settings?

pokaleshrey · September 21, 2019, 10:05am

My expectation is both data nodes should serve equally at the same time.
From the thread utilization of search thread it seems like traffic is routed to either of the Primary shard or the replica.

Data node yml is simple as below, literally nothing extra:

cluster.name: prodcluster
search.remote.connect: false
node.name: data-01
node.master: false
node.data: true
node.ingest: false
node.ml: false
xpack.ml.enabled: false
network.host: site
network.publish_host: site
discovery.zen.ping.unicast.hosts:
path.data:
path.logs:
path.repo:

Christian_Dahlqvist · September 21, 2019, 12:12pm

Have you set discovery.zen.minimum_master_nodes to 2? If not it is possible you could suffer from network partitions which can have strange and unexpected effects.

pokaleshrey · September 21, 2019, 12:16pm

discovery.zen.minimum_master_nodes=2 has been set in elasticsearch.yml of master nodes only.

Does it need to be there in yml of Data nodes also ?

Christian_Dahlqvist · September 21, 2019, 12:25pm

I don’t think so but am not sure.

pokaleshrey · September 21, 2019, 12:33pm

Keeping that setting in master node configs has helped us since about 2year in production.

I have never noticed this behavior until today when i coincidentally saw that both data nodes are serving the requests in an alternate manner. Strange one.

Preface to this is performance issue. We are getting degraded performance when we hit requests with different routing values per request, in a mixed bag way.

Request 1: routing=a
Request 2: routing=b
Request 3: routing=c

N number of requests having these values in a mixed way. The performance suddenly decreases.

But is i hit thousands of requests say only for routing=a, the performance increases. Or for that matter any routing separately. Any lead as to why this issue might occur ?

Christian_Dahlqvist · September 21, 2019, 1:10pm

sounds like a caching issue. You may want to try using the routing value as preference parameter as well.

Yashwantbokadia · September 22, 2019, 5:46am

In Data Nodes yml file not required to enter "discovery.zen.minimum_master_nodes=2"

pokaleshrey · September 22, 2019, 2:30pm

Preference, i doubt will be of any help as i need proper load balancing.

I was more of relying on the 2 Coordinating nodes to do the smart load balancing for me.
I am connecting to both my coordinating nodes from my java client.

system · October 20, 2019, 2:30pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.