The indexing or search request send to down node

Chimu · July 24, 2023, 12:52am

I have an Elasticsearch (v5.6.10) cluster with 3 nodes.

Node A : Master
Node B : Master + Data
Node C : Master + Data

There are 6 shards per data node with replication set as 1. All 6 primary nodes are in Node B and all 6 replicas are in Node C.

When I shutdown one node for maintenance I can see indexing or search requests still trying to reach that node and failing. I suspect this is because the client connecting to elastic is configured with all three node IPs.

Is there any way to avoid the requests reaching that down node?

dadoonet · July 24, 2023, 5:56am

I guess it depends on the client.

You must definitely upgrade everything. The cluster, the client... It's a way too old.

Chimu · July 24, 2023, 9:38pm

Can you please elaborate on that?

I am trying to find a solution where client does not face errors in this situation.

I saw that cluster status being yellow does not cause any issue during the maintenance period. But if a request goes to the down node, that’s when the problem occurs.

And, yes they are very old. I am planning for the upgrade, but it will take some time. For now I need to keep things running..

dadoonet · July 24, 2023, 11:41pm

Which client are you using?

Chimu · July 25, 2023, 1:09am

It’s a java client - “elasticsearch-rest-high-level-client” and all the elastic node ips are provided as a list while creating the rest client.

dadoonet · July 25, 2023, 9:00am

Did you add the sniffer? Sniffer | Elasticsearch Java API Client [8.8] | Elastic

Christian_Dahlqvist · July 25, 2023, 9:07am

The client contains a connection pool, which should mark connections as down once this has been detected. You could therefore see a few requests target the downed node before this is detected. This assumes you are using tbe client correctlt as a singleton and not creating it for each request.

Chimu · July 25, 2023, 9:42pm

Yes, I did. I was also looking into that. It seems I missed something.

Just to confirm, sniffer will remove any down node from the active node list and also add it back once it is up?

Chimu · July 25, 2023, 9:46pm

Yes, it is implemented as a singleton, but the connection to the down node is not automatically withdrawn from the connection pool.

dadoonet · July 26, 2023, 7:34am

I never used it but I guess it works that way according to the doc

It is also possible to enable sniffing on failure, meaning that after each failure the nodes list gets updated straightaway rather than at the following ordinary sniffing round. In this case a SniffOnFailureListener needs to be created at first and provided at RestClient creation. Also once the Sniffer is later created, it needs to be associated with that same SniffOnFailureListener instance, which will be notified at each failure and use the Sniffer to perform the additional sniffing round as described.

Chimu · July 27, 2023, 3:35am

I hope so too. I will give it a try and update here. Thanks a lot for all the guidance.

system · August 24, 2023, 3:35am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Issue with Coordinator node down Elasticsearch	6	914	June 20, 2018
When a master node goes down, how the client query works? Elasticsearch	4	4354	July 6, 2017
Cluster management Elasticsearch	6	350	July 6, 2017
Cluster connection issues when the machines hosting the nodes are restarted for service maintanance Elasticsearch	7	1011	July 6, 2017
Failover mechanism of master nodes Elasticsearch	4	864	October 21, 2020

The indexing or search request send to down node

Related topics