Data Nodes disconnected randomly

Hi everyone,
I have an Elasticsearch (7.10.2) cluster with 11 cluster nodes (3 master, 8 data).
Randomly there are data nodes that start to disconnect from the cluster. The node is healthy but offline, it comes back online after about 15 minutes.
Seeing different discussions on the subject, configure the TCP Keepalive:

  •    net.ipv4.tcp_keepalive_time = 600
    
  •    net.ipv4.tcp_keepalive_intvl = 60
    
  •    net.ipv4.tcp_keepalive_probes = 20
    

Trying this configuration a few days the same problem occurred again.

Any other solution?

Thank you so much :slight_smile:

7.10 is EOL and no longer supported, you should be looking to upgrade as a matter of urgency.

You'd need to share your elasticsearch.yml and your logs for us to be able to comment further.

The output from the _cluster/stats?pretty&human API might also be helpful.

1 Like

What is in the logs of the nodes that are disconnecting? Given that you are using version 7.10.2, do you have any third party plugins installed that could affect the cluster behaviour?

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.