Hi everyone,
I have an Elasticsearch (7.10.2) cluster with 11 cluster nodes (3 master, 8 data).
Randomly there are data nodes that start to disconnect from the cluster. The node is healthy but offline, it comes back online after about 15 minutes.
Seeing different discussions on the subject, configure the TCP Keepalive:
-
net.ipv4.tcp_keepalive_time = 600 -
net.ipv4.tcp_keepalive_intvl = 60 -
net.ipv4.tcp_keepalive_probes = 20
Trying this configuration a few days the same problem occurred again.
Any other solution?
Thank you so much ![]()