3 Node Elasticsearch cluster is failing repeatedly with error: this node is unhealthy: health check failed due to broken node lock

akansha.agarwal1 · May 11, 2023, 5:42pm

Hi All,

I am stuck in a very weird situation.
My 3-node ES cluster is failing after 8-10 days abruptly with error:
[WARN ][o.e.c.c.ClusterFormationFailureHelper] [elasticsearch-0.es-service] this node is unhealthy: health check failed due to broken node lock

No change is being done during that duration. Neither any other instance of ES is being run

ES Version: 8.6.2
Deployment: 3-node cluster on k8s

Can anyone please suggest any solution or investigation points?
Pleas Help!

DavidTurner · May 11, 2023, 5:47pm

That means that Elasticsearch saw a change in its data directory for which it was not responsible, so it stops all write activity to protect your data. To fix it, remove any other process that might make such a change and then restart Elasticsearch.

akansha.agarwal1 · May 11, 2023, 5:57pm

Hi @DavidTurner
Thanks for the reply!!
Can you also please provide some insights on how to identify other process which can change data directory?
Just to add, its a 3-node cluster with each node running on separate worker nodes & this issue occurs generally after 10-12 days of running cluster

DavidTurner · May 12, 2023, 1:17pm

Common culprits include misconfigured/buggy security scanners and backup tools, but it could be anything really. You'll need to work with your local sysadmin folks to pin it down.

system · June 9, 2023, 1:17pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ES 1.7.1 - Node health chcks Elasticsearch	2	301	December 28, 2021
Prioritise health checks in ES cluster to prevent nodes falling out Elasticsearch	2	198	May 3, 2022
Node fails even after start/restart and it is not joining the cluster Elasticsearch	2	286	November 17, 2021
After Create My ES cluster included only 3 nodes each of nodes is "master and data node" , when trying to check cluster health using "9200/_cluster/health?pretty'" getting an error Elasticsearch	1	269	February 6, 2019
Unstable elasticsearch cluster Elasticsearch	1	1024	July 5, 2017

3 Node Elasticsearch cluster is failing repeatedly with error: this node is unhealthy: health check failed due to broken node lock

Related topics