Failed to retrieve shard stats from node

amitsa · May 9, 2022, 6:23am

I am running es-cluster on kubernetes on secure mode(Basic security) . I am getting below warning on master es node logs. why am i getting this error during benchmarking with esrally.

i have 1 master and a data node running on different host.

{"type": "server", "timestamp": "2022-05-09T06:11:20,824Z", "level": "WARN", "component": "o.e.c.InternalClusterInfoService", "cluster.name": "elasticsearch", "node.name": "es-master", "message": "failed to retrieve shard stats from node [9_L5bY1kQA6s04k23DXPfQ]: [es-data][10.244.172.223:9300][indices:monitor/stats[n]] request_id [65720] timed out after [15006ms]", "cluster.uuid": "wBHxBmZ2SbClTiJ5dykWjQ", "node.id": "Sq7N0x61T2W6o5ZDr8KkVg"  }

dliappis · May 9, 2022, 7:49am

{"type": "server", "timestamp": "2022-05-09T06:11:20,824Z", "level": "WARN", "component": "o.e.c.InternalClusterInfoService", "cluster.name": "elasticsearch", "node.name": "es-master", "message": "failed to retrieve shard stats from node [9_L5bY1kQA6s04k23DXPfQ]: [es-data][10.244.172.223:9300][indices:monitor/stats[n]] request_id [65720] timed out after [15006ms]", "cluster.uuid": "wBHxBmZ2SbClTiJ5dykWjQ", "node.id": "Sq7N0x61T2W6o5ZDr8KkVg"  }

As shown in the message, It appears that the request for get stats from the node es-data (on IP address: 10.244.172.223) was very slow processing a shard stats request and the request timed out after 15seconds.

Are you monitoring the resource usage of your cluster (and Rally ) while running the benchmark? This warning indicates that this node is experiencing high load. You should also check the load of your master node(s). Hitting the nodes too hard is one of the deadly sins of benchmarking, I strongly recommend watching the recording (link to slides here).

system · June 6, 2022, 7:50am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
failed to retrieve shard stats from node [zxt4RAOiRZy9Lol9IdIGfg]: [node_2][10.202.152.18:9300][indices:monitor/stats[n]] request_id [77247 683] timed out after [15016ms] Elasticsearch	0	116	April 9, 2024
Received response for a request that has timed out and "failed to retrieve stats for node" Elasticsearch	8	883	October 26, 2023
Failed to retrieve shards stuck Elastic node and performance Elasticsearch elastic-stack-monitoring	0	115	June 18, 2024
[node2] collector [cluster_stats] timed out when collecting data Elasticsearch	3	4787	November 20, 2019
Data node removed; master_failed Elasticsearch	7	1584	July 4, 2017

Failed to retrieve shard stats from node

Related topics