Data node CPU constantly 90-98% on one node

Damelon · May 2, 2022, 5:46pm

So I have done several things to expand the cluster. I increased sharding, added warm nodes, and increased the CPU load on the hot nodes by doubling the instance type.

What I keep seeing is a single data node with around 90% cpu usage. there are 6 hot data nodes, but this specific one has almost twice the data on it than any of the others, and I don't know why. I haven't touched any of the default settings as far as re-balancing, and most of my smaller indices are 1 primary+1 replica shard. It's the large cluster sending it logs that is taking up all of the space. So it really comes down to a single multi-sharded index somehow taking up the space and not getting balanced properly.

Of course, I am not sure if one has nothing to do with the other... meaning the high CPU may not be because of the higher amount of space used, but I can't find anything else other than write and lucene taking up the hot threads.

Along with that, one specific node has a much higher document count, CPU load, IOPS, and segments. Maybe this specific node mostly has shards related to the large index and it is balancing on total shards for the cluster and not based on the 1 huge index.

Topic		Replies	Views
Very high CPU usage on one Elasticsearch data node Elasticsearch	18	33934	May 9, 2018
Mismatched CPU usages on data nodes Elasticsearch	6	596	January 28, 2019
High CPU usage on only 1 Data node Elasticsearch	7	1031	October 16, 2020
Only one of the data nodes has a significantly higher cpu usage than other data nodes Elasticsearch	1	203	March 27, 2023
Data node high CPU Elasticsearch	19	3750	February 26, 2018

Data node CPU constantly 90-98% on one node

Related topics