Latency and CPU spike on all nodes simultaneously

phill-tornroth · January 20, 2017, 9:27pm

Elasticsearch question. I'm not sure what background data is most helpful so I'll start with basics...

ES version: 1.3.1
Lucene: 4.9

We're seeing a situation periodically where our ES latency spikes dramatically, and the most interesting attribute of this pathology is that every node in the cluster spikes in CPU and load average at the same time.

I'm wondering what might cause that behavior? Here are some observations we've made to rule things out:

Memory and I/O look good
No significant change/spike in requests (though we do seem to see this happen mid-day when our load is higher than nights/weekends)
Segment Merge log doesn't appear to show an increase in merges, or merge latency
I can't imagine garbage collection is to blame since we see simultaneous behavior across nodes

Anyone have suggestions as to what to look at next? Any ideas about what sorts of problems tend to manifest in this behavior?

Thanks in advance.

system · February 17, 2017, 9:27pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
High CPU usage every day at the same time Elasticsearch	1	689	July 5, 2017
What's with these crazy CPU spikes? Elasticsearch	2	1758	October 23, 2018
ElasticSearch 1.7: query time spike, and the application processor crash after that Elasticsearch	10	755	December 29, 2017
Newbie performance troubleshooting, high load spikes on ES nodes Elasticsearch	5	5058	June 11, 2018
Distribution of work Elasticsearch	10	370	July 6, 2017

Latency and CPU spike on all nodes simultaneously

Related topics