Elasticsearch cluster performance degradation

We had a performance degradation for 5-10 minutes intervals. We looked into our monitoring tool and found there was high CPU utilization on the elasticsearch cluster. During this time we also noticed high fetch requests and cache eviction. What do the high fetch requests and high cache eviction implies?

Also, the rest of the metrics looked fine. Our slow query settings are set to 1 minute. So, we don't have all the queries which were executed during that time.

Where are you seeing these high metrics?

APM - SignalFX

I'm not familiar with that or how it collects those.

If there is an increase in requests and cache evictions (assuming the cache it's measuring is the Elasticsearch one, and not OS level), then that might suggest a large-ish number of unique queries coming in in a burst.

Is there a corresponding increase in GC?

There was increase in ES cache evictions. But, GC looked fine. I have not noticed the query requests going up during this time.

So what's the high fetch requests metric?

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.