Is 3k search/sec high volumn? (High CPU usage)

essis · November 23, 2015, 6:20pm

Hi
I have Elasticsearch 8 nodes cluster of 6 to 8 CPUs with 16GB memory and SSDs in them.
I am doing search query on the index which has 2 million documents in 8 shard + 1 replica. Document size is moderate so index size is around 2GB.
Current traffic makes 3000 search query per second, and overall CPU usage is around 50%. And, if the query rate goes up over 4000/s then some nodes reach 100% and start dropping queues which causes application failure.
There's no indexing during the period.
Each query takes less than 50ms. I tried to optimize search query, but simple match all query also takes almost half of current usage, which is still too high.
One interesting thing is that if I optimize index with max_num_segments=1 then CPU usage goes down to a half. So I reduced segments_per_tier to 3 but it didn't help.
Is this normal capacity of elasticsearch? Or is there something wrong with my cluster.
I used both Oracle and OpenJDK, and result is similar on both.
This is hot thread dump.

gist.github.com

https://gist.github.com/janghwan/351b1ed8b0f315361890

elastic search hot thread

::: [es-cluster][iTnWLzezQi2QUhFI2bmxBw][es-cluster.xxx][inet[/10.0.18.8:9300]]{master=true}
   Hot threads at 2015-11-23T16:45:07.894Z, interval=500ms, busiestThreads=3, ignoreIdleThreads=true:
   
   33.5% (167.5ms out of 500ms) cpu usage by thread 'elasticsearch[es-cluster][search][T#9]'
     2/10 snapshots sharing following 3 elements
       java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
       java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
       java.lang.Thread.run(Thread.java:745)
     7/10 snapshots sharing following 10 elements
       sun.misc.Unsafe.park(Native Method)

This file has been truncated. show original

nik9000 · November 23, 2015, 10:27pm

Depending on the version this doesn't kick in properly after the index is created. I don't have a link.

What you describe is fairly normal for when the cluster is at the edge.

Your index is fairly small so I'm not surprised I don't see IO load.

The hot_threads isn't doing well. It doesn't do a good job when you have many short running jobs. Your best bet is to use jstack on a node several times in a row while its under load and analyze that.

You'll have to post example search queries for us to help with those. Depending on what you are doing match all might not be a great indicator. Like if fetching from _source is taking a while then match_all isn't going to change anything. Really the stack traces are you best bet for figuring out what is up.

Another thing to check is jstat gcutil <pid> 3s 100. You can use that to figure out how much time is being taken up by gc. Its harder to figure out what is taking up the memory, but with the queries you could probably puzzle it out.

Topic		Replies	Views
Elasticsearch high cpu usage while searching Elasticsearch	5	2527	July 5, 2017
Elasticsearch full CPU utillization Elasticsearch	2	842	July 6, 2017
ES suddenly begin to consume CPU Elasticsearch	3	1507	July 5, 2017
My Elasticsearch is running at very high CPU (constantly 99%) - Need help understanding hot_threads Elasticsearch	2	1064	July 6, 2017
ElasticSearch High CPU usage 160 queries per second doesn't make sense Elasticsearch	1	997	July 6, 2017

Is 3k search/sec high volumn? (High CPU usage)

Related topics