CPU utilization is too low while indexing

suvarna · April 25, 2019, 3:46am

Hi

Please find the below configuration.

Rally:
1.Running esrally by using eventdata track.
2.Number of shards are 10:
3)Bulk size =2000000

system:
1)have 2 nodes 28 cores per socket.
2)have 1 nvme device of 1.8T .
3)Have 112 hyperthreads.
4)RAM 386GB (29gb is heap)

the Elasticsearch running for 2 hours and cpu ulitization is around 40% and most of the cpus are idle and its consuming around 50% memory ..

Can you please provide your input why the most of the cpus are idle and why the ES is taking 50% memory ..

Christian_Dahlqvist · April 25, 2019, 5:10am

Why are you using such extreme bulk size? How many rally workers are you using? What is the specification of the host rally is running on (or are you running it on the same host as Elasticsearch)? What type of network do you have in place? What indexing throughput are you seeing?

suvarna · April 26, 2019, 5:41am

I am using clients as 40 threads.
yes rally and Elasticsearch running on same server.
Index throughput is
All,Min Throughput,index-append-1000-elasticlogs_q_write,967.98,docs/s
All,Median Throughput,index-append-1000-elasticlogs_q_write,180778.04,docs/s
All,Max Throughput,index-append-1000-elasticlogs_q_write,209867.07,docs/s

Can you please suggest what is the ideal number for bulk size?

Christian_Dahlqvist · April 26, 2019, 5:56am

You typically see indexing throughput increase the bulk size, at least up to a certain level. It then usually flattens out before potentially even starting to decrease. I set the default to 1000 as I saw little gain after that but you could set it a bit higher if you want to. It is generally recommended to try and keep the size of the bulk request below a few MB in size which probably means around 10000 or so events. Also try with an even higher number of clients until you see no further improvement in throughput.

suvarna · April 26, 2019, 6:06am

Actually we are using one node cluster .
with above configuration .. i can see that , 10 indices created with 10 shards each..
I was able to see first 1 index will fill with indexing data and then the 2nd index will get create and so on ..

I observed one thing all 10 indices will work in sequence but 10 shards in each index will work in parallel .. it meas totally 10 write threads will be active right at a time ?
How to make the indices work parallel ? or else it suppose to be in this pattern only.

Christian_Dahlqvist · April 26, 2019, 6:10am

I moved this to the Rally category as that might give the thread more accurate visibility.

Given the size of the box it might make sense to run more than one node on it.

system · May 24, 2019, 6:10am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
High CPU on Bulk requests Elasticsearch	7	2774	July 5, 2017
Elasticsearch index throughtput Elasticsearch	15	1585	April 17, 2019
Increasing performance with idle resources Elasticsearch	4	314	July 6, 2017
Bulk Indexing Rate Elasticsearch	4	552	April 18, 2018
Idle Elastisearch cluster during Bulk loading Elasticsearch	2	597	August 19, 2017

CPU utilization is too low while indexing

Related topics