High cpu load elasticsearch on logstash output

tim.van.rooijen · December 15, 2018, 8:23pm

Hi,

Beginner with the elk stack.
Have an api thats sends messages to rabbit mq.
Logstash reads the messages and adds them to elasticsearch. When the api is under peak load the cpu load on elasticsearch is really high.
Its a single node cluster but it has some good hardware.

My logstash output looks like this:
For me it's not completely clear if logstash is sending these messages in bulk to elasticsearch or not.
Anybody could give me some tips on how to improve performance?

elasticsearch {
index => "api_requests-%{+YYYY.MM.dd}"
document_id => "%{id}"
hosts => "elastic-logging-01:9200"
document_type => "doc"
template => "/opt/plugin/MappingTemplates/api_requests.json"
template_name => "api_requests-template"
manage_template => true
template_overwrite => true
}

rugenl · December 15, 2018, 11:47pm

I see you are sending an ID, that could be a cause, see Bad bulk performance with self-generated id

tim.van.rooijen · December 17, 2018, 7:53am

Thanks for your reply. Sounds like a good point. Moving this into production today. Hopefully this will solve our problem

tim.van.rooijen · December 17, 2018, 9:01am

Moved it into production. But sadly not improvements in cpu usage elasticsearch. Any other suggestions?

Christian_Dahlqvist · December 17, 2018, 9:05am

Indexing can be quite CPU intensive. What is the average size of your documents? What indexing throughput are you seeing? What is the specification of the hardware your cluster is running on? Is there anything in the logs around long or frequent GC?

tim.van.rooijen · December 17, 2018, 9:36am

Arround 50 messages a second.
Doc size is arround 1000 bytes

Elasticsearch has 4 cpu cores 3.2 g
and 7 gigs of memory

Find nothing strange in logs elasticsearch or logstash

Christian_Dahlqvist · December 17, 2018, 9:39am

You mention that the CPU load is really high. How high is that? Do you have monitoring installed so you can see what is going on?

tim.van.rooijen · December 17, 2018, 9:51am

Under peak load cpu goes to 100%.
It still cannot keep up with rabbit and logstash starts throwing exceptions.
Will install monitoring later today

rugenl · December 18, 2018, 1:58am

Does your template define all fields or are you doing dynamic mapping? The mapping that analyzes text fields and maps them as keyword can do more than you sometimes need.

tim.van.rooijen · December 18, 2018, 9:33am

The mapping template contains:
"dynamic": "false"
So this should be oke.

@rugenl
Analysed the logs from yesterday. And removing the id did give us some improvements. Cpu load is still high but its better than before, so thanks for that one!!

system · January 15, 2019, 9:36am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
High CPU Utilized by JAVA in single node ELK #31089 Elasticsearch	12	2637	July 19, 2018
Elasticsearch huge CPU utilisation Elasticsearch	7	883	November 12, 2021
Help for extra large load Elasticsearch	5	343	March 9, 2022
Elasticsearch Hight used CPU Elasticsearch	3	299	December 2, 2020
Logstash performance drop with high CPU usage Logstash	2	935	July 6, 2017

High cpu load elasticsearch on logstash output

Related topics