Designing elasitcsearch node spec - CPU cores

zidane28 · March 29, 2019, 1:54am

I not so sure whether this kind of question had been asked before, but I'm having problem on deciding how many cores should I put for my data nodes.

I somehow know how many RAM or disk space I should assign, but for CPU cores, I not entirely sure.

Is the more better? Or even 4 cores should be enough for a typical data nodes?

What are the factors I should look at in determining how many CPU cores needed in data nodes?

Does the amount of CPU cores will affect concurrently how many search request can a data node work on?

Thank you!

Christian_Dahlqvist · March 29, 2019, 5:56am

This depends a lot on the use case but also what type of storage you have and what load you expect the cluster to be under. To get an answer I therefore think you need to provide more details or run some tests.

zidane28 · March 29, 2019, 6:22am

Let's say they are 2 use case here:

Map index, around 5 mil docs, concurrent users expected around 30 users, non-heavy search.
Huge index, perhaps more than 10mil of docs, will be used for data processing hence search request will keep on hitting the nodes, encounter search queue exceed kind of error before.

Christian_Dahlqvist · March 29, 2019, 6:32am

That is not nearly enough information. The size of the data set, type of data and queries have an impact, as does the query and indexing load and latency requirements.

Ideally you want a well-balanced node. There is little point in having lots of CPU if you have slow storage and this limits how fast you can retrieve data to process. If you on the other hand have a small data set that can be cached, you may get limited by CPU even if you have slow storage as disk I/O will be infrequent.

I would recommend running a test to see how much CPU your use case uses or is able to use and make sure that you have at least that amount ton ensure CPU is not a bottleneck.

zidane28 · March 29, 2019, 6:54am

Storage wise, the nodes will be hosted on virtual machine environment, so I think IO will be if-not-same-as-HDD-speed-then-is-slower-than kinda speed.

So does it mean more CPU won't help at all if I/O not good enough?

So is the above the factors you need to look at when determining # of CPU cores?

Christian_Dahlqvist · March 29, 2019, 6:57am

It is hard to predict, so I would always recommend running a test or benchmark.

system · April 26, 2019, 7:05am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How many CPUs should be allocated to an Elasticsearch node that doesn't store data? Elasticsearch	6	10338	March 7, 2018
Best practices: CPU core count vs. no. active shards per node Elasticsearch	20	13182	June 9, 2019
High CPU Usage and Load on Data Nodes Elasticsearch	4	2193	December 14, 2019
Questions about architecting and design Elasticsearch	2	373	July 6, 2017
Question about CPU Cores Elasticsearch	2	264	July 6, 2017

Designing elasitcsearch node spec - CPU cores

Related topics