Shards and replicas allocation in elasticsearch

iamlearner123 · November 15, 2018, 2:51am

Hi,
i am new to Elasticsearch and still learning it . can i please how to calculate the number of shards and replicas required for an index ? i now that number factors should be considered but i am kind of confused after reading articles on the internet because some say to use 40% of data node capacity while others use the JVM heap size for calculation shards. For example, i have 9 node cluster with 10GB storage per node and i want index 50 GB of data and JVM size is 32GB. how many shards and replicas do i need ? what is the best practice ?Any detailed explanation would be really helpful .

Thank you

warkolm · November 15, 2018, 4:02am

10GB, are you sure that is correct?

iamlearner123 · November 15, 2018, 6:02am

yes.. each node has 10GB storage.

warkolm · November 15, 2018, 7:28am

That's a huge heap size for a small amount of data. It's not really efficient.

iamlearner123 · November 15, 2018, 3:28pm

Thank you for the reply. So, we assign the shards based on the heap size on the node ? What am i trying to understand is that on what factors does assignment of shards and replicas depends?

warkolm · November 15, 2018, 11:50pm

Shard count per node, and disk space use.

iamlearner123 · November 19, 2018, 11:37pm

But in my case i am having a data set of 50GB. Can i please know how many shards to allocate per node and why ?

Thank you

system · December 17, 2018, 11:37pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Trying to optimize Elasticsearch cluster Elasticsearch	3	977	February 20, 2017
How to chose the number of shards and replica Elasticsearch	3	314	October 19, 2020
Elasticsearch Shards Elasticsearch	5	702	August 22, 2017
Correct number of shards for 5.3 TB indices Elasticsearch	10	2167	May 18, 2017
Shard memory allocation and replicas Elasticsearch	1	317	July 6, 2017

Shards and replicas allocation in elasticsearch

Related topics