Heap usage vs number of shards

vahissan · October 20, 2017, 4:30am

We have a 3 node cluster with 128GB RAM in each node. Heap allocated for each node is 31GB. Our expected monthly record count is 8 billion per month. So we have created daily indices to hold data up to 2 years. Now, after 9 months heap usage always stays around 80% and sometimes all nodes crash at the same time due to OutOfMemory exception. Can the heap usage reduced by reindexing the documents with monthly indices? If so, what kind of heap usage improvement can I expect?

Is there any other way I can reduce heap usage without reducing number of shards?

Christian_Dahlqvist · October 20, 2017, 6:10am

How many indices and shards do you currently have in the cluster? What is the average shard size in the cluster? Have you read this blog post around shards and sharding?

vahissan · October 20, 2017, 6:49am

Thanks, I will read the blog post. Currently we have 285 indices and 1638 shards. Average shard size is around 8GB.

Christian_Dahlqvist · October 20, 2017, 7:39am

Have you run force merge with max_num_segments set to 1 on older indices that are no longer written to?

vahissan · October 20, 2017, 7:44am

Thanks. I will check.

vahissan · October 20, 2017, 7:49am

There is also a change in requirement to upgrade the cluster to hold 15 year data. In that case, I believe it is important to reindex with monthly indices. Do you agree? I am thinking about 18 nodes with 30GB heap each and 6 primary shards + 1 replica shard. Can I have your advice on this?

Christian_Dahlqvist · October 20, 2017, 2:24pm

Are you going to index 15 years worth of data now or keep just keep the data you are indexing now that long?

vahissan · October 20, 2017, 7:31pm

We are going to index 10 years data now, and going to index for 5 more years later on.

Christian_Dahlqvist · October 20, 2017, 8:21pm

How large do you estimate a shard for a monthly index would be?

vahissan · October 20, 2017, 8:43pm

Around 300GB for a single shard. I'm planning to have 12 shards per month (6 primary + 1 replica).

Christian_Dahlqvist · October 21, 2017, 7:55am

I would recommend performing a benchmark to determine the max shard size as described in this Elastic{ON} talk. 300GB is quite large, and may result in slow queries and issues when recovering.

If I calculate correctly, you estimate you will generate about 3.6TB of indexed data per month (primaries and replicas). Over 15 years that is 648TB. To handle that amount of data I suspect you will need considerably more than 18 data nodes.

vahissan · October 22, 2017, 7:52am

I still have to watch the talk as I am currently traveling. Just a quick question before I watch - do you think it is better to have nodes with lesser RAM than 128GB (i.e. 64GB) when planning the cluster to hold 15 years of data?

Christian_Dahlqvist · October 22, 2017, 8:16am

When holding lots of data you often want to maximize heap. You could do that by having smaller hosts or simply running 2 Elasticsearch instances on each host. I would recommend spinning up a cluster with a few nodes and run a benchmark to determine exactly how much data you will be able to hold per node based on your expected indexing and query load as described in the video I liked to. This will allow you to estimate how many nodes you will need for that amount of data.

system · November 19, 2017, 8:16am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Why is my heap usage always high? Elasticsearch	10	4978	July 5, 2017
Suggest for Heap size chnage and number of shards require Elasticsearch	6	3746	August 30, 2017
How to optimise heap usage on elasticsearch nodes? Elasticsearch	9	622	November 10, 2020
Elasticsearch heap issues Elasticsearch	4	441	July 5, 2017
Heap (Shard Amount and Close Index) Elasticsearch	4	630	September 14, 2020

Heap usage vs number of shards

Related topics