“Hot-Warm” Architecture in Elasticsearch best practice

Christian_Dahlqvist · October 30, 2018, 5:59am

I agree with Aaron's recommendations in that you have far too many shards and need to reduce this dramatically. Please read this blog post on shards and sharding practices and then change how you create and manage indices. I would recommend changing to daily indices in order to increase the average shard size as per the recommendations in the blog post. We are also doing a webinar this Thursday on this topic, which may be useful.

Elasticsearch keeps a lot of data off-heap, which means that the size of the file system cache is important for good performance. You will need to adjust the amount of heap you give nodes if running multiple nodes on a host with just 64GB RAM. As Elasticsearch by default assumes all nodes are equal, you may want to run 2 nodes, each with 16GB heap, on all hosts. Another option might be to rearrange your disks and run a single node, hot or warm, per host with a 30.5GB heap.

Forcemerging down to a single segment can save a lot of heap and is definitely recommended. It is however as Aaron describes a quite expensive and slow process and uses a lot of disk I/O.

Topic		Replies	Views
Choosing the right heap size for warm data nodes Elasticsearch	11	4650	August 1, 2018
Hot-Warm architecture - ES high CPU usage Elasticsearch	3	1460	June 1, 2018
Elastic Search Warm node taking too much of Ram than the Heap assigned to that Elasticsearch	1	331	October 29, 2020
Massive Heap Usage (99%) on our "Warm" nodes Elasticsearch	3	1288	May 21, 2018
Hot-Warm Architecture - Storage Handling Elasticsearch	4	713	September 25, 2018

“Hot-Warm” Architecture in Elasticsearch best practice

Related topics