Cluster size (help to define it)

lgutierrez · May 5, 2020, 10:51am

Hello everybody,

We have different applications running on aws machines, which end up with a visualization reading of an elasticsearch cluster. Currently we have a cluster of 3 nodes with 3 EC2 r5.4xlarge instances, but we have many performance problems in the visualization (graphics that take a lot of time to load, crashes of the cluster due to large queries, etc).
So now, we want to optimize the cluster and take better advantage of horizontal scalability, adding new nodes (probably we can also use less powerful machines than the current ones).

We have 4 apps ingesting data in Elasticsearch:

App1: 25GB/day. (Currently we only store in elasticsearch the data from the last week, 175GB +-)
App2: 8 GB/day. (Currently we only store in elasticsearch the data from the last week, 56GB +-)
App3: 400MB/day. (Currently we only store in elasticsearch the data from the last 15 days, 4GB +-)
App4: 50MB/day. (Currently we only store in elasticsearch the data from the last 30 days, 1.5GB +-)

To "design" the new cluster we have different questions:

How many nodes should it have for this volume of data?
Hoy many node.master, node.data, etc?
How big do the shards have to be?
Is it better an index per day or not?
What aspects must be evaluated when choosing the machines that will make up the cluster? For example: Is it better 6 nodes with 16GB RAM or 12 nodes with 8GB RAM.
Other aspects that influence the performance of the visualization.

I appreciate any kind of help, thank you very much in advance.

Christian_Dahlqvist · May 5, 2020, 12:39pm

One of the primary factors impacting Elasticsearch performance is the speed of the storage, about which you have not provided any details. It would also be useful to know how many indices and shards you currently have.

system · June 2, 2020, 12:54pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Cluster sizing Elasticsearch	7	407	July 6, 2017
ElasticSearch Performance Elasticsearch	4	348	October 12, 2020
Disk size and performance optimization for Elasticsearch cluster Elasticsearch	2	664	April 4, 2023
Disk usage difference between data nodes Elasticsearch	6	1808	August 3, 2021
Cluster optimization(indexing/query performace) Elasticsearch	4	312	July 6, 2017

Cluster size (help to define it)

Related topics