Elastic cluster capacity planning

vivektsb · January 19, 2019, 6:49am

Hi,

We have requirement to index around 8TB data per day including replica( 4TB per day)

We are planning for 12 nodes cluster each with 8 core, 30TB Hdd,64gb ram out of 5 will be master nodes with SSD.
Do we need to use jbod or raid? As we have replica jbod is sufficient please correct us if we are wrong?

We have one logstash instance with 16 core,64 GB ram,5 TB Hdd .Each index with 2 primary shards and 1replica.Is that correct configuration for moderate querying.One logstash instance is sufficient or do we need to use redis or kafka for fault tolerance.

Please let us know elastic and logstash configuration is proper.

Regards,
Vivek

Christian_Dahlqvist · January 19, 2019, 9:44am

Indexing in Elasticsearch is very I/O intensive, so for best performance it is recommended to use SSDs. If you are indexing into spinning disks it is important that you spread out the indexing load across as many disks as possible, e.g. by striping them.

It is difficult to determine exactly how much data a node can index and store, so I would recommend you perform some benchmarks if you have the hardware available. I would also recommend the following resources around sizing and best practices:

https://www.elastic.co/webinars/optimizing-storage-efficiency-in-elasticsearch

https://www.elastic.co/elasticon/conf/2016/sf/quantitative-cluster-sizing

https://www.elastic.co/webinars/using-rally-to-get-your-elasticsearch-cluster-size-right

vivektsb · January 20, 2019, 7:39am

Hi Christian,

Thanks for your quick response.Do we need to use redis or kafka as we have only single logstash instance for failover conditions.

Regards,
Vivek

system · February 17, 2019, 7:39am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch Cluster for distributed mode Elasticsearch	4	1074	July 5, 2017
Elastic cluster hardware estimation Elasticsearch	3	544	May 30, 2018
Elasticsearch hardware planning Elasticsearch	5	796	July 6, 2017
Most right architecture for my cluster Elasticsearch	6	503	February 28, 2019
Elasticsearch node Sizing for production Elasticsearch	5	3907	July 16, 2019

Elastic cluster capacity planning

Related topics