What is best hardware recommendation in my use case?

vikas_gopal · December 4, 2017, 1:28pm

Hi ES Experts,

I have following use case , please suggest if I am in the right direction .

Daily Data volume :- 140 GB
Total ES Clusters :- 7, this means I will be having total 7 different clusters (ES1 to ES7) and in each cluster I will be having 20 GB Daily data volume , which will come to 140 GB in total.
Query Type :- Mostly it will be dashboards on non Analyzed data .Like aggregated data , top sources , destinations , ports etc.

I am planning to use
3 nodes , master/data eligible Linux 64 bit architecture
CPU:- 4
Core :- 2
RAM :- 64 GB
I will be dedicating 8 GB RAM to each cluster . With this I will be consuming 56GB RAM as I have 7 cluster on these machines .

I hope I am clear with my use case .Can anyone give me some suggestions if I need something else here ?

Regards
VG

Christian_Dahlqvist · December 4, 2017, 1:34pm

Why 7 different clusters?

vikas_gopal · December 4, 2017, 1:41pm

I will be having 7 different scenarios which I do not want to club in one cluster , so one dedicated cluster to one scenario + I will be having 7 different Kibana Views as per cluster .

Christian_Dahlqvist · December 4, 2017, 1:51pm

So each cluster get 3 nodes , each with 8GB RAM and 4GB heap, across the 3 hosts?

What type of storage do you have? How long will you be keeping data in the cluster?

vikas_gopal · December 4, 2017, 1:58pm

So each cluster get 3 nodes , each with 8GB RAM and 4GB heap, across the 3 hosts?

Absolutely Correct Christian

What type of storage do you have? How long will you be keeping data in the cluster?

So my Total Disk will be approx 2.5 TB as I want to hold data for 3 months (90 days). Which will hold 20GB x 90days=1800GB per cluster and for 7 clusters it will be 1800GB x 7= 12.6 TB total . I also use one Replica so I need 25 TB.

After 90 days with Curator I will close the index or I can shift it to warm node .

Now Since I want to hold data for 90 days there will be trending as well in Kibana.

Christian_Dahlqvist · December 4, 2017, 2:15pm

If I am not mistaken, that is 1800GB, not 180GB. This means that the total data volume, if we assume indexed size on disk is the same as the raw data volume, is about 25TB with replica configured.

vikas_gopal · December 4, 2017, 2:25pm

Ahh!! My Bad , you are correct it is 180GB(modified my ANS too) Actually size is not a problem I can increase it to 25 TB as well .
So, I will not use message field + 90% of the data will be not analyzed, but still index size will be increased as I will use one replica .

Christian_Dahlqvist · December 4, 2017, 2:35pm

If that is the volume that need to be indexed and queried, I suspect you have too little CPU and you may also need more RAM. As you will be indexing into a lot of clusters and indices, you may also be limited by disk performance, especially if you are planning to have spinning disks.

vikas_gopal · December 4, 2017, 2:41pm

Sure , but for start if I reduce it to 14 days only . Because I will have weekly index , so on the above hardware May I go ahead ?

Christian_Dahlqvist · December 4, 2017, 2:46pm

If you have the hardware available - try it out. There is no better way to see how well it works.

mujtabahussain · December 5, 2017, 1:27am

Are you going to be deploying on premises or a cloud provider?

vikas_gopal · December 5, 2017, 5:20am

On premises.

system · January 2, 2018, 5:20am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
What Hardware do I need for 100 GB Per day data volume? Elasticsearch	4	2463	August 2, 2018
Hardware for ELK Elasticsearch	8	529	May 7, 2018
How many ES nodes will be needed to handle 400GB/Day Data Elasticsearch	4	1965	October 20, 2017
Is this sufficcent hardware to handle 300 to 600 GB daily data volume? Elasticsearch	1	789	October 8, 2018
Elasticsearch hardware requirement Elasticsearch	3	373	July 6, 2017

What is best hardware recommendation in my use case?

Related topics