i am trying my best to get the most suitable configuration for my production server to deploy ElasticSearch. But after googling a lot of documents i did not figure out the exact hardware requirements for my purpose.
Below is my data volume:
Total data: 20 billions Total Disk Space: 50,000 GB (50 TB)
Now I have initially the following Linux server Configuration:
Processor: 4 core Ram: 16GB Disk Space: 500GB
server: Intel(R) Xeon(R) CPU E5-2683 v3 @ 2.00GHz
My Questions:
What will be the optimal Disk Space and Shards No per node?
How many servers i required if i increase the diskspace vertically?
What will be the maximum disk space for the above configured server for that huge data to be distributed in multiple servers?
how many shards can i create for that huge data?
what should be the shard size?
How much data can be stored in a single shard(in GB) ?
Anybody's help is highly appreciated.
Thanks in advance.