Hi,
Just want to ask your help what's the recommended cluster setup and settings for the data below:
Given:
To index: 32 billion records (around 7 TB)
Node specs: 8 CPUs x 64 RAM x 1.7 TB disk
Query complexity: 5 levels "has_child"
Queries include terms, range(number, date) and aggregations (date histogram, terms)
The ask:
- No. of nodes
- No. of primary shards
- No. of replicas
- other search optimization config
- etc
Thanks in advance!