Recommended Setup and Settings for Very Large Data

jessie.velayo · May 12, 2021, 4:56pm

Hi,

Just want to ask your help what's the recommended cluster setup and settings for the data below:
Given:
To index: 32 billion records (around 7 TB)
Node specs: 8 CPUs x 64 RAM x 1.7 TB disk
Query complexity: 5 levels "has_child"
Queries include terms, range(number, date) and aggregations (date histogram, terms)

The ask:
- No. of nodes
- No. of primary shards
- No. of replicas
- other search optimization config
- etc

Thanks in advance!

Christian_Dahlqvist · May 12, 2021, 6:33pm

You will need to benchmark to find out. I doubt anyone will be able to tell you with any accuracy. Using such deeply nested documents seem sound potentially problematic or inefficient. How have you determined that this is the optimal data model for the use case?

system · June 9, 2021, 6:33pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ES Recommended Configuration? Elasticsearch	3	930	July 6, 2017
Scaling Elasticsearch for 40GB of data Elasticsearch	5	1194	July 6, 2017
Index settings recommendations Elasticsearch	3	358	April 17, 2019
ElasticSearch Performance Elasticsearch	4	353	October 12, 2020
Help pls with elasticsearch cluster config Elasticsearch	1	441	February 13, 2017

Recommended Setup and Settings for Very Large Data

Related topics