How to choose size parameter when using partitioning

Anindya_Roy · April 2, 2019, 4:05pm

Hello,
How should I choose the size parameter when using partitioning query ?

E.g
Suppose the aggregation query result returns 60K hits and say I use num_partitions = 20, then is it correct to expect the total result is spread across 20 partitions, so I can specify the size as 3000 (60k/20) when making 20 queries with partition id 0 through 19 ?

e.g. [ showing the first aggregation block only ]

"aggs": {
"ip": {
"terms": {
"field": "ip",
"include": {
"partition": 0,
"num_partitions": 20
},
"size": 3000
},

Similarly - if I use num_partitions = 200, then should I use size = 300 and run 200 queries to retreive the 60k response ?

Thanks a lot for your insight.
best regards
Anindya

Anindya_Roy · April 2, 2019, 7:09pm

A related but different question:
I am using the current default 5 shards) for the index.
What is the recommended best practice in terms of choosing the number of partitions in relations with number of shards in the index ?
More partitions will translate to more query (i.e. more IOPs) with smaller results.
vs running smaller number of larger individual queries by using less number of partitions.

dadoonet · April 2, 2019, 8:36pm

May I suggest you look at the following resources about sizing:

https://www.elastic.co/elasticon/conf/2016/sf/quantitative-cluster-sizing

And https://www.elastic.co/webinars/using-rally-to-get-your-elasticsearch-cluster-size-right

system · April 30, 2019, 8:36pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Size parameter - how to check all stats in field Elasticsearch elastic-stack-alerting	2	664	September 19, 2018
Elasticsearch terms aggregation with partition does not honor the “size” value Elasticsearch	5	1846	May 25, 2021
"size" parameter doubt Elasticsearch	2	399	April 26, 2017
Need Clarification Regarding Size Parameter in ES Query Elasticsearch	4	535	August 12, 2021
Difficulty understanding the "size" parameter in aggregations Elasticsearch	9	515	July 5, 2017

How to choose size parameter when using partitioning

Related topics