Fastest way to retrieve all unique terms in batches

srishtii · February 15, 2026, 3:03pm

I’m looking for the fastest and safest way to retrieve all unique values of a field for a given time range, in batches.

My requirements are:

Retrieve 100% of unique terms (no missing buckets)
Support batching / pagination
Be as fast as possible
Avoid excessive heap usage

I’ve evaluated two approaches:

1. `terms` aggregation with `include.partition`

Example:

"terms": {
  "field": "someField.keyword",
  "size": 10000,
  "include": {
    "partition": 0,
    "num_partitions": 20
  }
}

Iterating partition = 0..N.

I understand that:

Each unique term is deterministically hashed into a partition
Distribution may appear uneven for small cardinalities
Larger cardinalities should distribute more evenly
The same term always maps to the same partition across shards

However, this approach requires:

Choosing num_partitions upfront
Managing a hard size limit per partition (risk of missing terms)
Manual orchestration of partitions
No cursor/resume mechanism
Potentially higher heap usage due to in-memory bucket building

2. Composite aggregation with `after_key`

This seems to offer:

Cursor-based pagination
Unlimited buckets
Natural batching
Lower memory pressure
Easy resumability

Question

For the general use case:

Retrieve all unique field values over a time range, at scale, with batching and maximum performance

Is composite aggregation the recommended production approach over terms + partition?

Are there scenarios where terms + partition is preferable?

My primary goal is:

Fast, complete, resumable extraction of unique terms.

Thanks in advance for any guidance.

Mark_Harwood1 · February 15, 2026, 9:47pm

Yes, use composite.

Partitioned terms agg is only generally useful if you are sorting by criteria other than the terms themselves eg getting a list of account IDS that have not been used for a long time (sorting by max date for each term, in reverse order)

Topic		Replies	Views
Aggregation to take the first result for every unique value of a term Elasticsearch	3	5543	January 23, 2018
Get all distincts values of a field ( more than 10k values) Elasticsearch	3	729	August 16, 2019
Disable order over Terms Aggregation Elasticsearch	2	2055	May 10, 2018
Composite Aggregations in the Java API and question about Terms Aggregation Partitioning Elasticsearch	0	1474	August 5, 2018
Pagination on unique data Elasticsearch	1	1631	September 14, 2014

Fastest way to retrieve all unique terms in batches

1. terms aggregation with include.partition

2. Composite aggregation with after_key

Question

Related topics

1. `terms` aggregation with `include.partition`

2. Composite aggregation with `after_key`