Hello, I have a requirement in which I need to aggregate over multiple indexes, each being independent from the others and each containing potentially millions of documents. Each index has its own Ids but contains a hash property that can be used to identify duplicated items across indexes. Documen…

Aggregation count unique values

Mark_Harwood (Mark Harwood) February 13, 2018, 8:54am 2

In later versions of elasticsearch we introduced the composite aggregation and terms agg partitioning to help break big requests like this one into smaller pieces.
Using 2.4 APIs you could look at using the scroll API, sorting docs by hash and stream them out to your client code to look for duplicates in the sequence of docs.

Topic		Replies	Views
Aggregations across multiple indices Elasticsearch	3	6159	July 6, 2017
Get number of unique results from multiple indices Elasticsearch	3	641	May 31, 2021
ElasticSearch use aggregation over another aggregation result Elasticsearch	2	472	April 14, 2020
Using aggregation in Elasticsearch to find duplicate data in the same index Elasticsearch painless	6	15805	January 22, 2021
Aggregation after dedup across indices Elasticsearch	1	475	June 12, 2018

Aggregation count unique values

Related topics