Hi, I was going through the following thread that talks about how you could get distinct documents with aggregation terms and top hits.
my question is, this will work with typically what size of documents ?
I have some million records and the above query yields me READTimeOut error from elasticsearch.
Is there some other way to address getting unique results for this huge dataset ?
just using aggregation terms to get the list of unique documents works fine for this dataset but I need to access the actual document values and using top_hits is giving readTimeout error.
Kindly advise.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.