I have a index with around 50Million data points, where I have a ID for each document.
I need unique id and its count , as it would be more than 10000, I used scrolling but unexpecteded, the scrolling gives me the same scrollid in the iteration
Hi, Thanks for your respone. I understand that scroll id is to scroll through the documents, so i'm expecting that in scroll_id would change for the next iterations, but it doesn't and always gives the first 10 results. I'm confused in it.
Aggregations summarize the entire result set, not the current page of documents.
If you want to page through aggregation results you need to see my previous advice.
Well, it returns the result with TDID and Count only for the first scroll, I request based on scroll_id on next. It doesn't return the TDID with Count but just lists the same documents. I'm bit confused. Sorry if I misunderstood something. Thanks for your support, I means a lot
Don't use scroll.
Use a regular search using the composite agg and then another search using a composite agg with the after parameter returned in the previous result. Repeat as necessary
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.