I need to compare ES with source data and I would like to do it in one pass
if possible which means I need to pull my data from ES in the doc key
order.
Is it possible to do it with SCAN API or scan order is undefined (based on
how it is stored in lucene segments?)
The Scan API indeed works based on the order of documents in the lucene
segments. This is the most efficient way to get bulks of data which is the
intended use case of this API.
On Monday, March 3, 2014 3:35:08 PM UTC+1, AlexR wrote:
Hello,
I need to compare ES with source data and I would like to do it in one
pass if possible which means I need to pull my data from ES in the doc key
order.
Is it possible to do it with SCAN API or scan order is undefined (based
on how it is stored in lucene segments?)
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.