Can I share a scroll id between multiple clients? If two clients ask the
next batch of the scroll at the same time, will they get different results
or is there a danger they will get duplicates? If there is no possibility
of duplicates, I could share a scroll id across machines and process all
records in my index in parallel...
I did a simple test using pmap to to make multiple scroll queries in
parallel. Out of 500 results, there were only 221 distinct values, so more
than half were duplicates
On Wednesday, January 28, 2015 at 12:56:21 PM UTC+1, David Smith wrote:
Can I share a scroll id between multiple clients? If two clients ask the
next batch of the scroll at the same time, will they get different results
or is there a danger they will get duplicates? If there is no possibility
of duplicates, I could share a scroll id across machines and process all
records in my index in parallel...
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.