Darn ok. Thank you.
If I'm retrieving large numbers of random largish (twitter river records)
documents, is there a particular pattern I should use for searching? That
is, does it make sense to send 20 sequential queries with size 10,000 and
random sorting, or a single query with a size of 200,000? What about up
into the millions? Obviously we're risking duplication of results when
sending multiple smaller queries, but this is OK for our purposes, or can
be dealt with at another stage of the process outside ES.
Thanks,
Josh
On Wednesday, February 19, 2014 12:41:58 PM UTC-8, Adrien Grand wrote:
Hi Josh,
In order to run efficiently, scan queries read records sequentially on
disk and keep a cursor that is used to maintain state between successive
pages. It would not be possible to get records in a random order as it
would not be possible to read sequentially anymore.On Wed, Feb 19, 2014 at 9:04 PM, Josh Harrison <hij...@gmail.com<javascript:>
wrote:
I need to be able to pull 100s of thousands to millions of random
documents from my indexes. Normally, to pull data this large I'd do a scan
query, but they don't support sorting, so the suggestions I've seen online
for randomizing your results don't work (such as those discussed here:
Random order & pagination Elasticsearch - Stack Overflow
).
Is there a way to introduce randomness into a basic scan query?--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/b3971dda-2963-48ce-b7ed-f50e85b82a97%40googlegroups.com
.
For more options, visit https://groups.google.com/groups/opt_out.--
Adrien Grand
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/fabec423-97a6-4246-bf11-5d2899ca64b9%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.