Python library: search over large dataset sorted by timestamp


(Andrew Vine) #1

Hi all,

I would like to read over all the docs for an index in timestamp order.

Does anyone have a suggestion as to what would be the best way to do this
using the python client?

At the moment I'm using the searchhttp://elasticsearch-py.readthedocs.org/en/latest/api.html function
provided. From reading the api docs I should benefit by using the scroll
option however the docs on the python client api are unclear
as to how you use the scroll on repeated searches.

Regards
Andrew

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/b65c7e23-cd53-4a17-a7aa-597520ca473a%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(Honza Král) #2

Hi Andrews,

there is a helper function that will help you with the repeated
searches (do them for you) in the helpers module (0).

Unfortunately scroll/scan combination doesn't support sorting since it
would be very expensive to do. If you need the results sorted you have
to use normal search method and do the pagination yourself.

Hope this helps,

Honza

0 - http://elasticsearch-py.readthedocs.org/en/latest/helpers.html#elasticsearch.helpers.scan

On Wed, Feb 19, 2014 at 11:01 AM, Andrew Vine andrew.vine@gmail.com wrote:

Hi all,

I would like to read over all the docs for an index in timestamp order.

Does anyone have a suggestion as to what would be the best way to do this
using the python client?

At the moment I'm using the search function provided. From reading the api
docs I should benefit by using the scroll option however the docs on the
python client api are unclear
as to how you use the scroll on repeated searches.

Regards
Andrew

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/b65c7e23-cd53-4a17-a7aa-597520ca473a%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CABfdDirhGHvGiWRpHjPd5TS3438qpgr6OaSxD5pfK3g-XFgVtQ%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.


(Andrew Vine) #3

Thanks, I ended up doing repeated range queries by my timestamp field.

On Wed, Feb 19, 2014 at 1:39 PM, Honza Král honza.kral@elasticsearch.comwrote:

Hi Andrews,

there is a helper function that will help you with the repeated
searches (do them for you) in the helpers module (0).

Unfortunately scroll/scan combination doesn't support sorting since it
would be very expensive to do. If you need the results sorted you have
to use normal search method and do the pagination yourself.

Hope this helps,

Honza

0 -
http://elasticsearch-py.readthedocs.org/en/latest/helpers.html#elasticsearch.helpers.scan

On Wed, Feb 19, 2014 at 11:01 AM, Andrew Vine andrew.vine@gmail.com
wrote:

Hi all,

I would like to read over all the docs for an index in timestamp order.

Does anyone have a suggestion as to what would be the best way to do this
using the python client?

At the moment I'm using the search function provided. From reading the
api
docs I should benefit by using the scroll option however the docs on the
python client api are unclear
as to how you use the scroll on repeated searches.

Regards
Andrew

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit

https://groups.google.com/d/msgid/elasticsearch/b65c7e23-cd53-4a17-a7aa-597520ca473a%40googlegroups.com
.

For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/elasticsearch/ifVeqydS2Rc/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CABfdDirhGHvGiWRpHjPd5TS3438qpgr6OaSxD5pfK3g-XFgVtQ%40mail.gmail.com
.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAE6AM-gHyL5cN0mfFVVh2-A0Z6aiYiHr5RsQBysN9a2NY%2BFSTA%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.


(system) #4