What about the Scroll API makes it a bad choice for paging large result sets?

The fact that the Scroll API is not recommended for deep pagination in ES 8 is well-documented. I'm currently updating some legacy code which uses it to page through a result set much larger than 10k hits, and from what I've read so far, it seems like search_after + PIT mainly exists to support realtime operations/lower request overhead.

My code is a long running job which I have the luxury of shutting down traffic to the cluster for whenever it needs to run. With that in mind, is there any danger in continuing to use scroll this way until I have time to go back and replace with search_after + PIT, or are there other issues with this that I am not considering?

From Elastic Search to Elasticsearch

Bump. Would really like a straightforward answer on the pros and cons of this approach. Fully acknowledge it might be dumb, but I would like someone to explain why further than, "You just shouldnt do it"

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.