Fetch 200M documents with slice and scroll

harshil · February 13, 2018, 6:51pm

Hi, We have a requirement to fetch ~200M documents. To make this work in parallel, I am using slice and scroll API, and fetching 10,000 documents per page. I know scroll API will run a query and takes a snapshot of the matched documents and keep it alive till the TTL. I wanted to understand how slice works with scrolling? Let's say my query matches 25M documents, but I have given slice:{"id": 0, "max": 5} it will roughly split 25M in 5M/slice, will it keep the snapshot live of all 25M documents or the snapshot is just for that slice id(here slice_id:0)?

system · March 13, 2018, 6:51pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Scroll query with slice Elasticsearch	2	353	October 6, 2022
Scrolling or slicing? Elasticsearch	5	1764	April 27, 2017
Using scroll for very large indices Elasticsearch	1	166	November 1, 2022
How to fetch ~12M documents(may be even more) quickly from ES using scroll API? Elasticsearch	4	832	December 28, 2017
How to get large response to query fast? Elasticsearch	2	841	August 31, 2017

Fetch 200M documents with slice and scroll

Related topics