Impact of multiple concurrent scroll API on ElasticSearch

Scrolls are expensive to run concurrently.
For each scroll ID there is a unique point-in-time view of the current set of segments preserved for that scroll. This hangs on to files and related caches that would otherwise be removed by the constant segment rewriting that happens while indexing is active. This is why it is especially resource-intensive to do concurrently.

1 Like