Reindexing with sliced scrolls VS. using custom ranges

hariso · December 14, 2016, 10:22am

Hi guys!

I'm reindexing a lot of data. To beef up the performance and also have more "elastic" error-handling, I've divided every source index into parts (ranges for a field). I then parallelize the process, by reindexing a number of ranges at a time. This has also the benefit of ranges being isolated, and so, if a range fails, I can investigate it and restart.

In Elasticsearch 5.1 we have reindexing with sliced scrolls. I have two questions:

How does slicing compare with the approach above (manually specifying ranges) in terms of performance?
How does slicing compare with the approach above (manually specifying ranges) in terms of error-handling? i.e. if there is a problem with a single slice, can I simply restart it, with the rest of documents still being reindexed?

Thanks in advance!

Haris

system · January 11, 2017, 10:22am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Remote reindex of large index using manual slicing Elasticsearch	1	796	June 29, 2017
I need more insights into how reindexing with parallel slicing works like about how it allocates slices Elasticsearch	1	840	August 5, 2019
ES 5.1.1- Using slices with the reindex from remote Elasticsearch	6	1889	January 25, 2017
Reindex API performance Elasticsearch	3	4494	July 5, 2017
Why does elasticsearch NOT provide slicing functionality for reindexing from remote cluster? Elasticsearch reindex	1	308	January 19, 2023

Reindexing with sliced scrolls VS. using custom ranges

Related topics