Reindex API performance

Marcin_Kubica · June 16, 2016, 4:50am

Hi, checking reindex api. Have 400GB index with 6 shards to play with and copying over with a changed template to have 20 shards.

Got 16 nodes in a cluster and noticed with index set to 16 shards elastic was not distributing shards evenly across the whole cluster, whereas with 20 it somehow managed to do distribute shards much more evenly.

Problem:
I don't exactly understand why am I getting speed around 1.5k docs/second when using Reindex API, whereas when I do read and write to the same cluster with logstash I can make it spinning at 4k docs/second.

Not sure if my cluster is a bottleneck. For instance when creating replicas on the mentioned 6 shards index it took only 1hour to produce 400GB new replicas. So I'd think my hw is kinda okayish here.

Cheers
Marcin

nik9000 · June 16, 2016, 12:18pm

Reindex uses a relatively simplistic pull from the scroll, index in bulk process. It doesn't attempt to run those concurrently or parallelize the process.

You can usually speed up reindex in a few ways:

Use a bigger batch size. Like this:

{
  "source": {
    "index": "foo",
    "size": 5000 <---- batch size
  },
  "dest": {
    "index": "bar"
  }
}

Issuing multiple reindexes in parallel. Use a query in the "source" that slices that data somehow and just run N of them.

The reindex in 2.3 has a too-small default batch size so that is where I'd start.

I'll be investigating adding concurrent scroll processing to reindex for 5.0 which ought to speed it up quite a bit.

Marcin_Kubica · June 17, 2016, 1:59pm

Cool! Waiting for es5 then!

Topic		Replies	Views
Improving performance of reindex API? Elasticsearch	7	12146	July 5, 2017
Improve reindex speed into new cluster Elasticsearch	4	1090	January 5, 2019
Why doesn't the Reindex API parallelize by shard automatically? Elasticsearch	7	886	July 5, 2017
Reindex API - Extremely Slow Elasticsearch	2	1487	March 16, 2019
Improving Reindex Performance in v5.6 Elasticsearch	8	660	January 18, 2019

Reindex API performance

Related topics