Indexing slows while rebalancing?

ecweaver · January 8, 2016, 2:29pm

We have a large-ish cluster used for log search, document size usually a few hundred bytes max.

We have had to re-spin nodes, causing recoveries and rebalances.

During rebalance, and sometimes at points in the recovery phase, indexing slows to the point where log lines get dropped (since the senders are configured to drop rather than queue).

Is this indexing slowdown expected? Is there any remedy for this other than set up Kafka (or Logstash 2.0)?

We are using AWS, 20 i2.8xlarge instances split across two availability zones. We have rebalancing throttled to 2 streams and recovery to 8. Would appreciate any insight into what's going on here.

ecweaver · January 9, 2016, 2:16am

Turns out rebalancing (as such) was not the issue, the issue was that a new day's index had all its shards on one or two nodes, and due to the imbalance in the data loading, the primary shards did not get distributed out in the normal way. One or two nodes were taking all the indexing.

I hand-rerouted those primary shards and indexing speed went back to normal.

It's a thing to watch out for...

Topic		Replies	Views
Share Rebalancing on large clusters (2.4) Elasticsearch	5	895	January 19, 2017
Elasticsearch not indexing while rebalancing Elasticsearch	5	954	July 5, 2017
Slow down/throttle indexing Elasticsearch	5	759	July 5, 2017
Slow indexing rate Elasticsearch	3	32	October 6, 2024
Pointers to Improve indexing performance? Elasticsearch	6	2941	February 28, 2017

Indexing slows while rebalancing?

Related topics