Indexing is throttled to all indicies while opening a closed index?

Ryan_Hadley · April 11, 2016, 9:04pm

Hi!

I could be entirely wrong here, but my observation seems to be that while an index is being opened no other indices can be actively indexed.

We have a lot of indices in our cluster and keep the majority of them closed, only opening the index if it needs to be updated or if someone wants to start searching on it.

We have noticed that if a lot of open calls happen simultaneously or really close to each other, then our indexing jobs start throwing these exceptions:

ProcessClusterEventTimeoutException[failed to process cluster event (put-mapping [response]) within 30s]

But the nodes themselves show no signs of stress. Search queries all execute quickly throughout it all, server/jvm metrics are all normal, disk I/O is fine, etc. But the indexing jobs start to hit 30s timeouts.

We found we can recreate this issue easily by closing a bunch of indices and putting them in a bash for loop to open them. If I put a 2 second sleep between each open call then the indexing jobs can complete without error. But with a 1 second delay or less, it hits 30s timeouts again.

Is there anything that I can do configuration-wise to remove this indexing throttle while other indices are being opened on the cluster? Or am I missing something else entirely here?

Thanks for any help!
-Ryan

Ryan_Hadley · April 13, 2016, 6:16pm

I did some more investigation and found something that might be related.

Cluster state tasks are executed in a single thread.
Mapping updates from indexing jobs push tasks to the pending_tasks queue with a priority of "HIGH"
Opening a closed index pushes tasks to the pending_tasks queue with a priority of "URGENT"

So my guess here is that too many "URGENTS" in this queue makes it so the "HIGHS" take too long to get to.

Is this accurate? And is any of this configurable? Can/Should we make the "shard-started" tasks equal in priority to the "put-mapping" tasks?

Topic		Replies	Views
Cluster taking a long time to open closed indices Elasticsearch	6	1814	March 27, 2019
Getting process cluster event timeout exceptions while bulk indexing with error message failure to put mappings on indices Elasticsearch	4	3110	June 20, 2017
Timeout exception with many time-based indices after 00:00 Elasticsearch	6	2617	July 5, 2017
Bulk indexing causes management threadpool queue to skyrocket Elasticsearch	19	1874	November 4, 2022
[ElasticSearch 2.2.0] I am occasionally getting Process Cluster Event Timeout Exception[failed to process cluster event (put-mapping [as]) within 30s] while bulk indexing documents Elasticsearch	8	13145	February 22, 2016

Indexing is throttled to all indicies while opening a closed index?

Related topics