Best practices for bulk indexing/retry handling?

tkaiser · February 18, 2016, 2:00pm

We're using BulkProcessor for indexing and stumbled upon a problem when doing rolling restarts of our Elasticsearch cluster consisting of three nodes.

It appears that once the master node is going down, we get
ClusterBlockException[blocked by: [SERVICE_UNAVAILABLE/2/no master];]
and obviously fail to index bulk requests.

Two questions here:

Does this need to be handled on the client side or is there any way to avoid this via cluster configuration?
If we need to handle this in code, are there any best practices or suggested ways of handling failed bulks? We want to retry the failed data, but this would probably involve some not-trivial implementation to re-queue this data, delay or exponential backoff, combining retry data with newly arriving index requests, etc

Thanks

jprante · February 18, 2016, 11:54pm

You need to handle this for yourself. A cluster is not aware that there are clients that should be suspended
You have to handle failed bulks for yourself. It should be fairly easy to add a suspend/resume logic. If not client side, a plugin at server side for book-keeping about client IDs would be an option. Suspend = tell all clients they have to write to local file, resume = tell all clients they can replay from local file. A sort of client translog, neglecting the edge case of full disks, in that case you should better halt all your clients before rolling updates.

There is a new logic for rejection exception backoff handling in BulkProcessor of ES 2.2+ which is enabled by default, but it assumes a cluster that can react properly and so it's kind of useless in situations where the cluster is degrading.

Topic		Replies	Views
BulkProcessor pest practices Elasticsearch	2	2488	July 6, 2017
What is the behaviour of Bulkporocessor when elastic search server restarts often Elasticsearch	4	480	March 6, 2017
Why BulkProcessor retries only on EsRejectedExecutionException? Elasticsearch	1	392	November 13, 2019
BulkProcessor does not retry on failures Elasticsearch	1	784	February 8, 2019
How to handle bulk rejections from ES in the client side Elasticsearch	1	566	April 13, 2018

Best practices for bulk indexing/retry handling?

Related topics