UpdateRequest upsert with retryOnConflict using BulkProcessor failed

edumucelli · May 23, 2019, 1:45pm

Althought ES documentation and staff suggests using retry_on_conflict to mitigate version conflict, this feature is broken.

I am using High Level Client 6.6.1 and here is the way I am building the request:

IndexRequest indexRequest = new IndexRequest(MY_INDEX, MY_MAPPING, myId)
            .source(gson.toJson(entity), XContentType.JSON);
        
UpdateRequest updateRequest = new UpdateRequest(MY_INDEX, MY_MAPPING, myId)
            .retryOnConflict(3)
            .doc(indexRequest)
            .upsert(indexRequest);

Without .retryOnConflict(3) it will "work", but some queries will complain about version conflict. I create multiple of those requests and then process it with

client.bulk(request, RequestOptions.DEFAULT);

However when I add .retryOnConflict(3) as shown above it completely breaks with the following stacktrace:

ElasticsearchStatusException[Elasticsearch exception [type=illegal_argument_exception,     reason=Action/metadata line [1] contains an unknown parameter [retry_on_conflict]]]
        at org.elasticsearch.rest.BytesRestResponse.errorFromXContent(BytesRestResponse.java:177)
        at org.elasticsearch.client.RestHighLevelClient.parseEntity(RestHighLevelClient.java:2050)
        at org.elasticsearch.client.RestHighLevelClient.parseResponseException(RestHighLevelClient.java:2026)
        at org.elasticsearch.client.RestHighLevelClient.internalPerformRequest(RestHighLevelClient.java:1775)
        at org.elasticsearch.client.RestHighLevelClient.performRequest(RestHighLevelClient.java:1732)
        at org.elasticsearch.client.RestHighLevelClient.performRequestAndParseEntity(RestHighLevelClient.java:1694)
        at org.elasticsearch.client.RestHighLevelClient.bulk(RestHighLevelClient.java:470)

Someone else already posted kind of same question before but unfortunately no response has been provided.

Is there a way to overcome this problem? Should I manually retry the query?

polyfractal · May 23, 2019, 2:21pm

Hmm, this looks like a bug, possibly in the HLRC itself. Would you mind opening a ticket in the ES repo?

edumucelli · May 23, 2019, 2:22pm

Alright, I will do it. Thanks!

edumucelli · May 24, 2019, 2:29pm

Further investigating I am wondering if the problem is actually due to the backend and high level client version mismatch. I have done those requests using High Level Rest Client 6.6.1 but using ES backend 5.5.3. Would you think there could exist such incompatibility?

Thanks!

yelin · May 24, 2019, 2:56pm

I got similar issue before even though we're using both 6.x server & client, reported on the forum but didn't get any feedback. As I'm using org.elasticsearch.action.bulk.BulkProcessor, for which alternatively you can set the number of retries through BulkProcessor.Builder.setBackoffPolicy() (example at https://www.elastic.co/guide/en/elasticsearch/client/java-rest/master/java-rest-high-document-bulk.html#java-rest-high-document-bulk-processor). However, for you to be aware of, there is another bug on the BulkProcessor reported at BulkProcessor hangs instead of timeout, which doesn't seem resolved yet

edumucelli · May 24, 2019, 3:27pm

Thanks for the info! Unfortunately I am not using BulkProcessor just the High Level client's client.bulk method. Considering what you have said there is not another way around to set a retry policy. I will probably come up with some home-made thing using Failsafe.

polyfractal · May 28, 2019, 1:56pm

Possibly... but I would be surprised. The HLRC just emits JSON and uses the standard REST endpoints, and "retry_on_conflict" has been in the REST endpoint for ages. That's actually the main reason we are moving to the HLRC, so that strict versioning becomes a bit more decoupled in java clients (like other languages, python/ruby/php/etc). As long as the JSON is valid for the endpoint the backend and client don't necessarily have to be on the same version.

So I think it's something about the HLRC, either emitting a bad parameter, or unable to parse the response, which is causing the issue.

But.... that's without looking into the issue at all, just a guess

edumucelli · June 19, 2019, 10:15am

I was finally able to narrow it down. By updating backend to 6.7 the problem has gone. So in my case having a backend 5.5.3 and client 6.7.2 breaks, but when I have aligned both backend and client on 6.7.2 it worked. Some imcompatible behavior is there, but I was not able yet to precisely find it.

system · July 17, 2019, 10:17am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
UpdateRequest with retryOnConflict using BulkProcessor failed Elasticsearch	1	1330	July 30, 2018
Version Conflicts in Bulk Upsert Elasticsearch	3	4818	July 23, 2017
ESRejectedExecution followed by Version Conflict Error for Bulk Requests Elasticsearch	3	776	July 5, 2017
Reasons for upsert failure in bulk request Elasticsearch	3	1288	February 9, 2018
What is the name of a parameter "retry_on_conflict" in Bulk Update API Elasticsearch	5	4904	December 5, 2016

UpdateRequest upsert with retryOnConflict using BulkProcessor failed

Related topics