Way to re-index failed documents using BulkProcessor

kpkvarma · November 4, 2015, 12:16pm

We are developing a application, where we ingest and bulk index time-series data using native JAVA client.
Here each and every document/event is very important to us, we can't miss single event.

As we are using BulkProcessor to index the data, in case of any failures to index data with malformed JSON or bulk queue unavailability or any other reason, is there any way to track the failed documents/events from BulkResponse?

I tried iterating through BulkResponse @ afterBulk() method, but couldn't find actual document/event.

Our plan is to index all such failed documents/events to separate INDEX (like unprocessed), which don't consider any mappings.

Please help me to identify the failed documents

jpountz · November 4, 2015, 3:39pm

In the afterBulk method, you should be able to check for BulkResponse.hasFailures(). In case it returns true, you could iterate over response items and index failed ones into your unprocessed index.

kpkvarma · November 4, 2015, 4:34pm

I tried your proposal already, but couldn't find a way to get actual document(in this case failed document) with BulkItemResponse. This object is just having id, index and type details, but not actual document.

Am I missing anything important here?

jpountz · November 4, 2015, 4:58pm

Oh I see. Something useful is that the response at index i in the response maps to the request at index i in the request, so you can get a reference to the ActionRequest that failed, then cast it to an IndexRequest (if you know it is an IndexRequest) and get the source using the .source() method.

kpkvarma · November 5, 2015, 8:29am

Thank you very much, finally able to get source/document with your suggestion.

Topic		Replies	Views
Identify, save and resend failed requests in BulkProcessor Elasticsearch	2	648	April 20, 2017
[Java] BulkProcessor- custom data in request-response items Elasticsearch	5	1564	December 4, 2017
BulkProcessor pest practices Elasticsearch	2	2488	July 6, 2017
How to identify message causing error in bulk request Elasticsearch	10	18306	July 5, 2017
[Java] Quick way to get the failure number in BulkResponse? Elasticsearch	3	1712	July 5, 2017

Way to re-index failed documents using BulkProcessor

Related topics