Low level operations during _delete_by_query

Rahul_Kumar3 · March 31, 2019, 5:30am

What are the sequence of low-level operations that happens during the execution of _delete_by_query operation? On a high-level _delete_by_query operates as follows:

Multiple search requests are sequentially executed in order to find all the matching documents to delete.
Every time a batch of documents is found, a corresponding bulk request is executed to delete all these documents.

Basically, I wanted to understand the following things:

Whether the search request is routed to primary shard or happens at the node where the request is received?
Does the delete operation wait for all the matching documents to be found or immediately deletes the matched document?
Whether the delete operation is routed to primary shard or happens at the node where the request is received?
After the search operation is done how does delete operation know whether the version of the document it is trying to delete is the latest one?
Also, let's say that if somehow delete operation figures out that the version has been incremented after search operation then why does it throw version conflict. Shouldn't it just try to delete the latest version of the document?

Christian_Dahlqvist · March 31, 2019, 5:39am

I believe it runs the query as a scroll query, which runs in a context of the segments that were available when the request was made. Any updates that are performed after the delete-by-query was started will therefore not feature in the search input, which is why you get a version conflict when an updated document is found as the version is used during the delete phase to keep this potentially long-running process safe.

Rahul_Kumar3 · March 31, 2019, 6:36am

Hi @Christian_Dahlqvist,

So basically search operation searches in the context when the search request was started. I have two doubts here:

The search operation is performed on translog or on the Lucene index?
I am guessing that delete operation persists information about the deletion of a particular version of a document to the translog. So how does the delete operation know whether that particular version of the document is updated during the time when search operation finishes and delete operation starts? Is there some kind of intelligence build into translog that does not allow persisting information about an older version of a document?

Christian_Dahlqvist · March 31, 2019, 7:32am

The search only considers documents that have been committed to Lucene and are searchable.

The delete takes documents in the translog into account. Get, set, update and delete operations on individual documents always consider all documents as it is not a search.

system · April 28, 2019, 7:32am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch delete_by_query 409 version conflict Elasticsearch	9	26649	April 27, 2019
Update_by_query/delete_by_query conflicts=proceed doesn't actually perform the operation? Elasticsearch	2	1459	December 16, 2020
How are version conflicts during _delete_by_query handled? Elasticsearch	3	432	March 5, 2019
Delete by query conflict Elasticsearch reindex	1	260	January 31, 2024
Elasticsearch Version Conflicts When Deleting Data Elasticsearch	4	2255	August 16, 2018

Low level operations during _delete_by_query

Related topics