Elastic Search Bulk API (Delete) vs DeleteByQuery

jbiju94 · August 3, 2020, 6:59pm

My requirement is to delete a particular document (with doc_id) from multiple indices in one go. But the catch is that I am not aware which all indices would have this doc and I will have to fire a delete request for the doc_id to all indices present in the system.

When I tried to delete via bulk API by passing index and doc id, I was getting request too big error (status code: 429). The only alternative I could think of is to send a single delete request (DeleteByQuery) containing all indices which have to be considered.

My query is that whether DeleteByQuery is costlier than Delete by Doc ID and Index when it touches all the indices present?

Is there any alternative approach I could give a shot?

Cluster Info -

2 data nodes.
318 indices.
1 shard per node.
1 Lakh documents in total (not per client).

Thanks.

Christian_Dahlqvist · August 3, 2020, 7:39pm

Why do you have so many indices for such a small number of documents?

jbiju94 · August 4, 2020, 2:07pm

Sorry, made a mistake. So in the worst case scenario i will have 1Lakh records per index.

Christian_Dahlqvist · August 4, 2020, 5:27pm

Assuminh that mean 100,000 documents per index it is still not necessarily a lot of data for an index unless the documents are massive. I would generally aim to have a shard size of 10GB to 50GB. What is your average shard size?

jbiju94 · August 6, 2020, 3:26am

The shard size is around 1 GB. The current index design was required for a particular use case and all optimisations are 'work-in-progress'.
The main issue that i am facing is 'Request too Big' - Status code 429. I was looking into ways to reduce the no of delete requests sent as part of bulk request or sent a deleteByQuery instead having the query pointing to the doc id.

Which one one would be better ?

system · September 3, 2020, 3:26am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Delete vs deletebyquery Elasticsearch	8	1294	July 25, 2022
Bulk operation using Delete by Query? Elasticsearch	1	466	July 6, 2017
Delete by query limitation Elasticsearch	2	1230	October 2, 2019
Very large delete_by_query - supported? Elasticsearch	6	1594	October 8, 2020
Slow bulk deletes Elasticsearch	14	811	January 1, 2019

Elastic Search Bulk API (Delete) vs DeleteByQuery

Related topics