Bulk API update by query

seanlaff · July 19, 2017, 5:52pm

Is it possible to use update_by_query with the bulk api?

I'm constantly polling data from mysql and updating the relevant ES documents- however since I don't match on the document IDs (I match on various keys of a document), I have been using the update_by_query function.

I'd like to bulk these requests since I'm making thousands of them, but it doesn't seem possible to use these two features together.

Any advice would be appreciated.

polyfractal · July 20, 2017, 8:57pm

To my knowledge, I don't believe update-by-query works with bulk. In fact, it's essentially doing bulk updates under the hood. UPQ works by executing a query to find all matching documents, collecting the IDs, then issuing a bulk request with an update action for each document.

A bulk update-by-query could be very expensive, since it would send off many search and bulk requests simultaneously.

Is there any way that you can tie the document's ID in elasticsearch to an ID in mysql? Update-by-query is useful, but also rather expensive. Doing thousands of them sounds like it will be putting a lot of strain on your cluster.

system · August 17, 2017, 8:58pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Does ElasticSearch support _bulk API with update_by_query Elasticsearch	1	450	August 24, 2017
How update, update_by_query in ES really work? Elasticsearch	8	2501	October 4, 2022
Update By Query - performance Elasticsearch	1	800	May 17, 2017
Official Support for UpdateByQuery? Elasticsearch	1	273	July 6, 2017
Bulk Data Update Query? Elasticsearch	3	580	July 5, 2017

Bulk API update by query

Related topics