How to ignore exceptions when bulk update with pyspark if doc doesn't exist

tree · June 25, 2017, 3:05am

Hi,

I am trying to do an update operation with elasticsearch hadoop package in pyspark. It says on the documentation that if no data is found, an exception is thrown. What is the best way to handle this exception? Or is it possible to pass something like raise_on_exception=False, raise_on_error=False provided with python elasticsearch API?

Thanks!

james.baiera · June 26, 2017, 9:46pm

@tree currently there is no way to supress the error when it occurs. If a value is missing when an update is executed, there's nothing for the connector to do but fail the task. A different option would be to avoid the update operation and instead try to stick to the upsert operation as it has fewer cases where it might fail.

tree · June 27, 2017, 1:17pm

Thank you! @james.baiera

system · July 25, 2017, 1:17pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Document Bailing Out Errors in Bulk Update Mode Resulting in Crashing the Entire Spark Job Elasticsearch es-hadoop	3	1297	July 25, 2018
Elasticsearch really needs a "update if document exists" ability Elasticsearch	2	1282	June 11, 2018
How to do an update-if-exists Elasticsearch	1	340	July 6, 2017
Elasticsearch-hadoop and updating records Elasticsearch es-hadoop	3	1389	July 6, 2017
Update only if exist Elasticsearch	2	6854	July 28, 2018

How to ignore exceptions when bulk update with pyspark if doc doesn't exist

Related topics