Bulk Operation Results from Databricks Spark Job

RahulGS · April 5, 2019, 7:22pm

Hello ,

We are using ElasticSearch 5.0

I am performing bulk writes from dataframe to elastic search using spark , writes are performed using .option("es.write.operations","upsert") and .mode("append") . Note we also set other options related to batch size (bytes and entries) .

I am trying to understand if there is a way to capture bulk output results either in spark or a way to force all bulk operations in ES to store the results (tasks) automatically so that i can look it up . what is the best way to do this ?

Note : My eventual goal is to track and figure out if there are any consistency issues during parallel bulk operations causing partial updates/inserts to the documents and identify the affected documents.

RahulGS · April 18, 2019, 10:21pm

Anyone ?

RahulGS · May 2, 2019, 7:23pm

keeping the thread alive .

system · May 30, 2019, 7:23pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Indexing data in bulk in Elasticsearch using PySpark Elasticsearch es-hadoop	1	1348	July 6, 2017
Is it possible to perform bulk insert from Spark to ElasticSearch? Elasticsearch es-hadoop	4	6517	July 6, 2017
Can es-hadoop write bulk files to disk? Elasticsearch es-hadoop	2	745	July 6, 2017
org.elasticsearch.hadoop.EsHadoopException: Could not write all entries for bulk operation [1/1]. Error sample (first [5] error messages): Elasticsearch	1	209	April 8, 2024
Add Es Spark Accumulators Elasticsearch es-hadoop	3	277	December 19, 2023

Bulk Operation Results from Databricks Spark Job

Related topics