Is it possible to perform bulk insert from Spark to ElasticSearch?

diplomaticguru · July 8, 2015, 5:52pm

Is it possible perform bulk insert from Spark to ElasticSearch?

At the moment, I'm using the 'saveToEsWithMeta' method for upserting the data(JavaPairRDD). Is there a way to bulk insert using the _bulk API? Are there any example that I could take a look?

costin · July 9, 2015, 9:27am

All the writes in Elasticsearch-Hadoop (including Spark) are done using the
bulk API underneath (through the REST protocol and thus use the _bulk
endpoint). Whether you saving 1, 100 or 10K, the procedure is the same.
Btw, I recommend spending some time reading the whole reference
documentation as it covers the architecture pretty well and provides plenty
of examples.

diplomaticguru · July 9, 2015, 10:53am

Thank you @costin for your reply. I'll check out the document but were you referring to this; https://github.com/elastic/elasticsearch-hadoop/blob/master/docs/src/reference/asciidoc/core/spark.adoc

costin · July 14, 2015, 6:33am

@diplomaticguru Why are you looking at the source and not the official, rendered doc which is available here? The docs are mentioned in the Github readme and on the project homepage.

How did you come across es-hadoop ? It's an honest question since it looks like the reference documentation was not advertised enough and I'd like to address that.

Topic		Replies	Views
Bulk insert to elasticsearch in spark using scala Elasticsearch es-hadoop	4	3248	March 28, 2017
Elasticsearch-hadoop and updating records Elasticsearch es-hadoop	3	1389	July 6, 2017
Can es-hadoop write bulk files to disk? Elasticsearch es-hadoop	2	758	July 6, 2017
Reagrding the bulk inserting data to ES Cluster Elasticsearch	2	542	July 5, 2017
Elastisearch-Hadoop how to do a bulk search in spark program Elasticsearch es-hadoop	2	994	October 10, 2017

Is it possible to perform bulk insert from Spark to ElasticSearch?

Related topics