Using ES Spark to copy data from one instance to another

Ramdev_Wudali · November 13, 2015, 10:15pm

Hi:
Is it possible to use the Spark API to read an index from one ES Cluster and write the same data into a different ES Cluster ? If so, how can I do it ? basically how would the configuration look like and how can I get the RDD to line up with the different SparkContexts???

Thanks much

Ramdev

costin · November 14, 2015, 7:56pm

No, ES-Hadoop works only against the same cluster. You could try using tribe nodes however this is an unsupported scenario.

You can simply read the data and store it on disk/spark/hdfs/s3/etc.. and start another job to write it to the other ES cluster.

Topic		Replies	Views
Multiple ES clusters in SparkSQL Elasticsearch es-hadoop	9	2878	July 6, 2017
Store indexes in ES while the data stays in HDFS Elasticsearch es-hadoop	4	967	July 6, 2017
Using Spark DataSource with ES Hadoop Elasticsearch es-hadoop	2	678	July 6, 2017
Large and collocated Spark/ES cluster, concerns about data transfert Elasticsearch es-hadoop	2	1325	July 6, 2017
Is elasticsearch-spark reading from localhost if ES and Spark is running on the same node? Elasticsearch es-hadoop	2	1198	July 6, 2017

Using ES Spark to copy data from one instance to another

Related topics