How to reindex from es cluster1 to es cluster2 with Spark

Jack_Wang · October 26, 2016, 7:22pm

I have two ES clusters, I wanna reindex the data from cluster1 to cluster2, but I found I only can setup one SparkContext with one ES cluster, such as:
var sparkConf:SparkConf = new SparkConf().setAppName("EsReIndex")
sparkConf.set("es.nodes", “node1:9200")

So how can I implement the data reindex between two ES clusters.

eperry · October 27, 2016, 3:07am

how about just using Logstash ( An oldie but goodie)

https://www.elastic.co/guide/en/logstash/2.4/plugins-inputs-elasticsearch.html

input {
elasticsearch {
.....details for cluster 1
}
}
output{
eleasticsearch {
...... Details for cluster2
}
}

this is an old how we did it in the old 1.3 days , I know some people keep all their documents in Kafka, and by just resetting the topic offset you can replay the whole Topic.

james.baiera · October 27, 2016, 8:11pm

@Jack_Wang Another option you could try is the ReIndex API in Elasticsearch.

Topic		Replies	Views
Reindex api - One ES cluster to another Elasticsearch	3	636	July 5, 2017
Is there a way to reindex to remote ES cluster with es-spark support? Elasticsearch es-hadoop	2	815	January 10, 2018
How to reindex data from cluster to another Elasticsearch	1	343	December 15, 2020
Using Spark DataSource with ES Hadoop Elasticsearch es-hadoop	2	677	July 6, 2017
Multiple ES clusters in SparkSQL Elasticsearch es-hadoop	9	2876	July 6, 2017

How to reindex from es cluster1 to es cluster2 with Spark

Related topics