Replicate all changes from one cluster to another

alienmind · September 26, 2016, 3:43pm

I intend to schedule replication from one cluster into a "backup cluster".

I've considered using the snapshot API (but it's not allowed to backup from ES to another ES, going through FS, S3 or HDFS are the only options)

Also, reading in the forums, someone mentioned doing rsync between the clusters' filesystems was an option (but clusters may be of different topology and also I don't want to have inconsistent data nor downtime in the "destination" cluster.

So, moving to a solution based on logstash, my configuration file is:

input {


  elasticsearch {


    hosts => [ "HOSTNAME_HERE" ]


    port => "9200"


    index => "INDEXNAME_HERE"


    size => 1000


    scroll => "5m"


    docinfo => true


    scan => true


  }


}







output {


  elasticsearch {


    hosts => [ "HOSTNAME_HERE" ]


    index => "%{[@metadata][_index]}"


    document_type => "%{[@metadata][_type]}"


    document_id => "%{[@metadata][_id]}"


  }


  stdout {


    codec => "dots"


  }


}

But this will not use the "scroll_id" at all, so my transfers are limited to "size" records (1000 by default) and I obviously don't know the number of documents per index beforehand.

This should be as automatic as possible, and ideally similar to what rsync does but at a cluster level (all changes should be replicated to the backup cluster.

Any ideas / suggestions?

Thanks

magnusbaeck · September 26, 2016, 3:52pm

I don't see how this could work at all. The elasticsearch plugin is stateless and pulls all documents matching the condition each time. If you restart Logstash it'll pull all of it again.

I'm not really sure what problem you're trying to solve, but I'd send all processed events to a broker and use two "dumb" Logstash instances that each feed off of a private queue on said broker and each feeds its own ES cluster.

Topic		Replies	Views
Replicating one cluster to another cluster Elasticsearch	4	906	July 6, 2017
Elasticsearch cluster replication Elasticsearch	5	266	June 18, 2023
Replicating data from one ES cluster to another Elasticsearch	2	690	July 6, 2017
Multiple Clusters: Replicating data to Another ES Cluster Elasticsearch	3	9797	January 2, 2017
Clone cluster with Logstash Logstash	20	981	January 6, 2023

Replicate all changes from one cluster to another

Related topics