Currently we have two clusters, in one we have 4 nodes(1 coordinating, 3 data nodes out of which two are master) and second cluster(1 coordinating, 5 data nodes out of which three are master). Can someone suggest me how to merge these two cluster as it is creating performance issues at our end.
OK. Then I suspect merging will need to be done as follows:
Take a full snapshot using the snapshot API of cluster 2.
Redirect writing of new data from cluster 2 to cluster 1.
Take a full snapshot of cluster 2 in order to capture the latest data written.
Add new empty nodes to cluster 1 so it has capacity to import the snapshot.
Restore the last cluster 2 snapshot to cluster 1. This may require some indices (or all) to be renamed to avoid clashes with the new indices being written to.
While the snapshot is being restored indexing of new data will work but old data will be temporarily unavailable.
If you can expand cluster 1 without decommissioning cluster 2 you should be able to speed up this process by migrating older indices before redirecting traffic to the new cluster.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.