Hi I need to change architecture of my cluster, thus I need to know Your's idea/approaches to do this.
So currently my cluster consist on 3 bare metal server with sdd and hdd disk.
But I need to increase memory of these nodes therefore it's needed to merge the data nodes.
Can You keep an eye on below schema of my cluster
Can you provide more context about your infrastructure?
You have 3 bare metals and the nodes you shared in this image are virtual machines running on those bare metals?
For example, if you need to remove the nodes data hot1_1, data hot2_1 and data hot3_1 to create a new node named data hot123_1 you will need to first remove all the data from these 3 nodes and allocate the data in the remaining data hot nodes, when there is no shard on those nodes you can remove them from the cluster and add the new node.
This is the first thing you need to do, you can't merge the disks while the nodes are running and have data, if you do that you will lose data, you need to remove the nodes from allocation, remove them from cluster, create your new node, add the new node to the cluster, wait for the rebalance to finish and them repeat this for all your other nodes.
You also need to have enough available space in the other nodes to allocate the shards.
I do not use docker, but the logic is the same you need to exclude from allocation the nodes you want to remove, you can exclude them by name or id as well, it is in the documentation.
This is multipath, it is still supported but it is deprecated and will be removed in the future you should avoid start using it now.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.