I have 4 nodes, where node 1 is the master and the others are data type. I have noticed that the same disk space occupied by data nodes are the same, ie they all have the same information.
This is correct? I have this doubt because in this way the use of disk will be enormous in all the machines of the cluster.
By default Elasticsearch keeps two copies of each shard (one primary and one replica) on separate nodes in case one fails. You can of course adjust this to taste. Perhaps this section of the manual is helpful?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.