This tells me that I have the option to backup to a shared disk location - I have 1 external disk mapped to every node in my cluster seperately and my hypervisor doesn't allow a shared disk - can I still backup to different disks - or will that not work i.e. will it create broken or duplicated data?
I'm afraid you'll need a shared disk location as a snapshot repository. Taking a snapshot is a distributed process and only if both master- and data-nodes have access to the same shared path will it work out correctly.
Maybe you can add e.g. a Minio server that mounts your external drive to your deployment and use the S3 repository plugin to work around this issue?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.