Hi,
I've been looking into the backup and restore plan for our modest elasticsearch cluster. We are currently setup in a "Hot/Warm" architecture with curator moving old indexs from hot to warm. We currently have 2x Hot and 2x Warm (We will be expanding this as soon as budget allows).
We use ZFS send/receive for most of our other services to perform snapshot and incremental backup (https://docs.oracle.com/cd/E23824_01/html/821-1448/gbciq.html). We then ship the snapshots off to a remote backup server using ZFS send/receive (https://docs.oracle.com/cd/E23824_01/html/821-1448/gbchx.html#scrolltoc). Services like like MySQL, Postgresql etc all work fine using this method.
After running a successful ZFS snapshot every hour and rolling back to a recent snapshot I'm unable to see any of the indexes, despite the data being back on the filesystem.
df -h
Filesystem Size Used Avail Use% Mounted on
elasticsearch 6.4T 128K 6.4T 1% /data/elasticsearch
zfs rollback elasticsearch@2018-03-19_08:00:01
df -h
Filesystem Size Used Avail Use% Mounted on
elasticsearch 6.4T 47M 6.4T 1% /data/elasticsearch
Has anyone ever been able to successfully backup and restore ES using ZFS snapshots? Is there a better way to achieve an automated, incremental backup system within ES? We have no access to a HDFS FS however we could possibly look at AWS S3 (This would be highly dependent on our data storage terms with our partners)
Thanks!