Hello,
Since S3 Gateway has been officially deprecated, how should one maintain
cluster persistence while running on Amazon cloud?
I can think of
- Once in a while, flush; disable_flush on a node.
- rsync index data to backup folder
- sync backup folder to S3 on background, to the folder named under the
node name. - Do that for every node
That way one can have the latest snapshot of the cluster backed up to S3 up
to the latest snapshot point.
But if I have 20 nodes in the cluster, restoring it from scratch will be a
lot of manual work.
Another way I can see:
- Run on EBS
- Periodically flush/disable_flush on a node
- sync
- create EBS snapshot
- enable_flush
But still
- Need to take care of older snapshots pruning
- Resting still looks like manual pane.
So what is the advised practice of running multi-node cluster on AWS with
ability to recover from cluster sudden death?
Can I still go with S3 gateway if I'll take particular precautions that
someone can outline?
Best regards,
Zaar
--