Best practice of using volumes across a cluster

Dmitry_Kulik · September 15, 2022, 8:19am

Hi all. Wanted to ask you, what is the best approach for using volumes within a cluster?

So basically I have a cluster with 3 master/data nodes (combined, not separate). And I want to increase this cluster and not lose any data.

Current my solution is to use 1 EFS storage (I'm using AWS to manage the cluster) and attach it to all nodes. And whenever I add a new node it will be linked there and all data will be present there too.

But the problem is that I got an error obtaining shard lock for [starting shard] timed out after [5000ms], lock already held for [closing shard] with age [286549ms] so I'm thinking if it is not better to use separate volumes for each node.

But keep in mind, that the number of nodes can increase and decrease, but data should be persisted no matter what.

Thanks in advance.

Christian_Dahlqvist · September 15, 2022, 9:13am

Have a look at the recommendations around deploying Elasticsearch on AWS. As you can see it is not recommended to use EFS for Elasticsearch storage.

Dmitry_Kulik · September 15, 2022, 9:32am

Thanks, @Christian_Dahlqvist, I've checked it and understand that EFS is not the best solution, but I still can't get how to save all the data in case I use the Instance store. In case 1 instance will be terminated, where ES will take indices from?

Christian_Dahlqvist · September 15, 2022, 9:39am

In an Elasticsearch cluster shards are generally replicated, so even if you lose 1 node there is still a copy of the shard available. This can then be replicated so the cluster again holds 2 copies. You can use EBS volumes for storage and these are more resilient.

Dmitry_Kulik · September 15, 2022, 9:51am

Thanks a lot for your help @Christian_Dahlqvist

system · October 13, 2022, 9:52am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ES - Windows Multi Cluster Advice Elasticsearch	5	387	September 22, 2020
Single NFS Storage for Entire Cluster - Separate processing and data replication Elasticsearch	2	4306	July 6, 2017
Which Storage type to be used for ELK on AWS? Elasticsearch	11	3838	April 8, 2019
Index share allocation question with Docker containers Elasticsearch docker	9	558	March 27, 2021
Node based storage vs external storage Elasticsearch	7	1390	February 21, 2017

Best practice of using volumes across a cluster

Related topics