Are replicase required when using external volumes like EBS?

Thijsvdp · September 13, 2022, 8:37am

Hi all,

We are currently running our Elasticsearch cluster in Kubernetes and use EBS gp3 volumes as storage. We are reading everywhere that during initial load of the data one could disable replicas to speed up indexing. In our application we are also updating our documents on a regular basis (e.g monthly).

Disabling replicas for that feels like a little bit of a risk because if the server gets overloaded and some nodes fail we might lose data. However, we were wondering if replicas altogether are needed if you use external volumes like EBS?

Thanks in advance!

warkolm · September 13, 2022, 8:39am

Disabling replicas can increase indexing speed, which is useful for an initial loading of the index.

Removing them entirely is not recommended. Even EBS can go bad.

Thijsvdp · September 13, 2022, 8:51am

Thanks, that makes sense. In our case, we then are considering to temporarily disable replicas during a full update of some fields of the documents, and enable it back again after that. We are also making snapshots constantly. We have to test this though.

warkolm · September 13, 2022, 9:26am

That seems like a sound approach.

system · October 11, 2022, 9:27am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Can I disable replicas in when index is getting data from logstash? Elasticsearch	3	807	November 10, 2021
Replica path Elasticsearch	3	351	July 6, 2017
Are replicas just for disaster recovery? Elasticsearch	6	313	March 9, 2021
To have replicas or not? Elasticsearch	2	718	March 14, 2018
ElasticSearch - node instances Elasticsearch	5	1058	July 28, 2017

Are replicase required when using external volumes like EBS?

Related topics