Are replicase required when using external volumes like EBS?

Hi all,

We are currently running our Elasticsearch cluster in Kubernetes and use EBS gp3 volumes as storage. We are reading everywhere that during initial load of the data one could disable replicas to speed up indexing. In our application we are also updating our documents on a regular basis (e.g monthly).

Disabling replicas for that feels like a little bit of a risk because if the server gets overloaded and some nodes fail we might lose data. However, we were wondering if replicas altogether are needed if you use external volumes like EBS?

Thanks in advance!

Disabling replicas can increase indexing speed, which is useful for an initial loading of the index.

Removing them entirely is not recommended. Even EBS can go bad.

Thanks, that makes sense. In our case, we then are considering to temporarily disable replicas during a full update of some fields of the documents, and enable it back again after that. We are also making snapshots constantly. We have to test this though.

That seems like a sound approach.

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.