Why NFS is to be avoided for data directories

malshan · January 16, 2020, 5:30am

ES: 7.5

I have seen the recommendations in official documentation to 'avoid NFS' when using them as data directories for elasticsearch. I have also seen almost everywhere in the forums to 'avoid NFS' but couldn't find a proper explanation.

Is it due to the fact that NFS is exported as a file system, and not a block device? Since there would be higher number of concurrent writes and reads, NFS would have protocol level bottlenecks.
Is it due to network latency related issues of any network related storage (against local storage).

Is it only the (1) reason, if that so, can we use a network bloack storage like ceph or SAN.

Thanks

dadoonet · January 16, 2020, 5:42am

Main reason to me is the latency.

DavidTurner · January 16, 2020, 6:25am

Yes, latency is a big factor. Another is correctness: Elasticsearch expects the filesystem under the data path to act like a local filesystem, and uses some fairly advanced features that are typically not well supported by nonlocal storage. For instance it needs locking and atomic file creation to work right; it's notoriously tricky to get NFS to do these things correctly.

Distributed storage like Ceph and GlusterFS is something I'd avoid. These technologies are still maturing IMO and have been linked to lost or corrupt data in the recent past. You don't need distributed storage since Elasticsearch handles the distributed side of things for you.

SANs work ok where performance is less important (e.g. the cold tier). I haven't heard of as many correctness issues with SANs as with your other suggestions.

Christian_Dahlqvist · January 16, 2020, 6:25am

As far as I recall it is both. There used to be issues around NFS causing problems due to not behaving and offering the same mechanisms as a block store, but that may have been addressed in more recent versions. Performance is a however still a major consideration as Elasticsearch tends to perform a lot of small random reads and writes rather that large consecutive operations. I do not recall seeing ceph used but many users have clusters backed by SAN. SAN performance can vary a lot so make sure you test properly.

malshan · January 16, 2020, 6:39am

Hi Guys,

Thanks for the quick help.
Our infrastructure team wants to use already available NFS storage for this task. I'll let them know that this unsuitable for elasticsearch.

Jugsofbeer · January 16, 2020, 7:28am

.... and while you are avoiding NFS, also avoid spinning hard drives - the pain is real.

SSD is the best starting point .

DavidTurner · January 16, 2020, 10:42am

Spinning disks are certainly a poor choice for nodes that see heavy indexing, but they can work well for nodes in the warm and cold tiers that see very little write traffic.

Jugsofbeer · January 16, 2020, 5:30pm

Your right, for warm or cold tiers spinning drives might be perfectly fine.

system · February 13, 2020, 5:30pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Bad indexing performance of elasticsearch Elasticsearch	7	2654	July 5, 2017
Elasticsearch wich SAN storage? Elasticsearch	7	2715	August 9, 2019
SAN storage - I/O values Elasticsearch	2	11	March 27, 2025
ES and SAN Elasticsearch	7	4306	July 6, 2017
Newbie question - need suggestion - NFS share + 1 node Elasticsearch	2	412	August 31, 2020

Why NFS is to be avoided for data directories

Related topics