Using huge NVMe disks with elasticsearch

Timur_Makarchuk · June 17, 2021, 7:59am

Hello everyone.

We're currently in process of choosing new hardware for our elasticsearch cluster.

Our current cluster consists of 32 nodes and holds 42TB of data across 2000 indices.

We're choosing hardware from the specs our provider has.
One of options we're considering is the one that has 256GB RAM and 4x2TB NVMe SSD.
We're planning to join those in RAID0 which would get us 8TB NVMe SSD per node.
My questing is is it maybe a bit too much since some of our shards are pretty small and we may cross 20 shards or fewer per GB of heap memory boundary.

And seeing is this node has too much RAM as well (although it can be used as cache) we were considering splitting those nodes into 4 LXC containers 64GB RAM and 2TB each. Which option would be preferable from elasticsearch perspective?

warkolm · June 17, 2021, 8:18am

Bare metal. But honestly, containerising things makes much more logical sense.

elasticforme · June 17, 2021, 12:58pm

why use raid0. you should use single disk /data01, /data02, /data03 etc... and elasticsearch will manage them. if you loose one disk you are only loosing 25% of shard on that node. if you use 8TB with Raid0 then one dead disk and you have 100% shard lost for that node.
256 RAM might be overkill and on that sense your logic is write to split it.

I am also in process of setting up same amount of NVME but 98gig ram 20 node cluster.

elasticforme · June 17, 2021, 1:01pm

Performomance vs other benefit

https://discuss.elastic.co/t/large-cluster-on-vm-vs-bare-metal/276017/2

Christian_Dahlqvist · June 17, 2021, 1:05pm

I believe multiple data paths is getting deprecated so raid0 is likely a better option. Recall seeing a discussion about that around here somewhere…

Timur_Makarchuk · June 17, 2021, 1:23pm

Hi! Thank you for your reply.

@elasticforme Your questing mentions VMs which implies performance overhead much larger then one of LXC (which is not VMs, but Containers), so I'm not sure if reply to your question is applicable here.

elasticforme · June 17, 2021, 1:58pm

whole, I didn't see that anywhere and I do have all my system with multiple data path.

Timur, yes vm/container not same but close because using same resource on same hardware

DavidTurner · June 19, 2021, 8:28am

Christian is right, they are deprecated as of 7.13 and 8.0 will require a node per data path instead:

github.com/elastic/elasticsearch

Deprecate and remove Multiple Data Paths

opened 10:46PM - 01 Apr 21 UTC

rjernst

:Core/Infra/Core Meta Team:Core/Infra

Multiple Data Paths (MDP) is a pseudo-software-RAID-0 feature within Elasticsear…ch allowing multiple paths to be specified in the path.data setting (which usually point to different disks). Although it has been used in the past as a simple way to run a multi-disk setups, it has long been a source of user complaints due to confusing or unintuitive behavior. Additionally, the implementation is complex, and not well-tested nor maintained, with practically no benefit over spanning the data path filesystem across multiple drives and/or running one node for each data path. We have long advised against using MDP, and are now ready to deprecate and remove it. This is a meta-issue to track that work. - [x] Deprecate MDP in 7.13 - [x] Document migration path #71871 - [ ] ~Remove documentation from 8.0~ - [ ] ~Block MDP in 8.0~ - [ ] ~Remove MDP from 8.0~

elasticforme · June 20, 2021, 1:03am

IMHO this is wrong move.

system · July 18, 2021, 1:03am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Hybrid RAID 0 and Mutliple Data Paths Elasticsearch	10	1687	July 5, 2017
Raid 0 SSD? Elasticsearch	19	6584	July 5, 2017
Elasticsearch hardware planning Elasticsearch	5	835	July 6, 2017
Using RAID 0 vs multiple data paths after commit #10461 Elasticsearch	2	450	July 6, 2017
RAID question Elasticsearch	2	399	July 5, 2017

Using huge NVMe disks with elasticsearch

Related topics