Storage Best Practices

SKumarMN · June 30, 2017, 7:05am

As part of core operations class, i see a slide with the below details .

path.data vs. RAID0
‒ RAID0 will be slightly more performant
‒ path.data will allow a node to continue to function
‒ e.g. a machine with 4 2TB drives and at most only 25% (2TB) of
data will need to relocate

Can you explain the above in detail or with some examples so that I can understand it well

warkolm · June 30, 2017, 7:11am

If you point Elasticsearch to multiple path.data mount paths and if one of those paths disappears (ie the disk fails) then you lose that single disk (25%).

If you have RAID0 and a disk fails then the entire array and all data on it are lost.

Attila_Nagy · June 30, 2017, 7:55am

How does elasticsearch handle the case where IO just freezes to that single disk and the path doesn't disappear (cached entries can be read, some writes may succeed until they are fsynced etc)?

warkolm · June 30, 2017, 8:43am

It'd be better if you created another thread in the #elasticsearch category. We're trying to keep this one to specific questions about our online training

SKumarMN · June 30, 2017, 8:54am

Thanks. Say i have a cluster with 3 nodes and a single index with 3 primary and one replica. When we configure multi paths, i believe index is stripped into multi paths but not shards. i.e one complete shard will remain in one path.

Say i have configured multi paths ex path.data : /ds1, /ds2 say one the disks failed(/ds1) in one of the nodes. Now will the shard reallocation happen still i.e make replicas in other nodes primary and create missing replicas( shards that were lost due to disk failure) or does this shard reallocation happen only when node fails.

warkolm · June 30, 2017, 9:09am

Shard reallocation happens on a shard level, not a node. So if the shard on that bad disk is lost then it will recreate one, it does not wait for the host to also drop off.

system · July 30, 2017, 9:09am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch multi path and RAID-0 Elasticsearch	6	1800	July 12, 2018
Raid 0 SSD? Elasticsearch	19	6451	July 5, 2017
Hybrid RAID 0 and Mutliple Data Paths Elasticsearch	10	1642	July 5, 2017
To multi path.data or hardware RAID or not? Elasticsearch	4	2913	July 5, 2017
Behaviour of ES when using multiple path in path.data Elasticsearch	2	308	December 14, 2020

Storage Best Practices

Related topics