Elasticsearch deployment

porscheme · June 22, 2021, 7:49pm

We wanted to build Elasticsearch on cloud Azure.
Source data size: 50TB
Replicas count: 3
Shard count: 1000

For doing this, we are considering below two VM SKUs.

L32sv2 is very processing we wanted to know SLAs for this VM SKU
Any experience with using this SKU is really helpful

DS5v2 – Intel Xeon processor
RAM: 56 GB
vCPU(s): 16
Managed Disk: Premium SSD

L32s v2: AMD EPYC 7551 processor
RAM: 256 GB
vCPU(s): 32
Local NVMe premium SSD disks attached to VMs (ephemeral)

warkolm · June 22, 2021, 9:56pm

Welcome to our community!

Are you saying you will have 2 nodes, with 3000 shards?

porscheme · June 25, 2021, 7:27am

No, we are thinking of using 10 nodes with 1000 shards.
We want each shard to be 75GB.

But the real question here is...
By default, I was assuming we’d have to use managed disks (using DS5v2 VM SKU) as we couldn’t rely on the local temporary storage. That could get blown up at any time. Attached managed storage seems to be the way most people talk about supporting this scenario online.

However, the Lsv2-series has both temporary storage and NVMe disk. The spec sheet for it talks about it being ideal for “Big Data, SQL, and NoSQL databases.”. Which seems to fit our problem space

Should use DS5v2 (with managed disks) or L32s (NVMe storage)

system · July 23, 2021, 7:27am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
L32sv2 vs DS5v2 on Azure Elasticsearch	7	1028	July 27, 2021
Using huge NVMe disks with elasticsearch Elasticsearch	9	1622	July 18, 2021
Elasticsearch data node sizing on Azure infrastructure Elasticsearch	1	492	May 15, 2018
Elasticsearch performance in HDD vs SSD and 32 GB vs 64 GB of RAM Elasticsearch	25	2979	June 30, 2023
Disk usage difference between data nodes Elasticsearch	6	1808	August 3, 2021

Elasticsearch deployment

Related topics