Elasticsearch: What's the best way to store big-data cost effective

basiltitus · January 3, 2024, 9:03am

We are using Basic Elasticsearch v7.4 on a single node with nearly 2TB of data. We planning to increase our retention however we are constrained by it's storage capacity. While adding disks and using multiple data path is a choice is not a recommended one or else needs to use LVM which I found quite troublesome.

We are considering Data Tier and adding new nodes in different tier like 'cold' or 'frozen'. Is this the best method to achieve this ? I have tested cold tier but does this version of ES support frozen tier? We are not expecting the same search performance for very old data as recently created indices do. Is HDDs best option for node in these tiers to save cost? Is heap memory still a performance factor in these tiers?

What are some alternatives to storing big-data is ES if there is any?

grumo35 · January 3, 2024, 9:26am

Hello,

The data tiering depends also on your querying, do you have a notion of time or age in your data ?

You could think about frozen and or cold indices if you dont need to acess the data on a regular bassis.

Data tiering is effective way to increase elastic storage capacity based on the fact that you might need to query only last 7 days and 1 time a year the frozen data.

Why would LVM save you any kind of space or increase storage effectiveness ?

Also if you're using enterprise grade storage solutions you should look into storage deduplication technology on your storage hardware it's actually insane how much data this can save you can go as high as 60%+ on some datasets.

system · January 31, 2024, 9:26am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Data storage - too much space taken Elasticsearch	4	3757	December 7, 2016
Efficient way of storing data in index to support date range filter Elasticsearch	4	451	August 16, 2018
Elasticsearch performance in HDD vs SSD and 32 GB vs 64 GB of RAM Elasticsearch	25	3375	June 30, 2023
Reducing heap usage dedicated to indices.segments.terms_memory_in_bytes...? Elasticsearch	10	3273	January 18, 2017
Frozen Index/Node questions Elasticsearch	6	1181	July 29, 2020

Elasticsearch: What's the best way to store big-data cost effective

Related topics