Can ES not to store original keyword content but a mapping num value (4 save space)?

DeeeFOX · January 20, 2020, 4:31am

We use es as tsdb described in

and data look like:

{
	"metric": "metric_name",
	"@timestamp": "2020-01-01T01:01:01",
	"t": {
		"key1": "val1",
		"key2": "val2"
	},
	"f": {
		"name1": 1.2,
		"name2": 2.3
	}
}

But when data grows (to billion datapoint), these file grows to a size so huge as below (already rolling by date):

('Points', '0.07')
size: 1,657,438,615 bytes

('DocValues', '0.18')
size: 3,876,527,301 bytes

('Term Dictionary', '0.20')
size: 4,237,561,161 bytes

('Field Data', '0.44')
size: 9,265,590,659 bytes

('Frequencies', '0.08'):
size: 1,733,107,687 bytes

And the query we use are just term(yes or no) and prefix and some simple aggregation(terms, date_histogram then sum, avg).

So, my question is

Whether es can translate first then store the the mapping data instead of original keyword content in the storing layer(Transparent to users) to save the space and io?

like:

{
	"metric": "0",
	"@timestamp": "2020-01-01T01:01:01",
	"t": {
		"key1": "1",
		"key2": "2"
	},
	"f": {
		"name1": 1.2,
		"name2": 2.3
	}
}

instead of

{
	"metric": "long_metric_name",
	"@timestamp": "2020-01-01T01:01:01",
	"t": {
		"key1": "long_val1",
		"key2": "long_val2"
	},
	"f": {
		"name1": 1.2,
		"name2": 2.3
	}
}

DeeeFOX · January 20, 2020, 8:57am

wait for reply!

dadoonet · January 20, 2020, 9:25am

Read this and specifically the "Also be patient" part.

It's fine to answer on your own thread after 2 or 3 days (not including weekends) if you don't have an answer.

DeeeFOX · January 20, 2020, 9:29am

sorry for that, emmm

system · February 17, 2020, 9:35am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Disk Usage Tuning Elasticsearch	2	444	March 3, 2018
Weird storage change Elasticsearch	4	628	July 20, 2017
Configure elasticsearch to store only index data Elasticsearch	3	1518	July 6, 2017
How exactly Elastic search stores the data? Elasticsearch	4	1055	July 6, 2017
Storage Ratios - I my syslog streams are expanding in elastic search to more than 10:1? Elasticsearch	5	503	July 6, 2017

Can ES not to store original keyword content but a mapping num value (4 save space)?

Related topics