Disk usage: tags vs fields

YvorL · October 29, 2018, 11:23pm

Hi!

Is there a significant disk usage difference between keeping some data in an array or in an individual field? The data won't be frequently queried but sometimes may act as a filter. So I'm not concerned about search speed in this case, only if I can spare a couple of GBs per X million documents.

Thank you!

polyfractal · November 2, 2018, 2:41pm

I'm not entirely sure I understand. Are you asking the difference between:

{
  "tags": ["foo", "bar"]
}

and

{
  "tag1": "foo",
  "tag2": "bar"
}

??

YvorL · November 5, 2018, 8:09am

Yes. Currently, I have about 5 values which aren't that important (e.g., I don't use scoring). With around 15.000.000.000 documents would I spare disk space if those were in an array (like the tags example)?

s1monw · November 5, 2018, 1:48pm

the tags example will be more efficient

system · December 3, 2018, 1:48pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Choosing filed and tag in filebeat Beats filebeat	2	1962	March 22, 2018
Best way to tag a large amount of documents Elasticsearch	4	450	November 13, 2020
Tags vs Fields for conditional search Logstash	6	2798	June 13, 2017
Efficiently updating a large number of array-type fields Elasticsearch	1	513	July 5, 2017
Whats the best format to store "tags" with values? Elasticsearch	1	600	July 5, 2017

Disk usage: tags vs fields

Related topics