Mapping: Array Type vs Text Type(with fielddata set to true)

avr · February 2, 2017, 8:58pm

I've a use case where I want to use text field in aggregations. By default text fields are not aggregatable and to make them aggregatable we need to enable fielddata parameter in the mapping.
As enabling fielddata comes with cost of HEAP, I'm wondering if following workaround makes any sense!

Pre-Analyse( tokenize text field) and make array out of it and index it as Array Type
After indexing text field as array type(into field called "text_array") I'll get two fields one is text_array (Text Type) and another one is text_array.keyword(Keyword Type)
As text_array.keyword is type Keyword I can use this in aggregations.

Is creating text_array.keyword makes sense? or does it also consumes HEAP when it is used in aggregations?

Any help on this much appreciated!

mnozawa · February 3, 2017, 12:49am

Hi,

By default, the doc_values setting is true In keyword datatype field .
So it doesn't consume so many heap at aggregation.

On the other hand, text type field may not necessary (it depends on your use case)

system · March 3, 2017, 12:49am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Making fields aggregate-able Elasticsearch	4	1004	December 14, 2016
ElasticSearch term Aggregation on text fields Elasticsearch	1	400	June 25, 2020
Using keyword type for mapping Elasticsearch	7	2183	July 31, 2018
Fielddata error indexing string fields Elasticsearch	3	951	December 14, 2021
Converting from 2.x to 5.x: "type": "string", "index": "no" -- use type or keyword? Elasticsearch	4	644	March 7, 2017

Mapping: Array Type vs Text Type(with fielddata set to true)

Related topics