Fielddata on a custom analyzer that is of keyword tokenizer

drjz · February 17, 2023, 1:35pm

I created a custom analyzer that does lowercasing. The reason why I did not use a normalizer is because I need to apply stopword filter that the normalizer is not supporting.

"lowercase_analyzer": {
                    "type": "custom",
                    "tokenizer": "keyword",
                    "filter": [
                        "truncate",
                        "stopwords",
                        "lowercase"
                    ]
                },

I define a subfield with the custom analyzer as follow:

"normalized": {
                      "analyzer": "lowercase_analyzer",
                      "type": "text",
                      "fielddata": true
                    },

But I have my doubts about using fielddata. Ideally, I want to use the doc_values for the aggregations. Since this is a fieldtype of text, this is not possible. How much will adding fielddata hurt the stability and performance? Does anyone know another solution?

system · March 17, 2023, 1:35pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ES 5.0 - case insensitive search for keyword fields Elasticsearch	11	11753	July 5, 2017
Custom analyzer is Not effecting to the data of the mapping field,even though i added it to the mapping field Elasticsearch	5	1187	July 5, 2017
Fields in Fields Elasticsearch	1	475	April 18, 2017
Altering the standard analyzer Elasticsearch	3	760	July 5, 2017
Design custom analyzer with custom tokenizers Elasticsearch	3	972	July 5, 2017

Fielddata on a custom analyzer that is of keyword tokenizer

Related topics