Sorting on keyword field with accents

guilherme_maranhao · January 3, 2019, 3:30pm

Hi there,

My index has a keyword field on which there are documents that contain words with accents, i.e., 'Águas Lindas', and some don't, i.e., 'Aracaju'.

I'm using that field to sort my results. The problem is that, to asc sorting, the documents on which this field has accents, are being returned on the last positions, i.e.:

doc1: {
  my_text_field.keyword: 'Aracaju'
},
doc_2: {
  my_text_field.keyword: 'Belo Horizonte'
}
doc_3: {
  my_text_field.keyword: 'Águas Lindas'
}

Is it possible to make elastic sorting ignore the accents, as long as I can't set an analyzer with asciifolding filter to my keyword field?

Or, for this use case, I have to use a text field with an anayzer and a fielddata=true. I was avoiding to use that, because the performance issues related to fielddata.

Does anybody know what is the best solution for me?

Thanks,

Guilherme

abdon · January 3, 2019, 3:43pm

You can indeed not apply an analyzer to a keyword field, but you can apply a normalizer. Think of a normalizer like an analyzer, but then for keyword fields (with some restrictions).

The example in our documentation shows exactly your use case: using a normalizer to apply ASCII folding to a keyword field, so you can use that keyword field for sorting.

guilherme_maranhao · January 3, 2019, 4:00pm

Great, @abdon! It worked.

Thank you very much!

system · January 31, 2019, 4:00pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Problem with sort with polish(POLAND) words! Elasticsearch	5	2159	March 1, 2017
Problem searching queries with accents Elasticsearch	10	13089	July 6, 2017
Index sorting Elasticsearch	4	571	May 8, 2019
Romanization keyword sorting (specifically Pinyin) Elasticsearch	1	349	March 17, 2021
Sort not working as expected Elastic Search	4	124	June 26, 2024

Sorting on keyword field with accents

Related topics