Sort and icu problem

Weiwei_Wang · June 16, 2011, 1:11pm

in order to sort on chinese, i used keworkd-tokenizer and icu_collator
filter, however, i also need to do facet on the same field, the
problem now comes that the facet result is very not friendly as i can
not read it.

besides i aslo want to search on this field, if i passed the result by
icu_collation to the query string, es will complains query parser
failing.

i want to know why localized sort not supported and keep the original
input not encoded by icu_collator?

rmuir · June 16, 2011, 1:46pm

On Thu, Jun 16, 2011 at 9:11 AM, Weiwei Wang ww.wang.cs@gmail.com wrote:

in order to sort on chinese, i used keworkd-tokenizer and icu_collator
filter, however, i also need to do facet on the same field, the
problem now comes that the facet result is very not friendly as i can
not read it.

besides i aslo want to search on this field, if i passed the result by
icu_collation to the query string, es will complains query parser
failing.

i want to know why localized sort not supported and keep the original
input not encoded by icu_collator?

Maybe ES needs to hide more from you, but at the low level you need 2 fields:

the original text for faceting (keywordtokenizer)
the sort key field for sorting (collated)

the sort key field is really not useful for anything but sorting. its
a binary sort key. this is the same way it works with all databases
too (they just hide the process from you)

Topic		Replies	Views
[Ann] ICU facet allows sorting based on ICU collations Elasticsearch	4	458	July 6, 2017
Icu_collation as keyword normalizer Elasticsearch	3	1123	March 13, 2017
ICU sorting of terms aggregation with multi-valued fields Elasticsearch runtime-fields	7	700	March 24, 2022
Issue with elasticsearch-analysis-icu plugin Elasticsearch	20	3513	June 27, 2017
Terms aggregation with ICU multi-field and arrays Elasticsearch runtime-fields	2	688	March 22, 2022

Sort and icu problem

Related topics