I would like to create an ingestion pipeline (or reindex-with-script task) which lowercases and asciifolds a string field in the doc. I've found a lowercase processor, but did not find any way of calling a ASCII-folding or ICU-folding token filters. However, there is a script processor, so I was wondering whether there is a way of calling a token filter from within such script.
I don't think there is a way to do it today. On a related note, there is this feature request that aims at making it possible to configure analyzers on keyword fields https://github.com/elastic/elasticsearch/issues/18064.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.