I now the asciifolding filter docs are really very clear on this, but it
took me an embarrassingly long time to realise I was losing my currency
symbol (£) to the ASCII folding filter.
Other than creating my own character map with the char map filter, does
there exist something of production quality that would translate accented
UTF8 characters of the Latin-alphabet into non-accented characters in the
ASCII range?
Overall, the explanation language is a little hairy and you may need
to chase through the Unicode pages, but it should be the
production-ready approach in the end.
I now the asciifolding filter docs are really very clear on this, but it
took me an embarrassingly long time to realise I was losing my currency
symbol (£) to the ASCII folding filter.
Other than creating my own character map with the char map filter, does
there exist something of production quality that would translate accented
UTF8 characters of the Latin-alphabet into non-accented characters in the
ASCII range?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.