I'am currently try to use elastic to store a huge amount of text. I do not need UTF things I'am would be fine with ASCII. Also I would be fine if all chars are lower case because I dont care about uppercase and lowercase.
Since the amount of data are some TB these little things really matter
Is there a why to configure which charset elasticsearch is using internaly? To save storage and to boost searches?
Elasticsearch uses UTF8 only. I'm not sure that running with ASCII would change anything in term of storage or performance as the underlying data structures will be the same I believe.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.