Configure Charset used by Elasticsearch to minimize storage and boost searches

Hi all,

I'am currently try to use elastic to store a huge amount of text. I do not need UTF things I'am would be fine with ASCII. Also I would be fine if all chars are lower case because I dont care about uppercase and lowercase.

Since the amount of data are some TB these little things really matter :slight_smile:

Is there a why to configure which charset elasticsearch is using internaly? To save storage and to boost searches?

Welcome!

Elasticsearch uses UTF8 only. I'm not sure that running with ASCII would change anything in term of storage or performance as the underlying data structures will be the same I believe.

1 Like