Hi I am still learning elasticsearch referring the official doc and I have a query

  • Under the tittle Living in the world of unicode it says

what’s the difference between é and é?

According to Elasticsearch,
the first one consists of the two bytes 0xC3 0xA9, and
the second one consists of three bytes, 0x65 0xCC 0x81.

How do the two character even differ?

The difference is they look the same but are encoded differently.

