Unicode characters and spaces in elasticsearch field names

Hello everybody,

I've researched a little bit what characters are allowed in elasticsearch
field names.
However, I couldn't find any official documentation only some posts which
mentioned that '.', '#' and '*' are discouraged.
See http://elasticsearch-users.115913.n3.nabble.com/Illegal-characters-in-elasticsearch-field-names-td4054773.html.

I've indexed some fields which contained spaces and unicode
characters with elasticsearch 1.4.2 ("lucene_version": "4.10.2"). I was
able to retrieve the documents with
term query without any problems.

My question would be, are there any pitfalls when using unicode characters
and spaces in elasticsearch field names? or is this discouraged?

Many thanks,
George

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/0c2a3b7b-0b3c-47e3-b149-ecad7d1e9a30%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Does anybody have an idea at least where in the elasticsearch code this is
handled?

Thanks!

On Friday, January 16, 2015 at 9:21:34 AM UTC+1, George wrote:

Hello everybody,

I've researched a little bit what characters are allowed in elasticsearch
field names.
However, I couldn't find any official documentation only some posts which
mentioned that '.', '#' and '*' are discouraged. See
http://elasticsearch-users.115913.n3.nabble.com/Illegal-characters-in-elasticsearch-field-names-td4054773.html
.

I've indexed some fields which contained spaces and unicode
characters with elasticsearch 1.4.2 ("lucene_version": "4.10.2"). I was
able to retrieve the documents with
term query without any problems.

My question would be, are there any pitfalls when using unicode characters
and spaces in elasticsearch field names? or is this discouraged?

Many thanks,
George

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d4a24d5a-c932-45c9-81d8-59323dd767bf%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

You can find this in the source code.

E.g.

org.elasticsearch.index.mapper.ContentPath -> see delimiter variable, it is
'.' by default

org.elasticsearch.index.mapper.Uid -> see DELIMITER, it is set to '#'

and for '*'

org.elasticsearch.index.mapper.FieldMappersLookup and
org.elasticsearch.index.mapper.object.DynamicTemplate -> are using
org.elasticsearch.common.reges.Regex.simpleMatch on field names, a
simplified regex routine which supports abc, abc, and abc* patterns

Jörg

On Sun, Jan 18, 2015 at 10:36 PM, George george.pucea@gmail.com wrote:

Does anybody have an idea at least where in the elasticsearch code this is
handled?

Thanks!

On Friday, January 16, 2015 at 9:21:34 AM UTC+1, George wrote:

Hello everybody,

I've researched a little bit what characters are allowed in elasticsearch
field names.
However, I couldn't find any official documentation only some posts which
mentioned that '.', '#' and '*' are discouraged. See
http://elasticsearch-users.115913.n3.nabble.com/Illegal-characters-in-
elasticsearch-field-names-td4054773.html.

I've indexed some fields which contained spaces and unicode
characters with elasticsearch 1.4.2 ("lucene_version": "4.10.2"). I was
able to retrieve the documents with
term query without any problems.

My question would be, are there any pitfalls when using unicode
characters and spaces in elasticsearch field names? or is this discouraged?

Many thanks,
George

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/d4a24d5a-c932-45c9-81d8-59323dd767bf%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/d4a24d5a-c932-45c9-81d8-59323dd767bf%40googlegroups.com?utm_medium=email&utm_source=footer
.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAKdsXoEQJJSBEjAg0a-hSny%3DbmM-oD5Rp1OBHmNXUs-_-SoSaA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.