Unable to escape special characters for a query

JP_Toto · April 9, 2012, 8:40pm

Hi all!

Sorry if this has been covered. I tried to do my searching first.

I have the query:

{"query":{"bool":{"must":[{"term":{"SID":"3962"}},{"term":{"BI":"True"}},{"query_string":{"default_field":"TA","query":"order-staging-541"}},{"term":{"TY":"2"}}]}}}

TA is a string field but when I search on it my query will also return any
results that look like the TA contents with other special characters.

So, for example, "order-staging-541" also matches "order staging-541" and
"order(staging-541".

I realize these are special characters but when I try to escape them as
such:

{"query":{"bool":{"must":[{"term":{"SID":"3962"}},{"term":{"BI":"True"}},{"query_string":{"default_field":"TA","query":"order-staging-541"}},{"term":{"TY":"2"}}]}}}

It throws a syntax error. I am trying to follow the Lucene special
characters escaping protocol laid out here
http://lucene.apache.org/core/old_versioned_docs/versions/3_0_0/queryparsersyntax.html at
the bottom but I'm still receiving syntax errors.

Any help would be greatly appreciated!!

atlaste · April 10, 2012, 6:34am

Funny enough I just queried on t-mobile and it seems to work:
{"query":{"query_string":{"default_field":"text","query":"t\-mobile"}}}

The reason is because json also uses escape characters. Combined with your link I guess this is correct.

Kind regards,
Stefan.

Igor_Motov · April 10, 2012, 8:28pm

By default, string fields are analyzed in elasticasearch. It means that
"order-staging-541" is indexed as 3 terms "order", "staging" and "541". You
query is also analyzed, and it's also converted into 3 terms "order",
"staging" and "541" no matter which special characters are in between.
That's why they are matching. If you want to search this field exactly as
it is, you should reindex it as "index":"not_analyzed".

On Monday, April 9, 2012 4:40:23 PM UTC-4, JP Toto wrote:

Hi all!

Sorry if this has been covered. I tried to do my searching first.

I have the query:

{"query":{"bool":{"must":[{"term":{"SID":"3962"}},{"term":{"BI":"True"}},{"query_string":{"default_field":"TA","query":"order-staging-541"}},{"term":{"TY":"2"}}]}}}

TA is a string field but when I search on it my query will also return any
results that look like the TA contents with other special characters.

So, for example, "order-staging-541" also matches "order staging-541" and
"order(staging-541".

I realize these are special characters but when I try to escape them as
such:

{"query":{"bool":{"must":[{"term":{"SID":"3962"}},{"term":{"BI":"True"}},{"query_string":{"default_field":"TA","query":"order-staging-541"}},{"term":{"TY":"2"}}]}}}

It throws a syntax error. I am trying to follow the Lucene special
characters escaping protocol laid out here
Apache Lucene - Query Parser Syntax at
the bottom but I'm still receiving syntax errors.

Any help would be greatly appreciated!!

JP_Toto · April 10, 2012, 11:11pm

Thanks, Igor! I came to that realization today. Got it fixed. I appreciate
the feedback

On Tue, Apr 10, 2012 at 4:28 PM, Igor Motov imotov@gmail.com wrote:

By default, string fields are analyzed in elasticasearch. It means that
"order-staging-541" is indexed as 3 terms "order", "staging" and "541". You
query is also analyzed, and it's also converted into 3 terms "order",
"staging" and "541" no matter which special characters are in between.
That's why they are matching. If you want to search this field exactly as
it is, you should reindex it as "index":"not_analyzed".

On Monday, April 9, 2012 4:40:23 PM UTC-4, JP Toto wrote:

Hi all!

Sorry if this has been covered. I tried to do my searching first.

I have the query:

{"query":{"bool":{"must":[{"term":{"SID":"3962"}},{"term":
{"BI":"True"}},{"query_string":{"default_field":"TA","query"
:"order-staging-541"}},{"term"**:{"TY":"2"}}]}}}

TA is a string field but when I search on it my query will also return
any results that look like the TA contents with other special characters.

So, for example, "order-staging-541" also matches "order staging-541" and
"order(staging-541".

I realize these are special characters but when I try to escape them as
such:

{"query":{"bool":{"must":[{"term":{"SID":"3962"}},{"term":
{"BI":"True"}},{"query_string":{"default_field":"TA","query"
:"order-staging-541"}},{"**term":{"TY":"2"}}]}}}

It throws a syntax error. I am trying to follow the Lucene special
characters escaping protocol laid out here http://lucene.apache.org/**
core/old_versioned_docs/**versions/3_0_0/**queryparsersyntax.htmlhttp://lucene.apache.org/core/old_versioned_docs/versions/3_0_0/queryparsersyntax.html at
the bottom but I'm still receiving syntax errors.

Any help would be greatly appreciated!!

--
JP Toto | james.p.toto@gmail.com | JP Toto | about.me

Topic		Replies	Views
Cannot escape special characters in query using Java API Elasticsearch	5	6608	July 6, 2017
How to properly escape special characters? Elasticsearch	4	14852	July 6, 2017
Escaping Special Characters in Wildcard Query Elasticsearch	6	136960	July 6, 2017
Unable to escape special character using Java REST API Elasticsearch language-clients	4	609	March 1, 2023
Querystring search on special characters Elasticsearch	3	1164	August 23, 2019

Unable to escape special characters for a query

Related topics