Wildcard datatype lowercase

covolution · August 20, 2020, 8:14am

I am looking at the Wildcard datatype introduced in 7.9.

I would like to normalize it first by lowercasing and removing whitespace. I can't work out the syntax.

"normalizer" and "analyzer" don't seem to be supported.

Mark_Harwood · August 20, 2020, 9:00am

Hi Gethin,

Correct, they're not.
The ngram index wildcard field uses behind the scenes lower-cases already but the verification phase is currently case sensitive. A regex query can be made case insensitive using [Ff][Oo][Oo] type syntax (which many existing systems use today ). Before long we hope to add a case insensitive flag to queries which will allow you to search with a flag eg /Foo/i and this will produce the same accelerated query as the [Ff][Oo][Oo] syntax you have to use today. However, given people query across legacy indexes and new ones I expect we'll see [Ff][Oo][Oo] queries in use for some time to come as people query across old keyword field and new wildcard fields with one common search.

This would need to be removed as part of an ingest pipeline or could be ignored at query time using a suitable regex.

system · September 17, 2020, 9:00am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Template wildcard data type, normalizer/analyzer for case insensitive search Elasticsearch	2	882	September 17, 2020
Whitespace tokenizer doesn't allow lowercase search? Elasticsearch	2	2992	October 4, 2017
Keyword with lowercase wildcard search Elasticsearch	2	421	August 2, 2019
Wildcard search case insensitive Elasticsearch	8	17825	March 31, 2018
WIldcard case insensitive query string Elasticsearch	6	22003	April 20, 2017

Wildcard datatype lowercase

Related topics