Relevation on wildcard results and wildcard speed

Michael · May 24, 2011, 6:32pm

Hi guys!
I want to do something like this.
When I search for example on field first_name by wildcard "A" I want
to see my result sortable like this

Andrew
Frank
Michael
Brenda

Can I do this by the naitive tools of elastic search?

And second question

I have some not_analyzed string fields, and I try to search by this
fields on the 500 000 items on ES, and I have 2 second on my computer
that is not realy fast
What I need to do, to make it faster?

kimchy · May 24, 2011, 8:25pm

wildcard searches are going to be slow, especially when doing leading wildcards. You have several options:

Use text_phrase_prefix query (new in 0.16.1), where you can control the number of max_expansions to reduce the time it takes to query.
Use ngrams to try and provide different type of suggestions. That will need to be done by creating an analyzer for that, and defining it in the mappings. And then do a simple text query (both index text and search text will be analyzed and "ngrammed").
On Tuesday, May 24, 2011 at 9:32 PM, Michael wrote:

Hi guys!
I want to do something like this.
When I search for example on field first_name by wildcard "A" I want
to see my result sortable like this

Andrew
Frank
Michael
Brenda

Can I do this by the naitive tools of Elasticsearch?

And second question

I have some not_analyzed string fields, and I try to search by this
fields on the 500 000 items on ES, and I have 2 second on my computer
that is not realy fast
What I need to do, to make it faster?

Andrew_Degtiariov · May 27, 2011, 1:30pm

On Tue, May 24, 2011 at 11:25 PM, Shay Banon
shay.banon@elasticsearch.comwrote:

wildcard searches are going to be slow, especially when doing leading
wildcards. You have several options:

Use text_phrase_prefix query (new in 0.16.1), where you can control the
number of max_expansions to reduce the time it takes to query.

Use ngrams to try and provide different type of suggestions. That will
need to be done by creating an analyzer for that, and defining it in the
mappings. And then do a simple text query (both index text and search text
will be analyzed and "ngrammed").

Does text_phrase_prefix query works on not analyzed fields? My tests show it
work like prefix when field is not analyzed:

For example:
The query {'query': {'text_phrase_prefix': {'fields.name': {'query': 'yuriy
os' } } } matched document with fields.name 'yuriy ostapyuk'

But the query {'query': {'text_phrase_prefix': {'fields.name': {'query':
'os' } } } doesn't match any documents.

--
Andrew Degtyarev
DA-RIPE

kimchy · May 27, 2011, 1:32pm

Yes, it works on non analyzed fields, but then, the entire content is checked against as a single term. so you need to provide a full prefix. If you want the text to be "broken" down into more granular terms, then you need to have it analyzed.

On Friday, May 27, 2011 at 4:30 PM, Andrew Degtyarev wrote:

On Tue, May 24, 2011 at 11:25 PM, Shay Banon <shay.banon@elasticsearch.com (mailto:shay.banon@elasticsearch.com)> wrote:

wildcard searches are going to be slow, especially when doing leading wildcards. You have several options:

Use text_phrase_prefix query (new in 0.16.1), where you can control the number of max_expansions to reduce the time it takes to query.

Use ngrams to try and provide different type of suggestions. That will need to be done by creating an analyzer for that, and defining it in the mappings. And then do a simple text query (both index text and search text will be analyzed and "ngrammed").

Does text_phrase_prefix query works on not analyzed fields? My tests show it work like prefix when field is not analyzed:

For example:
The query {'query': {'text_phrase_prefix': {'fields.name (http://fields.name)': {'query': 'yuriy os' } } } matched document with fields.name (http://fields.name) 'yuriy ostapyuk'

But the query {'query': {'text_phrase_prefix': {'fields.name (http://fields.name)': {'query': 'os' } } } doesn't match any documents.

--
Andrew Degtyarev
DA-RIPE

Andrew_Degtiariov · May 27, 2011, 2:20pm

On Fri, May 27, 2011 at 4:32 PM, Shay Banon shay.banon@elasticsearch.comwrote:

Yes, it works on non analyzed fields, but then, the entire content is
checked against as a single term. so you need to provide a full prefix. If
you want the text to be "broken" down into more granular terms, then you
need to have it analyzed.

Hm...
Don't understand. Could you explain about "full prefix"?

On Friday, May 27, 2011 at 4:30 PM, Andrew Degtyarev wrote:

On Tue, May 24, 2011 at 11:25 PM, Shay Banon <shay.banon@elasticsearch.com

wrote:

wildcard searches are going to be slow, especially when doing leading
wildcards. You have several options:

Use text_phrase_prefix query (new in 0.16.1), where you can control the
number of max_expansions to reduce the time it takes to query.

Use ngrams to try and provide different type of suggestions. That will
need to be done by creating an analyzer for that, and defining it in the
mappings. And then do a simple text query (both index text and search text
will be analyzed and "ngrammed").

Does text_phrase_prefix query works on not analyzed fields? My tests show
it work like prefix when field is not analyzed:

For example:
The query {'query': {'text_phrase_prefix': {'fields.name': {'query':
'yuriy os' } } } matched document with fields.name 'yuriy ostapyuk'

But the query {'query': {'text_phrase_prefix': {'fields.name': {'query':
'os' } } } doesn't match any documents.

--
Andrew Degtyarev
DA-RIPE

--
Andrew Degtyarev
DA-RIPE

kimchy · May 28, 2011, 9:33am

Since the entire content of the field is a single token (not indexed), then a prefix will only work on all of it. Exactly the behavior you saw.

On Friday, May 27, 2011 at 5:20 PM, Andrew Degtyarev wrote:

On Fri, May 27, 2011 at 4:32 PM, Shay Banon <shay.banon@elasticsearch.com (mailto:shay.banon@elasticsearch.com)> wrote:

Yes, it works on non analyzed fields, but then, the entire content is checked against as a single term. so you need to provide a full prefix. If you want the text to be "broken" down into more granular terms, then you need to have it analyzed.

Hm...
Don't understand. Could you explain about "full prefix"?

On Friday, May 27, 2011 at 4:30 PM, Andrew Degtyarev wrote:

On Tue, May 24, 2011 at 11:25 PM, Shay Banon <shay.banon@elasticsearch.com (mailto:shay.banon@elasticsearch.com)> wrote:

wildcard searches are going to be slow, especially when doing leading wildcards. You have several options:

Use text_phrase_prefix query (new in 0.16.1), where you can control the number of max_expansions to reduce the time it takes to query.

Use ngrams to try and provide different type of suggestions. That will need to be done by creating an analyzer for that, and defining it in the mappings. And then do a simple text query (both index text and search text will be analyzed and "ngrammed").

Does text_phrase_prefix query works on not analyzed fields? My tests show it work like prefix when field is not analyzed:

For example:
The query {'query': {'text_phrase_prefix': {'fields.name (http://fields.name)': {'query': 'yuriy os' } } } matched document with fields.name (http://fields.name) 'yuriy ostapyuk'

But the query {'query': {'text_phrase_prefix': {'fields.name (http://fields.name)': {'query': 'os' } } } doesn't match any documents.

--
Andrew Degtyarev
DA-RIPE

--
Andrew Degtyarev
DA-RIPE

Topic		Replies	Views
Elasticsearch Wildcard fieldtype has slow performance for wildcard queries Elasticsearch	5	2978	January 26, 2021
Slow Query Performance Elasticsearch	2	74	October 21, 2024
Relevation on wildcard results Elasticsearch	1	236	July 6, 2017
Leading wildcard search handling Elasticsearch	3	4329	May 2, 2017
Performance of filtered wildcard queries Elasticsearch	2	2705	June 29, 2018

Relevation on wildcard results and wildcard speed

Related topics