Hi
Lets say I have indexed a field person_name as a string, with a custom
analyzer. person_name is stored int the index in one of the documents as:
"Harry Greenberg"
I make a match query on the field : "harry g"
I have a custom edgengram tokenizer which breaks the query down as follows:
{
"tokens": [
{
"token": "h",
"start_offset": 0,
"end_offset": 1,
"type": "word",
"position": 1
},
{
"token": "ha",
"start_offset": 0,
"end_offset": 2,
"type": "word",
"position": 2
},
{
"token": "har",
"start_offset": 0,
"end_offset": 3,
"type": "word",
"position": 3
},
{
"token": "harr",
"start_offset": 0,
"end_offset": 4,
"type": "word",
"position": 4
},
{
"token": "harry",
"start_offset": 0,
"end_offset": 5,
"type": "word",
"position": 5
},
{
"token": "g",
"start_offset": 6,
"end_offset": 7,
"type": "word",
"position": 6
}
]
}
Will all of these tokens be matched agains "Harry Greenberg" or person_name
will also be broken down as defined by my custom analyzer?
If not, how can I make it so that it will also be broken down? Will it make
the search significantly slower?
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/af5354e7-5f7b-4b6e-96e6-f5e81df825db%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.