Searching Human Names In Elastic Search


(Nathan Smith) #1

Hi,

I'm currently build a replacement search service, it searches Scottish
human names and its essential that the data being returned is as close to
the current solution as possible....

Imagine this -

I put in a search for "Michael Heaney", I can either use a bool across both
a fore/surname field or I have a concatenated field of bothe fore/surname...

The current search solution returns all matching "Michael Heaney" but also
a few "Michael Heavey"', how would I approach this best, not only for this
case but to try and get similar names being returned in my results?

I have tried a bool fuzzy query over the two fields however im finding it
difficult to get appropriate results.

Thanks, Nathan.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/a3ca18d9-5ba2-45f0-acd1-23306e30e1d6%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(Itamar Syn-Hershko) #2

Try phonetic search, soundex and the like. I do hope it works for scottish
as well :slight_smile:
On Dec 16, 2013 5:07 PM, "Nathan Smith" nathn.smith@gmail.com wrote:

Hi,

I'm currently build a replacement search service, it searches Scottish
human names and its essential that the data being returned is as close to
the current solution as possible....

Imagine this -

I put in a search for "Michael Heaney", I can either use a bool across
both a fore/surname field or I have a concatenated field of bothe
fore/surname...

The current search solution returns all matching "Michael Heaney" but also
a few "Michael Heavey"', how would I approach this best, not only for this
case but to try and get similar names being returned in my results?

I have tried a bool fuzzy query over the two fields however im finding it
difficult to get appropriate results.

Thanks, Nathan.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/a3ca18d9-5ba2-45f0-acd1-23306e30e1d6%40googlegroups.com
.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAHTr4ZuQj%3DCSJBe%3D9xfZ0s9Q1kH83gyDx1Ehwz07Ybe%3DLKR6HA%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.


(Nathan Smith) #3

Thanks for your reply. I just realised i didnt mention, however I am using
the phonetic plugin for elasticsearch and I am currently using Metaphone..

"The metaphone code for Heavey is HF.

The metaphone code for Heaney is HN.

These surnames do not have the same metaphone code."

The codes are different and this is the same for soundex. I think the query
is what needs adjusting.

Anyone else any suggestions?

On Monday, 16 December 2013 15:56:17 UTC, Itamar Syn-Hershko wrote:

Try phonetic search, soundex and the like. I do hope it works for scottish
as well :slight_smile:
On Dec 16, 2013 5:07 PM, "Nathan Smith" <nathn...@gmail.com <javascript:>>
wrote:

Hi,

I'm currently build a replacement search service, it searches Scottish
human names and its essential that the data being returned is as close to
the current solution as possible....

Imagine this -

I put in a search for "Michael Heaney", I can either use a bool across
both a fore/surname field or I have a concatenated field of bothe
fore/surname...

The current search solution returns all matching "Michael Heaney" but
also a few "Michael Heavey"', how would I approach this best, not only for
this case but to try and get similar names being returned in my results?

I have tried a bool fuzzy query over the two fields however im finding it
difficult to get appropriate results.

Thanks, Nathan.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/a3ca18d9-5ba2-45f0-acd1-23306e30e1d6%40googlegroups.com
.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/83f09815-8e0b-47a2-962a-cd00f949e4fd%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(system) #4