Identify word as dominant word in search


(rotem123) #1

hi,
Is there a search in elasticsearch that support the behavior that when a
user enter a string to search, the ES recognize words as important words in
search.
for example: the user enter the string NEXUS COVER FOR EVERY DAY USE SILK
SOFT BLUE, and I want that the brand(NEXUS) and color(BLUE) will be more
dominant in search, and I have a list of all the colors and all the brand
that exist.

is it possible? and if yes, how do I implement that?

thank you!

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/2948b28d-6c1e-490d-bdbb-80df5d7b0ebd%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(Itamar Syn-Hershko) #2

Depending on your corpus, this should happen automatically. That's what
TF/IDF is about.

What you can do further is use NLP methods to tag those items in search and
indexing. Look up POS tagging and entity extraction.

--

Itamar Syn-Hershko
http://code972.com | @synhershko https://twitter.com/synhershko
Freelance Developer & Consultant
Author of RavenDB in Action http://manning.com/synhershko/

On Tue, Jun 3, 2014 at 12:22 PM, Rotem Haber haber.rotem@gmail.com wrote:

hi,
Is there a search in elasticsearch that support the behavior that when a
user enter a string to search, the ES recognize words as important words in
search.
for example: the user enter the string NEXUS COVER FOR EVERY DAY USE SILK
SOFT BLUE, and I want that the brand(NEXUS) and color(BLUE) will be more
dominant in search, and I have a list of all the colors and all the brand
that exist.

is it possible? and if yes, how do I implement that?

thank you!

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/2948b28d-6c1e-490d-bdbb-80df5d7b0ebd%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/2948b28d-6c1e-490d-bdbb-80df5d7b0ebd%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAHTr4ZsWggX9KP_fd75Qbfgk4uph9VNMbXUyaQQ3obMresdVyA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


(Ivan Brusic) #3

I agree with Itamar. It sounds like you do have a list of colors and brands
(tagging), so you can add a boost value as a payload to the relevant terms.
You can use these payloads with a function score script or a custom
similarity. Not an easy solution. If you can maintain a mapping of values
in Elasticsearch (via a plugin), you can bypass the payload and lookup the
terms yourself. Once again, not easy.

http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/analysis-delimited-payload-tokenfilter.html

--
Ivan

On Tue, Jun 3, 2014 at 3:00 AM, Itamar Syn-Hershko itamar@code972.com
wrote:

Depending on your corpus, this should happen automatically. That's what
TF/IDF is about.

What you can do further is use NLP methods to tag those items in search
and indexing. Look up POS tagging and entity extraction.

--

Itamar Syn-Hershko
http://code972.com | @synhershko https://twitter.com/synhershko
Freelance Developer & Consultant
Author of RavenDB in Action http://manning.com/synhershko/

On Tue, Jun 3, 2014 at 12:22 PM, Rotem Haber haber.rotem@gmail.com
wrote:

hi,
Is there a search in elasticsearch that support the behavior that when a
user enter a string to search, the ES recognize words as important words in
search.
for example: the user enter the string NEXUS COVER FOR EVERY DAY USE SILK
SOFT BLUE, and I want that the brand(NEXUS) and color(BLUE) will be more
dominant in search, and I have a list of all the colors and all the brand
that exist.

is it possible? and if yes, how do I implement that?

thank you!

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/2948b28d-6c1e-490d-bdbb-80df5d7b0ebd%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/2948b28d-6c1e-490d-bdbb-80df5d7b0ebd%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAHTr4ZsWggX9KP_fd75Qbfgk4uph9VNMbXUyaQQ3obMresdVyA%40mail.gmail.com
https://groups.google.com/d/msgid/elasticsearch/CAHTr4ZsWggX9KP_fd75Qbfgk4uph9VNMbXUyaQQ3obMresdVyA%40mail.gmail.com?utm_medium=email&utm_source=footer
.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CALY%3DcQAhjj%2BetB%2BNVNHtY6bvvA1de29jbOO2csmnEU%2B67Jxh-w%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


(system) #4