Only count matches once


(David Squires) #1

Hello,

I've tried a few different analyzers thinking that the stop words were
causing the results to be poor, but now it seems that it's a result of the
same token being matched multiple times, being given more weight.

For example a search for "this is the end", returns the following results:

  1. TutsPlus - End To End Analytics
  2. This Is The End (2013) Movie HD

As you can see, TutsPlus is pretty unrelated to what a user would be
searching for. I guess it's just because 'End' is found twice.

Is there anyway to only give weight to a single instance of a term or could
you recommend what I should look at to provide a more relevant result?
Really, the TutsPlus should not even be returned, or maybe way down the
line.

Thank you!

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(Jörg Prante) #2

Can you give the query you used?

Jörg

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(Nik Everett) #3

You could use a boost query or a rescore to match the phrase the user searched.

Sent from my iPhone

On Oct 13, 2013, at 8:16 PM, David Squires dave@bluetopmedia.com wrote:

Hello,

I've tried a few different analyzers thinking that the stop words were causing the results to be poor, but now it seems that it's a result of the same token being matched multiple times, being given more weight.

For example a search for "this is the end", returns the following results:

  1. TutsPlus - End To End Analytics
  2. This Is The End (2013) Movie HD

As you can see, TutsPlus is pretty unrelated to what a user would be searching for. I guess it's just because 'End' is found twice.

Is there anyway to only give weight to a single instance of a term or could you recommend what I should look at to provide a more relevant result? Really, the TutsPlus should not even be returned, or maybe way down the line.

Thank you!

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(David Squires) #4

I'm just using a standard query, I've tried messing filters and match, but
nothing that I played with seemed to get it...

{
"query": {
"name" : "this is the end"
}
}

On Sunday, October 13, 2013 8:26:51 PM UTC-4, Nikolas Everett wrote:

You could use a boost query or a rescore to match the phrase the user
searched.

Sent from my iPhone

On Oct 13, 2013, at 8:16 PM, David Squires <da...@bluetopmedia.com<javascript:>>
wrote:

Hello,

I've tried a few different analyzers thinking that the stop words were
causing the results to be poor, but now it seems that it's a result of the
same token being matched multiple times, being given more weight.

For example a search for "this is the end", returns the following results:

  1. TutsPlus - End To End Analytics
  2. This Is The End (2013) Movie HD

As you can see, TutsPlus is pretty unrelated to what a user would be
searching for. I guess it's just because 'End' is found twice.

Is there anyway to only give weight to a single instance of a term or
could you recommend what I should look at to provide a more relevant
result? Really, the TutsPlus should not even be returned, or maybe way down
the line.

Thank you!

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(system) #5