Any way to ensure diversity in search results?


(Andy-2) #1

My documents have a field "category".

When a user conducts a search, I want to make sure that the results
are diverse and aren't dominated by 1 category. Say I want to make
sure that category "A" doesn't have more than 3 results in the top 10.

Is there any way to do that? Thanks.


(Shay Banon) #2

No built in support except for asking for more docs instead of what you are going to present, and filter things out.

On Saturday, July 2, 2011 at 12:39 AM, Andy wrote:

My documents have a field "category".

When a user conducts a search, I want to make sure that the results
are diverse and aren't dominated by 1 category. Say I want to make
sure that category "A" doesn't have more than 3 results in the top 10.

Is there any way to do that? Thanks.


(Paul Loy) #3

In our project we used a custom script to score docs (actually random) after
filtering. That helped us to give 'fresh' results.

On Sat, Jul 2, 2011 at 9:34 PM, Shay Banon shay.banon@elasticsearch.comwrote:

No built in support except for asking for more docs instead of what you
are going to present, and filter things out.

On Saturday, July 2, 2011 at 12:39 AM, Andy wrote:

My documents have a field "category".

When a user conducts a search, I want to make sure that the results
are diverse and aren't dominated by 1 category. Say I want to make
sure that category "A" doesn't have more than 3 results in the top 10.

Is there any way to do that? Thanks.

--

Paul Loy
paul@keteracel.com
http://uk.linkedin.com/in/paulloy


(Otis Gospodnetić) #4

Hi,

Don't think there is anything built into ES for that. We've done this
before for Solr, though. It was a fun project with pluggable
diversification of search results and such. Please get in touch if
you want us to develop something like that for you for Elastic Search

Otis

Sematext is hiring search people! http://sematext.com/about/jobs.html

On Jul 1, 5:39 pm, Andy selforgani...@gmail.com wrote:

My documents have a field "category".

When a user conducts a search, I want to make sure that the results
are diverse and aren't dominated by 1 category. Say I want to make
sure that category "A" doesn't have more than 3 results in the top 10.

Is there any way to do that? Thanks.


(system) #5