How to design schema for boosting/ranking logic?

abhinavkulkarni · September 14, 2021, 6:38am

Hi,

Let us say, I am indexing a bunch of podcasts in my Elasticsearch cluster:

    Podcast:
      - _id (keyword)
      - email (keyword)
      - webLink (keyword)
      - rssLink (keyword)
      - shortDescription (text)
      - longDescription (text)
      - artistIds (array of integers)
      - imageLink (keyword)
      - numEpisodes (integer)

I want to submit queries to select podcasts for a text query and optionally boost score based on presence of certain fields. For e.g., I'd like to boost scores if a podcast has a link, short Description or image.

For a faster execution, should I have hasWebLink, hasShortDescription and hasImageLink fields or exists clause for these fields.

I am wondering if having separate fields and setting index=True for those would result in faster execution.

Thanks

spinscale · September 14, 2021, 7:59am

exists clause in the should part of a boolean query sounds like a good way to do this. No need for further tuning.

abhinavkulkarni · September 14, 2021, 8:37am

Thanks @spinscale for the reply.

As you mentioned, I am currently including the exists conditions with score boosting in a should clause.

According to the documentation here, a filter bitset is cached for potential reuse. If I do not explicitly define hasXYZ fields and an index on them, do my queries benefit from such filter bitset caching?

Thanks!

spinscale · September 14, 2021, 9:39am

Those bitsets are only used for a filter context, however in your case the exists query is part of the scoring.

system · October 12, 2021, 9:40am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Boost query by another field Elasticsearch	6	414	July 6, 2017
Does there exists an exists query Elasticsearch	3	464	July 6, 2017
Understanding how to use the indexed _boost field Elasticsearch	5	347	July 6, 2017
Boosting slows down query Elasticsearch	1	268	July 6, 2017
Boosting queries by type Elasticsearch	5	312	July 6, 2017

How to design schema for boosting/ranking logic?

Related topics