Querying an alias throws off scoring completely?

pudo · January 19, 2024, 8:28am

Hey all!

I’m seeing some really weird behaviour around index aliases, maybe I’m doing something conceptually dumb.

We have two indexes of very different size (example: 4mn docs in entities-a and 2(!) docs in entities-b ), which both point to an alias, entities.

Now when I query entities for a query that perfectly matches one of the 2 docs in entities-b, I get junk results from entities-a instead. If I remove entities-a from the alias, the query returns the doc from entities-b properly.

The sense I’m getting is that the imbalance of the indexes completely throws off relevancy scoring - is that likely? Is there any way to address it?

yago82 · January 19, 2024, 8:55am

Hi,

If you want to prioritize results from entities-b over entities-a , you can use the indices_boost parameter in your search query.

Regards

pudo · January 19, 2024, 9:03am

Thanks for that hint! While that may be a possible work-around, I would like to understand the problem a bit more before hacking it. The thing is that the match in entities-b is a perfect result (unique match on a boosted keyword field), so it should come out on top even if both indexes are queried...

Christian_Dahlqvist · January 19, 2024, 9:05am

Have you tried setting the search_type query parameter to dfs_query_then_fetch? Does that make any difference?

pudo · January 19, 2024, 9:08am

That looks to have made a difference!! My sample queries are coming back with the correct doc now. Is this safe to do? Does it point to an underlying issue, or is it just a little price we'll have to pay for this whacky setup?

Christian_Dahlqvist · January 19, 2024, 9:18am

Yes. It is a different query mode especially designed for better handling shards with different sizes and distributions of data.

No, not really. Have a look at the docs and the links therein to distributes term frequences and relevancy scoring.

pudo · January 19, 2024, 9:20am

Thank you very much for your competent help, @Christian_Dahlqvist! Appreciate you taking the time.

system · February 16, 2024, 9:21am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Baselining relevancy scoring by creating one massiveindex instead of separate indices Elasticsearch	5	827	July 5, 2017
Is consistent scoring across 2 documents that match either 1 of 2 properties possible? Elasticsearch	1	297	July 6, 2017
Issues with scoring and query boost Elasticsearch	2	403	July 6, 2017
Scoring not behaving as expected Elasticsearch	4	533	July 6, 2017
Intermittent scoring returned Elasticsearch	3	264	July 6, 2017

Querying an alias throws off scoring completely?

Related topics