Is the score consistent for different MLT-queries?

sandros · May 13, 2016, 8:39am

Hi all,

I use the more_like_this query in order to recommend similar content. The content is news-articles. Every now and then it happens that there is a very special article which does not really have similar content (i.e all the top terms of the mother article never appear together anywhere else) and then it uses not very specific terms from the mother to recommend kids (aka related articles).

So I was wondering whether the score is reliable and consistent for all possible mother articles? Such that it can be used as a filter, i.e: if (score_of_kid <3) then discard it.

In the call I use "max_query_terms"=52, so there are not always the same number of terms for each mother article (because some fields are quite short).

Topic		Replies	Views
Questions about MoreLikeThis Elasticsearch	3	476	July 6, 2017
More like this scoring algorithm unclear Elasticsearch	5	2547	July 6, 2017
Compare multiple MLT queries score Elasticsearch	1	814	February 5, 2017
Getting similarity scores by issuing MLT-queries doesn't work for some documents Elasticsearch	4	424	May 18, 2020
MoreLikeThis similarity ranking Elasticsearch	1	255	July 6, 2017

Is the score consistent for different MLT-queries?

Related topics