Score and relevance across the shards

konstantin · September 13, 2011, 1:14pm

Hi guys,

I need help to clarify how the ES merge scores across different
shards. As I know each shard is a Lucene index instance. Thus each
index has its own tf-idf normalization that is independent from other
shards. So, here is the first question. Are the score depends on how
data distributed across the shards? Second, are the rank of documents
depends on how data distributed across the shards?Third, what is the
algorithm for merging scores across the shards?

Sorry if my questions are vague. Just ask me and I'll clarify the
points.
Thanks!

kimchy · September 13, 2011, 9:00pm

Hey, by default, the scores are used as is from each shard, and sorted by
it. So, the scores will use the tf-idf as it is defined on each shard, which
means it will depend on how data is distributed.

If you want, you can set the search type to have a DFS phase that would go
and do aggregated frequencies, but that will mean slower search. See here:
Elasticsearch Platform — Find real-time answers at scale | Elastic.

On Tue, Sep 13, 2011 at 4:14 PM, konstantin
konstantin.selivanov@gmail.comwrote:

Hi guys,

I need help to clarify how the ES merge scores across different
shards. As I know each shard is a Lucene index instance. Thus each
index has its own tf-idf normalization that is independent from other
shards. So, here is the first question. Are the score depends on how
data distributed across the shards? Second, are the rank of documents
depends on how data distributed across the shards?Third, what is the
algorithm for merging scores across the shards?

Sorry if my questions are vague. Just ask me and I'll clarify the
points.
Thanks!

Topic		Replies	Views
Global Scores in ElasticSearch Elasticsearch	3	513	July 6, 2017
Aliases across many indices and default scoring function Elasticsearch	2	350	February 19, 2019
Newbie question - distributed ranking Elasticsearch	2	315	July 6, 2017
Scoring results from multiple indices Elasticsearch	4	2130	July 6, 2017
Scoring inconsistency across shards Elasticsearch	3	849	July 5, 2017

Score and relevance across the shards

Related topics