Hello,
I am having a problem understanding how scoring of nested documents works.
I have found other people with similar questions which have remained
unanswered:
The relevant section of my current mapping (with nested parts) is:
mappings: {
If I have three people who have visited different numbers of cities and I
search for a common city they have all visited I get different score
values. The person who visited the greatest number of cities is ranked
first, with the person who visited only one city getting a score of 1
(currently ranked lowest). The output of the explanation is that hthe score
is based on 'child doc range from 0 to x'. My question is how do TF, IDF
and Field Norm work for nested documents when the score is being
calculated?
Edit: There is only one shard being used in this mapping.
On Tuesday, October 21, 2014 9:56:51 AM UTC+1, ba...@intalex.com wrote:
Hello,
I am having a problem understanding how scoring of nested documents works.
I have found other people with similar questions which have remained
unanswered:
If I have three people who have visited different numbers of cities and I
search for a common city they have all visited I get different score
values. The person who visited the greatest number of cities is ranked
first, with the person who visited only one city getting a score of 1
(currently ranked lowest). The output of the explanation is that hthe score
is based on 'child doc range from 0 to x'. My question is how do TF, IDF
and Field Norm work for nested documents when the score is being
calculated?
You can for example choose to take the average, max or sum of all the child
documents that match your nested query and reward the parent doc with that
value
On Tuesday, October 21, 2014 9:56:51 AM UTC+1, ba...@intalex.com wrote:
Hello,
I am having a problem understanding how scoring of nested documents works.
I have found other people with similar questions which have remained
unanswered:
If I have three people who have visited different numbers of cities and I
search for a common city they have all visited I get different score
values. The person who visited the greatest number of cities is ranked
first, with the person who visited only one city getting a score of 1
(currently ranked lowest). The output of the explanation is that hthe score
is based on 'child doc range from 0 to x'. My question is how do TF, IDF
and Field Norm work for nested documents when the score is being
calculated?
Thanks for the help Mark.
When calculating relevance can I assume that TF is the number of times that
the term appears in the collapsed nested field? I.e. all of the city names
get merged into one field, or is it handled a different way? Is the Field
Length Norm calculated in the same way?
Barry
On Tuesday, October 21, 2014 3:48:15 PM UTC+1, Mark Harwood wrote:
You can for example choose to take the average, max or sum of all the
child documents that match your nested query and reward the parent doc with
that value
On Tuesday, October 21, 2014 9:56:51 AM UTC+1, ba...@intalex.com wrote:
Hello,
I am having a problem understanding how scoring of nested documents
works. I have found other people with similar questions which have remained
unanswered:
If I have three people who have visited different numbers of cities and I
search for a common city they have all visited I get different score
values. The person who visited the greatest number of cities is ranked
first, with the person who visited only one city getting a score of 1
(currently ranked lowest). The output of the explanation is that hthe score
is based on 'child doc range from 0 to x'. My question is how do TF, IDF
and Field Norm work for nested documents when the score is being
calculated?
After some investigation, the number of nested docs get counted
individually along with the root doc.
On Tuesday, October 21, 2014 4:55:56 PM UTC+1, ba...@intalex.com wrote:
Thanks for the help Mark.
When calculating relevance can I assume that TF is the number of times
that the term appears in the collapsed nested field? I.e. all of the city
names get merged into one field, or is it handled a different way? Is the
Field Length Norm calculated in the same way?
Barry
On Tuesday, October 21, 2014 3:48:15 PM UTC+1, Mark Harwood wrote:
You can for example choose to take the average, max or sum of all the
child documents that match your nested query and reward the parent doc with
that value
On Tuesday, October 21, 2014 9:56:51 AM UTC+1, ba...@intalex.com wrote:
Hello,
I am having a problem understanding how scoring of nested documents
works. I have found other people with similar questions which have remained
unanswered:
If I have three people who have visited different numbers of cities and
I search for a common city they have all visited I get different score
values. The person who visited the greatest number of cities is ranked
first, with the person who visited only one city getting a score of 1
(currently ranked lowest). The output of the explanation is that hthe score
is based on 'child doc range from 0 to x'. My question is how do TF, IDF
and Field Norm work for nested documents when the score is being
calculated?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.