I am trying to figure out whether query scores are ordinal or not. Trying
to figure this out, but have a problem with the effects coord function in
Lucene's forumla would do. Basically, if I have query A that says document
X is relevant with some score W, and query B which is relevant to document
Y with some score V, and W > V can I say that document X is more related A,
than a document Y is related to B.
Query scores are not ordinal. Lucene discourages comparing scores from
different queries since the search context is important in scoring. Query
norms are supposed to help normalize scores across queries, but as you
discovered, that is not normally what happens. Your theory will probably
hold true for many cases, but it is not 100% correct in all cases.
I am trying to figure out whether query scores are ordinal or not. Trying
to figure this out, but have a problem with the effects coord function in
Lucene's forumla would do. Basically, if I have query A that says document
X is relevant with some score W, and query B which is relevant to document
Y with some score V, and W > V can I say that document X is more related A,
than a document Y is related to B.
Thanks for confirming my suggestions. Even though the statement would stand
in general, since it is not ordinal it can only produce ranking, right?
So, if I would change the scoring function to be simple cosine of tf.idf
weights it should be ordinal. Can I do this in MLT query?
Thanks again
On Friday, January 31, 2014 12:42:38 AM UTC+1, Ivan Brusic wrote:
Query scores are not ordinal. Lucene discourages comparing scores from
different queries since the search context is important in scoring. Query
norms are supposed to help normalize scores across queries, but as you
discovered, that is not normally what happens. Your theory will probably
hold true for many cases, but it is not 100% correct in all cases.
--
Ivan
On Thu, Jan 30, 2014 at 7:06 AM, Ilija Subasic <subasi...@gmail.com<javascript:>
wrote:
I am trying to figure out whether query scores are ordinal or not. Trying
to figure this out, but have a problem with the effects coord function in
Lucene's forumla would do. Basically, if I have query A that says document
X is relevant with some score W, and query B which is relevant to document
Y with some score V, and W > V can I say that document X is more related A,
than a document Y is related to B.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.