I'm wondering is there away to calculate BM25 score for the query itself
(query-against-query)?
Adding it to index seems to be invalid solution as it will influent the
tf-idf of the index and make it corrupted.
Isn't the point of BM25 to use variable document length normalization? It
works when used on the entire index/corpus. It is meant to influence the TF
values.
Comparing results between Lucene queries is not advisable. Why did you
switch to BM25? Do you field lengths vary much?
I'm wondering is there away to calculate BM25 score for the query itself
(query-against-query)?
Adding it to index seems to be invalid solution as it will influent the
tf-idf of the index and make it corrupted.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.