how can I obtain BM25 score between two arbitrary documents D1 and D2. please note that I don't want to find Documents similar to D1 or D2. so I think more_like_this query is not proper for my case.
I don't knew that, thank you! but it is not important right now, because as you see structure of documents are same. I can store my documents in one type.
are you have any solution for main problem?
for calculating BM25 by myself, I need length of document and average document length on whole documents in collection. according to formula, length of a document is number of it's word. I search a lot, but no success to find a way to elastic return this values to me.
could you please give me a solution?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.