I dont quite understand the forumula of the BM25 explained here:
What is menat by the field length?
I dont quite understand the forumula of the BM25 explained here:
What is menat by the field length?
Field length is a number of terms in the current field of the current document.
For example:
if a doc1 has a field field1
with a value "foo foo bar", its field length will be 3.
if a doc2 has a field field1
with a value "foo foo foo bar bar", its field length will be 5.
If these are the only two documents in the index, the avgFieldLen
of field field1
will be = ( 3+5)/2 = 4
Thank you very much!
This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.
© 2020. All Rights Reserved - Elasticsearch
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant logo are trademarks of the Apache Software Foundation in the United States and/or other countries.