How document similarity works


#1

Forgive me for the beginner question but can someone explain how basic document similarity works in Elastic search. Specifically, is something like KNN used to compute the similarity score between two document vectors or instead Naive Bayes, SVM or some other algorithm is used. Thanks


(Mark Walkom) #2

We use BM25, you can read more on this page https://www.elastic.co/guide/en/elasticsearch/reference/5.5/index-modules-similarity.html


(system) #3

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.