I plan to store serveral millions of news documents using ES.
Now if a user queries something like "obama", hundreds of thousand
documents will be returned to the user. This will definitely take a
lot of time. (If I am wrong about this, plz correct me)
So I wanted to limit the size of documents to be searched and
returned.
Obviously I want the most relevant documents to be returned. And I
came across "min_score" function. (http://www.elasticsearch.org/guide/
reference/api/search/min-score.html)
According to the example, looks like I get to choose the score. But I
am wondering what the definition of this "score" is.
Is it the degree of relevance of the document to the query? (like,
cosine similarity?)
Or is it a some kind of percentage of the whole documents that are
hit?
I plan to store serveral millions of news documents using ES.
Now if a user queries something like "obama", hundreds of thousand
documents will be returned to the user. This will definitely take a
lot of time. (If I am wrong about this, plz correct me)
No. By default only the 10 first relevant will be returned.
It will be very fast.
Just try to play with it and you will see how easy ES is.
I plan to store serveral millions of news documents using ES.
Now if a user queries something like "obama", hundreds of thousand
documents will be returned to the user. This will definitely take a
lot of time. (If I am wrong about this, plz correct me)
By default, Elasticsearch will return you the 10 most relevant docs,
which sounds like exactly what you want
I plan to store serveral millions of news documents using ES.
Now if a user queries something like "obama", hundreds of thousand
documents will be returned to the user. This will definitely take a
lot of time. (If I am wrong about this, plz correct me)
By default, Elasticsearch will return you the 10 most relevant docs,
which sounds like exactly what you want
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.