Could anyone please refer me to documentation on how elasticsearch
works "under the hood", especially how it decides on ranking when
collection results from several nodes?
Could ranking be slightly different from the one we'd have on a single
centralized index? For example, will we have to give up the Lucene
feature of "IDF" ("rare words")?
The background is, I have an application that uses Lucene, and we
expect we'll need to Shard due to a leap in the amount of data (moving
from "startup / proof of concept" stage - to internationally-marketed
stable application).
I believe in understanding how stuff works before basing my
application on it...
Could anyone please refer me to documentation on how elasticsearch
works "under the hood", especially how it decides on ranking when
collection results from several nodes?
Could ranking be slightly different from the one we'd have on a single
centralized index? For example, will we have to give up the Lucene
feature of "IDF" ("rare words")?
The background is, I have an application that uses Lucene, and we
expect we'll need to Shard due to a leap in the amount of data (moving
from "startup / proof of concept" stage - to internationally-marketed
stable application).
I believe in understanding how stuff works before basing my
application on it...
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.