Common cause for discrepancies while testing. You can enable distributed
term frequencies (only a slight performance hit), but in general, IDF
values normalize as you add more non-trivial content. If you have only a
handful of test documents, then IDF values will almost always be different
(with the default 5 shards).
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.