mention that "One of the benefits of cutoff_frequency is that you get
domain-specific stopwords for free."
It seems like the index-per-user approach is required here in order to make
the term frequencies accurate. If you used a shared index
or even faked an index per user
your TF counts for some field would reflect the index as a whole
(aggregated across the counts for each shard in that index), not just for
that user. If you tended to just query the documents for one user at a time
using some filter field, the common terms query would probably not return
the results you are expecting.
Am I understanding this correctly?
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to firstname.lastname@example.org.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/398cfc81-ba3e-458c-840f-aee5e94902c4%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.