I filter the index for documents containing AAPL(the ticket symbol) (as
part of a field that is filterable).
I get back 1000 documents in no particular order as the request was just a
filter. To this filter, I would like to add a "discriminating/significant"
text that would be found in the 1000 documents. So that the documents
returned are in a sense only those that are significant.
I do not want the terms to be significant against the whole index, but only
against the documents that are returned for the query. Hence I would like
to run some extra analysis against this filter request result to identify
these "discriminating/significant" terms.
I was wondering if I can access the elastic API /underlying implementation
to do the calculations.
On Friday, 25 April 2014 13:09:35 UTC-5, Ivan Brusic wrote:
Can you provide a small example of what you are trying to achieve? Are the
discriminating terms known beforehand or is it dependent on the document?
Have you looked into the new text scoring features which have been released
since the original post? It is worth looking into:
You can probably calculate the TF values during indexing, but not the IDF
since that value is based on all of the documents in a shard.
A variant on this particular request:
I would like to get the tf-idf for an indexed field. (the field is a body
of a news document). I would like to find discriminating terms in the
document set (the document set is a result of executing a filter on the
The discriminating terms are to help with improving the query as the
number of documents returned are too many and relevant documents are
getting lost in the search result (of executing a filter).
Is it possible to run the tf-idf calculations that Elastic does while
indexing the document.(the API to access the TF-IDF calculations)
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to email@example.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/81a1726e-3b08-4de8-b9ea-28b159516e40%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.