Term Frequency of Different Entities

benny_vaks · February 6, 2013, 6:22am

Hi,

I wanted to find out if the term frequency (that is used to score the IDF) is stored for each different entity or for the whole index?

If it's stored for the whole index. is there any way that I can have a unique terms frequency vector for each type within this index?

My Problem is that I have an application with many different document types. Each type has its own corpus and I don't want that they will affect each other.

For example, if one type contains many occurrences of the term X then I don't want that this will lower the IDF score of X in other types.

I know that this can be achieved using multiple indices but I have many types and some of them contain low number of documents. Hence an index per each type will have bad performance impact.

Thanks guys!

Topic		Replies	Views
Want term frequency in a individual document text fields with my query results. How? Elasticsearch	1	247	October 25, 2022
How to get the term frequency in ES? Elasticsearch	1	1492	February 17, 2017
Different IDF for different documents Elasticsearch	2	452	July 27, 2018
Scoring based on existence of all terms even if one term appears multiple times Elasticsearch	2	408	July 5, 2017
Term Vector ttf from all shards Elasticsearch	1	672	July 6, 2017

Term Frequency of Different Entities

Related topics