Document frequency of phrases

Yingkai_Gao_2 · December 5, 2014, 12:35pm

I am implementing a model called sequential dependency model in
ElasticSearch using dynamic script, and I need to get the collection
frequency (shard frequency in ES) of a phrase. It is possible to get the
term frequency of phrase by comparing the positions of each term in the
phrase, but how could I get collection frequency or document frequency of
phrases? If it is possible, how could I get the collection frequency of
two sloppy near terms?

I guess one solution is to extend a Lucene similarity class. However, can
I do this just using dynamic script?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/fe9dcc76-75b8-4e57-9ade-26134b703e57%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Topic		Replies	Views
Phrase frequency in a document and in the whole collection Elasticsearch	3	1584	October 5, 2016
Score based on phrase frequency only Elasticsearch	0	644	April 21, 2014
Counting the frequency of a term Elasticsearch	1	1865	March 14, 2015
Count of phrase matches per document Elasticsearch	1	3307	August 15, 2017
Getting phrase count for each document separately Elasticsearch	0	319	April 18, 2014

Document frequency of phrases

Related topics