Calculate term co-occurrence matrix


(Aeneas Wiener) #1

Hi,

Is there an efficient way to calculate a term co-occurrence matrix
http://lucene.472066.n3.nabble.com/Calculate-Term-Co-occurrence-Matrix-td1226990.html
(i.e. a matrix showing how often a pair of terms of occurred in the same
document.)

I tried fetching a list of relevant terms using a terms aggregation, and
then I have two nested loops to calculate co-occurence of these terms via a span
near query
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/query-dsl-span-near-query.html.
Is there a faster way to do this in elasticsearch?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/fe184e70-4343-4a16-b941-0f31de19064e%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(Hossein Nasr Esfahani) #2

Hi Aeneas,

I was wonder if you have found any efficient way to solve this problem or not?

Thanks,


(Mark Harwood) #3

See https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-bucket-adjacency-matrix-aggregation.html


(system) #4