Find duplicate


(mr_max) #1

I have a new task find duplicates in contacts.
I have a store of contacts in ElasticSearch

contact : {
  name : 'Peter",
  emails: ["test@test.ru","test@gmail.com"],
  phones: [3242343,4234324]
}

Can i find duplicate contacts using ElasticSearch?


(Loren Siebert) #2

aggregate on the three fields, setting min_doc_count to 2 on the sub-sub-aggregation field?


(system) #3