Delete duplicate docs in ES 1.7

muhamadli302 · September 18, 2016, 12:38pm

Hello,

I'm having a problem finding the right query for searching and deleting duplicated documents in my index.
I'm using ES 1.7.

Thank you for your help

mainec · September 19, 2016, 12:17pm

How do you define a duplicate?

A brief search across the history of the discuss forum I'm currently typing this answer in revealed the following threads which might be helpful for you:

Hope this helps to get you started,
Isabel

muhamadli302 · September 22, 2016, 8:15am

Hi,
Thank you for your replay,
my definition for a duplicate is the same document with the same unique field under the same index.
This situation is caused by problems in inserting the data. So now I need to find an efficient way to find all the duplicates and delete them (leaving only one copy). It would be best if you could help us create a query that will find those duplicates and delete them.

Thank you!

Topic		Replies	Views
How to filter out duplicate documents across multiple types? Elasticsearch	3	2519	July 6, 2017
How to filter out duplicate documents across types Elasticsearch	1	370	July 6, 2017
Duplicates in ES Index Logstash	2	254	January 1, 2021
Find and delete duplicate documents Elasticsearch	8	25895	July 27, 2018
Delete all docs that have duplicate field values Elasticsearch	5	365	March 10, 2022

Delete duplicate docs in ES 1.7

Related topics