Query to get one doc out of its duplicate

Manikantaks · August 26, 2020, 6:48am

I have ELK set up . In specific index there are around 70Lac records , out of which some are duplicates . I need to create a filter in kibana so that only uniq records should be return. I need to check for two field for the records uniqness . Example , phone number and city fields of Document 1 and Document 2 are same then we consider those Documents are same , in this case query should return any one of these to Documents. here Id is auto generated we can't use that for uniqness. Need to help on this.

Nathan_Reese · August 28, 2020, 2:02pm

Why does your data have duplicates? Are the duplicates needed for other use cases? How is your data getting ingested? I would recommend removing duplicates during ingest. You can specify your own _id field. Maybe it makes sense to make this field be the concatenation of phone number and city.

Manikantaks · September 1, 2020, 7:33am

@Nathan_Reese Data already ingested. Now I need to read data based on the above condition . I need to do operations already existing data.

system · September 29, 2020, 7:33am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Retrieve duplicate data using Kibana search bar Kibana	2	8204	March 9, 2017
Remove Duplicate records Kibana	3	4502	October 8, 2018
Delete API for duplicate records Kibana	3	222	September 23, 2021
Display all documents with duplicated value of a given field Kibana	10	4529	June 6, 2019
Dropping duplicate documents Kibana	4	246	September 12, 2019

Query to get one doc out of its duplicate

Related topics