we are using elasticsearch 7.11.1 recently we observed an issue and below are the points for it.
- we store data for every 15 mins interval and we get time stamp from our input file (ex: 05:00, 23:15, 20:30, 11:45 )
- recently we observed our input file at 23:15 has 1890 records, but index has 3533 records.
- now we want to delete 1643 duplicate records from index, with out disturbing 1890 records.
We need API query for that.
for example
input file
name product sale id
sai pen 100 1
kumar car 30 2
sai pen 100 1
sai pen 100 1
ram bike 288 3
kumar car 30 2
After deleting duplicates my index should look like below,
name product sale id
sai pen 100 1
ram bike 288 3
kumar car 30 2
I need help with
- query to find only duplicates at 23:15
- query to delete duplicates
Can you please share the API query for the above issue.