Hot-Warm-Cold - avoid duplicates

Linuus · April 7, 2020, 6:08am

Hi!

I'm moving to a new ES cluster which also uses the Hot-Warm-Cold architecture. Now, when we imported our old data (600 million documents) it took quite a long time and after a while we noticed that we have duplicate documents, with the same _id field.

It seems like a few documents are present in both the "Hot" index as well as the "Warm" index. Previously this wasn't possible since we only had one index, but now there are separate indexes for Hot and Warm so duplicates are possible.

Are there any ways of avoiding these things happening?

system · May 5, 2020, 6:08am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Hot/warm data idempotency and vertical scaled ES infrastructure using Curator Elasticsearch	19	1276	February 5, 2019
Hot and Warm architecture Elasticsearch	1	737	June 27, 2017
Multiple clusters for hot warm architecture Elasticsearch	9	864	December 12, 2018
How to set-up Hot, Warm and cold for single node Elasticsearch? Elasticsearch ilm-index-lifecycle-management	9	1631	March 10, 2022
Please help newbie to create hot and warm cluster nodes Elasticsearch	5	1092	March 11, 2019

Hot-Warm-Cold - avoid duplicates

Related topics