Keep the "last" dump only in an elasticsearch index


(RayC) #1

Hi guys,

Let's say I have an index "A" that needs to keep the last dump of data. Data come in dumps with a few days of interval (irregular). The events of entities have entity ID's and I put each event by its ID (as document_id) in the index A, Over the days there are existing entities and new entities coming in the dumps.

Whatever the case, I'd like to keep the index containing only the last dump of entities. But problem kicks in if some entities got deleted over the dumps since the index will still keep all the deleted ones.

I know it's not easy to remove the "deleted" entities from the index. How could you guys do in such situation?

Thanks!


(system) #2

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.