Ingest issue during re-indexing/cloning?

GenSSC · February 24, 2023, 7:03pm

Hello !

I need to re-index multiple indices prior to an update of our stack.

In order to test the reindexing process, I am cloning an index. However, the index needs to be read-only. My question is...what if data is ingested during the read-only phase? Will it be lost?

Thank you !

leandrojmp · February 24, 2023, 9:18pm

Probably yes, to clone an index it needs to be on read-only mode, which blocks write, so new data will not be added to the index.

Depending on how you are indexing the data, it may be written after you remove the read-only, for example, but you also may have lost those data.

Clone and reindex are different operations, you can reindex an index that is still receiving data, but it will only reindex the data that already exists in the index when you triggered the reindex action, every new data added after this will note be present on the reindexed index and you will need to do another reindex to get that data.

GenSSC · February 27, 2023, 1:11pm

Thank you for your response.

Regarding the reindexing, do you mean that data added, let's say even one week later, will not be reindexed? In my environment, data is being ingested all the time.

leandrojmp · February 27, 2023, 1:13pm

The data added after the reindex request is made.

If you make a reindex request for an index now, and 1 milisecond later you add a new document, this new document will not be part of that reindex request and will not be reindexed.

GenSSC · February 27, 2023, 1:45pm

So basically when you reindex, it sort of locks your index forever?
I need to reindex everything because they were created with an older version. Someone from Elastic Search told me I need to reindex pretty much everything but never told me I wouldn't be able to work with my indices afterwards...

leandrojmp · February 27, 2023, 2:17pm

If you need to reindex your indices to increase the index version of it, you should stop writing on the old indices and start writing in new ones.

If you keep writing in the old indices, it makes no sense to reindex because a reindex is a snapshot of the data at the moment when the request is made, you would need to keep doing reindex after reindex to keep your data up to date.

In your case you should start ingest your data in new indices and reindex the old data from indices that are not being written anymore.

system · March 27, 2023, 2:17pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Deletions in reindex Elasticsearch reindex	2	523	April 11, 2022
About reindex Elasticsearch	7	187	March 3, 2024
How to re-index Elasticsearch without stale reads? Elasticsearch	2	464	December 8, 2020
Reindex while writing to index Elasticsearch	1	682	March 16, 2018
What happens new data is created during reindexing procedure? Elasticsearch	4	770	July 5, 2017

Ingest issue during re-indexing/cloning?

Related topics