Hi Everyone, Currently in our Elasticsearch cluster we have a lot of documents which need to deleted, so we are looking to re-index the used documents to a new index and delete the old index. We will be doing this in production environment, the approach is to create a new index, re-index the used documents to a new index, once done we will point the alias to new index to avoid any downtime. Questions for this approach:
-
What configuration can we use during re-indexing to not cause any latency issues to the queries during this task? (like scorll_size, requests_per_second etc)
-
In order to speed up the re-indexing we are thinking about stopping refresh on the new index, will there be any issues when ingest a large amount of documents and turn on the refresh later.
-
Will there be a increase in CPU Utilization or query latency issues when we delete a large index?
Any suggestions are welcome.
Thanks.