I'm currently evaluating Elasticsearch for our project. In this project we want to completely reindex our database from time to time, which will probably take a few hours up to a couple of days.
This must not affect the database in production, as users are working with the system 24/7.
And here is my question:
- Is there a best practice around how to achieve this?
Any hints or keywords to search for highly appreciated.
P.S.: My current toughts on this:
- Should we index the data into another cluster, node or index?
- Can we replicate it into our live system?
- Should we write data with some sort of notYetVisible boolean?