I have an ES index which is continuously receiving updates. Apart from inserts and deletes, my index accepts a lot of update operations. Over a period of time, I see a lot of disk IO performance issues because of which I am planning to reindex my cluster every 6 months or so.
I checked out the Reindex API, but this seems to use the current snapshot of the cluster to start copying data. What about the data that is continuously streaming in ? Whats the best way to handle that ?
Also, when you say ReIndex API copies data from one index to another, is it only the data that is copied ? Would disk defragments carry over to the new index ?