I have a running Elastic node with an index. I'm constantly inserting documents into the index in a process that I can't stop. The whole process is running in a remote server. I need to make a copy of the index into my local machine in order to experiment with the data
Is it possible to make a copy of the index into my local machine without interrupting the data ingestion in the server's node using _reindex?
Yes. You can reindex from the existing remote cluster. This will reindex all the data that was persent in the index at the time you started the reindexing. The remote cluster you are reading from can still continue to receive new data.
As Christian said you can use reindex from remote, on your local node you will need to configure reindex.remote.whitelist pointing to your remote host to allow the reindex, you will run the reindex request in your local node.
Keep in mind that the reindex will not get new documents created after you started the reindex process.
Oh cool, I was under the impression that reindexing has more implications than snapshot/restore and that it can be used only between two Elastic cloud clusters for data migration, but not with self managed deployment.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.