I want to copy the data in a distant Es (which I have a limited access ) to another ES where I have full rights.
So,the input Es index contains 50000 documents , and I manage to get only 10000 hits using the following configuration:
use the scroll API, search the forums here and you will get answers. Its always a good practice to search the forum before posting or google it
If still need help please ask here
if I don't specify scroll , then logstash will use the default value which is equal to "1m".
I've searched google and forums but I didn't find any answers
You could use cron or similar to schedule a script that calls the reindex API once a day. The reindex API supports queries, which would allow you to only transfer those entries that were done since the last run (if your data contains timestamps).
When transferring large result sets, you should also look into using the scroll API.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.