In our prod ES cluster, we have 5 nodes and I have to re-index all the
documents. I am doing bulk indexing through river, and while doing this
load on ES nodes increase to 10 - 15+, and obviously this slows down the
search. Can anyone suggest what should I do in order to bulk index through
river and at the same time keep the load on nodes under control. I have to
re-index 50 million documents.
In our prod ES cluster, we have 5 nodes and I have to re-index all
the documents. I am doing bulk indexing through river, and while doing this
load on ES nodes increase to 10 - 15+, and obviously this slows down the
search. Can anyone suggest what should I do in order to bulk index through
river and at the same time keep the load on nodes under control. I have to
re-index 50 million documents.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.