I am reading data from elasticsearch using spark (ES-Spark). After i get
the data using sc.esRDD(".../...") I want to store everything in HDFS so I
use the saveAsTextFile method but it is very slow ...
Am I doing the right things ? It takes 15min to save (and it is saving 11Go)
I am reading data from elasticsearch using spark (ES-Spark). After i get the data using sc.esRDD(".../...") I want to
store everything in HDFS so I use the saveAsTextFile method but it is very slow ...
Am I doing the right things ? It takes 15min to save (and it is saving 11Go)
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.