Store indexes in ES while the data stays in HDFS


(Athul Raj) #1

Is there a way to index data from an HDFS location into elasticsearch without moving/copying the same to ES? es-hadoop seems to copy the data into ES cluster there by making a copy of the same big chunk of data. Is there a config that I am missing out? Or is it that the whole idea is absurd?

It'd be helpful to know if this is possible, and if yes, a detailing about the same would be much appreciated.


(Mark Walkom) #2

No there is not.


(Athul Raj) #3

So, there's no way but to have two individual copies of the data in HDFS and ES, right?


(Mark Walkom) #4

That is correct.


(system) #5