Store indexes in ES while the data stays in HDFS

(Athul Raj) #1

Is there a way to index data from an HDFS location into elasticsearch without moving/copying the same to ES? es-hadoop seems to copy the data into ES cluster there by making a copy of the same big chunk of data. Is there a config that I am missing out? Or is it that the whole idea is absurd?

It'd be helpful to know if this is possible, and if yes, a detailing about the same would be much appreciated.

(Mark Walkom) #2

No there is not.

(Athul Raj) #3

So, there's no way but to have two individual copies of the data in HDFS and ES, right?

(Mark Walkom) #4

That is correct.

(system) #5