Index HDFS data


(Prasad N) #1

Sorry if this is a gross repeat of the question but I am trying to get my answers straight. I have CSV files coming into hadoop/hdfs. I would like to create an index for this data (incremental index)

One option is Create External table on folder and use Hive Plugin
Another is PIG.

Is there an way to directly index without using Hive or Pig ? If so do i have to convert data to JSON ?

Thanks
Logic4fun


(Joshua Rich) #2

Have you tried the ES-Hadoop connector? It supports reading data in HDFS into Elasticsearch.


(Prasad N) #3

I did not. Let me download and look into it.


(system) #4