Sorry if this is a gross repeat of the question but I am trying to get my answers straight. I have CSV files coming into hadoop/hdfs. I would like to create an index for this data (incremental index)
One option is Create External table on folder and use Hive Plugin
Another is PIG.
Is there an way to directly index without using Hive or Pig ? If so do i have to convert data to JSON ?