ElasticSearch and Hadoop

Hi everyone,

I'd like to share with you some changes that we have made to the
ElasticSearch Hadoop project:

  • Old code base moved to 'elasticsearch-hdfs' project
    Currently 'elasticsearch-hadoop' provides an HDFS shared gateway which, as
    you know, is deprecated and will be replaced with snapshot/restore feature
    in ES. As Hadoop is much more then HDFS, we decided to to rename the GitHub
    project to 'elasticsearch-hdfs'.

  • 'Fresh' code integrating ES with MapReduce, Hive and Pig. More to come in
    the near future.
    However, we're not doing the rename for nothing - starting today, in
    'elasticsearch-hadoop' you can find fresh code providing integration
    between ElasticSearch and Hadoop (MapReduce, Hive, Pig with more to come).
    We have just started working on it but it already has more features then
    any integration out there that we're aware of - just take a look at the
    docs [1].

For the time being, we will keep the HDFS integration (elasticsearch-hdfs)
as is but once the snapshot/restore feature is added to ES, we will add an
HDFS implementation for it within ElasticSearch Hadoop project.

We know that renames break thing but in this case, we think it's for the
best and hopefully the new features will make up for it :slight_smile:

Cheers,
Costin
http://twitter.com/costinl

[1] http://github.com/elasticsearch/elasticsearch-hadoop

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.