I've read through much of the documentation for es-hadoop, but I might be
coming away with some misunderstandings.
The setup docs for elasticsearch for apache hadoop (es-hadoop) uses the
word interact which is a bit vague.
Elasticsearch for Apache Hadoop is an open-source, stand-alone,
self-contained, small library that allows Hadoop jobs (whether using
Map/Reduce or libraries built upon it such as Hive, Pig or Cascading) to
interact with Elasticsearch. Data flows bi-directionaly so that
applications can leverage transparently the Elasticsearch engine
capabilities to significantly enrich their capabilities and increase the
So, does this mean I have a separate Hadoop instance (potentially built
upon HDFS or AWS EMR) and I can query data using either the elasticsearch
(REST/Java/etc) or hadoop (Hive, Pig, Cascading) environments?
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to email@example.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/c12c35a0-b9ee-461e-8e81-12910dd06894%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.