I see that Hive and Elasticsearch are almost equivalent except that
Elasticsearch supports near real time queries. Moreover, Elasticsearch can
run independently to store and analyze data. So why people use both Hive
and Elasticsearch on Hadoop ?
Actually they are completely different.
Hive is a library built on top of Hadoop that uses a SQL-like query language to transform (mainly read) data.
Elasticsearch is a real-time search and analytics engine.
You can read the docs of each library/product to see the differences or better yet, take a look at the various demos out
there.
As for why folks use Hive and Elasticsearch? Because as a Hadoop user (using Hive) by using Elasticsearch one can
leverage its
powerful search capabilities and easily slice and dice data.
Arguably one could do the same with Hive however it's not at all trivial - a simple example is doing geo-search.
Hope this clarifies this a bit.
P.S. Elasticsearch does not depend on Hadoop however it is integrated with Hadoop (Map/Reduce, Hive, Pig, Spark,
Cascading ) through
Elasticsearch-Hadoop project.
On 8/31/14 2:01 PM, vak5d6 wrote:
I see that Hive and Elasticsearch are almost equivalent except that Elasticsearch supports near real time queries.
Moreover, Elasticsearch can run independently to store and analyze data. So why people use both Hive and Elasticsearch
on Hadoop ?
Could you give a specific use case that combining Hive and ES to enhance
each other ?. I wonder that why don't we just use ES to retrieve data, big
data ? Thank you.
Actually they are completely different.
Hive is a library built on top of Hadoop that uses a SQL-like query
language to transform (mainly read) data.
Elasticsearch is a real-time search and analytics engine.
You can read the docs of each library/product to see the differences or
better yet, take a look at the various demos out there.
As for why folks use Hive and Elasticsearch? Because as a Hadoop user
(using Hive) by using Elasticsearch one can leverage its
powerful search capabilities and easily slice and dice data.
Arguably one could do the same with Hive however it's not at all trivial -
a simple example is doing geo-search.
Hope this clarifies this a bit.
P.S. Elasticsearch does not depend on Hadoop however it is integrated with
Hadoop (Map/Reduce, Hive, Pig, Spark, Cascading ) through
Elasticsearch-Hadoop project.
On 8/31/14 2:01 PM, vak5d6 wrote:
I see that Hive and Elasticsearch are almost equivalent except that
Elasticsearch supports near real time queries.
Moreover, Elasticsearch can run independently to store and analyze data.
So why people use both Hive and Elasticsearch
on Hadoop ?
--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/
topic/elasticsearch/4jIju1jY6v0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/
msgid/elasticsearch/540498A4.5010202%40gmail.com.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.