Elasticsearch and Hive work together


(vak5d6) #1

I see that Hive and Elasticsearch are almost equivalent except that
Elasticsearch supports near real time queries. Moreover, Elasticsearch can
run independently to store and analyze data. So why people use both Hive
and Elasticsearch on Hadoop ?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d69148e8-603a-4dc8-9e67-c89d37a43dca%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(Costin Leau) #2

Actually they are completely different.
Hive is a library built on top of Hadoop that uses a SQL-like query language to transform (mainly read) data.
Elasticsearch is a real-time search and analytics engine.

You can read the docs of each library/product to see the differences or better yet, take a look at the various demos out
there.
As for why folks use Hive and Elasticsearch? Because as a Hadoop user (using Hive) by using Elasticsearch one can
leverage its
powerful search capabilities and easily slice and dice data.
Arguably one could do the same with Hive however it's not at all trivial - a simple example is doing geo-search.

Hope this clarifies this a bit.

P.S. Elasticsearch does not depend on Hadoop however it is integrated with Hadoop (Map/Reduce, Hive, Pig, Spark,
Cascading ) through
Elasticsearch-Hadoop project.

On 8/31/14 2:01 PM, vak5d6 wrote:

I see that Hive and Elasticsearch are almost equivalent except that Elasticsearch supports near real time queries.
Moreover, Elasticsearch can run independently to store and analyze data. So why people use both Hive and Elasticsearch
on Hadoop ?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
elasticsearch+unsubscribe@googlegroups.com mailto:elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/d69148e8-603a-4dc8-9e67-c89d37a43dca%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/d69148e8-603a-4dc8-9e67-c89d37a43dca%40googlegroups.com?utm_medium=email&utm_source=footer.
For more options, visit https://groups.google.com/d/optout.

--
Costin

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/540498A4.5010202%40gmail.com.
For more options, visit https://groups.google.com/d/optout.


(vak5d6) #3

Could you give a specific use case that combining Hive and ES to enhance
each other ?. I wonder that why don't we just use ES to retrieve data, big
data ? Thank you.

On Mon, Sep 1, 2014 at 11:02 PM, Costin Leau costin.leau@gmail.com wrote:

Actually they are completely different.
Hive is a library built on top of Hadoop that uses a SQL-like query
language to transform (mainly read) data.
Elasticsearch is a real-time search and analytics engine.

You can read the docs of each library/product to see the differences or
better yet, take a look at the various demos out there.
As for why folks use Hive and Elasticsearch? Because as a Hadoop user
(using Hive) by using Elasticsearch one can leverage its
powerful search capabilities and easily slice and dice data.
Arguably one could do the same with Hive however it's not at all trivial -
a simple example is doing geo-search.

Hope this clarifies this a bit.

P.S. Elasticsearch does not depend on Hadoop however it is integrated with
Hadoop (Map/Reduce, Hive, Pig, Spark, Cascading ) through
Elasticsearch-Hadoop project.

On 8/31/14 2:01 PM, vak5d6 wrote:

I see that Hive and Elasticsearch are almost equivalent except that
Elasticsearch supports near real time queries.
Moreover, Elasticsearch can run independently to store and analyze data.
So why people use both Hive and Elasticsearch
on Hadoop ?

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to
elasticsearch+unsubscribe@googlegroups.com <mailto:elasticsearch+
unsubscribe@googlegroups.com>.

To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/d69148e8-
603a-4dc8-9e67-c89d37a43dca%40googlegroups.com
<https://groups.google.com/d/msgid/elasticsearch/d69148e8-
603a-4dc8-9e67-c89d37a43dca%40googlegroups.com?utm_medium=
email&utm_source=footer>.

For more options, visit https://groups.google.com/d/optout.

--
Costin

--
You received this message because you are subscribed to a topic in the
Google Groups "elasticsearch" group.
To unsubscribe from this topic, visit https://groups.google.com/d/
topic/elasticsearch/4jIju1jY6v0/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/
msgid/elasticsearch/540498A4.5010202%40gmail.com.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAKrnfTcy%2BXv2tcsXaXtyhDKUZwrG66WTHWS09fYqQC%2B7L4aq5g%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


(system) #4