Elasticsearch Hadoop

Badal_Mohapatra · January 9, 2014, 7:49am

Hi,

To index Hadoop data into elasticsearch as I understand,
We create an external table with essstorage handler and then copy the data
from another internal hive table doesn't it duplicate the data in HDFS?
Is there any way to use the hive internal tables directly to index instead
of having two tables with same data?

Kind Regards,
Badal

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/ed08fd38-05e4-437a-a8e2-3295f2195e2a%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

costin · February 3, 2014, 10:44am

There is no duplication per-se in HDFS. Hive tables are just 'views' of data - one sits unindexed, in raw format in HDFS
the other one is indexed and analyzed in Elasticsearch.

You can't combine the two since they are completely different things - one is a file-system, the other one is a search
and analytics engine.

On 09/01/2014 9:49 AM, Badal Mohapatra wrote:

Hi,
To index Hadoop data into elasticsearch as I understand,
We create an external table with essstorage handler and then copy the data from another internal hive table doesn't it
duplicate the data in HDFS?
Is there any way to use the hive internal tables directly to index instead of having two tables with same data?

Kind Regards,
Badal

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/ed08fd38-05e4-437a-a8e2-3295f2195e2a%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
Costin

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/52EF730F.4060508%40gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.

Topic		Replies	Views
Read from elasticsearch into Hive Elasticsearch es-hadoop	3	1149	February 10, 2017
Duplicate data on hadoop Elasticsearch	2	813	July 6, 2017
How to push data from Hadoop to ES? Elasticsearch es-hadoop	6	4156	July 21, 2017
Store indexes in ES while the data stays in HDFS Elasticsearch es-hadoop	4	965	July 6, 2017
Hive external table automatically send data to elasticsearch Elasticsearch es-hadoop	2	851	July 6, 2017

Elasticsearch Hadoop

Related topics