HIve on tez or Hive on Spark?

sdavy · November 20, 2017, 1:23pm

Hello everybody,

I'm trying to play with Hive and Elastic. I've been able to deploy a basic Hive config and adding the elastic-hadoop-hive jars I can create an external table mapped on Hadoop. Now, I have some messages telling that using mr as execution engine will be deprecated. Then I understand that I should use tez or Spark.

Which one should I use? Tez seems a little bit complex to install (need to build from source using Maven), but people says that performances are better.

Does anyone have some opinion on this? What is the optimal setup for Hive on Elasticsearch?

Thanks a lot,

Stéphane

james.baiera · December 13, 2017, 6:01pm

Elastic doesn't take an official standpoint on whether to use MR, Tez, or Spark for Hive's execution layer. We primarily test our Hive solution on a local distribution that is based on MR.

system · January 10, 2018, 6:01pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Is there any jar to support elasticsearch integrated to tez on hadoop? Elasticsearch es-hadoop	2	298	July 11, 2023
Hive (HDP 2.3) and ES-Hadoop Integration Issue Elasticsearch es-hadoop	9	3897	July 6, 2017
Whether I should use elasticsearch-spark-20_2.11-5.2.2.jar other than elasticsearch-hadoop-hive-.5.2.2.jar for loading hive table into Elasticsearch? Elasticsearch es-hadoop	2	1167	May 5, 2017
Hive integration - Which jar do we need? Elasticsearch es-hadoop	3	924	October 2, 2018
Data Integration between Hadoop - Hive and Elastic Search Elasticsearch es-hadoop	3	830	February 10, 2022

HIve on tez or Hive on Spark?

Related topics