How to monitor or find logs regarding elasticsearch-hadoop plugin for hive

Harbeer_Kadian · December 1, 2017, 6:34am

I am using elasticsearch-hadoop plugin inside hive to insert record from hive to elasticsearch. I need to get hold of insertion logs and if possible some way to monitor how the insertion is going. I checked elasticsearch-hadoop plugin documentation, but not able to understand how to check the logs.

If some body has used these tool, please help me in finding how to monitor execution.

Here is the documentation link i referred. https://www.elastic.co/guide/en/elasticsearch/hadoop/current/logging.html https://www.elastic.co/guide/en/elasticsearch/hadoop/current/metrics.html

james.baiera · December 13, 2017, 7:33pm

For finding the actual logs, you will have to check how your logging is configured for Hive. ES-Hadoop will log messages in two locations depending on what is being done. The first logging location will be on the HiveServer for any job configuration and simple job execution. The second logging location will be on the executors that Hive spins up on your cluster to perform more complicated distributed operations. If enabled, the logs will most certainly be mixed in with the regular Hive logs, so you'll have to dig a bit to get at them.

Harbeer_Kadian · December 14, 2017, 9:25am

I found the logs, where it was giving information about the elasticsearch-hadoop metrics. In my case, I am using hive on top of AWS EMR.

The metric logs are found at location
Find the application id of your insert into elasticsearch operation from hive-server2.log.
Go to aws console, and open emr console.
Open your emr cluster.
Open LOGURI containing log files under s3.
Go to node//applications/hive/hive-server2.out.gz
Find the application id of your step.
Now go to containers//<container_id>/syslog_attempt...
This file will give you the metrics results.
There can be many application id depending upon many queries run by hive as well as many container id depending upon how much parallel execution is happening.

james.baiera · December 14, 2017, 7:37pm

Thanks for sharing your finds here!

system · January 11, 2018, 7:37pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Running insert into elasticsearch from hive Elasticsearch es-hadoop	14	3665	July 6, 2017
How can we integration of hive(hadoop) and elasticsearch Elasticsearch	2	381	July 6, 2017
ELK and Hadoop integration Elasticsearch es-hadoop	6	6636	July 6, 2017
How to push data from Hadoop to ES? Elasticsearch es-hadoop	6	4229	July 21, 2017
Integration of Hive and Elasticsearch on cloudera Hadoop hive version 1.1.0 Elasticsearch es-hadoop	2	1629	July 6, 2017

How to monitor or find logs regarding elasticsearch-hadoop plugin for hive

Related topics