ELK and Hadoop integration

ajays · August 29, 2015, 2:54pm

Hi,

We are having one ELK setup and logs from various sources like apache, nginx, system messages etc are processed. We keep the ES data for 1 month and dump the rest to s3 bucket. For analysing the history data we need to again restore the content of s3 to elasticsearch. Now I want to integrate ELK with hadoop so that we can see the old data in kibana. I heard of elasticsearch-hadoop. Can anybody plz help how to dump all data to hadoop via elasticsearch and vice versa?

LogStash--->ElasticSearch--->Hadoop

AjayS

ajays · August 29, 2015, 6:54pm

Hi,

I saw on web that we can use hive/pig to get the data from hadoop to elasticsearch but what are the ways to do other way round. Can we use elasticsearch-hadoop connectors for this purpose? Can somebody give me some example/link?

Regards,
AjayS

costin · August 29, 2015, 7:16pm

Have you looked at the project reference documentation ? It explains the architecture, configuration and how to both read and write data through the connector for each library with code examples.

ajays · August 30, 2015, 2:16pm

@costin,

I watched your webinar video and I am trying your example but getting this error.

[root@localhost bin]# ./hive -hiveconf hive.aux.jars.path=///opt/elasticsearch-hadoop-2.1.1/dist/elasticsearch-hadoop-2.1.1.jar

Logging initialized using configuration in jar:file:/usr/local/apache-hive-1.2.1-bin/lib/hive-common-1.2.1.jar!/hive-log4j.properties
hive> CREATE EXTERNAL TABLE eslogs (time STRING, extension STRING, clientip STRING, request STRING, response INT, agent STRING) STORED BY 'org.elasticsearch.hadoop.hive.ESStorageHandler' TBLPROPERTIES('es.nodes' = '192.168.44.175','es.port' = '9200','es.resource' = 'demo/hive','es.mapping.names' = 'time:@timestamp');
FAILED: SemanticException Cannot find class 'org.elasticsearch.hadoop.hive.ESStorageHandler'

Regards,
AjayS

costin · August 31, 2015, 8:55pm

The classpath is not properly set; use an hdfs classpath instead of a local one since Hive tends to be picky about these things:
https://www.elastic.co/guide/en/elasticsearch/hadoop/current/hive.html#_installation_3

Your distro documentation might/should provide more info

ajays · September 1, 2015, 3:26am

@costin,

Thanks. Let me check and get back to you with the results.

Regards,
AjayS

Topic		Replies	Views
Connecting hadoop and elasticsearch Elasticsearch es-hadoop	3	1453	September 20, 2019
Query on Indexing using es-hadoop Elasticsearch es-hadoop	6	1957	July 6, 2017
Data Integration between Hadoop - Hive and Elastic Search Elasticsearch es-hadoop	3	830	February 10, 2022
Collect data from HIVE Elasticsearch es-hadoop	4	1403	August 22, 2018
Elasticsearch Hadoop Connection Elasticsearch es-hadoop	2	1373	November 13, 2019

ELK and Hadoop integration

Related topics