Integration of hadoop (specifically HDFS files) with ELK stack

Yukti_Agrawal · July 29, 2019, 3:01am

I am trying to integrate hadoop with ELK stack. My use case is " i have to get a data from a file present in HDFS path (AVRO format) and show the contents on kibana dashboard"

Anybody is having any article with step by step process?

james.baiera · August 14, 2019, 7:07pm

Unfortunately I don't have any step-by-step content for doing this, but assuming that you are hosting YARN with your distribution of HDFS, the easiest process for this might be to use something like Spark to read the AVRO files in parallel and ship it to Elasticsearch using ES-Hadoop.

I think the fact that the data is in AVRO format will be more of a limiting factor than the fact that it lives on HDFS. Spark and other Hadoop ecosystem technologies are usually better suited to ETL of that kind of data than other out of the box tools.

system · September 11, 2019, 7:07pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Hadoop/Mapr and ELK Elasticsearch es-hadoop	2	1276	December 12, 2019
Query on Indexing using es-hadoop Elasticsearch es-hadoop	6	1957	July 6, 2017
How to deserialize avro file and use in elastic search Elasticsearch	1	768	August 18, 2018
Data Integration between Hadoop - Hive and Elastic Search Elasticsearch es-hadoop	3	830	February 10, 2022
Pulling data from HDFS to elasticsearch Elasticsearch es-hadoop	2	1231	July 6, 2017

Integration of hadoop (specifically HDFS files) with ELK stack

Related topics