HDFS as elastic search data repository

yogeshmsharma · April 1, 2016, 12:26pm

For large volume of logs we are thinking of using HDFS as data repository for elastic search.
apache flume is one of option suggested by few blogs on internet. Would like to get more information from community on this to come on conclusion. How do I instruct Elastic search to store data in HDFS and query/index same every time.

LOGSTASH ---> ELASTIC SEARCH ( want to use Kibana for visualization ) -- > HDFS

magnusbaeck · April 2, 2016, 6:29pm

Except for ES-Hadoop I don't believe HDFS is a supported storage backend for Elasticsearch.

warkolm · April 3, 2016, 1:55am

It is not.

yogeshmsharma · April 6, 2016, 6:46am

Thanks @magnusbaeck. ES-hadoop as far as I understood from documentation is to do search on Hadoop echo system. Where as my requirement is to just use high storage capacity of Hadoop.

I will be collecting huge logs from 'n' number of Micro services and need to store this in faster storage for better retrieval. any thought on that ?.

I am getting tilt towards mongoDb a bit, doing research on elastic and mongodb for the moment

yogeshmsharma · April 13, 2016, 1:08pm

any link/guide to setup elastic indices in Hadoop?

thn · April 13, 2016, 1:37pm

I don't think it is supported... if you want it, you need to build one yourself... do it for the community

Here is an idea that you can think about, check out the link below

dadoonet · April 13, 2016, 2:16pm

You should really not do this. Really...

That being said, may be you can mount an NFS drive running on hadoop and put your indices in it but again, I would not do it...

Why not sending your logs directly to elasticsearch?

Topic		Replies	Views
HDFS to Elastic search - User Triggered Data Load Elasticsearch	2	380	July 6, 2017
How to query hdfs data from elasticsearch using es-hadoop.? Elasticsearch es-hadoop	7	1642	July 6, 2017
HDFS as storage for ElasticSearch Elasticsearch	2	2246	July 5, 2017
Store index in HDFS Elasticsearch	1	340	July 6, 2017
Hadoop / Elasticsearch functionality Elasticsearch es-hadoop	20	3235	July 6, 2017

HDFS as elastic search data repository

Related topics