How to do bulk insert from Hive to Elasticsearch?

ravi_yadav · July 7, 2016, 10:28am

I know that there is a bulk API in ES which can load data from a file, stating that file should have metadata/action before actual data for all the records.

However my use case is to load data from Hive to ES which i'm doing by creating an External Hive table with ES serde. How can I leverage ES bulk load API to load data faster from Hive to ES?

I'm using ES2.1

Thanks!
Ravi

cbuescher · July 7, 2016, 10:49am

Hi,

are you aware of the Hive Integration in ES Hadoop? I'm not too familiar with it so I don't know if this might solve your problem. If not, it would be interesting to know why.

ravi_yadav · July 7, 2016, 1:22pm

Yes, I'm aware of Hive Integration with ES. We are using it to push data successfully to ES from Hive. But the process takes hours. I wanted to know whether bulk load API can improve performance of data load to ES from Hive. If yes, how to use it with Hive? I couldn't find any documentation on that.

cbuescher · July 7, 2016, 8:36pm

btw. there is a dedicated sub-forum for Hadoop-related questions like this at https://discuss.elastic.co/c/elasticsearch-and-hadoop, maybe folks there have more of an opinion about this question than here in the Elasticsearch forum.

ravi_yadav · July 8, 2016, 2:18am

Reposted there.. Thanks!

Topic		Replies	Views
How to do bulk insert from Hive to Elasticsearch for better data load performance? Elasticsearch es-hadoop	2	2983	July 6, 2017
How to get a better performance to load ElasticSearch data into Hive? Elasticsearch es-hadoop	1	399	February 22, 2021
Insert data into Elasticsearch from Hive in real-time Elasticsearch es-hadoop	4	2262	July 6, 2017
Collect data from HIVE Elasticsearch es-hadoop	4	1403	August 22, 2018
How to push data from Hadoop to ES? Elasticsearch es-hadoop	6	4156	July 21, 2017

How to do bulk insert from Hive to Elasticsearch?

Related topics