Throttling write speed in ES-Hadoop Connector

Kamaldeep_Singh · October 7, 2016, 8:10pm

Hello,

I was working on ES - hadoop connector and I see that if you server has less memory writes keep on getting dropped.
org.elasticsearch.hadoop.EsHadoopException: Could not write all entries (maybe ES was overloaded?). Bailing out...

As mentioned in Pushback to hadoop from es on bulk load there's no bi-directional communication between Hadoop and the connector - the connector cannot say, there's too much data, slow down.

Does anyone think it might be a good idea to use sth. like Blocking Queues here and add acks while writing (kafka 101) so as to let the consumer (thread on ES) read at slower pace.

Else we would have to tune the batch size, write speed, http timeouts ourself.

I'm open to building/contributing to this its a good idea.

zhifengMaBeijing · November 6, 2016, 11:57pm

I like the idea!

Topic		Replies	Views
Throttle the ES-Hadoop write speed Elasticsearch es-hadoop	3	651	September 29, 2020
Pushback to hadoop from es on bulk load Elasticsearch es-hadoop	9	10630	July 6, 2017
Pushback to hadoop Elasticsearch	2	1047	July 6, 2017
Overloading during bulk write from Spark Elasticsearch es-hadoop	5	3448	June 27, 2017
[HADOOP] Anyone used TransportClient for writing to ES from Hadoop mappers? Elasticsearch	3	452	July 6, 2017

Throttling write speed in ES-Hadoop Connector

Related topics