Pushback to hadoop

koert · May 28, 2015, 11:00pm

When we load data from hadoop into elasticsearch, we keep seeing errors in the tasks like this:
org.elasticsearch.hadoop.EsHadoopException: Could not write all entries [99/347072] (maybe ES was overloaded?). Bailing out...

Since our hadoop cluster can load/read data at an enormous rate i am not surprised our (much smaller) elasticsearch cluster can not keep up. Fair enough. So this question is not about optimizing elasticsearch for faster indexing.

My question is: why can elasticsearch not do some kind of pushback to slow down the hadoop job to a speed that is acceptable for elasticsearch? It seems elasticsearch will happily keep on ingesting data at a rate it simply cannot sustain...

koert · May 29, 2015, 3:30am

oh i just reaized there is a forum for hadoop related stuff. moving this over to that one...

Topic		Replies	Views
Pushback to hadoop from es on bulk load Elasticsearch es-hadoop	9	10641	July 6, 2017
Throttling write speed in ES-Hadoop Connector Elasticsearch es-hadoop	2	2027	July 6, 2017
Load data from spark to ElasticSearch Hadoop Elasticsearch es-hadoop	1	1111	July 6, 2017
Elasticsearch & hadoop Elasticsearch es-hadoop	7	1114	September 11, 2017
ElasticSearch Hadoop Elasticsearch	2	353	July 6, 2017

Pushback to hadoop

Related topics