Error Hadoop/ElasticSearch Too Many Requests

Guillermo_Ortiz · March 29, 2016, 8:21am

I'm executing Spark againts ElasticSearch using the ElasticSearch API.

I have 6 executors with one core each one. There are not queued tasks. I only have two ElasticNodes with 8 cores and 32 Gb but it seems that they should handle that traffic.

I have checked the elasticsearch logs as well but there aren't any log.

Right now, I have reduce the number of executors to 3 to see what it happens.

Is it really too many producers? it seems that there are not since they are not queued tasks and I checked as well the CPU usage for the ElasticSearch nodes and it's about 30%.

User class threw exception: org.apache.spark.SparkException: Job aborted due to stage failure: Task 4 in stage 82742.0 failed 4 times, most recent failure: Lost task 4.3 in stage 82742.0 (TID 262382,xxxx): org.elasticsearch.hadoop.rest.EsHadoopInvalidRequest: Found unrecoverable error [xxx:9200] returned Too Many Requests(429) - rejected execution of org.elasticsearch.transport.TransportService$4@2c70992a on EsThreadPoolExecutor[bulk, queue capacity = 50, org.elasticsearch.common.util.concurrent.EsThreadPoolExecutor@294f7f8b[Running, pool size = 8, active threads = 8, queued tasks = 50

Guillermo_Ortiz · March 30, 2016, 9:23am

I changed the number of executors to three executors, one core each one and after 10 hours I got the same error.

Any idea?

costin · April 5, 2016, 2:52pm

It looks like you are actually using ES-Hadoop.

Unfortunately ES seems to be failing behind - slowly but surely. In this case, it seems after 10h. You could of course, use 2 executors however I strongly recommend monitoring the cluster to understand what's the cause of it:

a. does the cluster remain out of memory and the GC cause the nodes to slow down
b. based on your initial error, it looks like the queue grows to big - this means indexing is starting to slow down; maybe the disks are too slow?
c. any other processes that are stealing CPU and IO?

Considering the long time - 10h - maybe there's some external processes (crontab, antivirus, etc...) that kicks in while you are ingesting data, and causes the OS to slow down which in turn affects ES which ends up rejecting requests and thus aborting the job.

Topic		Replies	Views
Elasticsearch es-spark too many request Elasticsearch	5	1513	October 18, 2019
Hive -> ES - Too Many Requests(429) Elasticsearch es-hadoop	8	3358	July 6, 2017
org.elasticsearch.common.util.concurrent.EsRejectedExecutionException: rejected execution of org.elasticsearch.transport.TransportService Elasticsearch	25	8450	February 27, 2018
Too Many Requests error from bulk thread pool rather than index thread pool Elasticsearch	1	1822	March 31, 2018
ThreadPoolExecutor overused Elasticsearch	8	3405	July 5, 2017

Error Hadoop/ElasticSearch Too Many Requests

Related topics