org.elasticsearch.common.util.concurrent.EsRejectedExecutionException: rejected execution of org.elasticsearch.transport.TransportService

reeenz20 · January 29, 2018, 10:23am

Hi guys, I have this error on elasticsearch. The logs keeps on increasing. Urgently needed help here. Thank you!

org.elasticsearch.common.util.concurrent.EsRejectedExecutionException: rejected execution of org.elasticsearch.transport.TransportService$7@52bbdff1 on EsThreadPoolExecutor[search, queue capacity = 1000, org.elasticsearch.common.util.concurrent.EsThreadPoolExecutor@49c9d421[Running, pool size = 7, active threads = 7, queued tasks = 1000, completed tasks = 7265472

Bernt_Rostad · January 29, 2018, 10:40am

It means your Elasticsearch cluster has too many tasks to handle, your task queue is constantly full and the node will reject new tasks until its task queue drops below 1000 (the default max).

You basically have three choices:

Reduce the workload.
Add more nodes to the cluster to share the workload.
Get stronger hardware.

The first option may not always be feasible, if so you need to grow the clusters work capacity by either adding more nodes or improving the hardware. I see your pool size is just 7 which indicates a 4 CPU hardware. In general, Elasticsearch will use a pool size that is (1.5 x number of cores) + 1. For instance, if you have 24 CPUs the pool size will be 37, giving you 37 worker threads to handle the queued tasks.

Christian_Dahlqvist · January 29, 2018, 10:54am

How many concurrent queries are you serving? How many shards does each query typically address?

reeenz20 · January 29, 2018, 10:58am

Hi @Bernt_Rostad, sorry I'm kind of new here. How can I add more nodes to the cluster to share workloads? The hardware I have right now is
Intel Xeon E5-2666 v3 (Haswell)
4 vCPU
7.5 Mem

@Christian_Dahlqvist, How can I check the concurrent queries and shards?

Thank you.

Christian_Dahlqvist · January 29, 2018, 11:10am

How many shards typically match the index pattern your queries are using? You can list shards using the cat shards API.

How are you querying the cluster? Kibana?

If you have X-Pack monitoring installed, this can tell you how many queries the cluster is serving.

reeenz20 · January 29, 2018, 11:14am

I don't have XPACK installed. Here's the output:

filebeat-2017.11.23 2 p STARTED 37617 26.5mb 127.0.0.1 Eq-uD9o
filebeat-2017.11.23 2 r UNASSIGNED
filebeat-2017.11.23 0 p STARTED 37620 26.5mb 127.0.0.1 Eq-uD9o
filebeat-2017.11.23 0 r UNASSIGNED
filebeat-2017.10.06 1 p STARTED 437866 169.4mb 127.0.0.1 Eq-uD9o
filebeat-2017.10.06 1 r UNASSIGNED
filebeat-2017.10.06 3 p STARTED 438124 169.7mb 127.0.0.1 Eq-uD9o
filebeat-2017.10.06 3 r UNASSIGNED
filebeat-2017.10.06 4 p STARTED 437226 168.7mb 127.0.0.1 Eq-uD9o
filebeat-2017.10.06 4 r UNASSIGNED
filebeat-2017.10.06 2 p STARTED 436374 169.2mb 127.0.0.1 Eq-uD9o
filebeat-2017.10.06 2 r UNASSIGNED
filebeat-2017.10.06 0 p STARTED 437640 168.5mb 127.0.0.1 Eq-uD9o
filebeat-2017.10.06 0 r UNASSIGNED
filebeat-2018.01.24 1 p STARTED 58524 46.9mb 127.0.0.1 Eq-uD9o
filebeat-2018.01.24 1 r UNASSIGNED
filebeat-2018.01.24 3 p STARTED 58018 46.1mb 127.0.0.1 Eq-uD9o
filebeat-2018.01.24 3 r UNASSIGNED
filebeat-2018.01.24 4 p STARTED 58608 46.7mb 127.0.0.1 Eq-uD9o
filebeat-2018.01.24 4 r UNASSIGNED
filebeat-2018.01.24 2 p STARTED 58439 46.5mb 127.0.0.1 Eq-uD9o
filebeat-2018.01.24 2 r UNASSIGNED
filebeat-2018.01.24 0 p STARTED 58401 46.5mb 127.0.0.1 Eq-uD9o
filebeat-2018.01.24 0 r UNASSIGNED
filebeat-2017.11.19 1 p STARTED 15609 11mb 127.0.0.1 Eq-uD9o
filebeat-2017.11.19 1 r UNASSIGNED
filebeat-2017.11.19 3 p STARTED 15663 11.2mb 127.0.0.1 Eq-uD9o
filebeat-2017.11.19 3 r UNASSIGNED
filebeat-2017.11.19 4 p STARTED 15969 11.3mb 127.0.0.1 Eq-uD9o
filebeat-2017.11.19 4 r UNASSIGNED
filebeat-2017.11.19 2 p STARTED 15887 11.2mb 127.0.0.1 Eq-uD9o
filebeat-2017.11.19 2 r UNASSIGNED
filebeat-2017.11.19 0 p STARTED 15843 11.2mb 127.0.0.1 Eq-uD9o
filebeat-2017.11.19 0 r UNASSIGNED
filebeat-2017.11.08 1 p STARTED 32759 22.8mb 127.0.0.1 Eq-uD9o
filebeat-2017.11.08 1 r UNASSIGNED
filebeat-2017.11.08 3 p STARTED 32592 22.7mb 127.0.0.1 Eq-uD9o
filebeat-2017.11.08 3 r UNASSIGNED
filebeat-2017.11.08 4 p STARTED 33020 22.9mb 127.0.0.1 Eq-uD9o
filebeat-2017.11.08 4 r UNASSIGNED
filebeat-2017.11.08 2 p STARTED 32819 22.8mb 127.0.0.1 Eq-uD9o
filebeat-2017.11.08 2 r UNASSIGNED
filebeat-2017.11.08 0 p STARTED 32708 22.7mb 127.0.0.1 Eq-uD9o
filebeat-2017.11.08 0 r UNASSIGNED

There's a lot more. thanks!

Christian_Dahlqvist · January 29, 2018, 11:16am

Can you provide the full output of the cluster stats API? I suspect you have too many shards as your indices seem to use the default 5 shards and go back some time. Please read this blog post with guidelines about shards and sharding and try to reduce you shard count. This should allow you to query longer time periods without addressing too many shards.

reeenz20 · January 29, 2018, 12:25pm

Here the output for cluster stats. Btw, I'm using Elasticsearch 5.4.2.

I'm looking at the post you just sent right now.

Christian_Dahlqvist · January 29, 2018, 12:34pm

Yes, it looks like you have a lot of very small shards, which is causing problems. You should be able to create an index template that sets the number of primary shards to 1 for all new indices and then use the shrink index API to reduce the primary shard count to 1 for all existing indices. It may be worthwhile for you to consider using monthly indices with a single shard.

reeenz20 · January 29, 2018, 12:46pm

Will shrinking the shards affect the logs? Because I wanted to keep my logs intact for atleast a year.

Christian_Dahlqvist · January 29, 2018, 12:52pm

It will not affect the data, just reduce the shard count.

reeenz20 · January 29, 2018, 1:19pm

Okay. I've created a template. How do I use the shrink Index API to reduce the primary shard count to 1?

{
"filebeat" : {
"order" : 0,
"template" : "filebeat-",
"settings" : {
"index" : {
"number_of_shards" : "1",
"refresh_interval" : "5s"
}
},
"mappings" : {
"default" : {
"dynamic_templates" : [
{
"template1" : {
"mapping" : {
"ignore_above" : 1024,
"index" : "not_analyzed",
"type" : "{dynamic_type}",
"doc_values" : true
},
"match" : ""
}
}
],
"_all" : {
"norms" : {
"enabled" : false
},
"enabled" : true
},
"properties" : {
"@timestamp" : {
"type" : "date"
},
"geoip" : {
"dynamic" : true,
"type" : "object",
"properties" : {
"location" : {
"type" : "geo_point"
}
}
},
"offset" : {
"type" : "long",
"doc_values" : "true"
},
"message" : {
"index" : "analyzed",
"type" : "string"
}
}
}
},
"aliases" : { }
}
}

Christian_Dahlqvist · January 29, 2018, 1:34pm

Did you look at the documentation I linked to?

reeenz20 · January 29, 2018, 1:38pm

Yes, but I'm quite confused on the shrink API part. Btw, did i get the template correctly or I messed it up? Thanks!

Bernt_Rostad · January 29, 2018, 1:39pm

Adding a new node to a cluster is just as easy as starting up a single node, just make sure to use the same cluster.name in the elasticsearch.yml file of the new server and that the other server is listed in the discovery.zen.ping.unicast.hosts field (also in elasticsearch.yml).

As soon as you start up the new Elasticsearch instance it will try to connect to the named cluster by asking the server listed in the discovery host field if such a cluster is available. For how Elasticsearch discovers a cluster have a look at Discovery.

reeenz20 · January 30, 2018, 6:42am

I have a lot of indices, Do I need to shrink the index one by one?

Christian_Dahlqvist · January 30, 2018, 8:21am

Yes, I believe so, but you should be able to script it. Another option would be to use the reindex API to reindex your daily indices into monthly indices, after which you can simply delete the daily indices.

reeenz20 · January 30, 2018, 8:38am

How will I configure the index pattern on kibana to me new index created?

Christian_Dahlqvist · January 30, 2018, 8:43am

The filebeat-* index pattern will match both filebeat-2018.11.19 and filebeat-2018.11. While you are creating the new index your data for that month will exist in 2 matching indices, which means you will get incorrect results until it has completed. As it looks like you have quite small indices I would expect the reindex operation to be quite quick though.

reeenz20 · January 30, 2018, 8:59am

So basically I need to do.

Create a template

{
"filebeat" : {
"order" : 0,
"template" : "filebeat-",
"settings" : {
"index" : {
"number_of_shards" : "1",
"refresh_interval" : "5s"
}
},
"mappings" : {
"default" : {
"dynamic_templates" : [
{
"template1" : {
"mapping" : {
"ignore_above" : 1024,
"index" : "not_analyzed",
"type" : "{dynamic_type}",
"doc_values" : true
},
"match" : ""
}
}
],
"_all" : {
"norms" : {
"enabled" : false
},
"enabled" : true
},
"properties" : {
"@timestamp" : {
"type" : "date"
},
"geoip" : {
"dynamic" : true,
"type" : "object",
"properties" : {
"location" : {
"type" : "geo_point"
}
}
},
"offset" : {
"type" : "long",
"doc_values" : "true"
},
"message" : {
"index" : "analyzed",
"type" : "string"
}
}
}
},
"aliases" : { }
}
}

is that correct? How will I know if the for all new indices they will use that template?

Shrink the indices 1 by 1. (My current index is filebeat-YYYY.MM.dd) would it be okay to shrink it with the same index?
Reindex.

Topic		Replies	Views
ElasticSearch with > 40 nodes, missing shards and indexing troubles Elasticsearch	11	652	July 6, 2017
Changing shard number per index due to EsRejectedExecutionException Kibana	3	1268	July 6, 2017
20 shards per 5 nodes, thoughts Elasticsearch	9	566	July 6, 2017
Slow Shard Assignment Elasticsearch	6	1808	July 6, 2017
New index immediately becomes red Elasticsearch	8	2061	July 6, 2017

org.elasticsearch.common.util.concurrent.EsRejectedExecutionException: rejected execution of org.elasticsearch.transport.TransportService

Related topics