Direct correlation between pending tasks and GC

tutunak · January 11, 2021, 10:42am

Hi for all my clusters I see the direct correlation between pending tasks and GC time.

What can I do with that? How can avoid this situation?

tutunak · January 12, 2021, 12:08pm

Most of the pending tasks are snapshot related. We are decided to disable snapshots.

DavidTurner · January 12, 2021, 2:30pm

Snapshots are the last line of defence against data loss so its important to use them.

What's the actual issue here? A few ms of GC every 30 minutes or so doesn't sound like a problem that needs solving, and disabling snapshots will cause pain in future.

tutunak · January 16, 2021, 3:42pm

Snapshots aren't mandatory for us. We can easily restore data. The main problem, that our application stop writes to elastic search when we have pending tasks. In this situation, we have to "freeze" work to 5 minutes every hour and it isn't acceptable.

DavidTurner · January 16, 2021, 4:17pm

Pending tasks are processed on the master and are therefore always a bottleneck since there's only ever one master. If you are performance-sensitive then you should definitely avoid having master-related actions on your indexing pathway - this mainly means dynamic index creation and dynamic mapping updates.

I also don't see how a few ms of GC translates into a five-minute "freeze", that's definitely not normal.

tutunak · January 17, 2021, 12:18pm

There are minutes when GC works (spikes on the graphs). During this time you can see pending tasks spike. Our current solution stops write the data to the elastic if sees that we have pending tasks. From the elastic side, I suppose data will proceed. But we overloaded it in the past, so we decided to make the current solution as you see. I'm not sure that it was the right solution.

DavidTurner · January 17, 2021, 12:23pm

Ok, that makes more sense. Don't do that. You absolutely can carry on indexing whether the master is busy or not.

system · February 14, 2021, 12:23pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Pending Tasks increase more quickly and master do garbagge collector with failures Elasticsearch	2	804	July 5, 2017
Pending tasks accumulating with ES 8.17.3 Elasticsearch	1	31	December 4, 2025
Elasticsearch cluster have millions of pending tasks Elasticsearch	15	1216	June 8, 2021
Elasticsearch has accumulated a lot of pending tasks, and stop indexing Elasticsearch	13	1141	October 4, 2023
Increasing number of pending tasks despite small number of shards Elasticsearch	4	1218	June 23, 2021

Direct correlation between pending tasks and GC

Related topics