Reindex gets stuck

nayestaran · May 25, 2020, 8:56am

Hello Elastic community,

Context

We have two Elasticsearch clusters with 6 and 3 nodes each. The cluster with 6 nodes is the one we use in production environment and we use the one with 3 nodes for testing purposes. (We have the same problem in both clusters). All the nodes have the following characteristics:

Elasticsearch 7.4.2
1TB HDD disk
8 GB RAM

In our case, we need to reindex some of the indexes. Those indexes have billions of documents and a size between 50GB and 250GB.

Problem

Whenever we start reindexing, internally or from a remote source, the task starts working correctly but it reaches a point where it stops reindexing, without apparent reason. We can´t see anything in the logs. The task is not cancelled or anything, it only stops reindexing documents, it looks like the task gets stuck. We tried changing GC strategies, we used CMS and Shenandoah but nothing changes.

Has anyone run into the same problem?

system · June 22, 2020, 8:56am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Reindex GC overhead Elasticsearch	1	424	April 3, 2018
Reindexing stuck at some batch and fails with 'search context missing exception' Elasticsearch	3	1711	June 13, 2019
Reindex stopping on multiples of 5000 Elasticsearch	1	730	June 2, 2017
Why is reindex from remote constantly slowing down on large indices? Elasticsearch reindex	2	630	December 31, 2020
Improve reindex speed into new cluster Elasticsearch	4	1106	January 5, 2019

Reindex gets stuck

Related topics