Bulk index slowing down as index size increases

Pyppe · May 27, 2020, 1:34pm

Phew, I think I finally found the root cause of the indexing speed slowing down. It's ICU4J transliteration, and the fact that after about 13.2M documents we start to have a lot of Chinese data.

I asked in a new post how to avoid duplicate transliterations (I am assuming the use of icu_transform in one property but with multiple fields results in running transliteration multiple times for the same identical text): ICU transform filters slowing down indexing: how avoid duplicate transliterations?

Thanks!

PS. @Christian_Dahlqvist feel free to continue with the good insights in the new post!

Topic		Replies	Views
Bulk indexing slow down when data amount increase Elasticsearch	6	2948	July 6, 2017
Indexing speed slowdown Elasticsearch	12	891	October 19, 2017
Slowly Indexing speed Elasticsearch	26	857	August 18, 2020
ElasticSearch Node goes down Elasticsearch	6	3714	July 27, 2019
Rapidly Degrading Bulk Indexing Performance Elasticsearch	7	368	July 6, 2017

Bulk index slowing down as index size increases

Related topics