Huge performance degradation during bulk indexing

cfeenstra67 · April 9, 2019, 3:41pm

Thanks for the reply! I have a couple follow up questions if you don't mind: What sort of things affect the disk I/O usage? Number of shards you're indexing on, total size of those shards, number of documents being indexed? Intuitively I would guess that each of these have some impact, but is one of these typically the driver of high I/O?

There are definitely spikes in the I/O throughput (can't immediately figure out how to look at iowait at the moment), but they don't seem to explain the spikes in search latency. For example during our nightly job when we are reindexing the entire dataset in the background, the read and write throughput are both much higher and yet we don't experience the same latency spikes. That's not to say that isn't the problem, just that it isn't immediately obvious or could be a related issue

Topic		Replies	Views
Suggestion needed on Indexing Performance Elasticsearch	1	496	July 6, 2017
Elasticsearch bulk slows down after a certain amount of documents Elasticsearch	4	1381	April 24, 2020
Bulk indexing slow down when data amount increase Elasticsearch	6	2957	July 6, 2017
Slowly Indexing speed Elasticsearch	26	862	August 18, 2020
Indexing performance Elasticsearch	6	370	July 6, 2017

Huge performance degradation during bulk indexing

Related topics