How to stop index from getting throttled?

hjazz6 · June 1, 2023, 8:11am

Hi,

I am ingesting netflow data into my ES 8.3.3 node at a very high rate. As I increase the ingest to ES where the index rate is about 30+K/s, I started to get the messages below:

[INFO] [o.e.i.e.I.EngineMergeScheduler] ... now throttling indexing: numMergesInFlight=10, maxNumMerges=9
[INFO] [o.e.i.e.I.EngineMergeScheduler] ... stop throttling indexing: numMergesInFlight=8, maxNumMerges=9

I have tried to increase the refresh_interval to 30s, but that did not help.

I noticed that as I increase the ingest rate, I can never go above 34K/s index rate, and I would like to index at a much higher rate.

There is a StackOverflow post that talks about increasing index.merge.scheduler.max_merge_count and index.merge.scheduler.max_thread_count, but as the answer is a few years old, I'm not sure if it is still valid.

I'm simply using the netflow module that came with filebeat and the default mapping. Would specifying the index mapping help (currently, most fields are by default mapped to both text and keyword, which is not necessary).

What are some of the things I can do to prevent the index throttling and increase the index rate?

Thank you.

Christian_Dahlqvist · June 1, 2023, 8:15am

What is the size and specification of your cluster? What type of hardware are you using? Are you using local SSDs?

hjazz6 · June 1, 2023, 8:25am

I only have 1 node ("number_of_replicas": "0").

It is on a Linux server running RHEL 7.9, with 6TB of local SSDs, 755GB of RAM (64GB ringfenced for ES), and 96 CPUs.

Christian_Dahlqvist · June 1, 2023, 8:29am

How many shards are you actively indexing into?

How many concurrent indexing threads are you using in the process indexing data into Elasticsearch?

What bulk size are you using?

What is the average size of the documents you are indexing?

hjazz6 · June 1, 2023, 8:31am

I'm only indexing into 1 shard (index).

How do I find out how many concurrent indexing threads and bulk size I'm using?

Christian_Dahlqvist · June 1, 2023, 8:32am

That depends on what you are using to index data into Elasticsearch, e.g. Logstash or one of the beats.

hjazz6 · June 1, 2023, 8:39am

I'm using filebeat.

Christian_Dahlqvist · June 1, 2023, 8:40am

How is Filebeat configured?

hjazz6 · June 1, 2023, 8:44am

I configured the following in my filebeat.yml.

output.elasticsearch:
  bulk_max_size: 4000
  worker: 8

In netflow.yml, I also have queue_size: 64000.

Christian_Dahlqvist · June 1, 2023, 8:50am

I have not tuned Filebeat in a very long time so may have to leave that for someone else. Have you tried increasing the number of workers or the bulk size? Did that have any effect?

hjazz6 · June 1, 2023, 8:55am

I have yet to tune these parameters, as I did not think that the problem might lie with filebeat. I'll try increasing their values and see if it works. Thanks!

Christian_Dahlqvist · June 1, 2023, 8:57am

What type of SSD are you using? Am surprised to see indexing throttled due to merging when using SSDs.

hjazz6 · June 1, 2023, 9:04am

They're SATA SSD, each 960GB.

Christian_Dahlqvist · June 1, 2023, 9:12am

Can you try increasing the number of primary shards to 2 for the next active index and see if that makes any difference?

hjazz6 · June 1, 2023, 9:33am

Ok, I'll give that a try.

hjazz6 · June 6, 2023, 6:00am

Increasing the number of primary shards to 2 seems to help. Previously, I would get a lot of "throttling indexing" messages when the indexing rate goes above 30K/s. Now, it was only after 3 hours that I got a "throttling indexing" message (indexing rate consistently above 30K/s), then another hour before getting another 2 such messages. I was even able to reach 35K/s without getting the message.

I'm not sure if the size of the index matters too? As the index was a custom index, I have not been able to apply the ILM on it yet. So the last "throttling indexing" message occurred when the index is 480GB containing 490M documents.

warkolm · June 6, 2023, 6:04am

I'd reduce the bulk_max_size to 3000, I think the default if 2000 so doubling it might not be the first best step.

Christian_Dahlqvist · June 6, 2023, 6:19am

That could very well explain it. I assumed you were using time based indices. You will not be able to apply ILM to an existing index, so I would recommend setting up a new data stream with ILM and direct all new data there. Once the existing index only contains old data I would remove it manually.

hjazz6 · June 7, 2023, 3:26am

I've applied ILM on the index so that it rollovers every 50GB, and I have not seen the "throttling indexing" message anymore. Thanks for your help!

system · July 5, 2023, 3:26am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Merge throttling is preventing heavy bulk indexing (ES 1.7.5) Elasticsearch	5	1977	July 5, 2017
"now throttling indexing" Elasticsearch	13	13468	July 5, 2017
Elasticsearch indexing performance: throttle merging Elasticsearch	10	8319	February 13, 2017
Elasticsearch Index throttling info message comes in es 1.3.1 version Elasticsearch	3	482	July 6, 2017
Indices.store.throttle.max_bytes_per_sec config setting and 2.2 Elasticsearch	3	2361	July 5, 2017

How to stop index from getting throttled?

Related topics