Transport.tcp.compress slowing down shard relocation

djtecha · September 11, 2018, 1:49am

Looks like enabling tcp compression actually slows down the transfer rate of shards. Setting this I'm basically stuck to 13Mbs even though I've set the max_bytes_per_sec to -1 This has led to it taking about 1.2h to transfer a 50GB shard when it used to take 36m. Is there any thing we can do to get around this? is the max_bytes_per_sec setting not respected if we use compression? On a large cluster in AWS this helps to save a bunch of money on transfer costs, but it kind of is disappointing to see such a slow transfer rate. Important to note that the CPU isn't doing anything so I wouldn't think it was restricted by that.

ctluce · September 21, 2018, 1:47am

What version of Elasticsearch are you using? This might be helpful if your running 6x slow 6x shard recovery Only way I was able to attain configured max_bytes_per_sec was to disable compression, this was due to the change in concurrency between versions. If your running a different version then adjusting settings may help you with shard recovery/relocation time with compression enabled, specifically indices.recovery.concurrent_streams

djtecha · October 3, 2018, 6:32pm

Yea, looks like compression does a double compress on recovery. There's a github issue about this. Guess I'll just disable it for now and pay the bandwidth costs

system · October 31, 2018, 6:32pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Shard Relocation Speed Elasticsearch	8	6807	July 20, 2018
Just Pushed: Transport level compression, and recovery Elasticsearch	1	309	July 6, 2017
Network Speed between ES nodes Elasticsearch	2	1616	July 6, 2017
Could relocate speed faster than 96MB/s? Elasticsearch	11	2093	May 9, 2019
Elasticsearch 6.3.0 shard recovery is slow Elasticsearch	14	6924	September 26, 2018

Transport.tcp.compress slowing down shard relocation

Related topics