CCR with an BI-directional replication setup and performance impact

kley · August 14, 2019, 7:50am

Hi
So, we are currently performance testing our CCR setup to get some kind of understanding on how much more hardware we need to accomplish the same indexing numbers as without CCR.
As said in the title, we are running a CCR setup like this:

Where each datacenter (DC) has 3 ES nodes (3 primary shards one replica) on m5.large AWS machines.

Ok, when putting on an unbound load to index some documents (CCR disabled) we see a CPU consumption which looks like this:

As expected, indexing uses most of the CPU which is fine.

If we turn on CCR now, we see a consumption like this:

Way less "writing" but the Background thread CPU kicks in - which I assume are CCR related tasks like reading, unpacking lucene stored blocks and indexing (?) those.
We did see, that the indexing rate is going down for like ~40-45% which is expected, just because of the nature of our BI-directional setup.

So, my two questions are:

What steps in CCR are involved that generate such a high load for the CPU ?
Is there a rule of thump for calculating on how much hardware we have to add that we can accomplish the same numbers as without CCR ? Right now, we cannot tackle that one down. We increased the power of the AWS machines which basically had no impact at all.

Thank you in advance!

nhat · August 16, 2019, 8:46pm

@kley Thank you for your interest in Elasticsearch and CCR.

You're right. There's a reading task in the leader and indexing on the follower. It would be great if you can share with us privately the hot threads and the diagnostic. You can email it to me at nhat.nguyen at elastic.co Thank you!

kley · August 19, 2019, 1:15pm

Hey @nhat!

Sorry for replying late, had some days off

I will gather all the information you need and hit you up privately as soon as I got everything.

Thanks for helping me out!

system · September 16, 2019, 1:16pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Performance management in CCR Elasticsearch ccr-cross-cluster-replication	5	1001	August 2, 2021
Dec 22nd, 2018: [CN][Elasticsearch] 跨集群复制（CCR）的使用 Advent Calendar ccr-cross-cluster-replication	1	4422	December 1, 2019
Unique search for BiDirectional CCR Elasticsearch ccr-cross-cluster-replication	1	721	June 23, 2020
CCR replication paused indices Kibana ccr-cross-cluster-replication	1	503	February 21, 2023
High CPU load (3-4) and 50% CPU utilization Elasticsearch	5	3963	July 6, 2017

CCR with an BI-directional replication setup and performance impact

Related topics