Does adding data nodes increase write throughput

rex-remind · January 15, 2021, 4:39am

Question is, all other things equal, and assuming even distribution of shards, does adding data nodes increase write throughput?

My assumption is yes, I'd imagine the new data nodes could take on the burden of dealing with flushing data and merging segments and servicing bulk requests, but maybe I am wrong.

Additionally, what about adding ingest nodes?

warkolm · January 15, 2021, 4:43am

If you add more and the shards that are being written to are spread out more, then yes.

Operations like that only happen on nodes that hold the data (ie the shards), or receive the bulk request.

This kinda relates to my first comment.
Yes, if the node getting the request from the client doesn't hold data for the required shard.

Ingest only nodes might be worth looking at if you have a lot of pipelines.

rex-remind · January 15, 2021, 4:49am

I may not be following, but from what I'm hearing:

more data nodes will help spread out work more in general if shards are evenly distributed
ingest nodes will also help spread out load, but only for servicing requests and pipelines

Did I get that right?

warkolm · January 15, 2021, 4:50am

Yep.

system · February 12, 2021, 4:50am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Ideal data/ingester node count Elasticsearch	6	474	February 10, 2020
Adding new data nodes to busy ingesting cluster Elasticsearch	3	232	October 17, 2022
ES cluster throughput drops with 6 node cluster Elasticsearch	5	533	April 16, 2020
Hypothetical - ingest nodes vs data nodes Elasticsearch	3	47	December 9, 2025
Ingestion rate in a elastic cluster Elasticsearch	2	3959	September 14, 2017

Does adding data nodes increase write throughput

Related topics