Hi all, I've read that the best practice for querying/searching is to use a coordinator node, which makes sense to me (a node that's not busy on disk operations for indexing and uses memory for gather phase of searches). However, when indexing data the best practice is to write directly to data no…

[image] 1fe60245fe160bc0f8ab: when indexing data the best practice is to write directly to data nodes. I'm curious where this idea has come from. Indexing can also benefit from going via a coordinating node indeed, for exactly the reasons you describe. You might want to use dedicated ingest n…

Thanks for the reply! I got the idea from the following posts: Correct usage of Coordinating role node (in Christian_Dahlqvist 's 2nd reply) When to use Coordinating vs Data Node There were some other posts which I can't find at the moment, but I'll look it up later.

Hmm. Both from experienced Elasticsearch people. I wonder if I'm missing something. @thiago or @Christian_Dahlqvist can you explain your thinking there?

I think we're all learning :slight_smile: Ingest is hopefully in the form of a bulk request, several docs to be indexed. Whatever node gets this request has to decide where to send each item. We know that shards are assigned by modulo math on the doc id. I've wondered if doc id's are generated to…

Coordinating only nodes can certainly be useful when indexing as well as querying, but I would generally expect that to be the case for larger clusters. For small clusters I think the addition of dedicated coordinating nodes often adds relatively little value and an additional data node may have a …

Thank you for the clarification! A few small questions, if I may - What is considered a "large cluster"? Do you have a thumb rule for this scenario? My cluster is currently indexing 5TB per day, would you consider that large? Would you recommend on ingest nodes that aren't doing any processing or …

[image] 1fe60245fe160bc0f8ab: What is considered a "large cluster"? Do you have a thumb rule for this scenario? My cluster is currently indexing 5TB per day, would you consider that large? Yes, I would consider that large. I assume you have a reasonably large number of nodes in the cluster. …

Usage of coordinator node for indexing

Elastic Stack Elasticsearch

1fe60245fe160bc0f8ab (עידו בוקר) February 12, 2020, 9:36pm 3

Thanks for the reply! I got the idea from the following posts:

Correct usage of Coordinating role node (in Christian_Dahlqvist's 2nd reply)
When to use Coordinating vs Data Node

There were some other posts which I can't find at the moment, but I'll look it up later.

Topic		Replies	Views
Use master/data or cordinating node for search? Elasticsearch	7	72	July 11, 2026
When to use Coordinating only node Elasticsearch	6	3622	November 1, 2022
When to use Coordinating vs Data Node Elasticsearch	5	4569	June 13, 2018
Ingestion through coordinate node Elasticsearch	3	781	September 26, 2019
Performance tuning for indexing between using dedicated coordinator code and using data nodes directly Elasticsearch	0	362	August 27, 2019

Usage of coordinator node for indexing

Related topics