Relevance of having Cordinating Node and Data Node

rohitarora275 · May 19, 2020, 12:42pm

Hi Team,

I am trying to implement ELK on 6 servers in a cluster. I have below configuration

Two master Nodes
One Coordinating Node
Three Data Nodes

I have inserted around 100 records from logstash in elasticsearch, I have tried few scenarios.

1)If I am using Coordinating node in Kibana.yml(elasticsearch.hosts = Coordinating node URL property) then I am getting all the records

If I am using any Data node in Kibana.yml(elasticsearch.hosts = Data node URL property) then I am getting all the records
If two out of Three Data nodes are down then also I am getting same results.
Even if coordinating node is down , I am getting same results

So I have below queries here.

Is it inserting 100 records in all the three data nodes?
If I am getting Data by setting property in Kibana.yml(elasticsearch.hosts = Data node URL property) as well then what is the use of coordinating node

warkolm · May 19, 2020, 11:09pm

Elasticsearch works as a cluster, so the data is shared across all (data) nodes that are in that cluster. If the node you talk to doesn't have the data you want, then it will retrieve it from the node that does and then return that to the client.

Coordinating nodes are used to reduce the load on data nodes. They are usually only needed for high volume clusters.

What version are you on?

rohitarora275 · May 20, 2020, 8:04am

@warkolm Thanks for your reply, I am using the latest 7.7 version.

Also , you mentioned that it will retrieve from the node that has data(how will it retrieve if other data node is down).

I am just concerned , if there is any case where we are not able to display data

Should we use coordinating node in that case?

warkolm · May 20, 2020, 8:07am

The only case to really worry about is if nodes drop off the cluster and you don't have replicas.

rohitarora275 · May 21, 2020, 12:32pm

@warkolm : I am not creating any replica, I believe ELK must be creating it.

I have done testing with light load(100 records). While inserting the data there were 3 active data nodes in the cluster, after inserting I kept one active and two were down. Even two of the three were down, I was able to display correct and complete data

Can you tell me case when we don't have any replicas?

warkolm · May 22, 2020, 12:44am

The only time you won't have a replica is if you have a single node cluster, or you set the replica count to 0 for the index.

Christian_Dahlqvist · May 22, 2020, 3:41am

This is not good. You should always aim to have at least 3 master eligible nodes in any cluster. Two nodes does not give any high availability as Elasticsearch uses consensus algorithms and require a strict majority of master eligible nodes to be available to function fully.

Just because you can have dedicated node types does not mean that you should. The easiest way to get started is to have 3 nodes that hold data and are master eligible. Unless you expect to significantly expand the cluster this is a configuration that suits a large number of users.

system · June 19, 2020, 3:41am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Logstash and Kibana as coordinating nodes - Does it makes senses? Elasticsearch	10	409	September 15, 2020
ELK architucture Elasticsearch	5	323	March 31, 2020
Minimum number of Coordinating nodes Elasticsearch	7	1940	June 1, 2017
Coordination Node Where I put them? Logstash	8	492	October 17, 2020
ELK Cluster - Role of master node Elasticsearch	3	906	November 7, 2019

Relevance of having Cordinating Node and Data Node

Related topics