Cluster discovery configuration

pinakinab · October 21, 2019, 4:40pm

We are trying to host an elasticsearch cluster with 3 Master Nodes and 6 Data Nodes. In order to establish a cluster, we are providing discovery.seed_hosts and cluster.initial_master_nodes in the elasticsearch.yml file.

In discovery.seed_hosts, we are currently providing hostnames for all the nodes (master as well as data).

In cluster.initial_master_nodes, we are currently providing node.name info for all the master nodes.

The question here is, do we really need to include data nodes in the discovery.seed_hosts? or is it okay if we just specify only master nodes?

DavidTurner · October 21, 2019, 5:15pm

You should only refer to the master nodes in discovery.seed_hosts. From the docs:

... you must use the discovery.seed_hosts setting to provide a list of other nodes in the cluster that are master-eligible [...] This setting should normally contain the addresses of all the master-eligible nodes in the cluster.

pinakinab · October 22, 2019, 6:57pm

@DavidTurner - So is there any configuration parameter where in we need to specify list of hostnames of data nodes?

DavidTurner · October 22, 2019, 7:11pm

No, there is no such parameter.

pinakinab · October 23, 2019, 6:07am

@DavidTurner - Thanks! That helps a lot.

ted.fed · October 25, 2019, 4:36pm

@DavidTurner, it is a bit confusing then. There seems to be very less difference between discovery.seed_hosts and cluster.initial_master_nodes.
For production mode:

seed_hosts should contain the hostnames or the IPs of all master eligible nodes and
initial_master_nodes should contain the node.names of all master eligible nodes.

Is that a correct understanding?
And if yes, then why can't one be derived from the other automatically by ES since the mapping is already known to ES?

Thinking aloud, the property names like cluster.master_hostnames and cluster.master_names might be more clear perhaps.

DavidTurner · October 25, 2019, 6:07pm

There's a superficial similarity but they're really very different settings.

discovery.seed_hosts is about discovery, i.e. finding the master nodes, so belongs in the discovery.* settings namespace. It must be set on every node whether master-eligible or not because every node must perform discovery. It should be kept up to date as the cluster evolves, but it tolerates mistakes (particularly extra nodes) and need not be precisely synchronised across all nodes. It can involve an external service (e.g. DNS) which may not give wholly consistent answers, and is just one of a number of pluggable mechanisms for discovering the master nodes in a cluster.

cluster.initial_master_nodes is about cluster bootstrapping, i.e. the first election in the cluster, so belongs in the cluster.* settings namespace. It need not be set on master-ineligible nodes because these nodes do not take part in the first election. It absolutely must not be adjusted as the cluster evolves and can be removed once the first election has taken place. It does not tolerate mistakes and must be precisely synchronised across all nodes on which it is set. It must not involve external services like DNS for consistency reasons, and it cannot be supplied by a plugin.

The mapping between master names and addresses is not "already known" to ES and cannot be automatically discovered. A node may ask another node for its name but only once it knows its address, but addresses do not uniquely and consistently identify nodes.

system · November 22, 2019, 6:07pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch: Configurations when adding new nodes to a running cluster (Discovery) Elasticsearch	3	358	March 2, 2022
Discovery.seed_hosts and cluster.initial_master_nodes Elasticsearch	3	812	May 25, 2023
New cluster best practice - discovery.zen.ping.unicast.hosts: Elasticsearch	10	4792	July 22, 2019
Discovery.seed_hosts Elasticsearch	2	372	April 14, 2021
Cluster config? Elasticsearch	4	488	March 17, 2017

Cluster discovery configuration

Related topics