Suggetstions for appropriate number of nodes

manohar_deepu · September 5, 2020, 7:02am

Hi,

I want to set up my elastic cluster, I have 10 (not for data node) machines, 2 exclusive machines for data node requirements.

I was thinking, I will use the 10 machines in the following way

4 machines running both Master + Ingestor
4 Machines running both Kibana + Coordinator
2 Machines exclusively for APM server
2 Machines exclusively for data node (cannot allocate extra here as I don't have machines with good disk space)

Is this is a good setup, can you please suggest?

axw · September 7, 2020, 5:58am

Sizing will depend on the details of your environment. Generally I would recommend that you experiment with real workloads (or a simulation thereof), using an approach such as described at Quantitative Cluster Sizing | Elastic

Typically you would have three master-eligible nodes for Elasticsearch, as described at https://www.elastic.co/blog/a-new-era-for-cluster-coordination-in-elasticsearch:

Typically we recommend that clusters have three master-eligible nodes so that if one of the nodes fails then the other two can still safely form a quorum and make progress. If a cluster has fewer than three master-eligible nodes, then it cannot safely tolerate the loss of any of them. Conversely if a cluster has many more than three master-eligible nodes, then elections and cluster state updates can take longer.

The rest depends on:

how capable the hardware is
how much data you're sending to APM Server and Elasticsearch (how many events/sec)
how many end-users will be accessing Kibana
desired fault tolerance

Side note: if you only have 2 data nodes then that doesn't leave a lot of room for failure. If you haven't already, consider also using snapshot and restore, for backing up data to slower network/cloud storage.

manohar_deepu · September 7, 2020, 7:20am

We will conisder these points, I guess what I was looking for is , what different service I can co host in a machine (lets say my machine is powerful enough to host 2 services given our load)

Excluding data nodes, I was thinking of co-hosting

Master + Ingestor
APM + Kibana
Coordinator as separte

Will it cause any problems or do you see any red flags?

axw · September 7, 2020, 8:10am

OK, I see.

If you have the hardware available, then I would say it's generally a good idea to have dedicated master-eligible nodes. Under Node | Elasticsearch Guide [8.11] | Elastic there's some discussion of why it may be a good idea to run dedicated master-eligible nodes.

For Kibana, the simplest way to load-balance over multiple Elasticsearch nodes is by running a co-located coordinating-only Elasticsearch node, as described at Use Kibana in a production environment | Kibana Guide [8.11] | Elastic

APM Server and Ingest node are both CPU-heavy. The more you can allocate to them the better, so it may be better to run them on dedicated machines.

So (without knowing the finer details of your environment) I'd probably go with something more like:

3 master-eligible nodes
2-3 dedicated Ingest nodes
2-3 Kibana servers with local coordinating-only Elasticsearch nodes for load-balancing
2 dedicated APM Servers

If you intend to run ML or Transforms, you might also run those on the Ingest nodes.

manohar_deepu · September 7, 2020, 8:31am

This helps thank you.

manohar_deepu · September 7, 2020, 8:55am

One clarification , everywhere it is suggested that have 50% of system memory allocated to JVM for elastic search. Any specific reason for this number 50% , any issue if we go lets says 70% mem.

axw · September 7, 2020, 9:05am

It is important to leave enough memory for things other than the JVM heap, as described at https://www.elastic.co/guide/en/elasticsearch/reference/current/heap-size.html#heap-size

I don't know where the 50% number comes from. I recommend opening a new topic at https://discuss.elastic.co/c/elastic-stack/81 if you would like more details on running Elasticsearch.

system · September 28, 2020, 5:05am

This topic was automatically closed 20 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Question for Logging Cluster - is 3master, 2data node is good? Elasticsearch	6	368	March 8, 2023
Hardware Configuration for ELK Elasticsearch	5	366	August 5, 2018
Is it okay to make 7 nodes of cluster in one machine(server)? Elasticsearch	11	360	July 21, 2021
Best Practice for Multiple ES Nodes Setup in Production Elasticsearch	2	4278	September 11, 2018
How elastic cluster the number about nodes? Elasticsearch	10	1455	October 27, 2017

Suggetstions for appropriate number of nodes

Related topics