Production configuration question

Aleksandar_Aleksand1 · January 3, 2024, 12:04pm

Hi all,

I am preparing the following elasticsearch cluster architecture:
Total 6 Nodes:

1 Node with roles: master and remote_cluster_client
3 Nodes with roles: data, data_hot, data_content and ingest
1 Node with role data_warm
1 Node with Kibana

Ideally the goal is to store logs from the custom applications, developed by me. The logs will be something like 15k -20k per day. The logs will stay on the hot nodes for 2 months, because they will be accessed from Kibana for reporting. After this 2 months the logs will go to the warm Node (using ILM). After 6 months staying on the warm Node, the index will be sent to S3 archive.

The questions that I have are the following:

Is the architecture suitable for the required job?
Is 1 Node with Master role enough? What will happen if the master goes down?
According to the documentation, what I understood is that you can configure the ILM policy of the warm node to keep the data for XXX period and before deleting it to archive it to S3. What is the procedure to restore logs for a specific date that has already been sent to S3 (not in the warm Node period)?

Thank you

Christian_Dahlqvist · January 3, 2024, 12:41pm

You should always look to have 3 master eligible nodes as that will allow the cluster to continue operating if 1 is unavailable. It would therefore be better to have 3 nodes with master, data_hot, data_content and ingest roles. Unless you are going to have multiple clusters and use cross-cluster serach I am not sure why you would use remote_cluster_client. If you want warm data to also be highly available I would have 2 data nodes with the data_warm role.

No. That would make the cluster unavaiable and you would lose all data if the master node was permanently lost.

Aleksandar_Aleksand1 · January 3, 2024, 12:54pm

Thank you very much, Christian.
I will put master role also on the data nodes. In this case, is it possible to set a priority that the master role will be hold only by the master node and only if failure happens to be transferred to the other nodes?
What kind of backup strategy (apart from the snapshots) is good to be implemented on the nodes on server level?

Christian_Dahlqvist · January 3, 2024, 1:07pm

Make the 3 hot data nodes master eligible and do not use another dedicated master node. You want 3 master eligieble nodes, not 4. You can not (and do not need to) control which node is elected master at any point.

system · January 31, 2024, 1:08pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Adding a warm node Elasticsearch ilm-index-lifecycle-management	16	1052	May 3, 2022
Production deployment with dedicated masters Elasticsearch	10	390	December 15, 2023
Correct configuration Elastic Search 8.10.4 Elasticsearch	10	627	November 20, 2023
Master node on Hot node in hot -warm architecture? Elasticsearch	7	1795	December 12, 2016
3 node cluster suggestion Elasticsearch	18	1076	June 12, 2019

Production configuration question

Related topics