Understanding of recommended ES Cluster Architecture

neldreth · July 30, 2020, 2:52pm

I am currently working on designing a ELK Domain via AWS ES Service, and I trying to decided how many dedicated master, data, ingest, etc. nodes I require.

I am ingesting data from AWS Cloudtrail via Lambda, bringing in log data across several different accounts, and will be adding log data ingestion from several IIS servers.

Currently, I see near 1,000,000 documents being index per 24 hours. My ELK domain as of yesterday had 3 default nodes with 10GB's of storage each. None dedicated anything.

I am waiting for my ELK to process adding an additional 5 nodes and increase the storage of each node to 30GB's.

I am waiting to see how many logs are to be expected from IIS - though I would assume a good approximation would be another 1,000,000 just to have a marginal overestimation.

I am thinking of going by the rule of thumb and include 3 dedicated master nodes, with the 8 default that currently are being processed, and then I am thinking of including an ingestion node but am not sure what the rule of thumb is for that node type.

Additionally I expect to have around 50 indices.

What is a recommendation for my ELK domain?

Thanks to whomever can assist me!

Christian_Dahlqvist · July 30, 2020, 6:45pm

For that little storage I would recommend going with a basic cluster of 3 identical nodes which all hold data and are master eligible. At this size there is no point adding dedicated node types. How large you need to make the nodes depend on how much storage you need.

Is that 50 indices in total or 50 different time-based indices?

Try to keep the number of indices to a minimum as having lots of small indices and shards in a cluster is very inefficient and can cause performance problems.

neldreth · July 31, 2020, 5:26pm

Thank you. Just want to confirm 3 nodes total.. correct?

And yes 50 different Indexes. I am indexing based on the source of the log event - i.e. IAM, EC2, S3, etc.

Christian_Dahlqvist · July 31, 2020, 5:31pm

There is often no need to have a separate index per log type so I would recommend you consolidate. Also try adjust the time period covered by each index so you get a shard size ideally over 1GB.

neldreth · July 31, 2020, 5:45pm

Can you elaborate on your latter statement please?

Christian_Dahlqvist · July 31, 2020, 5:47pm

Please read this blog post.

system · September 3, 2020, 11:51am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Setting up Multi-node Architecture of ELK for log monitoring Elasticsearch	6	701	June 10, 2019
Index size questions Elasticsearch	3	753	September 28, 2017
ElasticSearch Size Recommendation Elasticsearch	7	2699	April 28, 2017
Managing Master Nodes and Data Nodes Kibana elastic-stack-security	6	306	December 4, 2022
Need advices for my cluster design Elasticsearch	4	1017	July 5, 2017

Understanding of recommended ES Cluster Architecture

Related topics