Requirements planning

parthmaniar · July 12, 2020, 7:12pm

I am running Ubuntu server 20.04 LTS in a VM hosted on a NAS (Synology.) Configuration is:

Compute: Intel Celeron J3455 (4 cores allocated to the VM.) 1.5 (base) / 2.3 (burst) GHz
Memory: 4 GB DDR3.
Storage: 7200 RPM drive.

Following is the usage:

I am planning to move this from the NAS to a workstation running:

VMWare ESXi (free)
Compute: Intel Xeon W 1290 with 10 cores (20 threads) with 3.20 GHz base and 5.20 GHz Turbo frequency.
Memory: 64 GB DDR4 RAM @ 2933 MHz
Storage: 7200 RPM drive + PCIe M.2 SSD (Dell Class 40)

I am a self-funded student, and I need to use this system as part of my final year perusing masters in software and systems security. I wanted assistance calculating systems requirements if the load on the system remains consistent. I am using Elasticsearch stack on this system. I am facing the following issues:

System load is always over 4.0 is this due to IOPs issue? If so, will adding RAM during migration to another system help? Or does this require faster storage such as SSD?
Will providing the following configuration help situation load and improve performance without over provisioning?

Compute: 3 processors x 2 cores per processor = 6

16 GB RAM

Storage will be local (SATA, 7200 RPM NAS grade drive.)

Underlying operating system continue to be Ubuntu 20.04 LTS. I want to comprehensively learn the Elastic Stack and hence I am hoping to put multiple nodes to test roles, query scheduling, etc. for using Elastic stack as an enterprise SIEM.

Current primary purpose of the system is to ingest logs from 50 honeypots deployed around the world in AWS and Azure with 500 being average EPS (events per second.) These events go into a logstash pipeline running on a raspberry pi before being sent the elastic stack.

What steps should I take to ensure I am not over provisioning and able to use the system for other projects too?

Thank you.

Steve_Mushero · July 13, 2020, 7:00am

Any reason not to run this on Docker, as then way more flexible to control versions, allocations, etc. 4GB not a lot for a real ELK stack, though your 64GB workstation is nice.

IOWait that high on 50 IOPS means small IO and maybe fsyncs() going on; certainly SSD will fix that.

parthmaniar · July 13, 2020, 8:27am

Let me look at the docker option. Since I don't specialise in it; I had not considered it.

Besdies:https://www.elastic.co/guide/en/elasticsearch/reference/current/tune-for-indexing-speed.html
Is there documentation to look at fsync() that I should refer to?

I am hoping to deploy two nodes on single hardware (workstation) mentioned above with following configuration for each:

3 cores - 6 threads, 16 GB RAM and data on two different HDDs (running at 10000 RPM.)

Do you reckon any bottle necks to continue here? I will upload proposed low level diagram in sometime.

Christian_Dahlqvist · July 13, 2020, 8:31am

Indexing is I/O intensive so can easily saturate disks. Have a look at this video for a good comparison.

Steve_Mushero · July 13, 2020, 12:43pm

Yeah a little confused on your goals as if you are starting out you have a ways to go before having Enterprise SIEM, and 500 EPS is not high, but could be for spinning disks; hard to know as they are so rare these days on primary data stores and all you can do it test & benchmark.

parthmaniar · July 13, 2020, 1:12pm

So my goal is to carry out my final year research on security monitoring. I've selected ELK as one of the platforms to use for benchmark and comparison to traditional SIEMs such as IBMs QRadar.

As part of my research I have series of honeypot systems deployed on AWS/Azure/Google cloud. These systems will be sending telemetry data. Currently I have 25+ such systems including few laptops of my family which also log netflow data. I hope this helps clear my usecase. I apologise for the ambiguity due to my initial post.

parthmaniar · July 13, 2020, 1:13pm

This was extremely insightful video. I've instantly shared it with all my friends Thank you very much. Is there a way to calculate EPS in ELK?

Steve_Mushero · July 14, 2020, 4:12am

Calc EPS by running cluster stats some seconds apart and get indices.docs.count and then divide it by the time. Our ELKman.io tool (which you can use for free) also shows this to you on our dashboard.

system · August 11, 2020, 4:12am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
System Requirements for ElasticSearch stack Elasticsearch	5	516	July 6, 2017
ElasticSearch System Requirements Elasticsearch	7	2667	July 6, 2017
Elastic stack 10k EPS Elasticsearch	7	71	June 7, 2025
Elastic SIEM - Hardware specs Elastic Security	4	141	February 5, 2025
Question about Digital ocean droplets and ELK Elasticsearch	1	591	January 6, 2020

Requirements planning

Related topics