Which is better: many small instances vs one single beefy instance for real-time indexing and query?

Jonathan_Moo · July 4, 2012, 3:17am

Just to give a background, I am currently using a Logstash setup, hence my
current version of Elasticsearch (ES) is at 18.7. Everything is housed
under one large instance in AWS http://aws.amazon.com/ec2/instance-types/,
having only 300gb of EBS Volume http://aws.amazon.com/ebs/. Going to
switch to S3 storage http://aws.amazon.com/s3/once I scale.

There are 30 rolling indices in my ES instance, each index represents one
day of record and I combined 3 servers' worth of logs into a rolling index
daily. I am chalking up at least 150gb to 210gb in this ES instance, each
index at least having 5gb.

I am asked to scale this project to house more servers' logs, but I'm not
sure which route to go. I have problems querying facets in real-time per
minute because it takes up too much RAM just to query the data and to put
them into a graph, but I hope to scale it such that I could create a date
histogram per minute on 30 days into 1 graph.

So which route should I take? Many small instances of ES or one single
beefy AWS/ES instance for my use-case? I need it to be as cost-efficient as
possible.

Topic		Replies	Views
ElasticSearch on Amazon EC2 tips Elasticsearch	4	1570	July 6, 2017
Scaling: Cluster for speed or for size? Elasticsearch	6	356	July 6, 2017
Elasticsearch on EC2. What kind of instance types to use? Elasticsearch	7	17648	July 6, 2017
Quick question on which Amazon instance type to use Elasticsearch	1	366	July 6, 2017
30 million documents in one index best hardware to use Elasticsearch	6	719	May 11, 2020

Which is better: many small instances vs one single beefy instance for real-time indexing and query?

Related topics