Scaling time series indices

shushu · July 10, 2016, 5:02pm

Hi,
I am running a time series elasticsearch cluster (on top of AWS service).
Using template, I create a daily index. 5 shards, 2 replicas, on 10 nodes + 4 masters.
Once the number of clients went high up - all stopped working - CPU maximum went to 100%, while CPU average kept low (~40%).

My guess is that the main searches are done against the latest days, so it focus on the nodes that has latest data - while the rest stays idle.

My question is - what would be the right scale mechanism ?
I think that I should by default, in the template, give the maximum number of replicas (10), so latest data will have as many replicas as possible.
Once data become old - in couple of days - reduce the number of replicas to 2.

Is this sounds like a decent methodology ?
Any other recommendations ?

nik9000 · July 10, 2016, 8:04pm

It is likely a lot more efficient to use bigger hardware (ssds, ram, CPU)
for the new indices and use forced allocation awareness to control indices
location.

I don't know how you'd do that with any cloud service.

Replicas scale reads well but not writes because replicas just perform the
same write as the primary.
shushu http://discuss.elastic.co/users/shushu Shushu Inbar
http://discuss.elastic.co/users/shushu
July 10

Hi,
I am running a time series elasticsearch cluster (on top of AWS service).
Using template, I create a daily index. 5 shards, 2 replicas, on 10 nodes +
4 masters.
Once the number of clients went high up - all stopped working - CPU maximum
went to 100%, while CPU average kept low (~40%).

My guess is that the main searches are done against the latest days, so it
focus on the nodes that has latest data - while the rest stays idle.

My question is - what would be the right scale mechanism ?
I think that I should by default, in the template, give the maximum number
of replicas (10), so latest data will have as many replicas as possible.
Once data become old - in couple of days - reduce the number of replicas to
2.

Is this sounds like a decent methodology ?
Any other recommendations ?

Visit Topic
http://discuss.elastic.co/t/scaling-time-series-indices/55135/1 or reply
to this email to respond.

You are receiving these emails cause you have enabled mailing list mode.

To unsubscribe from these emails, click here
http://discuss.elastic.co/email/unsubscribe/218e18220e3ce0d999d16a32978f5dcc3428ad3a81bc02edb9c8a875720d4730
.

Topic		Replies	Views
Scaling elasticsearch queries Elasticsearch	5	355	July 6, 2017
Experiences in "how to manage much data" needed Elasticsearch	8	575	August 10, 2018
Scaling ES Cluster and balacing shards (primary, replica) Elasticsearch	1	622	July 5, 2017
Using ES in a dynamic EC2 environment Elasticsearch	7	414	July 6, 2017
Few queries on setting up a high performing and scalable ES setup Elasticsearch	3	356	July 6, 2017

Scaling time series indices

Related topics