Formula to estimate optimal number of shards giving input parametes ( heap-size, average-index-size, etc...)

Hello guys,
i have read some discussion topic on how to tune-up logstash configuration, and for what i have understand there are a bunch of variables and metrics that should be related.
I.e:

  • average shard's size should be between 20GB and 40GB
  • the total number of shards inside a node should be 20*(GB-heap-size)
  • ......

I wonder if, fixing some assertions ( ex: indexes are daily-time-based ), exists some math formula that consent to estimate the optimal shards number compatible with input parameter ( number of nodes, GB-heap-size for nodes, average daily indexes size, total number of indexes created daily, retention period for an index ).

I'd like to have some automatic computation that could help me to visualize different tuning option corresponding to different input parameters value .
Just some starting formula to have some rapid feedback, i'm not involved in modeling machine-learning auto-predicting models :slight_smile: ) .

Thank u all for the feedback
Regards,
Diego B

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.