Hello,
We wish to set up an entire ELK system with the following features:
- Input from Logstash shippers located on 400 Linux VMs. Only a handful
 of log sources on each VM.
- Data retention for 30 days, which is roughly 2TB of data in indexed ES
 JSON form (not including replica shards)
- Estimated input data rate of 50 messages per second at peak hours.
 Mostly short or medium length one-line messages but there will be Java
 traces and very large service responses (in the form of XML) to deal with
 too.
- The entire system would be on our company LAN.
- The stored data will be a mix of application logs (info, errors etc)
 and server stats (CPU, memory usage etc) and would mostly be accessed
 through Kibana.
This is our current plan:
- Have the LS shippers perform minimal parsing (but would do multiline).
 Have them point to two load-balanced servers containing Redis and LS
 indexers (which would do all parsing).
- 2 replica shards for each index, which ramps the total data storage up
 to 6TB
- ES cluster spread over 6 nodes. Each node is 1TB in size
- LS indexers pointing to cluster.
So I have a couple questions regarding the setup and would greatly
appreciate the advice of someone with experience!
- 
Does the balance between the number of nodes, the number of replica 
 shards, and storage size of each node seem about right? We use
 high-performance equipment and would expect minimal downtime.
- 
What is your recommendation for the system design of the LS indexers 
 and Redis? I've seen various designs with each indexer assigned to a single
 Redis, or all indexers reading from all Redises.
- 
Leading from the previous question, what would your recommend data 
 size for the Redis servers be?
- 
Not sure what to do about master/data nodes. Assuming all the nodes 
 are on identical hardware would it be beneficial to have a node which is
 only a master which would only handle requests?
- 
Do we need to do any additional load balancing on the ES nodes? 
We are open to any and all suggestions. We have not yet committed to any
particular design so can change if needed.
Thank you for your time and responses,
Alex
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/b0aee66a-35bb-4770-927b-d9c7e13ad9fc%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.