currently we have a big machine running a single instance of logstash, kibana and elasticsearch.
It has 128GB RAM 700GB storage and 8 CPU. It's a virtual machine, disk is located in a NAS.
I think we are currently not working with replica shards. At least I got output of following kind:
GET _cat/shards other-prod-2017.12.05 0 p STARTED 285311 110.7mb 127.0.0.1 node-1 other-prod-2017.12.05 0 r UNASSIGNED other-staging-2017.09.30 0 p STARTED 279 238.9kb 127.0.0.1 node-1 other-staging-2017.09.30 0 r UNASSIGNED
Now I am thinking about building a cluster, maybe on a single machine. My goal is to increase performance for indexing and querying.
- If I build a cluster of two nodes for example, than I need twice of the storage, correct?
- Will it decrease indexing speed because Data needs to be written to primary and replica shards?
- Increasing the number of replica will increase query performance as long as the disk is not fully loaded, right?
- Is it possible to use replica shards by multiple nodes (because we store in NAS), or will I have a physical bottleneck in my NAS because the file is only once on disk?
I would like to understand what results I my doings probalbly will have because although we are on a VM our datacenter is sending us a bill for each change. So I do not want to waste money for trial and error