I'm a little fuzzy on the concept of sharding and have a few questions.
My ElasticSearch cluster consists of 4 Linux nodes. Each node has a 1 TB data disk. I'm shipping IIS logs to the cluster using Filebeat. I have the following environments...
NA1 - 12 web servers
NA2 - 9 web servers
NA3 - 12 web servers
NA4 - 9 web servers
STG - 4 web servers
When I started shipping logs I noticed that only two ElasticSearch nodes were getting data. There was only a single primary shard and single replica shard (on separate servers). I was under the impression that ElasticSearch would have evenly distributed the data across all nodes. I want to achieve the following.
Index Shards
na1_iis_logs split into 4 shards
na2_iis_logs split into 4 shards
na3_iis_logs split into 4 shards
na4_iis_logs split into 4 shards
stg_iis_logs split into 4 shards
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.