According to this post,
Generally speaking, you'll receive the optimal performance by using the same number of shards as nodes.
I have three dedicated master nodes and one data node in my cluster.
So I have four nodes in my cluster.
Does this mean I should set
number_of_shards to 4?
Or is it safe to ignore the data node and set
number_of_shards to 3?
Another similar question is about how to determine the value of
The formula for determining the value to use is: N / 2 + 1, where N is the total number of nodes in your cluster.
Should the value be 3 since there are four nodes in my cluster?
Or is it safe to ignore the data node and set the value to 2?