I am doing high volume inserts into a cluster. Sometimes we take a node down for maintenance, or failure. When the node comes back up, the disk utilization on the node is lower than the other nodes. This causes new shards to be created on this node at a much higher rate, and the volume of new data hitting this node is very high after rejoining the cluster.
Is there a setting to ensure new shard creation is evenly balanced across the cluster, and let the re-balance (which we throttle, and run heavier at night) worry about disk utilization differences. Since we delete data after X days, rebalance will fix itself over time as well. It would be best for us to ensure good balance at insert time.