It all depends on your query patterns and expected query concurrency. If you just serve a single request fewer shards will distribute load less but if you have many queries running in parallel against different indices the nodes may still be quite heavily loaded even with a few shards per index. I think you will need to test to find out.