That's a relatively low shard size, have you considered reducing that by half?
I can see you edited your post and added in what you had tried after my response. Given they are 2TB, that sort of time doesn't seem to unreasonable. Why do you need it to run faster?