Clustered indexing with large index


I have a large index roughly 120 gigs, I am having problems indexing. As we
run the index the process hits an OOM and dies and it also runs very slow.
Do you have any strategies that you could share with how to create a large
index using multiple river nodes to pull in the data? How about strategies
for proper sharding of this index?