We're having some performance/stability problems in our cluster while indexing data. There is especially two fields with pretty large html content with a custom analyzer as below.
The contents in those fields are around 1Mb - 3Mb large.
What we're seeing is nodes dropping out of the cluster frequently while adding docs. Logs show longish garbage collections.. The cluster is 5 nodes of 31Gb heap.
Any suggestions to make this easier on the cluster? I don't mind it being slow, but instability i want to avoid.