Hello ES Team,
I am using elastic search version 6.5.4 .
When I crowl 11,000 documents Crawling stops at 555 and I got following exception.
Exception :-' [2020-03-24T23:38:48,325][DEBUG][o.e.a.b.TransportShardBulkAction] [ESC-CND-EXTSH01] [documentsearchindex_pub-itsupport][1] failed to execute bulk item (index) index {[documentsearchindex_pub-itsupport][_doc][38730], source[n/a, actual length: [43.7kb], max length: 2kb]}
java.lang.IllegalArgumentException: Document contains at least one immense term in field="documentTextPages.lowercase" (whose UTF8 encoding is longer than the max length 32766), all of which were skipped. Please correct the analyzer to not produce such terms. The prefix of the first immense term is: '[13, 10, 13, 10, 32, 32, 32, 32, 13, 10, 32, 32, 32, 32, 32, 32, 32, 32, 13, 10, 32, 32, 32, 32, 32, 32, 32, 32, 32, 32]...', original message: bytes can be at most 32766 i[2020-03-24T23:38:48,325][DEBUG][o.e.a.b.TransportShardBulkAction] [ESC-CND-EXTSH01] [documentsearchindex_pub-itsupport][1] failed to execute bulk item (index) index {[documentsearchindex_pub-itsupport][_doc][38730], source[n/a, actual length: [43.7kb], max length: 2kb]}
java.lang.IllegalArgumentException: Document contains at least one immense term in field="documentTextPages.lowercase" (whose UTF8 encoding is longer than the max length 32766), all of which were skipped. Please correct the analyzer to not produce such terms. The prefix of the first immense term is: '[13, 10, 13, 10, 32, 32, 32, 32, 13, 10, 32, 32, 32, 32, 32, 32, 32, 32, 13, 10, 32, 32, 32, 32, 32, 32, 32, 32, 32, 32]...', original message: bytes can be at most 32766 in length; got 40977 n length; got 40977 '
Can you please help me how I handle above exception and crawl all documents successfully.
I would appreciate your suggestions.
Let me know if u require more information.
Thanks ,
Shweta