I'm indexing a collection of PDF documents. The ingestion Attachment processing pipeline is working fine, but there are a few of the documents that are rather large (because they contain images, etc.). This seems to run into an upper-limit on size that ES can handle.
- Is there a way to modify this size limitation for documents loaded via the attachments pipeline plugin?
- Is there a way to get the plugin to completely ignore images during processing?
Thanks in advance for any suggestions.
UPDATE:
Status Code = 413,
Message = u'{"Message":"Request size exceeded 10485760 bytes"}'