PDFBox parser exception

Hi,

Parsing an excrypted PDF Document is causing ES to crash after a few attempts. Below is some followup exceptions because of the main cause.
org.apache.pdfbox.pdfparser.XrefTrailerResolver] Did not found XRef object at specified startxref position 0

Is there a workaround for this. Can we make to avoid such files?

I think you are using mapper attachments plugin, do you?
If so, can you open an issue in the project repo and provide details such as:

  • version (es and plugin)
  • a link to a failing file

We have a unit test for that so I'd like to understand what is happening here.

Thanks

yes we are using mapper plugin.

Plugin version is 2.5 and ES version is 1.5.2. For policy reasons I cannot share the customer file with you. Trying to see if I can repro with some corrupt file.

Opened issue - Mapper Plugin crashes ES when corrupt file. Msg Document is encrypted #160

I need a file which helps to reproduce.
Also consider upgrading and test again the same file.