PDFBox parser exception


(Deepikaa Subramaniam) #1

Hi,

Parsing an excrypted PDF Document is causing ES to crash after a few attempts. Below is some followup exceptions because of the main cause.
org.apache.pdfbox.pdfparser.XrefTrailerResolver] Did not found XRef object at specified startxref position 0

Is there a workaround for this. Can we make to avoid such files?


(David Pilato) #2

I think you are using mapper attachments plugin, do you?
If so, can you open an issue in the project repo and provide details such as:

  • version (es and plugin)
  • a link to a failing file

We have a unit test for that so I'd like to understand what is happening here.

Thanks


(Deepikaa Subramaniam) #3

yes we are using mapper plugin.

Plugin version is 2.5 and ES version is 1.5.2. For policy reasons I cannot share the customer file with you. Trying to see if I can repro with some corrupt file.

Opened issue - Mapper Plugin crashes ES when corrupt file. Msg Document is encrypted #160


(David Pilato) #4

I need a file which helps to reproduce.
Also consider upgrading and test again the same file.


(system) #5