Attachment mapper parsing exception


(Andrew Soep-2) #1

Fatal error: Uncaught exception 'Elastica\Exception\ResponseException' with
message 'MapperParsingException[Failed to extract [100000] characters of
text for [null]]; nested: TikaException[TIKA-198: Illegal IOException from
org.apache.tika.parser.microsoft.ooxml.OOXMLParser@71097ae8]; nested:
IOException[Unable to read entire header; 0 bytes read; expected 512
bytes];

Any ideas? I saw a Git issue about this, but it seemed to have been
closed. I'm able to import many docs, but this particular PPT is causing
issues.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(David Pilato) #2

You probably hit the limit and your doc is bigger than 100000 chars.
See README on mapper attachment project. There is a section about that.

HTH

David :wink:
Twitter : @dadoonet / @elasticsearchfr / @scrutmydocs

Le 15 oct. 2013 à 00:30, Andrew Soep siraris@gmail.com a écrit :

Fatal error: Uncaught exception 'Elastica\Exception\ResponseException' with message 'MapperParsingException[Failed to extract [100000] characters of text for [null]]; nested: TikaException[TIKA-198: Illegal IOException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@71097ae8]; nested: IOException[Unable to read entire header; 0 bytes read; expected 512 bytes];

Any ideas? I saw a Git issue about this, but it seemed to have been closed. I'm able to import many docs, but this particular PPT is causing issues.

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(system) #3