I use the plugin mentioned by Rui in our production environment and
most files work as expected. It uses Tika internally. Installation
into elasticsearch is pretty easy. Stop your elasticsearch server and
run the following command in your elasticsearch folder:
./bin/plugin install mapper-attachments
Then restart the server and push your documents to the index (make
sure you have enough memory for large documents).
On Jun 7, 9:10 am, slavag slav...@gmail.com wrote:
Is there any convenient way to index office documents rather then
parse them by-myself (using Tika or Aperture) and the to feed the
elasticsearch with the parsed data ? If yes, some reference to java
API will be very helpful.
Thank You and Best Regards.