Document indexing

You could also use attachment plugin which will do the Tika job for you.

David :wink:
Twitter : @dadoonet / @elasticsearchfr

Le 10 mai 2012 à 16:05, Rafał Kuć r.kuc@solr.pl a écrit :

Hello!

Take a look at http://tika.apache.org/ framework. You can extract data
from files like PDF or DOC and then index that data into
Elasticsearch.

--
Regards,
Rafał Kuć
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - Elasticsearch

I'd like to use elasticsearch to store indexes about my documents.
I've documents like .doc files or .pdf files or whatever.
Is there any way/tool to index such kind of documents ?
Tks
Tullio