You could also use attachment plugin which will do the Tika job for you.
David ![]()
Twitter : @dadoonet / @elasticsearchfr
Le 10 mai 2012 à 16:05, Rafał Kuć r.kuc@solr.pl a écrit :
Hello!
Take a look at http://tika.apache.org/ framework. You can extract data
from files like PDF or DOC and then index that data into
Elasticsearch.--
Regards,
Rafał Kuć
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticsearchI'd like to use elasticsearch to store indexes about my documents.
I've documents like .doc files or .pdf files or whatever.
Is there any way/tool to index such kind of documents ?
Tks
Tullio