How can I upload/index PDF/HTML/XML format documents to Elastic Search.
Use Mapper attachments plugin to index the attachement type docs in ES.
https://github.com/elasticsearch/elasticsearch-mapper-attachments
If you are a Java developer, you could use Apache Tika.
This is what I'm doing here: https://github.com/dadoonet/fsriver/blob/master/src/main/java/fr/pilato/elasticsearch/river/fs/river/FsRiver.java#L689
--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr
Le 14 mars 2014 à 11:35:26, Sandeep (sandeep.test.id@gmail.com) a écrit:
How can I upload/index PDF/HTML/XML format documents to Elastic Search.
--
View this message in context: http://elasticsearch-users.115913.n3.nabble.com/Upload-index-document-to-Elastic-Search-tp4051542.html
Sent from the ElasticSearch Users mailing list archive at Nabble.com.
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/1394542439478-4051542.post%40n3.nabble.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.5322deef.2eb141f2.1ccf%40MacBook-Air-de-David.local.
For more options, visit https://groups.google.com/d/optout.