Indexing pdf, word, text, image files

hi,
can anyone please give an example of how to extract content from pdf, word, text etc. using elasticsearch and indexing it?
i know that elasticsearch uses the TIKA plugin but i was not able to see how to use it in the indexing process.
what formats does TIKA support for content extraction for indexing in ES?

thanks!

Have a look at https://www.elastic.co/guide/en/elasticsearch/plugins/current/ingest-attachment.html

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.