Index mapping with fscrawler

@dadoonet How do I correctly map an index with fscrawler so that pdf, doc and rtf files open on screen in the same format(visually) as they are on the original document?

(Please note that the documents are of various formats .. some have paragraphs while others are tables etc)

You can activate this option (https://fscrawler.readthedocs.io/en/latest/admin/fs/local-fs.html#store-binary) which will store the original binary content in elasticsearch.
That being said, I don't recommend this option. It's better to have the URL of the source document and load the binary from its source than storing big blobs in Elasticsearch/Lucene which has not really being designed for that purpose.

Not sure this is what you are looking after though.

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.