@dadoonet How do I correctly map an index with fscrawler so that pdf, doc and rtf files open on screen in the same format(visually) as they are on the original document?
(Please note that the documents are of various formats .. some have paragraphs while others are tables etc)
You can activate this option (https://fscrawler.readthedocs.io/en/latest/admin/fs/local-fs.html#store-binary) which will store the original binary content in elasticsearch.
That being said, I don't recommend this option. It's better to have the URL of the source document and load the binary from its source than storing big blobs in Elasticsearch/Lucene which has not really being designed for that purpose.
Not sure this is what you are looking after though.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.