I'd like to store files (.csv, .pdf...) into ES database so I can search on their content and visualize relevant informations into a kibana dashboard. Those files are reports of transactions.
After a few searches, I read about mapper-attachment and ElasticWareHouse.
Do anyone know what is the best way to store those files?
If you go with the mapper attachments way, please be aware that it's deprecated and you are encouraged using ingest-attachment from 5.0 (GA not available yet).
Storing big BASE64 Content (binary files) in Elasticsearch is not ideal IMO. If you need to store the content somewhere, I'd use something like HDFS or Couchdb. But definitely I'd extract the content and index into Elasticsearch.
Is it possible then to make the link between a content and index into Elasticsearch and the file stored in the other db? Like a path or something like thatso the user knows where the file he is looking for is in case this user wants to read the file in addition to the kibana view?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.