I have a situation where we need to index a contents of a PDF file in
Elastic Search. This can be easily achieved using a FS river plugin but the
pdf file is stored as a BLOB object in a database, this means a JDBC river
plugin is also in picture now.
Please let me know your thoughts on this. I will have a look on Attachments
Mapper plugin to see if it can replace a FS river plugin and read a BLOB
directly.
I have a situation where we need to index a contents of a PDF file in
Elastic Search. This can be easily achieved using a FS river plugin but the
pdf file is stored as a BLOB object in a database, this means a JDBC river
plugin is also in picture now.
Please let me know your thoughts on this. I will have a look on
Attachments Mapper plugin to see if it can replace a FS river plugin and
read a BLOB directly.
With MySQL blob, you do not need base64 conversion by DB, because JDBC
driver works correct. The base64 conversion is performed by JDBC river when
indexing. Maybe other DB work like MySQL too.
I have a situation where we need to index a contents of a PDF file in
Elastic Search. This can be easily achieved using a FS river plugin but the
pdf file is stored as a BLOB object in a database, this means a JDBC river
plugin is also in picture now.
Please let me know your thoughts on this. I will have a look on
Attachments Mapper plugin to see if it can replace a FS river plugin and
read a BLOB directly.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.