Would like to know whether PDF and Image documents (Stored in local file system/ AWS S3) can be indexed to enable the search on key entities depending on the nature of the documents (School Admission form, Examination Form, Purchase Orders, Telephone Bills, Electricity Bills..etc )
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.