Hi,
I need to index database and files in Elastic search. The database has information about the filesystem. It has the file path and file name stored in database along with other fields.
I need to index the database related data and take file name from db , extract content from the file(say word or pdf) and need to index in Elastic search using mapper-attachments/ingest-attachment .
I tried logstash and fscrawler both works fine and index records separately.
Is there any way to index both database content and file system content as single record in elastic search as both are linked?
If no out of box component, how can i index both (db+ file system) records combined in elastic search?
Can i use Apache tika to extract content and index it directly or any best approach available ?