I'm using the ingest-attachment plugin to parse PDF files in an ElasticSearch 7 cluster. Each PDF file gives additional informations to an already existing document.
I try to create a query which retrieves all the documents which contains a given text, either in their properties, either in their corresponding PDF file.
Ideally, I would like to store the PDF file content as a field of the already existing document, but I can't find a way to do it with the ingest-attachment plugin.
As a workaround, I thought of making a kind of one-to-one join query, but some sources say that it should be avoided if possible.
Is there a proper solution for this use case?