I run via the curl this code: PUT _ingest / pipeline / attachment { "description": "Extract attachment information", "processors": [ { "attachment": { "field": "date" } } ] } PUT my_index / _doc / my_id? Pipeline = attachment { "data": "e1xydGYxXGFuc2kNCkxvcmVtIGlwc3VtIGRvbG9yIHNpdCBhbWV0DQpccGFyIH0 =" } GET my_index / _doc / my_id
but I do not know how to send my base64 pdf file to the body.
with python I got it, however I have to mount with node.js, and use the elasticsearch client, and I'm not getting results. I saw that fsclawler might be a solution, however I wanted to make sure it could solve everything just with the elasticsearch
The code posted was the only one I found by example and apparently would do what I like: Read the contents of a pdf file and make it available for search.
But it does not fulfill what you would like, so I would like an example equivalent to the code made available in the documentation for the elasticsearch plugin.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.