Search froma a pdf file content

Hi,


I have created a trail account in ES ,installed the elastic search package and configured it.
to Insert the file i have converted it to base64 format and indexed and created a type . the file can be retrieved from the api console in elastic cloud using search url since i have indexed using keywords .
But for my need i need to search the content from the file which is in base64 format.
please help me here .

Did you install the ingest attachment plugin and followed the example in the plugin guide?

There an example here: https://www.elastic.co/guide/en/elasticsearch/plugins/current/using-ingest-attachment.html

PUT _ingest/pipeline/attachment
{
  "description" : "Extract attachment information",
  "processors" : [
    {
      "attachment" : {
        "field" : "data"
      }
    }
  ]
}
PUT my_index/_doc/my_id?pipeline=attachment
{
  "data": "e1xydGYxXGFuc2kNCkxvcmVtIGlwc3VtIGRvbG9yIHNpdCBhbWV0DQpccGFyIH0="
}
GET my_index/_doc/my_id

The data field is basically the BASE64 representation of your binary file.

You can also use FSCrawler. There's a tutorial to help you getting started.

the insgest plug in installed by the below api ??
PUT _ingest/pipeline/attachment
{
"description" : "Extract attachment information",
"processors" : [
{
"attachment" : {
"field" : "data"
}
}
]
}

this can be run in api console right. and needed "value" key . yes i have run the end point in api console. i have run it in this formatt

PUT _ingest/pipeline/attachment
{
"description" : "Extract attachment information",
"processors" : [
{
"set" : {
"field" : "data",
"value" :"boo"
}
}
]
}

I did insert my base64 file in "data" key word and still i am not able to search the file content

the insgest plug in installed by the below api ??

No. You need to install it from the cloud interface.

i am not able to find the installation set up in cloud interface can you guide me

Go to the "Edit" page of the cluster.
You should see in the page this part:

Just click on the Manage text and activate the plugin.

thank you, worked file content search perfectly it was the issue with plug in

hi how can we do a partial search of words irrespective of CASE , PLURALS etc eg: if i search for a key "sh" all words in a file with sh and SH should be found in search

Could you open another question for this?

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.