Ingesting files via ingest plugin

Hi

We have some folks trying to ingest files(pdfs) using the ingest plugin however the data is coming in as encrypted rawContent instead of searchable text - at least in the case of PDFs. Is there a way to convert from that to searchable text?

Here are the settings they're using:

{
 "attachments": {
   "description": "Document attachment pipeline",
   "processors": [
     {
       "attachment": {
         "field": "rawContent",
         "target_field": "attachment"
       }
     },
     {
       "remove": {
         "field": "rawContent"
       }
     }
   ]
 },

Here is sample daa:

{
  "_index" : "index-name",
  "_type" : "contentdto",
  "_id" : "eneZp2kB2sqO6dVfmlYf",
  "_version" : 1,
  "_seq_no" : 8299,
  "_primary_term" : 2,
  "found" : true,
  "_source" : {
    "contentID" : 45629,
    "keywords" : "Twenty Eight",
    "rawContent" : "UEsDBBQABgAIAAAAIQCv/zHFjwEAAJkGAAATANcBW0NvbnRlbnRfVHlwZXNdLnhtbCCi0wEooAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAALyVTU/CQBCG7yb+h2avhi5gYoyhcPDjqCRi4nXtDnTDfmVnQPj3bheshiCIEC/dtJ1532emM2lvsDA6m0NA5WzBOnmbZWBLJ5WdFOxl9NC6ZhmSsFJoZ6FgS0A26J+f9UZLD5jFbIsFq4j8DedYVmAE5s6DjW/GLhhB8TZMuBflVEyAd9vtK146S2CpRbUG6/fuYCxmmrL7RXy8IgmgkWW3q8Daq2DCe61KQZGUz63ccGmtHfKYmWKwUh4vIgbjWx3qNz8brPOeYmuCkpANRaBHYSIGX2j+7sL0zblpvltkC6Ubj1UJ0pUzEzuQow8gJFYAZHSeztwIZT+5d/inYOTp6JwYpK4vCe/hoPi9gafr8QhJZo8h0lIDnrjaleg+50oEkM8U4macHOC79i6OODfD4DzGDQpweBc+V6TObvkoBIEUNEuybdgax7h9hxtuTDvU+y1B/tK7nCE582o0VwQmFd49mqERrfX219+EfzEcP+mN6J8ZLv+7D80crNhPZL9lEHj6sfQ/AAAA//8DAFBLAwQUAAYACAAAACEAE16+ZQUBAADfAgAACwD3AV9yZWxzLy5yZWxzIKLzASigAAIAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACsks9OwzAMxu9IvEOU++puIITQ0l0mpN0QKg9gEveP2sZRkkH39gQkBJVGuwPH2J8///wp29049OKNfGjZKrnOcinIajatrZV8KR9X91KEiNZgz5aUPFGQu+L6avtMPcY0FJrWBZFcbFCyidE9AATd0IAhY0c2dSr2A8b09DU41B3WBJs8vwP/20MWE09xMEr6g7mRojy5tHnZm6uq1bRnfRzIxjMrgMZI1pBZOZ/YfGzTNaJEX1NU0rB+SuUA6FyWsCWcJ9pcTvT3tTBQRIMRQbOneZ5PxRzQ+nKg5Yimip90xh7e2XevzN0cy+1/suhjiDwshPOl+UaCybcsPgAAAP//AwBQSwMEFAAGAAgAAAAhAHx3iGaIAgAArwUAAA8AAAB4bC93b3JrYm9vay54bWysVF1vmzAUfZ+0/4D8TrHNRxIUUi0l1SJtU7V27WPlghOsAEa2kxBV/e+7hpL046XqhsAX+8LxOfde3+l5W5XOjistZJ0gcoaRw+tM5qJeJ+jPzaU7Ro42rM5ZKWueoAPX6Hz29ct0L9XmQcqNAwC1TlBhTBN7ns4KXjF9Jhteg2clVcUMTNXa043iLNcF56YqPYpx5FVM1KhHiNVHMORqJTKeymxb8dr0IIqXzAB9XYhGD2hV9hG4iqnNtnEzWTUA8SBKYQ4dKHKqLF6ua6nYQwmyWxI6rYI7godgGOiwE7jebVWJTEktV+YMoL2e9Dv9BHuEvApB+z4GH0MKPMV3wubwyEpFn2QVHbGiExjB/4xGoLS6WokheJ9EC4/cKJpNV6Lkt33pOqxpfrHKZqpETsm0WeTC8DxBI5jKPT8tgCq1beZbUYKXEkwnyJsdy/lKOQBruLpSYseyA5wJ625VPET4yigH3pfpD9jlmu1gT1CWP5fkEkCJf19nKib3jxc4nWN/RN3LFAdukBLijkM/cv2RH45otPD9efAEYVFRnEm2NcWzHAudoMBG7K3rJ2sHD8HxVuQnGo/4+XKtfTMMvicrxx7cW
}
}

Thanks

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.