Hi
We have some folks trying to ingest files(pdfs) using the ingest plugin however the data is coming in as encrypted rawContent instead of searchable text - at least in the case of PDFs. Is there a way to convert from that to searchable text?
Here are the settings they're using:
{
"attachments": {
"description": "Document attachment pipeline",
"processors": [
{
"attachment": {
"field": "rawContent",
"target_field": "attachment"
}
},
{
"remove": {
"field": "rawContent"
}
}
]
},
Here is sample daa:
{
"_index" : "index-name",
"_type" : "contentdto",
"_id" : "eneZp2kB2sqO6dVfmlYf",
"_version" : 1,
"_seq_no" : 8299,
"_primary_term" : 2,
"found" : true,
"_source" : {
"contentID" : 45629,
"keywords" : "Twenty Eight",
"rawContent" : "UEsDBBQABgAIAAAAIQCv/zHFjwEAAJkGAAATANcBW0NvbnRlbnRfVHlwZXNdLnhtbCCi0wEooAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAALyVTU/CQBCG7yb+h2avhi5gYoyhcPDjqCRi4nXtDnTDfmVnQPj3bheshiCIEC/dtJ1532emM2lvsDA6m0NA5WzBOnmbZWBLJ5WdFOxl9NC6ZhmSsFJoZ6FgS0A26J+f9UZLD5jFbIsFq4j8DedYVmAE5s6DjW/GLhhB8TZMuBflVEyAd9vtK146S2CpRbUG6/fuYCxmmrL7RXy8IgmgkWW3q8Daq2DCe61KQZGUz63ccGmtHfKYmWKwUh4vIgbjWx3qNz8brPOeYmuCkpANRaBHYSIGX2j+7sL0zblpvltkC6Ubj1UJ0pUzEzuQow8gJFYAZHSeztwIZT+5d/inYOTp6JwYpK4vCe/hoPi9gafr8QhJZo8h0lIDnrjaleg+50oEkM8U4macHOC79i6OODfD4DzGDQpweBc+V6TObvkoBIEUNEuybdgax7h9hxtuTDvU+y1B/tK7nCE582o0VwQmFd49mqERrfX219+EfzEcP+mN6J8ZLv+7D80crNhPZL9lEHj6sfQ/AAAA//8DAFBLAwQUAAYACAAAACEAE16+ZQUBAADfAgAACwD3AV9yZWxzLy5yZWxzIKLzASigAAIAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACsks9OwzAMxu9IvEOU++puIITQ0l0mpN0QKg9gEveP2sZRkkH39gQkBJVGuwPH2J8///wp29049OKNfGjZKrnOcinIajatrZV8KR9X91KEiNZgz5aUPFGQu+L6avtMPcY0FJrWBZFcbFCyidE9AATd0IAhY0c2dSr2A8b09DU41B3WBJs8vwP/20MWE09xMEr6g7mRojy5tHnZm6uq1bRnfRzIxjMrgMZI1pBZOZ/YfGzTNaJEX1NU0rB+SuUA6FyWsCWcJ9pcTvT3tTBQRIMRQbOneZ5PxRzQ+nKg5Yimip90xh7e2XevzN0cy+1/suhjiDwshPOl+UaCybcsPgAAAP//AwBQSwMEFAAGAAgAAAAhAHx3iGaIAgAArwUAAA8AAAB4bC93b3JrYm9vay54bWysVF1vmzAUfZ+0/4D8TrHNRxIUUi0l1SJtU7V27WPlghOsAEa2kxBV/e+7hpL046XqhsAX+8LxOfde3+l5W5XOjistZJ0gcoaRw+tM5qJeJ+jPzaU7Ro42rM5ZKWueoAPX6Hz29ct0L9XmQcqNAwC1TlBhTBN7ns4KXjF9Jhteg2clVcUMTNXa043iLNcF56YqPYpx5FVM1KhHiNVHMORqJTKeymxb8dr0IIqXzAB9XYhGD2hV9hG4iqnNtnEzWTUA8SBKYQ4dKHKqLF6ua6nYQwmyWxI6rYI7godgGOiwE7jebVWJTEktV+YMoL2e9Dv9BHuEvApB+z4GH0MKPMV3wubwyEpFn2QVHbGiExjB/4xGoLS6WokheJ9EC4/cKJpNV6Lkt33pOqxpfrHKZqpETsm0WeTC8DxBI5jKPT8tgCq1beZbUYKXEkwnyJsdy/lKOQBruLpSYseyA5wJ625VPET4yigH3pfpD9jlmu1gT1CWP5fkEkCJf19nKib3jxc4nWN/RN3LFAdukBLijkM/cv2RH45otPD9efAEYVFRnEm2NcWzHAudoMBG7K3rJ2sHD8HxVuQnGo/4+XKtfTMMvicrxx7cW
}
}
Thanks