Change mapping in pipeline for ingest-attachment plugin

ottadini · January 29, 2019, 2:55am

I have a workflow for storing a range of binary document types (some handled by Tika, others that need pre-processing, with json generated in a script and indexed using PUT). For the documents handled by the ingest-attachment plugin, I want to 'flatten' the json for the document. Instead of having the "attachment" field with nested keys, I want it to be non-nested. So instead of something like this in the index results:

{
    "attachment": {
        "date": "2017-03-28T05:12:37Z",
        "content_type": "application/pdf",
        "language": "en",
        "content": "Text from PDF document"
    }
}

I would want something like this (with some of the field names changed):

{
    "ModDate": "2017-03-28T05:12:37Z",
    "content_type": "application/pdf",
    "language": "en",
    "content": "Text from PDF document"
}

How would I achieve this? Is there anything stupid about this idea that I've not realised?
Can I change the mapping in the plugin's pipeline definition (I don't even know if this is the right terminology) or do I need to re index once indexed?

The requirement comes about because most of the documents that I'm indexing need pre-processing in closed-source software. I don't send the binary document to ES, rather just a json document with a set of common fields (the file metadata mostly), and any number of auto-generated fields.

dadoonet · January 29, 2019, 4:06am

I believe that after the attachment processor you need to add the rename processor: https://www.elastic.co/guide/en/elasticsearch/reference/6.5/rename-processor.html

ottadini · January 31, 2019, 5:12am

Perfect, thank you.

system · February 28, 2019, 5:12am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Ingest Attachment Plugin update index Elasticsearch ingest-pipeline	7	1595	December 15, 2021
Ingest attachment in nested type Elasticsearch	2	714	April 29, 2020
Problem with Ingest Attachment Processor Plugin Elasticsearch	8	1204	November 24, 2017
Ingest question - attachment processor plugin and dynamic fields Elasticsearch	1	1273	August 6, 2017
Implementing Ingest Attachment Processor Plugin Elasticsearch	34	15730	March 14, 2018

Change mapping in pipeline for ingest-attachment plugin

Related topics