Reindex with text replace

Martin_Fredriksson · May 27, 2019, 12:23pm

I was trying to reindex a large index when I discovered that there are trailing \r\n in the end of all values of a field. Since the mapping in the new index for this field is a double, and not a string, it crashes.

How do I replace/trim these values to in the reindex process?

gbrown · May 28, 2019, 11:10pm

If all you want to do is trim whitespace on the one field, you can set up an ingest pipeline with the trim processor set up for the field, then specify the pipeline in your reindex request like so:

POST _reindex
{
  "source": {
    "index": "source_index"
  },
  "dest": {
    "index": "dest_index",
    "pipeline": "trim_my_double_field_pipeline"
  }
}

(above example pulled with minor changes from the Reindex docs)

Alternatively, you could use a script in the reindex request, but using an ingest pipeline is probably easier.

Martin_Fredriksson · May 29, 2019, 7:30am

Thank you Gordon!

Martin_Fredriksson · May 29, 2019, 7:40am

It works! And I learned something...

system · June 26, 2019, 7:40am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to delete a field from an index while reindexing? Elasticsearch	7	13791	December 13, 2018
Newbie question: how to reindex with field value change? Elasticsearch	3	2862	November 19, 2020
Remove a Field using a wildcard on reindex Elasticsearch painless	4	1356	June 23, 2019
Reindexing: Move mapping up a level Elasticsearch	3	384	January 8, 2021
Trim all text fields Elasticsearch	1	391	August 19, 2019

Reindex with text replace

Related topics