Reprocess documents

Zachary_Buckholz · September 27, 2018, 8:55pm

I have documents that currently exist with a single _message field containing raw json. I'd like to reprocess all these documents and run them through a json parser to update the document and add fields / values.

What's the best approach for this?
External script to query, find a record needing updates, update it, then push it back to es as an update, or some other way?

Thanks
Zach

Christian_Dahlqvist · September 28, 2018, 7:33am

You may want to look into using the reindex API together with an ingest node pipeline. This should work as long as all data is present within each document. If you want to parse and enrich it based on external data, you may need a different approach, e.g. Logstash.

Zachary_Buckholz · September 28, 2018, 1:04pm

That seems like a great idea, thank you for pointing it out.

system · October 26, 2018, 1:04pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to process again documents already ingested Elasticsearch	4	479	October 30, 2020
Elasticsearch Update Doc String Replacement Elasticsearch	6	864	March 22, 2018
Suggestions for reindexing individual documents Elasticsearch	4	315	July 6, 2017
Update documents with reindex Elasticsearch	2	768	January 26, 2018
Re: Only update a single field in existing document Elasticsearch	2	391	July 6, 2017

Reprocess documents

Related topics