Is it possible to update a mapping field by only reindexing documents affected by the change?

jpblair · February 15, 2017, 3:53pm

I did something dumb and deployed a service that populates a new field in our Elasticsearch index before applying the corresponding mapping to the index. As a result, about 40 documents were indexed with the default string analyzer instead of not_analyzed. I understand the reasoning for not allowing mapping types to be changed:

If a mapping already exists for a field, data from that field has probably been indexed. If you were to change the field mapping, the indexed data would be wrong and would not be properly searchable.

What I'm wondering is if I can just reindex the offending 40 documents, or even just delete the index for the field, so that the issue of corrupted data from that field no longer matters. This is a production index with almost 300k documents, so reindexing the entire thing and causing a temporary outage would be inconvenient.

warkolm · February 15, 2017, 10:10pm

How did you fix that for the other docs?
(ES doesn't allow conflicting mappings)

jpblair · February 15, 2017, 10:16pm

I never "fixed" them, they just never had the field to begin with. I would also be fine with deleting the field in the problem docs, but I'm not sure if that would solve my problem either.

warkolm · February 15, 2017, 10:50pm

You can't delete fields, you need to reindex which you can do into the same index, then just delete those bad docs.

jpblair · February 16, 2017, 3:22pm

reindex which you can do into the same index

What do you mean by this?

Basically what I was hoping was that if I delete any references to that field, that Elasticsearch would let me delete the mapping since the index for that field would be empty. But it sounds like that isn't possible.

I wouldn't mind if searching on that field doesn't work for the 40 that were accidentally indexed, if docs going forward were indexed correctly.

warkolm · February 16, 2017, 9:43pm

https://www.elastic.co/guide/en/elasticsearch/reference/5.2/docs-reindex.html is what I mean.

jpblair · February 16, 2017, 9:49pm

Ah, I see. Unfortunately we don't have access to that endpoint since we're using the AWS Elasticsearch Service.

system · March 16, 2017, 9:49pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Reindexing with new mapping Elasticsearch	14	3431	July 6, 2017
Elasticsearch mapping changes Elasticsearch	3	502	May 28, 2017
Why can't the mapping field be deleted Elasticsearch	4	359	September 2, 2020
Remove mapping of no existing fields Elasticsearch	5	1077	July 5, 2017
Altering mapping of index without reindexing Elasticsearch	2	343	March 25, 2020

Is it possible to update a mapping field by only reindexing documents affected by the change?

Related topics