Elastic Mapping explosion

Nikesh · January 2, 2019, 12:03pm

Hi,

I have few questions related to number of fields present in mapping and addition of new fields dynamically.
What can be the causes of mapping explosions?
Is it the high number of fields(in my case more than 1000 fields) present in the mapping file or huge number of documents present?
Is there any other reasons for mapping explosion?

Mark_Harwood · January 2, 2019, 12:11pm

Your source of data.
An example - getting the keys and values of something like "customer_id": "N343242394638" the wrong way round in your application code would be a good way of generating a lot of unique field names from the keys, all with the same customer_id value string.

Nikesh · January 2, 2019, 12:33pm

@Mark_Harwood Thanks for the response
It is understandable from your reply to avoid generating lot of unique fields names unnecessarily.
But in situations where in, it is not possible to avoid any fields and the number is over 1000, how can we prevent mapping explosion?
Is there any other reasons that can cause mapping explosions?
Also, What can be the consequences of mapping explosions?

Mark_Harwood · January 2, 2019, 12:41pm

By carefully controlling what JSON you pass or, if you can't, by declaring what your indexing policy is for any new fields - ignore, accept or error?

If you're not interested in searching or aggregating certain fields that may appear in your docs you can simply choose to ignore them in your index mappings. They'll still exist in the stored JSON blob but won't be unpacked and added to any kind of index or doc-values storage.

Anything that can introduce new fields into the provided JSON.

Elasticsearch rejections because you exceeded the permitted number of mapped fields. Each mapped field comes with overheads (disk + RAM) so it shouldn't become an unbounded collection.

Nikesh · January 2, 2019, 12:48pm

How/Where is mapping stored within Elastic? How much of overhead does it cause on disk and RAM?

Nikesh · January 2, 2019, 1:08pm

Thanks for quick replies
To add to my previous doubt,

Is there a fixed value for number of fields that has to be stored? I see the default value is 1000 fields. But I have a situation where I have to store 1500 fields.
Is there any alternative to this mapping explosion prevention?

Mark_Harwood · January 2, 2019, 1:53pm

In the "cluster state" which is shared with every node.

A small part of the overhead is fixed (the set of fields definitions in cluster state) and the larger part varies with the number of documents in the index. More fields = more entries in the search index data structures and RAM-based caches.

See Mapping | Elasticsearch Guide [8.11] | Elastic
Example use here

Nikesh · January 3, 2019, 6:33am

Thanks for the response.
I have gone through the provided links, I understand that default limit of 1000 fields.
I have 2500 static fields specified at the time index creation. Is there a specific number of static field that can cause mapping explosion ?

Mark_Harwood · January 3, 2019, 9:33am

No, the same way there isn't a specific number that causes "a large crowd of people".

Nikesh · January 8, 2019, 6:51am

okay Thanks for the reply.
An add on to my previous question,
If there are 2000 fields in my mapping file but the number of documents I am indexing is low (50,000).
Can with such low data a mapping explosion occur?

Mark_Harwood · January 8, 2019, 10:00am

We may be talking at cross-purposes.
I don't think of "a mapping explosion" as a specific error or event.
I think of it as a general condition of having a lot of fields.

It's a condition that can lead to a number of problems (memory pressure, delays publishing cluster state..) and is the reason we introduced a soft-limit to the number of fields in mappings.

If there are 2000 fields in my mapping file but the number of documents I am indexing is low (50,000).

Sounds like a lot of fields for users to consider/search but shouldn't be too much of a problem.

Nikesh · January 14, 2019, 1:43pm

Thanks @Mark_Harwood for the response. I want to understand few more things. Can you please provide a link of things that can help me understand the following?
How does Mapping work internally within Elastic?
Is it referred to, for every search query?

system · February 11, 2019, 1:43pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch - Is there any certain number for mapping explosion Elasticsearch	2	654	February 11, 2019
Reason behind Mapping Explosion in Elasticsearch Elasticsearch	1	548	February 11, 2019
Mapping explosion, how to refactor? Elasticsearch	6	424	October 7, 2020
How do I handle mapping explosion? Elasticsearch	3	1437	February 21, 2018
Implications and reasons of field mapping limit Elasticsearch	1	24	October 7, 2024

Elastic Mapping explosion

Related topics