12K fields in the mapping

orips · January 3, 2022, 10:37am

Hi,

We're approaching 12K fields in our mapping. Our cluster does not have many documents (it's about 1 billion, and it's doc one per user, so it's not growing very rapidly).

I'd like to know: how much further can we grow the number of fields before reaching an unusable stage? Which metrics we'll see starting to degrade?

Another question: how can we redesign our cluster once we hit this limit? We are trying to avoid partitioning our customers to several clusters/indexes just to keep the mapping size low.

Thanks

warkolm · January 4, 2022, 5:53am

Welcome to our community!

What's the reasoning behind having so many fields?

orips · January 6, 2022, 8:12am

Hi,

The reason is that the index serves a lot for customers where each can define their own set of fields (up to 100) and we used a dynamic index for this.

mayya · January 9, 2022, 3:02pm

Indeed, mapping explosion is a valid concern, it takes heap memory; it is a part of the cluster state that gets synchronized between nodes, and the bigger the cluster state, the slower synchronization can me. Also a size of mapping affects the speed of indexing of documents, because as you index a new document, this document's mapping is checked agains the existing index mapping.

For an index with many many fields, we recommend using ES flattened field. It has some limitations for search, but from the ES point of view it is a single field with all its subfields with their names and values are stored on disk, which allows to have "unlimited" number of subfields.

orips · January 10, 2022, 1:05pm

Thanks. How does it compare to the method where you create an array like this:

[
    { "keyAndValue": "priority:urgent" },
    { "keyAndValue": "priority:high" }
]

in terms of performance?

Tomo_M · January 10, 2022, 1:30pm

it depends on what kind of queries you are using.
If you filter or search on that field, keyAndValue may not be a good option in most cases and flattened field is better.
If you filter or search only on other fields and just store and retrieve the data, you can use object field with enabled: false.

orips · January 11, 2022, 8:35am

Thanks

system · February 8, 2022, 8:36am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Mapping with a large number of fields Elasticsearch	1	1177	July 5, 2017
Maximum number of fields in an index mapping Elasticsearch	3	8708	July 20, 2017
Implications and reasons of field mapping limit Elasticsearch	1	24	October 7, 2024
Field count v. performance Elasticsearch	11	418	April 15, 2024
How to limit mapping size? Elasticsearch	4	929	July 5, 2017

12K fields in the mapping

Related topics