Terms aggregation on field with types in different mapping

elssar · July 27, 2016, 8:03am

So I have a bunch of indices, rotated monthly, with two mappings - server, client. They both have a field user_id. In server, the user_id is a string, and in client, it is mostly a string, but in one months index, it got indexed as an integer.

When I try to do a terms aggregation on the user_id field, I get an error - ClassCastException[org.elasticsearch.search.aggregations.bucket.terms.StringTerms$Bucket cannot be cast to org.elasticsearch.search.aggregations.bucket.terms.LongTerms$Bucket]

When I query the monthly indices which have the correct mapping, I don't get the error.

This is fine, and expected.

But what I don't understand is why do I get this error even when I specify a mapping?

That is, /events-*/server/_search, and then do a terms aggregation on the user_id field.

1222kkk · July 27, 2016, 8:10am

StringTerms ,LongTerms ,field type different and confilts?

danielmitterdorfer · July 27, 2016, 8:46am

Hi @elssar,

Any chance you allow dynamic field mapping, the user id was not in the mapping and encountered for the first time?

Anyway, you can correct the problem by creating a new index with the correct mapping and using the reindex API to reindex the data. See index aliases and zero downtime on how to achieve this without affecting your users.

Daniel

elssar · July 27, 2016, 10:23am

Hi @danielmitterdorfer,

I understand that I have to reindex (I've reindexed more times than I'd like to admit ).

I think my last post didn't explain my query well enough.

What I don't understand is why would a field having conflicting types in two different mappings cause problems when only aggregating over one mapping.

That is,

{
  "events": {
    "mappings": {
      "server": {
        "user_id": {
          "full_name": "user_id",
          "mapping": {
            "user_id": {
              "type": "string",
              "index": "not_analyzed"
            }
          }
        }
      },
      "client": {
        "user_id": {
          "full_name": "user_id",
          "mapping": {
            "user_id": {
              "type": "long"
            }
          }
        }
      }
    }
  }
}

Now when I run a termsn aggregation on the user_id field, in /events/server, why does the mapping for user_id in client matter? Shouldn't they be independent of each other. I under that I'd get an error if I sent an query to /events.

danielmitterdorfer · July 27, 2016, 10:35am

Hi @elssar,

oh, yes. I totally misunderstood.

when I run a termsn aggregation on the user_id field, in /events/server, why does the mapping for user_id in client matter? Shouldn't they be independent of each other.

Your assumption is not correct. Fields with the same name must have the same mapping (if they are in the same index). For more details see:

Conflicts between fields in different types in the reference documentation
The blog post index vs type

Daniel

elssar · July 27, 2016, 10:41am

@danielmitterdorfer ah, that explains it then. Thank you

danielmitterdorfer · July 27, 2016, 10:44am

Sure, you're welcome.

Topic		Replies	Views
Possible aggregation bug Elasticsearch	3	1234	July 6, 2017
Elastic mapping Issue Elasticsearch	6	29	September 6, 2024
Fields with same name mapped differently in different types Elasticsearch	4	399	July 6, 2017
Mapping conflict! Elasticsearch	7	29043	July 5, 2017
JSON mapping limitations, was: Getting MapperParsingException while parsing a string and a number? Elasticsearch	5	644	July 6, 2017

Terms aggregation on field with types in different mapping

Related topics