Coerce object to String

mhughes · July 29, 2016, 3:51pm

I'm currently ingesting documents that for the most part are very structured. However, there are two fields in the document that contain JSON documents. With dynamic mapping, ES is treating each key of the JSON document as a field. As each JSON document has different keys, this is leading to a mapping explosion that affects Kibana and ES search performance. The mapping JSON for this index is 1.5mb to give you some idea.

I'm trying to write a template that will treat the entire JSON document as a string:

{
  "type": "string",
  "coerce": true
}

but ES doesn't like this. I could use {"type": "object, "enabled": false} but that's not exactly what I'm looking for. That keeps the field as part of the document but it's not searchable. I want full-text search on the JSON document as a string. I don't need to be able to search by JSON.someKey.someNestedKey. Is this possible?

polyfractal · July 29, 2016, 4:06pm

Nope, it's not unfortunately. =( ES will always try to treat the JSON as an actual JSON object.

You'll have to preprocess your documents somehow and serialize the JSON into a string. Either in your application, or perhaps something like Logstash.

mhughes · July 29, 2016, 4:49pm

Would there be interest in a PR? I would think others have this same problem. Perusing XContentParser, it doesn't look that complicated.

polyfractal · July 29, 2016, 5:05pm

Maybe? To be honest, I'm not super familiar with the parsing code and the roadmap there. I know a lot of work has gone into simplifying it, and making it more consistent. This may take it the opposite direction, as it makes the parsing more ambiguous.

I'd open an issue first, instead of a PR, to gauge interest from devs more involved in that part of the code. That way you won't waste time on a PR if there is strong resistance.

Assuming it's a desired feature though, I'm sure a PR would be very appreciated!

mhughes · July 29, 2016, 6:40pm

Created issue: https://github.com/elastic/elasticsearch/issues/19691

IMO it seems more consistent to support coerce on as many data types as possible. Why limit it only to String -> Numbers.

Topic		Replies	Views
Even more "dynamic" mapping? Elasticsearch	1	484	August 8, 2017
How to map document field which can be a number OR string OR nested object? Elasticsearch	1	333	July 6, 2017
Mapping Json Object and Array under the same field Elasticsearch	3	751	March 16, 2022
Transform Object mapping to String mapping Elasticsearch	5	2645	July 6, 2017
Problems with dynamic mapping Elasticsearch	3	1242	July 6, 2017

Coerce object to String

Related topics