Field maxlength?

CraigFoote · March 24, 2016, 2:56pm

I have a java log with some entries having stacktraces. I'm using the multiline option in filebeat and a grok filter in logstash with the pattern ending in GREEDYDATA, i.e.:

SNIP%{GREEDYDATA:logmessage}

This works fine with logmessage sometimes having up to 100 lines though. In Kibana's Discover tab I see the whole stacktrace but in visualizations they appear empty, as if there was no data in that field. Could this be caused by a maximum field length in Kibana visualizations? I'm using version 4.4.0.

LeeDr · March 24, 2016, 10:38pm

I'm going to try to reproduce your issue but need to create some test data... But let me know if you already figured it out.

LeeDr · March 24, 2016, 11:23pm

Can you show a screenshot of what you see? Do you have the .raw field for the long string field?
In my test I don't (yet) and so I only get the first word of the long string field instead of the whole string.

LeeDr · March 24, 2016, 11:40pm

Once I added the .raw field I can see a very long string (over 4000 characters) in a visualization (data table) (I actually did one more with a string over 8000 characters and it all shows up);

CraigFoote · March 29, 2016, 1:08pm

Thanks Lee. BTW, I'm speaking with Jay Greenberg about this. Sorry about the crosspost. The only difference I see compared to yours is mine is a Java stacktrace and so typically starts with something like "org.something.something.SomeException: some message. \n\t some other lines". That and one stacktrace is 14000 characters.

PhaedrusTheGreek · March 29, 2016, 5:11pm

There is a maximum term length under Lucene.

You can avert this behaviour by using the ignore-above setting to ensure you come in under the limit. Something like this should work:

{
"exception": {
     "mapping": {
        "type": "string",
        "fields": {
           "raw": {
              "ignore_above": 10922,
              "index": "not_analyzed",
              "type": "string"
           }
        }
     }
  }
}

From the docs:

The value for ignore_above is the character count, but Lucene counts bytes. If you use UTF-8 text with many non-ASCII characters, you may want to set the limit to 32766 / 3 = 10922 since UTF-8 characters may occupy at most 3 bytes.

Topic		Replies	Views
Kibana not displaying fields with more than 300 characters Kibana	3	633	September 6, 2018
Context length is bigger than maximum allowed string Kibana	4	255	May 30, 2024
Not able to process large lines of log data into elasticsearch Elasticsearch	5	3142	December 21, 2016
Logstash: max_bytes_length_exceeded exception Logstash	4	4124	February 16, 2018
Any way to limit field length? Logstash	5	9429	July 6, 2017

Field maxlength?

Related topics