Logstash 2.1: Dynamically Altering Field Names

dawiro · December 10, 2015, 8:43am

Hi,
I've seen an issue in recent days with sending data to elasticsearch 2.1 where inbound messages contained an "_uid" field which clashed with the type of the in-built meta-field "_uid". This led to indexing failing, shards becoming unassigned and the cluster going into a red state.

Given elasticsearch's apparent heightened sensitivity around type handling I feel I need to strip any leading underscores from inbound field names as a safety measure. However, as our log formats vary widely we do not know in andvance what the field names will be. Also, we do virtually no filtering in logstash as we pre-format our log lines as json. So our exposure to filtering is very limited...

So, can someone show me how I can parse inbound messages to strip leading underscores from field names? Also, am I taking the right approach here? Could I mitigate this issue by modifying the logstash template?

Regards,
David

magnusbaeck · December 10, 2015, 10:29am

You can use a ruby filter for this. This should be reasonably close to actually working:

ruby {
  code => "
    event.to_hash.each_item {|k, v|
      if k.start_with? '_'
        event.remove(k)  # or is it .delete, I don't remember
        event[k.gsub('^_', '')] = v
      end
    }
  "
}

dawiro · December 10, 2015, 11:34am

Thanks for that but it doesn't work. "k" is getting set to the name of the first argument of the input I'm using to test this, which is the path to the input data.

magnusbaeck · December 10, 2015, 11:37am

Sorry, I don't get it. What do your messages look like?

dawiro · December 11, 2015, 3:28pm

I got it to work with this:

filter {
ruby {
code => "
event.to_hash.select{|k,v| k.start_with?('_')}.each() {|k, v| event.remove(k); event.append({k.slice(1,k.length) => v}) }
"
add_tag => "stripped"
}
}

whybangbang · January 28, 2016, 4:46am

thank your submit,we find the same problem,this is a bug in 2.1?

Topic		Replies	Views
Elasticsearch 2.1 _uid is reformated, shard will failed Elasticsearch	2	675	July 5, 2017
Logstash 2.x : Dynamic Mapping Logstash	10	1384	July 6, 2017
Elasticsearch 2.1: Regarding Meta-Fields and Dynamic Mapping Elasticsearch	2	522	July 5, 2017
Underscore preventing us from upgrading from 1.7.1 to 2.x Logstash	1	419	July 6, 2017
Remove part of field name from json inputted fields Logstash	9	3768	July 6, 2017

Logstash 2.1: Dynamically Altering Field Names

Related topics