Troubles with cyrillic data

ImMushroom · February 20, 2020, 7:25am

Hello!

I have problems with putting cyrillic data into indices.
I have a messages like

13:32.907017-17,CALL,1,p:processName=ut,Context=ОбщийМодуль.Вызов : ОбщийМодуль.прОбщегоНазначенияСервер.Модуль.ПолучитьТекстОповещенийПользователю,MemoryPeak=926485,InBytes=0,OutBytes=0,CpuTime=0

filebeat.yml

filebeat.inputs:

type: log
enabled: true
paths

C:\1C\calls\rphost**.log
fields:
type: onec_call
fields_under_root: true
encoding: utf-8

I create logstash config

input {
  stdin {
    codec => json { charset=>"UTF-8" }
  }
}
filter {
  if [type] == "onec_call" {
    grok {
          match => { "message" => ["%{NUMBER:num_min}:%{BASE10NUM:num_sec}-%{WORD:duration},%{WORD:event1c},%{WORD:level_event},p:processName=%{WORD:process1c},Context=%{WORD:context}"]}
    }
  }
}
output {
  stdout { codec => rubydebug }
  elasticsearch {
    index => "onectj-%{+yyyy.MM.dd}"
  }
}

In Kibana, my records tagged as beats_input_codec_plain_applied, _grokparsefailure.
If i delete Context=%{WORD:context} (cyrillic data) from filter block, it works fine.

I tried set codec => plain { charset=>"UTF-8" } in input, remove all codec instructions, but without any result.

What I doing wrong?

system · March 19, 2020, 7:26am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Проблема при разборе лога Вопросы на русском языке	10	1244	March 24, 2020
Issue with kibana log data Kibana	5	573	February 6, 2020
How to let Logstash read more than one "charset" in the codec setting Logstash	2	267	June 7, 2019
Seeking Advice on Handling Non-UTF-8 Characters in Logs from F5 BIG-IP ASM to Logstash Logstash	1	236	March 21, 2024
How can I specify the UTF-8 codec when searching ES from python? Elasticsearch	2	5184	July 6, 2017

Troubles with cyrillic data

Related topics