Logstash filter and output use only one CPU core

saisimo02 · November 6, 2018, 4:01pm

Hello, I am using losgtash to parse my xml files, however I can't increase the number of workers, when I use top -h only one worker is used ~98%.

I am using 8 Go for jvm and 4 workers in the file logstash.yml

logstash.conf

input {
  beats{
  port=> 5044
  client_inactivity_timeout=>5000
}}
filter {......}
output{
  if [theXML][measValue][r][p]==[theXML][measType][p]{
  elasticsearch
    {
        action => "index"
        hosts => ["172.25.196.103:9200"]
        index => "nokia-%{SDL}-%{+YYYY.MM.dd}"

    }
    stdout{codec=>rubydebug}}}

logstash.yml

The output of top -h

My question is: Why I am getting only one worker running even if I have 4 workers in my logstash.yml?

I am using logstash6.4.

Thanks

Christian_Dahlqvist · November 6, 2018, 4:19pm

What does your filter section look like?

saisimo02 · November 7, 2018, 8:10am

My filter section:

filter { xml { source => "message" store_xml => true target => "theXML" force_array => false }


split { field => "[theXML][measValue][r]" }
split { field => "[theXML][measValue]" }
split { field => "[theXML][measType]" }
kv{ source=>"[theXML][measValue][measObjLdn]"
    field_split=> "=,"
    }
if "access" in [theXML][measValue][measObjLdn]{
mutate{add_field=>["VNFc","access"]
    }

  }
if "storage" in [theXML][measValue][measObjLdn]{
mutate{add_field=>["VNFc","storage"]
    }

  }
if "diag" in [theXML][measValue][measObjLdn]{
mutate{add_field=>["VNFc","diag"]
    }

  }
if "ops" in [theXML][measValue][measObjLdn]{
mutate{add_field=>["VNFc","ops"]
    }

  }
if "tele" in [theXML][measValue][measObjLdn]{
mutate{add_field=>["VNFc","tele"]
    }

  }
if "ntf" in [theXML][measValue][measObjLdn]{
mutate{add_field=>["VNFc","ntf"]
    }

  }
mutate{convert=>{"[theXML][measValue][r][content]"=>"float"}}
mutate{convert=>{"[theXML][measValue][r][p]"=>"integer"}}
mutate{convert=>{"[theXML][measType][p]"=>"integer"}}
mutate { remove_field => [ "AZ","CoreId","ConfiguredMemory","path","RGN","[beat][hostname]", "[beat][hostname][keyword]", "[beat][name]","[beat][name][keyword]","[beat][version]",
"[beat][version][keyword]","[host][name]","[host][name][keyword]","[input][type]","[input][type][keyword]","[offset]",
"[prospector][type]","[prospector][type][keyword]","[tags][keyword]","[theXML][granPeriod][duration]","[theXML][granPeriod][duration][keyword]",
"[theXML][repPeriod][duration][keyword]","[theXML][repPeriod][duration]","[theXML][measType][content][keyword]"]}
date{
match=>["[theXML][granPeriod][endTime]","M dd yy HH:mm"]}

mutate {rename => { "[theXML][granPeriod][endTime]" => "time"}}
mutate {rename => { "[theXML][measInfoId]" => "measInfoId"}}
mutate {rename => { "[theXML][measType][content]" => "Type"}}
mutate {rename => { "[theXML][measValue][measObjLdn]" => "ObjLdn"}}
mutate {rename => { "[theXML][measValue][r][content]" => "Value"}}}

Christian_Dahlqvist · November 7, 2018, 8:16am

The fact that 98% of CPU is used does not necessarily mean that only one processing thread is active. Logstash can only process as fast as the downstream are able to accept the data, so it is useful to verify that Elasticsearch is not the bottleneck here. What does CPU usage and disk I/O and iowait look like on your Elasticsearch cluster? How many indices and shards are you actively writing into? Do you see any errors or warnings in the Elasticsearch logs? What is the specification of your Elasticsearch cluster and what throughput are you seeing?

If this is not the bottleneck, have you verified that you have enough data coming in to keep all pipeline threads busy?

saisimo02 · November 7, 2018, 8:32am

I have 3 VMs, in each one there is one instance of logstash, elasticsearch and kibana.
The first machine : 8 vCPUs, 48 GO Memory and 200GB HDD ( I am sending data to logstash running here and send data to the third machine, and my elasticsearch is a master)
The second one : 4 vCPUs,48 GO Memory and 200GB HDD ( another instance of ES running here as a master)
The third machine: 4 vCPUs,48 GO Memory and 200GB HDD ( another instance of ES running here node.master: false and node.data: true)

My filebeat shipped data from 3 servers and send it to logstash ( VM1)

I used this config for another filebeat which shipped data from only one server and it works fine but when I have 3 servers I am loosing a lot of data on my ES.

As you see there is only 3 indexes only because the output of my logstash is the name of the server.

How can I check that logstash receiving enough data ? because in this case I send 3 times more data compared to my first test

Christian_Dahlqvist · November 7, 2018, 8:37am

Having 2 master-eligible nodes in a cluster is bad practice as you should always aim to have at least 3. I would therefore recommend you make all 3 nodes master eligible and also make sure you set minimum_master_nodes to 2.

It would be great if you also could answer my other questions, e.g. about disk I/O and seen throughput.

saisimo02 · November 7, 2018, 8:45am

Do you mean that ?

Christian_Dahlqvist · November 7, 2018, 8:46am

Is this during indexing? is this on the node being indexed into?

saisimo02 · November 7, 2018, 9:02am

I restart ELK to check the value, I am getting an iowait between 0% and 2% max.

But I am getting something weird : Capture5

Christian_Dahlqvist · November 7, 2018, 10:53am

It may be that your VMs are overcommitted with respect to CPU and/or memory, so the VMs do not actually have access to the resources you have assigned. That could certainly explain why you do not get higher utilisation, and is something you should check.

saisimo02 · November 8, 2018, 2:57pm

Problem resolved: I am using now another instance with ubuntu and not debian . This instance is running on openstack, so it was not a problem of hardware. I don't know why the instance with debian image couldn't use more than one CPU and the ubuntu instance is running very well.

saisimo02 · November 15, 2018, 9:04am

Hello, just for info, I found out the problem. In fact I was using an old version of kernel, I upgraded it and logstash was able to use more than one CPU. But the systemd-journal was using 90-100% of one CPU even after the upgrade. There was another problem in my journalctl.conf I edited it and now my ELK is working very well.

system · December 13, 2018, 9:08am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Logstash dont use all available CPU-Cores Logstash	8	4333	July 6, 2017
Maximising Logstash CPU Utilisation Logstash	2	452	March 11, 2019
Logstash only using 2 core out of 4 core server Logstash	19	702	February 16, 2019
Logstash using 100% of one CPU core Logstash	5	3503	July 6, 2017
Logstash consuming lot of CPU Logstash	8	2482	January 31, 2018

Logstash filter and output use only one CPU core

Related topics