High CPU Usage - Windows

Kryten · June 17, 2016, 10:33am

Hi,

I am trying to use Filebeat to extract lines which contain a particular string and send those lines to Elasticsearch (or Logstash).

I'm now able to use either "include_lines" or "exclude_lines" to grab the relevant lines and send them. The problem however is that using either of these methods causes the CPU on the Filebeat Windows machine ( Win Server 2008) to climb to > 70% and, while the logs are being generated, pretty much stay there.

The logs are application specific and are generated at the rate of around one 50Mb log file every 10-12 minutes while the application is running. Each log file contains ~ 600,000 lines. There are only around 200-300 lines in each log which will match on the "include_lines" string.

If I run Filebeat without the "include_lines" string match and just send everything Filebeat happily runs at about 2-4% CPU and less than 30Mb memory. But the far end Logstash crashes under the strain of all the messages in no time at all (even with every message that does not match my string being immediately dropped).

Is this normal?

I would be very grateful for any suggestions for how this could be tuned for better performance.

ruflin · June 20, 2016, 7:26am

Which version of filebeat are you using? Can you post the regexp you are using?

Kryten · June 20, 2016, 9:24am

Hi @ruflin

Thank you for responding.

I am using: filebeat-5.0.0-alpha3-windows

The regex is this:

  include_lines: [".*appzero has returned the status.*"]

I have also tried using the exclude_lines config to drop lines which do not have the string.

Would be grateful for any suggestions about how to achieve this with a much lower utilization footprint.
Thanks.

steffens · June 20, 2016, 9:59am

I think the regex matcher in filebeat tries to match any substring-match in your log. That is, if one wants to match beginning of line, one has to use ^ and use $ for end of line. This turns the .* patterns basically into a NOOP, which still must be executed (by default longest match).

I wonder if

include_lines: "appzero has returned the status"

reduces resource usage.

Kryten · June 20, 2016, 10:03am

@steffens

Thank you for the reply.
I just tried changing the regex as you described and restarted the service. The CPU usage for filebeat process touched 81% CPU at the beginning, then settled down to about 60% CPU after about a minute.

ruflin · June 20, 2016, 2:47pm

Thanks for trying that out. Could you share some of your log lines? What is the number of events you see per second?

system · July 8, 2016, 10:33am

This topic was automatically closed after 21 days. New replies are no longer allowed.

Topic		Replies	Views
Filebeat include_lines performance v.s. grep Beats filebeat	2	1666	November 9, 2018
Filebeat - High CPU for windows machines Beats filebeat	26	4604	September 15, 2016
Filebeat CPU usage Beats filebeat	8	3075	December 13, 2018
Recommendations for parsing 1000's ~10MB files to backfill elasticsearch Beats filebeat	3	1063	April 26, 2019
Filebeat CPU Load and Log File Size Issues Beats filebeat	1	248	December 13, 2023

High CPU Usage - Windows

Related topics