Unusual Process For a Windows Host (rare_process_by_host_windows_ecs)

ARDiver86 · June 25, 2021, 3:03pm

This job appears to be looking at a list of process create events to determine if a process is new or existed previously. The issue I think we are having is it is alerting us on a lot of processes that existed previously because the service/process has been running for an extended period of time (I.E. the server hasn't been restarted for a while).

I think to resolve this issue the timeframe the job compares the data with should be extended but I'm not sure how to adjust this.

So my question is:

Is the model snapshot retention days (default to 10) basically saying it will compare the previous 10 days? I read the articles but having a difficult time understanding. In my case, if those processes aren't restarted every 10 days then it is going to alert about it being a "new" process when it actually isn't. I would like to expand it to 30 days if that is what that value is.

probson · June 25, 2021, 3:07pm

Similar issue to us, even had it with things such as excel.exe. User only uses's it once a month or so and it flags

We find the ML jobs useful inline with other detections to maybe highlight something going on

ARDiver86 · June 25, 2021, 3:55pm

Luckily we can add exclusions on the machine learning job section but figured there is a better way. I'm still confused on the timeframe it is using. Is it comparing to all documents in the time range you select when starting the job or is it only comparing it to the 10 model snapshots?

ARDiver86 · June 30, 2021, 4:15pm

I cloned the job and made some changes to the retention days and also started putting in a bunch of exclusions. I guess that is all we can do but I wish there was a way it could monitor running processes and not just when they started from the sysmon logs

sophie_chang · July 1, 2021, 8:18am

Changing the model snapshot retention days will not alter how data is modelled or the anomalies found.

Model snapshots are a "point in time" copy of the model. Consider them a bit like a backup. A snapshot is stored periodically to disk and used in the event of the job being restarted or moving due to a node failure or reverted. Changing the model snapshot retention days, changes the length of time for which old versions of the model are kept as backup.

The job analyses data from a starting point, going forwards in time. You can see the value of earliest_record_timestamp in the ML UI by expanding the row in the Job List and looking at the Counts tab. I'm not familiar with this particular job and am not sure if running processes are being logged or if it is just when a process is started, so I cannot comment on the results. However more information on how rare processes are modelled is described here. Detecting rare and unusual processes with OOTB machine learning | Elastic Blog

Hope this helps answer part of your question - concerning whether or not changing model snapshot retention days will change the results - unfortunately, it does not.

system · July 29, 2021, 8:18am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Export model Elasticsearch elastic-stack-machine-learning	6	552	July 19, 2019
ML jobs Elasticsearch elastic-stack-machine-learning	10	635	November 4, 2022
Gaps in machine learning data when reverting to snapshot with high query delay Elasticsearch	9	1134	December 8, 2017
Dec 7th, 2019 [EN] Looking behind the scenes of anomaly detector models Advent Calendar elastic-stack-machine-learning	1	2167	November 4, 2022
Does applying custom rules on machine learning jobs altert the ML model? Elasticsearch elastic-stack-machine-learning	6	457	October 13, 2021

Unusual Process For a Windows Host (rare_process_by_host_windows_ecs)

Related topics