Packetbeat Rare DNS Questions ML Job Customization

willemdh · September 4, 2020, 1:44pm

Hello,

I have an issue with the packetbeat rare dns question ml job, which generates quite a bit of anomalies due to the fact that our hosts are frequently contacting *.avqs.mcafee.com url's, which have a random part. For example:

8m6-0.13-0.800.17d4.25eb.27e3.0.0.166z6e233qvcvbebe5f2wi76.avqs.mcafee.com

These anomalies are picked up by SIEM and as a SIEM ML Detection has nu way to filter stuff:

I will need to tune or filter the ml job itself.

The query used in the ml job is:

{"bool":{"filter":[{"term":{"event.dataset":"dns"}},{"term":{"agent.type":"packetbeat"}}],"must_not":[{"bool":{"filter":{"term":{"dns.question.type":"PTR"}}}}]}}

So I'd like to discuss what would the best long term and flexible solution, so I can exclude certain domains when needed, without having to rebuild the ml job.

Some possible solutions:

I could filter out *.avqs.mcafee.com in dns.question.name in the ml datafeed query
Even better (so I don't have to use expensive leading wildcard query) I could filter out mcafee.com in dns.question.registered_domain

But both above options would require me to stop the datafeed, job and then update the datafeed query, which is not really user-friendly.

Ideally I'd love to use a whitelist filter list like this:

But dns.question.registered_domain is not an option to scope. Feedback to enable me to dynamically filter on dns.question.registered_domain is welcome.
Or is my only option to update the datafeed query in the ml job?

Grtz

Willem

Frank_Hassanabad · September 8, 2020, 3:15pm

Have you used the Filter lists from machine learning under settings? That might help you out some with what you're trying to do. I haven't used it directly myself but I hear good things about it from others. It will filter those things out before the anomalies are produced though but to a lot of people that's what they're aiming for:

willemdh · September 9, 2020, 2:37pm

Currently on a holiday, but I'll definitely investigate the filter lists capabilities further. Thanks

willemdh · September 23, 2020, 7:49am

So I tried to use the filter list, but it doesn't seem to work as expected..

whitelist_server_domain contains

But I still encounter anomalies with *.mcafee.com url's...

Am I missing something?

richcollier · September 24, 2020, 4:17pm

Looks like a good reason to open a support ticket

willemdh · September 24, 2020, 8:50pm

@richcollier Ticket 00614098 has been created. Grtz

willemdh · September 29, 2020, 10:31am

@richcollier Just an fyi, I stumbled on this => https://github.com/elastic/elasticsearch/issues/62948

After closing / reopening the job, it works.

While working on this, I got some additional questions.

Is it possible to configure a rule for an ml job before the ml job has been started? For example during creation time or while editing. I'm asking this, because I created a new job from scratch, trying to prevent internal url's and other known domains that should be whitelist to 'pollute' my ml model.

Afaik this is not possible yet. Is this already on Elastic's to do? If not, should I make a GH issue for it?

Greetings.

system · October 27, 2020, 10:31am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filter for Known IP Addresses in ML Jobs Elasticsearch elastic-stack-machine-learning	2	416	March 1, 2019
Tips to create a Machine Larning job Kibana elastic-stack-machine-learning	8	561	June 2, 2020
Elastic Machine Learning - Pre Built Rules Elasticsearch elastic-stack-machine-learning	5	393	June 18, 2022
DNS Tunneling job failing to start SIEM	1	21	September 24, 2024
Filter DNS queries by domains list Beats packetbeat	4	1137	July 17, 2018

Packetbeat Rare DNS Questions ML Job Customization

Related topics