Create a machine learning job with aggregation

TheHunter1 · October 28, 2020, 2:40pm

Hello everybody,

So I just began with machine learning jobs and I wanna create a job to detect port scans.
I wanna aggregate data by source.ip and then by destination.ip and finally count the number of destination.port
Could you tell me how can I make an aggregation in machine learning jobs !

Thanks.

richcollier · October 28, 2020, 3:30pm

Well, to answer your question, information about how to use an elasticsearch query aggregation as part of your ML job can be found here: https://www.elastic.co/guide/en/machine-learning/7.9/ml-configuring-aggregation.html

However, you likely have a very high cardinality of IP addresses. May I suggest that you instead use Population Analysis and configure something like the following:

detector: distinct_count(destination.port) over destination.ip
influencers: destination.ip, source.ip

The population analysis will effectively ease the burden on the high-cardinality destination IP field and the source IP as an influencer will only get analyzed if there's an anomaly on the distinct count, as defined by the detector.

TheHunter1 · October 28, 2020, 9:30pm

Thanks a lot for your reply and for your advices

system · November 25, 2020, 9:30pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
[Population Job]: port scanner Elasticsearch elastic-stack-machine-learning	3	909	February 10, 2021
How to create machine learning job to compare two counts? Elasticsearch elastic-stack-machine-learning	3	830	September 13, 2017
Question on how to create a simple ML job Elasticsearch elastic-stack-machine-learning	12	1114	October 29, 2018
How to solve hard and soft limit machine learning jobs Elasticsearch elastic-stack-machine-learning	3	1821	April 29, 2021
Help: Create multi metric machine learning job Elasticsearch elastic-stack-machine-learning	5	905	January 4, 2021

Create a machine learning job with aggregation

Related topics