Hi All,
I'm attempting to find a way to find anomalies within some data, but I'm having some trouble understanding the correct way to go about it.
I want to effectively determine the weight of an anomaly by comparing the data against both the entity and the entity's population.
Example:
If I have user login data, I want to detect when a user logins in from a rare source (e.g.: Country), but I want the weight anomaly value to do something like:
- If login source is not rare for the user, and not rare for the population, then its not an anomaly
- If login source is rare for the user, but not rare for the population weight the anomaly "lower"
- If login source is not rare for the user, but rare for the population weight the anomaly "lower"
- If login source is rare for the user, and rare for the population weight the anomaly "higher"
I thought about using a detector like:
rare by "source.geo.country_name" over "user.name"
But my understanding of the above, is that this will only compare the user against the population and not the user against the user.
Is the above a correct understanding? And if it is correct, is it currently possible to achieve what I am looking for?
I'm looking at testing out some Elastic Security (SIEM) stuff, if this helps at all with the context.