How do the influencers work or do they impact on the anomaly score?


(Rockybean) #1

According to Machine Learning Lab2 Section 15, the key field can impact on the final anomaly score like partition field. I'm confused about this. If key field works like partition field, then why does it exist?

Can any body explain how influencers work behind?

The question and answer is as below:

Create a Multi Metric job that only uses the Count but also sets a Keyfield or airline.keyword. Compare the results of this to the previous Single Metric job you created for the farequote data. Both show a single critical anomaly for the same airline on the same date but they have a different Anomaly Score. Why is this?
Show answer:
The Simgle Metric job created a score of 80 and the Multi Metric job using the Keyfield generated one of 98. This is because with the Multi Metric job you compared the airlines to their own past behavior rather than the entire dataset as a whole. When creating Machine Learning jobs you will have to consider if it makes sense to partition or not. And sometimes you'll want a job that does both!


(Rockybean) #2

up

any body can help?


(Nathan Zamecnik) #3

Hello,

Key Fields are "influencers" and when creating a "Multi-metric Job" when you select which field to "split" on, that is the partition field.

When you use a partition field the analysis is performed for each field based on only it's own history - that is, documents with the same value for the partition field.

Influencers are used to determine if values from other fields have a correlation with the anomalies discovered.

In the case of using partition fields with influencers (splitting a field and using key fields in the multi-metric job) you're able to see if certain fields correlate with anomalies in each partition field.


(system) #4

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.