Any classification procedure may make mistakes. In a binary classification, two kinds of errors may be distinguished. These have different terms in different disciplines but the underlyig concept is the same;
- In Statistics : Type 1 /Type 2 errors or alpha and beta errors
- In Testing: false positive, false negative
- In Information Retrieval : Recall and Precision
True | False | |
---|---|---|
Positive | True Positive (TP) | False Positive -Type 1 Error (FP) |
Negative | False Negative : Type 2 error (FN) | True Negative (TN) |
Examples of such propositions might be:
- The person has malaria
- The email is spam
- The word is spelt correctly
- The article is relevant
- The accused is guilty
- The traveller is a terrorist - FIA watchlist
- The student is of honours quality
- Efficiency = (FP + FN) / (TP + FP + TN + FN) = FP + FN/ All
- Precision = TP/ (TP + FP) = TP / Positives
- Recall = TP / (TP + FN) = TP / True
The costs of these two types of errors will be seen differently by different stakeholders in a system, leading to inherent conflict in systems design.
•
No comments:
Post a Comment