Professional Documents
Culture Documents
Contribution
We develop an evaluation method which:
extends the ROC to include membership scores
allows the visualization of individual scores
depicts the combined performance of
classification, ranking and scoring
Consider what information can be obtained from
testing a given learning method.
Learning Tasks
Low
Information
Content
High
Information
Content
Prediction Outcomes
Learning Tasks
Classification
Low
Information
Content Labels
High
Information
Content
Prediction Outcomes
Learning Tasks
Ordinal
Classification
Classification
Low
Information
Content Labels
High
Information
Content
Labels
Prediction Outcomes
Learning Tasks
Ordinal Ranking
Classification
Classification
Low
Information
Content Labels
Labels An order
on instances
Prediction Outcomes
High
Information
Content
Learning Tasks
Ordinal Ranking
Classification
Classification
Low
Information
Content Labels
Labels An order
on instances
Prediction Outcomes
Probability
Estimation
High
Information
Content
Probabilities
Learning Tasks
Ordinal Ranking
Classification
Classification
Low
Information
Content Labels
Probability
Estimation
Scoring
Labels An order
on instances
Scores
High
Information
Content
Probabilities
Prediction Outcomes
Motivation
With scores, one can:
compare classifications in terms of decisions,
ranking, and scores (confidence)
visualize the margins of scores
find gaps in scores
Applications
Comparing user preferences
Assessing relevance scores in search engines
Magnitude-preserving ranking (Cortes et. al ICML07)
Research Tool (PET / DT / Nave Bayes)
Bioinformatics (gene expression)
Jan
Methodology
+
L
+
H
H
smTPR =
smFPR =
smAUC
Experiment