PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
Abstract-Measuring the performance of a classifier is an arithmetic means of accuracies (Ameans) [2] based on the
:ssential step in building a classification method for a two class distributions of the data samples.
:lassification problem. The Receiver Operating Characteristic
035
results are similar with the empirical results using the testing
data set. Therefore the methods are applicable in predicting the
classifier performance without conducting an experiment. The 005
fraction (FPF) is defined as: probability density function of the feature samples. Also if a
classifier is based on a parameter calculated from the feature
samples, fÿx) and f/>(x) are the probability density function of
number of negatives incorrectly classified the parameter. Given a decision threshold, th, the predicted TN
FP rate = -- & J- (2)
__
number of total negatives rate, FP rate, TP rate, and FN rate are calculated by the
following equations.
In a two-class classifier using a decision threshold each ,
value of decision threshold will give a pair of TP rate and FP Predicted TN rate = FN(th) (5)
rate, then consequently will give a point in the ROC space. By
conducting experiments using several decision thresholds, the
classifier will produce several points in the ROC space. These
points can be connected to produce a curve which is called an Predicted FP rate = 1- Predicted TN rate (6)
ROC curve.
The Area Under ROC Curve (AUC) is one possible method By changing the decision threshold, th, several points in the
to compare classifiers based on their ROC curves [4] [6]. Each ROC space can be generated. Thus, an ROC curve can be
ROC curve produces one value of AUC. The range of AUC is predicted. By knowing the ROC curve, the predicted AUC of
between 0 and 1 since AUC is a portion of ROC space having the ROC curve can be calculated. The Ameans of each point in
the ROC curve and the maximum value of the Ameans can be
an area of a unit square.
calculated as well.
III. Arithmetic Means of Accuracies One possible method to choose the best decision threshold
in a classifier is by observing the point in ROC curve which
The arithmetic means of accuracies (Ameans) is one
has the maximum value of Ameans. This paper proposes a
possible method in comparing classifiers between several method of choosing this decision threshold, tÿ ,, so that the
points in the ROC space [2]. A non-weighted Ameans for a maximum value.of Ameans can be calculated by the following
two-class classification is defined as:
equation.
Ameans =-(TP rate + TN rate) (3) Predicted Max Ameans = (FN (tbes,) + (1 - FP (tbes,)) (9)
where
The method of choosing the decision threshold, //*,„, starts
with predicting that the highest Ameans is obtained if the t/,es, is
TN rate = 1 - FP rate set at the intersection of the negative and positive probability
density functions as shown in the following equation.
_
number of negatives correctly classified (4)
number of total negatives
/»(0 = /,(0 (10)
The value of Ameans is ranging between 0 and 1. Ameans
can be used to measure the performance of classifiers that can In the case that fÿ(x) is a Gaussian probability density
only create one point in the ROC space. A better classifier is a function [7] having mean mN and standard deviation cfy, and
classifier producing a higher Ameans. fiix) is a Gaussian probability density function having mean mi>
and standard deviation ar, the best decision threshold, /(*,„ can
be calculated using the following equation.
61
2 predicted maximum Ameans is similar with the experiment
m
P
-
a, =al-ai (12)
1 2 i
c, = mNap - mpcrN -
(17) The effects of the data sample mean and standard deviation
to the predicted ROC curve, AUC, and maximum Ameans are
shown in Fig. 3 and Fig. 4. The circles shown in Fig. 3 and Fig.
v Experiments and Results
4 depict the points in ROC space having the maximum Ameans
at the associated ROC curves. The intersection appeared in Fig.
.
The experiments are conducted by generating Gaussian 4 indicates that there is a point with equal performace resulted
iistributed random data. The class distributions have mN = 4 , from different values of standard deviation and different values
nP = 8, aN = 3, and o> = 2. The first experiment generates 1000 of decision threshold, th.
data for each class. The second experiment used an imbalanced
lata set (1000 data for the positive class and 500 data for the
legative class). By changing the decision threshold the ,
62
third feature which has the highest average difference between
the classes.
09
.
08
.
TABLE II. The statistical value of the second and third class
IN IRIS DATA SET
07
.
Class
04
.
Average of the 3"' 6 59.
2 97 5 55
.
2 03.
.
Class
03
.
Average difference
f
of the 2 d and 3
" '
0 65.
0 20
.
1 29
.
0 70.
02
. class
Standard deviation
0 1 0 52 0 31 0 47 0 20
of the 2 d class
.
" . . . .
Standard deviation
0 0 64 0 32 0 55 0 27
of the 3 J class
' . . . .
Standard
The ROC curve, AUC, and Ameans significantly depend Distance Average Deviation
on the mean and standard deviation of the data samples.
2"d class to the average of 2 J
"
class 0 71 0 34
Commonly, the data samples with higher mean difference and . .
The next experiments are done using the second and third 2"J class to the average of 3 class
"'
1 70
.
0 60
.
class of Iris flower data set [8]. Each class of the Iris data set "'
contains 50 data. The data set consists of information about the 3,d class to the average of 3 class 0 82
.
0 45
.
IV and Fig. 5. In Fig. 5 the predicted ROC curves from the first Third feature (petal length) 96.22% 89.74%
feature until the fourth feature are respectively denoted by
ROC1, ROC2, ROC3, and ROC4. The predicted points with Fourth feature (petal width) 97.93% 93.20%
.
..
.
-
J
between the data and the average of the second and third class. 09
.
ÿx _ -
ÿ ÿ -
/ -
ROC1
j y
in Table IV and Fig. 5. In Fig. 5 the predicted ROC curves 06
.
: / ' / O Ameansl
.
. * Ameans3
1 / /
I , / .
ROC4
points having the maximum value of Ameans assosiated with 03
.
"
i
1 /
/ V Ameans4
the ROC curves are denoted respectively by Ameansc2 and 02
t r ROCc2
/
.
Ameansc2
Ameansc3 in Fig. 5.
01
.
-
ROCc3
ÿ Ameansc3
The results in Table IV and Fig. 5 give us information 0
about the discriminability of the features. The highest 0 0.1 0.2 0.3 0.4 0.5
FP Rate
0.6 0.7 0.8 09 1
63
VI. Conclusions [2] I. K. Timotius and S. G. Miaou, "Arithmetic means of accuracies: a
"
The predicted performace measurements might be useful as [4] T. Fawcett, "An introduction to ROC analysis," Pattern Recognition
tools for feature sample discriminability examination. This is Letters, vol. 27 issue 8, pp. 861 - 874, 2006.
,
for the reason that the measurements depend on not simply by [5] F. Provost and T. Fawcett, "Analysis and visualization of classifier
"
the mean difference between classes , but also the standard performance: comparison under imprecise class and cost distributions,
Proc. of the Third Int. Conf. in Knowledge Discovery and Data Mining,
deviation within classes (if the feature samples are assumed to
pp. 43-48, 1997.
have Gaussian distributions). In the future, we will conduct
[6] C. X. Ling, J. Huang, and H. Zhang, "AUC: a statistically consistent and
further research on the feature discriminability examination. more discriminating measure than accuracy, Proc. of Int Joint Conf. on
"
64