0% found this document useful (0 votes)

39 views21 pages

Comparing Two ROC Curves-Independent Groups Design

This document outlines a procedure for comparing two ROC curves derived from independent groups, providing various statistical tests and confidence intervals related to the area under the ROC curve (AUC). It discusses both empirical and binormal ROC curve estimation methods, detailing how to interpret ROC curves in the context of diagnostic testing. Additionally, it explains the significance of AUC as a measure of diagnostic accuracy and the methods for estimating it.

Uploaded by

haqinaam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views21 pages

Comparing Two ROC Curves-Independent Groups Design

Uploaded by

haqinaam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

NCSS Statistical Software NCSS.

com

Chapter 548

Comparing Two ROC

Curves – Independent
Groups Design
Introduction
This procedure is used to compare two ROC curves generated from data from two independent groups. In
addition to producing a wide range of cutoff value summary rates for each group, this procedure produces
difference tests, equivalence tests, non-inferiority tests, and confidence intervals for the difference in the area
under the ROC curve. This procedure includes analyses for both empirical (nonparametric) and Binormal ROC
curve estimation.

548-1
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

Discussion and Technical Details

Although ROC curve analysis can be used for a variety of applications across a number of research fields, we will
examine ROC curves through the lens of diagnostic testing. In a typical diagnostic test, each unit (e.g., individual
or patient) is measured on some scale or given a score with the intent that the measurement or score will be useful
in classifying the unit into one of two conditions (e.g., Positive / Negative, Yes / No, Diseased / Non-diseased).
Based on a (hopefully large) number of individuals for which the score and condition is known, researchers may
use ROC curve analysis to determine the ability of the score to classify or predict the condition.

ROC Curve and Cutoff Analysis for each Diagnostic Test

The details of the many summary measures and rates for each cutoff value are discussed in the chapter One ROC
Curve and Cutoff Analysis. We invite the reader to go to that chapter for details on classification tables, as well as
true positive rate (sensitivity), true negative rate (specificity), false negative rate (miss rate), false positive rate
(fall-out), positive predictive value (precision), negative predictive value, false omission rate, false discovery rate,
prevalence, proportion correctly classified (accuracy), proportion incorrectly classified, Youden index, sensitivity
plus specificity, distance to corner, positive likelihood ratio, negative likelihood ratio, diagnostic odds ratio, and
cost analysis for each cutoff value.
The One ROC Curve and Cutoff Analysis chapter also contains details about finding the optimal cutoff value, as
well as hypothesis tests and confidence intervals for individual areas under the ROC curve.

ROC Curves
A receiver operating characteristic (ROC) curve plots the true positive rate (sensitivity) against the false positive
rate (1 – specificity) for all possible cutoff values. General discussions of ROC curves can be found in Altman
(1991), Swets (1996), Zhou et al. (2002), and Krzanowski and Hand (2009). Gehlbach (1988) provides an
example of its use.
Two types of ROC curves can be generated in NCSS: the empirical ROC curve and the binormal ROC curve.

Empirical ROC Curve

The empirical ROC curve is the more common version of the ROC curve. The empirical ROC curve is a plot of
the true positive rate versus the false positive rate for all possible cut-off values.

548-2
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

That is, each point on the ROC curve represents a different cutoff value. The points are connected to form the
curve. Cutoff values that result in low false-positive rates tend to result low true-positive rates as well. As the
true-positive rate increases, the false positive rate increases. The better the diagnostic test, the more quickly the
true positive rate nears 1 (or 100%). A near-perfect diagnostic test would have an ROC curve that is almost
vertical from (0,0) to (0,1) and then horizontal to (1,1). The diagonal line serves as a reference line since it is the
ROC curve of a diagnostic test that randomly classifies the condition.

Binormal ROC Curve

The Binormal ROC curve is based on the assumption that the diagnostic test scores corresponding to the positive
condition and the scores corresponding to the negative condition can each be represented by a Normal
distribution. To estimate the Binormal ROC curve, the sample mean and sample standard deviation are estimated
from the known positive group, and again for the known negative group. These sample means and sample
standard deviations are used to specify two Normal distributions. The Binormal ROC curve is then generated
from the two Normal distributions. When the two Normal distributions closely overlap, the Binormal ROC curve
is closer to the 45-degree diagonal line. When the two Normal distributions overlap only in the tails, the Binormal
ROC curve has a much greater distance from the 45-degree diagonal line.

It is recommended that researchers identify whether the scores for the positive and negative groups need to be
transformed to more closely follow the Normal distribution before using the Binormal ROC Curve methods.

Area under the ROC Curve (AUC)

The area under an ROC curve (AUC) is a popular measure of the accuracy of a diagnostic test. In general, higher
AUC values indicate better test performance. The possible values of AUC range from 0.5 (no diagnostic ability)
to 1.0 (perfect diagnostic ability).
The AUC has a physical interpretation. The AUC is the probability that the criterion value of an individual drawn
at random from the population of those with a positive condition is larger than the criterion value of another
individual drawn at random from the population of those where the condition is negative.
Another interpretation of AUC is the average true positive rate (average sensitivity) across all possible false
positive rates.
Two methods are commonly used to estimate the AUC. One method is the empirical (nonparametric) method by
DeLong et al. (1988). This method has become popular because it does not make the strong normality

548-3
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

assumptions that the Binormal method makes. The other method is the Binormal method presented by Metz
(1978) and McClish (1989). This method results in a smooth ROC curve from which the complete (and partial)
AUC may be calculated.

AUC of an Empirical ROC Curve

The empirical (nonparametric) method by DeLong et al. (1988) is a popular method for computing the AUC. This
method has become popular because it does not make the strong Normality assumptions that the Binormal method
makes.
The value of AUC using the empirical method is calculated by summing the area of the trapezoids that are formed
below the connected points making up the ROC curve. From DeLong et al. (1988), define the T1 component of the
ith subject, V(T1i) as
𝑛𝑛0
1
𝑉𝑉(𝑇𝑇1𝑖𝑖 ) = � Ψ�𝑇𝑇1𝑖𝑖 , 𝑇𝑇0𝑗𝑗 �
𝑛𝑛0
𝑗𝑗=1
th
and define the T0 component of the j subject, V(T0j) as
𝑛𝑛1
1
𝑉𝑉�𝑇𝑇0𝑗𝑗 � = � Ψ�𝑇𝑇1𝑖𝑖 , 𝑇𝑇0𝑗𝑗 �
𝑛𝑛1
𝑖𝑖=1

where
Ψ(𝑋𝑋, 𝑌𝑌) = 0 if 𝑌𝑌 > 𝑋𝑋,
Ψ(𝑋𝑋, 𝑌𝑌) = 1/2 if 𝑌𝑌 = 𝑋𝑋,
Ψ(𝑋𝑋, 𝑌𝑌) = 1 if 𝑌𝑌 < 𝑋𝑋
The empirical AUC is estimated as
𝑛𝑛1 𝑛𝑛0

𝐴𝐴𝐸𝐸𝐸𝐸𝐸𝐸 = � 𝑉𝑉(𝑇𝑇1𝑖𝑖 )/𝑛𝑛1 = � 𝑉𝑉�𝑇𝑇0𝑗𝑗 �/𝑛𝑛0

𝑖𝑖=1 𝑗𝑗=1

The variance of the estimated AUC is estimated as

1 2 1
𝑉𝑉�𝐴𝐴𝐸𝐸𝐸𝐸𝐸𝐸 � = 𝑆𝑆𝑇𝑇1 + 𝑆𝑆𝑇𝑇20
𝑛𝑛1 𝑛𝑛0
where 𝑆𝑆𝑇𝑇21 and 𝑆𝑆𝑇𝑇20 are the variances
1 𝑛𝑛 2
𝑆𝑆𝑇𝑇2𝑖𝑖 = 𝑛𝑛 −1 ∑𝑖𝑖=1
𝑖𝑖
�𝑉𝑉(𝑇𝑇1𝑖𝑖 ) − 𝐴𝐴𝐸𝐸𝐸𝐸𝐸𝐸 � , 𝑖𝑖 = 0,1
𝑖𝑖

548-4
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

AUC of a Binormal ROC Curve

The formulas that we use here come from McClish (1989). Suppose there are two populations, one made up of
individuals with the condition being positive and the other made up of individuals with the negative condition.
Further, suppose that the value of a criterion variable is available for all individuals. Let X refer to the value of the
criterion variable in the negative population and Y refer to the value of the criterion variable in the positive
population. The binormal model assumes that both X and Y are normally distributed with different means and
variances. That is,

( )
X ~ N µx , σ x2 , Y ~ N µ y , σ y2 ( )
The ROC curve is traced out by the function
    µ y − c  
{FP(c), TP(c)} = Φ  µσ− c  , Φ x
 , − ∞ < c < ∞
 x  σ y  

where Φ ( z ) is the cumulative normal distribution function.

The area under the whole ROC curve is

∞
A = ∫ TP(c) FP' (c) dc
−∞
∞   µy − c   µ − c  
= ∫− ∞ Φ  σ y  φ  xσ x  dc

 
 a 
= Φ 2 
 1+ b 
where
µ y − µx ∆ σ
a= = , b = x , ∆ = µ y − µx
σy σy σy
The area under a portion of the AUC curve is given by
c2

A = ∫ TP(c) FP' (c) dc

1   µy − c   µ − c  
c1

σ x c∫2   σ y   σ x  
= Φ  φ x  dc

The partial area under an ROC curve is usually defined in terms of a range of false-positive rates rather than the
criterion limits c1 and c2 . However, the one-to-one relationship between these two quantities, given by

ci = µx + σ xΦ −1 ( FPi )
allows the criterion limits to be calculated from desired false-positive rates.
The MLE of A is found by substituting the MLE’s of the means and variances into the above expression and using
numerical integration. When the area under the whole curve is desired, these formulas reduce to
 a 
A = Φ  
 1 + b 
2

548-5
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

Note that for ease of reading we will often omit the use of the hat to indicate an MLE in the following.

The variance of A is derived using the method of differentials as

2
 ∂A 
2
 ∂A   ∂A 
() ()
2

V A =   V ∆ +  2  V sx2 +  2  V s 2y
 ∂∆   ∂σ x   ∂σ y 
( ) ( )
where
∂A
=
E
[Φ (c~1 ) − Φ (c~0 )]
∂∆ 2π (1 + b )σ y
2 2

∂A
2
=
E
∂σ x 4π (1 + b )σ xσ y
2
e− k − e− k − [ abE0 1

2σ xσ y 2π (1 + b )
]
2 3/ 2
[Φ (c~1 ) − Φ (c~0 )]
 a2 
E = exp − 
 2 1 + b2
 ( ) 


∂A a  ∂A  2  ∂A 
=−   −b  2
∂σ y
2
2σ y  ∂∆   ∂σ x 

 
) ( )
ab
c~i = Φ −1 ( FPi ) +  1 + b2
 1 + b2 ( 

c~i 2
ki =
2
σ σ
()
2 2

V ∆ = x + y
nx n y

2σ x4
( )
V sx2 =
nx − 1

2σ y4
( )
Vs = 2
y
ny − 1

548-6
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

Comparing the AUC of Independent Sample ROC Curves

Comparing ROC curves may be done using either the empirical (nonparametric) methods described by DeLong
(1988) or the Binormal model methods as described in McClish (1989).

Comparing Independent Sample AUCs based on Empirical ROC Curve Estimation

Following Zhou et al. (2002) page 185, a z-test may be used for comparing AUC of two diagnostic tests in a
paired design
A1 − A2
z=
V( A1 − A2 )
where
V( A1 − A2 ) = V( A1 ) + V( A2 ) − 2Cov( A1 , A2 )
Each Variance is defined as
S Tk 1 S Tk 0
V( Ak ) = +
nk 1 nk 0
where
2

[( ) ]
n
1
∑ V Tkij − Ak , k = 1,2 i = 0,1
ki

STki =
nki − 1 j =1
n
V(Tk1i ) =
1
(
∑ψ Tk1i , Tk 0 j , k = 1,2 )
k0

nk 0 − 1 j = 1
n

( ) 1
(
∑ψ Tk1i , Tk 0 j , k = 1,2 )
k1

V Tk 0 j =
nk 1 − 1 i = 1
nk 0

∑ V(T ) ∑ V(T )
nk1

k 1i k0 j
j =1
Ak = i =1
= , k = 1,2
nk 1 nk 0

0 if Y > X

ψ ( X , Y ) =  12 if Y = X
1 if Y < X


Here 𝑇𝑇𝑘𝑘0𝑗𝑗 represents the observed diagnostic test result for the jth subject in group k without the condition and
𝑇𝑇𝑘𝑘1𝑗𝑗 represents the observed diagnostic test result for the jth subject in group k with the condition.

548-7
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

Comparing Independent Sample AUCs based on Binormal ROC Curve Estimation

When the binormal assumption is viable, the hypothesis that the areas under the two ROC curves are equal may
be tested using
A1 − A2
z=
V( A1 − A2 )
where
𝑉𝑉(𝐴𝐴1 − 𝐴𝐴2 ) = 𝑉𝑉(𝐴𝐴1 ) + 𝑉𝑉(𝐴𝐴2 )
where V( A1 ) and V( A2 ) are calculated using the formula for V( A) given above in the section on a single
Binormal ROC curve.
McClish (1989) ran simulations to study the accuracy of the normality approximation of the above z statistic for
various portions of the AUC curve. She found that a logistic-type transformation resulted in a z statistic that was
closer to normality. This transformation is

 FP − FP1 + A 
θ ( A) = ln  2 
 FP2 − FP1 − A 
which has the inverse version
eθ − 1
A = ( FP2 − FP1 )
eθ + 1
The variance of this quantity is given by

 2( FP2 − FP1 ) 
2

V(θ ) =   V( A)
2
 ( FP2 − FP1 ) − A 
2

The adjusted z statistic is

𝜃𝜃1 − 𝜃𝜃2 𝜃𝜃1 − 𝜃𝜃2
𝑧𝑧 = =
�𝑉𝑉(𝜃𝜃1 − 𝜃𝜃2 ) �𝑉𝑉(𝜃𝜃1 ) + 𝑉𝑉(𝜃𝜃2 )

Data Structure
The data are entered in three columns. One column specifies the true condition of the individual. Another column
contains the criterion values. The third column defines the groups.

Criterion Groups dataset

Condition Score Group
0 1 1
0 3 1
0 4 1
1 7 1
0 4 1
0 5 1
1 9 1
. . .
. . .
. . .

548-8
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

ROC Plot Format Window Options

This section describes some of the options available on the ROC Plot Format window, which is displayed when
the ROC Plot Chart Format button is clicked. Common options, such as axes, labels, legends, and titles are
documented in the Graphics Components chapter.

ROC Plot Tab

Empirical ROC Line Section

You can specify the format of the empirical ROC curve lines using the options in this section.

Binormal ROC Line Section

You can specify the format of the Binormal ROC curves lines using the options in this section.

548-9
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

Symbols Section
You can modify the attributes of the symbols using the options in this section.

Reference Line Section

You can modify the attributes of the 45º reference line using the options in this section.

Titles, Legend, Numeric Axis, Group Axis, Grid Lines, and Background
Tabs
Details on setting the options in these tabs are given in the Graphics Components chapter.

548-10
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

Example 1 – Comparing Two ROC Curves

This section presents an example of a producing a statistical comparison of two ROC curves using a Z-test. In the
Criterion Groups dataset, a 1 for Condition indicates the condition is present, while a 0 indicates the condition is
absent. It is anticipated that higher Score values are associated with the condition being present.

Setup
To run this example, complete the following steps:

1 Open the Criterion Groups example dataset

• From the File menu of the NCSS Data window, select Open Example Data.
• Select Criterion Groups and click OK.

2 Specify the Comparing Two ROC Curves – Independent Groups Design procedure options
• Find and open the Comparing Two ROC Curves – Independent Groups Design procedure using the
menus or the Procedure Navigator.
• The settings for this example are listed below and are stored in the Example 1 settings template. To load
this template, click Open Example Template in the Help Center or File menu.

Option Value
Variables Tab
Condition Variable .................................. Condition
Positive Condition Value ......................... 1
Criterion Variable .................................... Score
Criterion Direction ................................... Higher values indicate a Positive Condition
Group Variable........................................ Group
Cutoff Reports Tab
Cutoff Value List ..................................... Data
Counts, TPR (Sensitivity), ...................... Checked
TNR (Specificity), PPV, Accuracy,
TPR + TNR, Prevalence
All Other Reports .................................... Unchecked
AUC Reports Tab
Area Under Curve (AUC) Analysis ......... Checked
(Empirical Estimation)
Test Comparing Two AUCs .................... Checked
(Empirical Estimation)
Confidence Intervals for Comparing ....... Checked
Two AUCs (Empirical Estimation)
All Other Reports .................................... Unchecked

3 Run the procedure

• Click the Run button to perform the calculations and generate the output.

548-11
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

Common Rates and Indices for each Cutoff Value

Common Rates and Indices for each Cutoff Value ──────────────────────────────────────
Criterion Variable: Score when Group = 1
Estimated Prevalence = 19 / 50 = 0.3800
Estimated Prevalence is the proportion of the sample with a positive condition of 1, or (A + C) / (A + B + C + D) for
all cutoff values. The estimated prevalence should only be used as a valid estimate of the population prevalence
when the entire sample is a random sample of the population.

────── Table Counts ──────

Cutoff TPs FPs FNs TNs TPR TNR Accur- TPR +
Value A B C D (Sens.) (Spec.) PPV acy TNR
≥ 1.00 19 31 0 0 1.0000 0.0000 0.3800 0.3800 1.0000
≥ 2.00 19 28 0 3 1.0000 0.0968 0.4043 0.4400 1.0968
≥ 3.00 18 24 1 7 0.9474 0.2258 0.4286 0.5000 1.1732
≥ 4.00 17 19 2 12 0.8947 0.3871 0.4722 0.5800 1.2818
≥ 5.00 14 12 5 19 0.7368 0.6129 0.5385 0.6600 1.3497
≥ 6.00 12 9 7 22 0.6316 0.7097 0.5714 0.6800 1.3413
≥ 7.00 11 4 8 27 0.5789 0.8710 0.7333 0.7600 1.4499
≥ 8.00 8 2 11 29 0.4211 0.9355 0.8000 0.7400 1.3565
≥ 9.00 5 1 14 30 0.2632 0.9677 0.8333 0.7000 1.2309
≥ 10.00 2 1 17 30 0.1053 0.9677 0.6667 0.6400 1.0730

Common Rates and Indices for each Cutoff Value

Criterion Variable: Score when Group = 2
Estimated Prevalence = 28 / 60 = 0.4667
Estimated Prevalence is the proportion of the sample with a positive condition of 1, or (A + C) / (A + B + C + D) for
all cutoff values. The estimated prevalence should only be used as a valid estimate of the population prevalence
when the entire sample is a random sample of the population.

────── Table Counts ──────

Cutoff TPs FPs FNs TNs TPR TNR Accur- TPR +
Value A B C D (Sens.) (Spec.) PPV acy TNR
≥ 1.00 28 32 0 0 1.0000 0.0000 0.4667 0.4667 1.0000
≥ 2.00 28 25 0 7 1.0000 0.2188 0.5283 0.5833 1.2188
≥ 3.00 28 18 0 14 1.0000 0.4375 0.6087 0.7000 1.4375
≥ 4.00 28 13 0 19 1.0000 0.5938 0.6829 0.7833 1.5938
≥ 5.00 27 6 1 26 0.9643 0.8125 0.8182 0.8833 1.7768
≥ 6.00 21 5 7 27 0.7500 0.8438 0.8077 0.8000 1.5938
≥ 7.00 18 2 10 30 0.6429 0.9375 0.9000 0.8000 1.5804
≥ 8.00 15 1 13 31 0.5357 0.9688 0.9375 0.7667 1.5045
≥ 9.00 7 0 21 32 0.2500 1.0000 1.0000 0.6500 1.2500
≥ 10.00 2 0 26 32 0.0714 1.0000 1.0000 0.5667 1.0714

Definitions:
Cutoff Value indicates the criterion value range that predicts a positive condition.
A is the number of True Positives.
B is the number of False Positives.
C is the number of False Negatives.
D is the number of True Negatives.
TPR is the True Positive Rate or Sensitivity = A / (A + C).
TNR is the True Negative Rate or Specificity = D / (B + D).
PPV is the Positive Predictive Value or Precision = A / (A + B).
Accuracy is the Proportion Correctly Classified = (A + D) / (A + B + C + D).
TPR + TNR is the Sensitivity + Specificity.

The report displays, for each group, some of the more commonly used rates for each cutoff value.

548-12
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

Area Under Curve Analysis (Empirical Estimation)

Area Under Curve Analysis (Empirical Estimation) ──────────────────────────────────────
Estimated Prevalence (1) = 19 / 50 = 0.3800
Estimated Prevalence (2) = 28 / 60 = 0.4667
Estimated Prevalence is the proportion of the sample with a positive condition of 1. The estimated prevalence
should only be used as a valid estimate of the population prevalence when the entire sample is a random sample
of the population.

Z-Value Upper
Standard to Test 1-Sided 95% Confidence Limits
Group Count AUC Error AUC > 0.5 P-Value Lower Upper
1 50 0.7640 0.0710 3.720 0.0001 0.5860 0.8717
2 60 0.9314 0.0304 14.172 0.0000 0.8392 0.9715

Definitions:
Group is the Criterion group label.
Count is the number of the individuals used in the analysis.
AUC is the area under the ROC curve using the empirical (trapezoidal) approach.
Standard Error is the standard error of the AUC estimate.
Z-Value is the Z-score for testing the designated hypothesis test.
P-Value is the probability level associated with the Z-Value.
The Lower and Upper Confidence Limits form the confidence interval for AUC.

This report gives statistical tests comparing the area under the curve to the value 0.5. The small P-values indicate
a significant difference from 0.5 for both groups. The report also gives the 95% confidence interval for each
estimated AUC.

Test Comparing Two AUCs (Empirical Estimation)

Test Comparing Two AUCs (Empirical Estimation) ──────────────────────────────────────
H0: AUC1 = AUC2
H1: AUC1 ≠ AUC2
Total Sample Size: 110

Group Variable: Group

Difference Difference Difference

Group 1 Group 2 AUC1 AUC2 AUC1 - AUC2 Std Error Percent Z-Value P-Value
1 2 0.7640 0.9314 -0.1674 0.0772 21.905 -2.167 0.0302

This report gives a two-sided statistical test comparing the area under the curve of Group 1 to the area under the
curve of Group 2. The small P-value indicates a significant difference between the AUCs.

548-13
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Comparing Two ROC Curves – Independent Groups Design

Confidence Intervals for Comparing Two AUCs (Empirical Estimation)

Confidence Intervals for Comparing Two AUCs (Empirical Estimation) ────────────────────────
Total Sample Size: 110

Group Variable: Group

Difference Difference 95% Confidence Limits

Group 1 Group 2 AUC1 AUC2 AUC1 - AUC2 Std Error Lower Upper
1 2 0.7640 0.9314 -0.1674 0.0772 -0.3187 -0.0160

This report provide the confidence interval for the difference of the area under the curve of Group 1 and the area
under the curve of Group 2.

ROC Plot Section

ROC Plot Section ───────────────────────────────────────────────────────────

The plot can be made to contain the empirical ROC curve, the Binormal ROC curve, or both, by making the proper
selection after clicking the ROC Plot Format button.

The coordinates of the points of the ROC curves are the TPR and FPR for each of the unique Score values. The
diagonal (45 degree) line is an ROC curve of random classification, and serves as a baseline. Each ROC curve
shows the overall ability of using the score to classify the condition. The Group 2 curve appears to show better
classification ability and the Group 1 curve.

Example 2 – Comparing Two ROC Curves using Binormal

Estimation
This section presents an example of a producing a statistical comparison of two ROC curves using Binormal
estimation methods. The dataset used is the Criterion Groups dataset.

Setup
To run this example, complete the following steps:

1 Open the Criterion Groups example dataset

• From the File menu of the NCSS Data window, select Open Example Data.
• Select Criterion Groups and click OK.

2 Specify the Comparing Two ROC Curves – Independent Groups Design procedure options
• Find and open the Comparing Two ROC Curves – Independent Groups Design procedure using the
menus or the Procedure Navigator.
• The settings for this example are listed below and are stored in the Example 2 settings template. To load
this template, click Open Example Template in the Help Center or File menu.

Option Value
Variables Tab
Condition Variable .................................. Condition
Positive Condition Value ......................... 1
Criterion Variable .................................... Score
Criterion Direction ................................... Higher values indicate a Positive Condition
Group Variable........................................ Group
AUC Reports Tab
Area Under Curve (AUC) ........................ Checked
Analysis (Binormal Estimation)
Test Comparing Two AUCs .................... Checked
(Binormal Estimation)
Confidence Intervals for Comparing ....... Checked
Two AUCs (Binormal Estimation)
Plots Tab
ROC Plot ................................................. Checked
ROC Plot Format (Click the Button)
Empirical ROC Line ............................. Checked
Binormal ROC Line ............................. Checked

3 Run the procedure

• Click the Run button to perform the calculations and generate the output.

Area Under Curve Analysis (Binormal Estimation)

Area Under Curve Analysis (Binormal Estimation) ──────────────────────────────────────
Estimated Prevalence (1) = 19 / 50 = 0.3800
Estimated Prevalence (2) = 28 / 60 = 0.4667
Estimated Prevalence is the proportion of the sample with a positive condition of 1. The estimated prevalence
should only be used as a valid estimate of the population prevalence when the entire sample is a random sample
of the population.

Z-Value Upper
Standard to Test 1-Sided 95% Confidence Limits
Group Count AUC Error AUC > 0.5 P-Value Lower Upper
1 50 0.7654 0.0686 3.868 0.0001 0.5944 0.8702
2 60 0.9411 0.0274 16.106 0.0000 0.8560 0.9765

Definitions:
Group is the Criterion group label.
Count is the number of the individuals used in the analysis.
AUC is the area under the ROC curve using the Binormal estimation approach.
Standard Error is the standard error of the AUC estimate.
Z-Value is the Z-score for testing the designated hypothesis test.
P-Value is the probability level associated with the Z-Value.
The Lower and Upper Confidence Limits form the confidence interval for AUC.

This report gives a statistical test comparing the area under the curve to the value 0.5 for each group. The small P-
values indicate a significant difference from 0.5 for both groups. The report also gives the 95% confidence
interval for each estimated AUC.

Test Comparing Two AUCs (Binormal Estimation)

Test Comparing Two AUCs (Binormal Estimation) ──────────────────────────────────────
H0: AUC1 = AUC2
H1: AUC1 ≠ AUC2
Total Sample Size: 110

Group Variable: Group

Difference Difference Difference

Group 1 Group 2 AUC1 AUC2 AUC1 - AUC2 Std Error Percent Z-Value P-Value
1 2 0.7654 0.9411 -0.1757 0.0739 22.953 -2.536 0.0112

Confidence Intervals for Comparing Two AUCs (Binormal Estimation)

Confidence Intervals for Comparing Two AUCs (Binormal Estimation) ─────────────────────────
Total Sample Size: 110

Group Variable: Group

Difference Difference 95% Confidence Limits

Group 1 Group 2 AUC1 AUC2 AUC1 - AUC2 Std Error Lower Upper
1 2 0.7654 0.9411 -0.1757 0.0739 -0.3205 -0.0309

This report provides the confidence interval for the difference of the area under the curve of Group 1 and the area
under the curve of Group 2.

ROC Plot Section

ROC Plot Section ───────────────────────────────────────────────────────────

The Binormal estimation ROC plot is a smooth curve estimation of the true ROC curves. The diagonal (45
degree) line is an ROC curve of random classification, and serves as a baseline. The Binormal estimation ROC
plot and the empirical estimation ROC plot can be superimposed in one plot using the plot format button:

ROC Plot Section ───────────────────────────────────────────────────────────

Example 3 – Equivalence Test for Two AUCs

This section presents an example of testing the equivalence of two areas under the ROC curve. Suppose
researchers wish to show that a new, less expensive classification method works equally well to that of the current
method. The equivalence margin is set at 0.15. The dataset used is the Disease Diagnosis dataset.

Setup
To run this example, complete the following steps:

1 Open the Disease Diagnosis example dataset

• From the File menu of the NCSS Data window, select Open Example Data.
• Select Disease Diagnosis and click OK.

2 Specify the Comparing Two ROC Curves – Independent Groups Design procedure options
• Find and open the Comparing Two ROC Curves – Independent Groups Design procedure using the
menus or the Procedure Navigator.
• The settings for this example are listed below and are stored in the Example 3 settings template. To load
this template, click Open Example Template in the Help Center or File menu.

Option Value
Variables Tab
Condition Variable .................................. Disease
Positive Condition Value ......................... Yes
Criterion Variable .................................... Score
Criterion Direction ................................... Higher values indicate a Positive Condition
Group Variable........................................ Method
AUC Reports Tab
Area Under Curve (AUC) Analysis ......... Checked
(Empirical Estimation)
Equivalence Test for Two AUCs............. Checked
(Empirical Estimation)
Lower Equivalence Bound ...................... -0.15
Upper Equivalence Bound ...................... 0.15

3 Run the procedure

• Click the Run button to perform the calculations and generate the output.

Area Under Curve Analysis (Empirical Estimation)

Area Under Curve Analysis (Empirical Estimation) ──────────────────────────────────────
Estimated Prevalence (Current) = 10 / 40 = 0.2500
Estimated Prevalence (New) = 9 / 40 = 0.2250
Estimated Prevalence is the proportion of the sample with a positive condition of YES. The estimated prevalence
should only be used as a valid estimate of the population prevalence when the entire sample is a random sample
of the population.

Z-Value Upper
Standard to Test 1-Sided 95% Confidence Limits
Method Count AUC Error AUC > 0.5 P-Value Lower Upper
Current 40 0.8883 0.0534 7.268 0.0000 0.7246 0.9571
New 40 0.8710 0.0578 6.416 0.0000 0.7002 0.9475

Equivalence Test for Two AUCs (Empirical Estimation)

Equivalence Test for Two AUCs (Empirical Estimation) ───────────────────────────────────
Lower Equivalence Bound (LEB): -0.1500
Upper Equivalence Bound (UEB): 0.1500
H0: AUC1 - AUC2 ≤ -0.1500 or AUC1 - AUC2 ≥ 0.1500
H1: -0.1500 < AUC1 - AUC2 < 0.1500
Total Sample Size: 80

Group Variable: Method

── Two One-Sided Tests ──

Difference Lower Upper Equiv. ─── 90% C. I. ─── Conclusion
Group 1 Group 2 AUC1 - AUC2 P-Value P-Value P-Value Lower Upper (α = 0.05)
Current New 0.0174 0.0168 0.0460 0.0460 -0.1121 0.1469 Reject H0

Definitions:
Group 1 is the category of the Group Variable assigned to Group 1.
Group 2 is the category of the Group Variable assigned to Group 2.
AUC1 is the calculated area under the ROC curve for Group 1.
AUC2 is the calculated area under the ROC curve for Group 2.
Difference (AUC1 - AUC2) is the simple difference AUC1 minus AUC2.
Lower P-Value is the P-value for testing H0: AUC1 - AUC2 ≤ LEB vs. H1: AUC1 - AUC2 > LEB.
Upper P-Value is the P-value for testing H0: AUC1 - AUC2 ≥ UEB vs. H1: AUC1 - AUC2 < UEB.
Equivalence P-Value is the P-Value for testing overall equivalence. It is the larger of the Lower and Upper
P-Values.
If the Equivalence P-Value is less than α, H0 is rejected and equivalence may be concluded.
The Lower and Upper Confidence Limits form the equivalence confidence interval. The confidence level
(100 * (1 - 2α)) corresponds to a test based on α. If the equivalence confidence interval is inside the equivalence
bounds, H0 is rejected and equivalence may be concluded.
Conclusion is the determination concerning H0, based on the Equivalence P-Value (or the equivalence confidence
interval).

The Equivalence P-value indicates evidence that the two areas under the curve are equal. Also, the 90%
confidence interval is contained by the equivalence bounds.

ROC Plot Section

ROC Plot Section ───────────────────────────────────────────────────────────

The ROC plot shows the similarity of the two areas under the ROC curve.

Curvas ROC
No ratings yet
Curvas ROC
2 pages
Understanding ROC Curves in Diagnostics
No ratings yet
Understanding ROC Curves in Diagnostics
3 pages
Tests For Two ROC Curves: PASS Sample Size Software
No ratings yet
Tests For Two ROC Curves: PASS Sample Size Software
16 pages
Confidence Intervals For The Area Under An ROC Curve
No ratings yet
Confidence Intervals For The Area Under An ROC Curve
7 pages
AUC and Concordance
No ratings yet
AUC and Concordance
16 pages
Kja 21209
No ratings yet
Kja 21209
12 pages
ROC Curve
No ratings yet
ROC Curve
20 pages
Flach Roc Analysis
No ratings yet
Flach Roc Analysis
12 pages
Habibzadeh 2016
No ratings yet
Habibzadeh 2016
11 pages
Predicting Cesarean with ROC Curve
No ratings yet
Predicting Cesarean with ROC Curve
9 pages
ROC Curve and Interpretation. For Medical Statistics. Research Analysis
No ratings yet
ROC Curve and Interpretation. For Medical Statistics. Research Analysis
3 pages
Diagnostic Test: Magdalena Sidhartani
No ratings yet
Diagnostic Test: Magdalena Sidhartani
17 pages
International Biometric Society
No ratings yet
International Biometric Society
10 pages
13-Module 5 - ROC Curve Analysis - Introduction and Motivation-26-09-2023
No ratings yet
13-Module 5 - ROC Curve Analysis - Introduction and Motivation-26-09-2023
8 pages
Journal of Statistical Software: Plotroc: A Tool For Plotting Roc Curves
No ratings yet
Journal of Statistical Software: Plotroc: A Tool For Plotting Roc Curves
19 pages
Lecture 7
No ratings yet
Lecture 7
26 pages
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
No ratings yet
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
5 pages
Measurement Validity Explained
No ratings yet
Measurement Validity Explained
12 pages
Comparing The Areas Under Two or More Correlated Receiver Operating Characteristic Curves A Nonparametric Approach
No ratings yet
Comparing The Areas Under Two or More Correlated Receiver Operating Characteristic Curves A Nonparametric Approach
10 pages
83 ROCCurves
No ratings yet
83 ROCCurves
9 pages
A Study On The Bi-Rayleigh ROC Curve Model
No ratings yet
A Study On The Bi-Rayleigh ROC Curve Model
6 pages
One ROC Curve and Cutoff Analysis
No ratings yet
One ROC Curve and Cutoff Analysis
30 pages
Reciever Operating Characeteristics
No ratings yet
Reciever Operating Characeteristics
2 pages
BM28 1 010101
No ratings yet
BM28 1 010101
10 pages
AUC ROC Curve for ML Enthusiasts
No ratings yet
AUC ROC Curve for ML Enthusiasts
5 pages
Auc Roc Curve Machine Learning
No ratings yet
Auc Roc Curve Machine Learning
12 pages
The ROC Curve
No ratings yet
The ROC Curve
5 pages
Guide To Interprate ROC Analysis
No ratings yet
Guide To Interprate ROC Analysis
4 pages
ICA and Curve ROC.: P Erez Mart Inez Luis Alberto March 4, 2024
No ratings yet
ICA and Curve ROC.: P Erez Mart Inez Luis Alberto March 4, 2024
5 pages
ROC Curve for Medical Research
No ratings yet
ROC Curve for Medical Research
16 pages
ROC Covariado
No ratings yet
ROC Covariado
20 pages
Guide To AUC ROC Curve in Machine Learning
No ratings yet
Guide To AUC ROC Curve in Machine Learning
10 pages
UNSTUCK Summary ROC Curves
No ratings yet
UNSTUCK Summary ROC Curves
4 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Evolution of Disgnostic Testing
No ratings yet
Evolution of Disgnostic Testing
41 pages
Progress Assesment (ROV Curve and AUC)
No ratings yet
Progress Assesment (ROV Curve and AUC)
2 pages
Roc Curve in Python
No ratings yet
Roc Curve in Python
58 pages
The Optimization Problem
No ratings yet
The Optimization Problem
45 pages
Logistic Regression:: PGP Dse Bangalore July 2018
No ratings yet
Logistic Regression:: PGP Dse Bangalore July 2018
62 pages
ROC Curve Guide for Researchers
No ratings yet
ROC Curve Guide for Researchers
11 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
Bradley PR97 PDF
No ratings yet
Bradley PR97 PDF
15 pages
Test Accuracy: Sensitivity & Specificity
No ratings yet
Test Accuracy: Sensitivity & Specificity
26 pages
PARODI - Not Proper ROC Curves As New Tool For The Analysis of Differentially Expressed Genes in Microarray Experiments
No ratings yet
PARODI - Not Proper ROC Curves As New Tool For The Analysis of Differentially Expressed Genes in Microarray Experiments
30 pages
Seminar Maschinellem Lernen: An Improved Model Selection Heuristic For AUC
No ratings yet
Seminar Maschinellem Lernen: An Improved Model Selection Heuristic For AUC
19 pages
pROC: An Open-Source Package For R and S+ To Analyze and Compare ROC Curves
No ratings yet
pROC: An Open-Source Package For R and S+ To Analyze and Compare ROC Curves
8 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
Binary Classification Machine Learning Models
No ratings yet
Binary Classification Machine Learning Models
4 pages
Adjusting For Covariate Effects On Classification Accuracy Using The Covariate-Adjusted Receiver Operating Characteristic Curve
No ratings yet
Adjusting For Covariate Effects On Classification Accuracy Using The Covariate-Adjusted Receiver Operating Characteristic Curve
12 pages
Statistics in Medicine - 2005 - Walter - The Partial Area Under The Summary ROC Curve
No ratings yet
Statistics in Medicine - 2005 - Walter - The Partial Area Under The Summary ROC Curve
16 pages
QFR rp181
No ratings yet
QFR rp181
32 pages
ROC Auc
No ratings yet
ROC Auc
5 pages
Kohl PerformanceMeasures2012
No ratings yet
Kohl PerformanceMeasures2012
4 pages
Understanding Receiver Operating Characteristic Roc Curves
No ratings yet
Understanding Receiver Operating Characteristic Roc Curves
2 pages
Logistic Regression With R
No ratings yet
Logistic Regression With R
58 pages
MPC 006 5yr Sol - Pyq
No ratings yet
MPC 006 5yr Sol - Pyq
60 pages
Chapter 15
No ratings yet
Chapter 15
32 pages
Journal of Statistical Software: Thresholdroc: Optimum Threshold Estimation
No ratings yet
Journal of Statistical Software: Thresholdroc: Optimum Threshold Estimation
21 pages
Loboetal 2008 AUCymodelosdedistribucin GEB
No ratings yet
Loboetal 2008 AUCymodelosdedistribucin GEB
8 pages
Chapter 3
No ratings yet
Chapter 3
12 pages
Business Research Testbank
No ratings yet
Business Research Testbank
3 pages
Taking The Q
No ratings yet
Taking The Q
30 pages
Regression Analysis for Academics
No ratings yet
Regression Analysis for Academics
96 pages
Final-Term I-2425
No ratings yet
Final-Term I-2425
4 pages
GMATH Correlation Analysis
No ratings yet
GMATH Correlation Analysis
3 pages
Biostatistics - I
No ratings yet
Biostatistics - I
46 pages
RM Chap 3
No ratings yet
RM Chap 3
5 pages
Sur15 1 Sol
No ratings yet
Sur15 1 Sol
17 pages
A View of The Nature of Design Research - Archer
No ratings yet
A View of The Nature of Design Research - Archer
18 pages
Jackson Problem Solution 6 17
100% (3)
Jackson Problem Solution 6 17
2 pages
Part 1 - Basic Research Method
100% (2)
Part 1 - Basic Research Method
222 pages
MDSC 508 Thesis Proposal Guide
No ratings yet
MDSC 508 Thesis Proposal Guide
2 pages
Engineering Stats for Students
No ratings yet
Engineering Stats for Students
7 pages
History Classroom Engagement
No ratings yet
History Classroom Engagement
20 pages
The Exact Sciences
No ratings yet
The Exact Sciences
5 pages
Data Collection Method
100% (1)
Data Collection Method
21 pages
At The End of This Lesson, The Students Will Be Able To
No ratings yet
At The End of This Lesson, The Students Will Be Able To
18 pages
Grounded Theory in Medical Education Research: AMEE Guide No. 70
No ratings yet
Grounded Theory in Medical Education Research: AMEE Guide No. 70
12 pages
Sociology - Unit 2 - Primary Research Methods Sample - Case Studies
No ratings yet
Sociology - Unit 2 - Primary Research Methods Sample - Case Studies
2 pages
Article Review Guidelines 2
No ratings yet
Article Review Guidelines 2
2 pages
Data Collection
No ratings yet
Data Collection
3 pages
Senior R&D Director Application
100% (1)
Senior R&D Director Application
1 page
Six Methods for Sample Size Justification
No ratings yet
Six Methods for Sample Size Justification
31 pages
Goode and Hatt
90% (10)
Goode and Hatt
10 pages
The Whole Art of Detection - by Sherlock Holmes
100% (2)
The Whole Art of Detection - by Sherlock Holmes
55 pages
Soriano Research Work
No ratings yet
Soriano Research Work
16 pages
Name: Aditya Parade Roll No: 281047 PRN: 22311577 Batch: A-2 Assignment 6
No ratings yet
Name: Aditya Parade Roll No: 281047 PRN: 22311577 Batch: A-2 Assignment 6
2 pages
ANOVA Guide for Statisticians
No ratings yet
ANOVA Guide for Statisticians
12 pages
Jurnal Pendidikan: Oleh
No ratings yet
Jurnal Pendidikan: Oleh
8 pages

Comparing Two ROC Curves-Independent Groups Design

Uploaded by

Comparing Two ROC Curves-Independent Groups Design

Uploaded by

NCSS Statistical Software NCSS.

Comparing Two ROC

Discussion and Technical Details

ROC Curve and Cutoff Analysis for each Diagnostic Test

Empirical ROC Curve

Binormal ROC Curve

Area under the ROC Curve (AUC)

AUC of an Empirical ROC Curve

𝐴𝐴𝐸𝐸𝐸𝐸𝐸𝐸 = � 𝑉𝑉(𝑇𝑇1𝑖𝑖 )/𝑛𝑛1 = � 𝑉𝑉�𝑇𝑇0𝑗𝑗 �/𝑛𝑛0

The variance of the estimated AUC is estimated as

AUC of a Binormal ROC Curve

where Φ ( z ) is the cumulative normal distribution function.

The area under the whole ROC curve is

A = ∫ TP(c) FP' (c) dc

The variance of A is derived using the method of differentials as

Comparing the AUC of Independent Sample ROC Curves

Comparing Independent Sample AUCs based on Empirical ROC Curve Estimation

Comparing Independent Sample AUCs based on Binormal ROC Curve Estimation

The adjusted z statistic is

Criterion Groups dataset

ROC Plot Format Window Options

ROC Plot Tab

Empirical ROC Line Section

Binormal ROC Line Section

Reference Line Section

Example 1 – Comparing Two ROC Curves

1 Open the Criterion Groups example dataset

3 Run the procedure

Common Rates and Indices for each Cutoff Value

────── Table Counts ──────

Common Rates and Indices for each Cutoff Value

────── Table Counts ──────

Area Under Curve Analysis (Empirical Estimation)

Test Comparing Two AUCs (Empirical Estimation)

Group Variable: Group

Difference Difference Difference

Confidence Intervals for Comparing Two AUCs (Empirical Estimation)

Group Variable: Group

Difference Difference 95% Confidence Limits

ROC Plot Section

Example 2 – Comparing Two ROC Curves using Binormal

1 Open the Criterion Groups example dataset

3 Run the procedure

Area Under Curve Analysis (Binormal Estimation)

Test Comparing Two AUCs (Binormal Estimation)

Group Variable: Group

Difference Difference Difference

Confidence Intervals for Comparing Two AUCs (Binormal Estimation)

Group Variable: Group

Difference Difference 95% Confidence Limits

ROC Plot Section

ROC Plot Section ───────────────────────────────────────────────────────────

Example 3 – Equivalence Test for Two AUCs

1 Open the Disease Diagnosis example dataset

3 Run the procedure

Area Under Curve Analysis (Empirical Estimation)

Equivalence Test for Two AUCs (Empirical Estimation)

Group Variable: Method

── Two One-Sided Tests ──

ROC Plot Section

You might also like