0% found this document useful (0 votes)

231 views16 pages

Tests For Two ROC Curves: PASS Sample Size Software

Uploaded by

Sol Camus

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

231 views16 pages

Tests For Two ROC Curves: PASS Sample Size Software

Uploaded by

Sol Camus

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

PASS Sample Size Software NCSS.

com

Chapter 265

Tests for Two ROC

Curves
Introduction
Receiver operating characteristic (ROC) curves are used to summarize the accuracy of diagnostic tests. The
technique is used when a criterion variable is available which is used to make a yes or no decision. The area under
the ROC curve (AUC) is a popular summary index of an ROC curve.
This module computes power and sample size for comparing the AUC’s of two diagnostic tests obtained from the
same patients. The methodology of Obuchowski and McClish (1997) is used when the criterion variable yields a
discrete value. The methodology of Hanley and McNeil (1983) is used when the criterion variable yields a
continuous value.

Technical Details
In the following, we suppose that we have two groups of patients, those with a condition of interest (the disease)
and those without it. A patient’s classification may be known from extensive diagnosis or based on the value of
another diagnostic test. The diagnostic tests of interest are performed on each patient and the resulting test values
are recorded. At each specified cutoff value of the criterion variable, the true positive rate (TPR) and the false
positive rate (FPR) are calculated. An ROC curve is generating by plotting TPR versus FPR. The plot allows the
consequences of using various cutoff values to be evaluated. The area under the ROC curve, either for the whole
or partial range, is often used as a summary measure of the accuracy of the test.
It should be noted that TPR is similar to the statistical power of the diagnostic test at a particular cutoff value of
the criterion variable. Similarly, FPR is an estimate of the probability that the diagnostic test results in a type I
(alpha) error. Thus the ROC curve may be interpreted as a plot of the diagnostic test’s power versus it’s
significance level at various possible criterion cutoff values.
Users of ROC curves have developed special names for TPR and FPR. They call TPR the sensitivity of the test
and 1 - FPR the specificity of the test. Statisticians will be more familiar with using the word power instead of
sensitivity and the phrase ‘1 - alpha’ instead of specificity.
An ROC curve may be summarized by the area under it (AUC). This area has an additional interpretation.
Suppose that a rater is asked to study two subjects, one that is actually disease positive and one that is disease
negative. The AUC is equal to the probability that the rater will give the disease positive subject a higher score
than the disease negative subject. That is, the AUC is the probability that the rater will correctly order the two
subjects as to which is more likely to have the disease.
Several methods of computing the AUC have been proposed. One method uses the trapezoidal rule to calculate
the AUC directly. Another method, called the binormal model, computes the area by fitting two normal
distributions to the data.

265-1
© NCSS, LLC. All Rights Reserved.
PASS Sample Size Software NCSS.com
Tests for Two ROC Curves

The Binormal Model

Let X denote the distribution of the criterion variable for normal (non-diseased) patients and Y denote the
distribution of the criterion variable for abnormal (diseased) patients. It is assumed that

(
X ~ N µ− , σ −2 )
and

(
Y ~ N µ+ , σ +2 )
The partial area under the ROC curve, AUC, is defined as
c2

θi = ∫ Φ ( Ai + Bi v )φ ( v )dv
c1

(
where Φ( z ) is the cumulative normal distribution, c j = Φ −1 FPR j , and )
µi + − µi −
Ai =
σ i+
σ i−
Bi =
σ i+
Note that for the full range area under the curve, c1 = −∞ and c2 = ∞ .
Maximum likelihood estimates of A and B can be computed. The variances and covariance of these MLE’s can be
estimated from Fisher’s information matrix.
Define ∆ = θ1 − θ2 to be the difference in the accuracies (AUC’s) of the two tests. A test of whether the two AUC’s
are different amounts to testing whether ∆ = 0 . The test statistic for this test is

∆ − 0
Z=
var ∆0 ()
()
 is the variance of ∆ under the null hypothesis of equality. The above test statistic results in the
where var0 ∆
following formula for computing sample size

() ()
2
 z V ∆ + z V ∆ 
 α 0 β Alt

N+ =
∆2

Rating Data
When the criterion values are discrete rating values, Obuchowski and McClish (1997) showed that the variances
could be calculated using

() ( ) ( ) ( )
V0 ∆ = V θ1 + V θ1 − 2C θ1 ,θ1

VAlt(∆ ) = V (θ ) + V (θ ) − 2C(θ ,θ )

1 2 1 2

265-2
© NCSS, LLC. All Rights Reserved.
PASS Sample Size Software NCSS.com
Tests for Two ROC Curves

where

 B 2 A2 
( )  R 2
  1 + R 
V θi = f i 2  1 + i + i  + gi2  Bi2  
  2R  

g1 g 2 B1 B2 (r−2 + Rr+2 ) f1 g 2 A1 B2 r+2

( ) 
C θ1 , θ 2 = f1 f 2  r+ + r−
ˆ ˆ B1 B2
+ r+2 A1 A2 
+ +
 R 2  2R 2
f 2 g1 A2 B1r+2
+
2
E1i E3i
fi =
2πE2 i
E1i E4 i ABE E
gi = − i i 1i 3i
2πE2 i 2πE23i

 Ai2 
E1i = exp − 
 2 + 2 Bi2 

E2 i = 1 + Bi2
E3i = Φ( c2 ) − Φ( c1 )

 c2   c2 
E4 i = exp − 1  − exp − 2 
 2  2

( )
Aj B j
Φ −1 FPR j +
1 + B 2j
cj =
1 + B 2j

N−
R=
N+

Ai = Bi Φ −1 (TNRi ) − Φ −1 ( FPRi )
r− and r+ are the correlations between the results of the two diagnostics tests for normal and abnormal patients,
respectively. For the most conservative results, set Bi = 1 .

Continuous Data
When the criterion values are continuous, Obuchowski (1998) suggests that the following formulas of Hanley and
McNeil (1983) are more appropriate. Note that these formulas cannot be used for evaluating the AUC for a partial
range.

() ( ) ( ) (
V ∆ = V θ1 + V θ2 − 2C θ1 ,θ2 )

265-3
© NCSS, LLC. All Rights Reserved.
PASS Sample Size Software NCSS.com
Tests for Two ROC Curves

where

θi 2θi2
( )
V θi =
R( 2 − θi )
+
1 + θi
 1 + R
− θi2 
 R 


( )
C θ1 ,θ2 = 2r V (θ1 )V (θ2 )

and r is derived from a special table provided by Hanley and McNeil (1983).

Procedure Options
This section describes the options that are specific to this procedure. These are located on the Design tab. For
more information about the options of other tabs, go to the Procedure Window chapter.

Design Tab
The Design tab contains most of the parameters and options that you will be concerned with.

Solve For
Solve For
This option specifies the parameter to be solved for from the other parameters. Under most situations, you will
select either Power or Sample Size (N+).
Select Sample Size (N+) when you want to calculate the sample size needed to achieve a given power and alpha
level.
Select Power when you want to calculate the power of an experiment that has already been run.

Test
Alternative Hypothesis
Specify whether the test is one-sided or two-sided. When a two-sided test is selected, the value of alpha is divided
by two.
Note that most researchers assume that, unless stated otherwise, all statistical tests are two-sided. If you use a one-
sided test, you should clearly state and justify this in all reports.

Power and Alpha

Power
This option specifies one or more values for power. Power is the probability of rejecting a false null hypothesis,
and is equal to one minus Beta. Beta is the probability of a type-II error, which occurs when a false null
hypothesis is not rejected.
Values must be between zero and one. Historically, the value of 0.80 (Beta = 0.20) was used for power. Now,
0.90 (Beta = 0.10) is also commonly used.
A single value may be entered here or a range of values such as 0.8 to 0.95 by 0.05 may be entered.

265-4
© NCSS, LLC. All Rights Reserved.
PASS Sample Size Software NCSS.com
Tests for Two ROC Curves

Alpha
This option specifies one or more values for the probability of a type-I error. A type-I error occurs when a true
null hypothesis is rejected.
Values must be between zero and one. Historically, the value of 0.05 has been used for alpha. This means that
about one test in twenty will falsely reject the null hypothesis. You should pick a value for alpha that represents
the risk of a type-I error you are willing to take in your experimental situation.
You may enter a range of values such as 0.01 0.05 0.10 or 0.01 to 0.10 by 0.01.

Sample Size (When Solving for Sample Size)

Group Allocation
Select the option that describes the constraints on N+ or N- or both.
The options are

• Equal (N+ = N-)

This selection is used when you wish to have equal sample sizes in each group. Since you are solving for both
sample sizes at once, no additional sample size parameters need to be entered.

• Enter N+, solve for N-

Select this option when you wish to fix N+ at some value (or values), and then solve only for N-. Please note
that for some values of N+, there may not be a value of N- that is large enough to obtain the desired power.

• Enter N-, solve for N+

Select this option when you wish to fix N- at some value (or values), and then solve only for N+. Please note
that for some values of N-, there may not be a value of N+ that is large enough to obtain the desired power.

• Enter R = N-/N+, solve for N+ and N-

For this choice, you set a value for the ratio of N- to N+, and then PASS determines the needed N+ and N-,
with this ratio, to obtain the desired power. An equivalent representation of the ratio, R, is
N- = R * N+.

• Enter percentage in Group 1, solve for N+ and N-

For this choice, you set a value for the percentage of the total sample size that is in Group 1, and then PASS
determines the needed N+ and N- with this percentage to obtain the desired power.
N+ (Sample Size, Group 1)
This option is displayed if Group Allocation = “Enter N+, solve for N-”
N+ is the number of items or individuals sampled from the Group 1 population.
N+ must be ≥ 2. You can enter a single value or a series of values.
N- (Sample Size, Group 2)
This option is displayed if Group Allocation = “Enter N-, solve for N+”
N- is the number of items or individuals sampled from the Group 2 population.
N- must be ≥ 2. You can enter a single value or a series of values.

265-5
© NCSS, LLC. All Rights Reserved.
PASS Sample Size Software NCSS.com
Tests for Two ROC Curves

R (Group Sample Size Ratio)

This option is displayed only if Group Allocation = “Enter R = N-/N+, solve for N+ and N-.”
R is the ratio of N- to N+. That is,
R = N- / N+.
Use this value to fix the ratio of N- to N+ while solving for N+ and N-. Only sample size combinations with this
ratio are considered.
N- is related to N+ by the formula:
N- = [R × N+],
where the value [Y] is the next integer ≥ Y.
For example, setting R = 2.0 results in a Group 2 sample size that is double the sample size in Group 1 (e.g., N+ =
10 and N- = 20, or N+ = 50 and N- = 100).
R must be greater than 0. If R < 1, then N- will be less than N+; if R > 1, then N- will be greater than N+. You can
enter a single or a series of values.
Percent in Group 1 (+)
This option is displayed only if Group Allocation = “Enter percentage in Group 1, solve for N+ and N-.”
Use this value to fix the percentage of the total sample size allocated to Group 1 while solving for N+ and N-.
Only sample size combinations with this Group 1 percentage are considered. Small variations from the specified
percentage may occur due to the discrete nature of sample sizes.
The Percent in Group 1 must be greater than 0 and less than 100. You can enter a single or a series of values.

Sample Size (When Not Solving for Sample Size)

Group Allocation
Select the option that describes how individuals in the study will be allocated to Group 1 and to Group 2.
The options are

• Equal (N+ = N-)

This selection is used when you wish to have equal sample sizes in each group. A single per group sample
size will be entered.

• Enter N+ and N- individually

This choice permits you to enter different values for N+ and N-.

• Enter N+ and R, where N- = R * N+

Choose this option to specify a value (or values) for N+, and obtain N- as a ratio (multiple) of N+.

• Enter total sample size and percentage in Group 1

Choose this option to specify a value (or values) for the total sample size (N), obtain N+ as a percentage of N,
and then N- as N - N+.

265-6
© NCSS, LLC. All Rights Reserved.
PASS Sample Size Software NCSS.com
Tests for Two ROC Curves

Sample Size Per Group

This option is displayed only if Group Allocation = “Equal (N+ = N-).”
The Sample Size Per Group is the number of items or individuals sampled from each of the Group 1 and Group 2
populations. Since the sample sizes are the same in each group, this value is the value for N+, and also the value
for N-.
The Sample Size Per Group must be ≥ 2. You can enter a single value or a series of values.
N+ (Sample Size, Group 1)
This option is displayed if Group Allocation = “Enter N+ and N- individually” or “Enter N+ and R, where N- =
R * N+.”
N+ is the number of items or individuals sampled from the Group 1 population.
N+ must be ≥ 2. You can enter a single value or a series of values.
N- (Sample Size, Group 2)
This option is displayed only if Group Allocation = “Enter N+ and N- individually.”
N- is the number of items or individuals sampled from the Group 2 population.
N- must be ≥ 2. You can enter a single value or a series of values.
R (Group Sample Size Ratio)
This option is displayed only if Group Allocation = “Enter N+ and R, where N- = R * N+.”
R is the ratio of N- to N+. That is,
R = N-/N+
Use this value to obtain N- as a multiple (or proportion) of N+.
N- is calculated from N+ using the formula:
N-=[R x N+],
where the value [Y] is the next integer ≥ Y.
For example, setting R = 2.0 results in a Group 2 sample size that is double the sample size in Group 1.
R must be greater than 0. If R < 1, then N- will be less than N+; if R > 1, then N- will be greater than N+. You can
enter a single value or a series of values.
Total Sample Size (N)
This option is displayed only if Group Allocation = “Enter total sample size and percentage in Group 1.”
This is the total sample size, or the sum of the two group sample sizes. This value, along with the percentage of
the total sample size in Group 1, implicitly defines N+ and N-.
The total sample size must be greater than one, but practically, must be greater than 3, since each group sample
size needs to be at least 2.
You can enter a single value or a series of values.
Percent in Group 1 (+)
This option is displayed only if Group Allocation = “Enter total sample size and percentage in Group 1.”
This value fixes the percentage of the total sample size allocated to Group 1. Small variations from the specified
percentage may occur due to the discrete nature of sample sizes.
The Percent in Group 1 must be greater than 0 and less than 100. You can enter a single value or a series of
values.

265-7
© NCSS, LLC. All Rights Reserved.
PASS Sample Size Software NCSS.com
Tests for Two ROC Curves

Effect Size – Area Under the Curve

AUC1 (Area Under Curve 1)
Specify one or more values of the AUC for diagnostic test 1. The range of values is from 0.5 (indicative of a test
useless in diagnosis) to 1.0 (indicative of a test that is perfect in diagnosis).
Since the AUC may include a portion of the ROC curve that is not of interest because the FPR values are
unrealistic, you may be interested in only a portion of the area. In this case, you can specify a range of FPR values
for which the area is to be calculated. Unfortunately, the definition of the area becomes more difficult. When
analyzing the whole ROC curve, the area is known to be between 0.50 and 1.0. Following the suggestion of
Obuchowski and McClish (1997), the following transformation is applied so that the values of AUC remain
between 0.5 and 1.0.
1 AUC − min 
AUC ′ = 1 + 
2  max − min 
where
max = FPR2 − FPR1
max
min = ( FPR2 + FPR1)
2
Thus, when a partial range is entered for FPR1 and FPR2, the values entered here are assumed to be AUC' and are
translated to AUC using the above formulas.
AUC2 (Area Under Curve 2)
Specify one or more values of the AUC for diagnostic test 2. The range of values is from 0.5 (indicative of a test
useless in diagnosis) to 1.0 (indicative of a test that is perfect in diagnosis). Note that, as discussed above, this is
the value of AUC’ when a partial area is being analyzed.

Effect Size – False Positive Rate Limits

Lower FPR
This option specifies the lower (left) limit of the false positive rate (FPR) for which the area is to be computed. If
the area under the whole ROC curve is wanted, set this value to 0.0. If the partial area is wanted, set this value to
the desired left limit.
Note that the range of possible values is 0.0 ≤ Lower FPR < Upper FPR ≤ 1.0
Upper FPR
This option specifies the upper (right) limit of the false positive rate (FPR). If the area under the whole ROC
curve is wanted, set this value to 1.0. If the partial area is wanted, set this value to the desired right limit.
Note that the range of possible values is 0.0 ≤ Lower FPR < Upper FPR ≤ 1.0

Effect Size – Correlations

Correlation+
This is the correlation between the two diagnostic-test scores for the positive group. Although correlations can
range between -1 and 1, typical values are from 0.3 to 0.6.
Note that if you want to analyze a design in which a separate set of patients receive each diagnostic test, this may
be done by setting this correlation value to 0.

265-8
© NCSS, LLC. All Rights Reserved.
PASS Sample Size Software NCSS.com
Tests for Two ROC Curves

Correlation-
This is the correlation between the two diagnostic-test scores for the negative group. Although correlations can
range between -1 and 1, typical values are from 0.3 to 0.6.
Note that if you want to analyze a design in which a separate set of patients receive each diagnostic test, this may
be done by setting this correlation value to 0.

Effect Size – Type of Data

Type of Data
Specify the type of data that will be collected from the tests. The formulas for the variance are determined by this
option. Possible types are:

• Continuous
The test results are from a continuum of possible values. The Hanley and McNeil (1983) variance formulas
are used. Note that this option does not allow a partial range of FPR values to be analyzed.

• Discrete (Ratings)
The test results are from a small set of rating values such as 1, 2, 3, 4, 5. The Obuchowski & McClish (1997)
variance formulas are used.
B1 (SD Ratio)
B1 is the ratio of the standard deviation of the negative group to the positive group (SD-/SD+) for diagnostic test
1. That is, assuming the binormal model
σ 1−
B1 =
σ 1+
Note that this parameter is ignored for continuous data.
Although B1 can be any positive number, typical values are between 0.3 and 3.0. Obuchowski suggests that if the
value of B1 is not known, a value of 1.0 is used since this will result in a conservative (extra large) sample size.
She reports that in her experience, typical values are much less than 1.0, often near 0.3.
B2 (SD Ratio)
B2 is the ratio of the standard deviation of the negative group to the positive group (SD-/SP+) for diagnostic test
2. That is, assuming the binormal model
σ 2−
B2 =
σ 2+
Note that this parameter is ignored for continuous data.
Although B2 can be any positive number, typical values are between 0.3 and 3.0. Obuchowski suggests that if the
value of B2 is not known, a value of 1.0 is used since this will result in a conservative (extra large) sample size.
She reports that in her experience, typical values are much less than 1.0, often near 0.3.

265-9
© NCSS, LLC. All Rights Reserved.
PASS Sample Size Software NCSS.com
Tests for Two ROC Curves

Example 1 – Calculating Power

An investigator wants to compare the accuracy of two diagnostic tests which yield measurements on a rating scale
from 1 to 5. Historically, such tests have had an AUC of 0.80. The investigator wants to investigate three
alternative AUC values: 0.825, 0.850, and 0.900. A two-sided test is planned with a significance level of 0.05.
Historically, both the positive and negative correlations between the responses on two such tests have been close
to 0.60. Since no other information is available, B1 and B2 are both set to 1.0. The investigator would like to
achieve a power of 90% in the study. Patients without the disease under study are about twice as frequent as
patients with the disease. The investigator wants to see results for a sample size of up to 6000 patients.

Setup
This section presents the values of each of the parameters needed to run this example. First, from the PASS Home
window, load the Tests for Two ROC Curves procedure window by clicking on ROC, and then clicking on
Tests for Two ROC Curves. You may then make the appropriate entries as listed below, or open Example 1 by
going to the File menu and choosing Open Example Template.
Option Value
Design Tab
Solve For ................................................ Power
Alternative Hypothesis ............................ Two-Sided Test
Alpha ....................................................... 0.05
Group Allocation ..................................... Enter N+ and R, where N- = R x N+
N+ (Size of Positive Group) .................... 20 50 100 250 500 1000 2000
R (Sample Allocation Ratio) ................... 2
AUC1 (Area Under Curve 1) ................... 0.80
AUC2 (Area Under Curve 2) ................... 0.825 0.85 0.9
Lower FPR .............................................. 0.00
Upper FPR .............................................. 1.00
Correlation+ ............................................ 0.6
Correlation- ............................................. 0.6
Type of Data ........................................... Discrete (Ratings)
B1 (SD Ratio) .......................................... 1
B2 (SD Ratio) .......................................... 1

Annotated Output
Click the Calculate button to perform the calculations and generate the following output.

Numeric Report
Numeric Results for Testing AUC1 = AUC2 with Discrete (Rating) Data
Test Type = Two-Sided. FPR1 = 0.0. FPR2 = 1.0. B1 = 1.000. B2 = 1.000. Allocation Ratio = 2.000.
Target Actual
Power N+ N- N R R AUC1' AUC2' Diff' AUC1 AUC2 Diff Alpha
0.0501 20 40 60 2.0 2.0 0.8000 0.8250 0.0250 0.8000 0.8250 0.0250 0.050
0.0733 50 100 150 2.0 2.0 0.8000 0.8250 0.0250 0.8000 0.8250 0.0250 0.050
0.1084 100 200 300 2.0 2.0 0.8000 0.8250 0.0250 0.8000 0.8250 0.0250 0.050
0.2104 250 500 750 2.0 2.0 0.8000 0.8250 0.0250 0.8000 0.8250 0.0250 0.050
0.3744 500 1000 1500 2.0 2.0 0.8000 0.8250 0.0250 0.8000 0.8250 0.0250 0.050
0.6426 1000 2000 3000 2.0 2.0 0.8000 0.8250 0.0250 0.8000 0.8250 0.0250 0.050
0.9090 2000 4000 6000 2.0 2.0 0.8000 0.8250 0.0250 0.8000 0.8250 0.0250 0.050
(report continues)

Report Definitions
Power is the probability of rejecting a false null hypothesis.
N+ and N- are the number of items sampled from each population.
N is the total sample size, N+ + N-.
Target R is the desired ratio (or ratios) of R entered in the procedure. R is the ratio of N- to N+, so that
N- = R × N+.
Actual R is the value for R obtained in this scenario. Because N+ and N- are discrete, this value is sometimes
slightly different than the target R.
AUC1' and AUC2' are the adjusted areas under the ROC curve for diagnostic tests 1 and 2, respectively.
Diff' is AUC2 - AUC1. This is the adjusted difference to be detected.
AUC1' and AUC2' are the actual areas under the ROC curve for diagnostic tests 1 and 2, respectively.
Diff is AUC2 - AUC1. This is the difference to be detected.
Alpha is the probability of rejecting a true null hypothesis.
FPR1, FPR2 are the lower and upper bounds on the false positive rates.
B1 and B2 are the ratios of the standard deviations of the negative and positive groups for each test.

Summary Statements
A sample of 20 from the positive group and 40 from the negative group achieve 5% power to
detect a difference of 0.0250 between a diagnostic test with an area under the ROC curve (AUC)
of 0.8000 and another diagnostic test with an AUC of 0.8250 using a two-sided z-test at a
significance level of 0.0500. The data are discrete (rating scale) responses. The AUC is
computed between false positive rates of 0.000 and 1.000. The ratio of the standard deviation
of the responses in the negative group to the standard deviation of the responses in the
positive group for diagnostic test 1 is 1.000 and for diagnostic test 2 is 1.000. The
correlation between the two diagnostic tests is assumed to be 0.600 for the positive group and
0.600 for the negative group.

This report shows the power for each of the sample sizes. Most of the definitions are standard. However, a special
explanation must be given for AUC and AUC’.
AUC’
This is the adjusted area under the curve. A rescaling, discussed earlier, has been applied so that the minimum
area is 0.5 and the maximum area is 1.0.
AUC
This is the actual area under the curve. This value will equal the adjusted area when the FPR range is set from 0.0
to 1.0. Otherwise, these values will be different.

Plots Section

These plots show the power versus the sample size for the three values of AUC1.

Example 2 – Calculating Sample Size

Continuing Example 1, the investigator wants to know the exact sample size needed for each of the three values
of AUC2. The investigator wants to look at the Numeric Report.

Setup
This section presents the values of each of the parameters needed to run this example. First, from the PASS Home
window, load the Tests for Two ROC Curves procedure window by clicking on ROC, and then clicking on
Tests for Two ROC Curves. You may then make the appropriate entries as listed below, or open Example 2 by
going to the File menu and choosing Open Example Template.
Option Value
Design Tab
Solve For ................................................ Sample Size
Alternative Hypothesis ............................ Two-Sided Test
Power ...................................................... 0.90
Alpha ....................................................... 0.05
Group Allocation ..................................... Enter R = N-/N+, solve for N+ and N-
R (Sample Allocation Ratio) ................... 2
AUC1 (Area Under Curve 1) ................... 0.80
AUC2 (Area Under Curve 2) ................... 0.825 0.85 0.9
Lower FPR .............................................. 0.00
Upper FPR .............................................. 1.00
Correlation+ ............................................ 0.6
Correlation- ............................................. 0.6
Type of Data ........................................... Discrete (Ratings)
B1 (SD Ratio) .......................................... 1
B2 (SD Ratio) .......................................... 1

Output
Click the Calculate button to perform the calculations and generate the following output.

Numeric Results
Numeric Results for Testing AUC1 = AUC2 with Discrete (Rating) Data
Test Type = Two-Sided. FPR1 = 0.0. FPR2 = 1.0. B1 = 1.000. B2 = 1.000. Allocation Ratio = 2.000.

Target Actual Target Actual

Power Power N+ N- N R R AUC1' AUC2' Diff' AUC1 AUC2 Diff Alpha
0.90 0.9001 1937 3874 5811 2.0 2.0 0.8000 0.8250 0.0250 0.8000 0.8250 0.0250 0.050
0.90 0.9002 480 960 1440 2.0 2.0 0.8000 0.8500 0.0500 0.8000 0.8500 0.0500 0.050
0.90 0.9012 117 234 351 2.0 2.0 0.8000 0.9000 0.1000 0.8000 0.9000 0.1000 0.050

This report shows the sample size needed to achieve 90% power for each value of AUC2.

Example 3 – Partial Area Under Curve

Continuing Example 2, the investigator knows that FPR values between 0.0 and 0.20 are the only values of
interest. Hence, he wants to investigate the sample size needed when the FPR range is confined to this range.

Setup
This section presents the values of each of the parameters needed to run this example. First, from the PASS Home
window, load the Tests for Two ROC Curves procedure window by clicking on ROC, and then clicking on
Tests for Two ROC Curves. You may then make the appropriate entries as listed below, or open Example 3 by
going to the File menu and choosing Open Example Template.
Option Value
Design Tab
Solve For ................................................ Sample Size
Alternative Hypothesis ............................ Two-Sided Test
Power ...................................................... 0.90
Alpha ....................................................... 0.05
Group Allocation ..................................... Enter R = N-/N+, solve for N+ and N-
R (Sample Allocation Ratio) ................... 2
AUC1 (Area Under Curve 1) ................... 0.80
AUC2 (Area Under Curve 2) ................... 0.825 0.85 0.9
Lower FPR .............................................. 0.00
Upper FPR .............................................. 0.20
Correlation+ ............................................ 0.6
Correlation- ............................................. 0.6
Type of Data ........................................... Discrete (Ratings)
B1 (SD Ratio) .......................................... 1
B2 (SD Ratio) .......................................... 1

Output
Click the Calculate button to perform the calculations and generate the following output.

Numeric Results
Numeric Results for Testing AUC1 = AUC2 with Discrete (Rating) Data
Test Type = Two-Sided. FPR1 = 0.0. FPR2 = 0.200. B1 = 1.000. B2 = 1.000. Allocation Ratio = 2.000.

Target Actual Target Actual

Power Power N+ N- N R R AUC1' AUC2' Diff' AUC1 AUC2 Diff Alpha
0.90 0.9000 4095 8190 12285 2.0 2.0 0.8000 0.8250 0.0250 0.1280 0.1370 0.0090 0.050
0.90 0.9002 1012 2024 3036 2.0 2.0 0.8000 0.8500 0.0500 0.1280 0.1460 0.0180 0.050
0.90 0.9001 242 484 726 2.0 2.0 0.8000 0.9000 0.1000 0.1280 0.1640 0.0360 0.050

Note that the necessary sample size has more than doubled.

Example 4 – Validation using Obuchowski

The formulas used in this module were given in Obuchowski and McClish (1997). On pages 1538 - 1540, they
provide an example which will be duplicated here. The study compared an automated classification system with
an expert mammographer in their ability to find malignant breast lesions. The measure of diagnostic accuracy is
the AUC from an FPR of 0.0 to an FPR of 0.2. The allocation ratio is 2. B1 = B2 = 1.0. Correlation+ =
Correlation- = 0.6. The values of A1 and A2 are found to be 2.6 and 1.9. These translate to adjusted AUC’s of
0.922222 and 0.819444. A two-tailed test is envisioned in which alpha is 0.05. A power of 80% is desired. In their
article, they found N+ = 109 and N- = 218.

Setup
This section presents the values of each of the parameters needed to run this example. First, from the PASS Home
window, load the Tests for Two ROC Curves procedure window by clicking on ROC, and then clicking on
Tests for Two ROC Curves. You may then make the appropriate entries as listed below, or open Example 4 by
going to the File menu and choosing Open Example Template.
Option Value
Design Tab
Solve For ................................................ Sample Size
Alternative Hypothesis ............................ Two-Sided Test
Power ...................................................... 0.80
Alpha ....................................................... 0.05
Group Allocation ..................................... Enter R = N-/N+, solve for N+ and N-
R (Sample Allocation Ratio) ................... 2
AUC1 (Area Under Curve 1) ................... 0.922222
AUC2 (Area Under Curve 2) ................... 0.819444
Lower FPR .............................................. 0.00
Upper FPR .............................................. 0.20
Correlation+ ............................................ 0.6
Correlation- ............................................. 0.6
Type of Data ........................................... Discrete (Ratings)
B1 (SD Ratio) .......................................... 1
B2 (SD Ratio) .......................................... 1

Output
Click the Calculate button to perform the calculations and generate the following output.

Numeric Results
Numeric Results for Testing AUC1 = AUC2 with Discrete (Rating) Data
Test Type = Two-Sided. FPR1 = 0.0. FPR2 = 0.200. B1 = 1.000. B2 = 1.000. Allocation Ratio = 2.000.

Target Actual Target Actual

Power Power N+ N- N R R AUC1' AUC2' Diff' AUC1 AUC2 Diff Alpha
0.80 0.8027 109 218 327 2.0 2.0 0.9222 0.8194 -0.1028 0.1720 0.1350 -0.0370 0.050

Note that the sample sizes of 109 and 218 match exactly with the results of Obuchowski.

Example 5 – Validation using Hanley

The formulas for continuous data were given in Hanley and McNeil (1982). On page 34 of their article they
provide a table of sample sizes calculated using their formulas. We will duplicate their results for AUC1 = 0.70
and AUC2 = 0.75. Using a one-sided test of significance with alpha = 0.05 and a sample allocation ratio of 1.0,
they found the number of subjects for both the positive and negative groups to be 652, 897, and 1131 for
statistical powers of 80%, 90%, and 95%, respectively.
When using Hanley and McNeil’s formulation, the values of B1, B2, FPR1, and FPR2 are ignored. Also, in this
case, the correlations are set to 0.0.

Setup
This section presents the values of each of the parameters needed to run this example. First, from the PASS Home
window, load the Tests for Two ROC Curves procedure window by clicking on ROC, and then clicking on
Tests for Two ROC Curves. You may then make the appropriate entries as listed below, or open Example 5 by
going to the File menu and choosing Open Example Template.
Option Value
Design Tab
Solve For ................................................ Sample Size
Alternative Hypothesis ............................ One-Sided Test
Power ...................................................... 0.8 0.9 0.95
Alpha ....................................................... 0.05
Group Allocation ..................................... Equal (N+ = N-)
AUC1 (Area Under Curve 1) ................... 0.7
AUC2 (Area Under Curve 2) ................... 0.75
Lower FPR .............................................. 0.00
Upper FPR .............................................. 1.00
Correlation+ ............................................ 0.0
Correlation- ............................................. 0.0
Type of Data ........................................... Continuous

Output
Click the Calculate button to perform the calculations and generate the following output.

Numeric Results 1
Numeric Results for Testing AUC1 = AUC2 with Continuous Data
Test Type = One-Sided. FPR1 = 0.0. FPR2 = 1.0. B1 = 1.000. B2 = 1.000. Allocation Ratio = 1.000.

Target Actual
Power Power N+ N- N AUC1' AUC2' Diff' AUC1 AUC2 Diff Alpha
0.80 0.8003 652 652 1304 0.7000 0.7500 0.0500 0.7000 0.7500 0.0500 0.050
0.90 0.9001 897 897 1794 0.7000 0.7500 0.0500 0.7000 0.7500 0.0500 0.050
0.95 0.9501 1129 1129 2258 0.7000 0.7500 0.0500 0.7000 0.7500 0.0500 0.050

Note that the sample sizes of 897 and 652 match exactly with the results of Hanley and McNeil. The 1129 is two
less than their 1131. This difference may be due to refinements in computing the normal probability distribution
used in PASS. You can compare these sample sizes by calculating the power.

Numeric Results 2
Numeric Results for Testing AUC1 = AUC2 with Continuous Data
Test Type = One-Sided. FPR1 = 0.0. FPR2 = 1.0. B1 = 1.000. B2 = 1.000. Allocation Ratio = 1.000.

Power N+ N- N AUC1' AUC2' Diff' AUC1 AUC2 Diff Alpha

0.9499 1128 1128 2256 0.7000 0.7500 0.0500 0.7000 0.7500 0.0500 0.050
0.9501 1129 1129 2258 0.7000 0.7500 0.0500 0.7000 0.7500 0.0500 0.050
0.9502 1130 1130 2260 0.7000 0.7500 0.0500 0.7000 0.7500 0.0500 0.050
0.9504 1131 1131 2262 0.7000 0.7500 0.0500 0.7000 0.7500 0.0500 0.050
0.9505 1132 1132 2264 0.7000 0.7500 0.0500 0.7000 0.7500 0.0500 0.050

Note that the power for 1129 is 0.9501 while the power for 1131 is 0.9505. This is only a slight difference and
explains why this value showed up in their table.

Confidence Intervals For The Area Under An ROC Curve
No ratings yet
Confidence Intervals For The Area Under An ROC Curve
7 pages
Comparing Two ROC Curves-Independent Groups Design
No ratings yet
Comparing Two ROC Curves-Independent Groups Design
21 pages
Lecture 7
No ratings yet
Lecture 7
26 pages
Habibzadeh 2016
No ratings yet
Habibzadeh 2016
11 pages
Understanding ROC Curves in Diagnostics
No ratings yet
Understanding ROC Curves in Diagnostics
3 pages
Predicting Cesarean with ROC Curve
No ratings yet
Predicting Cesarean with ROC Curve
9 pages
Two-Sample T-Tests Assuming Equal Variance
No ratings yet
Two-Sample T-Tests Assuming Equal Variance
19 pages
Two-Sample T-Tests Assuming Equal Variance
No ratings yet
Two-Sample T-Tests Assuming Equal Variance
19 pages
83 ROCCurves
No ratings yet
83 ROCCurves
9 pages
International Biometric Society
No ratings yet
International Biometric Society
10 pages
Curvas ROC
No ratings yet
Curvas ROC
2 pages
Bradley PR97 PDF
No ratings yet
Bradley PR97 PDF
15 pages
Evolution of Disgnostic Testing
No ratings yet
Evolution of Disgnostic Testing
41 pages
AUC and Concordance
No ratings yet
AUC and Concordance
16 pages
Comparing The Areas Under Two or More Correlated Receiver Operating Characteristic Curves A Nonparametric Approach
No ratings yet
Comparing The Areas Under Two or More Correlated Receiver Operating Characteristic Curves A Nonparametric Approach
10 pages
1603 - EvaluatingDiagnosis - PDF Version 1
No ratings yet
1603 - EvaluatingDiagnosis - PDF Version 1
5 pages
Confidence Intervals For The Difference Between Two Proportions
No ratings yet
Confidence Intervals For The Difference Between Two Proportions
17 pages
Null Vs Alternative Hypothesis, Rejection Region, and Significance Level Type I Error and Type II Error, Test For The Mean. Population Variance Known, P-Value
No ratings yet
Null Vs Alternative Hypothesis, Rejection Region, and Significance Level Type I Error and Type II Error, Test For The Mean. Population Variance Known, P-Value
14 pages
Diagnostic Test: Magdalena Sidhartani
No ratings yet
Diagnostic Test: Magdalena Sidhartani
17 pages
ROC Curve
No ratings yet
ROC Curve
20 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
60 pages
Tests For Paired Sensitivities
No ratings yet
Tests For Paired Sensitivities
10 pages
Two-Sample T-Tests Using Effect Size
No ratings yet
Two-Sample T-Tests Using Effect Size
11 pages
Statistical Inference and Hypothesis Testing
No ratings yet
Statistical Inference and Hypothesis Testing
34 pages
Kja 21209
No ratings yet
Kja 21209
12 pages
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
No ratings yet
PROS - Ivanna Kristianti T - Predicting Receiver Operating Characteristic - Fulltext
5 pages
Inference Using Normal and T Distribution
No ratings yet
Inference Using Normal and T Distribution
9 pages
Flach Roc Analysis
No ratings yet
Flach Roc Analysis
12 pages
Binary Diagnostic Tests - Single Sample
No ratings yet
Binary Diagnostic Tests - Single Sample
6 pages
Tests For Two Proportions
No ratings yet
Tests For Two Proportions
29 pages
Non - Parametic Test
No ratings yet
Non - Parametic Test
5 pages
Equivalence Tests For The Ratio of Two Means in A 2x2 Cross-Over Design
No ratings yet
Equivalence Tests For The Ratio of Two Means in A 2x2 Cross-Over Design
8 pages
Regression Analysis for Economists
No ratings yet
Regression Analysis for Economists
31 pages
Statistical Application Da 2 Srishti
No ratings yet
Statistical Application Da 2 Srishti
7 pages
Two-Sample Z-Tests Allowing Unequal Variance
No ratings yet
Two-Sample Z-Tests Allowing Unequal Variance
14 pages
Power
No ratings yet
Power
29 pages
The Optimization Problem
No ratings yet
The Optimization Problem
45 pages
Statistical Inference
No ratings yet
Statistical Inference
14 pages
Normal Distribution
No ratings yet
Normal Distribution
8 pages
Kohl PerformanceMeasures2012
No ratings yet
Kohl PerformanceMeasures2012
4 pages
Guide To Interprate ROC Analysis
No ratings yet
Guide To Interprate ROC Analysis
4 pages
Test Accuracy: Sensitivity & Specificity
No ratings yet
Test Accuracy: Sensitivity & Specificity
26 pages
Hypothesis Testing in Machine Learning Using Python - by Yogesh Agrawal - 151413
No ratings yet
Hypothesis Testing in Machine Learning Using Python - by Yogesh Agrawal - 151413
15 pages
12 Stats Review
No ratings yet
12 Stats Review
51 pages
ST332 - Lecture07 - 2 Variances & Paired T-Test
No ratings yet
ST332 - Lecture07 - 2 Variances & Paired T-Test
7 pages
Hypothesis Testing For Means & Proportions
No ratings yet
Hypothesis Testing For Means & Proportions
20 pages
Course Code: 8614 Course Name: Educational Statistics Assignment: 2 Semester: Spring 2022 Program: B.Ed
No ratings yet
Course Code: 8614 Course Name: Educational Statistics Assignment: 2 Semester: Spring 2022 Program: B.Ed
19 pages
Quantitative Methods Sessions 11 - 21
No ratings yet
Quantitative Methods Sessions 11 - 21
41 pages
Walpole Chapter 10
No ratings yet
Walpole Chapter 10
13 pages
Minitab 16: ANOVA, Normality, Tukey, Control Charts
No ratings yet
Minitab 16: ANOVA, Normality, Tukey, Control Charts
63 pages
Statistics For Management Unit 3 2marks
No ratings yet
Statistics For Management Unit 3 2marks
4 pages
5 - Stat Lecture..
No ratings yet
5 - Stat Lecture..
44 pages
Multiple Comparisons Testing
No ratings yet
Multiple Comparisons Testing
7 pages
Statistics
No ratings yet
Statistics
66 pages
Statistical Inference and Testing of Single Mean
No ratings yet
Statistical Inference and Testing of Single Mean
50 pages
Course Unit 8 - Summary of Basic Statistical Tests III-2
No ratings yet
Course Unit 8 - Summary of Basic Statistical Tests III-2
21 pages
CH 10
No ratings yet
CH 10
43 pages
Mathematics Soln
No ratings yet
Mathematics Soln
10 pages
Module07 Notes
No ratings yet
Module07 Notes
14 pages
Quiz - 1 - XLRI - BMD - 22-24
No ratings yet
Quiz - 1 - XLRI - BMD - 22-24
22 pages
Student Group Project Guide
No ratings yet
Student Group Project Guide
3 pages
Bsc-Statistics - 'Syllabus PDF
No ratings yet
Bsc-Statistics - 'Syllabus PDF
141 pages
TV Shows' Impact on Serotonin Levels
No ratings yet
TV Shows' Impact on Serotonin Levels
4 pages
Social Change: Decomposition Method
No ratings yet
Social Change: Decomposition Method
48 pages
Experimental Research: By: Neline Amelano
No ratings yet
Experimental Research: By: Neline Amelano
21 pages
Mcclure Som
No ratings yet
Mcclure Som
12 pages
Comprehensive Applied Basic Sciences For MDS Students
69% (13)
Comprehensive Applied Basic Sciences For MDS Students
354 pages
Eapp Q2 W1
50% (2)
Eapp Q2 W1
37 pages
Res 03: Experimental Research: By: George T. Cadungon JR
No ratings yet
Res 03: Experimental Research: By: George T. Cadungon JR
32 pages
Guide For PETA 1 4th Quarter
No ratings yet
Guide For PETA 1 4th Quarter
4 pages
Palabra de Ladrones / Vow of Thieves Mary Pearson Download
100% (1)
Palabra de Ladrones / Vow of Thieves Mary Pearson Download
123 pages
Conditional Probability
100% (1)
Conditional Probability
13 pages
66r-11selecting Probability Distribution Functions For Use in Cost& Schedule Risk Simulaiton Mofels
No ratings yet
66r-11selecting Probability Distribution Functions For Use in Cost& Schedule Risk Simulaiton Mofels
7 pages
6 DM
No ratings yet
6 DM
2 pages
6 One Hot Encoding
No ratings yet
6 One Hot Encoding
3 pages
Uma Sekaran
67% (3)
Uma Sekaran
158 pages
Econ 210 Exam 4 7
No ratings yet
Econ 210 Exam 4 7
18 pages
Using Gretl For POE4
No ratings yet
Using Gretl For POE4
500 pages
Human Body DK Eyewitness Richard Walker PDF Download
100% (1)
Human Body DK Eyewitness Richard Walker PDF Download
153 pages
Sentiment Analysis on Nias MSME Sales
No ratings yet
Sentiment Analysis on Nias MSME Sales
8 pages
A Modern Theory of Factorial Design - R. Mukerjee & C. F. Jeff Wu PDF
No ratings yet
A Modern Theory of Factorial Design - R. Mukerjee & C. F. Jeff Wu PDF
231 pages
W12 Reading Quiz Probability and Confidence Intervals Math For The Real World PDF
No ratings yet
W12 Reading Quiz Probability and Confidence Intervals Math For The Real World PDF
5 pages
Data Science Foundations Guide
100% (2)
Data Science Foundations Guide
143 pages
Environment and Citizenship 1st Edition Benito Cao PDF Download
No ratings yet
Environment and Citizenship 1st Edition Benito Cao PDF Download
116 pages
Case Study SCM
No ratings yet
Case Study SCM
2 pages
GST 221 History and Philosophy of Science
No ratings yet
GST 221 History and Philosophy of Science
57 pages
Correction Factor
No ratings yet
Correction Factor
3 pages
Concept Maps As A Tool For Teaching Organic Chemical Reactions
No ratings yet
Concept Maps As A Tool For Teaching Organic Chemical Reactions
11 pages
Parameter Estimation Techniques
No ratings yet
Parameter Estimation Techniques
8 pages

Tests For Two ROC Curves: PASS Sample Size Software

Uploaded by

Tests For Two ROC Curves: PASS Sample Size Software

Uploaded by

PASS Sample Size Software NCSS.

Tests for Two ROC

The Binormal Model

VAlt(∆ ) = V (θ ) + V (θ ) − 2C(θ ,θ )

g1 g 2 B1 B2 (r−2 + Rr+2 ) f1 g 2 A1 B2 r+2

Power and Alpha

Sample Size (When Solving for Sample Size)

• Equal (N+ = N-)

• Enter N+, solve for N-

• Enter N-, solve for N+

• Enter R = N-/N+, solve for N+ and N-

• Enter percentage in Group 1, solve for N+ and N-

R (Group Sample Size Ratio)

Sample Size (When Not Solving for Sample Size)

• Equal (N+ = N-)

• Enter N+ and N- individually

• Enter N+ and R, where N- = R * N+

• Enter total sample size and percentage in Group 1

Sample Size Per Group

Effect Size – Area Under the Curve

Effect Size – False Positive Rate Limits

Effect Size – Correlations

Effect Size – Type of Data

Example 1 – Calculating Power

Example 2 – Calculating Sample Size

Target Actual Target Actual

Example 3 – Partial Area Under Curve

Target Actual Target Actual

Example 4 – Validation using Obuchowski

Target Actual Target Actual

Example 5 – Validation using Hanley

Power N+ N- N AUC1' AUC2' Diff' AUC1 AUC2 Diff Alpha

You might also like