0% found this document useful (0 votes)

75 views

Lecture 10 PDF

The odds ratio of 5.444444 indicates that the odds of being admitted for males are 5.444444 times higher than the odds for females, holding all other variables constant.

Uploaded by

patricia

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views

Lecture 10 PDF

The odds ratio of 5.444444 indicates that the odds of being admitted for males are 5.444444 times higher than the odds for females, holding all other variables constant.

Uploaded by

patricia

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 73

Lecture 10:

Logistical Regression II—

Multinomial Data
Prof. Sharyn O’Halloran
Sustainable Development U9611
Econometrics II
Logit vs. Probit Review
Use with a dichotomous dependent variable
Need a link function F(Y) going from the
original Y to continuous Y′
Probit: F(Y) = Φ-1(Y)
Logit: F(Y) = log[Y/(1-Y)]

Do the regression and transform the findings

back from Y′ to Y, interpreted as a probability
Unlikelinear regression, the impact of an
independent variable X depends on its value
And the values of all other independent variables
Classical vs. Logistic Regression
Data Structure: continuous vs. discrete
Logistic/Probit
regression is used when the
dependent variable is binary or dichotomous.
Different assumptions between traditional
regression and logistic regression
The population means of the dependent variables at
each level of the independent variable are not on a
straight line, i.e., no linearity.
The variance of the errors are not constant, i.e., no
homogeneity of variance.
The errors are not normally distributed, i.e., no
normality.
Logistic Regression Assumptions
1. The model is correctly specified, i.e.,
The true conditional probabilities are a logistic
function of the independent variables;
No important variables are omitted;
No extraneous variables are included; and
The independent variables are measured
without error.
2. The cases are independent.
3. The independent variables are not linear
combinations of each other.
Perfect multicollinearity makes estimation
impossible,
While strong multicollinearity makes estimates
imprecise.
About Logistic Regression
It uses a maximum likelihood estimation rather
than the least squares estimation used in
traditional multiple regression.
The general form of the distribution is assumed.
Starting values of the estimated parameters are
used and the likelihood that the sample came
from a population with those parameters is
computed.
The values of the estimated parameters are
adjusted iteratively until the maximum likelihood
value for the estimated parameters is obtained.
That is, maximum likelihood approaches try to find
estimates of parameters that make the data actually
observed "most likely."
Interpreting Logistic Coefficients
Logistic slope coefficients can be
interpreted as the effect of a unit of
change in the X variable on the predicted
logits with the other variables in the model
held constant.
That is, how a one unit change in X effects the
log of the odds when the other variables in the
model held constant.
Interpreting Odds Ratios
Odds ratios in logistic regression can be
interpreted as the effect of a one unit of
change in X in the predicted odds ratio
with the other variables in the model held
constant.
Interpreting Odds Ratios
An important property of odds ratios is that they
are constant.
It does not matter what values the other independent
variables take on.
For instance, say you estimate the following
logistic regression model:
-13.70837 + .1685 x1 + .0039 x2
The effect of the odds of a 1-unit increase in x1 is
exp(.1685) = 1.18
Meaning the odds increase by 18%
Incrementing x1 increases the odds by 18%
regardless of the value of x2 (0, 1000, etc.)
aptitude gender admit

Example: 8
7
1
1
1
0

Admissions Data 5
3
1
1
1
0
3 1 0
5 1 1
20 observations of 7 1 1

admission into a graduate 8

5
1
1
1
1
program 5 1 1

Data collected includes 4

7
0
0
0
1
whether admitted, gender 3 0 1

(1 if male) and the student’s 2 0 0

4 0 0
aptitude on a 10 point scale. 2 0 0
3 0 0
4 0 1
3 0 0
2 0 0
Admissions Example – Calculating the
Odds Ratio
Example: admissions to a graduate program
Assume 70% of the males and 30% of the females
are admitted in a given year
Let P equal the probability a male is admitted.
Let Q equal the probability a female is admitted.
Odds males are admitted: odds(M) = P/(1-P) = .7/.3 = 2.33
Odds females are admitted: odds(F) = Q/(1-Q) = .3/.7 = 0.43
The odds ratio for male vs. female admits is then
odds(M)/odds(F) = 2.33/0.43 = 5.44
The odds of being admitted to the program are
about 5.44 times greater for males than females.
Ex. 1: Categorical Independent Var.
. logit admit gender
Logit estimates Number of obs = 20
LR chi2(1) = 3.29
Prob > chi2 = 0.0696
Log likelihood = -12.217286 Pseudo R2 = 0.1187
------------------------------------------------------------------------------
admit | Coef. Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
gender | 1.694596 .9759001 1.736 0.082 -.2181333 3.607325
_cons | -.8472979 .6900656 -1.228 0.220 -2.199801 .5052058
------------------------------------------------------------------------------

exp( Xβ )
Formula to back out Y from logit estimates: Y = 1 + exp( Xβ )

.dis exp(_b[gender]+_b[_cons])/(1+exp(_b[gender]+_b[_cons]))
.7
. dis exp(_b[_cons])/(1+exp(_b[_cons]))
.3
Ex. 1: Categorical Independent Variable
To get the results in terms of odds ratios:
logit admit gender, or

Logit estimates Number of obs = 20

LR chi2(1) = 3.29
Prob > chi2 = 0.0696
Log likelihood = -12.217286 Pseudo R2 = 0.1187

------------------------------------------------------------------------------
admit | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
gender | 5.444444 5.313234 1.736 0.082 .8040183 36.86729
------------------------------------------------------------------------------

Translates original logit coefficients to odds ratio on gender

Same as the odds ratio we calculated by hand above
Ex. 1: Categorical Independent Variable
To get the results in terms of odds ratios:
logit admit gender, or
Logit estimates Number of obs = 20
LR chi2(1) = 3.29
Prob > chi2 = 0.0696
Log likelihood = -12.217286 Pseudo R2 = 0.1187
------------------------------------------------------------------------------
admit | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
gender | 5.444444 5.313234 1.736 0.082 .8040183 36.86729
------------------------------------------------------------------------------

So 5.4444 is the “exponentiated coefficient”

Don’t confuse this with the logit coefficient (1.6945)
Ex. 1: Categorical Independent Variable
To get the results in terms of odds ratios:
logit admit gender, or

Logit estimates Number of obs = 20

LR chi2(1) = 3.29
Prob > chi2 = 0.0696
Log likelihood = -12.217286 Pseudo R2 = 0.1187

That is, exp(1.694596) = 5.444444

Ex. 2: Continuous Independent Var.
logit admit apt

Iteration 0: log likelihood = -13.862944

Look at the probability of
Iteration 1: log likelihood = -9.6278718 being admitted to
Iteration 2: log likelihood = -9.3197603 graduate school given the
candidate’s aptitude
Iteration 3: log likelihood = -9.3029734
Iteration 4: log likelihood = -9.3028914
Logit estimates Number of obs = 20
LR chi2(1) = 9.12
Prob > chi2 = 0.0025
Log likelihood = -9.3028914 Pseudo R2 = 0.3289
------------------------------------------------------------------------------
admit | Coef. Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
apt | .9455112 .422872 2.236 0.025 .1166974 1.774325
_cons | -4.095248 1.83403 -2.233 0.026 -7.689881 -.5006154
------------------------------------------------------------------------------
Ex. 2: Continuous Independent Var.
logit admit apt

Iteration 0: log likelihood = -13.862944

Aptitude is positive and significantly related to being

admitted into the graduate program
Ex. 2: Continuous Independent Var.
logit admit apt, or

Logit estimates Number of obs = 20

LR chi2(1) = 9.12
Prob > chi2 = 0.0025
Log likelihood = -9.3028914 Pseudo R2 = 0.3289

------------------------------------------------------------------------------
admit | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
apt | 2.574129 1.088527 2.236 0.025 1.123779 5.8963
------------------------------------------------------------------------------

Pr (admit | apt + 1) 1 − Pr (admit | apt + 1)

This means: = 2.57
Pr (admit | apt ) 1 − Pr (admit | apt )
Ex. 2: Continuous Independent Var.
1
.8 .6
Pr(admit)
.4 .2
0

0 2 4 6 8 10
aptitude

. predict p
. line p aptitude, sort
Ex. 2: Continuous Independent Var.
1
.8 .6
Pr(admit)

50% chance of
.4

being admitted
.2
0

0 2 4 6 8 10
aptitude

. predict p
. line p aptitude, sort
Example 3: Categorical & Continuous
Independent Variables
logit admit gender apt
Logit estimates Number of obs = 20
LR chi2(2) = 9.16
Prob > chi2 = 0.0102
Log likelihood = -9.2820991 Pseudo R2 = 0.3304
------------------------------------------------------------------------------
admit | Coef. Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
gender | .2671938 1.300899 0.205 0.837 -2.282521 2.816909
apt | .8982803 .4713791 1.906 0.057 -.0256057 1.822166
_cons | -4.028765 1.838354 -2.192 0.028 -7.631871 -.4256579
------------------------------------------------------------------------------

Gender is now insignificant!

Once aptitude is taken into account gender plays no role
Likelihood Ratio Test
Log-likelihoods can be used to test
hypotheses about nested models.
Say we want to test the null hypothesis H0
about one or more coefficients
For example, H0: x1 = 0, or H0: x1 = x2 = 0
Then the likelihood ratio is the ratio of the
likelihood of imposing H0 over the likelihood
of the unrestricted model:
L(model restricted by H0)/ L(unrestricted model)
If H0 is true, then this ratio should be near 1
Likelihood Ratio Test
Under general assumptions,
-2 * (log of the likelihood ratio) ~ χ2(k)
Where the k degrees of freedom are the
number of restrictions specified in H0
This is called a likelihood ratio test
Call the restricted likelihood L0, and the
unrestricted likelihood L.
Then we can rewrite the equation above as:
-2*log(L0 / L) = - 2*log(L0) - 2*log(L) ~ χ2(k)
The difference of the log-likelihoods will be
di t ib t d 2
Likelihood Ratio Test
In our admissions example, take
Pr(admit) = β0 + β1*gender + β2*aptitude
The log-likelihood of this model was -9.282
Likelihood Ratio Test
In our admissions example, take
Pr(admit) = β0 + β1*gender + β2*aptitude
The log-likelihood of this model was -9.282
logit admit gender apt
Logit estimates Number of obs = 20
LR chi2(2) = 9.16
Prob > chi2 = 0.0102
Log likelihood = -9.2820991 Pseudo R2 = 0.3304
------------------------------------------------------------------------------
admit | Coef. Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
gender | .2671938 1.300899 0.205 0.837 -2.282521 2.816909
apt | .8982803 .4713791 1.906 0.057 -.0256057 1.822166
_cons | -4.028765 1.838354 -2.192 0.028 -7.631871 -.4256579
------------------------------------------------------------------------------
Likelihood Ratio Test
In our admissions example, take
Pr(admit) = β0 + β1*gender + β2*aptitude
The log-likelihood of this model was -9.282
logit admit gender apt
Log-likelihood
Logit estimates with no restrictions Number of obs = 20
LR chi2(2) = 9.16
Prob > chi2 = 0.0102
Log likelihood = -9.2820991 Pseudo R2 = 0.3304
------------------------------------------------------------------------------
admit | Coef. Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
gender | .2671938 1.300899 0.205 0.837 -2.282521 2.816909
apt | .8982803 .4713791 1.906 0.057 -.0256057 1.822166
_cons | -4.028765 1.838354 -2.192 0.028 -7.631871 -.4256579
------------------------------------------------------------------------------
Likelihood Ratio Test
In our admissions example, take
Pr(admit) = β0 + β1*gender + β2*aptitude
The log-likelihood of this model was -9.282

First look at H0: β2 = 0

logit admit gender, or
Logit estimates Number of obs = 20
LR chi2(1) = 3.29
Prob > chi2 = 0.0696
Log likelihood = -12.217286 Pseudo R2 = 0.1187
------------------------------------------------------------------------------
admit | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
gender | 5.444444 5.313234 1.736 0.082 .8040183 36.86729
------------------------------------------------------------------------------
Likelihood Ratio Test
In our admissions example, take
Pr(admit) = β0 + β1*gender + β2*aptitude
The log-likelihood of this model was -9.282

First look at H0: β2 = 0

logit admit gender, or
Log-likelihood
Logit estimates with aptitude=0 Number of obs = 20
LR chi2(1) = 3.29
Prob > chi2 = 0.0696
Log likelihood = -12.217286 Pseudo R2 = 0.1187
------------------------------------------------------------------------------
admit | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
gender | 5.444444 5.313234 1.736 0.082 .8040183 36.86729
------------------------------------------------------------------------------
Likelihood Ratio Test
In our admissions example, take
Pr(admit) = β0 + β1*gender + β2*aptitude
The log-likelihood of this model was -9.282
First look at H0: β2 = 0
The likelihood of the regression with gender but not
aptitude was -12.217
Likelihood ratio test:
[-2* (-12.217)] – [-2 * (-9.282)] = 5.87
From Stata
dis 1- chi2(1, 5.87)
.01540105
Significant at 5% level. Therefore we can reject the null
hypothesis that β2 = 0.
Likelihood Ratio Test
In our admissions example, take
Pr(admit) = β0 + β1*gender + β2*aptitude
The log-likelihood of this model was -9.282

Now look at H0: β1 = 0

Likelihood Ratio Test
In our admissions example, take
Pr(admit) = β0 + β1*gender + β2*aptitude
The log-likelihood of this model was -9.282

Now look at H0: β1 = 0

logit admit apt, or
Logit estimates Number of obs = 20
LR chi2(1) = 9.12
Prob > chi2 = 0.0025
Log likelihood = -9.3028914 Pseudo R2 = 0.3289
------------------------------------------------------------------------------
admit | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
apt | 2.574129 1.088527 2.236 0.025 1.123779 5.8963
------------------------------------------------------------------------------
Likelihood Ratio Test
In our admissions example, take
Pr(admit) = β0 + β1*gender + β2*aptitude
The log-likelihood of this model was -9.282

Now look at H0: β1 = 0

logit admit apt, or
Log-likelihood
Logit estimates with gender=0 Number of obs = 20
LR chi2(1) = 9.12
Prob > chi2 = 0.0025
Log likelihood = -9.3028914 Pseudo R2 = 0.3289
------------------------------------------------------------------------------
admit | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
apt | 2.574129 1.088527 2.236 0.025 1.123779 5.8963
------------------------------------------------------------------------------
Likelihood Ratio Test
In our admissions example, take
Pr(admit) = β0 + β1*gender + β2*aptitude
The log-likelihood of this model was -9.282
Now look at H0: β1 = 0
The likelihood of the regression with gender but not
aptitude was -9.303
Likelihood ratio test:
[-2* (-9.303)] – [-2 * (-9.282)] = 0.042
From Stata
dis 1- chi2(1, .042)
.83761977
Not significant at 5% level. Therefore we fail to reject the
null hypothesis that β1 = 0.
Example 4: Honors Composition using High
School and Beyond Dataset
use http://www.gseis.ucla.edu/courses/data/hsb2

Variable Obs Mean Std. Dev. Min Max

id 200 100.5 57.87918 1 200

female 200 .545 .4992205 0 1
race 200 3.43 1.039472 1 4
ses 200 2.055 .7242914 1 3
schtyp 200 1.16 .367526 1 2

prog 200 2.025 .6904772 1 3

read 200 52.23 10.25294 28 76
write 200 52.775 9.478586 31 67
math 200 52.645 9.368448 33 75
science 200 51.85 9.900891 26 74

socst 200 52.405 10.73579 26 71

honors 200 .265 .4424407 0 1
ses1 200 .235 .4250628 0 1
ses2 200 .475 .5006277 0 1
ses3 200 .29 .4549007 0 1
Example 4: Categorical and continuous
independent variables
generate honors = (write>=60)

/* create dummy coding for ses */

tabulate honors
honors | Freq. Percent Cum.
------------+-----------------------------------
0 | 147 73.50 73.50
1 | 53 26.50 100.00
------------+-----------------------------------
Total | 200 100.00
Example 4: Categorical and continuous
independent variables
generate honors = (write>=60)

/* create dummy coding for ses */

Creates new variables
tabulate ses, generate(ses) ses1, ses2, and ses3
ses | Freq. Percent Cum.
------------+-----------------------------------
low | 47 23.50 23.50
middle | 95 47.50 71.00
high | 58 29.00 100.00
------------+-----------------------------------
Total | 200 100.00

tabulate honors
honors | Freq. Percent Cum.
------------+-----------------------------------
0 | 147 73.50 73.50
1 | 53 26.50 100.00
------------+-----------------------------------
Total | 200 100.00
Example 4: Categorical and continuous independent var.
describe honors female ses1 ses2 read math

storage display value

variable name type format label variable label
-------------------------------------------------------------------------------
honors float %9.0g
female float %9.0g fl
ses1 byte %8.0g ses==low
ses2 byte %8.0g ses==middle
read float %9.0g reading score
math float %9.0g math score

tab1 honors female ses1 ses2 read math -> tabulation of ses1
-> tabulation of honors ses==low | Freq. Percent Cum.
------------+-----------------------------------
honors | Freq. Percent Cum. 0 | 153 76.50 76.50
------------+----------------------------------- 1 | 47 23.50 100.00
0 | 147 73.50 73.50 ------------+-----------------------------------
1 | 53 26.50 100.00 Total | 200 100.00
------------+-----------------------------------
Total | 200 100.00
-> tabulation of ses2
-> tabulation of female
ses==middle | Freq. Percent Cum.
female | Freq. Percent Cum. ------------+-----------------------------------
------------+----------------------------------- 0 | 105 52.50 52.50
male | 91 45.50 45.50 1 | 95 47.50 100.00
female | 109 54.50 100.00 ------------+-----------------------------------
------------+----------------------------------- Total | 200 100.00
Total | 200 100.00
Example 4: Categorical and continuous
independent var.

We would
.04

normally worry
.03
Density

about this but….

.02.01

.04
0

30 40 50 60 70 80
reading score

.03
Density
.02
.01
0

30 40 50 60 70
math score
Example 4: Categorical and continuous
independent var.

We would normally
.04

worry about this

.03
Density

but….
.02.01

.04
0

30 40 50 60 70 80
reading score

.03
Density
.02

The logit link

.01

function takes logs

of the series.
0

30 40 50 60 70
math score
Example 4: Categorical and continuous
independent variables
logit honors female ses1 ses2 read math

Logit estimates Number of obs = 200

LR chi2(5) = 87.30
Prob > chi2 = 0.0000
Log likelihood = -71.994756 Pseudo R2 = 0.3774

------------------------------------------------------------------------------
honors | Coef. Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
female | 1.145726 .4513589 2.538 0.011 .2610792 2.030374
ses1 | -.0541296 .5945439 -0.091 0.927 -1.219414 1.111155
ses2 | -1.094532 .4833959 -2.264 0.024 -2.04197 -.1470932
read | .0687277 .0287044 2.394 0.017 .0124681 .1249873
math | .1358904 .0336874 4.034 0.000 .0698642 .2019166
_cons | -12.49919 1.926421 -6.488 0.000 -16.27491 -8.723475
------------------------------------------------------------------------------

test ses1 ses2

( 1) ses1 = 0.0 So the socioeconomic
( 2) ses2 = 0.0
variables are significant as a
chi2( 2) = 6.13
Prob > chi2 = 0.0466
group.
Example 4: Categorical and continuous
independent variables
logistic honors female ses1 ses2 read math

Logit estimates
Same as logit, or Number of obs
LR chi2(5)
=
=
200
87.30
Prob > chi2 = 0.0000
Log likelihood = -71.994756 Pseudo R2 = 0.3774

------------------------------------------------------------------------------
honors | Odds Ratio Std. Err. z P>|z| [95% Conf. Interval]
---------+--------------------------------------------------------------------
female | 3.144725 1.4194 2.538 0.011 1.29833 7.616932
ses1 | .9473093 .563217 -0.091 0.927 .2954031 3.037865
ses2 | .3346963 .1617908 -2.264 0.024 .1297728 .8632135
read | 1.071145 .0307466 2.394 0.017 1.012546 1.133134
math | 1.145556 .0385909 4.034 0.000 1.072363 1.223746
-------------------------------------------------------------------------------

test ses1 ses2

( 1) ses1 = 0.0 So the socioeconomic
( 2) ses2 = 0.0
variables are significant as a
chi2( 2) = 6.13
Prob > chi2 = 0.0466
group.
Graphing the Results
Let’s say we want to see how the probability
of honors changes with the reading score
Stata’s postgr3 command will create a new
variable giving the probability after a logit
Impact of Reading Score on Probability of Honors
.5
.4
Probability of Honors

. postgr3 read, gen(avg)

. line avg read, sort

.2 .1
0

30 40 50 60 70 80
Reading Score
Graphing the Results
Can do this separately for males & females
Impact of Reading Score on Probability of Honors
.6
Probability of Honors
.2 0 .4

30 40 50 60 70 80
Reading Score

Average Male Female

. postgr3 read, gen(male) x(female=0) nodraw

. postgr3 read, gen(fem) x(female=1) nodraw
. graph twoway (line avg read, sort) (line male read, sort) (line fem read, sort)
Graphing the Results
Can do this separately for males & females
Impact of Reading Score on Probability of Honors
.6

Marginal impact is
Probability of Honors

higher for females

than for males

.2 0

30 40 50 60 70 80
Reading Score

Average Male Female

. postgr3 read, gen(male) x(female=0) nodraw

. postgr3 read, gen(fem) x(female=1) nodraw
. graph twoway (line avg read, sort) (line male read, sort) (line fem read, sort)
Assessing Model Fit
How good a job does the model do of
predicting outcomes?
General answer is “hits and misses”
What percent of the observations the model
correctly predicts
How to calculate:
Use model to generate the probability p that each
observation will have Y=1
If p ≥ 0.5, predict Y=1
If p < 0.5, predict Y=0

Check predictions against the actual outcomes in

the data
Assessing Model Fit
Can do this by checking predictions
Eventsthat happened that were predicted to
happen
E.g., model correctly predicts honors
Events that didn’t happen that were predicted
not to happen
E.g., model correctly predict no honors
Or can go the other way around
The probability of a positive prediction given
honors
This is the model’s sensitivity
The probability of a negative prediction given
Example 4: Categorical and continuous
independent variables
lstat

Logistic model for honors

-------- True --------

Classified | D ~D Total
-----------+--------------------------+-----------
+ | 31 12 | 43
- | 22 135 | 157
-----------+--------------------------+-----------
Total | 53 147 | 200 Definition of D as
Classified + if predicted Pr(D) >= .5 student getting honors
True D defined as honors ~= 0
--------------------------------------------------
Sensitivity Pr( +| D) 58.49%
Specificity Pr( -|~D) 91.84%
Positive predictive value Pr( D| +) 72.09%
Negative predictive value Pr(~D| -) 85.99%
--------------------------------------------------
False + rate for true ~D Pr( +|~D) 8.16%
False - rate for true D Pr( -| D) 41.51%
False + rate for classified + Pr(~D| +) 27.91%
False - rate for classified - Pr( D| -) 14.01%
--------------------------------------------------
Correctly classified 83.00%
--------------------------------------------------
Example 4: Categorical and continuous
independent variables
lstat

Logistic model for honors

-------- True --------

Classified | D ~D Total
-----------+--------------------------+-----------
+ | 31 12 | 43
- | 22 135 | 157
-----------+--------------------------+-----------
Total | 53 147 | 200

Classified + if predicted Pr(D) >= .5

True D defined as honors ~= 0
--------------------------------------------------
Sensitivity Pr( +| D) 58.49%
Specificity Pr( -|~D) 91.84% Summary of
Positive predictive value Pr( D| +) 72.09% correct predictions
Negative predictive value Pr(~D| -) 85.99%
--------------------------------------------------
False + rate for true ~D Pr( +|~D) 8.16%
False - rate for true D Pr( -| D) 41.51%
False + rate for classified + Pr(~D| +) 27.91%
False - rate for classified - Pr( D| -) 14.01%
--------------------------------------------------
Correctly classified 83.00%
--------------------------------------------------
Example 4: Categorical and continuous
independent variables
lstat

Logistic model for honors

-------- True --------

Classified | D ~D Total
-----------+--------------------------+-----------
+ | 31 12 | 43
- | 22 135 | 157
-----------+--------------------------+-----------
Total | 53 147 | 200

Classified + if predicted Pr(D) >= .5

True D defined as honors ~= 0
--------------------------------------------------
Sensitivity Pr( +| D) 58.49%
Specificity Pr( -|~D) 91.84% Summary of
Positive predictive value Pr( D| +) 72.09% correct predictions
Negative predictive value Pr(~D| -) 85.99%
--------------------------------------------------
False + rate for true ~D Pr( +|~D) 8.16%
False - rate for true D Pr( -| D) 41.51% Summary of
False + rate for classified + Pr(~D| +) 27.91% incorrect predictions
False - rate for classified - Pr( D| -) 14.01%
--------------------------------------------------
Correctly classified 83.00%
--------------------------------------------------
Example 4: Categorical and continuous
independent variables
lstat

Logistic model for honors

-------- True --------

Classified | D ~D Total
-----------+--------------------------+-----------
+
-
|
|
31
22
12 |
135 |
43
157 Overall success rate:
-----------+--------------------------+-----------
Total | 53 147 | 200

Classified + if predicted Pr(D) >= .5 (31 + 135) / 200

True D defined as honors ~= 0
--------------------------------------------------
Sensitivity Pr( +| D) 58.49%
Specificity Pr( -|~D) 91.84%
Positive predictive value Pr( D| +) 72.09%
Negative predictive value Pr(~D| -) 85.99%
--------------------------------------------------
False + rate for true ~D Pr( +|~D) 8.16%
False - rate for true D Pr( -| D) 41.51%
False + rate for classified + Pr(~D| +) 27.91%
False - rate for classified - Pr( D| -) 14.01%
--------------------------------------------------
Correctly classified 83.00%
--------------------------------------------------
Example 4: Categorical and continuous
independent variables
lstat

Logistic model for honors

-------- True --------

Classified + if predicted Pr(D) >= .5 (31 + 135) / 200 = 83%

True D defined as honors ~= 0
--------------------------------------------------
Sensitivity Pr( +| D) 58.49%
Specificity Pr( -|~D) 91.84%
Positive predictive value Pr( D| +) 72.09%
Negative predictive value Pr(~D| -) 85.99%
--------------------------------------------------
False + rate for true ~D Pr( +|~D) 8.16%
False - rate for true D Pr( -| D) 41.51%
False + rate for classified + Pr(~D| +) 27.91%
False - rate for classified - Pr( D| -) 14.01%
--------------------------------------------------
Correctly classified 83.00%
--------------------------------------------------
Assessing Model Fit
This is all calculated using 50% as a cutoff point for
positive predictions
But this isn’t set in stone; depending on your
application, you might want to change it
You might want to avoid false positives
Forexample, don’t convict innocent people
Then you would set the cutoff higher than 50%

Or you might want to avoid false negatives

For example, don’t report that someone who has a
disease is actually healthy
Then you would set the cutoff lower than 50%
Assessing Model Fit
We can imagine changing the cutoff point π
continuously from 0 to 1
Recall that
Sensitivity = Prob( + | D )
Specificity = Prob ( - | ~D )
At π=0, everything is predicted to be positive
That means you will misclassify all the negatives
So the sensitivity=1, specificity=0
At π=1, everything is predicted to be negative
That means you will misclassify all the positives
So the sensitivity=0, specificity=1
Assessing Model Fit
In between, you can vary the number of false
positives and false negatives
Ifyour model does a good job of predicting
outcomes, these should be low for all π
The ROC curve plots the sensitivity and
1-specificity as π goes from 0 to 1
The better the model does at predicting, the
greater will be the area under the ROC curve
Produce these with Stata command “lroc”
Example 4: Categorical and continuous
independent variables
1.00
0.75

Area under the ROC

curve is .8912
Sensitivity
0.50
0.25
0.00

0.00 0.25 0.50 0.75 1.00

1 - Specificity
Area under ROC curve = 0.8912

lroc
Logistic model for honors
number of observations = 200
area under ROC curve = 0.8912
Example 4: Categorical and continuous
independent variables
1.00

Or, you can use

the “lsens”
0.75
Sensitivity/Specificity

function to
directly plot the
0.50

sensitivity and
0.25

specificity
as your cutoff
0.00

0.00 0.25 0.50

Probability cutoff
0.75 1.00 changes from
Sensitivity Specificity 0 to 1.
. lsens
Diagnostic Plots
Can obtain predicted values in the usual way,
with command “predict p”
Two methods to calculate residuals
Pearson residuals: “predict x, dx2”
Deviance residuals: “predict z, ddeviance”
Leverage: “predict b, dbeta”
Draw the graphs:
Pearson residuals vs. predicted probabilities
Deviance residuals vs. predicted probabilities
Leverage residuals vs. predicted probabilities
Diagnostic Plots
Pearson Residuals vs. Predicted Probabilities Two distinct patterns
of residuals
30

One for Y=1, the

other for Y=0
Residuals Residuals
20

for Y=1 for Y=0

H-L dX^2

As with all logits and

probits, the residuals
honors
are definitely
heteroskedastic
10

No honors
0

0 .2 .4 .6 .8 1
Pr(honors)

scatter x p, ti(Pearson Residuals vs. Predicted Probabilities)

Diagnostic Plots
Pearson Residuals vs. Predicted Probabilities
83
30

High residual
Large
Residuals points were
predicted to be
20

Y=0, but got

H-L dX^2

60
honors anyway
10
0

0 .2 .4 .6 .8 1
Pr(honors)

scatter x p, ti(Pearson Residuals vs. Predicted Probabilities)

Diagnostic Plots
Deviance Residuals vs. Predicted Probability
8

83
Large Same pattern
Residuals as before.
6

Same two
H-L dD
4

points as
outliers
2
0

0 .2 .4 .6 .8 1
Pr(honors)

scatter x p, ti(Deviance Residuals vs. Predicted Probabilities)

Diagnostic Plots
Influence vs. Predicted Probabilities
.4

83 Different points
have large
.3

influence.
Pregibon's dbeta

60
.2

Could eliminate
these and see if
results change.
.1
0

0 .2 .4 .6 .8 1
Pr(honors)

scatter b p, ti(Influence vs. Predicted Probabilities)

Diagnostic Plots
Pearson Residuals vs. Predicted Probabilities
35

One way to
30

show both
25

residuals and
influence on
H-L dX^2
20

one graph is to
15

weight each
10

residual marker
by the value of
5

its influence.
0

0 .2 .4 .6 .8 1
Pr(honors)

scatter x p [weight=b], msymbol(oh) ylab(0 (5) 35)

Multinomial Data
We now move on to study logits when there
are more than 2 possible outcomes
There are two major categories of analysis:
ordered and unordered outcomes
Examples of unordered outcomes
Religion:Protestant, Catholic, or other
Mode of transportation: bus, car, subway, walking

Examples of ordered outcomes

Regime type: Autocracy, Partial Dem., Full Dem.
Socioeconomic status: High, Medium, Low
Unordered Outcomes
Pick a base category and calculate the odds
of the other possible outcomes relative to it
Forexample, say a student can enter a general,
vocational, or academic program
Use academic as the base category

Then we will use multinomial logit to estimate

Prob(general)/Prob(academic)
Prob(vocational)/Prob(academic)

That is, the probability of choosing general or

vocational relative to an academic program
Unordered Outcomes
Pick a base category and calculate the odds
of the other possible outcomes relative to it
Forexample, say a student can enter a general,
vocational, or academic program
Use academic as the base category

Then we will use multinomial logit to estimate

Prob(general)/Prob(academic) Two separate
Prob(vocational)/Prob(academic) regressions

That is, the probability of choosing general or

vocational relative to an academic program
Unordered Outcomes
Can interpret the results from a multinomial
logit as relative risk ratios (RRR)

Or they can be interpreted as Conditional

Odds Ratios
Multinomial Logit Example
. mlogit prog female math socst

Multinomial logistic regression Number of obs = 200

LR chi2(6) = 65.51
Prob > chi2 = 0.0000
Log likelihood = -171.34162 Pseudo R2 = 0.1605

------------------------------------------------------------------------------
prog | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
general |
female | -.0840263 .3806826 -0.22 0.825 -.8301505 .6620979
math | -.0739045 .0254512 -2.90 0.004 -.1237879 -.0240211
socst | -.0370939 .0217034 -1.71 0.087 -.0796319 .0054441
_cons | 5.130723 1.392646 3.68 0.000 2.401188 7.860258
-------------+----------------------------------------------------------------
vocation |
female | -.0177488 .4085162 -0.04 0.965 -.8184258 .7829282
math | -.1127775 .0289322 -3.90 0.000 -.1694836 -.0560714
socst | -.079675 .0227946 -3.50 0.000 -.1243516 -.0349984
_cons | 9.106635 1.545711 5.89 0.000 6.077098 12.13617
------------------------------------------------------------------------------
(Outcome prog==academic is the comparison group)
Multinomial Logit Example
. mlogit, rrr

Multinomial logistic regression Number of obs = 200

LR chi2(6) = 65.51
Prob > chi2 = 0.0000
Log likelihood = -171.34162 Pseudo R2 = 0.1605

------------------------------------------------------------------------------
prog | RRR Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
general |
female | .9194071 .3500023 -0.22 0.825 .4359837 1.938856
math | .9287604 .023638 -2.90 0.004 .8835673 .9762651
socst | .9635856 .0209131 -1.71 0.087 .9234562 1.005459
-------------+----------------------------------------------------------------
vocation |
female | .9824078 .4013295 -0.04 0.965 .4411255 2.18787
math | .8933494 .0258466 -3.90 0.000 .8441006 .9454716
socst | .9234164 .0210489 -3.50 0.000 .8830693 .9656069
------------------------------------------------------------------------------
(Outcome prog==academic is the comparison group)

Same results, but with RRR interpretation

Multinomial Logit Example
. listcoef

mlogit (N=200): Factor Change in the Odds of prog

Variable: female (sd=.4992205)

Odds comparing|
Group 1 vs Group 2| b z P>|z| e^b e^bStdX
------------------+---------------------------------------------
general -vocation | -0.06628 -0.155 0.877 0.9359 0.9675
general -academic | -0.08403 -0.221 0.825 0.9194 0.9589
vocation-general | 0.06628 0.155 0.877 1.0685 1.0336
vocation-academic | -0.01775 -0.043 0.965 0.9824 0.9912
academic-general | 0.08403 0.221 0.825 1.0877 1.0428
academic-vocation | 0.01775 0.043 0.965 1.0179 1.0089
----------------------------------------------------------------

(similar results for other two independent variables omitted)

“listcoef” gives all the relevant comparisons

Also gives p-values and exponentiated coefficients
Multinomial Logit Example
. prchange

mlogit: Changes in Predicted Probabilities for prog

female
Avg|Chg| general vocation academic
0->1 .0101265 -.01518974 .00147069 .01371908

math
Avg|Chg| general vocation academic
Min->Max .49023263 -.23754089 -.49780805 .73534894
-+1/2 .01500345 -.0083954 -.01410978 .02250516
-+sd/2 .13860906 -.07673311 -.13118048 .20791358
MargEfct .01500588 -.00839781 -.01411102 .02250882

(socst omitted)

general vocation academic

Pr(y|x) .25754365 .19741122 .54504514

female math socst

x= .545 52.645 52.405
sd(x)= .49922 9.36845 10.7358

“prchange” gives the probability changes directly

Multinomial Logit Example
Stata’s “mlogplot” illustrates the impact of
each independent variable on the probabilities
of each value of the dependent variable

female-0/1 G VA

math-std V G A

socst-std V G A
-.13 -. 09 -.05 0 .04 .08 .12 .17 .21
C hange in Predicted Probability for prog

mlogplot female math socst, std(0ss) p(.1) dc ntics(9)

Multinomial Logit Example
Same plot, with odds ratio changes rather
than discrete changes
Factor C hange Scale Relative to C ategory academic
.35 .4 .45 .52 .59 .67 .77 .88 1

female G
V
0/1 A

math V
G
Std Coef A

socst V
G
Std Coef A
-1.06 -.92 -.79 -.66 -.53 -.4 -.26 -.13 0
Logit Coefficient Scale Relative to C ategory academic

mlogplot female math socst, std(0ss) p(.1) or ntics(9)

Multinomial Logit Example
Use “prgen” to show how probabilities change with respect
to one variable
. mlogit prog math science, nolog

(output omitted)

------------------------------------------------------------------------------
prog | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
general |
math | -.1352046 .0305449 -4.43 0.000 -.1950716 -.0753376
science | .0602744 .0254395 2.37 0.018 .0104139 .1101348
_cons | 3.166452 1.298818 2.44 0.015 .6208165 5.712088
-------------+----------------------------------------------------------------
vocation |
math | -.1690188 .0331945 -5.09 0.000 -.2340789 -.1039588
science | .0170098 .0250403 0.68 0.497 -.0320684 .0660879
_cons | 7.053851 1.37717 5.12 0.000 4.354647 9.753055
------------------------------------------------------------------------------
(Outcome prog==academic is the comparison group)

. prgen math, gen(m) x(science=50) from(25) to(75) n(100)

Multinomial Logit Example
Use prgen to show how probabilities change
with respect to one variable
1

mlogplot female math socst, std(0ss) p(.1) or ntics(9)

.8
Probabilities
.4 .2
0 .6

20 40 60 80
Changing value of math

pr(general) [1] pr(academic) [2] pr(vocation) [3]

Lab 5 Combined Loading
100% (1)
Lab 5 Combined Loading
13 pages
Pure Mathematicsyear 2 (A Level) Unit Test 10: Integration (Part 1)
No ratings yet
Pure Mathematicsyear 2 (A Level) Unit Test 10: Integration (Part 1)
1 page
Regresi Logistik
No ratings yet
Regresi Logistik
34 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
L9 Logistical Regression Models Updated
No ratings yet
L9 Logistical Regression Models Updated
10 pages
STATA - Logit-Probit-Tobit - IInd Sem 23-24
No ratings yet
STATA - Logit-Probit-Tobit - IInd Sem 23-24
84 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
1170_10045_153184 (1)
No ratings yet
1170_10045_153184 (1)
36 pages
Logit and Spss
No ratings yet
Logit and Spss
37 pages
D3 Logit
No ratings yet
D3 Logit
37 pages
Article: An Introduction Tos Logistic Regression Analysis and Reporting
No ratings yet
Article: An Introduction Tos Logistic Regression Analysis and Reporting
5 pages
Mock-test-Econ
No ratings yet
Mock-test-Econ
3 pages
Pooled and Exact Logistics
No ratings yet
Pooled and Exact Logistics
33 pages
Logistic Regression
100% (2)
Logistic Regression
41 pages
Annotated Stata Output - Logistic Regression
100% (1)
Annotated Stata Output - Logistic Regression
3 pages
An Introduction To Logistic Regression in R
No ratings yet
An Introduction To Logistic Regression in R
25 pages
An Introduction To Logistic Regression
No ratings yet
An Introduction To Logistic Regression
13 pages
Getting Started in Logit and Ordered Logit Regression: (Ver. 3.1 Beta)
No ratings yet
Getting Started in Logit and Ordered Logit Regression: (Ver. 3.1 Beta)
14 pages
7 Short Logistic Regression
No ratings yet
7 Short Logistic Regression
13 pages
Logistic Regression (Peng Et Al)
No ratings yet
Logistic Regression (Peng Et Al)
13 pages
Logistic Regression
100% (1)
Logistic Regression
37 pages
Gologit 2 Part 1
No ratings yet
Gologit 2 Part 1
27 pages
1_LogisticRegressionNotes1
No ratings yet
1_LogisticRegressionNotes1
11 pages
Logistic Regression Example Illustrated
No ratings yet
Logistic Regression Example Illustrated
20 pages
Logit R101
No ratings yet
Logit R101
27 pages
Lec-4 Logistic Regression
No ratings yet
Lec-4 Logistic Regression
54 pages
Binary Logistic Regression - 6.2
No ratings yet
Binary Logistic Regression - 6.2
34 pages
Psy 512 Logistic Regression
No ratings yet
Psy 512 Logistic Regression
12 pages
Regresion Logistica
No ratings yet
Regresion Logistica
71 pages
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
No ratings yet
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
27 pages
Logistic Regression
No ratings yet
Logistic Regression
30 pages
Logisticregression PDF
No ratings yet
Logisticregression PDF
48 pages
Logistic Regression Copy
No ratings yet
Logistic Regression Copy
30 pages
Logistic Regression: 30 March 2016
No ratings yet
Logistic Regression: 30 March 2016
49 pages
Probit Logit Interpretation
No ratings yet
Probit Logit Interpretation
26 pages
Logistic
No ratings yet
Logistic
14 pages
Section and Solution
No ratings yet
Section and Solution
4 pages
Logistic Regression & Practice
100% (1)
Logistic Regression & Practice
51 pages
Logistic Regression Analysis
No ratings yet
Logistic Regression Analysis
16 pages
Introduction To Logistic Regression
No ratings yet
Introduction To Logistic Regression
20 pages
Logistic Regression-Advanced Biostat-PDF(1)
No ratings yet
Logistic Regression-Advanced Biostat-PDF(1)
86 pages
Logistic Regression
100% (1)
Logistic Regression
12 pages
Garson 2008 Logistic Regression
No ratings yet
Garson 2008 Logistic Regression
33 pages
An Introduction To Logistic Regression
No ratings yet
An Introduction To Logistic Regression
48 pages
Generalized Ordered Logit Models: April 2, 2010 Richard Williams, Notre Dame Sociology, Gologit2 Support Page
No ratings yet
Generalized Ordered Logit Models: April 2, 2010 Richard Williams, Notre Dame Sociology, Gologit2 Support Page
6 pages
Getting Started in Logit and Ordered Logit Regression: (Ver. 3.1 Beta)
No ratings yet
Getting Started in Logit and Ordered Logit Regression: (Ver. 3.1 Beta)
15 pages
A Simple But Effective Logistic Regression Derivation
No ratings yet
A Simple But Effective Logistic Regression Derivation
6 pages
Logistic Regression
100% (1)
Logistic Regression
21 pages
Logit
No ratings yet
Logit
48 pages
Categorical Dependent Variable Regression Models Using STATA, SAS, and SPSS
No ratings yet
Categorical Dependent Variable Regression Models Using STATA, SAS, and SPSS
32 pages
5.1) Binary logistic regression
No ratings yet
5.1) Binary logistic regression
32 pages
Mock Test Econ
No ratings yet
Mock Test Econ
9 pages
Mlogit 2004
No ratings yet
Mlogit 2004
5 pages
Logistic Reg
No ratings yet
Logistic Reg
54 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
BSC Intermediate Econometrics: Please Do Not Distribute
No ratings yet
BSC Intermediate Econometrics: Please Do Not Distribute
25 pages
Chapter 16 - Logistic Regression Model
No ratings yet
Chapter 16 - Logistic Regression Model
7 pages
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Multiple Models Approach in Automation: Takagi-Sugeno Fuzzy Systems
From Everand
Multiple Models Approach in Automation: Takagi-Sugeno Fuzzy Systems
Mohammed Chadli
No ratings yet
Variance Analysis - Complete
No ratings yet
Variance Analysis - Complete
3 pages
Dimensional Analysis MR Raman Gahlaut
No ratings yet
Dimensional Analysis MR Raman Gahlaut
55 pages
Python - Multithreaded Programming
No ratings yet
Python - Multithreaded Programming
9 pages
Threads in Python Programming
No ratings yet
Threads in Python Programming
30 pages
MATH 545, Stochastic Calculus Problem Set 2: January 24, 2019
No ratings yet
MATH 545, Stochastic Calculus Problem Set 2: January 24, 2019
7 pages
R Tutorial For Beginner Level 1
No ratings yet
R Tutorial For Beginner Level 1
2 pages
FIM601S - Financial Mathematics - 1st Opportunity - June 2016
No ratings yet
FIM601S - Financial Mathematics - 1st Opportunity - June 2016
4 pages
2 Step Option Pricing Model
No ratings yet
2 Step Option Pricing Model
8 pages
Isometric Drawing
100% (1)
Isometric Drawing
27 pages
Dece 2024 Timetable _060211
No ratings yet
Dece 2024 Timetable _060211
2 pages
Download Statistical Inference 2e 2nd Edition George Casella ebook All Chapters PDF
100% (1)
Download Statistical Inference 2e 2nd Edition George Casella ebook All Chapters PDF
51 pages
Research Article: Teaching Mathematics Through Concept Motivation and Action Learning
No ratings yet
Research Article: Teaching Mathematics Through Concept Motivation and Action Learning
14 pages
Material Selection For Low-Temperature Applications - HP - July 2004
No ratings yet
Material Selection For Low-Temperature Applications - HP - July 2004
12 pages
1990 Pekoz Design of Cold-Formed Steel Screw Connections
No ratings yet
1990 Pekoz Design of Cold-Formed Steel Screw Connections
15 pages
calc_4.7_packet
No ratings yet
calc_4.7_packet
3 pages
China-Team_Selection_Test-2009-47
No ratings yet
China-Team_Selection_Test-2009-47
7 pages
Ept474 2dof
100% (1)
Ept474 2dof
118 pages
Ducks U Darshan 1997
No ratings yet
Ducks U Darshan 1997
21 pages
Issn 2488-8648: 10.5555/edpv8104
No ratings yet
Issn 2488-8648: 10.5555/edpv8104
9 pages
Final Suggestion-VI
No ratings yet
Final Suggestion-VI
20 pages
Trigonometric Functions of Acute Angle (Lesson 3)
No ratings yet
Trigonometric Functions of Acute Angle (Lesson 3)
4 pages
10. JPPW-2022-01
No ratings yet
10. JPPW-2022-01
12 pages
ResSimCh6 PDF
No ratings yet
ResSimCh6 PDF
69 pages
Revision Worksheet-Maths-Grade 7
No ratings yet
Revision Worksheet-Maths-Grade 7
4 pages
SPE 11382 - Calculation of NMDC Length Required For Various Latitudes Developed From Field Measurements of Drill String Magnetisation
No ratings yet
SPE 11382 - Calculation of NMDC Length Required For Various Latitudes Developed From Field Measurements of Drill String Magnetisation
8 pages
Physics Lab #1 Linear Motion
No ratings yet
Physics Lab #1 Linear Motion
4 pages
Simulation Basis Manager: Component Lists
No ratings yet
Simulation Basis Manager: Component Lists
8 pages
Datesheet - Semester End Examination (First Semester) - January 2023
No ratings yet
Datesheet - Semester End Examination (First Semester) - January 2023
27 pages
1.5.3 Elastic deformation
No ratings yet
1.5.3 Elastic deformation
7 pages
PPS CLA-1 Set B Key PDF
No ratings yet
PPS CLA-1 Set B Key PDF
6 pages
A Comparison of Image Processing Techniques For Bird Detection
No ratings yet
A Comparison of Image Processing Techniques For Bird Detection
117 pages
[Ebooks PDF] download Dynamical Processes on Complex Networks 1st Edition Alain Barrat full chapters
No ratings yet
[Ebooks PDF] download Dynamical Processes on Complex Networks 1st Edition Alain Barrat full chapters
55 pages
Full download Integral and Finite Difference Inequalities and Applications 1st Edition B.G. Pachpatte (Eds.) pdf docx
100% (10)
Full download Integral and Finite Difference Inequalities and Applications 1st Edition B.G. Pachpatte (Eds.) pdf docx
45 pages
Comparison Test of Convergences and Their Applications
No ratings yet
Comparison Test of Convergences and Their Applications
13 pages
Fatigue Failure Is Characterized by Three Stages
No ratings yet
Fatigue Failure Is Characterized by Three Stages
37 pages
4- الرياضيات (المحاضرة الرابعة)
No ratings yet
4- الرياضيات (المحاضرة الرابعة)
6 pages

Lecture 10 PDF

Uploaded by

Lecture 10 PDF

Uploaded by

Lecture 10:

Logistical Regression II—

 Do the regression and transform the findings

admission into a graduate 8

 Data collected includes 4

(1 if male) and the student’s 2 0 0

Logit estimates Number of obs = 20

Translates original logit coefficients to odds ratio on gender

So 5.4444 is the “exponentiated coefficient”

Logit estimates Number of obs = 20

That is, exp(1.694596) = 5.444444

Iteration 0: log likelihood = -13.862944

Iteration 0: log likelihood = -13.862944

Aptitude is positive and significantly related to being

Logit estimates Number of obs = 20

Pr (admit | apt + 1) 1 − Pr (admit | apt + 1)

Gender is now insignificant!

 First look at H0: β2 = 0

 First look at H0: β2 = 0

 Now look at H0: β1 = 0

 Now look at H0: β1 = 0

 Now look at H0: β1 = 0

Variable Obs Mean Std. Dev. Min Max

id 200 100.5 57.87918 1 200

prog 200 2.025 .6904772 1 3

socst 200 52.405 10.73579 26 71

/* create dummy coding for ses */

/* create dummy coding for ses */

storage display value

about this but….

worry about this

The logit link

function takes logs

Logit estimates Number of obs = 200

test ses1 ses2

test ses1 ses2

. postgr3 read, gen(avg)

. line avg read, sort

Average Male Female

. postgr3 read, gen(male) x(female=0) nodraw

higher for females

than for males

Average Male Female

. postgr3 read, gen(male) x(female=0) nodraw

Check predictions against the actual outcomes in

Logistic model for honors

-------- True --------

Logistic model for honors

-------- True --------

Classified + if predicted Pr(D) >= .5

Logistic model for honors

-------- True --------

Classified + if predicted Pr(D) >= .5

Logistic model for honors

-------- True --------

Classified + if predicted Pr(D) >= .5 (31 + 135) / 200

Logistic model for honors

-------- True --------

Classified + if predicted Pr(D) >= .5 (31 + 135) / 200 = 83%

 Or you might want to avoid false negatives

Area under the ROC

0.00 0.25 0.50 0.75 1.00

Or, you can use

0.00 0.25 0.50

One for Y=1, the

for Y=1 for Y=0

As with all logits and

scatter x p, ti(Pearson Residuals vs. Predicted Probabilities)

Y=0, but got

scatter x p, ti(Pearson Residuals vs. Predicted Probabilities)

scatter x p, ti(Deviance Residuals vs. Predicted Probabilities)

scatter b p, ti(Influence vs. Predicted Probabilities)

scatter x p [weight=b], msymbol(oh) ylab(0 (5) 35)

 Examples of ordered outcomes

 Then we will use multinomial logit to estimate

 That is, the probability of choosing general or

Do the regression and transform the findings

Data collected includes 4

First look at H0: β2 = 0

First look at H0: β2 = 0

Now look at H0: β1 = 0

Now look at H0: β1 = 0

Now look at H0: β1 = 0

Or you might want to avoid false negatives

Examples of ordered outcomes

Then we will use multinomial logit to estimate

That is, the probability of choosing general or

Then we will use multinomial logit to estimate

That is, the probability of choosing general or

Or they can be interpreted as Conditional