MOHAMMED BORHANDDEN MUSAH
FACULTY OF EDUCATION
UNIVERSITI TEKNOLOGI MALAYSIA
Introduction to Principal Component
Analysis (PCA)
OUTLINE
Purpose
Definition
Procedure
 Assumptions
 Requirement
 Graphical Application
Purpose
1. Data reduction.
 redundant variables
 unclear variables
 irrelevant variables
2. Structural detection.
Definition
PCA is a linear combination of weighted
observed variables.
Note
 PCA, PA, EFA and CFA
 Measured variable, observed variable,
indicator variable, item and manifest variable.
Cont.
 Latent construct/variable, factor,
component, underlying construct and
unobserved variable.
 Case, observation, respondent, participant
and subject.
Procedure
1. Assumptions (data screening)
Normality; the assumption is that the
combination of variables follows a multivariate
normal distribution.
Both graphical and statistical methods are used to
evaluate normality.
Cont.
Homoscedasticity, refers to the assumption that
the dependent variable exhibits similar amounts of
variance across the range of values for an
independent variable.
Both graphical and statistical methods are used to
evaluate homoscedasticity.
Cont.
Linearity, assumes that the relationship between
variables is linear, or they perform better if the
relationships are linear.
Both graphical and statistical methods are used to
evaluate linearity.
Cont.
2. Requirements
 The variables included must be metric level or
dichotomous (dummy-coded) nominal level.
 The sample size must be greater than 50
(preferably 100).
 The ratio of cases to variables must be 5 to 1
or larger.
 The correlation matrix for the variables must
contain 2 or more correlations of 0.30 or
greater.
Cont.
 Variables with MSA less than 0.50 must be
removed.
 The overall MSA should be 0.50 or higher.
 The Bartlett test of sphericity should
statistically significant.
 Components weighted with only one
variable, should be discarded.
Graphical Application
Introduction to principal component analysis (pca)

Introduction to principal component analysis (pca)

  • 1.
    MOHAMMED BORHANDDEN MUSAH FACULTYOF EDUCATION UNIVERSITI TEKNOLOGI MALAYSIA Introduction to Principal Component Analysis (PCA)
  • 2.
  • 3.
    Purpose 1. Data reduction. redundant variables  unclear variables  irrelevant variables 2. Structural detection.
  • 4.
    Definition PCA is alinear combination of weighted observed variables.
  • 5.
    Note  PCA, PA,EFA and CFA  Measured variable, observed variable, indicator variable, item and manifest variable.
  • 6.
    Cont.  Latent construct/variable,factor, component, underlying construct and unobserved variable.  Case, observation, respondent, participant and subject.
  • 7.
    Procedure 1. Assumptions (datascreening) Normality; the assumption is that the combination of variables follows a multivariate normal distribution. Both graphical and statistical methods are used to evaluate normality.
  • 8.
    Cont. Homoscedasticity, refers tothe assumption that the dependent variable exhibits similar amounts of variance across the range of values for an independent variable. Both graphical and statistical methods are used to evaluate homoscedasticity.
  • 9.
    Cont. Linearity, assumes thatthe relationship between variables is linear, or they perform better if the relationships are linear. Both graphical and statistical methods are used to evaluate linearity.
  • 10.
    Cont. 2. Requirements  Thevariables included must be metric level or dichotomous (dummy-coded) nominal level.  The sample size must be greater than 50 (preferably 100).  The ratio of cases to variables must be 5 to 1 or larger.  The correlation matrix for the variables must contain 2 or more correlations of 0.30 or greater.
  • 11.
    Cont.  Variables withMSA less than 0.50 must be removed.  The overall MSA should be 0.50 or higher.  The Bartlett test of sphericity should statistically significant.  Components weighted with only one variable, should be discarded.
  • 12.