0% found this document useful (0 votes)

1K views

Factor Analysis - Spss

A common factor is an abstraction, a dimension that affects at least two of the variables. We assume that there is also one unique factor for each variable. We eliminate the variance due to unique factors by replacing the 1's with estimates of the variables' communalities.

Uploaded by

manjunatha t

Available Formats

Download as PDF or read online on Scribd

0% found this document useful (0 votes)

1K views

Factor Analysis - Spss

Uploaded by

manjunatha t

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 15

Factor Analysis - SPSS

First Read Principal Components Analysis.

The methods we have employed so far attempt to repackage all of the variance
in the p variables into principal components. We may wish to restrict our analysis to
variance that is common among variables. That is, when repackaging the variables
variance we may wish not to redistribute variance that is unique to any one variable.
This is Common Factor Analysis. A common factor is an abstraction, a hypothetical
dimension that affects at least two of the variables. We assume that there is also one
unique factor for each variable, a factor that affects that variable but does not affect any
other variables. We assume that the p unique factors are uncorrelated with one another
and with the common factors. It is the variance due to these unique factors that we
shall exclude from our FA.

Iterated Principal Factors Analysis

The most common sort of FA is principal axis FA, also known as principal
factor analysis. This analysis proceeds very much like that for a PCA. We eliminate
the variance due to unique factors by replacing the 1s on the main diagonal of the
correlation matrix with estimates of the variables communalities. Recall that a variables
communality, its SSL across components or factors, is the amount of the variables
variance that is accounted for by the components or factors. Since our factors will be
common factors, a variables communality will be the amount of its variance that is
common rather than unique. The R2 between a variable and all other variables is most
often used initially to estimate a variables communality.
Using the beer data, change the extraction method to principal axis:

Copyright 2009, Karl L. Wuensch - All rights reserved.

FA-SPSS.doc
2

Take a look at the initial communalities (for each variable, this is the R2 for
predicting that variable from an optimally weighted linear combination of the remaining
variables). Recall that they were all 1s for the principal components analysis we did
earlier, but now each is less than 1. If we sum these communalities we get 5.675. We
started with 7 units of standardized variance and we have now reduced that to 5.675
units of standardized variance (by eliminating unique variance).

Communalities

Initial Extraction
COST .738 .745
SIZE .912 .914
ALCOHOL .866 .866
REPUTAT .499 .385
COLOR .922 .892
AROMA .857 .896
TASTE .881 .902
Extraction Method: Principal Axis Factoring.

For an iterated principal axis solution SPSS first estimates communalities, with
R2 s, and then conducts the analysis. It then takes the communalities from that first
analysis and inserts them into the main diagonal of the correlation matrix in place of the
R2 s, and does the analysis again. The variables SSLs from this second solution are
then inserted into the main diagonal replacing the communalities from the previous
iteration, etc. etc., until the change from one iteration to the next iteration is trivial.
Look at the communalities after this iterative process and for a two-factor
solution. They now sum to 5.60. That is, 5.6/7 = 80% of the variance is common
variance and 20% is unique. Here you can see how we have packaged that common
variance into two factors, both before and after a varimax rotation:

Total Variance Explained

Extraction Sums of Squared Loadings Rotation Sums of Squared Loadings

Factor Total % of Variance Cumulative % Total % of Variance Cumulative %
1 3.123 44.620 44.620 2.879 41.131 41.131
2 2.478 35.396 80.016 2.722 38.885 80.016
Extraction Method: Principal Axis Factoring.
3

The final rotated loadings are:

Rotated Factor Matrixa

Factor
1 2
TASTE .950 -2.17E-02
AROMA .946 2.106E-02
COLOR .942 6.771E-02
SIZE 7.337E-02 .953
ALCOHOL 2.974E-02 .930
COST -4.64E-02 .862
REPUTAT -.431 -.447
Extraction Method: Principal Axis Factoring.
Rotation Method: Varimax with Kaiser Normalization.
a. Rotation converged in 3 iterations.

These loadings are very similar to those we obtained previously with a principal
components analysis.

Reproduced and Residual Correlation Matrices

Having extracted common factors, one can turn right around and try to reproduce
the correlation matrix from the factor loading matrix. We assume that the correlations
between variables result from their sharing common underlying factors. Thus, it makes
sense to try to estimate the correlations between variables from the correlations
between factors and variables. The reproduced correlation matrix is obtained by
multiplying the loading matrix by the transposed loading matrix. This results in
calculating each reproduced correlation as the sum across factors (from 1 to m) of the
products (r between factor and the one variable)(r between factor and the other
variable). For example, for our 2 factor iterative solution the reproduced correlation
between COLOR and TASTE = (r for color - Factor1)(r for taste - Factor 1) + (r for color
- Factor 2)(r for taste - Factor2) = (.95)(.94) + (.06)(-.02) = .89. The original r between
color and taste was .90, so our two factors did indeed capture the relationship between
Color and Taste.
The residual correlation matrix equals the original correlation matrix
minus the reproduced correlation matrix. We want these residuals to be
small. If you check Reproduced under Descriptive in the Factor Analysis
dialogue box, you will get both of these matrices:
4

Reproduced Correlations

COST SIZE ALCOHOL REPUTAT COLOR AROMA TASTE

Reproduced Correlation COST .745b .818 .800 -.365 1.467E-02 -2.57E-02 -6.28E-02
SIZE .818 .914b .889 -.458 .134 8.950E-02 4.899E-02
ALCOHOL .800 .889 .866b -.428 9.100E-02 4.773E-02 8.064E-03
REPUTAT -.365 -.458 -.428 .385b -.436 -.417 -.399
COLOR 1.467E-02 .134 9.100E-02 -.436 .892b .893 .893
AROMA -2.57E-02 8.950E-02 4.773E-02 -.417 .893 .896b .898
TASTE -6.28E-02 4.899E-02 8.064E-03 -.399 .893 .898 .902b
Residual a COST 1.350E-02 -3.295E-02 -4.02E-02 3.328E-03 -2.05E-02 -1.16E-03
SIZE 1.350E-02 1.495E-02 6.527E-02 4.528E-02 8.097E-03 -2.32E-02
ALCOHOL -3.29E-02 1.495E-02 -3.47E-02 -1.88E-02 -3.54E-03 3.726E-03
REPUTAT -4.02E-02 6.527E-02 -3.471E-02 6.415E-02 -2.59E-02 -4.38E-02
COLOR 3.328E-03 4.528E-02 -1.884E-02 6.415E-02 1.557E-02 1.003E-02
AROMA -2.05E-02 8.097E-03 -3.545E-03 -2.59E-02 1.557E-02 -2.81E-02
TASTE -1.16E-03 -2.32E-02 3.726E-03 -4.38E-02 1.003E-02 -2.81E-02
Extraction Method: Principal Axis Factoring.
a. Residuals are computed between observed and reproduced correlations. There are 2 (9.0%) nonredundant residuals with
absolute values greater than 0.05.
b. Reproduced communalities

Nonorthogonal (Oblique) Rotation

The data may be better fit with axes that are not perpendicular. This can be
done by means of an oblique rotation, but the factors will now be correlated with one
another. Also, the factor loadings (in the pattern matrix) will no longer be equal to the
correlation between each factor and each variable. They will still be standardized
regression coefficients (Beta weights), the As in the X j = A1 j F1 + A2 j F2 + + Amj Fm + U j
formula presented at the beginning of the handout on principal components analysis.
The correlations between factors and variables are presented in a factor structure
matrix.
I am not generally comfortable with oblique rotations, but for this lesson I tried a
Promax rotation (a varimax rotation is first applied and then the resulting axes rotated to
oblique positions):
5

Beta Weights Correlations

Pattern Matrixa Structure Matrix

Factor Factor
1 2 1 2
TASTE .955 -7.14E-02 TASTE .947 .030
AROMA .949 -2.83E-02 AROMA .946 .072
COLOR .943 1.877E-02 COLOR .945 .118
SIZE 2.200E-02 .953 SIZE .123 .956
ALCOHOL -2.05E-02 .932 ALCOHOL .078 .930
COST -9.33E-02 .868 COST -.002 .858
REPUTAT -.408 -.426 REPUTAT -.453 -.469

Extraction Method: Principal Axis Factoring. Extraction Method: Principal Axis Factoring.
Rotation Method: Promax with Kaiser Normalization. Rotation Method: Promax with Kaiser Normalization.
a. Rotation converged in 3 iterations.

Factor Correlation Matrix

Factor 1 2
1 1.000 .106
2 .106 1.000
Extraction Method: Principal Axis Factoring.
Rotation Method: Promax with Kaiser Normalization.

Notice that this solution is not much different from the previously obtained
varimax solution, so little was gained by allowing the factors to be correlated.

Exact Factor Scores

One may wish to define subscales on a test, with each subscale representing
one factor. Using an "exact" weighting scheme, each subject's estimated factor score
on each factor is a weighted sum of the products of scoring coefficients and the
subject's standardized scores on the original variables.
The regression coefficients (standardized scoring coefficients) for converting
scores on variables to factor scores are obtained by multiplying the inverse of the
original simple correlation matrix by the factor loading matrix. To obtain a
subjects factor scores you multiply es standardized scores (Zs) on the variables by
these standardized scoring coefficients. For example, subject # 1s Factor scores are:
Factor 1: (-.294)(.41) + (.955)(.40) + (-.036)(.22) + (1.057)(-.07) + (.712)(.04) +
(1.219)(.03) + (-1.14)(.01) = 0.23.
Factor 2: (-.294)(.11) + (.955)(.03) + (-.036)(-.20) + (1.057)(.61) + (.712)(.25) +
(.16)(1.219) + (-1.14)(-.04) = 1.06
SPSS will not only compute the scoring coefficients for you, it will also output the
factor scores of your subjects into your SPSS data set so that you can input them into
6

other procedures. In the Factor Analysis window, click Scores and select Save As
Variables, Regression, Display Factor Score Coefficient Matrix.

Here are the scoring coefficients:

Factor Score Coefficient Matrix

Factor
1 2
COST .026 .157
SIZE -.066 .610
ALCOHOL .036 .251
REPUTAT .011 -.042
COLOR .225 -.201
AROMA .398 .026
TASTE .409 .110
Extraction Method: Principal Axis Factoring.
Rotation Method: Varimax with Kaiser Normalization.
Factor Scores Method: Regression.

Look back at your data sheet. You will find that two columns have been added to
the right, one for scores on Factor 1 and another for scores on Factor 2.
SPSS also gives you a Factor Score Covariance Matrix. On the main diagonal of
this matrix are, for each factor, the R2 between the factor and the observed variables.
This is treated as an indictor of the internal consistency of the solution. Values below
.70 are considered undesirable.

Factor Score Covariance Matrix

Factor 1 2
1 .966 .003
2 .003 .953
Extraction Method: Principal Axis Factoring.
Rotation Method: Varimax with Kaiser Normalization.
Factor Scores Method: Regression.
7

These squared multiple correlation coefficients are equal to the variance of the
factor scores:

Descriptive Statistics

N Mean Variance
FAC1_1 220 .0000000 .966
FAC2_1 220 .0000000 .953

The input data included two variables (SES and Group) not included in the factor
analysis. Just for fun, try conducting a multiple regression predicting subjects SES
from their factor scores and also try using Students t to compare the two groups means
on the factor scores. Do note that the scores for factor 1 are not correlated with those
for factor 2. Accordingly, in the multiple regression the squared semipartial correlation
coefficients are identical to squared zero-order correlation coefficients and the
R 2 = rY21 + rY22 .

Model Summary

Adjusted Std. Error of

Model R R Square R Square the Estimate
1 .988a .976 .976 .385
a. Predictors: (Constant), FAC2_1, FAC1_1

ANOVAb

Sum of
Model Squares df Mean Square F Sig.
1 Regression 1320.821 2 660.410 4453.479 .000a
Residual 32.179 217 .148
Total 1353.000 219
a. Predictors: (Constant), FAC2_1, FAC1_1
b. Dependent Variable: SES

Coefficientsa

Standardized
Coefficients Correlations
Model Beta t Sig. Zero-order Part
1 (Constant) 134.810 .000
FAC1_1 .681 65.027 .000 .679 .681
FAC2_1 -.718 -68.581 .000 -.716 -.718
a. Dependent Variable: SES
8

Group Statistics

Std. Error
GROUP N Mean Std. Deviation Mean
FAC1_1 1 121 -.4198775 .97383364 .08853033
2 99 .5131836 .71714232 .07207552
FAC2_1 1 121 .5620465 .88340921 .08030993
2 99 -.6869457 .55529938 .05580969

Independent Samples Test

Levene's Test for

Equality of Variances t-test for Equality of Means
95% Confidence
Interval of the
Difference
F Sig. t df Sig. (2-tailed) Lower Upper
FAC1_1 Equal variances
19.264 .000 -7.933 218 .000 -1.16487 -.701253
assumed
Equal variances
-8.173 215.738 .000 -1.15807 -.708049
not assumed
FAC2_1 Equal variances
25.883 .000 12.227 218 .000 1.047657 1.450327
assumed
Equal variances
12.771 205.269 .000 1.056175 1.441809
not assumed

Unit-Weighted Factor Scores

If one were using FA to define subscales on a test, e might decide to define
subscale 1 as being an unweighted sum of all of a subjects scores (for example, 1....7
on a Likert item) on any item loading heavily (>.4) on Factor 1, etc. etc. Such a method
is sometimes referred to as a "unit-weighting scheme." For our data, if I answered the
Color, Taste, Aroma, Size, Alcohol, Cost, Reputation questions with the responses 80,
100, 40, 30, 75, 60, 10, my Subscale 1 (Aesthetic Quality) subscale score would be 80
+ 100 + 40 - 10 = 210 [note that I subtracted the Reputation score since its loading was
negative] and my Subscale 2 (Cheap Drunk) score would be 30 + 75 + 60 - 10 = 155.
Recent work has indicated that certain types of unit-weighting schemes may
actually be preferable to exact factor scores in certain circumstances. I recommend
Grice, J. W. (2001). A comparison of factor scores under conditions of factor obliquity,
Psychological Bulletin, 6, 67-83. Grice reviewed the literature on this topic and
presented results of his own Monte Carlo study (with oblique factor analysis). The early
studies which suggested that unit-weighted factor scores are preferable to exact factor
scores used data for which the structure was simple (each variable loading well on only
one factor) and the loadings not highly variable (most loadings either very high or very
low). More recent research (Grice & Harris, 1998, orthogonal factor analysis, cited in
Grice, 2001) has shown that unit-weighted factor scores based on the loadings perform
9

poorly under conditions of non-simple structure and variable loadings, which are typical
of the conditions most often found in actual practice. Grice and Harris developed an
alternative unit-weighting scheme which produced factor scores that compared
favorably with exact factor scores -- they based the weightings on the factor score
coefficients rather than on the loadings.
Grice's article extended the discussion to the case of oblique factor analysis,
where one could entertain several different sorts of unit-weighting schemes -- for
example, basing them on the pattern matrix (loadings, standardized regression
coefficients for predicting), the structure matrix (correlations of variables with factors), or
the factor score coefficients. Grice defined a variable as salient on a factor if it had a
weighting coefficient whose absolute value was at least 1/3 as large as that of the
variable with the largest absolute weighting coefficient on that factor. Salient items'
weights were replaced with 1 or 1, and nonsalient variables' weights with 0. The
results of his Monte Carlo study indicated that factor scores using this unit-weighting
scheme based on scoring coefficients performed better than those using various other
unit-weighting schemes and at least as good as exact factor scores (by most criteria
and under most conditions). He did note, however, that exact factor scores may be
preferred under certain circumstances -- for example, when using factor scores on the
same sample as that from which they were derived, especially when sample size is
relatively small. If we followed Grices advice we would drop Reputation from both
subscales and Cost from the second subscale.

Cronbachs Alpha
If you have developed subscales such as the Aesthetic Quality and Cheap Drunk
subscales above, you should report an estimate of the reliability of each subscale. Test-retest
reliability can be employed if you administer the scale to the same persons twice, but usually
you will only want to administer it once to each person. Cronbachs alpha is an easy and
generally acceptable estimate of reliability.
Suppose that we are going to compute AQ (Aesthetic Quality) as color + taste + aroma
reputat and CD as cost + size + alcohol reputat. How reliable would such subscales be?
We conduct an item analysis to evaluate the reliability (and internal consistency) of the each
subscale.
Before conducting the item analysis, we shall need to multiply the Reputation variable
by minus 1, since it is negatively weighted in the AQ and CD subscale scores. Transform
Compute NegRep = 1 reputat.
10

Analyze, Scale, Reliability Analysis. Scoot color, aroma, taste, and NegRep into the
items box.

Click Statistics. Check Scale if item deleted. Continue, OK.

Reliability Statistics

Cronbach's
Alpha N of Items
.886 4

Alpha is .886. Shoot for an alpha of at least .7 for research instruments.

Item-Total Statistics

Scale Corrected Cronbach's

Scale Mean if Variance if Item-Total Alpha if Item
Item Deleted Item Deleted Correlation Deleted
color 63.7500 3987.814 .859 .810
aroma 70.0000 4060.959 .881 .802
taste 47.5000 4273.174 .879 .807
NegRep 163.0000 5485.936 .433 .961

Notice that NegRep is not as well correlated with the corrected total scores as are the
other items and that dropping it from the scale would increase the value of alpha considerably.
That might be enough to justify dropping the reputation variable from this subscale.
If you conduct an item analysis on the CD items you will find alpha = .878 and increases
to .941 if Reputation is dropped from the scale.

Comparing Two Groups Factor Structure

Suppose I wish to compare the factor structure in one population with that in
another. I first collect data from randomly selected samples from each population. I
then extract factors from each group using the same criteria for number of factors etc. in
both groups. An eyeball test should tell whether or not the two structures are
dissimilar. Do I get the same number of well defined factors in the two groups? Is the
set of variables loading well on a given factor the same in Group 1 as in Group 2 (minor
differences would be expected due to sampling error, of course - one can randomly split
one sample in half and do separate analyses on each half to get a picture of such error).
Catells salient similarity index (s) may be used to compare two solutions
patterns of loadings. Consider the following hypothetical loading matrices:
Group 1 Group 2
Variable Factor1 Factor2 Factor3 Factor1 Factor2 Factor3
X1 .90 .12 .21 .45 .49 -.70
X2 .63 .75 .34 .65 -.15 .22
X3 .15 .67 .24 .27 .80 .04
X4 -.09 -.53 -.16 -.15 .09 .67
X5 -.74 -.14 -.19 .95 -.79 .12
SSL 1.78 1.32 0.28 1.62 1.53 1.00
12

Each loading is classified as Positively Salient (Catell used a criterion of > .10,
Ill use a higher cut, > .30), Negatively Salient (< -.30) or neither (HyperPlane). One
then constructs a third order square [PS, HP, NS] matrix comparing Group 1 with Group
2. Ill abbreviate the contents of this table using these cell indices:
Group 1
PS HP NS
PS 11 12 13
Group2 HP 21 22 23
NS 31 32 33
The loadings for X1 - F1 are PS for both groups, so it is counted in cell 11. Ditto
for X2 - F1. X3 - F1 is HP in both groups, so it is counted in cell 22. Ditto for X4 - F1.
X5 - F1 is NS in Group 1 but PS in Group 2, so it is counted in cell 13.
Thus, the table for comparing Factor 1 in Group 1 with Factor 1 in Group 2 with
frequency counts inserted in the cells looks like this:
Group 1
PS HP NS
PS 2 0 1
Group2 HP 0 2 0
NS 0 0 0
The 1 in the upper right corner reflects the difference in the two patterns with
respect to X5. Counts in the main diagonal, especially in the upper left and the lower
right, indicate similarity of structure; counts off the main diagonal, especially in the upper
right or lower left, indicate dissimilarity.
Catells s is computed from these counts this way:
11 + 33 13 31
s= (The numbers here are cell indices.)
11 + 33 + 13 + 31 + .5(12 + 21 + 23 + 32)
For our data,
2 + 0 1 0
s= = .33
2 + 0 + 1 + 0 + .5(0 + 0 + 0 + 0)
Catell et al. (Educ. & Psych. Measurement, 1969, 29, 781-792) provide tables to
convert s to an approximate significance level, P, for testing the null hypothesis that the
two factors being compared (one from population 1, one from population 2) are not
related to one another. [I have these tables, in Tabachnick & Fidell, 1989, pages 717 &
718, if you need them.] These tables require you to compute the percentage of
hyperplane counts (60, 70, 80, or 90) and to have at least 10 variables (the table has
rows for 10, 20, 30, 40, 50, 60, 80, & 100 variables). We have only 5 variables, and a
hyperplane percentage of only 40%, so we cant use the table. If we had 10 variables
and a hyperplane percentage of 60%, P = .138 for s = .26 and P = .02 for s = .51.
Under those conditions our s of .33 would have a P of about .10, not low enough to
reject the null hypothesis (if alpha = .05) and conclude that the two factors are related
13

(similar). In other words, we would be left with the null hypothesis that Factor 1 is not
the same in population 1 as population 2.
It is not always easy to decide which pairs of factors to compare. One does not
always compare Factor 1 in Group 1 with Factor 1 in Group 2, and 2 in 1 with 2 in 2, etc.
Factor 1 in Group 1 may look more like Factor 2 in Group 2 than it does like Factor 1 in
Group 2, so one would compare 1 in 1 with 2 in 2. Remember that factors are ordered
from highest to lowest SSL, and sampling error alone may cause inversions in the
orders of factors with similar SSLs. For our hypothetical data, comparing 1 in 1 with 1
in 2 makes sense, since F1 has high loadings on X1 and X2 in both groups. But what
factor in Group 2 would we choose to compare with F2 in Group 1 - the structures are so
different that a simple eyeball test tells us that there is no factor in Group 2 similar to F2
in Group 1.
One may also use a simple Pearson r to compare two factors. Just correlate
the loadings on the factor in Group 1 with the loadings on the maybe similar factor in
Group 2. For 1 in 1 compared with 1 in 2, the 1(2) data are .90(.45), .63(.65), .15(.27),
-.09(-.15), & -.74(.95). The r is -0.19, indicating little similarity between the two factors.
The Pearson r can detect not only differences in two factors patterns of loadings,
but also differences in the relative magnitudes of those loadings. One should beware
that with factors having a large number of small loadings, those small loadings could
cause the r to be large (if they are similar between factors) even if the factors had
dissimilar loadings on the more important variables.
Cross-Correlated Factor Scores. Compute factor scoring coefficients for Group
1 and, separately, for Group 2. Then for each case compute the factor score using the
scoring coefficients from the group in which is it located and also compute it using the
scoring coefficients from the other group. Correlate these two sets of factor scores
Same Group and Other Group. A high correlation between these two sets of factor
scores should indicate similarity of the two factors between groups. Of course, this
method and the other two could be used with random halves of one sample to assess
the stability of the solution or with different random samples from the same population at
different times to get something like a test-retest measure of stability across samples
and times.
RMS, root mean square. For each variable square the difference between the
loading in the one group and that in the other group. Find the mean of these differences
and then the square root of that mean. If there is a perfect match between the two
groups loadings, RMS = 0. The maximum value of RMS (2) would result when all of
the loadings are one or minus one, with those in the one group opposite in sign of those
in the other group.
CC, coefficient of congruence. Multiply each loading in the one group by the
corresponding loading in the other group. Sum these products and then divide by the
square root of (the sum of squared loadings for the one group times the sum of squared
loading for the other group).
See Factorial Invariance of the Occupational Work Ethic Inventory -- An example
of the used of multiple techniques to compare factor structures.
14

Required Number of Subjects and Variables

"How many subjects do I need for my factor analysis?" If you ask this question of
several persons who occasionally employ factor analysis, you are likely to get several
different answers. Some may answer "100 or more" or give you some other fixed
minimum N. Others may say "That depends on how many variables you have, you
should have at least 10 times as many subjects as you have variables" (or some other
fixed minimum ratio of subjects to variables). Others may say "get as many subjects as
you can, the more the better, but you must have at least as many subjects as you have
variables." I would say "that depends on several things." So what are those several
things? Here I shall summarize the key points of two recent articles that address this
issue. The first also addresses the question of how many variables should be included
in the analysis. The articles are:
Velicer, W. F., & Fava, J. L. (1998). Effects of variable and subject sampling on factor
pattern recovery. Psychological Methods, 3, 231-251.
MacCallum, R. C., Widaman, K. F., Zhang, S., & Hong, S. (1999). Sample size in
factor analysis. Psychological Methods, 4, 84-99.
Velicer & Fava briefly reviewed the literature and then reported the results of their
empirical study. The main points were as follows:
Factors that do not have at least three variables with high loadings should not be
interpreted accordingly, you need to have at least three variables per factor,
probably more. Since not all of your variables can be expected to perform as
anticipated, you should start out with 6 to 10 variables per factor.
If the loadings are low, you will need more variables, 10 or 20 per factor may be
required. Assuming that some variables will have to be discarded, you should
start with 20 to 30 variables per factor.
The larger the number of subjects, the larger the number of variables per factor,
and the larger the loadings, the better are your chances of doing a good job at
reproducing the population factor pattern. Strength in one of these areas can
compensate for weakness in another for example, you might compensate for
mediocre loadings and variable sample size by having a very large number of
subjects. If you can't get a lot of subjects, you can compensate by having many
good variables per factor (start with 30 60 per factor, but not more variables
than subjects).
MacCullum et al. demonstrated, by both theoretical and empirical means that the
required sample size or ratio of subjects to variables is not constant across studies, but
rather varies as a function of several other aspects of the research. They concluded
that rules of thumb about sample size are not valid and not useful. Here are the key
points:
Larger sample size, higher communalties (low communalities are associated
with sampling error due to the presence of unique factors that have nonzero
correlations with one another and with the common factors), and high
15

overdetermination [each factor having at least three or four high loadings and
simple structure (few, nonoverlapping factors)] each increase your chances of
faithfully reproducing the population factor pattern.
Strengths in one area can compensate for weaknesses in another area.
When communalities are high (> .6), you should be in good shape even with N
well below 100.
With communalities moderate (about .5) and the factors well-determined, you
should have 100 to 200 subjects.
With communalities low (< .5) but high overdetermination of factors (not many
factors, each with 6 or 7 high loadings), you probably need well over 100
subjects.
With low communalities and only 3 or 4 high loadings on each, you probably
need over 300 subjects.
With low communalities and poorly determined factors, you will need well over
500 subjects.
Of course, when planning your research you do not know for sure how good the
communalities will be nor how well determined your factors will be, so maybe the best
simple advise, for an a priori rule of thumb, is "the more subjects, the better."
MacCallum's advise to researchers is to try to keep the number of variables and factors
small and select variables (write items) to assure moderate to high communalities.

Closing Comments
Please note that this has been an introductory lesson that has not addressed
many of the less common techniques available. For example, I have not discussed
Alpha Extraction, which extracts factors with the goal of maximizing alpha (reliability)
coefficients of the Extracted Factors, or Maximum-Likelihood Extraction, or several
other extraction methods.
I should remind you of the necessity of investigating (maybe even deleting)
outlying observations. Subjects factor scores may be inspected to find observations
that are outliers with respect to the solution [very large absolute value of a factor score].

Factor Analysis Exercise: Animal Rights, Ethical Ideology, and Misanthropy

Factor Analysis Exercise: Rating Characteristics of Criminal Defendants
Polychoric and Tetrachoric Factor Analysis with data from Likert-type or dichotomous
items.
Return to Multivariate Analysis with SPSS

Sample Q&A For GIS
88% (8)
Sample Q&A For GIS
6 pages
Homework 4
No ratings yet
Homework 4
4 pages
Factor Analysis Example Coca Cola
No ratings yet
Factor Analysis Example Coca Cola
7 pages
Dadm Assignment Questionnaire Unsolved
No ratings yet
Dadm Assignment Questionnaire Unsolved
16 pages
National Institute of Technology, Tiruchirappalli MBA Trimester Examination, Basic Data Analytic Marathon Exam
No ratings yet
National Institute of Technology, Tiruchirappalli MBA Trimester Examination, Basic Data Analytic Marathon Exam
22 pages
X Steel Book System
No ratings yet
X Steel Book System
291 pages
Introduction To Factor Analysis (Compatibility Mode) PDF
No ratings yet
Introduction To Factor Analysis (Compatibility Mode) PDF
20 pages
Dr. Chinmoy Jana Iiswbm: Management House, Kolkata
No ratings yet
Dr. Chinmoy Jana Iiswbm: Management House, Kolkata
22 pages
Factor Analysis in SPSS
No ratings yet
Factor Analysis in SPSS
9 pages
How To Perform and Interpret Factor Analysis Using SPSS
100% (2)
How To Perform and Interpret Factor Analysis Using SPSS
9 pages
STA780 Wk6 Factor Analysis-SPSS
No ratings yet
STA780 Wk6 Factor Analysis-SPSS
19 pages
Chapter-18: Research Methodology
No ratings yet
Chapter-18: Research Methodology
19 pages
Factor Analysis
No ratings yet
Factor Analysis
11 pages
Exploratory Factror Analysis
No ratings yet
Exploratory Factror Analysis
39 pages
Exploratory Factor Analysis
100% (1)
Exploratory Factor Analysis
33 pages
Cluster Analysis
No ratings yet
Cluster Analysis
38 pages
Files-2-Presentations Malhotra Mr05 PPT 16
No ratings yet
Files-2-Presentations Malhotra Mr05 PPT 16
53 pages
Business Research Method: Factor Analysis
100% (1)
Business Research Method: Factor Analysis
52 pages
Cluster Analysis BRM Session 14
No ratings yet
Cluster Analysis BRM Session 14
25 pages
Factor Analysis (Optional Session)
100% (2)
Factor Analysis (Optional Session)
23 pages
Recap: Step 1: Identify and Define The Problem or Opportunity
No ratings yet
Recap: Step 1: Identify and Define The Problem or Opportunity
38 pages
BA Project Group33
No ratings yet
BA Project Group33
10 pages
Class 10 Factor Analysis I
No ratings yet
Class 10 Factor Analysis I
45 pages
Malhotra Mr05 PPT 02
100% (1)
Malhotra Mr05 PPT 02
36 pages
Pelican Stores Promotional Campaign - #2
No ratings yet
Pelican Stores Promotional Campaign - #2
2 pages
Case Study I :ken Black Business Statistics
No ratings yet
Case Study I :ken Black Business Statistics
3 pages
Statistics SPSS Project
No ratings yet
Statistics SPSS Project
12 pages
Conjoint Analysis
No ratings yet
Conjoint Analysis
9 pages
Principal Components Analysis
No ratings yet
Principal Components Analysis
50 pages
Sri Guru Tegh Bahadur Institute of Management & Information Technology (Ggsipu)
No ratings yet
Sri Guru Tegh Bahadur Institute of Management & Information Technology (Ggsipu)
63 pages
The Box-Jenkins Methodology For RIMA Models
No ratings yet
The Box-Jenkins Methodology For RIMA Models
172 pages
Factor Analysis
67% (3)
Factor Analysis
25 pages
Question 8: P&G Has Developed A New Toothpaste That Provides Tooth and Gum Protection For
No ratings yet
Question 8: P&G Has Developed A New Toothpaste That Provides Tooth and Gum Protection For
3 pages
Elements of A Designed Experiment: Definition 10.1
No ratings yet
Elements of A Designed Experiment: Definition 10.1
12 pages
Marketing Analytics
No ratings yet
Marketing Analytics
27 pages
Cluster Training PDF (Compatibility Mode)
No ratings yet
Cluster Training PDF (Compatibility Mode)
21 pages
Conjoint Analysis (Marketing) - Wikipedia, The Free Encyclopedia
No ratings yet
Conjoint Analysis (Marketing) - Wikipedia, The Free Encyclopedia
4 pages
EV Market Research For India
No ratings yet
EV Market Research For India
11 pages
DATA PROCESSING, ANALYSING AND INTERPRETATION Ipmi
100% (1)
DATA PROCESSING, ANALYSING AND INTERPRETATION Ipmi
120 pages
Regression Examplz
No ratings yet
Regression Examplz
7 pages
Chapter-23 Bivariate Statistical Analysis: Measurement of Association
No ratings yet
Chapter-23 Bivariate Statistical Analysis: Measurement of Association
30 pages
Lecture 11 Factor Analysis
No ratings yet
Lecture 11 Factor Analysis
21 pages
Hypothesis Testing Results Analysis Using SPSS RM Dec 2017
No ratings yet
Hypothesis Testing Results Analysis Using SPSS RM Dec 2017
66 pages
Factor Analysis: Statistics For Psychosocial Research
No ratings yet
Factor Analysis: Statistics For Psychosocial Research
73 pages
Pearson Correlation - SPSS Tutorials - LibGuides at Kent State University
No ratings yet
Pearson Correlation - SPSS Tutorials - LibGuides at Kent State University
13 pages
Cluster Analysis
100% (1)
Cluster Analysis
13 pages
Determine The Role of Consumer Preference Towards Signal Toothpaste
100% (1)
Determine The Role of Consumer Preference Towards Signal Toothpaste
18 pages
Business Analytics - The Science of Data Driven Decision Making
No ratings yet
Business Analytics - The Science of Data Driven Decision Making
55 pages
An Introduction To Factor Analysis: Philip Hyland
No ratings yet
An Introduction To Factor Analysis: Philip Hyland
34 pages
Reckit Benkiser
No ratings yet
Reckit Benkiser
4 pages
40-1 EFA - 2019 - Upld
No ratings yet
40-1 EFA - 2019 - Upld
45 pages
Econometrics Project
No ratings yet
Econometrics Project
17 pages
9.data Analysis
No ratings yet
9.data Analysis
25 pages
PBM SPSS
No ratings yet
PBM SPSS
7 pages
Fa SPSS
No ratings yet
Fa SPSS
40 pages
Factor Analysis With SPSS
No ratings yet
Factor Analysis With SPSS
10 pages
Pca SPSS
100% (1)
Pca SPSS
52 pages
Annotated SPSS Output Factor Analysis
No ratings yet
Annotated SPSS Output Factor Analysis
20 pages
MKTG3014-Lecture-week
No ratings yet
MKTG3014-Lecture-week
55 pages
Lecture 4 - Notes On Principal Components Analysis and Factor Analysis1
No ratings yet
Lecture 4 - Notes On Principal Components Analysis and Factor Analysis1
3 pages
Factor Analysis Final
No ratings yet
Factor Analysis Final
13 pages
Chi Square 34
No ratings yet
Chi Square 34
4 pages
Policies and Schemes of Central and State Governments For People With Disabilities
No ratings yet
Policies and Schemes of Central and State Governments For People With Disabilities
128 pages
Mysore District PDF
No ratings yet
Mysore District PDF
13 pages
Annual Report of Industries and Commerce Karnataka
0% (1)
Annual Report of Industries and Commerce Karnataka
29 pages
DEVELOPMENT STRATEGY AND OVERVIEW OF SMALL-SCALE INDUSTRIES (SSIs) IN INDIA
100% (2)
DEVELOPMENT STRATEGY AND OVERVIEW OF SMALL-SCALE INDUSTRIES (SSIs) IN INDIA
343 pages
Vtu III & IV Sem Syllabus 22-5-8
No ratings yet
Vtu III & IV Sem Syllabus 22-5-8
37 pages
Budget Salary
No ratings yet
Budget Salary
1 page
VTU - V & VIIIsem Syllabus 23-7-8
No ratings yet
VTU - V & VIIIsem Syllabus 23-7-8
11 pages
Sample Size in Factor Analysis: Why Size Matters Helen C Lingard and Steve Rowlinson
No ratings yet
Sample Size in Factor Analysis: Why Size Matters Helen C Lingard and Steve Rowlinson
6 pages
Assignment
No ratings yet
Assignment
2 pages
Measuring Intrapreneurship
100% (2)
Measuring Intrapreneurship
228 pages
F. NO.23-112008-TS.11 Government of India
No ratings yet
F. NO.23-112008-TS.11 Government of India
5 pages
Syllabus EC IIISem BE
100% (1)
Syllabus EC IIISem BE
35 pages
NETRAVATIBLANK
No ratings yet
NETRAVATIBLANK
2 pages
F. No. 23-1/2008 - T - 1
No ratings yet
F. No. 23-1/2008 - T - 1
3 pages
STT GDC Sea - Dco.sg - Dcso Form 005 Method of Procedure v2.0
100% (1)
STT GDC Sea - Dco.sg - Dcso Form 005 Method of Procedure v2.0
5 pages
BMX160
No ratings yet
BMX160
2 pages
Midterm Quiz 2 - Attempt Review PDF
No ratings yet
Midterm Quiz 2 - Attempt Review PDF
3 pages
Finite Element Method Magnetics: Octavefemm: User'S Manual
No ratings yet
Finite Element Method Magnetics: Octavefemm: User'S Manual
61 pages
Counting-Techniques 1
No ratings yet
Counting-Techniques 1
3 pages
Assignment 01
No ratings yet
Assignment 01
1 page
Catia V5 QnA
No ratings yet
Catia V5 QnA
2 pages
AIX CFGMGR Output
No ratings yet
AIX CFGMGR Output
37 pages
SAP HANA On Power Competitive Level 2 Quiz - Attempt Review
No ratings yet
SAP HANA On Power Competitive Level 2 Quiz - Attempt Review
25 pages
2023-2024 DCIG TOP 5 High-End Storage Arrays
No ratings yet
2023-2024 DCIG TOP 5 High-End Storage Arrays
14 pages
Check SQL Server Port Availability
No ratings yet
Check SQL Server Port Availability
3 pages
John Charles Dailey: Education
No ratings yet
John Charles Dailey: Education
5 pages
Lenovo Specifications K9 Note
No ratings yet
Lenovo Specifications K9 Note
4 pages
Disk Scheduling: (1) Code
No ratings yet
Disk Scheduling: (1) Code
8 pages
CCS338-Computer-Vision-Lecture-Notes-1
No ratings yet
CCS338-Computer-Vision-Lecture-Notes-1
99 pages
Comcot Manual V 17 PDF
No ratings yet
Comcot Manual V 17 PDF
65 pages
Domestic Data Entry Operator - Textbook For Class-IX-17925 Ncert Amazon - in Books
No ratings yet
Domestic Data Entry Operator - Textbook For Class-IX-17925 Ncert Amazon - in Books
1 page
Opmanager Datasheet
No ratings yet
Opmanager Datasheet
5 pages
ATFL Lecture 6
No ratings yet
ATFL Lecture 6
14 pages
Content of HPE vSphereOfflinebundles
No ratings yet
Content of HPE vSphereOfflinebundles
9 pages
DSAP MFG C202 Change Master Recipe
No ratings yet
DSAP MFG C202 Change Master Recipe
23 pages
Teach Yourself Linux-Mandrake in 24 Hours
No ratings yet
Teach Yourself Linux-Mandrake in 24 Hours
408 pages
External Content
No ratings yet
External Content
301 pages
User Guide AlineaSol Project Management 2.x v1
No ratings yet
User Guide AlineaSol Project Management 2.x v1
19 pages
How To Prevent Identity Theft
No ratings yet
How To Prevent Identity Theft
25 pages
Grade 2 Lesson Plan - Final Samr Model-3
No ratings yet
Grade 2 Lesson Plan - Final Samr Model-3
8 pages
CS6601-Distributed Systems
No ratings yet
CS6601-Distributed Systems
12 pages
DIALux Evo Manual
No ratings yet
DIALux Evo Manual
101 pages

Factor Analysis - Spss

Uploaded by

Factor Analysis - Spss

Uploaded by

Factor Analysis - SPSS

First Read Principal Components Analysis.

Iterated Principal Factors Analysis

Copyright 2009, Karl L. Wuensch - All rights reserved.

Total Variance Explained

Extraction Sums of Squared Loadings Rotation Sums of Squared Loadings

The final rotated loadings are:

Rotated Factor Matrixa

Reproduced and Residual Correlation Matrices

COST SIZE ALCOHOL REPUTAT COLOR AROMA TASTE

Nonorthogonal (Oblique) Rotation

Beta Weights Correlations

Pattern Matrixa Structure Matrix

Factor Correlation Matrix

Exact Factor Scores

Here are the scoring coefficients:

Factor Score Coefficient Matrix

Factor Score Covariance Matrix

Adjusted Std. Error of

Independent Samples Test

Levene's Test for

Unit-Weighted Factor Scores

Click Statistics. Check Scale if item deleted. Continue, OK.

Alpha is .886. Shoot for an alpha of at least .7 for research instruments.

Scale Corrected Cronbach's

Comparing Two Groups Factor Structure

Required Number of Subjects and Variables

Factor Analysis Exercise: Animal Rights, Ethical Ideology, and Misanthropy

Copyright 2009, Karl L. Wuensch - All rights reserved.

You might also like