0% found this document useful (0 votes)

31 views

Chapter 3 Normalized Principal Components Analysis

This document discusses normalized principal components analysis (PCA). It describes how PCA can be used to: 1) Reduce the dimensionality of data by finding the best visualization planes using orthogonal projections. 2) Group together homogeneous individuals and identify outliers. 3) Analyze relationships between variables. The document outlines the key steps of PCA, including data standardization, identifying principal axes and components, analyzing variable and individual scatterplots, assessing inertia to determine the optimal number of principal axes, and interpreting the results.

Uploaded by

sarra

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views

Chapter 3 Normalized Principal Components Analysis

Uploaded by

sarra

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Chapter 3: Normalized Principal Components Analysis

1. Initial data
Columns are quantitative variables (turnover, rate, weight ...)
Rows represent statistical individuals (basic units such as human beings, countries, years…)

2. Example of data
3. Objectives from a technical point of view.
 Reduce the number of dimension of the data by looking for the best planes visualization and this by applying
orthogonal projections of the data.
 Group together homogeneous individuals and identify exceptional individuals.
 Analyze the relationships between the variables.

Case Studies
Study the perception of a brand by the consumer.
Study the evolution of the financial situation of a company over time.
Compare several brands on the market.

Data standardization: centering and reducing the data

4. Individuals scatter plot analysis

Identification of the principal axes.
Principal axes:
 Principal axes ∆ 1 , ∆ 2 , . . ., k are identified by looking for eigenvalue of the eigenvectors of the correlation matrix
R=t ZDZ . ¿ is the weight matrix).
 In a next step, we sort the eigenvalue in a decreasing order1 ≫❑2> …>❑p . We denote by U the ( pXp) matrix of
eigenvectors u j organized in columns.

Principal components:
 The coordinates, over the new axes formed by the eigenvectors, of a
given individuals are given by the scalar products:
 Any couple of columns of the matrix U form a factors map.
Factors map and projection quality
Absolute contribution: The absolute contribution of a given point i to the projection inertia over the axis α is :
α 2
pi ( C i )
ACTR ( i ,α )=
❑α

Relative contribution or cosine square:

The relative contribution of a given point i over the axis is:
2
( Cαi )
RCTR ( i , α )=cos ( z i , z^ )= 2
2 α
i
‖z i‖
α
Where ^z i is the orthogonal projection of Zi over the axisα .

Remarks
C 1 Is the variable that gives the best description of the data dispersion.
The best plane data visualization is given by the factors map formed by the two axes C 1∧C 2 ¿ .
The variables C α ( α =1 , … , p ) are orthogonal (not correlated)
The variables C α ( α =1 , … , p )are a linear combination of Z j variables, so C α is also standardized.
For all α ≤ p :Var ( Cα ) =❑α

5. Variable scatterplot analysis

()
j
Z1
j
The coordinates of the j th variable point: Z = Z 2
j
…
j
Z n

The eigenvectors (v 1 , v 2 , ...) defining the principal axes of the second scatterplot are given by the transition formula:
1
vα= Zu
√ α α
❑
The new factor coordinates of each variable point j over the axis are given by
α j
S j = √ ❑α uα

Projection quality of the variables points

Relative contribution (or cosine square) of the projection of the variable point Z jaccording to the axis α is given by the
cosine square of the angle formed the two vectors Z j and its projection ^z j , α F α :
α 2
CTR ( j, α )=cos ² (Z , ^z )=( S j )
j j,α

The communality according to the first factor map is:

1 2 2 2
Com( j ,(1 ,2))=( S j ) + ( S j )

The projections ^z j of the variables Z j the principal maps lie in a circle of center O and radius 1: this circle is named
correlation circle.
6. Inertia and the choice of the number of principal axes
Inertia and the choice the number of axes to retain
The total inertia of the initial scatterplot of individual is equal to:
p
I =∑ ❑i =p
i=1

The overall quality of the representation of the scatterplot on the main sub-space formed by ( u1 , u2 ,… , uq ) . The
proportion of the inertia absorbed by this subspace measures it. It is worth:
..+..+¿ q
❑1+ ¿
p
So, the rate of inertia absorbed by the first factors map (or principal factor map) is:
❑1+❑2
p
Number of axes to retain: inertia criterion
The criterion usually employed to measure PCA quality is the percentage of total inertia explained by the first chosen k
axes. It is defined by:
..+ ..+ ¿k ..+ ..+ ¿k
Rat e k =❑1 + p
=❑1 + ¿¿
p
∑ ❑i
i=1

This rate defines the explanatory power of the k first axis (or factors): it represents the part of total variance taken into
account by this k axis. However, its appreciation must take into account the number of variables and the number of
individuals. For example, an inertia rate relative to an axis of 10 % can be an important value if the we have 100
variables and low if it has only 7 variables.

Number of axes to retain: criterion

It consists in keeping, in a normalized PCA, only those axes whose eigenvalue is greater than 1 (i.e. average inertia).

7. Interpretation of variables factor maps

1 Variables to keep: We keep only the variables well represented on the factor map (i.e. variables close to the
correlation circle).

2 Variable-axis: variables strongly correlated with a factor will contribute to the definition of this axis.

3 Variable-variable: the proximity of projections of 2 variables indicates a strong positive correlation between them.
item 2 Diametrically opposite projections indicate a negative correlation between them.

4 Nearly orthogonal directions indicate a weak linear correlation.

Rotation
To ease the interpretation task, it may be convenient, once the number of factors determined, to rotate the axes. The
rotation (the varimax method, …) allows to get closer to a simple structure:
 One component is strongly correlated with some variables and little correlated with the others.
 A variable is correlated with a single component. In this case, the information restored by the factorial plane remains
the same but that restored by the axes changes.

8. Interpretation of the individual factors map

Cheat Sheet - Exam 3
No ratings yet
Cheat Sheet - Exam 3
20 pages
Linear Programming
No ratings yet
Linear Programming
24 pages
Unit4 TJ
No ratings yet
Unit4 TJ
39 pages
Lecture 1: Revision of Vector Analysis: 1.1.1 Vectors and Scalars
No ratings yet
Lecture 1: Revision of Vector Analysis: 1.1.1 Vectors and Scalars
7 pages
UNIT-2 - Differential Calculus - Cluster C
No ratings yet
UNIT-2 - Differential Calculus - Cluster C
39 pages
Principal Components Analysis (PCA) : 2.1 Outline of Technique
No ratings yet
Principal Components Analysis (PCA) : 2.1 Outline of Technique
21 pages
343-1280-2-PB
No ratings yet
343-1280-2-PB
8 pages
h03 Essential Math
No ratings yet
h03 Essential Math
13 pages
An1 Derivat - Ro BE 2 71 Mathematical Aspects
No ratings yet
An1 Derivat - Ro BE 2 71 Mathematical Aspects
8 pages
Chapter1 Mathtype Copyright
No ratings yet
Chapter1 Mathtype Copyright
67 pages
UNIT-2 Differential Calculus Cluster C
No ratings yet
UNIT-2 Differential Calculus Cluster C
38 pages
Interpall
No ratings yet
Interpall
4 pages
Pot 078
No ratings yet
Pot 078
7 pages
Uantum Mechanics A L A Irac T S - G Q
No ratings yet
Uantum Mechanics A L A Irac T S - G Q
10 pages
Shape Funct
100% (1)
Shape Funct
46 pages
MA111 Lec8 D3D4
No ratings yet
MA111 Lec8 D3D4
33 pages
Coupled-chaotic-Colpitts-oscillators-Identical-and-mismatched-cases_2006_Nonlinear-Dynamics
No ratings yet
Coupled-chaotic-Colpitts-oscillators-Identical-and-mismatched-cases_2006_Nonlinear-Dynamics
8 pages
Lecture-9 D.K Analysis-1
No ratings yet
Lecture-9 D.K Analysis-1
36 pages
Electromagnetics Unit 1 EFT
No ratings yet
Electromagnetics Unit 1 EFT
141 pages
Applied Physics R20 - Unit-1
No ratings yet
Applied Physics R20 - Unit-1
28 pages
Physical Chemistry Exam 1 Key
No ratings yet
Physical Chemistry Exam 1 Key
8 pages
A Brief Survey of Differential Geometry: Adrian Down August 29, 2006
No ratings yet
A Brief Survey of Differential Geometry: Adrian Down August 29, 2006
5 pages
Mathcad Functions
No ratings yet
Mathcad Functions
33 pages
applied-physics-r20-unit-1
No ratings yet
applied-physics-r20-unit-1
29 pages
Angular_Momentum
No ratings yet
Angular_Momentum
12 pages
10 11648 J Mcs 20160102 11
No ratings yet
10 11648 J Mcs 20160102 11
8 pages
linear algebra
No ratings yet
linear algebra
11 pages
Analytical Perturbative Approach To Periodic Orbits in The Homogeneous Quartic Oscillator Potential
No ratings yet
Analytical Perturbative Approach To Periodic Orbits in The Homogeneous Quartic Oscillator Potential
16 pages
VCV Week3
No ratings yet
VCV Week3
4 pages
The Fuzzy Approach to Assessment of ANOVA Results
No ratings yet
The Fuzzy Approach to Assessment of ANOVA Results
9 pages
Degrees of Freedom PDF
No ratings yet
Degrees of Freedom PDF
7 pages
Lecture7 D2 PDF
No ratings yet
Lecture7 D2 PDF
25 pages
Gate Aerospace 2011 Solution
No ratings yet
Gate Aerospace 2011 Solution
33 pages
Lecture 2: Review of Vector Calculus: Instructor: Dr. Gleb V. Tcheslavski Contact: Office Hours: Class Web Site
No ratings yet
Lecture 2: Review of Vector Calculus: Instructor: Dr. Gleb V. Tcheslavski Contact: Office Hours: Class Web Site
30 pages
LectureSeries 01 SHM
No ratings yet
LectureSeries 01 SHM
42 pages
Diss 6 10
No ratings yet
Diss 6 10
5 pages
Lecture6 Orthogonality Dot Product
No ratings yet
Lecture6 Orthogonality Dot Product
5 pages
6 Electromagnetic Fields and Waves
No ratings yet
6 Electromagnetic Fields and Waves
25 pages
Khyvbjlk
No ratings yet
Khyvbjlk
1 page
Exercise 2 Data Analysis and Visualization Computer Graphics - Projection and Shading
No ratings yet
Exercise 2 Data Analysis and Visualization Computer Graphics - Projection and Shading
2 pages
Lecture 7 - 2D Element - Triangular Element (7 Sep 2021
No ratings yet
Lecture 7 - 2D Element - Triangular Element (7 Sep 2021
30 pages
A New Approach For Smarandache Curves
No ratings yet
A New Approach For Smarandache Curves
11 pages
Theorist's Toolkit Lecture 8: High Dimensional Geometry and Geometric Random Walks
No ratings yet
Theorist's Toolkit Lecture 8: High Dimensional Geometry and Geometric Random Walks
8 pages
K. F. Riley M. P. Hobson Student Solution Manual For Mathematical Methods For Physics and Engineering Third Edition 2006 Cambridge University Press
No ratings yet
K. F. Riley M. P. Hobson Student Solution Manual For Mathematical Methods For Physics and Engineering Third Edition 2006 Cambridge University Press
15 pages
Mathematics - IJMCAR - On Eccentric Connectivity Index - A. JAYENTHI
No ratings yet
Mathematics - IJMCAR - On Eccentric Connectivity Index - A. JAYENTHI
6 pages
Chapter-5: Proposed Split Filtering Work
No ratings yet
Chapter-5: Proposed Split Filtering Work
10 pages
FEMM 2.1
No ratings yet
FEMM 2.1
17 pages
Ch1 Lorentz Group & Lorentz Invariant
No ratings yet
Ch1 Lorentz Group & Lorentz Invariant
32 pages
Chapter 3 One-D Problems (1)
No ratings yet
Chapter 3 One-D Problems (1)
43 pages
Module 4 MAXWELL EM For EEE Students
No ratings yet
Module 4 MAXWELL EM For EEE Students
23 pages
3.3 The Smith Chart
No ratings yet
3.3 The Smith Chart
6 pages
1staticfields SN WB Part1
No ratings yet
1staticfields SN WB Part1
7 pages
O Level Additional Maths Notes
100% (1)
O Level Additional Maths Notes
8 pages
Satellite Systems - Design, Modeling, Simulation and Analysis
No ratings yet
Satellite Systems - Design, Modeling, Simulation and Analysis
226 pages
Faw Formulae
No ratings yet
Faw Formulae
7 pages
Geostatistics Formula Sheet
No ratings yet
Geostatistics Formula Sheet
2 pages
Vector Calculus
No ratings yet
Vector Calculus
37 pages
Statistics Theory Notes
No ratings yet
Statistics Theory Notes
6 pages
Exercises of Basic Analytical Geometry
From Everand
Exercises of Basic Analytical Geometry
Simone Malacrida
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
From Everand
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Mathematica - Fitting Penner's Skin Scattering
No ratings yet
Mathematica - Fitting Penner's Skin Scattering
8 pages
Special Cases in Simplex Method
No ratings yet
Special Cases in Simplex Method
18 pages
Chapter 3 Fourier Transform
No ratings yet
Chapter 3 Fourier Transform
10 pages
Mte 04
0% (1)
Mte 04
5 pages
Uh Econ 607 Notes
No ratings yet
Uh Econ 607 Notes
255 pages
Linear - Equations - Worksheet 2 CW
No ratings yet
Linear - Equations - Worksheet 2 CW
2 pages
VI.2. Schwarz's Lemma
No ratings yet
VI.2. Schwarz's Lemma
4 pages
maths-class-xii-chapter-12-linear-programming-practice-paper-13-answers
No ratings yet
maths-class-xii-chapter-12-linear-programming-practice-paper-13-answers
12 pages
SMCS
No ratings yet
SMCS
7 pages
Solution of A Nonhomogeneous Equation
No ratings yet
Solution of A Nonhomogeneous Equation
3 pages
Truncated SVD For Image Compression
No ratings yet
Truncated SVD For Image Compression
10 pages
Calculus 2 Practice Exam #2
No ratings yet
Calculus 2 Practice Exam #2
9 pages
Econometrics Sample Questions
No ratings yet
Econometrics Sample Questions
2 pages
Boundary Vlaue Problem Module 4 With Solutions Page 1-20
No ratings yet
Boundary Vlaue Problem Module 4 With Solutions Page 1-20
20 pages
Cat 2 CCP
No ratings yet
Cat 2 CCP
8 pages
Haphazardly Sampled Data Processing Algorithm Using Lomb Welch Periodogram
No ratings yet
Haphazardly Sampled Data Processing Algorithm Using Lomb Welch Periodogram
10 pages
TIFR Pamphlet On Algebraic Number Theory
No ratings yet
TIFR Pamphlet On Algebraic Number Theory
92 pages
Statistics Probability Midterm Cheat Sheet
0% (1)
Statistics Probability Midterm Cheat Sheet
5 pages
Frequency Domain Specifications: Ali Karimpour Apr 2009
No ratings yet
Frequency Domain Specifications: Ali Karimpour Apr 2009
129 pages
Simplifying Algebraic Expressions
No ratings yet
Simplifying Algebraic Expressions
3 pages
Simplexmethod-lec.-2 (1)(2)(original)
No ratings yet
Simplexmethod-lec.-2 (1)(2)(original)
21 pages
Problems: TABLE 10.2.1 Values of The Error e
No ratings yet
Problems: TABLE 10.2.1 Values of The Error e
10 pages
Computer Graphics Using Opengl, 3 Edition F. S. Hill, Jr. and S. Kelley
No ratings yet
Computer Graphics Using Opengl, 3 Edition F. S. Hill, Jr. and S. Kelley
29 pages
Module - 1
No ratings yet
Module - 1
63 pages
Factor I Sing Exercises 132
No ratings yet
Factor I Sing Exercises 132
4 pages
Module 5 Genmath
100% (1)
Module 5 Genmath
23 pages
The Adaptive Cross-Approximation Technique For The 3-D Boundary-Element Method PDF
No ratings yet
The Adaptive Cross-Approximation Technique For The 3-D Boundary-Element Method PDF
4 pages
Gosford 2023 4U Trials & Solutions
No ratings yet
Gosford 2023 4U Trials & Solutions
25 pages

Chapter 3 Normalized Principal Components Analysis

Uploaded by

Chapter 3 Normalized Principal Components Analysis

Uploaded by

Chapter 3: Normalized Principal Components Analysis

Data standardization: centering and reducing the data

4. Individuals scatter plot analysis

Relative contribution or cosine square:

5. Variable scatterplot analysis

Projection quality of the variables points

The communality according to the first factor map is:

Number of axes to retain: criterion

7. Interpretation of variables factor maps

4 Nearly orthogonal directions indicate a weak linear correlation.

8. Interpretation of the individual factors map

You might also like