0% found this document useful (0 votes)

37 views

The Multivariate Normal Distribution: f (x) = √ e −∞ 0. /σ

1. The multivariate normal distribution is a mathematically tractable model that provides insights even when the true distribution is more complex. 2. It can reasonably model many real-world datasets and its properties allow for analytical solutions even with large sample sizes. 3. The distribution of sample means and variances follow predictable patterns regardless of the underlying data distribution.

Uploaded by

sileshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views

The Multivariate Normal Distribution: f (x) = √ e −∞ 0. /σ

Uploaded by

sileshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

The Multivariate Normal Distribution

Why should we consider the multivariate normal distribution? It would seem that applied problems
are so complex that it would only be interesting from a mathematical perspective.

1. It is mathematically tractable for a large number of problems, and, therefore, progress towards
answers to statistical questions can be provided, even if only approximately so.

2. Because it is tractable for so many problems, it provides insight into techniques based upon
other distributions or even non-parametric techniques. For this, it is often a benchmark
against which other methods are judged.

3. For some problems it serves as a reasonable model of the data. In other instances, transfor-
mations can be applied to the set of responses to have the set conform well to multivariate
normality.

4. The sampling distribution of many (multivariate) statistics are normal, regardless of the
parent distribution (Multivariate Central Limit Theorems). Thus, for large sample sizes, we
may be able to make use of results from the multivariate normal distribution to answer our
statistical questions, even when the parent distribution is not multivariate normal.

Consider ﬁrst the univariate normal distribution with parameters µ (the mean) and σ (the variance)
for the random variable x,
2
1 1 (x−µ)
f (x) = √ e− 2 σ2 (1)
2πσ 2
for −∞ < x < ∞, −∞ < µ < ∞, and σ 2 > 0.
Now rewrite the exponent (x − µ)2 /σ 2 using the linear algebra formulation of

(x − µ) (σ 2 )−1 (x − µ).

This formulation matches that for the generalized or Mahalanobis squared distance

(x − µ) Σ−1 (x − µ),

where both x and µ are vectors. The multivariate normal distribution can be derived by substi-
tuting the Mahalanobis squared distance formula into the univariate formula and normalizing the
distribution such that the total probability of the distribution is 1. This yields,
1 1 −1 (x−µ)
f (x) = 1/2
e− 2 (x−µ) Σ (2)
(2π)p/2 |Σ|

for −∞ < x < ∞, −∞ < µ < ∞, and for Σ positive deﬁnite.

In the bivariate normal case the squared distance formula, in terms of the individual means µ1 and
µ2 , variances σ11 and σ22 , and correlation ρ12 , is

(x − µ) Σ−1 (x − µ) =
 2 2 
1 x1 − µ1
 √
x2 − µ2 x1 − µ1 x2 − µ2 
2 + √ − 2ρ12 √ √ . (3)
1 − ρ12 σ11 σ22 σ11 σ22
Multivariate Normal Properties 2

Properties of the Multivariate Normal Distribution

Let X ∼ Np (µ, Σ) be p-variate multivariate normal with mean µ and variance-covariance matrix
Σ, where      
X1 µ1 σ11 σ12 · · · σ1p
     
 X2   µ2   σ21 σ22 · · · σ2p 
X=  .. 
 µ=  .. 
 Σ=  .. .. .. .. .

 .   .   . . . . 
Xp µp σp1 σp2 · · · σpp

1. The solid ellipsoid of all x, such that (x − µ) Σ−1 (x − µ) ≤ χ2p (α), contains (1 − α)100%
of the probability in the distribution. This also implies that contours delineating regions of
constant probability about µ are given by (x − µ) Σ−1 (x − µ) = χ2p (α).

2. The semiaxes of the ellipsoid containing (1 − α)100% probability are given by the eigenvalues
(λi ) and eigenvectors (ei ) of Σ such that the semiaxes are

±c λi ei

where c2 = χ2p (α).

3. All subsets of X are themselves multivariate normal.

4. Any linear combination of the Xi , say c X = c1 X1 +c2 X2 +· · ·+cp Xp , is normally distributed

as
c X ∼ N (c µ, c Σc).

Further, q linear combinations of the Xi , say C X, is (q-variate) multivariate normal. Let

 
c11 X1 + c12 X2 + · · · + c1p Xp
 
 c21 X1 + c22 X2 + · · · + c2p Xp 
C X = 
 .. .. . .. ,

 . + . + .. + . 
cq1 X1 + cq2 X2 + · · · + cqp Xp

then
C X ∼ Nq (C µ, C ΣC).

5. Subdivide the vector X into two subsets X1 and X2 ,

   
X11 X21

   
X1  X12   X22 
X= , X1 = 
 ..  , and X2 = 
  .. ,

X2  .   . 
X1p X2q

and so that

X1 µ1 Σ11 Σ12
X= ∼ Np+q ( , ).
X2 µ2 Σ21 Σ22

The conditional distribution of X1 given X2 = x2 , f (X1 |X2 = x2 ) is (p-variate) multivariate

normal,
Np (µ2 + Σ12 Σ−1 −1
22 (x2 − µ2 ), Σ11 − Σ12 Σ22 Σ21 ).
Multivariate Normal Sampling Distributions 3

6. If two variates, say X1 and X2 , of the multivariate normal are uncorrelated, ρ12 = 0 and
implies σ12 = 0, then X1 and X2 are independent. This property is not in general true for
other distributions. However, it is always true that if two variates are independent, then they
are uncorrelated, no matter what their joint distribution is.

Sampling Distributions of the Multivariate Normal

1. Let  
X̄1
 
 X̄2 
X̄ = 
 .. 

 . 
X̄p
be the vector of sample means from a sample of size n from the multivariate normal distri-
bution for X, then
1
X̄ ∼ Np (µ, Σ).
n
2. Let S be the sample variance-covariance matrix computed from a sample of size n from the
multivariate normal distribution for X, then
(n − 1)S ∼ W(n−1) (Σ),
the Wishart distribution with (n − 1) degrees of freedom.
3. The density function W for S does not exist when n ≤ p. Further, S must be positive deﬁnite
(λi > 0 ∀ i = 1, 2, · · · , p) for the density to exist.
4. X̄ and S are stochastically independent.
5. Let (n − 1)S ∼ W(n−1) (Σ), then
(n − 1)C SC ∼ W(n−1) (C ΣC).

6. Let A1 = (n1 − 1)S1 ∼ W(n1 −1) (Σ) and A2 = (n2 − 1)S2 ∼ W(n2 −1) (Σ), where S1 and S2 are
independent estimates of Σ, then
A1 + A2 ∼ W(n1 +n2 −2) (Σ)
and
1
( )(A1 + A2 )
n1 + n2 − 2
is a “pooled” estimate of Σ.
7. Let X1 , X2 , . . . , Xn be a simple random sample of size n, where Xi ∼ Np (µ, Σ), then approx-
imately for (n − p) large,
n(X̄ − µ) Σ−1 (X̄ − µ) ∼ χ2p .
A central limit theorem says that for very large n − p we can relax the requirement that the
Xi be multivariate normal. Further, for n − p large, an approximate (1 − α)100% conﬁdence
region for µ is given by the set of all µ such that
n(X̄ − µ) S −1 (X̄ − µ) ≤ χ2p (α).
Assessing Multivariate Normality 4

Assessing Multivariate Normality

The methods for assessing multivariate normality of a set of data make use of the properties of
the multivariate normal distribution discussed earlier. You should also note that the tools assume
a common multivariate normal distribution for the data, i.e., the same mean µ and covariance
matrix Σ. This means that for many sets of data, checks on multivariate normality will need to be
performed on the residuals rather than the raw data. Some ideas to consider are:

1. All marginal distributions must be normal. Check the normality of each variable. If a variable
does not conform to the normal distribution, then the set of variables can not be multivariate
normal.
Steps for the q-q normal distribution plot:
(a) Order the observations from smallest to largest (X(1) ≤ X(2) ≤ . . . ≤ X(n) ). These
are the order statistics for this random variable and they estimate the quantiles of the
distribution from which they were sampled. The quantile is the value at which a certain
proportion of the distribution is less than or equal to that value.
(b) Estimate the proportion of the distribution that should be less than or equal to the value
of each order statistic. One such estimate is
i − 1/2
n
where i is the rank of each observation.
(c) Compute the expected quantiles from the normal distribution as

i − 1/2
qi = Φ−1 ,
n
where Φ−1 is the inverse of the standard normal cumulative distribution function.
(d) Plot the observed quantiles, X(i) , versus the expected quantiles, qi , and check for linearity
of the plot. If the observed quantiles correspond with a normal distribution, then the
points will plot on a straight line. If not, reject (multivariate) normality. Note, you
should have a minimum sample size of 20 to begin to have conﬁdence in the plot.
2. All pairs of variables must be bivariate normal. Produce scatter plots of all pairs of variables.
Density regions should correspond roughly to elliptical patterns with linear relationships
among pairs of variables.
3. Linear combinations of the variables are normal. Check any meaningful linear combinations
for normality (sums, diﬀerences). Further, check the principal components (linear combina-
tions corresponding to the eigenvectors of Σ) for normality. Check pairs of linear combinations
and principal components for bivariate normality. Rejection of normality or bivariate nor-
mality for linear combinations also rejects multivariate normality.
4. Squared distances about the population mean vector are distributed as chi-square with p
degrees of freedom. Estimate the population mean vector with the sample mean vector, and
estimate the population covariance matrix with the sample covariance matrix. Compute the
squared distances of each observation to the sample mean vector and check to see that they
are chi-square distributed.
Steps for the q-q chi-square distribution plot:
Assessing Multivariate Normality 5

(a) Compute d2i = (Xi − X̄) S −1 (Xi − X̄).

(b) Order the d2i from smallest to largest to get observed quantiles of the distribution as
d2(1) ≤ d2(2) ≤ · · · ≤ d2(n) .
(c) Compute expected quantiles from the χ2p distribution where

i − 1/2
qi = χ2p
n

corresponding with each d2i , i = 1, 2, . . . , n.

(d) Plot d2i versus qi for i = 1, 2, . . . , n and check for linearity in the plot. If the points do
not form a straight line, then the observed quantiles do not follow from the chi-square
distribution, so reject multivariate normality.

These steps can be repeated for subsets of the variables and linear combinations. This may
be useful in identifying whether “problems” of multivariate normality are associated with a
set of variables or a single variable.

If (multivariate) normality is rejected, then

1. check for outliers or errors in the data. If outliers are identiﬁed, then use methods for dealing
with outliers to determine their impact on the analysis. Alternative robust methods may also
be available. Remember that an outlier may be the most informative observation in the data
set as it is “diﬀerent” from all the others.

2. consider transformations of one or more of the variables. A variable may, for example, follow
the lognormal distribution, so a logarithm transformation would be in order. Note that
transformations also aﬀect the variable’s associations with the other variables.

3. consider robust or alternative multivariate methods if available. Some techniques are much
less sensitive to outliers or the distribution of the data than others. See, for example, Sil-
verman, B.W., 1986, Density Estimation for Statistics and Data Analysis, Chapman & Hall,
New York, 175pp.

4. consider basing inference on results using resampling methods (Monte Carlo, bootstrap, or
permutation methods). See, for example, Manly, B.F.J., 1997, Randomization, Bootstrap and
Monte Carlo Methods in Biology, second edition, Chapman & Hall, New York, 399pp.

Bio 12 Answer Key CH 1-7
100% (2)
Bio 12 Answer Key CH 1-7
35 pages
Alien Reproduction Vehicle-ARV
100% (5)
Alien Reproduction Vehicle-ARV
16 pages
Solusi Soal Bab 4
No ratings yet
Solusi Soal Bab 4
9 pages
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
941 pages
Hevi-Rail-Cam Roller Technology Catalog PDF
No ratings yet
Hevi-Rail-Cam Roller Technology Catalog PDF
68 pages
E.T.whittaker 1903
No ratings yet
E.T.whittaker 1903
6 pages
Multivariate Analysis
No ratings yet
Multivariate Analysis
25 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
Chapter1 MV
No ratings yet
Chapter1 MV
72 pages
Applied Statistics
No ratings yet
Applied Statistics
56 pages
Multivariate Normal Distribution
100% (1)
Multivariate Normal Distribution
8 pages
Multivariate Statistical Analysis: The Multivariate Normal Distribution
No ratings yet
Multivariate Statistical Analysis: The Multivariate Normal Distribution
13 pages
Multivariate Normal Distribution
No ratings yet
Multivariate Normal Distribution
19 pages
Handout-3-Multivariate Normal
No ratings yet
Handout-3-Multivariate Normal
9 pages
Multi Varia Da 1
No ratings yet
Multi Varia Da 1
59 pages
The Multivariate Normal Distribution: Exactly Central Limit
No ratings yet
The Multivariate Normal Distribution: Exactly Central Limit
59 pages
Slides 4
No ratings yet
Slides 4
51 pages
Random Vectors and Multivariate Normal Distribution
No ratings yet
Random Vectors and Multivariate Normal Distribution
6 pages
Chap 2
No ratings yet
Chap 2
9 pages
Exactly Central Limit: Multivariate Statistical Methods
No ratings yet
Exactly Central Limit: Multivariate Statistical Methods
18 pages
research methodology part 1
No ratings yet
research methodology part 1
25 pages
Multivariate Distributions
No ratings yet
Multivariate Distributions
8 pages
Multivariate Normal Distribution
No ratings yet
Multivariate Normal Distribution
20 pages
1-Multivariate Normal Distributions-18-07-2024
No ratings yet
1-Multivariate Normal Distributions-18-07-2024
36 pages
Chap2 Multivariate Normal and Related Distributions
No ratings yet
Chap2 Multivariate Normal and Related Distributions
18 pages
Multivariate normal distribution - Wikipedia, the free encyclopedia
No ratings yet
Multivariate normal distribution - Wikipedia, the free encyclopedia
12 pages
SSP4SE Appa
No ratings yet
SSP4SE Appa
10 pages
Topic 3 Multivariate Models I (Week 2)
No ratings yet
Topic 3 Multivariate Models I (Week 2)
27 pages
Slides
No ratings yet
Slides
38 pages
Covariance Matrix (W Krzanowski)
No ratings yet
Covariance Matrix (W Krzanowski)
5 pages
Lecture 11 HHJJ
No ratings yet
Lecture 11 HHJJ
6 pages
Statistics Review
No ratings yet
Statistics Review
9 pages
Multivariate Methods Assignment Help
No ratings yet
Multivariate Methods Assignment Help
17 pages
My Notes For Discrete and Continuous Distributions 987654
No ratings yet
My Notes For Discrete and Continuous Distributions 987654
28 pages
Unit 19
No ratings yet
Unit 19
16 pages
Multivariate Normal Distribution: 3.1 Basic Properties
No ratings yet
Multivariate Normal Distribution: 3.1 Basic Properties
13 pages
1) Common Univariate Summaries: I) I) Iii) I) Ii)
No ratings yet
1) Common Univariate Summaries: I) I) Iii) I) Ii)
5 pages
6.1 The Multivariate Normal Random Vector
No ratings yet
6.1 The Multivariate Normal Random Vector
9 pages
Multivariate Normal Distribution
No ratings yet
Multivariate Normal Distribution
51 pages
4 Distribusi Normal Multivariat-1
No ratings yet
4 Distribusi Normal Multivariat-1
26 pages
Lecture 4 Sep 16
No ratings yet
Lecture 4 Sep 16
26 pages
Multivariate_normal
No ratings yet
Multivariate_normal
24 pages
Chisq QQPlot
No ratings yet
Chisq QQPlot
8 pages
Multivariate Analysis - Multivariate Normal Distribution Function, Properties of Multivariate Normal
No ratings yet
Multivariate Analysis - Multivariate Normal Distribution Function, Properties of Multivariate Normal
13 pages
Stat1 Formulas and Tables For Statistics 2022
No ratings yet
Stat1 Formulas and Tables For Statistics 2022
34 pages
Multivariate Normal Distribution
No ratings yet
Multivariate Normal Distribution
9 pages
Symbiosis International (Deemed University) : Symbiosis School For Online and Digital Learning
No ratings yet
Symbiosis International (Deemed University) : Symbiosis School For Online and Digital Learning
84 pages
Pattern Recognition
No ratings yet
Pattern Recognition
9 pages
Bivarnorm
No ratings yet
Bivarnorm
6 pages
Normal Distribution - Wikipedia, The Free Encyclopedia
No ratings yet
Normal Distribution - Wikipedia, The Free Encyclopedia
22 pages
Chapter 6 - The Multivariate Normal Distribution and Copulas - 2013 - Simulation
No ratings yet
Chapter 6 - The Multivariate Normal Distribution and Copulas - 2013 - Simulation
13 pages
Chapter6 (Multivariate Normal Distribution)
No ratings yet
Chapter6 (Multivariate Normal Distribution)
25 pages
Normality
No ratings yet
Normality
14 pages
Sampling MND MLE AED 2021
No ratings yet
Sampling MND MLE AED 2021
28 pages
AE - Tema 3 - The Multivariate Gaussian Distribution
No ratings yet
AE - Tema 3 - The Multivariate Gaussian Distribution
6 pages
BT_Wk3_LectureNotes(2)
No ratings yet
BT_Wk3_LectureNotes(2)
19 pages
Rousseeuwhubert Highbdmultivariatelocscatter Fests
No ratings yet
Rousseeuwhubert Highbdmultivariatelocscatter Fests
19 pages
STAT456 Study Guide
No ratings yet
STAT456 Study Guide
31 pages
Solution To Exercises On MVN: 1 Question 1 (I)
No ratings yet
Solution To Exercises On MVN: 1 Question 1 (I)
3 pages
Module01_ProbabilityAndHypothesisTesting
No ratings yet
Module01_ProbabilityAndHypothesisTesting
62 pages
Third Lecture
No ratings yet
Third Lecture
12 pages
Mathematical Statistics (MA212M) : Lecture Slides
No ratings yet
Mathematical Statistics (MA212M) : Lecture Slides
6 pages
qrm_06
No ratings yet
qrm_06
59 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Chemicals Zetag DATA Beads Magnafloc 156 - 0410
No ratings yet
Chemicals Zetag DATA Beads Magnafloc 156 - 0410
2 pages
1-s2.0-S0196890424003303-main (1)
No ratings yet
1-s2.0-S0196890424003303-main (1)
24 pages
Module5 - Methods of AC Analysis
No ratings yet
Module5 - Methods of AC Analysis
120 pages
Week 5 Difference between micro and macro machining, Components of machine tool_nptel_watermark
No ratings yet
Week 5 Difference between micro and macro machining, Components of machine tool_nptel_watermark
28 pages
USM Flare Gas
No ratings yet
USM Flare Gas
8 pages
Moment Connection Portal Frame
100% (1)
Moment Connection Portal Frame
18 pages
Softly As in A Morning Sunrise
No ratings yet
Softly As in A Morning Sunrise
1 page
Xypex C-5000 Datasheet 2013
No ratings yet
Xypex C-5000 Datasheet 2013
4 pages
API 579 A S M E FFS 2 2009 Fitness For Service Example Problem Manual PDF
100% (3)
API 579 A S M E FFS 2 2009 Fitness For Service Example Problem Manual PDF
4 pages
Ramachandran Plot
100% (5)
Ramachandran Plot
18 pages
An Introduction To Optical Fibers
No ratings yet
An Introduction To Optical Fibers
344 pages
Acoustics: Course No. Arch5251 Instructors: Dawit Melaku (Msc. in Advanced Architecture)
No ratings yet
Acoustics: Course No. Arch5251 Instructors: Dawit Melaku (Msc. in Advanced Architecture)
21 pages
Magic Squares and Linear Algebra2
No ratings yet
Magic Squares and Linear Algebra2
9 pages
HKIMO Heat Round Lembar Jawaban 2024
No ratings yet
HKIMO Heat Round Lembar Jawaban 2024
1 page
Welding of Cast Iron
No ratings yet
Welding of Cast Iron
10 pages
COMSOL Paper - TA-PJM
No ratings yet
COMSOL Paper - TA-PJM
6 pages
Instron 5960
100% (1)
Instron 5960
2 pages
(TD-8565) Adiabatic Gas Law 012-05110C PDF
No ratings yet
(TD-8565) Adiabatic Gas Law 012-05110C PDF
18 pages
Lesson in Cartesian
No ratings yet
Lesson in Cartesian
32 pages
Chapter 5 - Vector Calculus File
No ratings yet
Chapter 5 - Vector Calculus File
41 pages
Great Neck South High School Answers
No ratings yet
Great Neck South High School Answers
69 pages
CTS2023 DR Dan Siegel Transcript
No ratings yet
CTS2023 DR Dan Siegel Transcript
17 pages
Lab Report 5
No ratings yet
Lab Report 5
7 pages
Physics 2 First Mid Material
No ratings yet
Physics 2 First Mid Material
152 pages
Krohne Optisound Vu31 Manual r2
No ratings yet
Krohne Optisound Vu31 Manual r2
82 pages
The Effects of Crankshaft Offset On The Engine Friction
100% (1)
The Effects of Crankshaft Offset On The Engine Friction
15 pages

The Multivariate Normal Distribution: f (x) = √ e −∞ 0. /σ

Uploaded by

The Multivariate Normal Distribution: f (x) = √ e −∞ 0. /σ

Uploaded by

The Multivariate Normal Distribution

(x − µ) (σ 2 )−1 (x − µ).

(x − µ) Σ−1 (x − µ),

for −∞ < x < ∞, −∞ < µ < ∞, and for Σ positive deﬁnite.

Properties of the Multivariate Normal Distribution

where c2 = χ2p (α).

3. All subsets of X are themselves multivariate normal.

4. Any linear combination of the Xi , say c X = c1 X1 +c2 X2 +· · ·+cp Xp , is normally distributed

Further, q linear combinations of the Xi , say C X, is (q-variate) multivariate normal. Let

5. Subdivide the vector X into two subsets X1 and X2 ,

The conditional distribution of X1 given X2 = x2 , f (X1 |X2 = x2 ) is (p-variate) multivariate

Sampling Distributions of the Multivariate Normal

Assessing Multivariate Normality

(a) Compute d2i = (Xi − X̄) S −1 (Xi − X̄).

corresponding with each d2i , i = 1, 2, . . . , n.

If (multivariate) normality is rejected, then

You might also like