Statistical Tests of The Estimated Variance Factor
Statistical Tests of The Estimated Variance Factor
of
the Estimated Variance Factor
in
by
ABSTRACT
Similarities and differences between the implementation of testing the estimated variance
factor in different Least-Squares adjustment packages are presented. A review of the
characteristics of the Chi-Square (χ2) and Fisher (F) distributions indicate that both
distributions have similar shapes for low degrees of freedom and their shape approaches a
Normal distribution as the degrees of freedom increase. The F distribution becomes a χ2
distribution when the second variance is associated with the reference variance of the
whole population. Different formulations of testing the estimated variance factor from a
2-side and a 1-side test interval are derived and illustrated to provide background for their
implementation in the different software packages.
Results obtained from the different packages indicate the same outcome and equivalent
statistics especially for the large degree of freedom data set example despite the fact that
the χ2 and F distributions are mathematically different and that StarNet and GeoLab
packages use a 2-side χ2 test formulation compared with a 1-side F test formulation using
the B-method for the MOVE3 package.
One of the quantities being tested in a Least-Squares adjustment of survey data is the
estimated variance factor ( ). The estimated variance factor is a unitless quantity
corresponding to the weighted sum of the square of residuals divided by the degrees of
freedom which is given by the following expression:
The statistical test of the estimated variance factor consists of verifying if its value ( ) is
within an interval computed from a χ2 or Fisher F distribution based on the degree of
freedom at a given error probability level.
The statistical testing of the estimated variance factor can be seen as way to confirm the
initial assumptions made about the measurements errors to be mostly random,
uncorrelated and coming from a unit variance population and by how much it deviates
from these premises.
Before examining the statistical testing of the estimated variance factor, let’s review the
characteristics of the χ2and F distributions.
2
It has the following probability density function (pdf):
The Chi-Square distribution is a special case of the Gamma distribution. It can be used to
make statistical inference about the data.
Its shape has the following characteristics for different degrees of freedom:
As the degree of freedom increases (ν > 10), its shape approaches a Normal Distribution
with mean and variance of ν and 2ν respectively such as χ2 (ν = large) ---> N(ν, 2ν) based on
the Central Limit theorem.
Φ(x) = Γ((ν1+ν2)/2) . (ν1/ν2) (ν1/2) . x(ν1/2 - 1) for x > 0, ν1 > 0 and ν2 > 0
Fν1, ν2 Γ(ν1/2) Γ(ν2/2) . (1 + x ν1/ ν2)(ν1+ν2)/2
where Γ(ν1/2) = 0 ∫∞ ℮-t . t (ν1/2 – 1) dt
Γ(ν2/2) = 0 ∫∞ ℮-t . t (ν2/2 – 1) dt
Γ((ν1+ν2)/2) = 0 ∫∞ ℮-t. t ((ν1+ν2)/2–1) dt
Its shape has the following characteristics for different degrees of freedom:
- Skew (non-symmetrical) shape for small degrees of freedom (ν1 and ν2 <= 10)
- A value of 0 at x = 0
- A maximum value between 0 and +∞
- 2 points of inflection each side of a maximum value
- Positive x-axis as asymptote
As both degrees of freedom increase (ν1 and ν2 > 10), its shape approaches a Normal
Distribution such as F (ν1, ν2 = large) ---> N(ν2/(ν2 - 2), 2ν22(ν1 + ν2 -2)/(ν1(ν2 - 2)2(ν2 - 4)))
for ν1 > 2 and ν2 > 4 .
Moreover, the F distribution becomes a χ2 distribution when the degree of freedom of the
second variance approaches infinity ν2 ---> ∞ such that χ2ν2 /ν2 = 1, thus F ν, ∞ = χ2 ν /ν .
The statistical test of the Estimated Variance Factor consists of verifying if its value ( )
is within a confidence interval computed from χ2 or F distribution percentile values based
on a given degree of freedom at a given error probability level.
The Chi-Square test of the estimated variance factor is usually conducted by verifying if
its value is within an interval defined by the upper and lower limits obtained from 2
percentile values of a Chi-Square distribution for a given degree of freedom (n-u) at a
given error probability (α in %) divided by the adjustment degree of freedom.
4
The 2-side Chi-Square statistical test interval can be expressed as:
Probability that [ χ2(n-u), (1 - α/2) /(n-u) < < χ2(n-u), α/2 /(n-u) ] = (1 – α) % for σ02 = 1
Probability that [(n-u) . / χ2(n-u), α/2 < (σ02 = 1) < (n-u) . / χ2(n-u), (1 - α/2)] = (1 – α) %
where is the estimated variance factor and σ02 is the reference observation variance
χ (n-u), (1 - α/2) and χ2(n-u), (α/2) are the percentiles obtained from a χ2 distribution for
2
The 2-Side Chi-Square test of the estimated variance factor can be illustrated as follow:
The F test of the Estimated variance factor is usually conducted by verifying if its value
is within an interval defined by the upper limit obtained from the percentile value of
an F distribution for a given degree of freedom (n-u) for the first variance and +∞ for the
second variance assumed to be the reference variance of the whole population at a given
error probability (α in %).
5
Since the estimated variance factor is expected to be greater than the reference variance of
the whole population > 1.0, it is not necessary to set a lower limit to the F test in these
conditions.
The 1-Side F test of the estimated variance factor can be illustrated as follow:
Following are some numerical examples showing the implementations of the statistical
tests conducted on the estimated variance factor using the χ2 and the F distributions from
different Least-Squares adjustment packages used in Surveying and Geodesy.
Minimally constrained adjustments using 1 known control station from GNSS baseline
vectors are presented for small and large degree of freedom data sets processed under the
same conditions in three adjustment packages namely: StarNet, MOVE3 and GeoLab.
6
4.1 Small Degree Of Freedom Data Set:
The example for the small degree of freedom data set consists of adjusting 3 control
points from 5 single occupied GPS baseline vectors based on 1 known control station as
shown in Figure 1.
Excerpts from StarNet adjustment report: Excerpts from MOVE3 adjustment report:
7
Excerpts from GeoLab adjustment report:
-----------------------------------------------------------------
| S T A T I S T I C S S U M M A R Y
-----------------------------------------------------------------
| Residual Critical Value Type | Tau Max
| Residual Critical Value | 2.3015
| Number of Flagged Residuals | 0
| Convergence Criterion | 0.0010
| Final Iteration Counter Value | 2
| Confidence Level Used | 95.0000
| Estimated Variance Factor | 7.9398
| Number of Degrees of Freedom | 6
|
| Chi-Square Test on the Variance Factor:
| 3.2968e+00 < 1.0000 < 3.8418e+01 ?
| (7.9398x6/χ26,97.5%)--^ ^--(7.9398x6/χ26,2.5%)
2
| for χ 6,97.5% = 14.45 THE TEST FAILS for χ26,2.5% = 1.24
-----------------------------------------------------------------
The example for the large degree of freedom data set consists of adjusting 51 stations
from 4896 GPS baseline vectors occupied several times based on 1 known control station
as shown in Figure 2.
8
Excerpts from StarNet adjustment report: Excerpts from MOVE3 adjustment report:
Statistical tests of the estimated variance factor have been described in terms of the χ2 and
F distributions used in the calculation of the test interval limits for implementations used
in three different Least-Squares adjustment packages.
Review of the characteristics of the χ2 and F distributions indicates that as the degrees of
freedom increase, both distributions approach a Normal distribution as per the Central
Limit theorem of statistics.
9
Moreover, the F distribution converts to a χ2 distribution when the second degree of
freedom becomes very large which is used to test the estimated variance factor with
respect to the whole population variance.
Different formulations of testing the estimated variance factor from a 2-side and a 1-side
test interval are illustrated to provide background for their implementation in the different
software packages.
Excerpts from StarNet adjustment reports are presented in terms of the square-root
(positive) values of the estimated variance factor and the associated 2-side χ2 distribution
statistics over the degree of freedom whereas excerpts from GeoLab χ2 tests are presented
as 2-side χ2 tests in terms of the initial observation unit variance of 1.0 with the weighted
sum of the residuals over the χ2 statistics.
The MOVE3 adjustment package reports the statistical tests of the estimated variance
factor from a 1-side test interval (upper limit) obtained from an F distribution having its
second degree of freedom set to ∞ for the second variance describing the reference unit
variance of the whole population with additional statistics based on the power β (Beta) of
the test set at 70% and α multi-dimensional values computed by the B-method (Baarda)
of testing which slightly push the upper limits inside the interval.
Results obtained from the different adjustment packages indicate the same outcome and
equivalent statistics especially for the large degree of freedom example despite the fact
that the χ2and F distributions are mathematically different and that StarNet and GeoLab
use a 2-side χ2 test formulation compared with a 1-side F test formulation for the MOVE3
package.
Statistical test of the estimated variance factor should always be examined in conjunction
with the degree of freedom, standardized residuals, measurement noise, mathematical and
stochastic models used in the adjustment.
6.0 References:
D.Wells and E. Krakiwsky (1971): The Method of Least-Squares. Lectures Notes 18,
Department of Survey Engineering, University of New Brunswick