MW Physics
MW Physics
Mathematical Principles of
Theoretical Physics
(理论物理的数学原理)
Printed in Beijing
ISBN 978-7-03-045289-4
Chapter 1
General Introduction
Physics is an important part of science of Nature, and is one of the oldest science disci-
plines. It intersects with many other disciplines of science such as mathematics and chem-
istry.
Great progresses have been made in physics since the second half of the 19th century.
The Maxwell equation, the Einstein special and general relativity and quantum mechanics
have become cornerstones of modern physics. Nowadays physics faces new challenges. A
partial list of most important and challenging ones is given as follows.
Dark matter and dark energy are two great mysteries in physics. Their gravitational
effects are observed and are not accounted for in the Einstein gravitational field equations,
and in the Newtonian gravitational laws.
These are certainly most fundamental questions about our Universe. The current dom-
inant thinking is that the Universe was originated from the Big-Bang. However, there are
many unsolved mysteries associated with the Big-Bang theory, such as the horizon problem,
the cosmic microwave radiation problem, and the flatness problem.
Black holes are fascinating objects in our Universe. However, there are a lot of confu-
sions about black holes, even its very definition.
5. Quark Confinement: Why has there never been observed free quarks?
2 Chapter 1 General Introduction
There are 12 fundamental subatomic particles, including six leptons and six quarks. This
is a mystery for not being able to observe free quarks and gluons.
Each particle has its own antiparticle. It is clear that there are far more particles in this
Universe than anti-particles. What is the reason? This is another mystery, which is also
related to the formation and origin of our Universe.
We know that the Newton and the Coulomb formulas are basic force formulas for grav-
itational force and for electromagnetic force. One longstanding problem is to derive similar
force formulas for the weak and the strong interactions, which are responsible for holding
subatomic particles together and for various decays.
The main focus of this book is on the symbiotic interplay between theoretical physics
and advanced mathematics. Throughout the entire history of science, the searching for
mathematical representations of the laws of Nature is built upon the believe that the Nature
speaks the language of Mathematics. The Newton’s universal law of gravitation and laws of
mechanics are clearly among the most important discoveries of the mankind based on the
interplay between mathematics and natural sciences. This viewpoint is vividly revealed in
Newton’s introduction to the third and final volumes of his great Principia Mathematica: “I
now demonstrate the frame of the system of the world.”
1.2 Law of Gravity, Dark Matter and Dark Energy 3
It was, however, to the credit of Albert Einstein who envisioned that the laws of Nature
are dictated by a few fundamental mathematical principles. Inspired by the Albert Einstein’s
vision, our general view of Nature is synthesized in two guiding principles, Principles 2.1
& 2.2, which can be recapitulated as follows:
Nature speaks the language of Mathematics: the laws of Nature 1) are repre-
sented by mathematical equations, 2) are dictated by a few fundamental prin-
ciples, and 3) always take the simplest and aesthetic forms.
Newtonian gravity
Newton’s universal law of gravity states that the gravitational force between two massive
objects with m and M is given by
GmM
F =− 2 , (1.2.1)
r
which is an empirical law.
Einstein’s General theory of relativity
One of the greatest discovery in the history of science is Albert Einstein’s general the-
ory of relativity (Einstein, 1915, 1916). He derives the law of gravity, his gravitational
field equations by postulating two revolutionary fundamental principles: the principle of
equivalence (PE) and the principle of general relativity (PGR):
1 8π G
Rµν − gµν R = − 4 Tµν . (1.2.3)
2 c
This is the most profound theory of science. The PGR is a symmetry principle, and
the law of gravity, represented as a set of differential equations (1.2.3), is dictated by this
profound and simple looking symmetry principle. The connection to the Newtonian gravi-
tational law (1.2.1) is achieved through the following Schwarzschild solution of the Einstein
field equations in the exterior of a ball of spherically symmetric matter field with mass M:
! " ! "
2 2MG 2 2 2MG −1 2
ds = − 1 − 2 c dt + 1 − 2 dr + r2 d θ 2 + r2 sin2 θ d ϕ 2 . (1.2.4)
c r c r
Here the temporal component of the metric and the gravitational force of the ball exerted on
an object of mass mass m are
! " ! "
2MG 2 GMm
g00 = − 1 − 2 = − 1+ 2ψ , F = −m∇ψ = − 2 . (1.2.5)
c r c r
The div-free condition, ∇ µ Xµν = 0, imposed on the variation element represents energy-
momentum conservation. We call this variational principle, the principle of interaction
dynamics (PID), which will be discussed hereafter again.
Using PID (1.2.6), 1 we then derive the new gravitational field equations (Ma and Wang,
2014e):
1 8π G
Rµν − gµν R = − 4 Tµν − ∇µ ∇ν φ , (1.2.7)
# 2 c $
8 π G
∇µ Tµν + ∇µ ∇ν φ = 0. (1.2.8)
c4
In summary, we have derived new law of gravity based solely on first principles, the PE, the
PGR, and the constraint variational principle (1.2.6):
We remark that it is the duality and both attractive and repulsive behavior of gravity that
maintain the stability of the large scale structure of our Universe.
Modified Newtonian formula from first principles (Ma and Wang, 2014e)
Consider a central matter field with total mass M and with spherical symmetry. We
derive an approximate gravitational force formula:
# $
1 k0
F = mMG − 2 − + k1r , k0 = 4 × 10−18km−1 , k1 = 10−57 km−3 . (1.2.9)
r r
1 The new field equations (1.2.7) can also be equivalently derived using the orthogonal decomposition theo-
c4
Tµν = T!µν − ∇ µ Φν .
8π G
See Chapter 4 and (Ma and Wang, 2014e) for details.
6 Chapter 1 General Introduction
Here the first term represents the Newton gravitation, the attracting second term stands for
dark matter and the repelling third term is the dark energy. We note that the dark matter
property of the gravity and the approximate gravitational interaction formula are consistent
with the MOND theory proposed by (Milgrom, 1983); see also (Milgrom, 2014) and the
references therein. In particular, our modified new formula is derived from first principles.
Dark matter and dark energy: a property of gravity (Ma and Wang, 2014e)
We have shown that it is the duality between the attracting gravitational field {g µν } and
the repulsive dual field {Φ µ = ∇µ φ } in (1.2.7), and their nonlinear interaction that gives
rise to gravity, and in particular the gravitational effect of dark energy and dark matter.
Also, we show in (Hernandez, Ma and Wang, 2015) that consider the gravitational field
outside of a ball of centrally symmetric matter field. There exist precisely two physical
parameters dictating the two-dimensional stable manifold of asymptotically flat space-time
geometry, such that, as the distance to the center of the ball of the matter field increases,
gravity behaves as Newtonian gravity, then additional attraction due to the curvature of
space (dark matter effect), and repulsive (dark energy effect). This also clearly demonstrates
that both dark matter and dark energy are just a property of gravity.
Of course, one can consider dark matter and dark energy as the energy carried by the
gravitons and the dual gravitons, to addressed in the unified field theory later in this book.
Symmetry principles
We have shown that for gravity, the basic symmetry principle is the Einstein PGR, which
dictates the Einstein-Hilbert action, and induces the gravitational field equations (1.2.7)
using PID.
Quantum mechanics provides a mathematical description about the Nature in the molecule,
the atomic and subatomic levels. Modern theory and experimental evidence have suggested
1.3 First Principles of Four Fundamental Interactions 7
that the electromagnetic, the weak and the strong interactions obey the gauge symmetry.
In fact, these symmetries and the Lorentz symmetry, together with the simplicity of laws
of Nature dictate the Lagrangian actions for the electromagnetic, the weak and the strong
interactions:
(a) The principle of general relativity dictates the action for gravity, the Einstein-
Hilbert action.
(b) The principle of Lorentz invariance and the principle of gauge invari-
ance, together with the simplicity principle of laws of Nature, dictate the
Lagrangian actions for the electromagnetic, the weak and the strong in-
teractions.
Here the N wave functions: Ψ = (ψ1 , · · · , ψN )T represent the N particle fields, the N 2 − 1
gauge fields Gaµ represent the interacting potentials between these N particles, and {τa | a =
1, · · · , N 2 − 1} is the set of representation generators of SU(N). The gauge symmetry is then
stated as follows:
Principle of Gauge Invariance. The electromagnetic, the weak, and the strong
interactions obey gauge invariance. Namely, the Dirac equations involved in
the three interactions are gauge covariant and the actions of the interaction
fields are gauge invariant under the gauge transformations (1.3.1).
The field equations involving the gauge fields Gaµ are determined by the corresponding
Yang-Mills action, is uniquely determined by both the gauge invariance and the Lorentz
invariance, together with simplicity of laws of nature:
# $
' ( ! 1 ) mc *
LY M Ψ, {Gaµ } = − Gab gµα gνβ Gaµν Gbαβ + Ψ iγ µ Dµ − Ψ dx, (1.3.2)
M 4 h̄
8 Chapter 1 General Introduction
which is invariant under both the Lorentz and gauge transformations (1.2.3). Here Ψ =
1
Ψ† γ 0 , Ψ† = (Ψ∗ )T is the transpose conjugate of Ψ, Gab = tr(τa τb† ), λab c are the structure
2
constants of {τa | a = 1, · · · , N 2 − 1}, Dµ is the covariant derivative and Gaµν stands for the
curvature tensor associated with D µ :
Dµ = ∂µ + igGkµ τk ,
(1.3.3)
Gaµν = ∂µ Gaν − ∂ν Gaµ + gλbc
a b c
G µ Gν .
1) For the four fundamental interactions, the Lagrangian actions are given
by
!
√
L(g, A, ψ ) = L (g µν , A, ψ ) −gdx, (1.3.4)
M
We now illustrate that PID is also valid for both the weak and strong interactions, re-
placing the classical PLD.
First, PID applied to the Yang-Mills action (1.3.2) takes the following form:
1.3 First Principles of Four Fundamental Interactions 9
+ ,
Gab ∂ ν Gbν µ − gλcd
b αβ c
g Gα µ Gdβ − gΨγµ τa Ψ = (∂µ + αb Gbµ )φa , (1.3.5)
(iγ µ Dµ − m)Ψ = 0, (1.3.6)
where the Dirac equations (1.3.6) are the gauge covariant, and are the dynamic equations
for fermions participating the interaction. The right hand side of (1.3.5) is due to PID, with
the operator
∇A = ∂µ + αb Gbµ
taken in such a way that αb Gbµ is PRI-covariant under the representation transformations
(1.3.8) below.
One important consequence of the PID SU(N) theory is the natural introduction of the
scalar fields φa , reminiscent of the Higgs field in the standard model in particle physics.
Second, for the Yang-Mills action for an SU(N) gauge theory, the resulting Yang-Mills
equations are the Euler-Lagrange equations of the Yang-Mills action:
+ ,
Gab ∂ ν Gbν µ − gλcd
b αβ c
g Gα µ Gdβ − gΨγµ τa Ψ = 0, (1.3.7)
Fourth, one of the most challenging problems for strong interaction is the quark confine
ment-no free quarks have been observed. One hopes to solve this mystery with the quantum
chromodynamics (QCD) based on the classical SU(3) gauge theory. Unfortunately, as we
shall see later that the classical SU(3) Yang-Mills equations produces only repulsive force,
and it is the dual fields in the PID gauge field equations (1.3.5) that give rise to the needed
attraction for the binding quarks together forming hadrons.
Hence experimental evidence of quark confinement, as well as many other properties
derived from the PID strong interaction model, clearly demonstrates the validity of PID for
strong interactions.
Fifth, from the mathematical point of view, the Einstein field equations are in general
non well-posed, as illustrated by a simple example in Section 4.2.2. In addition, for the
classical Yang-Mills equations, the gauge-fixing problem will also pose issues on the well-
posedness of the Yang-Mills field equations; see Section 4.3.5. The issue is caused by the
fact that there are more equations than the number of unknowns in the system. PID induced
model brings in additional unknowns to the equations, and resolves this problem.
Recently we have observed that there is a freedom to choose the set of generators for
representing elements in SU(N). In other words, basic logic dictates that the SU(N) gauge
theory should be invariant under the following representation transformations of the gener-
ator bases:
τ&a = xba τb , (1.3.8)
where X = (xba ) are non-degenerate (N 2 − 1)-th order matrices. Then we can define natu-
rally SU(N) tensors under the transformations (1.3.8). It is clear then that θ a , Gaµ , and the
structure constants λabc are all SU(N)-tensors. In addition, G = 1 Tr(τ τ † ) is a symmetric
ab a b
2
positive definite 2nd-order covariant SU(N)-tensor, which can be regarded as a Riemannian
metric on SU(N). Consequently we have arrived at the following principle of representation
invariance, first discovered by the authors (Ma and Wang, 2014h):
PRI (Ma and Wang, 2014h) For the SU(N) gauge theory, under the repre-
sentation transformations (1.3.8),
It is clear that PRI is a basic logic requirement for an SU(N) gauge theory, and has
profound physical implications.
1.3 First Principles of Four Fundamental Interactions 11
First, as indicated in (Ma and Wang, 2013a, 2014h) and in Chapter 4, the field model
based on PID appears to be the only model which obeys PRI. In fact, based on PRI, for
the gauge interacting fields A µ and {Wµa }3a=1, corresponding to two different gauge groups
U(1) for electromagnetism and SU(2) for the weak interaction, the following combination
α Aµ + β Wµ3 (1.3.9)
is prohibited. The reason is that A µ is an U(1)-tensor with tensor, and Wµ3 is simply the
third component of an SU(2)-tensor. The above combination violates PRI. This point of
view clearly shows that the classical electroweak theory violates PRI, so does the standard
model. The difficulty comes from the artificial way of introducing the Higgs field. The PID
based approach for introducing Higgs fields by the authors appears to be the only model
obeying PRI.
Another important consequence of PRI is that for the term αb Gbµ in the right-hand side
of the PID gauge field equations (1.3.5), both {αb | b = 1, · · · , N 2 − 1} and {Gbµ | b =
1, · · · , N 2 − 1} are SU(N) tensors under the representation transformations (1.3.8).
The coefficients αb represent the portions distributed to the gauge potentials by the
charge, represented by the coupling constant g. Then it is clear that
In the field equations (1.3.5) and (1.3.6) of the SU(N) gauge theory for an
fundamental interaction,
(a) the coupling constant g represents the interaction charge, playing the
same role as the electric charge e in the U(1) abelian gauge theory for
quantum electrodynamics (QED);
(b) the potential
def
Gµ = αb Gbµ (1.3.10)
One great vision of Albert Einstein is his principle of equivalence, which, in the math-
ematical terms, says that the space-time is a 4-dimensional (4D) Riemannian manifold
{M , g µν } with the metric g µν representing the gravitational potential. In other words,
gravity is manifested as the curved effect of the space-time manifold {M , g µν }. In essence,
gravity is manifested by the gravitational fields {g µν , ∇µ φ }, determined by the gravitational
field equations (1.2.7) and (1.2.8) together with the matter distribution {Tµν }.
The gauge theory provides a field theory for describing the electromagnetic, the weak
and the strong interactions. The geometry of the SU(N) gauge theory is determined by
1) the complex bundle 1 M ⊗ p (C4 )N for the wave functions Ψ = (ψ1 , · · · , ψN )T , repre-
senting the N particles,
2) the gauge interacting fields {Gaµ | a = 1, · · · , N 2 − 1}, and their dual fields {φ a | a =
1, · · · , N 2 − 1}, and
3) the gauge field equations (1.3.5) coupled with the Dirac equations (1.3.6).
In other words, the geometry of the complex bundle M ⊗ p (C4 )N , dictated by the gauge
field equations (1.3.5) together with the matter equations (1.3.6), manifests the underlying
interaction.
Consequently, it is natural for us to postulate the Geometric Interaction Mechanism 4.1
for all four fundamental interactions:
Geometric Interaction Mechanism(Ma and Wang, 2014d)
1) (Einstein, 1915) The gravitational force is the curved effect of the time-
space; and
2) the electromagnetic, weak, strong interactions are the twisted effects of
the underlying complex vector bundles M ⊗ p Cn .
We note that Yukawa’s viewpoint, entirely different from Einstein’s, is that the other
three fundamental forces—the electromagnetism, the weak and the strong interactions–take
place through exchanging intermediate bosons such as photons for the electromagnetic in-
teraction, the W± and Z intermediate vector bosons for the weak interaction, and the gluons
for the strong interaction.
It is worth mentioning that the Yukawa Mechanism is oriented toward to computing the
transition probability for particle decays and scattering, and the above Geometric Interac-
tion Mechanism is oriented toward to establishing fundamental laws, such as interaction
potentials, of the four interactions.
1 Throughout this book, we use the notation ⊗ to denote ”gluing a vector space to each point of a manifold”
p
to form a vector bundle. For example, !
M ⊗ p Cn = {p} × Cn
p∈M
is a vector bundle with base manifold M and fiber complex vector space C n .
1.4 Symmetry and Symmetry-Breaking 13
breaking is inevitably needed for the unification. The PID-induced gauge symmetry-breaking,
by the authors (Ma and Wang, 2015a, 2014h,d), offers a symmetry-breaking mechanism
based only on the first principle; see also Chapter4 for details.
Second, for a multi-particle and multi-level system, its action is dictated by a set of
SU(N1 ), · · · , SU(Nm ) gauge symmetries, and the governing field equations will break some
of these gauge symmetries; see Chapter 6 .
Third, in astrophysical fluid dynamics, one difficulty we encounter is that the Newtonian
Second Law for fluid motion and the diffusion law for heat conduction are not compatible
with the principle of general relativity. Also, there are no basic principles and rules for com-
bining relativistic systems and the Galilean systems together to form a consistent system.
The distinction between relativistic and Galilean systems gives rise to an obstacle for estab-
lishing a consistent model of astrophysical dynamics. This difficulty can be circumvented
by using the above mentioned symmetry-breaking principle, where we have to chose the
coordinate system
xµ = (x0 , x), x0 = ct and x = (x1 , x2 , x3 ),
Here gi j (1 ! i, j ! 3) are the spatial metric, and ψ represents the gravitational potential.
The resulting system breaks the symmetry of general coordinate transformations, and we
call such symmetry-breaking as relativistic-symmetry breaking.
1) the general relativity and the gauge symmetries dictate the Lagrangian;
1.5 Unified Field Theory Based on PID and PRI 15
2) the coupling of the four interactions is achieved through PID, PRI and
PSB in the unified field equations, which obey the PGR and PRI, but break
spontaneously the gauge symmetry; and
3) the unified field model can be easily decoupled to study individual inter-
action, when the other interactions are negligible.
Hereafter we address briefly the main ingredients of the unified field theory.
Lagrangian action
Following the simplicity principle of laws of Nature as stated in Principle 2.2, the three
basic symmetries—the Einstein general relativity, the Lorentz invariance and the gauge
invariance—uniquely determine the interaction fields and their Lagrangian actions for the
four interactions:
• The Lagrangian action for gravity is the Einstein-Hilbert functional given by (1.2.2);
• The field describing electromagnetic interaction is the U(1) gauge field {A µ }, repre-
senting the electromagnetic potential, and the Lagrangian action density is
1
LEM = − Aµν Aµν + ψ e (iγ µ Dµ − me )ψ e , (1.5.1)
4
in which the first term stands for the scalar curvature of the vector bundle M ⊗ p C4 .
The covariant derivative and the field strength are given by
• For the weak interaction, the SU(2) gauge fields {Wµa |a = 1, 2, 3} are the interacting
fields, and the SU(2) Lagrangian action density LW for the weak interaction is the
standard Yang-Mills action density as given by (1.3.2).
• The SU(3) gauge action density LS for the strong interaction is also in the standard
Yang-Mills form given by (1.3.2), and the strong interaction fields are the SU(3)
gauge fields {Skµ | 1 ! k ! 8}, representing the 8 gluon fields.
It is clear that the action coupling the four fundamental interactions is the natural com-
bination of the Einstein-Hilbert functional, the standard U(1), SU(2), SU(3) gauge actions
for the electromagnetic, weak and strong interactions:
) * ! √
L {g µν }, Aµ , {Wµa }, {Skµ } = [LEH + LEM + LW + LS ] −gdx, (1.5.2)
M
which obeys all the symmetric principles, including principle of general relativity, the
Lorentz invariance, the U(1) × SU(2) × SU(3) gauge invariance and PRI.
16 Chapter 1 General Introduction
1 8π G + ,
Rµν − gµν R + 4 Tµν = ∇µ + α 0 Aµ + αb1Wµb + αk2 Skµ φνG , (1.5.3)
2 c + ,
µ
∂ (∂µ Aν − ∂ν Aµ ) − eJν = ∇ν + β 0 Aν + βb1Wνb + βk2Sνk φ e , (1.5.4)
+ ,
w
Gab ∂ µ Wµνb
− gw λcdb αβ c
g Wαν Wβd − gw Jν a (1.5.5)
# $
1 2
= ∇ν + γ Aν + γb Wν + γk Sν − mw xν φaw ,
0 1 b 2 k
4
+ ,
j j αβ c
Gksj ∂ µ Sµν − gs Λcd g Sαν Sβd − gs Qν k (1.5.6)
# $
1
= ∇ν + δ 0 Aν + δb1Wνb + δk2 Sνk − m2s xν φks ,
4
(iγ µ Dµ − m)Ψ = 0, (1.5.7)
where Ψ = (ψ e , ψ w , ψ s )T stands for the wave functions for all fermions, participating re-
spectively the electromagnetic, the weak and the strong interactions, and the current densi-
ties are defined by
Jν = ψ e γν ψ e , Jν a = ψ w γν σa ψ w , Qν k = ψ s γν τk ψ s . (1.5.8)
Equations (1.5.7) are the gauge covariant Dirac equations for all fermions participating
the four fundamental interactions. The left-hand sides of the field equations (1.5.3)-(1.5.6)
are the same as the classical Einstein equations and the standard U(1) × SU(2) × SU(3)
gauge field equations. The right-hand sides of are due to PID, and couple the four funda-
mental interactions.
Duality
In the field equations (1.5.3)-(1.5.6), there exists a natural duality between the inter-
action fields (g µν , Aµ ,Wµa , Skµ ) and their corresponding dual fields (φ µG , φ e , φaw , φks ). This
duality can be viewed as the duality between mediators and the duality between interacting
forces, summarized as follows:
These are longstanding problems, and the new field theory for strong interaction based
on PID and PRI completely solves these open problems. We now present some basic in-
gredients of this new development, and we refer the interested readers to (Ma and Wang,
2014c) and Section 4.5 for details.
The strong interaction field equations decoupled from the unified field model are
+ , # $
j j 1
Gksj ∂ µ Sµν − gs Λcd gαβ Sαν
c
Sβd − gs Qν k = ∂ν + δk2 Sνk − m2s xν φks , (1.6.1)
4
+ ) * ,
iγ µ ∂µ + igs Skµ τk − mq ψ = 0. (1.6.2)
1. Gluons and dual scalar gluons. For the strong interaction, the field equations induce
a duality between the eight spin-1 massless gluons, described by the SU(3) gauge fields
{Skµ | k = 1, · · · , 8}, and the eight spin-0 dual gluons, described by the dual fields {φ ks | a =
1, · · · , 8}. As we shall see from the explianation of quark confinements gluons and their
dual gluons are confined in hadrons.
2. Attracting and repulsive behavior of strong force. As indicated before and from the
strong interaction potentials derived hereafter, strong interaction can display both attracting
and repulsive behavior. The repulsive behavior is due mainly to the spin-1 gluon fields, and
the attraction is caused by the spin-0 dual gluon fields.
def def
Sµ = δk2 Skµ = (S0 , S1 , S2 , S3 ), Φ = S0 . (1.6.3)
1.6 Theory of Strong Interactions 19
The temporal component S0 , denoted by Φ, gives rise to the strong force between two
elementary particles carrying strong charges:
From the field equations (1.6.1) and (1.6.2), we derive in Section 4.5.2 that for a particle
with N strong charges and radius ρ , its strong interaction potential are given by (4.5.39) and
recast here convenience:
# $
1 A
Φ = gs (ρ ) − (1 + kr)e −kr
,
r ρ
! "3 (1.6.5)
ρw
gs (ρ ) = N gs ,
ρ
where gs is the strong charge of w∗ -weakton, the ρw is the radius of the w∗ -weakton, A is a
dimensionless constant depending on the particle, and 1/k is the radius of strong attraction
of particles. Phenomenologically, we can take
⎧ −18
⎪
⎪ 10 cm for w∗ − weaktons,
⎪
⎪
1 ⎨ 10
−16
cm for quarks,
= (1.6.6)
k ⎪ 10
⎪ −13
cm for neutreons,
⎪
⎪
⎩ −7
10 cm for atom/molecule,
and the resulting layered formulas of strong interaction potentials are called the w ∗ -weakton
potential Φ0 , the quark potential Φq , the nucleon/hadron potential Φn and the atom/molecule
potential Φa .
Quark confinement
Quark confinement is a challenging problem in physics. Quark model was confirmed by
many experiments. However, no any single quark is found ever. This fact suggests that the
quarks were permanently bound inside a hadron, which is called the quark confinement. Up
to now, no other theories can successfully describe the quark confinement phenomena. The
direct reason is that all current theories for interactions fail to provide a successful strong
interaction potential to explain the various level strong interactions.
With the strong interaction potential (1.6.5), we can calculate the strong interaction
bound energy E for two same type of particles:
# $
1 A
E = g2s (ρ ) − (1 + kr)e−kr . (1.6.7)
r ρ
The quark confinement can be well explained from the viewpoint of the strong quark
bound energy Eq and the nucleon bound energy En . In fact, by (1.6.7) we can show that
This clearly shows that the quarks is confined in hadrons, and no free quarks can be found.
Asymptotic freedom was discovered and described by (Gross and Wilczek, 1973; Politzer,
1973). Using the strong interaction potential (1.6.5), we can clearly demonstrate the asymp-
totic freedom property and the short-ranged nature of the strong interaction.
1. Higgs fields from first principles. The right-hand side of (1.7.1) is due to PID, leading
naturally to the introduction of three scalar dual fields. The left-hand side of (1.7.1) repre-
sents the intermediate vector bosons W ± and Z, and the dual fields represent two charged
Higgs H ± (to be discovered) and the neutral Higgs H 0 , with the later being discovered by
LHC in 2012.
It is worth mentioning that the right-hand side of (1.7.1), involving the Higgs fields, can
not be generated by directly adding certain terms in the Lagrangian action, as in the case for
the new gravitational field equations derived in (Ma and Wang, 2014e).
2. Duality. We establish a natural duality between weak gauge fields {Wµa }, representing
the W ± and Z intermediate vector bosons, and three bosonic scalar fields φaw , representing
both two charged and one neutral Higgs particles H ± , H 0 .
4. Weak charge and weak potential. By PRI, the SU(2) gauge coupling constant g w
plays the role of weak charge, responsible for the weak interaction.
1.8 New Theory of Black Holes 21
Also, PRI induces an important SU(2) constant vector {γb1 }. The components of this
vector represent the portions distributed to the gauge potentials Wµa by the weak charge gw .
Hence the (total) weak interaction potential is given by the following PRI representation
invariant
Wµ = γa1Wµa = (W0 ,W1 ,W2 ,W3 ), (1.7.3)
and the weak charge potential and weak force are as
In this section, we present the black hole theorem proved in Section 7.3, and clarify
some of the confusions in the literature. This black hole theorem leads to important insights
to many problems in astrophysics and cosmology, which will be addressed in details in
Chapter 7.
Blackhole Theorem (Ma and Wang, 2014a) Assume the validity of the Einstein
theory of general relativity, then the following assertions hold true:
1) black holes are closed: matters can neither enter nor leave their interiors,
2) black holes are innate: they are neither born to explosion of cosmic ob-
jects, nor born to gravitational collapsing, and
3) black holes are filled and incompressible, and if the matter field is non-
homogeneously distributed in a black hole, then there must be sub-blackholes
in the interior of the black hole.
This theorem leads to drastically different views on the structure and formation of our
Universe, as well as the mechanism of supernovae explosion and the active galactic nucleus
(AGN) jets. Here we only make a few remarks, and refer interested readers to Section 7.3
for the detailed proof.
1. An intuitive observation. One important part of the theorem is that all black holes are
closed: matters can neither enter nor leave their interiors. Classical view was that nothing
can get out of blackholes, but matters can fall into blackholes. We show that nothing can
get inside the blackhole either.
To understand this result better, let’s consider the implication of the classical theory
that matters can fall inside a blackhole. Take as an example the supermassive black hole
at the center of our galaxy, the Milky Way. By the classical theory, this blackhole would
continuously gobble matters nearby, such as the cosmic microwave background (CMB). As
the Schwarzschild radius of the black hole r = Rs is proportional to the mass, then the radius
Rs would increase in cubic rate, as the mass M is proportional to the volume. Then it would
be easy to see that the black hole will consume the entire Milky Way, and eventually the
entire Universe. However, observational evidence demonstrates otherwise, and supports our
result in the blackhole theorem.
metric at r = Rs
! "−1
2 u 2 2 r2
ds = −e c dt + 1 − 2 dr2 + r2 d θ 2 + r2 sin2 θ d ϕ 2 , (1.8.3)
Rs
is a true physical singularity, and defines the black hole boundary.
3. Geometric realization of a black hole. As described in Section 4.1 in (Ma and Wang,
2014a) and in Section 7.3.1, the geometrical realization of a black hole, dictated by the
Schwarzschild and TOV metrics, clearly manifests that the real world in the black hole is a
hemisphere with radius Rs embedded in R4 , and at the singularity r = Rs , the tangent space
of the black hole is perpendicular to the coordinate space R3 .
This geometric realization clearly demonstrates that the disk in the realization space R 3
is equivalent to the real world in the black hole. If the outside observer observes that nothing
gets inside the black hole, then nothing will get inside the black hole in the reality as well.
Basic assumptions
(a) the Einstein theory of general relativity, and
(b) the cosmological principle.
With Assumption (a) above, we have our blackhole theorem, Theorem 7.15, at our dis-
posal. Then we can draw a number of important conclusions on the structure of our Uni-
verse.
1. Let E and M be the total energy and mass of the Universe:
where Ψ is the energy of all interaction fields. The total mass M dictates the Schwarzschild
radius Rs .
24 Chapter 1 General Introduction
If our Universe were born to the Big-Bang, assuming at the initial stage, all energy is
concentrated in a ball with radius R0 < Rs , by the theory of black holes, then the energy
contained in BR0 must generate a black hole in R3 with fixed radius Rs as defined by (1.8.2).
If we assume that at certain stage, the Universe were contained in ball of a radius R with
R0 < R < Rs , then we can prove that the Universe must contain a sub-black hole with radius
r given by
1
R
r= R.
Rs
Based on this property, the expansion of the Universe, with increasing R to R s , will give rise
to an infinite sequence of black holes with one embedded to another. Apparently, this sce-
nario is clearly against the observations of our Universe, and demonstrates that our Universe
cannot be originated from a Big-Bang.
2. By the cosmological principle, given the energy density ρ 0 > 0 of the Universe, based
on the Schwarzschild radius, the Universe will always be bounded in black hole, which is
an open ball of radius:
2
3c2
Rs = .
8π Gρ0
This immediately shows that there is no unbounded universe. Consequently, since a black
hole is unable to expand and shrink, we arrive immediately from the above analysis that our
Universe must be static, and not expanding.
Notice that the isotropy requirement in the cosmological principle excludes the globular
open universe scenario. Consequently, we have shown that our Universe must be a closed
3D sphere S3 .
In summary, we have proved two theorems, Theorems 7.27-7.28, on the geometry and
structure of our Universe, which have been discovered and proved in (Ma and Wang, 2014a,
Theorems 6.2 & 6.3):
It is clear that this theorem changes drastically our view on the geometry and the origin
of the Universe. Inevitably, a number of important questions need to be addressed for this
scenario of our Universe. Hereafter we examine a few most important problems.
Redshift problem
The natural and important question that one has to answer is the consistency with astro-
nomical observations, including the cosmic edge, the flatness, the horizon, the redshift, and
the cosmic microwave background (CMB) problems. These problems can now be easily
understood based on the structure of the Universe and the blackhole theorem we derived.
Hereafter we focus only on the redshift and the CMB problems.
The most fundamental problem is the redshift problem. Observations clearly show that
light coming from a remote galaxy is redshifted, and the farther away the galaxy is, the
larger the redshift. In modern astronomy and cosmology, it is customary to characterize the
redshift by a dimensionless quantity z in the formula
λobserv
1+z = , (1.9.2)
λemit
where λobserv and λemit represent the observed and emitting wavelenths.
There are three sources of redshifts: the Doppler effect, the cosmological redshift, and
the gravitational redshift. If the Universe is not considered as a black hole, then the gravi-
tational redshift and the cosmological redshift are both too small to be significant. Hence,
modern astronomers have to think that the large port of the redshift is due to the Doppler
effect.
However, due to black hole properties of our Universe, the black hole and cosmological
redshifts cannot be ignored. Due to the horizon of the sphere, for an arbitrary point in the
spherical Universe, its opposite hemisphere relative to the point is regarded as a black hole.
Hence, g00 can be approximatively taken as the Schwarzschild solution for distant objects
as follows
! "
Rs
−g00 (r) = α (r) 1 − , α (0) = 2, α (Rs ) = 1, α ′ (r) < 0,
r
&
where &r = 2Rs − r for 0 ! r < Rs is the distance from the light source to the opposite radial
point, and r is the distance from the light source to the point. Then we derive the following
redshift formula, which is consistent with the observed redshifts:
√
1 2Rs − r
1+z = 2 ! " = 3 for 0 < r < Rs . (1.9.3)
Rs α (r)(Rs − r)
α (r) 1 −
r
&
26 Chapter 1 General Introduction
CMB problem
In 1965, two physicists A. Penzias and R. Wilson discovered the low-temperature cos-
mic microwave background (CMB) radiation, which fills the Universe, and it has been re-
garded as the smoking gun for the Big-Bang theory. However, based on the unique scenario
of our Universe we derived, it is the most natural thing that there exists a CMB, because
the Universe has always been there as a black-body, and CMB is a result of blackbody
equilibrium radiation.
Conclusion 4) in the above results for our Universe shows that the observable cosmic
mass M, and the total mass Mtotal which includes both M and the non-observable mass
caused by space curvature energy, enjoy the following relation:
The difference Mtotal − M can be regarded as the dark matter. Astronomical observations
have shown that the measurable mass M is about one fifth of total mass Mtotal .
Also, the static Universe has to possess a negative pressure to balance the gravitational
attracting force. The negative pressure is actually the effect of the gravitational repelling
force, attributed to dark energy.
Equivalently, the above interpretation of dark matter and dark energy is consistent with
the theory based on the new PID gravitational field equations discovered by (Ma and Wang,
2014e) and addressed in detail in Chapter 4. It is clear now that gravity can display both at-
tractive and repulsive effect, caused by the duality between the attracting gravitational field
{g µν } and the repulsive dual vector field {Φ µ }, together with their nonlinear interactions
governed by the field equations. Consequently, dark energy and dark matter phenomena are
simply a property of gravity. The detailed account of this relation is addressed in Section
7.6, based in part on (Hernandez, Ma and Wang, 2015).
PID-cosmological model
One of the main motivations for the introduction of the Big-Bang theory and for the
expanding universe is that the Friedamann solution of the Einstein gravitational field equa-
tions demonstrated that the Einstein theory must produce a variable size universe; see Con-
clusions of Friedmann Cosmology 7.23. Also, it is classical that bringing the cosmological
constant Λ into the Einstein field equations will lead to an unstable static Universe.
We have demonstrated that the right cosmological model should be derived from the new
gravitational field equations (1.2.7), taking into consideration the presence of dark matter
and dark energy. In this case, based on the cosmological principle, the the metric of a
1.10 Supernovae Explosion and AGN Jets 27
where R = R(t) is the cosmic radius. We deduce then from (1.2.7) the following PID-
cosmological model, with ϕ = φ ′′ :
! "
4π G 3p ϕ
R′′ = − ρ+ 2 + R,
3 c 8π G
1 (1.9.6)
(R′ )2 = (8π Gρ + ϕ )R2 − c2 ,
3
3R′ 24π G R′
ϕ′ + ϕ =− 2 p,
R c R
supplemented with the equation of state:
p = f (ρ , ϕ ). (1.9.7)
mGr0 r1 β T0 − T1
Re = , (1.10.1)
κν r1 − r0
where T0 and T1 are the temperatures at the bottom and top of an annular shell region r0 <
r < r1 ; see e.g. (7.1.74) for other notations used here.
Based on our theory of black holes, including in particular the incompressibility and
closedness of black holes, the relativistic effect is described by the δ -factor:
2mG
δ= . (1.10.2)
c2 r0
28 Chapter 1 General Introduction
As far as we know, there is still no good model for a multi-particle system, which takes
also into consideration the dynamic interactions between particles. The main obstacle for
establishing a field theory for an interacting multi-particle system is the lack of basic prin-
ciples to describe the dynamic interactions of the particles.
Basic postulates for interacting multi-particle systems
To seek the needed principles, we proceed with three observations.
1. One natural outcome of the field theory of four interactions developed recently by
the authors and addressed in the previous sections is that the coupling constants for the
U(1) × SU(2) × SU(3) gauge theory play the role of the three charges e, g w and gs for
electromagnetism, the weak and the strong interaction. These charges generate interacting
fields among the interacting particles.
Now we consider an N-particle system with each particle carrying an interaction charge
g. Let this be a fermionic system, and the Dirac spinors be given by
Ψ = (ψ1 , · · · , ψN )T ,
where G = (Giµj ) is an Hermitian matrix, representing the interacting potentials between the
N-particles generated by the interaction charge g.
Now let
{τ0 , τ1 , · · · , τK | K = N 2 − 1}
be a basis of the set of all Hermitian matrices, where τ0 = I is the identity, and τa (1 ! a !
ij
N 2 − 1) are the traceless Hermitian matrices. Then the Hermitian matrix G = (G µ ) and the
differential operator D µ in (1.11.1) can be expressed as
G = G0µ I + Gaµ τa ,
Dµ = ∂µ + igG0µ + igGaµ τa .
2. The energy contributions of the N particles are indistinguishable, which implies the
SU(N) gauge invariance. Hence (1.11.3) are exactly the Dirac equations in the form of
SU(N) gauge fields {Gaµ | 1 ! a ! N 2 − 1} with a given external interaction field G0µ .
30 Chapter 1 General Introduction
3. The SU(N) gauge theory of N particles must obey PRI. Consequently there exists a
constant SU(N) tensor
αaN = (α1N , · · · , αNN ),
Gµ = αaN Ga (1.11.4)
is independent of the SU(N) representation τa , and is the interaction field which can be
experimentally observed.
With these three observations, it is natural for us to introduce the following postulate,
which is also presented as Postulates 6.25-6.27 in Chapter 6:
1) the Lagrangian action for an N-particle system satisfy the SU(N) gauge
invariance;
2) gGaµ represent the interaction potentials between the particles; and
3) for an N-particle system, only the interaction field G µ in (1.11.4) can be
measured, and is the interaction field under which this system interacts
with other external systems.
Multi-particle systems are layered, and with the above postulates and basic symmetry
principles, we are able to determine in a unique fashion field equations for different multi-
particle systems.
For example, given an N-particle system consisting of N fermions with given charge g,
the SU(N) gauge symmetry dictates uniquely the Lagrangian density, given in two parts:
1) the sector of SU(N) gauge fields LG , and 2) the Dirac sector of particle fields LD ,
as described earlier in the SU(N) gauge theory. The combined action is 1) SU(N) gauge
invariant, 2) representation invariant (PRI), and 3) Lorentz invariant. The field equations are
then derived by using PID:
+ , # $
ν b b αβ c d k2 0
Gab ∂ Gν µ − gλcd g Gα µ Gβ − gΨγµ τa Ψ = ∂µ − xµ + gGµ + gGµ φa , (1.11.5)
4
4 5
iγ µ ∂µ + igG0µ + igGaµ τa Ψ − MΨ = 0, (1.11.6)
for 1 ! a ! N 2 − 1, where G0µ is the interaction field of external systems, and G µ = αa Gaµ .
It is important to note that coupling to the external fields is achieved by the terms involving
G0µ .
1.12 Weakton Model of Elementary Particles 31
His point was that, from an aesthetic point of view, the left hand side of the
equation which describes the gravitational field is based on a beautiful geo-
metrical principle, whereas the right hand side, which describes everything
else,... looks arbitrary and ugly.
... [today] Since gauge fields are based on a beautiful geometrical principle,
one may shift them to the left hand side of Einstein s equation. What is left
on the right are the matter fields which act as the source for the gauge fields ...
Can one geometrize the matter fields and shift everything to the left?
Basically, one needs to geometrize the energy-momentum tensor Tµν appearing in the
Einstein field equations. For example, for multi-particle system under gravity and elec-
tromagnetism, using the basic postulates as outlined above, a unified field model can be
naturally derived so that the energy-momentum tensor Tµν is derived from first principles
and is geometrized.
The above vision of Einstein and Nambu is achieved by using the postulate for interact-
ing multi-particle systems, the PID, the PRI, as well as the principle of symmetry-breaking
(PSB):
1) fundamental symmetries of Nature dictate the actions for both the subsys-
tems as well as the global system,
2) the subsystems are coupled through PID, PRI and PSB.
We believe that this is the essence of physical modeling and a unique route for unifica-
tion.
Great achievements and insights have been made for last 100 years on the understand-
ing of the structure of subatomic particles and on their interactions; see among many others
(Halzen and Martin, 1984; Griffiths, 2008; Kane, 1987; Quigg, 2013). However, there are
still many longstanding open questions and challenges. Here are a few fundamental ques-
tions which are certainly related to the deepest secrets of our Universe. One such problem
is that why leptons do not participate in strong interactions.
The starting point of the study is the puzzling decay and reaction behavior of subatomic
particles. For example, the electron radiations and the electron-positron annihilation into
photons or quark-antiquark pair clearly shows that there must be interior structure of elec-
trons, and the constituents of an electron contribute to the making of photon or the quark in
the hadrons formed in the process. In fact, all sub-atomic decays and reactions show clearly
the following conclusion:
This conclusion motivates us to propose a model for sub-lepton, sub-quark, and sub-
mediators. It is clear that any such model should obey four basic requirements:
4) Weakton confinement.
The model should lead to consistency of masses for both elementary particles, which
we call weaktons, and composite particles (the quarks, leptons and mediators). Since the
mediators, the photon γ and the eight gluons are all massless, a natural requirement is that
w∗ , w1 , w2 , νe , νµ , ντ ,
(1.12.1)
w , w1 , w2 , ν e , ν µ , ν τ ,
∗
where νe , νµ , ντ are the three generation neutrinos, and w∗ , w1 , w2 are three new particles,
which we call w-weaktons. These weaktons in (1.12.1) are endowed with the quantum
1.12 Weakton Model of Elementary Particles 33
numbers: electric charge Qe , weak charge Qw , strong charge Qs , weak color charge Qc ,
baron number B, lepton numbers Le , Lµ , Lτ , spin J, and mass m. The quantum numbers of
weaktons are listed in Table 1.1.
e = νe w1 w2 , µ = νµ w1 w2 , τ = ντ w1 w2 ,
∗
u = w w1 w1 , c = w∗ w2 w2 , t = w∗ w2 w2 , (1.12.2)
∗ ∗ ∗
d = w w1 w2 , s = w w1 w2 , b = w w1 w2 ,
where c,t and d, s, b are distinguished by the spin arrangements; see (5.3.20).
The weakton constituents of the mediators and their dual mediators are given by
taking into consideration of neutrino oscillations. When examining decays and reactions of
sub-atomic particles, it is apparent for us to predict the existence of this mediator.
With the above weakton constituents of charged leptons, quarks and mediators, we can
then verify the four basic requirements as mentioned earlier, based in part on the weak
interaction potentials.
1. Mass generation. One important conclusion of the aforementioned weakton model is
that all particles—both matter particles and mediators—are made up of massless weaktons.
A fundamental question is how the mass of a massive composite particle is generated. In
fact, based on the Einstein formulas:
2 2
d⃗ v2 ⃗ v2 E
P = 1 − 2 F, m = 1− 2 2, (1.12.5)
dt c c c
we observe that a particle with an intrinsic energy E has zero mass m = 0 if it moves in the
speed of light v = c, and possess nonzero mass if it moves with a velocity v < c. Hence by
this mass generation mechanism, for a composite particle, the constituent massless weak-
tons can decelerate by the weak force, yielding a massive particle.
In principle, when calculating the mass of the composite particle, one should also con-
sider the bounding and repelling energies of the weaktons, each of which can be very large.
Fortunately, the constituent weaktons are moving in the “asymptotically-free” shell region
of weak interactions as indicated by the weak interaction potential/force formulas, so that
the bounding and repelling contributions to the mass are mostly canceled out. Namely,
the mass of a composite particle is due mainly to the dynamic behavior of the constituent
weaktons.
2. Consistency and removal of unrealistic compositions. The consistency can be easily
checked. Also a few simple quantum rules can be devised so that unrealistic combinations
of weaktons are easily excluded.
3. The weakton confinement is simply the direct consequence of the weak interaction
potential/force formulas.
Mechanism of decays
Remarkably, the weakton model offers a perfect explanation for all sub-atomic decays.
In particular, all decays are achieved by 1) exchanging weaktons and consequently exchang-
ing newly formed quarks, producing new composite particles, and 2) separating the new
composite particles by weak and/or strong forces.
One aspect of this decay mechanism is that we know now the precise constituents of
particles involved in all decays/reactions both before and after the reaction. It is therefore
believed that the new decay mechanism provides clear new insights for both experimental
and theoretical studies.
Chapter 2
Fundamental Principles of Physics
The purposes of this chapter are 1) to provide an intuitive introduction to fundamental prin-
ciples of Nature, and 2) to explain how the laws of Nature are derived based on these princi-
ples. The main focus is on the symbiotic interplay between advanced mathematics and the
laws of Nature. For this purpose, we start with a brief overview on the perspective and the
physical significance of a few fundamental principles.
Section 2.1 provides a basic intuitive introduction to fundamental principles, symme-
tries, the geometric interaction mechanism and the symmetry-breaking principle. We start
with two guiding principles of theoretical physics, which can be synthesized as: the laws
of Nature are represented by mathematical equations, are dictated by a few fundamental
principles, and always take the simplest and aesthetic forms.
The geometric interaction mechanism was originally motivated by Albert Einstein’s vi-
sion revealed in his principle of equivalence, and was first postulated in (Ma and Wang,
2014d).
Symmetry plays a fundamental role in physics. Many, if not all, physical systems obey
certain symmetry. For this reason, many fundamental principles in physics address the
underlying symmetries of physical systems.
However, a crucial component of the unification of four fundamental interactions as
well as the modeling of multi-level physical systems is the symmetry-breaking mechanism.
Consequently, we postulated in (Ma and Wang, 2014a) and in Section 2.1.7 a principle
of symmetry-breaking 2.14, which states that for a system coupling different levels of
physical laws, part of these symmetries of the subsystems must be broken.
Section 2.2 addresses essentials of the classical Lorentz invariance, and its applications
to the derivation of Schrödinger equation, the Klein-Gordon equation, the Weyl equations
and the Dirac equations in relativistic quantum mechanics.
Section 2.3 presents a brief introduction to the Einstein theory of general relativity,
focusing on Einstein’s two basic principles, the principle of equivalence and the principle
of general relativity, and on the derivation of the Einstein gravitational field equations.
A brief introduction to both the U(1) abelian and the SU(N) non-abelian gauge theories
are presented in Section 2.4. In particular, the U(1) abelian gauge theory is defined on the
36 Chapter 2 Fundamental Principles of Physics
complex spinor bundle M 4 ⊗ p C4 . The SU(N) non-abelian gauge theory, also called the
Yang-Mills theory, is the connection on the complex spinor bundle M 4 ⊗ p (C4 )N . Note that
the presentation here leads clearly to PRI, to be introduced in Chapter 4.
Section 2.5 introduces the principle of Lagrangian dynamics (PLD) and its many appli-
cations such as in classical mechanics and electrodynamics. The Noether Theorem, Theo-
rem 2.38, is also proved.
Section 2.6 is on Hamiltonian dynamics and its connections to the Lagrangian dynamics.
First Principle of Physics 2.1 Physical systems obey laws and principles of Nature:
2) These laws and principles are universal, and the universality is reflected by symme-
tries in physics; and
3) Physical symmetries dictate the precise forms of the mathematical models, which are
often presented in the forms of ordinary or partial differential equations.
In addition, the following principle provides the essence of fundamental physics, and is
supported by known experimental facts.
Essence of Physics 2.2 The essence of physics can be drawn in the following two
important ingredients:
2) The laws of Nature always take the simplest and aesthetic forms.
First Principle 2.1 and Essence of Physics 2.2 serve as guiding principles to understand
and explore the physical world of our Universe and the laws of Nature.
F = −mMΦ(r),
Φ(r1 ) r2 v21
= . (2.1.2)
Φ(r2 ) r1 v22
By the Kepler’s third law, the periods T1 and T2 of the two planets satisfy
T12 r3
2
= 13 , Ti vi = 2π ri for i = 1, 2,
T2 r2
where G is a constant, called the gravitational constant. Thus, by the Kepler’s third law, one
can postulate naturally the Newton’s gravitational law (2.1.1). In other words, the Newton’s
gravitational law can be regarded as a phenomenological theory.
Albert Einstein was the first who tried to deduce basic physical laws from the first prin-
ciples. For example, the Einstein theory of general relativity and the Einstein gravitational
field equations:
1 8π G
Rµν − gµν R = − 4 Tµν , (2.1.4)
2 c
are derived based on the following three basic principles:
The Schwarzschild solution of the Einstein gravitational field equations (2.1.4) offers a nat-
ural link between the field equations and the Newtonian gravitational law (2.1.1).
Fundamental first principles refer to the laws of Nature that cannot be derived from other
more basic laws.
Based on Essence of Physics 2.2, if we would like to better understand theoretical
physics, it is crucial to know and find out all fundamental laws. In this subsection, we
shall list the known and important principles in various physical fields, most of which will
be introduced in later chapters.
We start with the introduction of Principle of Lagrangian Dynamics (PLD), which is of
special importance.
In classical mechanics, we know the least action principle. For a mechanical system
with the position and velocity variables
let the kinetic energy T and the potential energy V be functions of the position and velocity
variables in (2.1.5). Then the states of the system are the extremum points of the functional
! t1
L(x) = L (x, ẋ)dt, (2.1.6)
t0
is called the Lagrange density, and the functional (2.1.6) is called the Lagrange action. The
extremum points x satisfy the variational equation, called the Euler-Lagrange equations of
(2.1.6):
δ L(x) = 0, (2.1.8)
where δ L is the variational derivative operator of L. The equation (2.1.8) can be equivalently
expressed as
d ∂L ∂L
− = 0.
dt ∂ ẋ ∂x
The most important point is that the least action principle can be generalized to all
physical systems describing motions, as well as to fundamental interactions of Nature. The
generalization in a motion system is called the PLD, and in an interaction system is called
the Principle of Interaction Dynamics (PID). PID will be introduced in detail In Chapter 4
of this book, and PLD is stated as follows.
Principle of Lagrangian Dynamics 2.3 For a physical motion system, there are
functions
u = (u1 , · · · , un ), (2.1.9)
which describe the states of this system, and there exists a functional of u, given by
!
L(u) = L (u, Du, · · · , Dm u)dx, (2.1.10)
Ω
where Ω is the domain of u, and Dk u is the k-th derivative of u for any 0 ! k ! m. Then the
state functions of this system are the extremum points of (2.1.10). Namely the state functions
satisfy the variational equation of (2.1.10):
δ L(u) = 0. (2.1.11)
The functional L is called the Lagrange action, and L is called the Lagrange density.
By PLD, to derive dynamical equations for a physical system, it suffices to find the
corresponding Lagrange action. In the next subsection, we demonstrate that symmetries in
physics dictate the precise forms of the Lagrange actions.
A list of known important physical principles and laws of Nature in various subfields of
physics is given as follows.
• Principle of Symmetry-Breaking
2) Classical mechanics:
3) Quantum physics:
4) Statistical physics:
• Le Châterlier Principle.
5) Nonlinear sciences:
Remark 2.4 Because various conservation laws can be deduced by PHD, PLD and
symmetries based on the Noether theorem, these laws are not listed in the fundamental
principles.
Remark 2.5 Mathematical models form the skeleton of theoretical physics, and most,
if not all, mathematical models (differential equations) in theoretic physics can be derived
based on the principles listed above. One of the ultimate goals of this book is to derive
physical models based on a first principles.
2.1 Essence of Physics 41
2.1.4 Symmetry
Symmetry plays an important role in physics. We start with intuitively examining how
symmetry works, keeping in mind the relation between equations and laws of physics in
First Principle 2.1, which can be simply recast as
The laws of Nature on the left hand side of (2.1.12) are often beyond words, and are
best expressed by differential equations. It is this characteristic, together with the Noether
theorem, that illustrates the importance of symmetry in physics, as illustrated below:
Invariance Covariance Symmetry
⇓ ⇓ ⇓
Form of Equation Space Structure Conservation Law
1) the explicit form of differential equations governing the underlying physical system,
2) the space-time structure of the Universe, and the mechanism of fundamental interac-
tions of Nature, and
We now give a simple example to demonstrate how symmetry determines the explicit
form of equations. We begin with two basic implications of a symmetry:
a) Fundamental laws of Nature are universal, and their validity is independent of the
space, time, and directions of experiments and observations, and
b) By (2.1.12), the universality of laws of Nature implies that the differential equations
representing them are covariant. Equivalently the Lagrange actions are invariant un-
der the corresponding coordinate transformations.
As an example, consider a physical system obeying PLD 2.3. For simplicity, we assume
that the Lagrange action of this fictitious system is finite-dimensional. Namely, let
F : Rn → R 1
be a finite dimensional function regarded as the action of the physical system, and we need
to determine the explicit form of this function
Consider the case where the system satisfies the following invariance principle, called the
principle of rotational invariance.
42 Chapter 2 Fundamental Principles of Physics
Principle 2.6 (Rotational Invariance) The laws that the system (2.1.13) obeys are the
same under all orthogonal coordinate systems.
are invariant, where y ∈ Rn is given, and the invariant function F(x) must be of the form:
F(x) = F(|x|, x · y)
which are applicable to different physical fields. Here are the five currently known important
symmetries and their corresponding ingredients:
1) Galileo Invariance:
2) Lorentz Invariance:
4) Gauge Invariance:
5) Representation Invariance:
Principle 2.7 (Galilean Invariance) Mechanical laws are invariant under the following
transformations:
1) Galilean transformation
t = t,
& x& = x + vt for x ∈ R3 , (2.1.16)
t = t + t0 ,
& x& = Ax + b for x ∈ R3 , (2.1.17)
By (2.1.12) we know that the invariance of physical laws is equivalent to the covariance
of the corresponding differential equations. We now demonstrate that the Newton’s Second
Law obeys the Galileo Invariance Principle 2.7.
The Newton’s Second Law reads as
ma = F, (2.1.18)
and the corresponding differential equation describing the motion of a particle is given by
d 2x
m = F. (2.1.19)
dt 2
When we investigate the motion in other reference frame (&
x, &
t ) with the transformation
t = t + t0 ,
& x& = Ax + vt + b, (2.1.20)
by (2.1.20) we derive
d 2 x& d 2x
= A . (2.1.21)
d&t2 dt 2
On the other hand, experiments show that the expressions F& in (&
x, &
t ) and F in (x,t) of force
satisfy the relation:
F& = AF. (2.1.22)
Hence, by (2.1.19) we deduce from (2.1.21) and (2.1.22) that
d 2 x& &
m = F. (2.1.23)
d&t2
It is clear that both forms of (2.1.19) and (2.1.23) are the same under the transformation
(2.1.20), i.e. under the transformations (2.1.16) and (2.1.17).
Mathematically, the transformations corresponding to (2.1.16) and (2.1.17) are the Galileo
group and SO(3), and the functions satisfying (2.1.21) and (2.1.22) under the transformation
(2.1.20) are called first-order Cartesian tensors, which are also the usual vectors in R 3 .
Also, this example demonstrates that invariance principles must be characterized by
corresponding tensors. In the later sections, we shall give precise definitions of various
types of tensors.
Remark 2.8 In classical mechanics, the laws were first discovered phenomenolog-
ically, and then were known to obey the Galileo Principle of Invariance. It is Einstein’s
vision that laws of physics can be derived from symmetries.
2.1 Essence of Physics 45
and the Pythagorean theorem is in general not true in a curved space. Mathematically, a
space M being flat indicates that one can choose properly a coordinate system so that the
metric
g i j = δi j ,
and otherwise, M will be curved.
Regarding to the laws of Nature on our Universe M , physical states are described by
functions u = (u1 , · · · , un ) defined on M :
which are solutions of differential equations associated with the laws of the underlying
physical system:
δ L(Du) = 0, (2.1.28)
where D is a derivative operator, and L is the Lagrange action. Consider two transformations
for the two physical systems (2.1.26) and (2.1.27):
x& = T x for (2.1.26), (2.1.29)
Principle 2.10 (General Relativity) Laws of Physics are the same under all coordinate
systems. Namely, equations (2.1.28) are covariant and equivalently the action L is invariant
under all coordinate transformations (2.1.29).
where Γ depends on the metrics gi j , A is a gauge field, representing the interaction potential,
and g is the coupling constant, representing the interaction charge.
The derivatives defined in (2.1.31) and (2.1.32) are called connections respectively on
M and on the complex vector bundle M ⊗ p Cn . For the connections, we have the following
theorem, providing a mathematical basis for our interaction mechanism.
Theorem 2.12
Consequently, by Principle 2.10 the presence of the gravitational field implies that
the space-time manifold is curved, and, by Principle 2.11, the presence of the electromag-
netic, the weak and strong interactions indicates that the complex vector bundle M ⊗ p Cn
is twisted:
Principle 2.10 ⇒ gi j ̸= δi j ⇒ M is curved,
(2.1.33)
Principle 2.11 ⇒ A ̸= 0 ⇒ M ⊗ p Cn is twisted.
This analogy, together with Einstein’s vision on gravity as the curved effect of space-
time manifold, it is natural for us to postulate the following mechanism for all four interac-
tions.
Geometric Interaction Mechanism 2.13 The gravitational force is the curved effect
of the time-space, and the electromagnetic, weak, strong interactions are the twisted effects
of the underlying complex vector bundles M ⊗ p Cn .
As mentioned earlier, traditionally one adopts Yukawa’s viewpoint that forces of the
interactions of Nature are caused by exchanging the field mediators. Namely, the gravitation
is due to exchanging gravitons, the electromagnetic force is due to exchanging photons, the
weak force is due to exchanging the intermediate vector bosons, and the strong force is due
to exchanging gluons. The point of view we are taking is the Interaction Mechanism 2.13,
following Einstein’s version.
Astrophysics is the only field that involving all the fields in (2.1.34) and (2.1.35). Con-
sequently, one needs to couple the basic laws in (2.1.35) to model astrophysical dynamics.
One difficulty we encounter now is that the Newtonian Second Law for fluid motion
and the diffusion law for heat conduction are not compatible with the principle of general
relativity. Also, there are no basic principles and rules for combining relativistic systems and
the Galilean systems together to form a consistent system. The reason is that in a Galilean
system, time and space are independent, and physical fields are 3-dimensional; while in a
relativistic system, time and space are related, and physical fields are 4-dimensional.
The distinction between relativistic and Galilean systems gives rise to an obstacle for
establishing a consistent model of astrophysical dynamics, coupling all the physical systems
in (2.1.34) and (2.1.35).
In the unified field theory based on PID, the key ingredient for coupling gravity with
the other three fundamental interactions is achieved through spontaneous gauge symme-
try breaking. Here we propose that the coupling between the relativistic and the Galilean
systems through relativistic-symmetry breaking.
In fact, the model given by (7.1.75)-(7.1.76) follows from this symmetry-breaking prin-
ciple, where we have to chose the coordinate system
With this metric (2.1.36)-(2.1.37), we can establish the fluid and heat equations as in (7.1.78).
It is then clear that by fixing the coordinate system to ensure that the metric is in the form
(2.1.36)-(2.1.37), the system breaks the symmetry of general coordinate transformations,
and we call such symmetry-breaking as relativistic-symmetry breaking.
We believe the symmetry-breaking is a general phenomena when we deal with a physical
system coupling different subsystems in different levels. The unified field theory for the four
fundamental interactions is a special case, which couples the general relativity, the Lorentz
and the gauge symmetries. Namely, the symmetry of general relativity needs to be linked to
both the Lorentz invariance and the gauge invariance in two aspects as follows:
2) The gauge-symmetry breaks in the gravitational field equations coupling the other
interactions:
1 8π G
Rµν − gµν R = − 4 Tµν + Dµ Φν , (2.1.38)
2 c
where
k1 k2 k3
Dµ = ∇µ + eAµ + gwWµ + gs Sµ , (2.1.39)
h̄c h̄c h̄c
∇µ is the covariant derivative, ki (1 ! i ! 3) are parameters, A µ ,Wµ , S µ are the total
electromagnetic, weak and strong interaction potentials. It is the terms A µ ,Wµ , S µ in
(2.1.39) that break the gauge symmetry of (2.1.38).
2) for a system coupling different levels of physical laws, part of these symmetries must
be broken.
This principle of symmetry-breaking holds the key component for us to establish the
PID unified field theory for four fundamental interactions in Chapter 4, the field equations
for multi-particle systems in Chapter 6, and the astrophysical fluid dynamical equations in
Chapter 7, resolving a number of important physical problems.
In 1905, Albert Einstein introduced the special theory of relativity, based on the follow-
ing two first principles: the principle of special relativity and the principle of invariance of
the speed of light.
50 Chapter 2 Fundamental Principles of Physics
Principle 2.15 (Special Relativity) Physical laws are covariant in all inertial systems.
Principle 2.16 (Invariance of speed of light) The vacuum speed of light is a universal
constant.
A few remarks are now in order. First, a system of reference is always needed to describe
the nature, and an inertial system is a system on which a freely moving object moves with
constant velocity (Landau and Lifshitz, 1975). Second, it is clear that under the Galilean in-
variance, the speed of light changes in different inertial systems. Consequently, the Galilean
invariance and the Lorentz invariance are incompatible. Third, the invariance of the vacuum
speed of light was verified by the Michelson and Morley experiment. Einstein discovered
his special theory of relativity by postulating, based on Principles 2.15-2.16, the following
principle of Lorentz invariance.
Principle 2.17 (Lorentz Invariance) Physical laws are invariant under Lorentz trans-
formations.
With this principle, the special theory of relativity was developed in two directions:
a) the introduction of relativistic mechanics, replacing the classical mechanics, and b) the
development of relativistic quantum mechanics.
Figure 2.1
We now introduce typical Lorentz transformations 1 . Let (x,t) and (& x, &
t ) be two inertial
systems which are in relativistic motion with a constant velocity v, as shown in Figure 2.1,
where v is in the x1 -axis direction. Then the Lorentz transformation for the two inertial
systems is given by
6 7
x 1 − vt t − x1 v/c2
x1 , x&2 , x&3 ) = 3
(& , x2 , x3 , t=3
& , (2.2.3)
1 − v2/c2 1 − v2/c2
The vacuum speed of light is invariant under the above Lorentz transformation (2.2.3).
x1 /d&
To see this, notice that the speed of light is c& = d& t measured in the x&-coordinate system,
1
and is c = dx /dt measured in the x-coordinate system. Consequently,
x1
d& dx1 − vdt 8 dt − dx1 v/c2 dx1 /dt − v c−v
c& = =3 3 = 1 2
= = c.
dt
& 2
1 − v /c 2 2
1 − v /c 2 1 − (dx /dt)(v/c ) 1 − v/c
where c is the speed of light, and M 4 is endowed with the Riemannian metric:
⎛ ⎞
−1 0 0 0
⎜ 0 1 0 0⎟
(g µν ) = ⎜
⎝ 0 0 1 0⎠ .
⎟ (2.2.5)
0 0 0 1
The doublet {M 4 , g µν } given by (2.2.4) and (2.2.5) is called the Minkowski space, and the
metric (2.2.5) can be equivalently expressed in the form
ds2 = g µν dxµ dxν = −c2 dt 2 + dx2 = −c2 dt 2 + (dx1 )2 + (dx2 )2 + (dx3 )2 . (2.2.6)
where β = ν /c. Both transformations (2.2.3) and (2.2.7) are the same in form, but the math-
ematical implication is changed. Here, (2.2.7) represents the coordinate transformation for
the Minkowski space M 4 . With Einstein’s summation convention, (2.2.7) is often denoted
by
µ
x&µ = Lν xν , (2.2.8)
where
⎛ ⎞
1 β
3 −3 0 0
⎜ 1 −β2 1−β2 ⎟
⎜ ⎟
µ ⎜ β 1 ⎟
(Lν ) = ⎜
⎜− 31 − β 2 3 0 0⎟
⎟ (2.2.9)
⎜ 1−β2 ⎟
⎝ 0 0 1 0⎠
0 0 0 1
µ µ
is the Lorentz matrix, and its inverse (lν ) = (Lν )−1 is given by
⎛ ⎞
1 β
3 3 0 0
⎜ 1−β2 1−β2 ⎟
⎜ ⎟
µ ⎜ β 1 ⎟
(lν ) = ⎜
⎜ 31 − β 2 3 0 0⎟
⎟. (2.2.10)
⎜ 1−β2 ⎟
⎝ 0 0 1 0⎠
0 0 0 1
An important property of the Minkowski space is that its Riemannian metric is invariant
under the Lorentz transformation (2.2.7).
Theorem 2.18 The Minkowski metric (2.2.5) or (2.2.6) is invariant under the co-
ordinate transformation (2.2.7). Namely the metric (& xµ } is the same as that in
g µν ) in {&
µ
{x }:
g µν ) = (g µν ).
(& (2.2.11)
xµ } is also expressed as
In other words, ds2 in {&
The proof of Theorem 2.18 needs to use the properties of tensors. In fact, the Minkowski
metric (2.2.5) is a second-order covariant Lorentz tensor, i.e. under the transformation
(2.2.8), g µν transforms as
g µν ) = (lβα )(g µν )(lβα )T ,
(& (2.2.12)
µ µ
where (lµ ) is the inverse of (Lν ) given by (2.2.10). Then, a direct computation we can get
(2.2.11) from (2.2.12).
2.2 Lorentz Invariance 53
where
v u w u+v
β= , γ= , α= , w= ,
c c c 1 + uv/c2
where w is the velocity composed by u and v by the theory of special relativity.
Thanks to Theorem 2.18, we can define Lorentz transformation group as all linear trans-
formations of the Minkowski space M 4 , that preserve the Minkowski metric; see (2.2.12):
µ µ
LG = {(Lν ) : M 4 → M 4 | (g µν ) = (Lαβ )(g µν )(Lαβ )T , det(Lν ) = 1}. (2.2.13)
Elements of LG are also called Lorentz matrices, and relativistic physics is referred to the
invariance of action under the Lorentz group. The invariance of the Minkowski metric
implies the Lorentz invariance of physical laws.
µ
In this definition, we require det(Lν ) = 1, and such transformations are often called
proper Lorentz transformation in physics literatures. The parity transformation and time
reversal are two special linear transformations of the Minkowski space, which are not ele-
ments in LG as defined in (2.2.13) and are often dealt with separately:
⎛ ⎞ ⎛ ⎞
1 0 0 0 −1 0 0 0
⎜0 −1 0 0⎟ ⎜ 0 1 0 0⎟
P=⎜ ⎝0 0 −1 0 ⎠ ,
⎟ T =⎜⎝ 0 0 1 0⎠ .
⎟ (2.2.14)
0 0 0 −1 0 0 0 1
Lorentz Tensors
We now define Lorentz tensors, also called 4-dimensional (4-D) tensors, corresponding
to the Lorentz invariance.
54 Chapter 2 Fundamental Principles of Physics
Definition 2.19 (Lorentz Tensors) The following quantities are called Lorentz tensors,
or 4-dimensional tensors:
T = {Tµ1 ···µk }, µ1 , · · · , µk = 0, 1, 2, 3,
is called a k-th order covariant tensor, if under the Lorentz transformation (2.2.13),
the components of T transform as
ν
T&µ1 ···µk = lµν11 · · · lµkk Tν1 ···νk ,
µ
where (lµν ) = (Lν )−1 is the inverse transformation of the Lorentz transformation
µ
(Lν ) ∈ LG.
2) A tensor
T = {T µ1 ··· µk }, µ1 , · · · , µk = 0, 1, 2, 3,
µ
is called a k-th order contra-variant tensor, if under the Lorentz transformation (L ν ) ∈
LG, the components of T change as
µ µ
T& µ1 ···µk = Lν11 · · · Lνkk T ν1 ···νk .
3) A function
µ ··· µ
T = {Tν11···νs r }, r + s = k,
µ
is called a k-th order (r, s)-type tensor, if under the Lorentz transformation (L ν ) ∈ LG,
µ ··· µ µ µ β ···β
T&ν11···νs r = lνα11 · · · lναss Lβ 1 · · · Lβrr Tα11···αsr .
1
In physics, the most important tensors are first- and second-order tensors. The following
is a list of commonly encountered 4-D tensors, and we always use (g µν ) = (g µν )−1 with
(g µν ) being the Minkowski metric given by (2.2.5):
1) Position vectors:
xµ = (x0 , x1 , x2 , x3 ), x0 = ct,
xµ = (x0 , x1 , x2 , x3 ) = g µν xν = (−x0 , x1 , x2 , x3 ). (2.2.15)
Aµ = (A0 , A1 , A2 , A3 ),
Aµ = (A0 , A1 , A2 , A3 ) = g µν Aν , (2.2.16)
Aµ = (A0 , A1 , A2 , A3 ), Bµ = (B0 , B1 , B2 , B3 )
A µ B µ = A0 B0 + A1 B1 + A2 B2 + A3 B3 (2.2.20)
is a Lorentz invariant. In fact, under the Lorentz transformation (2.2.8), A µ and Bµ satisfy
&µ = Lνµ Aν , µ
A B&µ = lν Bν .
It follows that
&µ B&µ = Lαµ l µ Aα Bβ .
A (2.2.21)
β
Since (lµν ) = (Lνµ )−1 is the inverse matrix of (Lνµ ), then
µ
Lαµ lβ = δβα .
1) The contractions
Aµ1 ···µk Bµ1 ···µk , (2.2.22)
is a Lorentz invariant, and so are the following contractions:
µ
2) Let Aν be a (1,1) Lorentz tensor, then the contraction
µ
Aµ = A00 + A11 + A22 + A33 (2.2.24)
is a Lorentz invariant.
Theorem 2.20 provides a number of typical Lorentz invariants through contractions, and
in fact, all Lagrange actions in relativistic physics are combinations of the Lorentz invariants
in (2.2.22)-(2.2.25).
u µ = (u0 , u1 , u2 , u3 ),
1 vk /c (2.2.26)
u0 = 3 , uk = 3 for 1 ! k ! 3.
1 − v2/c2 1 − v2/c2
2.2 Lorentz Invariance 57
du µ
aµ = = (a0 , a1 , a2 , a3 ),
ds 6 7
0 1 d 1
a = 3 3 , (2.2.27)
c 1 − v2/c2 dt 1 − v2/c2
6 7
k 1 d vk
a = 3 3 for 1 ! k ! 3.
c2 1 − v2/c2 dt 1 − v2/c2
Fourth, by (2.2.28), we derive the most important formula in relativistic, called the Ein-
stein energy-momentum relation:
E 2 = c 2 P 2 + m2 c4 . (2.2.29)
Remark 2.21 In 1905, Albert Einstein conjectured that a static object with mass m
have energy E, and satisfy the relation, called the Einstein formula:
E = mc2 . (2.2.30)
Then, it follows from (2.2.31) that the energy-momentum (E, cP) of a moving object with
velocity v is taken in the form of (2.2.28), which implies that the relation (2.2.29) holds true.
Hence, the energy-momentum relation (2.2.29) is based on postulating (2.2.30), which was
verified by numerous experiments.
Fifth, in classical mechanics, the Newtonian second law takes the form
d
P = F, P = mv is the momentum.
dt
In the relativistic mechanics, the 4-D force is
! "
dE dP
Fµ = ,c ,
ds ds
58 Chapter 2 Fundamental Principles of Physics
where P = (P1 , P2 , P3 ) is as in (2.2.28). Thus, it follows that the relativistic motion law is
given by
?
d k mvk
P = 1 − v2/c2 F k , Pk = 3 for k = 1, 2, 3. (2.2.32)
dt 1 − v2/c2
2.2.5 Lorentz invariance of electromagnetism
The Maxwell equations for electromagnetic fields are
1 ∂H
curl E = − , (2.2.33)
c ∂t
div H = 0, (2.2.34)
1 ∂ E 4π
curl H = + J, (2.2.35)
c ∂t c
div E = 4πρ , (2.2.36)
where E, H are the electric and magnetic fields, and J is the current density, and ρ the charge
density.
To show the Lorentz invariance of the Maxwell equations (2.2.33)-(2.2.36), we need to
express them in the form of the 4-D electromagnetic potential and current density.
The electromagnetic potential and current density are briefly introduced in (2.2.16) and
(2.2.17):
Aµ = (A0 , A1 , A2 , A3 ),
(2.2.37)
Jµ = (J0 , J1 , J2 , J3 ), J0 = −cρ .
Using the Lorentz tensor operator
! "
∂ ∂ ∂ ∂ ∂
∂µ = = , 1, 2, 3 ,
∂ xµ ∂x ∂x ∂x ∂x
0
∂µ J µ = 0, (2.2.43)
H(l1 , · · · , lm ) = 0, (2.2.49)
H(L1 , · · · , Lm )ψ = 0 (2.2.50)
may give a model describing this system provided the operator H(L 1 , · · · , Lm ) in (2.2.49) is
Hermitian.
Remark 2.24 If the operator H(L1 , · · · , Lm ) in (2.2.49) is irreducible, then (2.2.49)
must describe the system.
Based on Postulates 2.22 and 2.23, we derive three basic equations of relativistic quan-
tum mechanics: the Klein-Gordon equation describing the bosons, the Weyl equations de-
scribing massless free fermions, and the Dirac equations describing massive fermions.
1. Klein-Gordon equation. By the Einstein energy m omentum relation for energy E,
momentum P and the rest mass m:
E 2 − c2⃗P2 − m2c4 = 0,
2.2 Lorentz Invariance 61
and by (2.2.45), we derive from Postulate 2.23 an relativistic equation, called the Klein-
Gordon (KG) equation: #
1 ∂2 ) mc *2 $
2
− ∇ + ψ = 0. (2.2.51)
c2 ∂ t 2 h̄
The KG equation describes a free boson with spin J = 0.
When an electromagnetic field presents, the operators in (2.2.45) and (2.2.46) are rewrit-
ten as1
∂ e
E = ih̄ − eA0 , ⃗P = −ih̄∇ − ⃗A,
∂t c (2.2.52)
P0 = ih̄(⃗σ · ⃗D), P1 = −ih̄(⃗α · ⃗D),
where A µ = (A0 , ⃗A) is the electromagnetic potential, and
⃗D = ∇ + ie ⃗A.
h̄c
Thus, as electromagnetic field presents, the Klein-Gordon equation (2.2.51) is expressed as
! ) mc *2 "
µ
Dµ D − ψ = 0, (2.2.53)
h̄
where e
Dµ = ∂µ + i
Aµ , (2.2.54)
h̄c
is a 4-dimensional vector operator. The expression (2.2.53) is clearly Lorentz invariant.
2. Weyl equations. Based on the de Broglie relation
E = h̄ω , P = h̄/λ , c = ωλ ,
we obtain
E = cP. (2.2.55)
The relation (2.2.55) is valid to a massless free fermion, e.g. as neutrinos. Inserting E and
P0 in (2.2.45) and (2.2.46) into (2.2.55), we obtain that
∂ψ
= c(⃗σ · ∇)ψ , (2.2.56)
∂t
which are called the Weyl equations describing the free massless neutrinos. Here ψ =
(ψ1 , ψ2 )T is a two component Weyl spinor.
3. Dirac equations. For a massive fermion, the de Broglie matter-wave duality relation
can be generalized in the form
Then we have
E = cP ± mc2 . (2.2.57)
Inserting the operators E and P1 in (2.2.45) and (2.2.46) into (2.2.57), and taking the mass
operator as ! " ! "
I 0 1 0
±mc2 = mc2 α0 , α0 = , I= ,
0 −I 0 1
we derive the following Dirac equations:
∂ψ
ih̄ = −ih̄c(⃗α · ∇)ψ + mc2α0 ψ , (2.2.58)
∂t
where ψ = (ψ1 , ψ2 , ψ3 , ψ4 )T is a four-component Dirac spinor, and ⃗α = (α1 , α2 , α3 ) is as
defined by (2.2.48).
Usually, we multiply both sides of (2.2.58) by the matrix α0 , and denote
γ µ = (γ 0 , γ 1 , γ 2 , γ 3 ), (2.2.59)
where
! " ! "
I 0 0 σk
γ 0 = α0 = , γ k = α0 αk = for 1 ! k ! 3.
0 −I − σk 0
The matrices γ µ are the Dirac matrices. Then the Dirac equations (2.2.58) are in the form
) mc *
iγ µ ∂µ − ψ = 0. (2.2.60)
h̄
When we consider the case under an electromagnetic field, we need to replace ∂ µ in
(2.2.60) by D µ given by (2.2.54).
Remark 2.25 The reason why the scalar-valued momentum operators P are taken in
the form of (2.2.46) is due to the following Einstein energy-momentum formulas:
It follows that
R−1 γ µ Lνµ R = γ ν ,
which is equivalent to
µ
R−1 γ µ R = Lν γ ν for µ = 0, 1, 2, 3. (2.2.65)
1 v/c
cosh θ = 3 , sinh θ = 3 .
1 − v2/c2 1 − v2/c2
In retrospect, Albert Einstein must have followed the spirit of this principle for his dis-
covery of the general theory of relativity. As mentioned earlier, coordinate systems, also
called reference systems, are just an indispensable tool to express the laws of physics in the
form of differential equations. Consequently the validity of laws of physics is independent
of coordinate systems. Hence Einstein proposed the following principle of general relativity.
Principle 2.26(General Relativity) Laws of physics are the same under all coordinate
systems, both inertial and non-inertial. In other words, the models describing the laws of
physics are invariant under general coordinate transformations.
• general tensors,
Hence in a general non-inertial system, the metric ds2 takes the form
where {g µν } is a Riemannian metric different from the Minkowski metric given by (2.2.5).
In mathematics, the space M endowed with the metric (2.3.4), denoted by {M , g µν },
is called a Riemannian space, or a Riemannian manifold, and (2.3.4) or {g µν } is the Rie-
mannian metric.
In Theorem 2.12, we know that the Minkowski space M 4 is flat, and the Riemannian
space {M , g µν } is curved provided that the metric g µν is not the same as the Minkowski
metric in any coordinate systems.
66 Chapter 2 Fundamental Principles of Physics
According to the theory of Riemannian geometry, the motion equations (2.3.5) in the Minkowski
space are in the form
d 2 xk
=0 for k = 1, 2, 3,
ds2
which are the motion equations of special relativity, and (2.3.5) in the Riemannian space
become
d 2 xk dxµ dxν
2
+ Γkµν =0 for k = 1, 2, 3. (2.3.6)
ds ds ds
where Γαµν is the Levi-Civita connection, given by
! "
1 ∂ g µβ ∂ gνβ ∂ g µν
Γαµν = gαβ + − , (2.3.7)
2 ∂ xν ∂ xµ ∂ xα
With the principle of equivalence, the Riemann metric is regarded as the effects caused
only by the gravitation. Furthermore, by the classical gravitational theory, the acting force
! "
∂ϕ ∂ϕ ∂ϕ
F = −∇ϕ = − , , , (2.3.8)
∂ x1 ∂ x2 ∂ x3
where ϕ is the gravitational potential. In comparing the force (2.3.8) with (2.3.7) which
contains first-order derivative terms ∂α gµν , the connections (2.3.7) also play a role of acting
forces in (2.3.6). Hence we need to regard the Riemann metric {g µν } as the gravitational
potential.
Thus, the principle of general relativity requires that the underlying space-time manifold
be a 4-dimensional Riemann space {M , g µν }, and the principle of equivalence defines the
Riemann metric g µν as the gravitational potential.
In conclusion, by Principles 2.26 and 2.27, we derive the following crucial physical
conclusion of the general theory of relativity, and the second crucial physical conclusion is
the Einstein field equations for the gravitational potential:
Physical Conclusion 2.28 (General Theory of Relativity) The physical space of our
Universe is a 4-dimensional Riemannian space {M , g µν }, and the Riemann metric g µν
represents the gravitational potential. The physical laws associated with gravitation are
covariant under the general coordinate transformations in {M , g µν }.
The principle of equivalence have received many supports by experiments. In fact, the
principle is based on the fact that the inertial mass is the same as the gravitational mass. By
the Newton second law,
F = am,
where m is called the inertial mass, denoted by mI , and the gravitational law provides
Gm1 m2
F= ,
r2
where m1 and m2 are called the gravitational masses, and denoted by mg .
Theoretically both masses mI and mg are different physical quantities, and we cannot a
prior claim that they are the same. It must be verified by experiments. Newton was the first
man to check them, resulting
mg
= 1 + o(10−3),
mI
i.e., mg = mI in the error of 10−3 . In 1890, Eötvös obtained the precision to 10−8, and in
1964, Dicke reached at 10−11 .
We remark that if the matrices in (2.3.10) and (2.3.11) are not continuous, then the
transformation (2.3.8) are not permitted.
T = {T ji11···
···ir
js (x)} for x ∈ M .
T&ji11···
···ir i1 ir l1 ls k1 ···kr
js = ak1 · · · akr b j1 · · · b js Tl1 ···ls , (2.3.12)
then T is called a (r, s) type of kth-order general tensor with k = r + s, where a ij and bm
l are
as in (2.3.10) and (2.3.11). The (r, 0) type tensors are called contra-variant tensors, and the
(0, s) type tensors are called covariant tensors.
On a Riemannian space {M , gi j }, the metric {gi j } and its inverse {gi j } are second-order
covariant and contra-variant symmetric tensors, i.e. they satisfy
gi j = g ji , gi j = g ji ,
(2.3.13)
g&i j = bli bkj glk , g&i j = ail akj glk ,
tr[ABA−1] = tr B, (2.3.16)
which can be shown by the fact that the eigenvalues λ 1 , · · · , λn of B are the same as those of
ABA−1 . Indeed we have
tr B = λ1 + · · · + λn = tr[ABA−1].
Alk11···l s k1 ···kr
···kr Bl1 ···ls , (2.3.17)
where the invariance holds true as well if the indices are exchanged. For example, A i j B ji is
also invariant.
By the invariant form (2.3.17), the metric form ds2 of a Riemannian space is a general
invariant, i.e.
ds2 = gi j dxi dx j = g&i j d&
xi d&
x j.
∂k = ∂ /∂ xk for 1 ! k ! n. (2.3.18)
&k = ak Al .
A (2.3.19)
l
∂ A&k ∂ Al ∂ xi ∂ akl ∂ xi l ∂ Al ∂ a k
=akl i + i A = akl bij i + li bij Al , (2.3.20)
∂ x&j ∂ x ∂ x&j ∂ x ∂ x&j ∂x ∂x
where bij = ∂ xi /∂ x&j are as in (2.3.11). In view of Definition 2.29, we infer from (2.3.20)
@ iA
∂A
that the usual derivative operators (2.3.18) are not tensor operators. Namely, is not
∂xj
∂ akl i l
a tensor, and the transformation formula (2.3.20) contains an extra term: bA.
∂ xi j
To solve this problem, as shown in (2.1.31), we need to add a term Γ to the derivative
operator ∂k , resulting a new derivative operator ∇:
∂ Ak
∇ j Ak = + Γkij Ai ,
∂xj
such that ∇ = {∇ j } is a tensor operator. Namely, {∇ j Ak } is a (1,1) type tensor, and trans-
forms as ! l "
k ∂ A&k &k &i k i ∂A
&
∇ jA =
& + Γi j A = al b j + Γri A = akl bij ∇i Al .
l r
(2.3.21)
∂ x&j ∂ xi
By (2.3.20), it follows from (2.3.21) that
k
Γ &i = ak bij Γlri Ar − bij ∂ al Al .
&kij A (2.3.22)
l
∂ xi
By (2.3.22) we deduce the transformation rule for Γ as
k
&kij = ak bri bsj Γlrs − bri bsj ∂ ar .
Γ (2.3.23)
l
∂ xs
Fortunately, for a Riemannian space {M , gi j }, there exists a set of functions
Γ = {Γkij }, (2.3.24)
called the Levi-Civita connection, which satisfies the transformation given by (2.3.23), and
are given by ! "
1 ∂ gil ∂ g jl ∂ gi j
Γkij = gkl + − , (2.3.25)
2 ∂xj ∂ xi ∂ xl
2.3 Einstein’s Theory of General Relativity 71
∂u
∇k u = for a scalar field u,
∂ xk
∂uj j
∇k u j = k + Γkl ul for a vector field {u j },
∂x
∂uj
∇k u j = k − Γlk j ul for a covector field {u j },
∂x
∂ uij11···i r
··· js
∇k uij11···i r
··· js = − Γlk j1 uil1j···i r
2 ··· js
− · · · − Γlk js uij11···i r
··· js−1 l (2.3.26)
∂ xk
i ···i l
+ Γikl1 ulij12······ijsr + · · · + Γiklr u j11 ··· s−1
js .
Consequently, physical laws obeying Principle 2.26 of Einstein general relativity must
be in a form of covariant partial differential equations:
L(u, ∇u, · · · , ∇m u) = 0.
Now, what remains is to establish the field equations governing the Riemann metric
{gi j } (the gravitational potential), which will be introduced in the next two subsections.
L = L (g µν , · · · , ∂ m gµν ). (2.3.27)
However, it is known that all covariant derivatives of the Riemann metric are zero:
∇g µν = 0, ∇g µν = 0. (2.3.28)
Hence, we are not able to directly use the terms ∇g µν , · · · , ∇m g to construct the density
(2.3.27), and have to look for invariants depending on ∂ g µν , · · · , ∂ m gµν (m $ 1) in a differ-
ent fashion.
72 Chapter 2 Fundamental Principles of Physics
The Ricci curvature tensor provides a natural way for us to find the Lagrange action.
For a covector field A = {Ak } we have
∂ β β
∇µ ∇ν Aα = (∇ν Aα ) − Γµν (∇β Aα ) − Γµα (∇ν Aβ ).
∂ xµ
Note that
∂ Aα γ
∇ν Aα = − Γνα Aγ .
∂ xν
Then we can deduce that
β
[∇µ ∇ν − ∇ν ∇µ ]Aα = Rα µν Aβ , (2.3.29)
where
β
β ∂ Γα µ ∂ Γβαν γ β γ β
Rα µν = − + Γα µ Γγν − Γαγ Γγ µ . (2.3.30)
∂ xν ∂ xµ
The tensor in (2.3.30) is called the curvature tensor, and its self-contraction given by
R = g µν Rµν . (2.3.32)
Einstein derived the field equations (2.3.38) in 1915 using his great physical intuition.
According to the classical gravity theory, the Newton potential ϕ satisfies the Laplace equa-
tion
∆ϕ = 4π Gρ , (2.3.39)
where ρ is the mass density. Einstein thinks the gravitational field equations of general
relativity should be in the form
where Tµν represents the energy-momentum tensor, which are provided by physical obser-
vations, and G µν represents gravitational potential which are the generalization of ∆ϕ in
(2.3.39). Hence G µν contains the derivatives of g µν up to the second order. In Riemannian
geometry, only the curvature tensors satisfy the needed properties. Thus G µν must be in the
form
Gµν = Rµν + λ1gµν R + λ2gµν .
74 Chapter 2 Fundamental Principles of Physics
∇µ Tµν = 0,
∇µ Gµν = 0. (2.3.41)
By the Bianchi identity, G µν satisfying (2.3.41) are uniquely determined in the form up to
a constant λ ,
1
Gµν = Rµν − gµν R + λ g µν , (2.3.42)
2
where λ g µν are divergence-free due to (2.3.28). Furthermore, Einstein determined the con-
stant β in (2.3.40) by comparing with (2.3.39), and β is given by
8π G
β =− .
c4
Thus, by (2.3.40) and (2.4.42), Einstein deduced the field equations in the general form as
follows
1 8π G
Rµν − gµν R + λ g µν = − 4 Tµν , (2.3.43)
2 c
and the constant λ is usually called the cosmological constant. Today, a large number of
physical experiments manifest that λ = 0. However, the recent discovery of the acceleration
of our universe leads to some physicists to think that λ ̸= 0.
In Chapter 7, based on the unified field theory developed by the authors, we present a
theory of dark matter and dark energy, which clearly explains the phenomena of dark matter
and dark energy, and shows that the constant λ should be zero, i.e. λ = 0.
Gauge symmetry is one of fundamental invariance principles of physics, and determines the
Lagrange actions of the electromagnetic, the weak and the strong interactions. In order to
show the origin of gauge theory, we first introduce the U(1) gauge invariance of electro-
magnetic fields.
Fermions under an electromagnetic field with potential A µ obey the Dirac equations:
mc
iγ µ Dµ ψ − ψ = 0, (2.4.1)
h̄
Dµ = (∂µ + ieAµ ), (2.4.2)
2.4 Gauge Invariance 75
4π µ
∂ν F µν = J , (2.4.3)
c
! "
µα νβ ∂ Aα ∂ Aβ
F µν =g g − α . (2.4.4)
∂ xβ ∂x
It is easy to see that the system of equations (2.4.1)-(2.4.4) is invariant under the follow-
ing transformation:
& = eiθ ψ ,
ψ A&µ = Aµ − 1 ∂µ θ . (2.4.5)
e
Since eiθ ∈ U(1), (2.4.5) is called a U(1)-gauge transformation.
Now, we consider the gauge invariance from another point of view. Let ψ be a Dirac
spinor describing a fermion:
ψ : M 4 → M 4 ⊗ p C4 .
Experimentally, we cannot observe the phase angles of ψ . Namely, under a phase rotation
transformation
& = eiθ ψ , θ = θ (xµ ),
ψ (2.4.6)
D & = eiθ Dµ ψ .
&µ ψ (2.4.7)
&µ = G µ − 1 ∂ µ θ .
G (2.4.10)
g
&µ ψ
D & µ )(eiθ ψ ) = eiθ (∂µ + igGµ )ψ = eiθ Dµ ψ .
& =(∂µ + igG
76 Chapter 2 Fundamental Principles of Physics
Hence, the derivative operator defined by (2.4.9) is covariant under the gauge transformation
(2.4.6) and (2.4.10):
& = eiθ ψ ,
ψ G& = Gµ − 1 ∂µ θ . (2.4.11)
g
In view of (2.4.11) and (2.4.5), the field G µ and the coupling constant g in (2.4.9) are
the electromagnetic potential A µ and electric charge e:
Gµ = Aµ the electromagnetic potential,
g=e the electric charge.
The above process illustrates that the electromagnetic potential A µ and the electric
charge e are the outcomes from the U(1) gauge invariance. In other words, from the phase
angle symmetry of particles (U(1) gauge symmetry) we deduce the following conclusions:
U(1) gauge symmetry ⇒ U(1) gauge field A µ ,
U(1) gauge field A µ = electromagnetic potential, (2.4.12)
U(1) gauge coupling constant g = electric charge.
Recall that in the Einstein general theory of relativity, the principle of general relativity
not only leads to the equivalence of gravitational potential to the Riemannian metric g µν ,
but also determines the field action—the Einstein-Hilbert action (2.3.35). Now, by the U(1)
gauge invariance, we are able to deduce both the electromagnetic potential A µ as (2.4.12)
and the U(1) gauge action as follows.
In fact, in the same fashion as used in (2.3.29) for gravity, by the covariance of (2.4.7)
we derive the covariant field Fµν as
4 5
Dµ Dν − Dν Dµ ψ = ieFµν ψ , (2.4.13)
where Fµν is given by (2.4.4). The left-hand side of (2.4.13) is covariant under the gauge
transformation (2.4.5), and so does Fµν . Consequently the contraction
F = Fµν F µν , (2.4.14)
ψ : M 4 → M 4 ⊗ p C4 .
2.4 Gauge Invariance 77
The tensor Fµν in (2.4.13) is the curvature tensor of the bundle M 4 ⊗ p C4 , and F = Fµν F µν
in (2.4.15) is the scalar curvature. In addition, the current energy A µ J µ in (2.4.15) corre-
sponds to the energy-momentum g µν Sµν in (2.3.35).
In the same fashion as the electromagnetism, we can derive the gauge theories for both
the weak and the strong interactions from the SU(2) and SU(3) gauge invariances. In the
next subsections we shall introduce the mathematical theory of SU(N) gauge fields and the
principle of gauge invariance in physics.
Ψ = (ψ 1 , · · · , ψ N )T ,
where each ψ k (1 ! k ! N) is a 4-component Dirac spinor. We seek for a set of gauge fields
Gkµ such that the Dirac equations
+ cm ,
iγ µ Dµ − Ψ=0 (2.4.16)
h̄
are invariant under the SU(N) gauge transformation
k (x)τ
Ψ
& = ΩΨ, ∀Ω = eiθ k ∈ SU(N), (2.4.17)
Dµ = ∂µ + igGkµ τk , (2.4.18)
where Ω† = (ΩT )∗ is the complex conjugate of the transport of Ω. Note that an exponent
eiA of a matrix iA is an N × N complex matrix satisfying
†
(eiA )† = e−iA .
78 Chapter 2 Fundamental Principles of Physics
It follows that
† †
(eiA )† (eiA ) = e−iA eiA = ei(A−A ) .
Hence we see that
In addition, we have
det eA = etrA , (2.4.22)
which will be proved at the end of this subsection.
Based on (2.4.20)-(2.4.22), it is not difficult to understand the representation of SU(N)
in a neighborhood of the unit matrix, stated in the following theorem.
Theorem 2.31 (SU(N) Representation) The matrices Ω in a neighborhood U ⊂
SU(N) of the unit matrix can be expressed as
Furthermore, the tangent space Te SU(N) of SU(N) at the unit matrix e = I is a K-dimensional
linear space (K = N 2 − 1), generated by K linear independent traceless Hermitian matrices
τk (1 ! k ! K):
By (2.4.27) we have
⎛ ⎞k ⎛ ⎞
∞ ∞
λ1 ∗ eλ 1 ∗
1 1⎜ .. ⎟ ⎜ .. ⎟
BeA B−1 = ∑ k! (BAB−1)k = ∑ k! ⎝ . ⎠ =⎝ . ⎠.
k=0 k=0
0 λN 0 eλN
Hence we get
det eA = det (BeA B−1 ) = eλ1 +···+λN . (2.4.28)
It is known that
tr A = λ1 + · · · + λN .
Thus, the formula (2.4.22) follows from (2.4.28).
To ensure that (2.4.30)-(2.4.32) are covariant under the transformation (2.4.33), we need
to determine
(1) the transformation form of the gauge fields G µ , and
(2) the action for the gauge fields G µ .
First, consider (1). The covariance of Dirac equations (2.4.30) is equivalent to the co-
variance of the derivative D µ Ψ as given by
&µ Ψ
D & = ΩDµ Ψ ∀Ω as in (2.4.33). (2.4.34)
& = ΩmΩ−1 .
m (2.4.36)
In summary, the transformations (2.4.33), (2.4.35) and (2.4.36) constitute the SU(N)
gauge transformations defined as
aτ
Ψ
& = ΩΨ ∀Ω = eiθ a
∈ SU(N),
&aµ τa = Gaµ Ωτa Ω−1 + i (∂µ Ω)Ω−1 ,
G (2.4.37)
g
& = ΩmΩ−1 .
m
Usually, (2.4.37) is taken as an infinitesimal transformation as follows
We now derive in the same fashion the SU(N) action, called the Yang-Mills action.
By (2.4.34),
& ν Ψ)
& µ (D
D & =D& µ (ΩDν Ψ) = Ω(Dµ Dν Ψ).
It follows that + , 4 5
D &ν Ψ
&µ , D & = Ω Dµ , Dν Ψ. (2.4.39)
i4 5 i i
Dµ , Dν = (∂µ + igGaµ τa )(∂ν + igGaν τa ) − (∂ν + igGaν τa )(∂µ + igGaµ τa ). (2.4.40)
g g g
Notice that
∂µ ∂ν = ∂ν ∂µ , ∂ν (Gaµ τa ) = ∂ν Gaβ τa + Gaβ τa ∂ν .
Therefore we define
a
Fµν = ∂µ Gaν − ∂ν Gaµ + gλbc
a b c
G µ Gν . (2.4.42)
As µ , ν are the indices of 4-D tensors, a Lorentz invariant can be constructed using the
following contraction
a
F&µν τa F&µν b τb† = F&µν
a &µν b
F τa τb† , (2.4.44)
where F µν b = g µα gνβ Fαβ . By (2.4.43) we have
a
F&µν τa F&µν b τb† =Fµν
a
(Ωτa Ω−1 )F µν b (Ωτb Ω−1 )† (2.4.45)
a
=Fµν F µν b Ωτa τb† Ω−1 †
(by Ω = Ω ).
−1
1
where Gab = tr(τa τb† ).
2
The equality (2.4.47) shows that
a
F = Gab Fµν F µν b = Gab g µα gνβ Fµν
a b
Fαβ (2.4.48)
is invariant under both the Lorentz transformations and the SU(N) gauge transformations,
a
where Fµν are given by (2.4.42).
The function (2.4.48) is a unique form which is both Lorentz and the SU(N) gauge
invariant, and contains up to first-order derivatives of the gauge fields G aµ . Hence, F in
(2.4.48) is a unique candidate to be the Lagrange density.
In Section 3.5, we have shown that {Gab} in (2.4.48) is a Riemannian metric of SU(N),
and
1 1 c d
Gab = tr(τa τb† ) = λ λ , (2.4.49)
2 4N ad cb
where λab c are the structure constants of SU(N).
In the classical SU(N) gauge theory, the SU(N) generator τk (1 ! k ! K) are taken to
be Hermitian and traceless, and satisfy
1
tr (τa τb† ) = δab .
2
In this case, the Lagrange density F in (2.4.48) becomes
a
F = Fµν F µν a .
2.4 Gauge Invariance 83
The Lagrange action of an SU(N) gauge theory is usually taken in the following form,
called the Yang-Mills action:
! # ) $
1 a µν a µ cm *
LY M = − Fµν F + Ψ iγ Dµ − Ψ dx, (2.4.50)
M4 4 h̄
where Ψ = Ψ† γ 0 ,
a
Fµν = ∂µ Gaν − ∂ν Gaµ + gλbc
a b c
G µ Gν ,
(2.4.51)
Dµ = ∂µ + igGaµ τa .
The second term on the right-hand side of (2.4.50) is the action for the Dirac equations
(2.4.30)-(2.4.32).
Remark 2.33 The Standard Model in particle physics is currently a prevailing theory
describing all, except the gravity, fundamental interactions. It consists of the Glashow-
Weinberg-Salam (GWS) electroweak theory, the transition theory of weak interaction decay,
the quark model, and the Quantum Chromodynamics (QCD). Based on the Standard Model,
the strong interaction is described by an SU(3) gauge theory, and the electromagnetic and
weak interactions are unified in an action of U(1) × SU(2) gauge fields, combing with the
Higgs mechanism and the Yukawa coupling.
Remark 2.34 All up-to-date experiments illustrate that the electromagnetic, weak,
strong (EWS) interactions obey Principle of Gauge Invariance 2.32.
Remark 2.35 All current theories about interactions, concluding the Standard Model
and the String Theory, have a remarkable drawback: they cannot provide a set of acceptable
field equations for the weak and strong interactions from which we can deduce the weak and
strong interaction potentials. It is known that, the variational equations of the Yang-Mills
action (2.4.50)-(2.4.51) are in the form
∂ µ Fµν
a a αβ b
− gλbc g Fαν Gcβ − gJνa = 0,
) cm * (2.4.52)
iγ µ Dµ − Ψ = 0,
h̄
where
Jνa = Ψγν τa Ψ, γν = gνβ γ β .
84 Chapter 2 Fundamental Principles of Physics
However, it appears that we are not able to derive from (2.4.52) any weak and strong in-
teraction potentials in agreement with experiments. An important reason is that the SU(N)
gauge theory has N 2 − 1 fields:
and we don’t know which or what combination of the potentials in (2.4.53) gives rise to an
interaction force formula.
Remark 2.36 The authors have developed a new unified field theory for the four fun-
damental interactions recently , based only on the following fundamental principles: the
Einstein Principle of General Relativity, the Principle of Lorentz Invariance, the Principle
of Gauge Invariance, and the two newly developed principles by the authors: the Principle
of Interaction Dynamics (PID) and the Principle of Representation Invariance (PRI). The
unified field theory will be presented in detail in Chapter 4, and we shall see that the theory
provides sound explanations and resolutions to the following problems:
3) quark confinement,
4) asymptotically freedom,
1. Newtonian dynamics. The motion equations governing the N planets are the Newto-
nian second law:
d 2 xk
mk 2 = Fk for 1 ! k ! N, (2.5.1)
dt
2.5 Principle of Lagrangian Dynamics (PLD) 85
where Fk is the gravitational force acting on the k-th planet by the other planets, which can
be expressed as
mk m j G
Fk = − ∑ 3
(x j − xk ), (2.5.2)
j̸=k j − xk |
|x
where G is the gravitational constant.
2. Lagrangian dynamics. Based on the least action principle, the Lagrange density L
of the N-body system is
L = T − V,
where T is the total kinetic energy, and V is the potential energy:
! "
1 N dxk 2 mi m j G
T = ∑ mk , V =− ∑ , (2.5.3)
2 k=1 dt i, j=1,i̸= j 2|x i − x j|
where k is a constant.
The Lagrange action for the elastic wave is written as
! t1 ! t1 ! # $
1 2 1 2
L= (T − V )dt = ρ u̇ − k|∇u| − F(x, u) dxdt. (2.5.11)
t0 t0 Ω 2 2
δ L = 0,
∂ 2u
ρ − k∆u = f (x, u) for x ∈ Ω. (2.5.12)
∂ t2
where
∂ F(x, z)
f (x, z) = .
∂z
The equation (2.5.12) is the usual wave equation describing elastic vibration in a continuous
medium.
Electrodynamics consists of two parts: the Maxwell field equations and the motion of
charged particles, each described by the related Lagrange actions.
In classical Maxwell theory, we usually take the electric field E, the magnetic field H,
the current density J⃗ and the electric charge density ρ as state functions, because these
2.5 Principle of Lagrangian Dynamics (PLD) 87
physical quantities are observable. However, the fields E and H are not fundamental physi-
cal quantities, and the basic fields describing electromagnetism are the 4-D electromagnetic
potential A µ and the current density Jµ :
Aµ = (A0 , A1 , A2 , A3 ),
(2.5.13)
Jµ = (J0 , J1 , J2 , J3 ).
Hence the Lagrange density for the Maxwell theory should be constructed with the fields in
(2.5.13).
Based on the Lorentz Invariance and the U(1) Gauge Invariance, the action of (2.5.13)
is uniquely determined and is given in the form (2.4.15):
! T!
L= L (Aµ , Jµ )dxdt, (2.5.14)
0 Ω
where
1 1
L = Fµν F µν + Aµ J µ Fµν as in (2.4.4). (2.5.15)
16π c
We now derive the variational equation, also called the Euler-Lagrange equation, of
(2.5.14)-(2.5.15). Since Jµ is an applied external field, we only take variation with respect
to the field A µ .
It is known that δ L is a 4-dimensional field
δ L = (δ L0 , δ L1 , δ L2 , δ L3 ),
&µ with A
and satisfies that for any A &µ |∂ Q = 0, we have
T
!
d %%
(δ L)µ A
&µ dxdt = % L(Aµ + λ A
&µ ), (2.5.16)
QT d λ λ =0
L = L1 + L2 ,
!
1 µ a νβ
L1 = g g Fµν Fαβ dxdt, Fµν = ∂ν Aµ − ∂µ Aν ,
QT 16 π
!
1
L2 = Aµ J µ dxdt.
QT c
It is clear that
! !
d %% &µ ) = 1 d &µ )J µ dxdt = 1
% L2 (Aµ + λ A (Aµ + λ A Jµ A
&µ dxdt.
d λ λ =0 c QT d λ c QT
1 µα νβ ∂ Fαβ 1 ∂ F µν
δ L1 = − g g = − . (2.5.18)
4π ∂ xν 4 π ∂ xν
Hence it follows from (2.5.17) and (2.5.18) that
1 ∂ F µν 1 µ
δL = − + J ,
4 π ∂ xν c
which implies that the equation δ L = 0 takes the form:
∂ F µν 4π µ
ν
= J . (2.5.19)
∂x c
This is the second pair of the Maxwell equations (2.2.35) and (2.2.36). Since the first pair
of the Maxwell equations (2.2.33) and (2.2.34) are direct consequence of (2.4.4) and
1 ∂ ⃗A
H = curl ⃗A, E = ∇A0 − , ⃗A = (A1 , A2 , A3 ), (2.5.20)
c ∂t
the action (2.5.14)-(2.5.15) completely determines the Maxwell equations (2.2.33)-(2.2.36).
2.5 Principle of Lagrangian Dynamics (PLD) 89
dP e d⃗A
+ = ∇L ,
dt c dt (2.5.25)
mv
P= 3 is the momentum.
1 − v2/c2
By (2.5.23), we have
e
∇L = ∇⃗A · v + e∇A0.
c
90 Chapter 2 Fundamental Principles of Physics
Thanks to
∇⃗A · v =∇(⃗A · v) (by ∂ v = 0)
=(⃗A · ∇)v + (v · ∇)⃗A + v × curl ⃗A + ⃗Acurl v
=(v · ∇)⃗A + v × curl ⃗A,
the equation (2.5.25) reads
dP e d⃗A e e
+ = (v · ∇)⃗A + v × curl ⃗A + e∇A0 . (2.5.26)
dt c dt c c
It is known that
d⃗A ∂ ⃗A ∂ ⃗A dxk ∂ ⃗A
= + k = + (v · ∇)⃗A.
dt ∂ t ∂ x dt ∂t
Hence (2.5.26) is rewritten as
dP e ∂ ⃗A e
=− + e∇A0 + v × curl ⃗A. (2.5.27)
dt c ∂t c
Then, by (2.5.20), the equation (2.5.27) takes the form
dP e
= eE + v × curl ⃗A.
dt c
Thus, we have deduced (2.5.21) from the action (2.5.23).
Now, we deduce the Einstein energy-momentum formula for the 4-dimensional energy-
momentum under an electromagnetic field. Corresponding to the least action principle of
classical mechanics, the momentum P and energy E of a charged particle are given by
∂L ∂L
P= , E = v· −L. (2.5.28)
∂v ∂v
By (2.5.23), we refer from (2.5.28) that
mv e
P= 3 + ⃗A, ⃗A the magnetic potential,
2
1 − v /c 2 c
(2.5.29)
mc2
E=3 + eA0 , A0 the electric potential.
1 − v2/c2
It follows from (2.5.29) that
) e *2
(E − eA0)2 = c2 ⃗P − ⃗A + m2 c4 , (2.5.30)
c
which is the Einstein energy-momentum relation under with an electromagnetic field.
It is the formula (2.5.30) that makes us to take the energy and momentum operators in
quantum mechanics in the following form:
∂ c
ih̄− eA0, −ih̄∇ − ⃗A, (2.5.31)
∂t e
which were also given in (2.2.52). In particular, the particular form of (2.5.31) leads to the
origin of gauge theory.
2.5 Principle of Lagrangian Dynamics (PLD) 91
ψ
& ∗ (0, x) = ψ
& ∗ (T, x) = 0 ∀x ∈ R3 ,
ψ
& →0
∗ as |x| → ∞.
Since ψ
& ∗ is arbitrary, we derive that
∂ψ h̄2
δ L = ih̄ + ∆ψ − V ψ .
∂t 2m
Hence we have
δL ∂ψ h̄2
=0 ⇔ ih̄ = − ∆ψ + V (x)ψ . (2.5.34)
δ ψ∗ ∂t 2m
In other words, (2.5.34) and (2.5.35) are equivalent, and are exactly the Schrödinger equa-
tion.
The field equations governing the spin-0 bosons are the Klein-Gordon equations:
1 ∂ 2ψ ) mc *2
− ∆ψ + ψ = 0. (2.5.36)
c ∂t
2 2 h̄
It is easy to introduce the Lagrange action for (2.5.36):
! # ) mc *2 $
√
L= ∇µ ψ ∇µ ψ ∗ + |ψ |2 −gdx, (2.5.37)
M4 h̄
1) Physics are established based on a few universal principles, which provides a solid
foundation for physics;
3) Lagrange actions contain more physical information than the differential equations.
In fact, a conservation law of a physical system can be derived from the invariance
of the Lagrange action under the associated symmetric transformation.
The correspondence between symmetries and conservation laws are revealed by the
Noether theorem, to be introduced below. For this purpose, we need to introduce some
related concepts on group action and symmetry.
We begin with a simple example. A circle with radius r is described by
x2 + y 2 = r 2 . (2.5.43)
Figure 2.2
We can clearly see the symmetry of the circle shown in Figure 2.2—the graph is the same
from whatever the direction we look at it. This phenomenon is expressed in mathematics as
the invariance of the equation (2.5.43) under the following coordinate transformation
! ′" ! "! "
x cos θ sin θ x
= . (2.5.44)
y′ − sin θ cos θ y
2.5 Principle of Lagrangian Dynamics (PLD) 95
Namely, in the coordinate system (x′ , y′ ), the equation describing the circle is invariant and
still takes form:
x′2 + y′2 = r2 . (2.5.45)
Now, we discuss the symmetry from the viewpoint of action functional and group action.
The key feature is then characterized by the Noether theorem. Let F : R 2 → R1 be a function
(a finite dimensional functional) defined by
All the transformation matrices in (2.5.44) constitute a group, denoted by SO(2), called the
orthogonal group:
@ % ! " A
% cos θ sin θ
SO(2) = Aθ %% Aθ = , θ ∈ R1 . (2.5.47)
− sin θ cos θ
The symmetry here is the invariance of the function (2.5.46) under the SO(2) group action:
The generalization to the Lagrangian dynamics is what we shall introduce in this subsection.
Au ∈ X ∀u ∈ X and A ∈ G,
(2.5.48)
A(Bu) = (AB)u ∀u ∈ X, A, B ∈ G.
F(Au) = F(u) ∀A ∈ G.
The following theorem is the well-known Noether theorem, which provides a correspon-
dence between symmetries and conservation laws in the Lagrangian system (2.5.49).
96 Chapter 2 Fundamental Principles of Physics
Proof of Theorem 2.38 Let (u, ut ) be a solution of the variational equation of L given
by (2.5.49): ! "
d δ L (u, u̇) δ
= L (u, u̇). (2.5.53)
dt δ u̇ δu
By the invariance (2.5.50), (Aλ u, Aλ u̇) are also solutions of (2.5.53). Then it follows from
(2.5.50) that
∂ L (Aλ u, Aλ u̇) D δ L dΦ E D δ L d Φ̇ E
0= = , + , , (2.5.54)
∂λ ∂ Φ dλ δ Φ̇ d λ
where Φ = Aλ u. Since (Aλ u, Aλ u̇) satisfies (2.5.53), we have
! "
δ d δ L (Aλ u, Aλ u̇)
L (Aλ u, Aλ u̇) = . (2.5.55)
δΦ dt δ Φ̇
Inserting (2.5.55) in (2.5.54) we deduce that for any Φ = A λ u and any λ ∈ R1 ,
D d ! δ L " dΦ E D ∂ L d ! dΦ " E d D δ L dΦ E
0= , + , = , ,
dt δ Φ̇ dλ ∂ Φ̇ dt d λ dt δ Φ̇ d λ
which implies that
d d D δ L dΦ E%%
I(u, u̇) = , % = 0,
dt dt δ Φ̇ d λ λ =0
where Aλ u = u if λ = 0. The proof is complete.
We now give two examples to show how to apply the Noether theorem to a specific
physical problem.
Example 2.40 Consider an N-body motion, such as a system of N planets. Let m k
and xk be the mass and the coordinates of the k-th body. The Lagrange density is
1 N
L = ∑ mk |ẋk |2 − ∑ V (|xi − x j |).
2 k=1
(2.5.56)
i̸= j
2.5 Principle of Lagrangian Dynamics (PLD) 97
G = {Aλ | Aλ : R3 → R3 , Aλ x = x + λ⃗r},
where ⃗r is a given vector. It clear that (2.5.56) is invariant under the transformation of G.
By
! "
δL ∂L ∂L
= ,··· , = (m1 ẋ1 , · · · , mN ẋN ),
δ ẋ ∂ ẋ1 ∂ ẋN
d %% d %%
% Aλ x = % (x + λ⃗r) = (⃗r, · · · ,⃗r ).
d λ λ =0 d λ λ =0 F GH I
N
is conserved, and is the total momentum in the direction ⃗r. Thus we have shown that the
translation invariance corresponds to momentum conservation.
Example 2.41 Consider the Schrödinger equation, the action is given by (2.5.33), and
! # ! "$
1 h̄2
L (ψ , ψ̇ ) = −ih̄ψ ∗ ψ̇ + |∇ψ |2 + V |ψ |2 dx. (2.5.57)
R3 2 2m
Let G be the phase rotation group
G = {Aλ = eiλ | λ ∈ R1 }.
ψ → Aλ ψ = eiλ ψ .
We see that
δL d
= −ih̄ψ ∗ , A ψ| = iψ .
δ ψ̇ d λ λ λ =0
Thus, the quantity (2.5.51) reads as
DδL d E !
I(ψ , ψ̇ ) = , Aλ ψ = h̄|ψ |2 dx.
δ ψ̇ d λ R3
Here, the conservation (2.5.59) corresponds to the invariance of phase rotation of the wave
function ψ .
98 Chapter 2 Fundamental Principles of Physics
Remark 2.42 In classical mechanics, the energy conservation can be deduced from
the time translation invariance
t →t +λ for λ ∈ R1 . (2.5.60)
∂H ∂H ∂L
dH = dp+ dq − dt, (2.5.61)
∂p ∂q ∂t
where H = H(q, p,t) is the total energy, L is the Lagrangian, and q, p satisfy the Hamilton
equations
dq ∂ H dp ∂H
= , =− . (2.5.62)
dt ∂p dt ∂q
If L = L (q, q̇,t) is invariant under the translation (2.5.60), then L does not explicitly
contain time t, i.e.
∂L
= 0.
∂t
Then it follows from (2.5.61) and (2.5.62) that
dH ∂L
=− = 0.
dt ∂t
Hence the energy conservation is deduced using the time translation invariance.
We end this section with some relations of symmetries and conservation laws:
In classical mechanics, the principle of Hamiltonian dynamics (PHD) consists of the fol-
lowing three main ingredients:
1) for an isolated (conserved) mechanical system, its states are described by a set of state
variables given by
q1 , · · · , qN , p1 , · · · , pN ; (2.6.1)
2.6 Principle of Hamiltonian Dynamics (PHD) 99
The equivalences in (2.6.9) only manifest that the three principles have an intrinsic relation.
In particular, by (2.6.4)-(2.6.7), the variables q, q̇ of PLD and the variables q, p of PHD have
the relation:
δL δL ∂L
p= (by = = mq̇), (2.6.10)
δ q̇ δ q̇ ∂ q̇
and the Lagrange density L (q, q̇) and the Hamilton energy H(q, p) are related by
In this example, the two relations (2.6.10) and (2.6.11) are obtained directly by their
physical meanings, and by which we can deduce one by another.
In fact, the relations (2.6.10) and (2.6.11) are also valid in [Link] discuss this prob-
lem as follows.
Consider a mechanical system. For PLD, the state variables are positions q k and veloci-
ties q̇k :
q1 , · · · , qN and q̇1 , · · · , q̇N , (2.6.12)
q1 , · · · , qN and p 1 , · · · , pN , (2.6.14)
The two systems (2.6.12)-(2.6.13) and (2.6.14)-(2.6.15) of PLD and PHD satisfy the
following relations, which implies the equivalence of PLD and PHD in classical mechanics.
Dynamical Relation 2.43 (PLD and PHD) For the two systems of PLD and PHD in
classical mechanics, the following conclusions hold true:
1) The two sets (2.6.12) and (2.6.14) of variables satisfy the following relation:
∂ L (q, q̇)
pk = for 1 ! k ! N. (2.6.16)
∂ q̇k
2) The two functions (2.6.13) and (2.6.15) satisfy the following relation, which is usually
called the Legendre transformation:
Then, inserting (2.6.18) in the left-hand side of (2.6.17) we deduce the expression of the
Hamilton energy:
H(q, p) = pk fk (q, p) − L (q, f (q, p)), (2.6.19)
which gives rise to the Hamiltonian system (2.6.3). In other words, we can derive the
Hamiltonian dynamics from the Lagrangian dynamics by the relations (2.6.16) and (2.6.17).
Conversely, if we know the PHD system (2.6.14) and (2.6.15), then it follows from
(2.6.16) and (2.6.17) that the Lagrange density L satisfies
! "
∂L ∂L
H x, − yk + L (x, y) = 0. (2.6.20)
∂y ∂ yk
Theoretically we can solve the differential equation (2.6.20), and obtain the solution L =
L (x, y). Let x = q and y = q̇, then we deduce the expression for the Lagrange density from
the Hamilton function H(q, p).
Remark 2.44 We remark that both PLD and PHD are independent to each other.
However, the PLD dynamical system (2.1.5)-(2.1.8) and the PHD dynamical system (2.6.1)-
(2.6.3) are equivalent in mechanics by (2.6.16) and (2.6.17). But we shall see later that the
relations (2.6.16) and (2.6.17) are not valid in electromagnetism where PLD and PHD still
hold true.
In addition, the differential dH is given by
∂H ∂H ∂H
dH = dqk + d pk + dt. (2.6.21)
∂ qk ∂ pk ∂t
By (2.6.17),
∂L ∂L
dH = q̇k d pk − dqk − dt. (2.6.22)
∂ qk ∂t
It follows from (2.6.21) and (2.6.22) that
∂H
q̇k = , (2.6.23)
∂ pk
∂H ∂L
=− , (2.6.24)
∂ qk ∂ qk
∂H ∂L
=− . (2.6.25)
∂t ∂t
From (2.6.21) and (2.6.25) we deduce (2.5.61). Then by (2.6.16) and the Lagrange equation
(2.1.8), we have ! "
d pk d δL ∂L
= = .
dt dt δ q̇k ∂ qk
102 Chapter 2 Fundamental Principles of Physics
d
H(q(t), p(t)) = 0, (2.6.27)
dt
where (q, p) are the solutions of the Hamiltonian system.
Now, we introduce the PHD.
Principle 2.45 (Hamiltonian Dynamics) For any conservative physical system, there
are two sets of state functions
provided that the system is described by continuous fields. Moreover the state functions u
and v satisfy the equations
∂u δH
=α ,
∂t δv (2.6.31)
∂v δH
= −α ,
∂t δu
where α is a constant.
In general, the energy density (2.6.29) for a continuous field system depends on u, v only
up to the first-order derivatives Du and Dv:
Then the Hamilton equations (2.6.31) can be expressed in the following form:
# ! " $
∂ uk ∂H ∂H
= α −∂ j + ,
∂t ∂ ζ jk ∂ vk
# ! " $ (2.6.33)
∂ vk ∂H ∂H
= α ∂j − ,
∂t ∂ ξ jk ∂ uk
where ξ jk , ζ jk are variables corresponding to ∂ j uk and ∂ j vk .
Hereafter, we always assume that H is in the form (2.6.32). The following theorem
shows that the total energy H of the Hamiltonian system (2.6.31) (or (2.6.33)) is conserved.
Theorem 2.46 (Energy Conservation) Let (u, v) be the solutions of (2.6.33), and H
does not explicitly contain time t. Then the energy H(u, v) is conserved, i.e. H(u, v) satisfies
(2.6.27) with q = u and p = v, if Ω = R3 . In addition, if Ω ̸= R3 , then we have
! # $
dH ∂H ∂uj ∂H ∂vj
= + ni ds, (2.6.34)
dt ∂ Ω ∂ ξi j ∂ t ∂ ζi j ∂ t
where n = (n1 , n2 , n3 ) is the unit outward normal at ∂ Ω.
Proof For (2.6.32), we have
! # ! " ! "$
dH ∂ H ∂ uk ∂ H ∂uj ∂ H ∂ vk ∂ H ∂vj
= + ∂i + + ∂i dx. (2.6.35)
dt Ω ∂ uk ∂ t ∂ ξi j ∂t ∂ vk ∂ t ∂ ζi j ∂t
By the Gauss formula,
! ! " ! ! ! "
∂H ∂uj ∂H ∂uj ∂H ∂uj
∂i dx = ni ds − ∂i dx,
Ω ∂ ξi j ∂t ∂ Ω ∂ ξi j ∂ t Ω ∂ ξi j ∂t
! " ! ! ! "
∂H ∂vj ∂H ∂vj ∂H ∂vj
∂i dx = ni ds − ∂i dx.
∂ ζi j ∂t ∂ Ω ∂ ζi j ∂ t Ω ∂ ζi j ∂ t
Hence (2.6.35) is rewritten as
! #! ! "" ! ! "" $
d ∂H ∂H ∂ uk ∂H ∂H ∂ vk
H= − ∂i + − ∂i dx (2.6.36)
dt Ω ∂ uk ∂ ξik ∂t ∂ vk ∂ ζik ∂t
! # $
∂H ∂uj ∂H ∂vj
+ + ni ds.
∂ Ω ∂ ξi j ∂ t ∂ ζi j ∂ t
Since (u, v) is a solution of (2.6.33), then (2.6.36) becomes
! # $
dH ∂H ∂uj ∂H ∂vj
= + ni ds,
dt ∂ Ω ∂ ξi j ∂ t ∂ ζi j ∂ t
and (2.6.34) follows.
If Ω = R3 (i.e. ∂ Ω = ∅) or u = v = 0 on ∂ Ω, then
d
H(u, v) = 0 for (u, v) satisfy (2.6.33).
dt
The proof is complete.
104 Chapter 2 Fundamental Principles of Physics
u = E = (E1 , E2 , E3 ),
(2.6.41)
v = A = (A1 , A2 , A3 ),
where E is the electric field, and A is the magnetic potential. The Hamilton energy H is
given by !
H= H (E, A)dx,
Ω
(2.6.42)
1 1 1
H = (E 2 + |curl A|2 ) + ∇ϕ · E − J · A,
8π 4π c
where ϕ is the electric potential, and J is the current. Using
DδH E d %%
, E& = % H(E + λ E,
& A),
δE d λ λ =0
DδH E %
,A & = d %% H(E, A + λ A),
&
δA d λ λ =0
we can compute the derivatives of (2.6.42) as follows:
δH 1 δH 1 1
= (E + ∇ϕ ), = curl2 A − J.
δE 4π δA 4π c
Thus, the Hamilton equations (2.6.31) are in the form:
1 ∂E δH 4π
= 4π = curl2 A − J,
c ∂t δA c (2.6.43)
1 ∂A δH
= −4π = −E − ∇ϕ ,
c ∂t δE
which are the classical Maxwell equations (2.2.33) and (2.2.35).
where
1 ∂A
E =− − ∇A0 , Aµ = (A0 , A1 , A2 , A3 ).
c ∂t
106 Chapter 2 Fundamental Principles of Physics
A1 , A2 , A3 and E 1 , E2 , E3 .
It is clear that
1 1 1
E(−Ȧ) − LEM = H (A, E) − ϕ J 0 ,
4π 4π c
and J 0 = ρ . Hence, the relation (2.6.17) does not hold true in general.
Now, we consider the energy conservation. When H contains ∇ϕ and J, which depend
on t, the Hamilton H = H (A, E,t) contains explicitly t. It implies that the Maxwell fields
have energy exchange with other charged particles. Then H is not conserved.
As there is no charged particles in Ω, then
(ρ , J) = 0 in Ω.
In this case, ! ! !
∇ϕ · Edx = − ϕ div Edx = − 4πϕρ dx = 0.
Ω Ω Ω
1 ∂A
P= curl A ×
4π ∂t
Note that
∂A
= −cE, curl A = B (B the magnetic field),
∂t
Then, by (2.6.34) we have that
! ! ) c *
dH
=− P · nds = − E × B · nds.
dt ∂Ω ∂ Ω 4π
The field
c
P=
E ×B
4π
is the Poynting vector, represents the energy flux density of an electromagnetic field.
2.6 Principle of Hamiltonian Dynamics (PHD) 107
ψ = (ψ1 , · · · , ψN )T , N $ 1,
and ψk are as
ψk = ψk1 + iψk2 for 1 ! k ! N. (2.6.46)
In view of PHD for a quantum system, the conjugate fields are taken as real and imagi-
nary parts of the wave functions in (2.6.46):
Let H = H(ψ ) be the Hamilton energy. Then the Hamilton equations for the quantum
system are as follows:
∂ ψk1 δ
α = H,
∂t δ ψk2
for 1 ! k ! N, (2.6.48)
∂ ψ2 δ
α k = − 1 H,
∂t δ ψk
where α is a constant.
We now introduce quantum Hamiltonian systems for the Schrödinger equation, the Weyl
equation, the Dirac equations, the Klein-Gordon equation, and the BEC equation.
1. Schrödinger equation:
∂ψ h̄2
ih̄ = − ∆ψ + V (x)ψ , (2.6.49)
∂t 2m
where ψ = ψ 1 + iψ 2 . The Hamilton energy of (2.6.49) is given by
! # 2 $
1 h̄
H(ψ ) = |∇ψ |2 + V (x)|ψ |2 dx.
2 Ω 2m
It is easy to derive that
δ h̄2
H = − ∆ψ 1 + V ψ 1 ,
δ ψ1 2m
δ h̄2
H = − ∆ψ 2 + V ψ 2 .
δψ 2 2m
Hence, the Hamiltonian system is
∂ ψ1 h̄2
h̄ = − ∆ψ 2 + V ψ 2 ,
∂t 2m (2.6.50)
∂ ψ2 h̄2
h̄ = ∆ψ 2 − V ψ 2 .
∂t 2m
108 Chapter 2 Fundamental Principles of Physics
1 ∂ ψk1 δ
= H,
c ∂t δ ψk2
for k = 1, 2,
1 ∂ ψk2 δ
=− H,
c ∂t δ ψk1
∂ψ
ih̄ = −ih̄c(⃗α · ∇)ψ + mc2α0 ψ , (2.6.54)
∂t
where ⃗α , α0 are as in (2.2.58), ψ = (ψ1 , ψ2 , ψ3 , ψ4 )T , and
It is clear that the expansions of (2.6.54) in terms of real and imaginary parts are in the form
∂ ψk1 δ
h̄ =− H,
∂t δ ψk2
for 1 ! k ! 4,
∂ ψk2 δ
h̄ = H,
∂t δ ψk1
where H is given by (2.6.55).
4. Klein-Gordon equation:
1 ∂ 2ψ ) mc *2
− ∆ ψ + ψ = 0. (2.6.56)
c2 ∂ t 2 h̄
The conjugate fields are
∂ψ
ψ1 = ψ, ψ2 = .
∂t
The Hamilton energy is
! # ) mc *2 $
1
H= |ψ 2 |2 + |∇ψ 1 |2 + |ψ 1 |2 dx. (2.6.57)
2 Ω h̄
Then (2.6.56) can be equivalently written as
1 ∂ ψ1 δ
= H,
c ∂t δ ψ2
1 ∂ ψ2 δ
=− H.
c ∂t δ ψ1
∂ψ h̄2
ih̄ = − ∆ψ + V (|ψ |2 )ψ , (2.6.58)
∂t 2m
where V (|ψ |2 ) is a function of |ψ |2 , and ψ = ψ 1 + iψ 2 . The Hamilton energy of (2.6.58) is
given by
! # 2 $
h̄ 2 2
H= |∇ψ | + G(|ψ | ) dx,
Ω 4m
! (2.6.59)
1 z
G(z) = V (s)ds.
2 0
For (2.6.59), the equation (2.6.58) can be rewritten as
∂ ψ1 δ
h̄ = H,
∂t δ ψ2
∂ ψ2 δ
h̄ =− H.
∂t δ ψ1
110 Chapter 2 Fundamental Principles of Physics
The examples given above show that PHD holds true in general in quantum physics.
In particular, we now show that the relations (2.6.16) and (2.6.17) also valid for quantum
Hamiltonian systems.
In fact, the Lagrange action of a quantum system with the Hamilton energy H(ψ ) is
given by
! T!
L= L (ψ , ψ̇ )dxdt,
0 R3 (2.6.60)
L = ih̄ψ ψ̇ − H (ψ , Dψ ),
∗
where H reads !
H= H (ψ , Dψ )dx.
R3
When taking the variation, the functional
!
&
L= iψ ∗ ψ̇ dxdt
QT
!
= i(ψ 1 − iψ 2 )(ψ̇ 1 + iψ̇ 2 )dxdt
QT
! # $
i ∂ ∂
= |ψ |2 − (ψ 1 ψ 2 ) + 2ψ 2ψ̇1 dxdt
QT 2 ∂ t ∂t
! T!
= 2ψ 2 ψ̇ 1 dxdt,
0 R3
L = 2h̄ψ 2 ψ̇ 1 − H . (2.6.61)
The aim of this chapter is to provide mathematical foundations for the remaining part of the
book on the unified field theory, elementary particles, quantum physics, astrophysics and
cosmology.
As addressed in Chapter 2, Nature speaks the language of mathematics. Thanks to
Einstein’s principle of equivalence and the geometric interaction mechanism, the space-time
is a 4D Riemannian manifold M , and physical fields are regarded as functions on vector
bundles over the base manifold M . For example, the Riemannian metric g µν , representing
the gravitational potential, is a function on the second-order cotangent bundle, g µν : M →
T ∗ M ⊗ T ∗ M ; the electromagnetic field A µ is a function on the cotangent bundle A µ : M →
T ∗ M ; the SU(2) gauge fields for the weak interaction is a function {Wµa } : M → (T ∗ M )3 ;
and the SU(3) gauge fields for the strong interaction is a function {S kµ } : M → (T ∗ M )8 .
Also, the Dirac spinor field is defined a complex bundle: Ψ : M → M ⊗ p C4 .
The three most fundamental symmetries of Nature are the principle of Lorentz invari-
ance, the principle of general relativity, and the principle of gauge invariance. They lead
to transformations in either the base space-time manifold M , or the corresponding vector
bundles.
In summary, we have the following
1) The space-time is a 4D Riemannian manifold, and all physical fields are
functions on vector bundles over the space-time manifold M ⊗ p E N ; and
2) Fundamental symmetries are invariances of physical field equations un-
der the underlying transformations on the corresponding vector bundles.
The basic concepts in Section 3.1 include Riemannian manifolds such as the space-time
manifold, vector bundles, tensor fields, connections and linear transformations on vector
bundles. It is particularly important that we identify all physical fields as functions on
proper vector bundles defined on the space-time manifold, and symmetries correspond then
to linear transformations on the corresponding vector bundles.
Section 3.2 is on basic functional analysis and partial differential equations on mani-
folds, which are needed for rigorous proofs of theorems and concepts used in later chapters
of the book. The readers may consult other references such as (Ma, 2011).
112 Chapter 3 Mathematical Foundations
Section 3.3 proves the orthogonal decomposition theorem for tensors on Riemannian
manifolds, which forms the mathematical foundation for the principle of interaction dy-
namics (PID), postulated by (Ma and Wang, 2014e, 2015a). Basically, PID takes the vari-
ation of the Lagrangian action L(u) under the generalized energy-momentum conservation
constraints, which we call divA -free constraints with A being the gauge potentials:
d !! def
⟨δ L(u0 ), X⟩ = ! L(u0 + λ X) = 0, ∀ divA X = divX − X · A = 0. (3.0.1)
d λ λ =0
Here divA X = 0 represents a generalized energy-momentum conservation. The study of the
constraint variation (3.0.1) requires the decomposition of all tensor fields into the space of
divA -free fields and its orthogonal complements. Such a decomposition is reminiscent to
the classical Helmholtz decomposition of a vector into the sum of an irrotational (curl-free)
vector field and a solenoidal (divergence-free) vector field. In particular, we show in this
chapter that (see Theorem 3.17):
$
where x3 = R2 − (x1 )2 − (x2 )2 , and R is the radius. The metric of this sphere (3.1.3) is
given by
% &2
2 1 2 ∂ x3 1 ∂ x3 2
2 2
ds =(dx ) + (dx ) + dx + 2 dx (3.1.4)
∂ x1 ∂x
=(1 + ϕ )(dx1 )2 + 2ϕ dx1 dx2 + (1 + ϕ )(dx2)2 ,
114 Chapter 3 Mathematical Foundations
is not constant. In fact, for the plane shown by Figure 3.1 (a), we can find a coordinate
transformation % 1& % 1 & % 1&
x' a1 a12 x
= ,
x'2 a21 a22 x2
x1 , x'2 ), the metric (3.1.2) is expressed in the diagonal
such that in the coordinate system ('
form:
ds2 = (d' x1 )2 + (d'
x2 )2 . (3.1.7)
In other words, the metric is gi j = δi j . However, it is impossible to achieve this for the
metric (3.1.4) of the sphere.
The conclusion in this example holds true as well for all Riemannian manifolds. Namely,
for a Riemannian manifold {M , gi j }, M is flat if and only if there is a coordinate system x,
such that the metric {gi j } can be expressed as gi j = δi j under the x-coordinate system.
The following are a few general properties of an n-dimensional Riemannian manifold
{M , gi j }.
1. The Riemannian metric
ds2 = gi j dxi dx j (3.1.8)
is invariant. In fact, under a coordinate transformation
Hence we have
⎛ ⎞
x1
d'
⎜ .⎟
xi d'
g'i j d' x j =(d'
x1 , · · · , d'
xn )(' ⎝ ..⎠
gi j ) ⎜ ⎟ (3.1.10)
xn
d'
⎛ ⎞
dx1
=(dx1 , · · · , dxn )(aki )T (bki )T (gkl )(blj )(alj ) ⎝ ... ⎠
⎜ ⎟
dxn
⎛ 1
⎞
dx
1 n ⎜ .. ⎟
=(dx , · · · , dx )(gi j ) ⎝ . ⎠
dxn
=gi j dxi dx j .
The invariance of ds as shown in (3.1.10) implies that the length L in (3.1.12) is independent
of the coordinate systems.
3. The volume of a bounded domain U ⊂ M is invariant. The reason why {g i j } is called
a metric is that such quantities as the length, area, volume and angle are all determined by
the metric {gi j }.
Given a Riemannian manifold {M , gi j }, let U ⊂ M be a bounded domain. Then the
volume of U is written as !
V = Ω(x)dx. (3.1.13)
U
where the volume element Ωdx reads
√
Ω(x)dx = −gdx, g = det(gi j ). (3.1.14)
116 Chapter 3 Mathematical Foundations
where {⃗e1 , · · · ,⃗en+1 } is an orthogonal basis of Rn+1 . Then the volume element Ωdx is
!0 1!
! ∂⃗r ∂⃗r !!
Ωdx = !! , · · · , dx.
∂ x1 ∂ xn !
∂⃗r ∂⃗r
By gi j = · , the norm of the vector [∂1⃗r, · · · , ∂n⃗r] is
∂ xi ∂ x j
√
|[∂1⃗r, · · · , ∂n⃗r]| = −g, g = det (gi j ).
d' x1 ∧ · · · ∧ d'
x = d' xn (3.1.15)
% 1 & % n &
∂ϕ 1 ∂ ϕ1 n ∂ϕ 1 ∂ ϕn n
= dx + · · · + dx ∧ · · · ∧ dx + · · · + dx
∂ x1 ∂ xn ∂ x1 ∂ xn
% i&
∂ϕ
= det dx1 ∧ · · · ∧ dxn .
∂xj
On the other hand,
% & % &T
∂ ψi ∂ ψi
gi j ) =
(' (gi j ) , ψ = ϕ −1 . (3.1.16)
∂yj ∂yj
Hence % i &−1 .
.
∂ϕ
det('
gi j ) = det det(gi j ). (3.1.17)
∂xj
We deduce from (3.1.15) and (3.1.17) that
. .
det('
gi j )d'
x = det(gi j )dx.
Namely, both the volume and the volume element in (3.1.13)-(3.1.14) are invariant.
4. The metric {gi j } gives rise to an inner product structure on the tangent space of a
Riemann manifold M , and defines the angle between two tangent vectors.
3.1 Basic Concepts 117
Let p ∈ M be a given point, and Tp M be the tangent space at p ∈ M. For two vectors
X,Y ∈ Tp M ,
X = {X 1, · · · , X n }, Y = {Y 1 , · · · ,Y n },
It is clear that the inner product (3.1.18) is an invariant. The angle between X and Y is
defined as .
⟨X,Y ⟩
cos θ = , |Z| = gi j Z i Z j , for Z = X,Y.
|X||Y |
def "
M ⊗p EN = {p} × E pN , (3.1.19)
p∈M
Example 3.1 The motion of the air or seawater can be ideally considered as a fluid
motion on a two-dimensional sphere S 2 , and the velocity field u is defined on the tangent
planes T S2 :
u : S2 → T S2 with u(p) ∈ Tp S2 , ∀p ∈ S2 .
Example 3.2 The electromagnetic interaction takes place on the 4D space-time mani-
fold M , and the electromagnetic field A µ is defined on the tangent space:
Aµ : M → T M with Aµ (p) ∈ Tp M , ∀p ∈ M ,
Example 3.3 An electron moves in the space-time manifold M , and the field de-
scribing the electron state is the Dirac spinor ψ , which is defined on a 4D complex space
C4 :
Examples 3.1-3.3 clearly illustrate that all physical fields are defined on a vector bundle
on M . Namely, a physical field F is a mapping:
The physical field F takes its values in the N-dimensional linear space E N , and can be
written as N components,
F = (F1 , · · · , FN )T . (3.1.21)
Considering invariance under certain symmetry, the transformation group always acts on
the bundle space E N , and induces a corresponding transformation for the field F in (3.1.21).
Also, in order to ensure the covariance of the field equations for F, a differential operator
D acting F must also be covariant, leading to the introduction of connections on the vector
bundle M ⊗ p E N . This process is shown as follows:
1. Symmetric group G act on E N :
G p : E pN → E pN . (3.1.22)
3.1 Basic Concepts 119
F → Tp F. (3.1.23)
Dµ = ∂µ + Γµ . (3.1.24)
Finally, we say that the vector bundle M ⊗ p E N is geometrically trivial if and only if the
connections Γ µ in (3.1.24) are zero, i.e. Γ µ = 0 on M . It implies that the transformations
(3.1.22)-(3.1.23) determines whether M ⊗ p E pN is geometrically trivial, and if G p , Tp in
(3.1.22) and (3.1.23) are independent of p ∈ M , then M ⊗ p E N is geometrically trivial,
and otherwise it’s not. This viewpoint is important for the unified field theory introduced
in Chapter 4, because it implies that the geometry of M ⊗ p E N is determined by symmetry
principles.
The other reason to adopt vector bundles as the mathematical framework to describe
physical fields F is that the types of F can be directly reflected by the bundle space E N .
Hereafter we list a few useful physical fields:
Aµ : M → T M ,
Aµ : M → T ∗ M ,
Trk M = T M ⊗ · · · ⊗ T M ⊗ T ∗ M ⊗ · · · ⊗ T ∗ M , (3.1.25)
4 56 7 4 56 7
k r
Ψ : M → M ⊗ p C4 .
Ψ : M → M ⊗ p (C4 )N .
g µν : M → T20 M ,
The fields given by 1)-6) above include all types of physical fields, and the associated
vector bundles are physically significant.
In classical theories of interactions, the physical fields and the associated bundle spaces
are given as follows:
1) Gravity:
g µν : M → T ∗ M ⊗ T ∗ M . (3.1.26)
Aµ : M → T ∗ M , Ψ : M → M ⊗ p C4 . (3.1.27)
In the unified field theory to be introduced in Chapter 4, each of the four interactions
(3.1.26)-(3.1.29) possesses corresponding dual fields as follows:
The corresponding vector bundles for the dual fields are as follows
φµG : M → T ∗ M ,
φ E : M → M ⊗ p R1 ,
(3.1.30)
φaw : M → M ⊗ p R3 ,
φks : M → M ⊗ p R8 .
3.1 Basic Concepts 121
The fields (3.1.26)-(3.1.30) are all physical fields in the unified field theory. The dual
fields (3.1.30) are introduced only in the unified field equations using PID, but they do not
appear in the actions of the unified field theory.
F : M → M ⊗p EN . (3.1.31)
Let G be a transformation group acting on the fiber space E N of the bundle. This group
action induces naturally a transformation on the bundle M ⊗ p E N as follows:
X → gX, ∀X ∈ E pN , g ∈ G, p ∈ M. (3.1.32)
F(p) ∈ E pN , ∀p ∈ M .
F → gF, ∀g ∈ G. (3.1.33)
Tp M : ei = ∂ /∂ xi for 1 ! i ! n,
i i
(3.1.38)
Tp∗ M : e = dx for 1 ! i ! n.
A : Tp M → Tp M , A = (aij ),
(3.1.40)
B : Tp∗ M → Tp∗ M , B = (bij ) = (AT )−1 ,
then the field T of (3.1.36) (i.e. (3.1.39)) will transform in the same fashion as (3.1.35).
The above two ways to define general tensors are equivalent. However, in the second
fashion we replace the coordinate transformation (3.1.39) by (3.1.40). This approach is very
convenient to uniformly treat the transformations in the unified field theory.
Hereafter we list a few typical linear transformations of fiber spaces for various physical
fields.
1. Lorentz transformations. A (k, r) type Lorentz field is a mapping
F : M → M ⊗p EN , (3.1.41)
E pN = Tp M ⊗ · · · ⊗ Tp M ⊗ Tp∗ M ⊗ · · · ⊗ Tp∗ M .
4 56 7 4 56 7
k r
L : Tp∗ M → Tp∗ M ,
' L = (LT )−1 ,
' (3.1.43)
3.1 Basic Concepts 123
This leads to a natural linear transformation for the fiber space E pN , which induces a trans-
formation for the field F in (3.1.41).
2. SU(N) gauge transformation. A set of N Dirac spinor fields Ψ are referred to the
mapping:
Ψ : M → M ⊗ p (C4 )N . (3.1.44)
When we take the bundle space transformation
Ψ → ΩΨ for Ω ∈ SU(N).
Gaµ (1 ! a ! N 2 − 1) and Ψ = (ψ 1 , · · · , ψ N )T ,
Gaµ : M → T M .
F : M → M ⊗p EN (3.1.47)
undergoes a transformation:
F' = Tp F, p ∈ M, (3.1.48)
if the bundle space E pN undergoes a linear transformation
Tp : E pN → E pN . (3.1.49)
Physical symmetry principles amount to saying that the action (3.1.50) is invariant under
the transformation (3.1.48). This requires that the derivative DF in (3.1.50) be covariant.
Namely, for (3.1.48) we have
' F' = NDF,
D N is a matrix depends on Tp . (3.1.51)
Dµ = ∂µ (∂µ = ∂ /∂ xµ ).
which violates the covariance of (3.1.51), because it has a superfluous term ∂'µ Tp F in the
right-hand side of (3.1.52). Hence, the derivative operator D µ must be in the form
Dµ = ∂µ + Γµ , (3.1.53)
such that (3.1.51) holds true. The field Γ µ is the connection of the vector bundle M ⊗ p E N
under the transformation (3.1.49).
To make explicit of Γ' µ , we assume that ∂'µ and ∂µ have the following relation
Thanks to (3.1.51),
It follows that
In other words, under the linear transformation (3.1.49), the system transforms as follows
' F' = (ATp )DF,
D
(3.1.56)
' = ATp ΓTp−1 − A(∂ Tp )Tp−1 .
Γ
2. Connection for SU(N) group. For the SU(N) group action, (3.1.49) is
Hence, relations (3.1.56) for the SU(N) gauge fields Gaµ are written as
'Ψ
D ' = ΩΨ, Ψ : M → M ⊗ p (C4 )N ,
'aµ τa = ΩGaµ τa Ω−1 − 1 ∂ ΩΩ−1 ,
G Ω ∈ SU(N).
ig
F : M → Trk M , (3.1.57)
k k
Tp = A p ⊗ · · · ⊗ A p ⊗ A−1 ⊗ · · · ⊗ A−1
p : Tr M → Tr M , (3.1.59)
4 56 7 4p 56 7
k r
which can be equivalently expressed a K × K matrix with K = n k+r , and ⊗ is the tensor
product of matrices defined by (3.1.67); see Remark 3.5. In this case, the matrix A of
(3.1.54)-(3.1.55) is precisely the A p in (3.1.58). Hence, by (3.1.56) we have
' F' = (A ⊗ T )DF,
D (3.1.60)
D j F i = ∂ j F i + Γijl F l , (3.1.61)
where Γijl are connections defined on T M . As M is a Riemannian manifold, {Γijk } are the
Levi-Civita connection as given by (2.3.25). It follows from (3.1.56) that the connection of
(3.1.61) transforms as
' = A ⊗ A ⊗ (AT)−1 Γ − A ⊗ ∂ A ⊗ A−1,
Γ (3.1.62)
Dk Fi Gi + Fi Dk Gi = ∂k Fi Gi + Fi∂k Gi . (3.1.65)
6. Derivative on Trk M . For two vector fields Ak and Bk , their tensor product A ⊗ B =
{Ai B j } is a second order tensor. By (3.1.63) we have
Dk (A ⊗ B) =Dk (Ai B j ) = Dk Ai B j + Ai Dk B j
j
=(∂k Ai + Γikl Al )B j + Ai (∂k B j + Γkl Bl )
=∂k (Ai B j ) + Γikl Al B j + Γklj Ai Bl .
3.2 Analysis on Riemannian Manifolds 127
Replacing A ⊗ B by T i j , we obtain
Dk T i j = ∂k T i j + Γikl T l j + Γklj T il .
Remark 3.5 We have encountered tensor products in (3.1.59) and (3.1.62). A further
explanation of this considered is now in order. Let A and B be two matrices with orders n
and m respectively. Then A ⊗ B is a matrix of order N = nm, defined by
⎛ ⎞ ⎛ ⎞
a11 B · · · a1n B ai j b11 · · · ai j b1m
A ⊗ B = ⎝ ... .. ⎟ , ai j B = ⎝ ... .. ⎟ .
⎜ ⎜
. ⎠ . ⎠ (3.1.67)
an1 B ··· ann B ai j bm1 ··· ai j bmm
i ···i
In (3.1.60) and (3.1.62), the components of DF = {Dk Fj11··· jkr } and Γ = {Γkij } are arranged
to be in two vectorial forms.
By Functional Analysis, the spaces L p (M ⊗ p E N ) are Banach spaces, and for 1 < p < ∞,
L p (M ⊗ p E N ) are reflective and separable. The dual spaces of L p (M ⊗ p E N ) (1 < p < ∞)
are
L p (M ⊗ p E N )∗ = Lq (M ⊗ p (E N )∗ ), 1 1
+ = 1,
q N ∗ p
L (M ⊗ p E ) = L (M ⊗ p (E ) ), N ∗ p q
The space L∞ (M ⊗ p E N ) is a Banach space, but not reflective and separable. The space
L∞ (M ⊗ p E N ) is the dual space of L1 (M ⊗ p E N ), i.e.
L1 (M ⊗ p E N )∗ = L∞ (M ⊗ p (E N )∗ ).
ϕ = ∂ ku j. (3.2.3)
The definition (3.2.2)-(3.2.3) for weak derivatives is very abstract. In the following, we
discuss the distinction between continuity and weak differentiability in an intuitive fashion.
3.2 Analysis on Riemannian Manifolds 129
It is clear that
|v(x) − v(y)|
[v]Lip = sup .
x,y∈M , x̸=y |x − y|
Here C0,α (M ⊗ p E N ) are the Hölder spaces. Moreover we have the following inequalities
for the norms:
np
||u||Lq ! C||u||W 1,p for 1 ! q ! , if n > p,
n− p
||u|| q ! C||u|| 1,p for 1 ! q < ∞, if n = p, (3.2.12)
L W
||u||C0,α ! C||u||W 1,p for α = 1 − n/p, if n < p,
where C > 0 are constants depending on n, p and M.
Remark 3.8 When M is non-compact, the conclusions (3.2.11) and (3.2.12) are also
valid only for q satisfying
np
p!q! if n > p,
n− p
p!q<∞ if n = p.
132 Chapter 3 Mathematical Foundations
Based on weakly differentiability properties (3.2.6) and (3.2.7), essence of Theorem 3.7
can be seen from embeddings (3.2.11) using the following function:
|x|α
u(x) = , x ∈ Rn , (3.2.13)
| ln |x||β
where β > 1 is given, and the exponent α < 1 is to reflect the critical embedding index q ∗
in (3.2.11).
The derivatives of u given by (3.2.13) are as follows
∇u = (∂1 u, · · · , ∂n u),
% &
α β (3.2.14)
∂i u = − |x|α −2 xi .
| ln |x||β | ln |x||β +1
dx = rn−1 drds,
|x|α n
u= β
∈ Lq (BR ) ⇔ α " − . (3.2.18)
| ln |x|| q
and as p ! n, then from (3.2.17) and (3.2.18), at the critical embedding exponent q ∗ we have
n n
α = 1− , α =− .
p q∗
It follows that ⎧
⎨ = np for n > p,
q ∗ n− p
⎩
<∞ for n = p.
Thus, we deduce that
⎧ np
⎪
⎪
⎪ u ∈ Lq (BR ) for 1 ! q ! , if n > p,
⎪
⎨ n− p
u ∈ W 1,p (BR ) ⇒ u ∈ Lq (BR ) for 1 ! q < ∞, if n = p, (3.2.19)
⎪
⎪
⎪ n
⎩ u ∈ C0,α (BR )
⎪ for α = 1 − , if p > n.
p
∇k = gkl ∇l , ∇k = gkl ∇l ,
∇k = (D1 , · · · , Dn ),
(3.2.21)
D j the covariant derivative operators.
i ···i ···ik+1
div T = Dil T j11··· jlr , and
i ···i
(3.2.23)
jl
div T = D T j11··· jkl ··· jk+1
∂ uk
div u = Dk uk = + Γkk j u j .
∂ xk
By the Levi-Civita connections (2.3.25), the contraction
√
1 ∂ gkl 1 ∂g 1 ∂ −g
Γkk j = gkl = = √ ,
2 ∂xj 2g ∂ x j −g ∂ x j
∂ ∂
Dk Dk = .
∂ xk ∂ xk
However, the Laplace operators Dk Dk defined on a Riemannian manifold are usually very
complex.
By (3.2.20) and (3.2.22), we have
∇ div
W 2,p (Trk M ) →k W 1,p (Tr+1
k
M ) → L p (Trk M ),
∇k div
W 2,p (Trk M ) → W 1,p (Trk+1 M ) → L p (Trk M ).
div · ∇ = Dk Dk = gkl Dk Dl .
u = (u1 , · · · , un ),
∆ = d δ + δ d,
where d is the differential operator, and δ the Hodge operator, which are defined on the
spaces of all differential forms. For a scalar field, ∆ is the same as div · ∇, i.e.
(X,Y ) = gi j X i X j , ∀X,Y ∈ T M .
(X ∗ ,Y ∗ ) = gi j Xi∗Y j∗ , ∀X ∗ ,Y ∗ ∈ T ∗ M .
Trk M = T · · ⊗ T M7 ⊗ T
4 M ⊗ ·56
∗
· · ⊗ T ∗ M7 .
4 M ⊗ ·56
k r
i ···i
(u, v) = gi1 j1 · · · gik jk gl1 s1 · · · glr sr ul11 ···lrk vsj11···s
··· jr
r
, ∀u, v ∈ Trk M . (3.2.33)
where ∆ = div · ∇.
In the following, we give the Gauss formula on Trk M , which are crucial for the orthog-
onal decomposition theory in the next section.
Theorem 3.11 (Gauss Formula) For any u ∈ H 1 (Trk M ) and v ∈ H 1 (Trk+1 M ) (or
v ∈ H 1 (Tr+1
k M )),
! !
√ √
(∇u, v) −gdx = − (u, divv) −gdx. (3.2.36)
M M
138 Chapter 3 Mathematical Foundations
Remark 3.12 The motivation to generalize the Gauss formulas (3.2.36) to the opera-
tors DA and divA is to develop a new unified field theory for the fundamental interactions.
The vector fields A in (3.2.38) and (3.2.39) represent gauge fields in the interaction field
equations, and lead to a mass generation mechanism based on a first principle, called PID,
different from the famous Higgs mechanism.
where Di are the covariant derivative operators, the unknown function u : M → Trk M is a
(k, r) tensor field, g : M → Trk M and f : M → Trk+1 M (or f : M → Tr+1k M ) are given.
The following existence theorem is a classical result, which is a corollary of the well-
known Fredholm Alternative Theorem.
Two remarks are now in order. First, the solutions φ ̸= 0 of (3.2.42) are the eigenfunc-
tions of the Laplace operator div · ∇ = Di Di corresponding to the zero eigenvalue λ = 0, and
the condition (3.2.41) represents that div f + g is orthogonal to all eigenfunctions for λ = 0
of div · ∇.
Second, by the L p -estimate theorem, if gi j ∈ W m+2,p , f ∈ W m+1,p , and g ∈ W m,p , then
u ∈ W m,p (Trk M ). Therefore, by the Sobolev Embedding Theorem 3.7, if g i j , f , g are C∞ ,
then the solution u is also C ∞ .
Linear hyperbolic equations
Let {M , g µν } be a Minkowski manifold, i.e., its metric g µν can be written in some
coordinate system as % &
−1 0
(g µν ) = , G = (gi j ) (3.2.43)
0 G
where G is an (n − 1) × (n − 1) positive definite symmetric matrix. Then, the Laplace
operator
∂2
Dµ Dµ = g µν Dµ Dν = − 2 + gi j Di D j
∂t
is a hyperbolic operator.
140 Chapter 3 Mathematical Foundations
∂ 2u
− + gi j Di D j u = div f , (3.2.45)
∂ t2
with the periodic condition
Theorem 3.15 Let M = S1 × M Ais a Minkowski manifold with metric (3.2.43), and
∂ M = ∅. Assume that g µν and f are C∞ , then the problem (3.2.45)-(3.2.46) has a C ∞
A
solution u.
Remark 3.16 If M = R1 × M Awith metric (3.2.43) and ∂ M ' = ∅, then the problem
(3.2.45)-(3.2.46) is replaced by the following initial value problem:
∂ 2u
− + gi j Di D j u = div f ,
∂ t2
(3.2.47)
u(0) = ϕ ,
ut (0) = ψ .
The same existence theorem holds true as well for problem (3.2.47).
In this section, we shall show that all tensor fields as given by (3.3.1) can be decomposed
into the form (3.3.2) satisfying (3.3.3).
In order to understand the problem well, we first introduce some classical results: the
Helmholtz decomposition and the Leray decomposition on M = R n .
then there exist a function φ ∈ H 1 (R3 ) and a vector field A ∈ H 1 (T R3 ), such that u can be
decomposed as
u = ∇φ + curl A,
!
(3.3.4)
∇φ · curl Adx = 0.
R3
Note that div(curl A) = 0. Hence, the Helmholtz decomposition is an important initial result
on orthogonal decompositions.
u = ∇φ + v,
v · n|∂ Ω = 0, div v = 0, φ ∈ H 1 (Ω), (3.3.5)
!
∇φ · vdx = 0.
Ω
which means that div v = 0. Thus we obtain the orthogonal decomposition u = ∇φ + v with
div v = 0.
142 Chapter 3 Mathematical Foundations
divA h = 0, ∇A h = 0.
In particular, the subspace of all harmonic tensor fields in L2 (Trk M ) is of finite di-
mensional:
H(Trk M ) = {h ∈ L2 (Trk M )| ∇A h = 0, divA h = 0}, and
(3.3.11)
dim H(Trk M ) < ∞.
Remark 3.18 The above orthogonal decomposition theorem implies that L 2 (Trk M )
can be decomposed into
then u ∈ L2D (Trk M ). It is the reason why we define a divA -free field by (3.3.8).
∆ = divA · ∇A . (3.3.14)
Without loss of generality, we only consider the case where divA u ∈ E' = Trk−1 M . It is
clear that if (3.3.13) has a solution ϕ ∈ H 1 (E),
' then by (3.3.14), the following vector field
must be divA -free
v = u − ∇Aϕ ∈ L2 (E). (3.3.15)
Moreover, by (3.3.8), we have
!
√
⟨v, ∇A ψ ⟩L2 = (v, ∇A ψ ) −gdx = 0, ∀∇A ψ ∈ L2 (Trk M ). (3.3.16)
M
Obviously, if φ satisfies
∆φ = 0, (3.3.18)
where ∆ is as in (3.3.14), then, by (3.2.39),
! !
√ √
(∆φ , φ ) −gdx = − (∇A φ , ∇A φ ) −gdx = 0.
M M
By Theorem 3.14, we derive that the equation (3.3.13) has a unique weak solution ϕ ∈
H 1 (E).
'
For Minkowski manifolds, by Theorem 3.15, the equation (3.3.13) also has a solution.
Thus Assertion (1) is proved.
S TEP 2. P ROOF OF A SSERTION (2). Based on Assertion (1), we have
where
∆u
' = P∆u, (3.3.20)
∂2
∆ = divA · ∇A = gkl + B, (3.3.21)
∂ xk ∂ xl
where B is a lower-order differential operator. Since M is compact, the Sobolev embeddings
B : H 2 (M , RN ) → L2 (M , RN )
is a linear compact operator. Therefore the operator in (3.3.21) is a linear completely con-
tinuous field
∆ : H 2 (E) → L2 (E),
which implies that the operator of (3.3.20) is also a linear completely continuous field
By the spectrum theorem of completely continuous fields (Ma and Wang, 2005), the space
It follows that
' ⇔ ∇A u = 0,
u∈H
' is the same as the harmonic space H of (3.3.11), i.e. H
which implies that H ' = H. Thus we
have
u ∈ L2 (T M ) or u ∈ L2 (T ∗ M ),
u i j = Di ϕ j + v i j , Di vi j = 0, (3.3.23)
j
ui j = D j ψi + wi j , D wi j = 0. (3.3.24)
It is easy to see that if ui j ̸= u ji , then (3.3.23) and (3.3.24) can be two different decomposi-
tion of ui j . Namely
The reason is that the two differential equations generating the two decompositions (3.3.23)
and (3.3.24) as
Di Di ϕ j = Di ui j and Di Di ψ j = Di u ji (3.3.25)
are different because Di ui j ̸= Di u ji .
However for a symmetric tensor field ui j = u ji , as
Di ui j = Di u ji ,
the two equations in (3.3.25) are the same. By the uniqueness of solutions of (3.3.25), the
two solutions ϕ j and ψ j are the same:
ϕi = ψi for 1 ! i ! n.
u i j = Di ϕ j + v i j , Di vi j = 0, (3.3.26)
j
ui j = D j ϕi + wi j , D wi j = 0. (3.3.27)
Theorem 3.20 Let u ∈ L2 (T20 M ) be symmetric, i.e. ui j = u ji , and the first Betti number
β1 (M ) = 0 for M . Then the following assertions hold true:
3.3 Orthogonal Decomposition for Tensor Fields 147
u i j = v i j + Di D j ϕ ,
(3.3.28)
vi j = v i j , Di vi j = 0.
3) u can be orthogonally decomposed in the form (3.3.28) if and only if the follow-
ing differential equations have a solution ϕ ∈ H 2 (M ), and ϕ is the scalar field in
(3.3.28):
∂ ∂ϕ
∆ϕ + Rki k = −D j u ji for 1 ! i ! n, (3.3.29)
∂ xi ∂x
where Rki = gk j Ri j and Ri j are the Ricci curvature tensors, and ∆ is the Laplace
operator for scalar fields as defined by (3.2.28).
Di ϕ j = D j ϕi . (3.3.30)
Note that
∂ϕj
Di ϕ j = − Γkij ϕk ,
∂ xi
and Γkij = Γkji . We infer then from (3.3.30) that
∂ϕj ∂ ϕi
= . (3.3.31)
∂x i ∂xj
By assumption, the 1-dimensional homology of M is zero,
H1 (M ) = 0,
and by the de Rham theorem (Ma, 2010), it follows that all closed 1-forms are complete
differentials, i.e. for any
ω = ψi dxi , d ω = 0,
there is a scalar function ψ such that
ψi = ∂ ψ / ∂ x i for 1 ! i ! n.
ω = ϕk dxk
∂ϕ
ϕk = for 1 ! k ! n.
∂ xk
Assertion (2) is proved.
Now we prove Assertion (3). Taking the divergence on both sides of (3.3.26), we obtain
that
Di Di ϕ j = D i u i j . (3.3.32)
' ϕ j = Di Di ϕ j + Rk ϕk ,
−∆ (3.3.33)
j
dω = 0 ⇔ ϕi = ∂ ϕ / ∂ x i ,
(3.3.34)
δ ω = ∆ϕ ⇔ ϕi = ∂ ϕ / ∂ x i .
Here ∇ is the gradient operator, and ∆ the Laplace operator as in (3.2.28). It follows from
(3.3.34) that
'ω = (δ d + d δ )ω = d δ ω ⇔ ϕi = ∂ ϕ /∂ xi ,
∆
and
∂
dδ ω = (∆ϕ )dxi .
∂ xi
Hence we deduce from (3.3.33) that
∂ ∂ϕ
Di D j ϕ j = − (∆ϕ ) − Rkj k ⇔ ϕi = ∂ ϕ /∂ xi . (3.3.35)
∂x j ∂x
Inserting (3.3.32) in (3.3.35) we obtain that the equations
∂ ∂ϕ
(∆ϕ ) + Rkj k = −Di ui j
∂xj ∂x
Remark 3.21 The conclusions of Theorem 3.20 are also valid for second-order contra-
variant symmetric tensors u = {ui j }, and the decomposition is given as follows:
ui j = vi j + gik g jl Dk Dl ϕ
Di vi j = 0, vi j = v ji , ϕ ∈ H 2 (M ).
3.3 Orthogonal Decomposition for Tensor Fields 149
u : M → Trk M (3.3.36)
u = ∇A ϕ + v, (3.3.37)
!
√
divA v = 0, v · n|∂ M = 0, (∇A ϕ , v) −gdx = 0,
M
v = u − ∇A ϕ (3.3.40)
v · n|∂ M = 0. (3.3.41)
Then it follows from (3.3.40) and (3.3.41) that the tensor field u in (3.3.36) can be orthogo-
nally decomposed into the form of (3.3.37). The proof is complete.
Here M Ais a closed Riemannian manifold, and G = (gi j ) is the Riemann metric of M A.
In view of the Minkowski metric (3.3.43), we see that the operator divA · ∇A is a hyper-
bolic differential operator expressed as
% &2
∂
divA · ∇A = − + A0 + gi j DAi DA j . (3.3.44)
∂t
Now a tensor field u ∈ L2 (Trk M ) has an orthogonal composition if the following hyper-
bolic equation
divA · ∇A ϕ = divA u, in M (3.3.45)
has a weak solution ϕ ∈ H 1 (Trk−1 M ) in the following sense:
! !
√ √
(DA ϕ , DA ψ ) −gdx = (u, DA ψ ) −gdx, ∀ψ ∈ H 1 (Trk−1 M ). (3.3.46)
M M
if and only if equation (3.3.45) has a weak solution ϕ ∈ H 1 (T k−1 M ) in the sense of (3.3.46).
F : X → R1 . (3.4.1)
In the classical variational principle we know that the minimal point u of F in (3.4.2) is
a solution of the variational equation of F:
δ F(u) = 0, (3.4.3)
3.4 Variations with divA -Free Constraints 151
δ F : X → X ∗. (3.4.4)
⟨·, ·⟩ : X × X ∗ → R.
d f (0) d !!
=0⇒ ! F(u + λ v) = 0, ∀v ∈ X.
dλ d λ λ =0
It follows then by (3.4.5) that
⟨δ F(u), v⟩ = 0, ∀v ∈ X,
We see that
! 0 1
d d 1 2
F(u + λ v) = |∇u + λ ∇v| + f (u + λ v) dx
dλ dλ Rn 2
! 8 9
= (∇u + λ ∇v) · ∇v + f ′(u + λ v)v dx.
Rn
152 Chapter 3 Mathematical Foundations
Hence we have
!
d !! 8 9
! F(u + λ v) = ∇u · ∇v + f ′(u)v dx
d λ λ =0 n
!R 8 9
= −div (∇u) + f ′ (u) vdx. (by(3.2.36))
Rn
we deduce that
! !
δ F(u)vdx = (−∆u + f ′(u))vdx, ∀v ∈ H 1 (Rn ).
Rn Rn
In Section 2.4.3, we have seen that the functional (3.4.7) is the scalar curvature part in
the Yang-Mills action (2.4.50).
Referring to derivative operators of functionals for the electromagnetic potential de-
duced in Subsection 2.5.3, we now derive the derivative operator for the Yang-Mills func-
tional (3.4.7):
!
d !! ' =− 1 a d !
!
! F(G + λ G) gµα gνβ Fαβ ! F a (G + λ G)dx
'
d λ λ =0 2 M d λ λ =0 µν
!
B C
'a ∂ G 'aµ
1 µν a ∂ Gν
=− F − ν dx
2 M ∂ xµ ∂x
!
1
− F µν a gλ abc (Gbµ G
'cν + G
'bµ Gcν )dx (by(3.4.8))
2 M
B a C
!
µα νβ
∂ Fαβ c cba b
= g g − gFαβ λ Gµ G 'aν dx.
M ∂ xµ
3.4 Variations with divA -Free Constraints 153
δ F = ∂ α Fαβ
a
− ggα µ λcb
a c
Fαβ Gbµ , β = 0, 1, 2, 3, (3.4.9)
a = λ cba .
where λcb
For the general form of Yang-Mills functional given by
! 0 1
1 a µν b
F= − Gab Fµν F dx, (3.4.10)
M 4
where (Gab ) is the Riemann metric on SU(N) given by (2.4.49). The derivative operator of
F in (3.4.10) is as follows
δ F = Gab ∂ α Fαβ
b
− ggα µ Gbc λda
c b
Fαβ Gdµ . (3.4.11)
j
In addition, by gik gk j = δi , we obtain
d !!
! (gik + λ g'ik )(gk j + λ g'k j ) = 0.
d λ λ =0
It follows that
g'i j = −gik g jl g'kl . (3.4.19)
Thus, (3.4.18) is rewritten as
d !!
! det (gi j + λ g'i j ) = −ggi j g'i j . (3.4.20)
d λ λ =0
For the Einstein-Hilbert functional (3.4.12), we have
d !!
! F(gi j + λ g'i j )
d λ 0λ =0
!
√ 1 d
= Ri j g'i j −g − √ R det (gi j + λ g'i j )
M 2 −g d λ
1 !
d √ !
+ gi j Ri j (gi j + t g'i j ) −g dx!
dλ λ =0
!
1 √
= (Ri j − gi j R)' gi j −gdx
M 2
!
d !! √
+ gi j ! Ri j (gi j + λ g'i j ) −gdx (by (3.4.20)).
M d λ λ =0
∂ Γkki ∂ Γkij
Ri j = − k + Γlik Γklj − Γli j Γklk . (3.4.22)
∂xj ∂x
By the Riemannian Geometry, for each point x0 ∈ M there exists a coordinate system
under which
Γkij (x0 ) = 0, ∀1 ! i, j, k ! n. (3.4.23)
It is known that the covariant derivatives of gi j and gi j are zero, i.e. Dgi j = 0 and Dgi j = 0.
Hence we infer from (3.4.33) that
∂ gi j (x0 ) ∂ gi j (x0 )
= 0, = 0, ∀1 ! i, j, k ! n.
∂ xk ∂ xk
By (3.4.22), at x0 , we have
% &
ij ij ∂ k ∂ k ∂ D ik l ij k
E
g δ Ri j =g δ Γ ik − δ Γ i j = g δ Γ il − g δ Γ i j . (3.4.24)
∂xj ∂ xk ∂ xk
∂ uk
g i j δ Ri j = = div u, at x0 ∈ M. (3.4.25)
∂ xk
where
uk = gik δ Γlil − gi j δ Γkij
is a vector field. Since (3.4.25) is independent of the coordinate systems, in a general
coordinate system the relation (3.4.25) becomes
1 ∂ √
gi j δ Ri j = div u = √ ( −guk ), at x0 ∈ M . (3.4.26)
−g ∂ xk
As x0 ∈ M is arbitrary, the formula (3.4.26) holds true on M . Hence we have
! !
√ √
gi j δ Ri j −gdx = div u −gdx.
M M
Thus we derive (3.4.21), and the derivative operator of the Einstein-Hilbert functional (3.4.12)
is as given by (3.4.15).
156 Chapter 3 Mathematical Foundations
G : M → T20 M = T ∗ M ⊗ T ∗ M ,
The space W m,2 (M , g) is a metric space, but not a Banach space. However, it is a subspace
of the direct sum of two Sobolev spaces W m,2 (T20 M ) and W m,2 (T02 M ):
F : W m,2 (M , g) → R (3.4.28)
is called the functional of Riemannian metric. In general, the functional (3.4.28) can be
expressed as
!
√
F(gi j ) = f (gi j , · · · , ∂ m gi j ) −gdx. (3.4.29)
M
1
gi j = × the cofactor of gi j . (3.4.30)
g
Therefore, F(gi j ) in (3.4.29) also depends on gi j , i.e. putting (3.4.30) in (3.4.29) we get
!
√
F(gi j ) = f'(gi j , · · · , ∂ m gi j ) −gdx. (3.4.31)
M
3.4 Variations with divA -Free Constraints 157
We note that although W m,2 (M , g) is not a linear space, but for a given element gi j ∈
W m,2 (M , g) and any symmetric tensor fields Xi j , X i j , there is a number λ0 > 0 such that
With (3.4.32), we can define the following derivative operators of the functional F:
where W −m,2 (E) is the dual space of W m,2 (E), and δ∗ F, δ ∗ F are defined by
d !!
⟨δ∗ F(gi j ), X⟩ = ! F(gi j + λ Xi j ), (3.4.33)
d λ λ =0
d !!
⟨δ ∗ F(gi j ), X⟩ = ! F(gi j + λ X i j ). (3.4.34)
d λ λ =0
For any give metric gi j ∈ W m,2 (M , g), the value of δ∗ F and δ ∗ F at gi j are second-order
contra-variant and covariant tensor fields:
δ∗ F(gi j ) : M → T M × T M ,
(3.4.35)
δ ∗ F(gi j ) : M → T ∗ M × T ∗ M .
Theorem 3.24 Let F be the functionals defined by (3.4.29) and (3.4.31). Then the
following assertions hold true:
1) For any gi j ∈ W m,2 (M , g), δ∗ F(gi j ) and δ ∗ F(gi j ) are symmetric tensor fields.
j
Proof We only need to verify Assertion (3). In view of g ik gk j = δi , we have the
variational relation
δ (gik gk j ) = gik δ gk j + gk j δ gik = 0.
It implies that
δ gkl = −gki gl j δ gi j . (3.4.36)
158 Chapter 3 Mathematical Foundations
λ Xi j = δ gi j , λ X i j = δ gi j , λ ̸= 0 small.
⟨(δ∗ F)kl , δ gkl ⟩ = −⟨(δ∗ F)kl , gki gl j δ gi j ⟩ = −⟨gki gl j (δ∗ F)kl , δ gi j ⟩ = ⟨(δ ∗ F)i j , δ gi j ⟩.
Hence we have
(δ ∗ F)i j = −gki gl j (δ∗ F)kl .
Thus Assertion (3) follows and the proof is complete.
We are now in position to consider the variation with divA -free constraints. We know
that an extremum point gi j of a metric functional is a solution of the equation
δ F(gi j ) = 0, (3.4.37)
Note that the solution gi j of (3.4.37) in the usual sense should satisfy
Thus, due to the symmetry of δ F(gi j ), the solution gi j of (3.4.37) satisfying (3.4.38) must
also satisfy (3.4.39). Hence the solutions of (3.4.37) in the sense of (3.4.38) are the solutions
in the usual sense.
However, if we consider the variations of F under the divA -free constraint, then the
extremum points of F are not solutions of (3.4.37) in the usual sense. Motived by physical
considerations, we now introduce variations with divA -free constraints.
3.4 Variations with divA -Free Constraints 159
d !!
⟨δ F(u0 ), X⟩ = ! F(u0 + λ X) = 0, ∀divA X = 0, (3.4.40)
d λ λ =0
where divA is as defined in (3.2.38).
In particular, if F is a functional of Riemannian metrics, and the solution u 0 = gi j is a
Riemannian metric, then the differential operator DA in divA X in (3.4.40) is given by
DA = D + A, D = ∂ + Γ, (3.4.41)
δ F(gi j ) = DΦ + A ⊗ Φ, (3.4.42)
Proof of Theorems 3.26 and 3.27 First we prove Theorem 3.26. By (3.4.40), the
extremum points {gi j } of F with the divA -free constraint satisfy
!
√
δ F(gi j ) · X −gdx = 0, ∀X ∈ L2 (T02 M ) with divA X = 0.
M
It implies that
δ F(gi j )⊥L2D (T20 M ) = {v ∈ L2 (T20 M )| divA v = 0}. (3.4.44)
By Theorem 3.17, L2 (T20 M ) can be orthogonally decomposed into
δ F(gi j ) ∈ G2 (T20 M ),
160 Chapter 3 Mathematical Foundations
L2 (T M ) = L2D (T M ) ⊕ G2 (T M ),
L2D (T M ) = {v ∈ L2 (T M )| divA v = 0},
G2 (T M ) = {DA ϕ | ϕ ∈ H 1 (M )}.
δ F(u) ∈ G2 (T M ).
Theorem 3.28 (Scalar Potential Theorem) Assume that the first Betti number of M is
zero, i.e. β1 (M ) = 0. Let F be a functional of Riemannian metrics. Then there is a scalar
field ϕ ∈ H 2 (M ) such that the extremum points {gi j } of F with divergence-free constraint
satisfy the equation
(δ F(gi j ))kl = Dk Dl ϕ . (3.4.46)
Dk Φl = Dl Φk . (3.4.48)
3.4 Variations with divA -Free Constraints 161
In addition, by
∂ Φl
Dk Φl = − Γklj Φ j ,
∂ xk
and Γklj = Γlkj , it follows from (3.4.48) that
∂ Φl ∂ Φk
= . (3.4.49)
∂x k ∂ xl
By assumption, the first Betti number of M is zero, i.e. the first homology of M is zero:
H1 (M ) = 0. It follows from the de Rham theorem that if
% &
k ∂ Φk ∂ Φl
d(Φk dx ) = − k dxl ∧ dxk = 0,
∂ xl ∂x
then there exists a scalar function ϕ such that
∂ϕ k
dϕ = dx = Φk dxk .
∂ xk
Thus, we infer from (3.4.49) that
∂ϕ
Φk = for some ϕ ∈ H 2 (M ).
∂ xk
Therefore, we derive (3.4.46) from (3.4.47), and the proof is complete.
where α j (1 ! j ! N) are constants, Rkl = gk j R jl and R jl are the Ricci curvature tensors.
Proof Since the first Betti number β1 (M ) = N(̸= 0), by the de Rham theorem, there
are N closed 1-forms
they constitute a basis for the 1-dimensional de Rham homology Hd1 (M ). Hence ω j (1 !
j ! N) are not exact, and satisfy that
B C
∂ ψkj ∂ ψlj
dω j = − k dxl ∧ dxk = 0,
∂ xl ∂x
162 Chapter 3 Mathematical Foundations
Dl Φk = Dk Φl (3.4.53)
(δ F(gi j ))lk = Dl Φk ,
where Φk satisfy (3.4.53) and (3.4.54), which are the equations given by (3.4.50).
On the other hand, by the Hodge decomposition theorem, the 1-forms in (3.4.52) are
harmonic, i.e.
d ω j = 0, δ ω j = 0 for 1 ! j ! N.
It follows that the covector fields ψ j (1 ! j ! N) in (3.4.52) satisfy
∆ψ j = 0 for 1 ! j ! N, (3.4.55)
For the fields (3.5.1), a gauge invariant functional is the Yang-Mills action:
!
LY M = [LG + LD ] dx, (3.5.2)
where LG and LD are the gauge field section and Dirac spinor section, and are written as
1 a µν a
LG = − Fµν F ,
4 (3.5.3)
a
Fµν = ∂µ Aaν − ∂ν Aaµ + gλbc
a b c
Aµ Aµ ,
and
LD = Ψ(iγ µ Dµ − m)Ψ,
(3.5.4)
Dµ = ∂µ + igAaµ τa ,
where τa (1 ! a ! K = N 2 − 1) are the generators of SU(N).
The actions (3.5.2)-(3.5.4) are invariant under both the Lorentz transformation and the
SU(N) gauge transformation as follows
Ψ
' = ΩΨ,
(3.5.5)
'aµ τa = ΩAaµ τa Ω−1 + i (∂µ Ω)Ω−1 ,
A
g
and Ω ⊂ SU(N) can be expressed as
aτ
Ω = eiθ a
, τa is in (3.5.4).
For the gauge theory, a vary basic and important problem is that in the gauge transfor-
mation (3.5.5) the generators of SU(N) given by
{τa | 1 ! a ! K}, (3.5.6)
have infinite numbers of families, and each family of (3.5.6) corresponds to a set of gauge
fields:
{τa | 1 ! a ! K} ↔ {Aaµ | 1 ! a ! K}. (3.5.7)
Now, we assume that the generators of (3.5.6) undergo a linear transformation as follows
τ'b = xab τa , (3.5.8)
where (xab ) is a K-th order complex matrix. Then the corresponding gauge field A aµ in (3.5.7)
has to change. Namely, under the transformation (3.5.8) A aµ will transform as
'aµ = ya Abµ ,
A (3.5.9)
b
ψ : U → Rn
and akl = xkl + iykl ∈ C (1 ! k, l ! N). Thus the matrix A can be regarded as a point p A in
2
R2N :
2
pA = (x11 , y11 , · · · , x1N , y1N , · · · , xN1 , yN1 , · · · , xNN , yNN ) ∈ R2N . (3.5.10)
2
Therefore, we have that SU(N) can be regarded as a subspace of R 2N .
By A† A = I and det A = 1, the entries akl (1 ! k, l ! N) of A satisfy
which are N 2 + 1 equations, as constraints for the point pA in (3.5.10). Hence SU(N) can
2
be regarded as subspace of R2N has dimension N 2 − 1.
Mathematically SU(N) is a manifold. In fact, at any point p A of (3.5.10), each equation
2
of (3.5.11) represents a hypersurface near pA in R2N :
Now we derive some properties of tangent vectors τ on TA SU(N). To this end, let
γ (t) ⊂ SU(N) be a curve passing through the point A ∈ SU(N) with τA ∈ TA SU(N) as its
tangent vector at A, as shown in Figure 3.2. Let γ (0) = A. Then the curve γ (t) satisfies the
following equation
d γ (t)
= τt , τt ∈ T SU(N) with τA = τ |t=0 ,
dt (3.5.12)
γ (0) = A.
166 Chapter 3 Mathematical Foundations
γ (t) = A + t τA ⊂ SU(N).
It follows that
(A + t τA)† (A + t τA) = I. (3.5.13)
As A† A = I and t is infinitesimal, we deduce from (3.5.13) that
A† τA + τA† A = 0. (3.5.14)
Hence, (3.5.14) is the condition for an N-th order complex matrix τ ∈ TA SU(N): A† τ is
anti-Hermitian. Namely,
Note that
†τ
A + t τA = A(I + tA†τA ) = AetA A , (3.5.16)
for infinitesimal t. If we replace τ by iτ , then (3.5.15) can be expressed as
Tr(A† τ ) = 0
τ1 , · · · , τK , K = N 2 − 1, (3.5.20)
3.5 SU(N) Representation Invariance 167
Since the expression (3.5.21) is independent of the choice of the generator bases of SU(N),
we have ⎛ ⎞
τ1
θ'a τ'a = (θ 1 , · · · , θ K )Y T X ⎝ ... ⎠ = θ a τa ,
⎜ ⎟
τK
which requires that
Y = (X −1 )T . (3.5.24)
Thus, we see that (θ 1 , · · · , θ K ) is a first order contra-variant tensor defined on Te SU(N).
We are now ready to define more general SU(N) tensors.
Definition 3.30 (SU(N) Tensors) Let T be given as
T = {Tba11···b
···ai
j
| 1 ! ak , bl ! K = N 2 − 1}.
We say that T is a (i, j)-type of SU(N) tensor, under the generator basis transformation as
(3.5.22), we have
d
T'ba11···b
···ai
j
= yac11 · · · yacii xdb11 · · · xb jj Tdc11···d
···ci
j
,
where (xab ) and (yab ) are as in (3.5.22) and (3.5.23).
Based on Definition 3.30, it is easy to see that the SU(N) gauge fields (A 1µ , · · · , Aaµ ) is a
contra-variant SU(N) tensor. In other words, under (3.5.22), (A 1µ , · · · , Aaµ ) transforms as
'aµ = ya Abµ ,
A Y = (yab ) as in (3.5.24). (3.5.25)
b
168 Chapter 3 Mathematical Foundations
This can be seen from the fact that the operator Aaµ τa in the differential operator D µ in
(3.5.4) is independent of generator bases τa of SU(N).
a of SU(N) constitute a (1,2)-type of SU(N)
We now verify that the structure constants λbc
a,
tensor. By the definition of λbc
By (3.5.22),
f
[τ'b , τ'c ] = xab xdc [τa , τd ] = ixab xdc λad τf ,
and by definition
a' a d
[τ'b , τ'c ] = i'
λbc τa = i'
λbc xa τd .
Then it follows that
a f
'
λbc = xb xgc yad λ fdg , (3.5.26)
which means that a}
{λbc is a (1,2)-type SU(N) tensor.
Next, we introduce two second-order covariant SU(N) tensors G ab and gab , and later we
shall prove that they are equivalent.
Let A ∈ SU(N). Then the tangent space TA SU(N) is given by (3.5.17). Let
ω1 , · · · , ωK ∈ TA SU(N), (3.5.27)
1) SU(N) tensor Gab (A). By the basis (3.5.27) we can get a 2-covariant SU(N) tensor
defined by
1
Gab (A) = tr(ωa ωb† ), A ∈ SU(N), (3.5.28)
2
where ωa (1 ! a ! K) are as in (3.5.27).
2) SU(N) tensor gab . The structure constants λbc a generated by the generators ω in
a
(3.5.27) satisfy
a
[ωb , ωc ] = iλbc ωa A † , ∀A ∈ SU(N) (3.5.29)
then we can define another 2-covariant tensor gab on TA SU(N) by the structure con-
a as follows
stants λbc
1 c d
gab = λ λ . (3.5.30)
4N ad cb
The following theorem shows that both Gab and gab are symmetric second-order SU(N)
tensors.
Theorem 3.31 The fields Gab (A) and gab given by (3.5.28) and (3.5.30) are 2-order
symmetric SU(N) tensors, and for any A ∈ SU(N), the generator basis (3.5.27) satisfies
a are the structure constants of SU(N) independent of A.
(3.5.29), where λbc
3.5 SU(N) Representation Invariance 169
Now we verify the symmetry of Gab . By (3.5.17) the basis (ω1 , · · · , ωK ) of TA SU(N)
satisfy that
A† ωa = (A† ωa )† , tr(A† ωa ) = 0, A† = A−1 , (3.5.31)
Thus we have
1 1 1
Gab = tr(ωa ωb† ) = tr(Aτa τb† A† ) = tr(τa τb† ). (3.5.33)
2 2 2
Thanks to (3.5.32), (τ1 , · · · , τK ) ∈ Te SU(N). Hence we have
A† [ωa , ωb ]A = iλab
c †
A ωc .
Theorem 3.32 For the 2-order SU(N) tensors gab and Gab , the following assertions
hold true:
2) For each point A ∈ SU(N), there is a coordinate system in where G ab (A) = gab . There-
fore Gab (A) is a Riemannian metric on SU(N).
gab ) = X(gab)X T ,
(' X = (xab ) as in (3.5.22). (3.5.35)
1 c d
Hence, if for a given basis {τa | 1 ! a ! K} of Te SU(N) we can verify that gab = λ λ
4N ad cb
is positive definition, then by (3.5.35) we derive Assertion 1). In the following, we proceed
first for SU(2) and SU(3), then for the general SU(N).
For SU(2), we take the Pauli matrices
% & % & % &
0 1 0 −i 1 0
σ1 = , σ2 = , σ3 = (3.5.36)
1 0 i 0 0 −1
c of (3.5.36) are as follows
as the generator basis of SU(2). The structure constants λ ab
⎧
⎪ 1 if (abc) is even,
⎨
c
λab = 2εabc , εabc = − 1 if (abc) is odd, (3.5.37)
⎪
⎩
0 if otherwise.
Now, we prove Assertion 2). For each A ∈ SU(N) we take the matrices
1
tr(τa τb† ) = δab .
2
It follows from (3.5.43) that with the basis (3.5.40) of TA SU(N),
LY M = LG + LD ,
1
LG = − Gab Fµνa
F µν b , (3.5.44)
4
8 9
LD = Ψ iγ µ (∂µ + igAaµ τa ) − m Ψ,
It is clear that the action density (3.5.44) is invariant under the transformation of TA SU(N):
In fact, the following are three terms in (3.5.44), which involve contractions of SU(N)
tensors:
a
Gab Fµν F µν b , Aaµ τa , λbc
a b c
A µ Aν .
Remark 3.33 The purely mathematical logic requires the introduction of the modified
Yang-Mills action (3.5.44) and the SU(N) tensors. There is a profound physical significance.
This invariance dictates that mixing different gauge potentials from different gauge groups
will often lead to the violation this simple principle. As we shall see, the new invariance
theory is very important and crucial in the unified field model presented in the next chapter,
where this principle is called the Principle of Representation Invariance (PRI).
h̄2 2 Ze2
− ∇ ϕ− ϕ = λ ϕ.
2m0 r
Since the orbital electrons are bound in the interior of the atom, the following condition
holds true:
ϕ =0 for |x| > r0 ,
where r0 is the radius of an atom. Thus, if ignoring the electromagnetic interactions be-
tween orbital electrons, then the bound energy of an electron is a negative eigenvalue of the
following elliptic boundary problem
h̄2 2 Ze2
− ∇ ϕ− ϕ = λϕ for x ∈ Br0 ,
2m0 r (3.6.4)
ϕ =0 for x ∈ ∂ Br0 ,
− D2 ψ + A ψ = λ ψ for x ∈ Ω,
(3.6.5)
ψ =0 for x ∈ ∂ Ω,
N = dim Eλ0
3.6 Spectral Theory of Differential Operators 175
is called the multiplicity of λ0 . Physically, N is also called the degeneracy provided N > 1.
Usually, we count the multiplicity N of λ0 as N eigenvalues, i.e., we denote
λ1 = · · · = λ N = λ0 .
Based on the physical background, we mainly concern the negative eigenvalues. How-
ever, for our purpose the following classical spectral theorem is very important.
Theorem 3.34 (Spectral Theorem of Elliptic Operators) Let the matrices in (3.6.7)
are Hermitian, and the functions Ai j , Bkij ∈ L∞ (Ω). The the following assertions hold true:
1) All eigenvalues of (3.6.5) are real with finite multiplicities, and form an infinite con-
sequence as follows:
−∞ < λ1 ! λ2 ! · · · ! λk ! · · · , λk → ∞ as k → ∞.
Remark 3.35 For the energy levels of subatomic particles introduced in Chapter 5,
we are mainly interested in the negative eigenvalues of (3.6.5) and in the estimates of the
number N in (3.6.8).
Theorem 3.34 is a corollary of the classical Lagrange multiplier theorem. We recall the
variational principle with constraint. Let H be a linear normed space, and F and G are two
functionals on H:
F, G : H → R.
Let Γ ⊂ H be the set
Γ = {u ∈ H| G(u) = 1}.
If u0 ∈ Γ is a minimum point of F with constraint on Γ:
δ F(ψ ) = λ δ G(ψ ),
and F, G are as in (3.6.10) and (3.6.11), which is as the variational equation (3.6.9) with the
constraint on Γ.
Hence, the eigenvalues λk (k = 1, 2, · · · ) of (3.6.5) can be expressed in the following
forms
λ1 = min F(ψ ),
ψ ∈Γ
(3.6.12)
λk = min F(ψ ),
ψ ∈Γ,ψ ∈Hk−1
⊥
Theorem 3.36 For the eigenvalue problem (3.6.5), the following assertions hold true:
3.6 Spectral Theory of Differential Operators 177
where D is as in (3.6.6).
Proof Assertion 1) follows directly from the following classical formula for the first
eigenvalue λ1 of (3.6.5):
! 8
1 9
λ1 = min (Dψ )† (Dψ ) + ψ † Aψ dx.
ψ ∈H0 (Ω,C ) ||ψ ||L2
1 m Ω
We now prove Assertion 2) by contradiction. Assume that it is not true, then K > N. By
Theorem 3.34, the K functions ψ j in (3.6.15) can be expended as
N ∞
ψ j = ∑ α ji ei + ∑ β jl ϕl for 1 ! j ! K, (3.6.16)
i=1 l=1
where ⎛ ⎞
α11 ··· α1N
⎜ .. .. ⎟
α =⎝ . . ⎠ with αi j as in (3.6.16).
αK1 ··· αKN
Thus, under the transformation P,
B C
ψ
ψ
'=P ∈ EK, ψ = (ψ1 , · · · , ψN )T , (3.6.18)
0
where E K is as in (3.6.15).
However, by (3.6.14) and (3.6.15), the first term ψ
'1 in (3.6.18) can be expressed in the
form
∞
ψ
'1 = ∑ θl ϕl ∈ E K . (3.6.19)
l=1
178 Chapter 3 Mathematical Foundations
Note that ϕl are the eigenfunctions corresponding to the nonnegative eigenvalues of (3.6.5).
Hence we have
! ! ∞
Ω
'1 )† (Dψ
[(Dψ '1† Aψ
'1 ) + ψ '1 ]dx =
Ω
'1† (−D2 ψ
ψ '1 + Aψ
'1 )dx = ∑ |θl |2 λl > 0. (3.6.20)
l=1
Here λl " 0 are the nonnegative eigenvalues of (3.6.5). Hence we derive, from (3.6.19) and
(3.6.20), a contradiction with the assumption in Assertion (2). The proof of the theorem is
complete.
− ∇2 ψ + V (x)ψ = λ ψ for x ∈ Br ,
(3.6.21)
ψ =0 for x ∈ ∂ Br ,
The main result in this section is the following theorem, which provides a relation be-
tween N, θ and r, where N is the number of negative eigenvalues of (3.6.21). Let λ 1 be the
first eigenvalue of the equation
− ∆e = λ e for x ∈ Ω,
(3.6.25)
e=0 for x ∈ ∂ Ω,
Lemma 3.37(H. Weyl) Let λN be the N-th eigenvalue of the m-th order elliptic oper-
ator
(−1)m ∆m e = λ e for x ∈ Ω ⊂ Rn ,
(3.6.26)
Dk e|∂ Ω = 0 for 0 ! k ! m − 1,
then λN has the asymptotical relation
λN ∼ λ1 N 2m/n , (3.6.27)
provided that θ r2+α /λ1 ≫ 1 is sufficiently large, where r and θ are as in (3.6.21) and
(3.6.24), and λ1 is the first eigenvalue of (3.6.25).
Br = {y = rx | x ∈ B1 }.
Based on Assertion (2) in Theorem 3.36, we need to find N linear independent functions
ϕn ∈ H01 (B1 ) (1 ! n ! N) satisfying
! 8 9
|∇ϕ |2 + r2+α V0 (x)ϕ 2 dx < 0, (3.6.31)
B1
180 Chapter 3 Mathematical Foundations
and
λN < θ r2+α ! λN+1 . (3.6.32)
Therefore we have
N
||ϕ ||L2 = ∑ αn2 = 1. (3.6.33)
n=1
It follows from Theorem 3.36 that there are at least N negative eigenvalues for (3.6.30).
When θ r2+α ≫ 1 is sufficiently large, the relation (3.6.32) implies that
λN ≃ θ r2+α . (3.6.34)
3.6 Spectral Theory of Differential Operators 181
Hence the relation (3.6.28) follows from (3.6.34) and (3.6.35). The proof is complete.
Remark 3.39 In Section 6.4.6, we shall see that for particles with mass m, the param-
eters in (3.6.28) are
2Zme2 1
V (x) = − , m the mass of electron.
h̄2 r
The parameters in (3.6.28) for this system are
2Zme2
θ= . (3.6.37)
h̄2
According to physical parameters, it is known that
mc 1 e2 1
r0 × = × 103, = . (3.6.38)
h̄ 4 h̄c 137
182 Chapter 3 Mathematical Foundations
Remark 3.41 The equation (3.6.40) is essentially an eigenvalue problem of the first
order differential operator:
ih̄c(⃗σ · ⃗D) + gA0,
which is called the Weyl operator. In addition, the operator (⃗σ · ⃗D)2 in (3.6.40) is elliptic
and can be written as
g
(⃗σ · ⃗D)2 = D2 − ⃗σ · curl⃗A, (3.6.42)
h̄c
and ⃗A = (A1 , A2 , A3 ) as in (3.6.41). The ellipticity of (3.6.42) g in (3.6.41)-(3.6.42) repre-
sents the weak or strong interaction charge, and A µ = (A0 , A1 , A2 , A3 ) the weak or strong
interaction potential.
Note also that ⃗A = (A1 , A2 , A3 ) stands for the magnetic component of the weak or strong
interaction. Hence, in (3.6.42), the term
g⃗σ · curl⃗A
represents magnetic energy generated by the weak or strong interactions, which is an im-
portant byproduct of the unified field theory based on PID and PRI.
3.6 Spectral Theory of Differential Operators 183
Definition 3.42 A real number λ and a two-component wave function ϕ ∈ H01 (Ω, C2 )
are called the eigenvalue and eigenfunction of (3.6.40), if (λ , ϕ ) satisfies (3.6.40) and
! F G
ϕ † i(⃗σ · ⃗D)ϕ dx > 0. (3.6.43)
Ω
The physical significance of (3.6.43) is that the kinetic energy E of the eigenstate ϕ is
positive: E > 0.
The following theorem ensures the mathematical rationality of the eigenvalue problem
of the Weyl operators.
Theorem 3.43 (Spectral Theorem of Weyl Operators) For the eigenvalue problem
(3.6.40), the following assertions hold true:
1) The eigenvalues of (3.6.40) are real and discrete, with finite multiplicities, and satisfy
−∞ < λ1 ! · · · ! λk ! · · · , λk → ∞ as k → ∞.
−∞ < λ1 ! · · · ! λN < 0.
4) Equations (3.6.40) have negative eigenvalues if and only if there exists a function
ϕ ∈ H01 (Ω, C2 ) satisfying (3.6.43) such that
! 0 1
ig
h̄c|(⃗σ · ⃗D)ϕ |2 + ϕ † {(⃗σ · ⃗D), A0 }ϕ < 0.
Ω 2
By (3.6.42), the operator L2 = −(⃗σ · ⃗D)2 is elliptic. Hence F has the following lower bound
on S: > !! ?
!
S = ψ ∈ H01 (Ω, C2 )!! ψ † Lψ dx = 1 ,
Ω
namely
min F(ψ ) > −∞.
ψ ∈S
Based on the Lagrange multiplier theorem of constraint minimization, the first eigenvalue
λ1 and the first eigenfunction ϕ1 ∈ S satisfy
In addition, if
λ1 ! · · · ! λ m
are the first m eigenvalues with eigenfunctions ψk , 1 ! k ! m, then the (m + 1)-th eigenvalue
λm+1 and eigenfunction ψm+1 satisfy
In the following, we consider the estimates of the number of negative eigenvalues for
the Weyl operators. If the interaction potential A µ takes approximatively the following
βN ∼ β1 N 1/n . (3.6.50)
Here the exponent is 1/n, since (3.6.47) is an 2m-th order elliptic equation with m = 1/2.
Hence we deduce, from (3.6.48) and (3.6.50), the estimates of the number N of negative
eigenvalues of (3.6.47) as % &n
K
N≃ , (3.6.51)
β1
where β1 is the first eigenvalue of (3.6.49).
Remark 3.44 For the mediators such as the photon and gluons, the number of energy
levels is given by N in (3.6.51), which can be estimated as
% &3
A ρ1 g2w
N= ,
β1 ρ h̄c
Once again, the goal of this book is to derive experimentally verifiable laws of Nature based
on a few fundamental mathematical principles. The aims of this chapter are are as follows:
• to address the basic principles for the unified field theory coupling the four funda-
mental interactions,
This chapter is based entirely on the recent work of the authors (Ma and Wang, 2015a,
2014h,c, 2013a, 2014e). The key ingredients of the unified field theory include the follow-
ing.
First, we have established two new principles, the principle of interaction dynamics
(PID) and the principle of representation invariance (PRI). PID was first discovered by (Ma
and Wang, 2014e, 2015a). It requires that for the four fundamental interactions, the variation
be taken under the energy-momentum conservation constraints. The validity of PID for the
four fundamental interactions of Nature has been demonstrated through strong experimental
and observation supports. For gravity, PID is induced by the presence of the dark matter and
dark energy phenomena. PID is also required by the Higgs field and the quark confinement,
as we explained in Chapter 1.
PRI, originally discovered by (Ma and Wang, 2014h), states that the SU(N) gauge theory
should be invariant, under the representation transformations of the generator bases. PRI is
simply a logic requirement for the SU(N) theory.
Second, with PRI, the unification through a large symmetry group appears to be not
feasible. Then we have demonstrated that the two first principles, PID and PRI, together
with the principle of symmetry-breaking, offer an entirely different route for the unification:
1) the general relativity and the gauge symmetries dictate the Lagrangian;
and
4.1 Principles of Unified Field Theory 187
2) the coupling of the four interactions is achieved through PID and PRI
in the unified field equations, which obey the PGR and PRI, but break
spontaneously the gauge symmetry.
In this chapter, there are four sections. Section 4.1 presents the general view of the
unified field theory, the geometry of unified fields, gauge-symmetry breaking, PID, and
PRI. Section 4.2 presents the experimental and observational physical supports for PID.
The mathematical reason from the well-posedness point of view is also given. Section 4.3
introduces the unified field equations coupling the four fundamental interactions based on
PID. Section 4.4 presents the natural duality between the gauge fields and their dual fields,
as well as the decoupling of the unified field equations to the field equations for individual
interaction when the other interactions are negligible.
1) the gravitational force, generated by the mass charge M, which is responsible to all
macroscopic motions;
2) the electromagnetic force, generated by the electric charge e, which holds the atoms
and molecules together;
3) the strong force, generated by the strong charge g s , which mainly acts at three levels:
quarks and gluons, hadrons, and nucleons;
4) the weak force, generated by the weak charge g w , which provides the binding energy
to hold the mediators, the leptons and quarks together.
The most crucial ingredient of each interaction is its corresponding interaction potential
Φ and charge g. The relation between the corresponding force F, its associated potential Φ
and its charge g is as follows:
F = −g∇Φ, (4.1.1)
where ∇ is the three-dimensional spatial gradient operator. The charge g for each interaction
is given as follows:
We know now that the four interactions are dictated respectively by the following sym-
metry principles:
The last three interactions also obey the Lorentz invariance. As a natural consequence, the
three charges e, gw , gs in (4.1.2) are the coupling constants of the U(1), SU(2), SU(3) gauge
fields.
Following the simplicity principle of laws of Nature as stated in Principle 2.2, the three
basic symmetries—the Einstein general relativity, the Lorentz invariance and the gauge
invariance—uniquely determine the interaction fields and their Lagrangian actions for the
four interactions:
1. Gravity. The gravitational fields are the Riemannian metric defined on the space-time
manifold M :
ds2 = g µν dxµ dxν , (4.1.4)
and then second-order tensor {g µν } stands for the gravitational potential. The Lagrangian
action for the metric (4.1.4) is the Einstein-Hilbert functional
8π G
LEH = R + S, (4.1.5)
c4
where R stands for the scalar curvature of the tangent bundle T M of M .
2. Electromagnetism. The field describing electromagnetic interaction is the U(1) gauge
field
Aµ = (A0 , A1 , A2 , A3 ),
representing the electromagnetic potential, and the Lagrangian action is
1
LEM = − Aµν Aµν , (4.1.6)
4
which stands for the scalar curvature of the vector bundle M ⊗ p C4 , with
Aµν = ∂µ Aν − ∂ν Aµ .
4. Strong interaction. The strong fields are the SU(3) gauge fields
The Yukawa Interaction Mechanism, briefly mentioned in Section 2.1.6 and restated
below, is the main reason why the weak interaction is described by an SU(2) gauge theory
and the strong interaction is described by an SU(3) gauge theory.
One great vision of Albert Einstein is his principle of equivalence, which says that grav-
ity is manifested as the curved effect of the space-time manifold {M , g µν }. Based on the
recent work by the authors (Ma and Wang, 2015a, 2014h,d), the Einstein vision leads us
to postulate the Geometric Interaction Mechanism 2.13, which is restated here for conve-
nience:
Geometric Interaction Mechanism 4.1 The gravitational force is the curved effect of
the time-space, and the electromagnetic, weak, strong interactions are the twisted effects of
the underlying complex vector bundles M ⊗ p Cn .
Yukawa’s viewpoint, entirely different from Einstein’s, is that the other three funda-
mental forces—the electromagnetism, the weak and the strong interactions——take place
through exchanging intermediate bosons:
Yukawa Interaction Mechanism 4.2 The four fundamental interactions of Nature are
mediated by exchanging interaction field particles, called the mediators. The gravitational
force is mediated by the graviton, the electromagnetic force is mediated by the photon, the
strong interaction is mediated by the gluons, and the weak interaction is mediated by the
intermediate vector bosons W ± and Z.
190 Chapter 4 Unified Field Theory of Four Fundamental Interactions
It is the Yukawa mechanism that leads to the SU(2) and SU(3) gauge theories respec-
tively for the weak and the strong interactions. In fact, the three mediators W ± and Z for
the weak interaction are regarded as the SU(2) gauge fields Wµa (1 ! a ! 3), and the eight
gluons for the strong interaction are considered as the SU(3) gauge fields S kµ (1 ! k ! 8).
Of course, the three color quantum numbers for the quarks are an important evidence for
choosing the SU(3) gauge theory to describe the strong interaction.
The two interaction mechanisms lead to two entirely different directions to develop the
unified field theory. The need for quantization for all current theories for the four interac-
tions is based on the Yukawa Interaction Mechanism. The new unified field theory in this
article is based on the Geometric Interaction Mechanism, which focuses directly on the four
interaction forces as in (4.1.1), and does not involve a quantization process.
A radical difference for these two mechanisms is that the Yukawa Mechanism is ori-
ented toward to computing the transition probability for the particle decays and scatterings,
and the Geometric Interaction Mechanism is oriented toward to fundamental laws, such as
interaction potentials, of the four interactions.
Einstein’s unification
The aim of a unified field theory is to establish a set of field equations coupling the four
fundamental interactions. Albert Einstein was the first person who attempted to establish a
unified field theory. The basic philosophy of his unification is that all fundamental forces
of Nature should be dictated by one large symmetry group, which can degenerate into a
sub-symmetry for each interaction:
In essence, with the Einstein unification, under the large symmetry, the four fundamental
forces can be regarded as one fundamental force.
Recall that there are four fundamental interactions of Nature: gravitational, electromag-
netic, strong, and weak, whose fields and actions:
and an action
L = L (G1 , · · · , GN ). (4.1.11)
The basic requirements for such a unification are as follows: Under certain conditions,
1) the large symmetry degenerates into sub-symmetries: the general invariance, the
Lorentz invariance, and the U(1), SU(2), SU(3) invariance;
2) the field G of (4.1.10) is then decomposed into the fields of the four fundamental
interactions:
degenerate
(G1 , · · · , GN ) −→ g µν , Aµ ,Wµa , Skµ , (4.1.12)
3) the action (4.1.11) also becomes the simple sum of these actions given in (4.1.5)-
(4.1.8):
L (G1 , · · · , GN ) → LEH + LEM + LW + LS . (4.1.13)
For almost a century, a great deal of effort has been made to find the unified field model
based on the above mentioned approach. However, all efforts in this aspect are not suc-
cessful. In fact, among other reasons, this route of unification violates the principle of
representation invariance (PRI), discovered in (Ma and Wang, 2014h); see also remaining
part of this chapter for details.
Moreover, the basic principles and the field equations from all attempted unified field
theories based on (4.1.9) are often too complex, and violate the simplicity principle of
physics. Most importantly, despite of many attempts, the current theories following (4.1.9)
fail to provide solutions to the following longstanding problems and challenges:
2) quark confinement,
3) asymptotical freedom,
Unification through the Field Equations Based on PID, PRI and PSB (4.1.15)
Also, the route of unification (4.1.14) and (4.1.15) is readily applied to multi-particle
interacting systems, and gives rise to a first dynamic interacting model for multi-particle
systems; see Chapter 6 for details.
Finally, we present a diagram to illustrate the framework of the unified field theory,
based on (4.1.14) and (4.1.15). We note that quantization is used mainly for deriving tran-
sition probability from the field equations for each interaction.
1) The symmetric principles, such as the Einstein general relativity, the Lorentz invari-
ance, and the gauge invariance, determine that the fields reflecting geometries of M
are the Riemannian metric {g µν } and the gauge fields {Gaµ }. In addition, the sym-
metric principles also determine the Lagrangian actions of g µν and Gaµ ;
3) The solutions g µν and Gaµ of the field equations determine the geometries of M and
the vector bundles.
The geometry of the unified fields refers to the geometries of M , determined by the
following known physical symmetry principles:
We shall introduce the two principles PID and PRI in Section 4.1.5.
The fields determined by the symmetries in (4.1.16) are given by
The Lagrange action for the geometry of the unified fields is given by
!
√
L= [LEH + LEM + LW + LS + LD ] −gdx (4.1.17)
M
where LEH , LEM , LW and LS are the Lagrangian actions for the four interactions defined
by (4.1.5)–(4.1.8), and the action for the Dirac spinor fields is given by
Here
Ψ = (ψ e , ψ w , ψ s ),
m = (me , mw , ms ),
and
ψ e : M → M ⊗ p C4 1-component Dirac spinor,
w 4 2
ψ : M → M ⊗ p (C ) 2-component Dirac spinors, (4.1.19)
ψ s : M → M ⊗ p (C4 )3 3-component Dirac spinors.
Dµ ψ e = (∂µ + ieAµ )ψ e ,
Dµ ψ w = (∂µ + igwWµa σa )ψ w , (4.1.20)
s
Dµ ψ = (∂µ + igsSkµ τk )ψ s .
The geometry of unified fields consists of 1) the field functions and 2) the Lagrangian
action (4.1.17), which are invariant under the following seven transformations:
µ T µ
1) the general linear transformation Q p = (aν ) : Tp M → Tp M with Q −1
p = (bν ) , for
any p ∈ M :
g µν ) = Q p (g µν )Q Tp ,
('
'µ = aνµ Aν ,
A
W'µa = aνµ Wνa for1 ! a ! 3, (4.1.21)
S'kµ = aνµ Sνk for1 ! k ! 8,
µ
γ'µ = bν γ ν , ∂'µ = aνµ ∂ν ,
L = (lµν ) : Tp M → Tp M , L is independent of p ∈ M,
g µν ) = L(g µν )LT ,
(' 'µ = lµν Aν ,
A
W'µa = lµν Wνa for 1 ! a ! 3,
(4.1.22)
S'kµ = lµν Sνk for 1 ! k ! 8,
Ψ' = RL Ψ, RL is the spinor transformation matrix,
∂'µ = l ν ∂ν ,
µ
Ω = eiθ : C4 → C4 ∈ U(1),
D E % 1
&
(4.1.23)
e iθ e
ψ' , Aµ = e ψ , Aµ − ∂µ θ ,
'
e
Remark 4.3 Here we adopt the linear transformations of the bundle spaces instead
of the coordinate transformations in the base manifold M . In this case, the two transfor-
mations (4.1.21) and (4.1.22) are compatible. Otherwise, we have to introduce the Vierbein
tensors to overcome the incompatibility between the Lorentz transformation and the general
coordinate transformation.
represent the universality of physical laws— the validity of laws of Nature is independent of
the coordinate systems expressing them. Consequently, the symmetries in (4.1.30) cannot
be broken at both levels of (4.1.28) and (4.1.29).
The physical implication of the gauge symmetry, however, is different at the two levels:
(1) the gauge invariance of the Lagrangian action, (4.1.28), says that the energy contri-
butions of particles in a physical system are indistinguishable; and
(2) the gauge invariance of the variational equations, (4.1.29), means that the particles
involved in the interaction are indistinguishable.
It is clear that the first aspect (1) above is universally true, while the second aspect (2)
is not universally true. In other words, the Lagrange actions obey the gauge invariance,
but the corresponding variational equations break the gauge symmetry. This suggests us to
postulate the following principle of gauge symmetry breaking for interactions described by
the gauge theory.
1) The gauge symmetry holds true only for the Lagrangian actions for the electromag-
netic, week and strong interactions; and
2) the field equations of these interactions spontaneously break the gauge symmetry.
4.1 Principles of Unified Field Theory 197
The principle of gauge symmetry breaking can be regarded as part of the spontaneous
symmetry breaking, which is a phenomenon appearing in various physical fields. In 2008,
the Nobel Prize in Physics was awarded to Y. Nambu for the discovery of the mechanism
of spontaneous symmetry breaking in subatomic physics. In 2013, F. Englert and P. Higgs
were awarded the Nobel Prize for the theoretical discovery of a mechanism that contributes
to our understanding of the origin of mass of subatomic particles.
This phenomenon was discovered in superconductivity by Ginzburg-Landau in 1951,
and the mechanism of spontaneous symmetry breaking in particle physics was first pro-
posed by Y. Nambu in 1960; see (Nambu, 1960; Nambu and Jona-Lasinio, 1961a,b). The
Higgs mechanism, introduced in (Higgs, 1964; Englert and Brout, 1964; Guralnik, Hagen
and Kibble, 1964), is an artificial method based on the Nambo-Jona-Lasinio spontaneous
symmetry breaking, leading to the mass generation for the vector bosons of the weak inter-
action.
PID discovered by the authors, to be stated in detail in the next section, provides a new
mechanism for gauge symmetry breaking and mass generation. The difference between the
PID and the Higgs mechanisms is that the first one is a natural sequence of the first principle,
and the second is to add artificially a Higgs field in the Lagrangian action. Also, the PID
mechanism obeys PRI, and the Higgs mechanism violates PRI. symmetry-breaking!PID-
induced
∇A u = ∇u + u ⊗ A,
divA u = div u − A · u,
where A is a vector field and here stands for a gauge field, ∇ and div are the usual gradient
and divergent covariant differential operators. Let F = F(u) be a functional of a tensor field
u. A tensor u0 is called an extremum point of F with the divA -free constraint, if u0 satisfies
the equation
!
d !! √
! F(u0 + λ X) = δ F(u0 ) · X −gdx = 0, ∀X with divA X = 0. (4.1.31)
d λ λ =0 M
2) The action (4.1.32) satisfy the invariance of general relativity, Lorentz invariance,
gauge invariance and the gauge representation invariance;
3) The states (g, A, ψ ) are the extremum points of (4.1.32) with the divA -free constraint
(4.1.31).
Based on PID and Theorems 3.26 and 3.27, the field equations with respect to the action
(4.1.32) are given in the form
δ
L(g, A, ψ ) = (∇µ + αb Abµ )Φν , (4.1.33)
δ g µν
δ
L(g, A, ψ ) = (∇µ + βba Abµ )ϕ a , (4.1.34)
δ Aaµ
δ
L(g, A, ψ ) = 0 (4.1.35)
δψ
where Aaµ = (Aa0 , Aa1 , Aa2 , Aa3 ) are the gauge vector fields for the electromagnetic, the weak
and strong interactions, Φν = (Φ0 , Φ1 , Φ2 , Φ3 ) in (4.1.33) is a vector field induced by grav-
itational interaction, ϕ a is the scalar fields generated from the gauge fields Aaµ , and αb , βba
are coupling parameters.
Consider the action (4.1.32) as the natural combination of the actions for all four inter-
actions, as given in (4.1.17)-(4.1.20):
L = LEH + LEM + LW + LS + LD .
Then (4.1.33)-(4.1.35) provide the unified field equations coupling all interactions. More-
over, we see from (4.1.33)-(4.1.35) that there are too many coupling parameters which need
to be determined. Fortunately, this problem can be satisfactorily resolved, leading also to
the discovery of PRI (Ma and Wang, 2014h). Meanwhile, we remark that it is the gauge
fields Aaµ appearing on the right-hand sides of (4.1.33) and (4.1.34) that break the gauge
symmetry, leading to the mass generation of the vector bosons for the weak interaction
sector.
We are now in position to introduce the principle of representation invariance (PRI). We
end this section by recalling the principle of representation invariance (PRI) first postulated
4.1 Principles of Unified Field Theory 199
in (Ma and Wang, 2014h). We proceed with the SU(N) representation. In a neighborhood
U ⊂ SU(N) of the unit matrix, a matrix Ω ∈ U can be written as
aτ
Ω = eiθ a
,
where
τa = {τ1 , · · · , τK } ⊂ Te SU(N), K = N 2 − 1, (4.1.36)
is a basis of generators of the tangent space Te SU(N); see Section 3.5 for the mathematical
theory. An SU(N) representation transformation is a linear transformation of the basis in
(4.1.36) as
τ'a = xba τb , (4.1.37)
where X = (xba ) is a nondegenerate complex matrix.
Mathematical logic dictates that a physically sound gauge theory should be invariant
under the SU(N) representation transformation (4.1.37). Consequently, the following prin-
ciple of representation invariance (PRI) must be universally valid and was first postulated in
(Ma and Wang, 2014h).
Both PID and PRI are very important. As far as we know, it appears that the only unified
field model, which obeys not only PRI but also the principle of gauge symmetry breaking,
Principle 4.4, is the unified field model based on PID introduced in this chapter. From this
model, we can derive not only the same physical conclusions as those from the standard
model, but also many new results and predictions, leading to the solution of a number of
longstanding open questions in physics, including the 10 problems mentioned in Section
4.1.2.
Fourth, the electroweak interactions is a U(1) × SU(2) gauge theory coupled with the
Higgs mechanism. An unavoidable feature for the Higgs mechanism is that the gauge fields
with different symmetry groups are combined linearly into terms in the corresponding gauge
field equations. For example, in the Weinberg-Salam electroweak gauge equations with
U(1) × SU(2) symmetry breaking, there are such linearly combinations as
2) the non-existence of solutions for the classical Einstein gravitational field equations
in general situations,
where Tµν is the usual energy-momentum tensor of visible matter. By the Bianchi identity,
the left-hand side of (4.2.1) is divergence-free, i.e.
1
∇µ (Rµν − gµν R) = 0. (4.2.2)
2
It implies that the usual energy-momentum tensor satisfies
∇µ Tµν = 0. (4.2.3)
However, due to the presence of dark matter and dark energy, the energy-momentum
tensor of visible matter Tµν may no longer be conserved, i.e. (4.2.3) is not true. Hence we
have
∇µ Tµν ̸= 0,
Hence, by (4.2.2) and (4.2.5) the gravitational field equations (4.2.1) should be in the form
1 8π G
Rµν − gµν R = − 4 T'µν . (4.2.6)
2 c
By (4.2.4) we have
c4
T'µν = Tµν + ∇µ Φν ,
8π G
which stands for all energy and momentum including the visible and the invisible matter
and energy, and which, by (4.2.5), is conserved. Thus, the equations (4.2.6) are rewritten as
1 8π G
Rµν − gµν R = − 4 Tµν − ∇µ Φν , (4.2.7)
2 c
The equations (4.2.7) are just the variational equations of LEH with the div-free constraint
as (4.1.33). Namely, (4.2.7) are the gravitational field equations obeying PID.
We remark that the term ∇ µ Φν in (4.2.7) has no variational structure, and cannot be
derived by modifying the Einstein-Hilbert functional. Hence, (4.2.7) are just the variational
equations due to PID.
202 Chapter 4 Unified Field Theory of Four Fundamental Interactions
x'µ = aνµ xν , 0 ! ν ! 3,
NEQ = 10,
(4.2.8)
NUF = 6,
where NEQ is the number of independent equations in (4.2.1), and NUF is the number of
independent unknown functions.
Consequently, the Einstein field equations (4.2.1) have no solutions in the general case.
Some readers may think that the Bianchi identity
% &
1 8π G
∇µ Rµν − gµν R + 4 Tµν = 0, (4.2.9)
2 c
reduce the number NEQ to six: NEQ = 6. But we note that (4.2.9) generates also four new
equations
∇µ Tµν = 0 for 0 ! ν ! 3,
because there are unknown functions g µν in the covariant derivative operators ∇ µ ; see
(3.1.66) or (2.3.26). Hence, the Einstein field equations (4.2.1) should be in the form
1 8π G
Rµν − gµν R = − 4 Tµν ,
2 c (4.2.10)
∇µ Tµν = 0.
Thus, the fact (4.2.8) still holds for (4.2.10).
Now we note the gravitational field equations (4.2.7) derived from PID, where there
are four additional new unknown functions Φν (0 ! ν ! 3). In this case, the numbers of
independent unknown functions and equations are the same.
In the following, we give an example to show the non well-posedness of the classical
Einstein field equations.
4.2 Physical Supports to PID 203
It is known that the metric of central gravitational field takes the form
where (ct, r, θ , ϕ ) is the spherical coordinate system. The metric g µν in (4.2.11) can be
expressed in the form
g00 = −eu (u = u(r)),
v
g11 = e (v = v(r)),
2
g22 = r , (4.2.12)
2 2
g33 = r sin θ ,
gµν = 0 for µ ̸= ν .
Consider the influence of cosmic microwave background (CMB) radiation, the energy-
momentum tensor can be approximatively written as
% &
−g00ρ 0
Tµν = , (4.2.13)
0 0
8π G 1
Rµν = − (Tµν − gµν T ),
c4 2 (4.2.15)
∇µ Tµν = 0,
and by (4.2.13),
T = g µν Tµν = g00 T00 = −ρ .
Thus, the Einstein field equations for the spherically symmetric gravitation fields (4.2.15)
204 Chapter 4 Unified Field Theory of Four Fundamental Interactions
R00 = 0,
which is a contradiction to the first equation of (4.2.16). Therefore the equations (4.2.16)
have no solutions.
However, if we consider this example by using the field equations derived from PID,
then the problem must have a solutions; see the theory of dark matter and dark energy in
Chapter 7.
where ψ = ψ † γ 0 .
4.2 Physical Supports to PID 205
#Φ = g(Φ). (4.2.20)
The physical significances of the fermionic and bosonic fields ψ and Φ are as follows:
In particular, in the classical Yang-Mills theory, the SU(N) gauge fields A aµ = (Aa0 , Aa1 ,
Aa2 , Aa3 )
(1 ! a ! N 2 − 1) satisfy the following field equations:
∂ µ Fµν
a
= o(Aµ ), a
Fµν = ∂µ Aaν − ∂ν Aaµ + gλbc
a b c
A µ Aν , (4.2.21)
∂ µ Fµν
a
= −#Aaν − ∂ν (∂ µ Aaµ ) + o(A).
Thus, the gauge field equations (4.2.21) are reduced to the bosonic field equations (4.2.20).
In other words, the gauge fields Aaµ (1 ! a ! N 2 − 1) satisfying (4.2.21) represent N 2 − 1
massless bosons with spin-1 because each Aaµ is a vector field.
We are now in position to introduce the Higgs mechanism. Physical experiments show
that the weak interacting fields should be SU(2) gauge fields with masses, representing 3
massive bosonic particles. However, as mentioned in (4.2.21), the gauge fields satisfying
the SU(2) Yang-Mills theory are 3(= N 2 − 1) massless bosons. To overcome this difficulty,
(Higgs, 1964; Englert and Brout, 1964; Guralnik, Hagen and Kibble, 1964) suggested to
add a scalar field φ into the Yang-Mills action (4.2.22) to create masses. In fact, we cannot
add a massive term mAaµ Aµ a into the Yang-Mills action (4.2.22); otherwise this action will
206 Chapter 4 Unified Field Theory of Four Fundamental Interactions
violate the gauge symmetry. But the Higgs mechanism can ensure the gauge invariance for
the Yang-Mills action, and spontaneously break the gauge symmetry in field equations at a
ground state of the Higgs field φ .
For clearly revealing the essence of the Higgs mechanism, we only take one gauge field
(there are four gauge fields in the GWS theory). In this case, the Yang-Mills action density
is in the form
1
LY M = − gµα gνβ (∂µ Aν − ∂ν Aµ )(∂α Aβ − ∂β Aα ) + ψ (iγ µ Dµ − m)ψ , (4.2.23)
4
where g µν is the Minkowski metric, and
It is clear that the action (4.2.23) is invariant under the following U(1) gauge transformation
1
ψ → eiθ ψ , Aµ → Aµ − ∂µ θ . (4.2.25)
g
The variation equations of (4.2.23) are
#Aµ + ∂µ (∂ ν Aν ) + gJµ = 0,
(iγ µ Dµ − m)ψ = 0, (4.2.26)
µ
Jµ = ψγ ψ ,
which are invariant under the gauge transformation (4.2.25). It is clear that the bosonic
particle A µ in (4.2.26) is massless.
To generate mass for A µ , we add a Higgs sector LH to the Yang-Mills action (4.2.23):
1 1
LH = − gµν (Dµ φ )† (Dµ φ ) + (φ † φ − ρ 2 )2 ,
2 4
Dµ φ = (∂µ + igAµ )φ , (4.2.27)
1
(ψ , φ ) → (eiθ ψ , eiθ φ ), Aµ → Aµ − ∂µ θ . (4.2.30)
g
Equations (4.2.29) are still massless. However, we note that (0, 0, ρ ) is a solution of
(4.2.29), which represents a ground state in physics, i.e. a vacuum state. Consider a trans-
lation for Φ = (A, ψ , φ ) at Φ0 = (0, 0, ρ ) as
Φ=Φ
' + Φ0 , Φ 'ψ
' = (A, ' , φ'),
where
i
J'µ (φ ) = (φ'(Dµ φ')† − φ'† Dµ φ').
2
We see that A 'µ attains its mass m = gρ in (4.2.31), but equations (4.2.31) break the
invariance for the gauge transformation (4.2.30). The process that masses are created by
the spontaneous gauge-symmetry breaking is called the Higgs mechanism. Meanwhile, the
√
field φ' in (4.2.31), called the Higgs boson, is also obtain its mass m = 2ρ .
In the following, we show that PID provides a new mechanism for generating masses,
drastly different from the Higgs mechanism.
In view of (4.1.34) and (4.1.35), based on PID, the variational equations of the Yang-
Mills action (4.2.23) with the divA -free constraint are in the form
0 1
1 D mc E2
∂ ν (∂ν Aµ − ∂µ Aν ) − gJµ = ∂µ − xµ + λ Aµ φ ,
4 h̄ (4.2.32)
µ
(iγ Dµ − m f )ψ = 0,
1 D mc E2
where φ is a scalar field. The term − xµ is the mass potential of φ , and is also
4 h̄
regarded as the interacting length of φ . If φ has a nonzero ground state φ 0 = ρ , then for the
translation
φ = φ' + ρ , Aµ = A 'µ , ψ = ψ ',
D m c E2 h̄ $
0
where = λ ρ . Thus the mass m0 = λ ρ is generated in (4.2.33) as the Yang-
h̄ c
Mills action takes the divA -free constraint variation. Moreover, when we take divergence
on both sides of (4.2.33), and by
∂ µ ∂ ν (∂ν A
'µ − ∂µ A
'ν ) = 0, ∂ µ J'µ = 0,
c e2 h̄es
curl2 A = − s |ψ |2 A − i (ψ ∗ ∇ψ − ψ ∇ψ ∗ ). (4.2.37)
4π ms c ms
Let
c e2s h̄es ∗
J= curl2 A, Js = |ψ |2 A − i (ψ ∇ψ − ψ ∇ψ ∗).
4π ms c ms
Physically, J is the total current in Ω, and Js is the superconducting current. Since Ω is a
medium conductor, J contains two types of currents as
J = Js + σ E,
4.3 Unified Field Model Based on PID and PRI 209
1 ∂A
E =− − ∇Φ = −∇Φ,
c ∂t
and Φ is the electric potential. Since At = 0, the superconducting current equations should
be taken as
1 σ e2 ih̄es ∗
curl2 A = − ∇Φ − s 2 |ψ |2 A − (ψ ∇ψ − ψ ∇ψ ∗ ). (4.2.38)
4π c ms c ms c
Since (4.2.37) is the expression of (4.2.36), then the equation (4.2.38) can be written in
the abstract form
δG σ
= − ∇Φ. (4.2.39)
δA c
In addition, for conductivity, the gauge fixing is given by
div A = 0, A · n|∂ Ω = 0,
Here
8π G
LEH = R + S,
c4
1
LEM = − Aµν Aµν + ψ e (iγ µ Dµ − m)ψ e ,
4
(4.3.2)
1 w a bµν
LW = − GabWµν W + ψ w (iγ µ Dµ − ml )ψ w ,
4
1
LS = − Gkls Skµν S µν l + ψ s (iγ µ Dµ − mq )ψ s ,
4
where R is the scalar curvature of the space-time Riemannian manifold (M , g µν ) with
Minkowski type metric, S is the energy-momentum density, G ab w and G s are the SU(2)
kl
and SU(3) metrics as defined by (3.5.28), ψ e , ψ w and ψ s are the Dirac spinors for fermions
participating in the electromagnetic, weak, strong interactions, and
Aµν = ∂µ Aν − ∂ν Aµ ,
a
Wµν = ∂µ Wνa − ∂ν Wµa + gwλbc
a
WµbWνc , (4.3.3)
Skµν = ∂µ Sνk − ∂ν Skµ + gsΛklr Slµ Sνr .
Here Aµ is the electromagnetic potential, Wµa (1 ! a ! 3) are the SU(2) gauge fields for the
weak interaction, Skµ (1 ! k ! 8) are the SU(3) gauge fields for the strong interaction, g w
and gs are the weak and strong charges, and
Dµ ψ e = (∂µ + ieAµ )ψ e ,
Dµ ψ w = (∂µ + igwWµa σa )ψ w , (4.3.4)
s
Dµ ψ = (∂µ + igsSkµ τk )ψ s ,
where σa (1 ! a ! 3) and τk (1 ! k ! 8) are the generators of SU(2) and SU(3).
Remark 4.8 For a vector field Xµ and an antisymmetric tensor field Fµν , we have
∇µ Xν − ∇ν Xµ = ∂µ Xν − ∂ν Xµ ,
∇µ Fµν = ∂ µ Fµν ,
where ∇µ is the Levi-Civita covariant derivative. Hence, the tensor fields in (4.3.3) and the
action (4.3.1) obey both the Einstein general relativity and the Lorentz invariance simulta-
neously under the transformations (4.1.21)-(4.1.22).
Remark 4.9 In the standard model, the wave functions ψ w and ψ s in LW and LS are
as follows % &
l
ψw = 1 the left-hand lepton pairs,
l2 L
⎛ ⎞ (4.3.5)
q1
s
ψ = ⎝q2 ⎠ qi the quark with ith color.
q3
4.3 Unified Field Model Based on PID and PRI 211
The reason why ψ w and ψ s are taken in the form (4.3.5) is that the standard model, in partic-
ular the GWS electroweak theory, is oriented toward to computing the transition probability
for the decay and scattering. In the PID field theory, it is unnecessary to take ψ w and ψ s
as in (4.3.5), because this model is oriented toward to interaction potentials and the basic
mechanism.
Remark 4.10 According to the standard model, the field particles corresponding to
electromagnetic, weak, and strong interactions are described by U(1), SU(2), SU(3) gauge
fields. Hence we take the U(1) × SU(2) × SU(3) Yang-Mills action together with L EH
as the action. However, if only consider the field theory for an N-particle system with N1
electric, N2 weak, N3 strong charges, then the action sectors in (4.3.1) should be taken as
LEH = R,
LEM = SU(N1 ) Yang-Mills action,
LW = SU(N2 ) Yang-Mills action, and
LS = SU(N3 ) Yang-Mills action.
In Section 6.5, we shall discuss the unified field theory for multi-particle systems.
We are now in position to establish unified field equations obeying PRI and PSB. By
PID, the unified field model (4.1.33)-(4.1.34) are derived as the variational equation of the
action (4.1.17) under the divA -constraint
Here it is required that the gradient operator ∇A corresponding to divA are PRI covariant.
The gradient operators in different sectors are given as follows:
where
mw , ms , α 0 , β 0 , γ 0 , δ 0 are scalar parameters,
αa1 , βa1 , γa1 , δa1 are first-order SU(2) tensors, (4.3.7)
αk2 , βk2 , γk2 , δk2 are first-order SU(3) tensors.
212 Chapter 4 Unified Field Theory of Four Fundamental Interactions
δL
= Dgµ φνg ,
δ g µν
δL
= Deµ φ e ,
δ Aµ
(4.3.8)
δL
= Dwµ φaw ,
δ Wµa
δL
= Dsµ φks ,
δ Skµ
g
where φν is a vector field, and φ e , φ w , φ s are scalar fields.
With the PID equations (4.3.8), the PRI covariant unified field equations are then given
as follows:1
1 8π G
Rµν − gµν R = − 4 Tµν + Dgµ φνg , (4.3.9)
2 c
∂ µ (∂µ Aν − ∂ν Aµ ) − eJν = Deν φ e , (4.3.10)
F G
w
Gab ∂ µ Wµνb b αβ c
− gw λcd g Wαν Wβd − gwJν a = Dνw φaw , (4.3.11)
F G
j j αβ c
Gksj ∂ µ Sµν − gsΛcd g Sαν Sβd − gsQν k = Dνs φks , (4.3.12)
Jν = ψ e γν ψ e ,
Jν a = ψ w γ ν σa ψ w ,
Qν k = ψ s γν τk ψ s ,
(4.3.16)
δS c4 αβ w a b
Tµν = + g (GabWα µ Wβ ν + Gkls Sαk µ Sβl ν + Aα µ Aβ ν )
δ g µν 16π G
c4
− gµν (LEM + LW + LS ).
16π G
Remark 4.11 It is clear that the action (4.3.1)-(4.3.4) for the unified field model is
invariant under the U(1) × SU(2) × SU(3) gauge transformation as follows
D E % 1
&
e iθ e
ψ' , Aµ = e ψ , Aµ − ∂µ θ ,
'
e
D E % i
&
a
w 'a w a
ψ , Wµ σa = U ψ ,Wµ U σaU + ∂µ UU
' −1 −1
, U = eiθ σa ,
gw
D E % k i
&
k
(4.3.17)
iϕ τk s k
' s 'k
ψ , S τk = e ψ , Sµ Ωτk Ω + ∂µ ΩΩ
−1 −1
, Ω = eiϕ τk ,
gs
aσ aσ
' l = eiθ
m a
ml e−iθ a
,
iϕ k τ −iϕ k τ
m
'q = e k mq e k .
However, the equations (4.3.9)-(4.3.15) are not invariant under the transformation (4.3.17)
g g
due to the terms D µ φν , Deν φ e , Dwν φaw , Dsν φks on the right-hand sides of (4.3.9)-(4.3.12) con-
taining the gauge fields A µ ,Wµa and Skµ .
Hence, the unified field model based on PID and PRI satisfies the spontaneous gauge-
symmetry breaking as stated in Principle 4.4 and PRI.
There are a number of to-be-determined coupling parameters in the general form of the
unified field equations (4.3.9)-(4.3.15), and the SU(2) and SU(3) generators σ a and τk are
taken arbitrarily. With PRI we are able to substantially reduce the number of these to-be-
determined parameters in the unified model to two SU(2) and SU(3) tensors
w
Gab = δab , Gkls = δkl .
Hence we usually take the Pauli matrices σa and the Gell-Mann matrices λk as the SU(2)
and SU(3) generators.
For convenience, we first introduce dimensions of related physical quantities. Let E
represent energy, L be the length and t be the time. Then we have
$ √
(Aµ ,Wµa , Skµ ) : E/L, (e, gw , gs ) : EL,
√
E
(Jµ , Jµ a , Qµ k ) : 1/L3 , e w s
(φ , φa , φk ) : √ ,
LL
(h̄, c) : (Et, L/t), mc/h̄ : 1/L.
214 Chapter 4 Unified Field Theory of Four Fundamental Interactions
where mH and mπ represent the masses of φ w and φ s , and all the parameters (α , β , γ , δ )
on the right hand side of (4.3.20) with different super and sub indices are dimensionless
constants.
With (4.3.20) at our disposal, the unified field equations (4.3.9)-(4.3.15) can be simplified
in the form
0 1
1 8π G eα e gw αaw a gs αks k g
Rµν − gµν R = − 4 Tµν + ∇µ + Aµ + Wµ + S φ , (4.3.21)
2 c h̄c h̄c h̄c µ ν
F e gw gs G
∂ ν Aν µ − eJν = ∂µ + β e Aµ + βawWµa + βks Skµ φ e , (4.3.22)
h̄c h̄c h̄c
gw a αβ b c
∂ ν Wνaµ − εbc g Wα µ Wβ − gwJµa (4.3.23)
h̄c
0 1
e gw gs 1 D mH c E 2
= ∂µ + γ e Aµ + γbwWµb + γks Skµ − xµ φwa ,
h̄c h̄c h̄c 4 h̄
ν k gs k αβ i j k
∂ Sν µ − f i j g Sα µ Sβ − gs Q µ (4.3.24)
h̄c
0 1
e e gw w b gs s l 1 D mπ c E 2
= ∂µ + δ Aµ + δb Wµ + δl Sµ − xµ φsk ,
h̄c h̄c h̄c 4 h̄
(iγ µ Dµ − m)Ψ = 0, (4.3.25)
4.3 Unified Field Model Based on PID and PRI 215
where Ψ = (ψ e , ψ w , ψ s ), and
Aµν = ∂µ Aν − ∂ν Aµ ,
a gw a b c
Wµν = ∂µ Wνa − ∂ν Wµa + ε W W , (4.3.26)
h̄c bc µ ν
gs
Skµν = ∂µ Sνk − ∂ν Skµ + fikj Siµ Sνj .
h̄c
Equations (4.3.21)-(4.3.25) need to be supplemented with coupled gauge equations to
compensate the new dual fields (φ e , φwa , φsk ). In different physical situations, the coupled
gauge equations may be different.
From the field theoretical point of view instead of the field particle point of view, the
coefficients in (4.3.21)-(4.3.24) should be
and
(α1s , · · · , α8s ) = α s (ρ1 , · · · , ρ8 ),
(β1s , · · · , β8s ) = β s (ρ1 , · · · , ρ8 ),
(4.3.28)
(γ1s , · · · , γ8s ) = γ s (ρ1 , · · · , ρ8 ),
(δ1s , · · · , δ8s ) = γ s (ρ1 , · · · , ρ8 ),
with the unit modules:
.
|ω | = ω12 + ω22 + ω32 = 1,
.
|ρ | = ρ12 + · · · + ρ82 = 1,
using the Pauli matrices σa and the Gell-Mann matrices λk as the generators for SU(2) and
SU(3) respectively.
The two SU(2) and SU(3) tensors in (4.3.27) and (4.3.28),
are very important, by which we can obtain SU(2) and SU(3) representation invariant gauge
fields:
Wµ = ωaWµa , S µ = ρk Skµ . (4.3.30)
which represent respectively the weak and the strong interaction potentials.
216 Chapter 4 Unified Field Theory of Four Fundamental Interactions
In view of (4.3.27)-(4.3.30), the unified field equations for the four fundamental forces
are written as
0 1
1 8π G eα e gw α w gs α s
Rµν − gµν R + 4 Tµν = ∇µ + Aµ + Wµ + Sµ φνg , (4.3.31)
2 c h̄c h̄c h̄c
0 1
ν eβ e gw β w gs β s
∂ Aν µ − eJµ = ∂µ + Aµ + Wµ + Sµ φ e , (4.3.32)
h̄c h̄c h̄c
gw a αβ b c
∂ ν Wνaµ − ε g Wα µ Wβ − gw Jµa (4.3.33)
h̄c bc
0 1
1 eγ e gw γ w gs γ s
= ∂µ − kw2 xµ + Aµ + Wµ + Sµ φwa ,
4 h̄c h̄c h̄c
ν k gs k αβ i j k
∂ Sν µ − f i j g Sα µ Sβ − gs Q µ (4.3.34)
h̄c
0 1
1 2 eδ e gw δ w gs δ s
= ∂ µ − ks x µ + Aµ + Wµ + Sµ φsk ,
4 h̄c h̄c h̄c
(iγ µ Dµ − m)Ψ = 0. (4.3.35)
Aµ = (A0 , A1 , A2 , A3 ) (4.3.36)
They are SU(N) tensors with N = 2, 3, and have N 2 − 1 components. These components will
change under the transformation of SU(N) generators. Thanks to PRI, the N 2 − 1 (N = 2, 3)
4.3 Unified Field Model Based on PID and PRI 217
gauge fields in (4.3.40) can be combined into two vector fields as in (4.3.30):
Wµ = ωaWµa = (W0 ,W1 ,W2 ,W3 ),
(4.3.41)
S µ = ρk Skµ = (S0 , S1 , S2 , S3 ),
which have the same role as (4.3.36)-(4.3.39) for the electro-magnetic U(1) gauge fields.
In the same spirit as the electromagnetic fields, for the two fields given by (4.3.41), we
have
W0 = the weak force potential,
(4.3.42)
⃗ = the weak magnetic potential, W
W ⃗ = (W1 ,W2 ,W3 ),
and
S0 = the strong force potential,
(4.3.43)
⃗S = the strong magnetic potential, ⃗S = (S1 , S2 , S3 ).
In addition, the weak and strong charges gw and gs are
weak charge gw = SU(2) gauge coupling constant,
(4.3.44)
strong charge gs = SU(3) gauge coupling constant.
The weak and strong forces are given by
weak force: Fw = −gw ∇W0 ,
gw (4.3.45)
⃗,
weak magnetic force: Fwm = ⃗v × curl W
c
and
strong force : Fs = −gs ∇S0 ,
gs (4.3.46)
strong magnetic force : Fsm = ⃗v × curl ⃗S.
c
Remark 4.12 It is the PRI that provides a physical approach to combine the N 2 − 1
components of the SU(N) gauge fields into the forms (4.3.41)-(4.3.46) for the interacting
forces. With this physical interpretation, the electromagnetic, weak and strong interactions
can be regarded as a unified force, separated by three different interaction charges: the
electric charge e, weak charge gw , strong charge gs .
' = eiθ ψ ,
ψ 'µ = Aµ − 1 ∂µ θ .
A
e
It implies that A'µ , ψ
' are also solutions. Hence, the equations (4.3.47)-(4.3.48) have in-
finitely many solutions. However, in these solutions only one is physical. To find the phys-
ical solution, one has to provide a supplementary equation, called gauge-fixing equation,
F(Aµ ) = 0, (4.3.49)
such that the system (4.3.47)-(4.3.49) has a unique physical solution. Observe that
∂ µ ∂ ν (∂ν Aµ − ∂µ Aν ) = 0,
∂ µ ψγµ ψ = 0 (by (4.3.48)).
Only three equations in (4.3.47) are independent. Therefore the number of independent
equations in (4.3.47)-(4.3.49) is NEQ = 8, the same as the number of unknowns. Hence,
(4.3.47)-(4.3.49) are well-posed. Usually physical gauge-fixing equation (4.3.47) takes one
of the forms:
∂ A1 ∂ A2 ∂ A3
Coulomb gauge : + + 3 = 0,
∂ x1 ∂ x2 ∂x
Lorentz gauge : ∂ µ Aµ = 0,
Axial gauge : A3 = 0,
Temporal gauge : A0 = 0.
However, for the SU(N) gauge theory by PLD, the gauge-fixing problem is in general
not well-posed. In fact, the SU(N) gauge field equations are given by
where
Fνaµ = ∂ν Gaµ − ∂µ Gaν + gλbc
a b c
G µ Gν .
The equations (4.3.50)-(4.3.51) are invariant under the SU(N) gauge transformations
ψ
a
' = eiθ τa ψ , 'aµ τa = Gaµ eiθ b τb τa e−iθ b τb − 1 ∂µ θ a τa .
G (4.3.52)
g
Hence, if there is a solution for (4.3.50)-(4.3.51), then there are infinitely many solutions.
Thus, we have to supplement N 2 − 1 gauge-fixing equations in order to get a unique physical
solution:
Fa (Gµ ) = 0 for 1 ! a ! N 2 − 1. (4.3.53)
4.4 Duality and Decoupling of Interaction Fields 219
The reason why take N 2 − 1 equations in (4.3.53) are that there are N 2 − 1 free functions θ a
in (4.3.52).
Now, the gauge-fixing problem (4.3.50)-(4.3.51) with (4.3.53) is not well-posed either,
because the number of independent equations of (4.3.50) are 4(N 2 − 1) due to
∂ µ (ψγµ τa ψ ) ̸= 0.
4.4.1 Duality
In the unified field equations (4.3.21)-(4.3.24), there exists a natural duality between the
interaction fields (g µν , Aµ ,Wµa , Skµ ) and their corresponding dual fields (φ µg , φ e , φaw , φks ) :
g µν ↔ φµg ,
Aµ ↔ φ e,
(4.4.1)
Wµa ↔ φwa for 1 ! a ! 3,
Skµ ↔ φsk for 1 ! k ! 8.
Thanks to PRI, the SU(2) gauge fields Wµa (1 ! a ! 3) and the SU(3) gauge fields S kµ (1 !
k ! 8) are symmetric in their indices a = 1, 2, 3 and k = 1, · · · , 8 respectively. Therefore, the
220 Chapter 4 Unified Field Theory of Four Fundamental Interactions
corresponding relation (4.4.1) can be also considered as the following dual relation
gµν ↔ φµg ,
Aµ ↔ φ e,
(4.4.2)
{Wµa } ↔ {φwa },
{Skµ } ↔ {φsk }.
The duality relation (4.4.1) can be regarded as the correspondence between field particles
for each interaction, and the relation (4.4.2) is the duality of interacting forces. We now
address these two different dualities.
1. Duality of field particles. In the duality relation (4.4.1), if the tensor fields on the
left-hand side are of k-th order, then their dual tensor fields on the right-hand side are of
(k − 1)-th order. Physically, this amounts to saying that if a mediator for an interaction has
spin-k, then the dual mediator for the dual field has spin-(k − 1). Hence, (4.4.1) leads to the
following important physical conclusion:
The neutral Higgs H 0 (the adjoint particle of Z) had been discovered experimentally. We
remark that the duality (4.4.3) can also be derived using the weakton model (Ma and Wang,
2015b), which is also presented in the next chapter.
2. Duality of interacting forces. The correspondence (4.4.2) provides a dual relation
between the attracting and repelling forces. In fact, from the interaction potentials we find
that the even-spin fields yield attracting forces, and the odd-spin fields yield repelling forces.
Duality of Interaction Forces 4.14 Each interaction generates both attracting and
repelling forces. Moreover, for each pair of dual fields, the even-spin field generates an
attracting force, and the odd-spin field generates a repelling force.
4.4 Duality and Decoupling of Interaction Fields 221
Φ = g µν ∇µ ∇ν ϕ . (4.4.12)
The first term in the right-hand side of (4.4.11) is the Newton’s gravitational force, and the
second term (4.4.12) represents the repelling force generated by the dual field ϕ , and the
third term % 2 &
c 1 dϕ
− +
MG r dr
represents the force due to the nonlinear coupling of g µν and its dual ϕ . Formula (4.4.11)
can be approximatively written as
% &
1 k0
F = mMG − 2 − + k1 r ,
r r (4.4.13)
−18 −1 −57 −3
k0 = 4 × 10 km , k1 = 10 km .
The formula (4.4.13) shows that a central gravitational field with mass M has an attract-
ing force −k0 /r in addition to the Newtonian gravitational force. This explains the dark
matter phenomenon. Also there is a repelling force k1 r, which explains the dark energy
phenomenon; see (Ma and Wang, 2014e) for details.
∂ µ Jµ = 0. (4.4.16)
4.4 Duality and Decoupling of Interaction Fields 223
Equations (4.4.14) and (4.4.15) are the modified QED model. Taking divergence on
both sides of (4.4.14), by (4.4.16) and
∂ µ ∂ ν (∂ν Aµ − ∂µ Aν ) = 0,
The following are the two perspectives of the duality for electromagnetism.
1. Photon and dual scalar photon. If we view the field equations (4.4.17) and (4.4.18)
as describing the field particles, then we have
Jµ = 0, β = 0, in (4.4.17)-(4.4.18).
#Aµ + ∂µ (∂ ν Aν ) = 0, (4.4.24)
and the dual photon, also called the scalar photon, is described by the following equation
#φ e = 0. (4.4.25)
By (4.4.24) and (4.4.25) we deduce the following basic properties for photons and scalar
photons:
photon: J = 1, m = 0, Qe = 0,
(4.4.26)
scalar photon: J = 0, m = 0, Qe = 0.
Only the radially symmetric solutions of (4.4.29) and (4.4.30) are physical. The radial
solutions of (4.4.30) are constants
φ e = φ0 the constants.
Thus the equations (4.4.29) and (4.4.30), in the spherical coordinate system, are reduced as
follows
1 d 2d
(r )ϕ − kϕ = −4π eδ (r), (4.4.31)
r2 dr dr
4.4 Duality and Decoupling of Interaction Fields 225
e
where k = β φ0 . The solutions of (4.4.31) are expressed as
h̄c
e √
ϕ = e− kr . (4.4.32)
r
The parameter k is to be determined by experiments.
We discuss the solutions (4.4.32) in the following three cases.
1) Case k = 0. The solution (4.4.32) in this case is reduced to the Coulomb potential
(4.4.28);
2) Case k > 0. This solution is similar to the Yukawa potential for the strong interac-
tions of nucleons.
∂ µ ∂ ν Skµν = 0 for 1 ! k ! 8,
we deduce the following dual field equations for the strong interaction:
0% & 1
gs k l 1 m2π c2 gs
µ k
∂ ∂µ φs + ∂ µ
δ S − xµ φsk = −gs ∂ µ Qkµ − fikj gαβ ∂ µ (Sαi µ Sβj ).
h̄c l µ 4 h̄2 h̄c
(4.4.37)
226 Chapter 4 Unified Field Theory of Four Fundamental Interactions
Fsk (S µ , φs , ψ ) = 0, 1 ! k ! 8. (4.4.38)
Gluons and scalar gluons are described by equations (4.4.34) and (4.4.37) respectively,
which are nonlinear. In fact, gk and gk0 are confined in hadrons.
2. Strong force. The strong interaction forces are governed by the field equations
(4.3.31)-(4.3.35). The decoupled field equations are given by
0 1
ν k gs k αβ i j k 1 2 gs δ
∂ S ν µ − f i j g S α µ S β − g s Q µ = ∂ µ − ks x µ + Sµ φsk , (4.4.39)
h̄c 4 h̄c
1 gs δ µ
∂ µ ∂µ φsk − k2 φsk + ks2 xµ ∂ µ φsk + ∂ (S µ φsk ) (4.4.40)
4 h̄c
g s
= −gs ∂ µ Qkµ − fikj gαβ ∂ µ (Sαi µ Sβj ),
F G h̄c
gs mc
iγ µ ∂µ + i Slµ τl ψ − ψ = 0, (4.4.41)
h̄c h̄
for 1 ! k ! 8, where δ is a parameter, and S µ is as in (4.3.41).
In the next section, we shall deduce the layeredformulas of strong interaction potentials
from the equations (4.4.39)-(4.4.41) with gauge equations (4.4.38).
Remark 4.16 We need to explain the physical significance of the parameters k s and
δ . Usually, ks and δ are regarded as masses of the field particles. However, since (4.4.39)-
(4.4.41) are the field equations for the interaction forces, the parameters k s and δ are no
longer viewed as masses. In fact, k −1 represents the range of attracting force for the strong
% &−1
gs φs0
interaction, and δ is the range of the repelling force, where φs0 is a ground state
h̄c
of φs .
4.4 Duality and Decoupling of Interaction Fields 227
1. Duality between W ± , Z Bosons and Higgs Bosons H ± , H 0 . The three massive vec-
tor bosons, denoted by W ± , Z 0 , has been discovered experimentally. The field equations
(4.4.42) give rise to a natural duality:
Z0 ↔ H 0, W ± ↔ H ±, (4.4.47)
where H 0 , H ± are three dual scalar bosons, called the Higgs particles. The neutral Higgs H 0 ,
discovered by LHC in 2012, and the charged Higgs H ± , to be discovered experimentally.
In Section 4.6, we shall introduce the dual bosons (4.4.47) by applying the field equa-
tions (4.4.42)-(4.4.45) with gauge equations (4.4.46).
2. Weak force. If we consider the weak interaction force, we have to use the equations
decoupled from (4.3.31)-(4.3.35):
0 1
ν a gw a αβ b c 1 2 gw
∂ Wν µ − εbc g Wα µ Wβ − gw Jµ = ∂µ − kw xµ + γ Wµ φwa ,
a
(4.4.48)
h̄c 4 h̄c
gw 1
∂ µ ∂µ φwa − k2 φw2 + γ∂ µ (Wµ φwa ) − k2 xµ ∂ µ φwa (4.4.49)
h̄c 4
228 Chapter 4 Unified Field Theory of Four Fundamental Interactions
gw a αβ µ b c
= −gw ∂ µ Jµa − ε g ∂ (Wα µ Wβ ),
h̄c bc
gw mc
iγ µ (∂µ + i Wµa σa )ψ − ψ = 0, (4.4.50)
h̄c h̄
where γ , kw are constants, Wµ is as in (4.3.41).
In Section 4.6, we shall deduce the layered formulas of weak interaction potentials by
applying the equations (4.4.48)-(4.4.50).
Remark 4.17 The duality of four fundamental interactions is very important, and is a
direct consequence of PID. With this duality, and with the PRI invariant potentials Wµ and
Sµ given by (4.3.41), we obtain explanations for a number of physical problems such as the
dark matter and dark energy phenomena, the weak and strong forces and potential formulas,
the quark confinement, the asymptotic freedom, the strong potentials of nucleons etc. Also
this study leads to the needed foundation for the weakton model of elementary particles and
the energy level theory of subatomic particles, and gives rise to a mechanism for subatomic
decays and scatterings.
F = −gs ∇Φ0 .
However, the strong interactions are layered, i.e. the strong forces only act on the same
level of particles, such as the quark level, the hadron level. Hence, the strong interaction
potentials are also layered. In fact, in the next subsection we shall show that for a particle
with N strong charges gs of the elementary particles, its strong interaction potential is given
by 0 1
1 A
Φs = Ngs (ρ ) − (1 + kr)e −kr
,
r ρ
% &3 (4.5.2)
ρw
gs (ρ ) = gs ,
ρ
4.5 Strong Interaction Potentials 229
where ρw is the radius of the elementary particle (i.e. the w∗ weakton), ρ is the particle
radius, k > 0 is a constant with k−1 being the strong interaction attraction radius of this
particle, and A is the strong interaction constant, which depends on the type of particles.
Thus, the strong force between such two particles is
In this subsection, we shall deduce formula (4.5.4) for the w∗ -weaktons from the strong
interaction field equations (4.4.39)-(4.4.41). Taking inner product of the field equation
(4.4.39) and (4.4.40) with ρk = (ρ1 , · · · , ρ8 ), we derive that
0 1
ν gs αβ i j 1 2 gs δ
∂ S ν µ − λ i j g S α µ S β − g s Q µ = ∂ µ − k0 x µ + Sµ φs , (4.5.6)
h̄c 4 h̄c
% &
1 ∂2 1 gs
− ∆ φs + k02 φs + k02 xµ ∂ µ φs = gs ∂ µ Qµ + ∂ µ (λi j gαβ Sαi µ Sβj − δ S µ φs ),
c ∂t
2 2 4 h̄c
(4.5.7)
where
gs
φs = ρk φsk , Qµ = ρk Qkµ , λi j = ρk λikj , S µν = ∂µ Sν − ∂ν Sµ + λi j Siµ Sνj .
h̄c
Based on the superposition property of the strong potential for strong charges, Φ s = S0
and φs obey a linear relationship. Namely, the time-component µ = 0 equation of (4.5.6)
and the equation (4.5.7) should be linear. In other words, we have to take eight gauge fixing
equations as in (4.4.38) such that they contain the following two equations:
F G
λi j ∂ ν (Sνi S0j ) − gαβ Sαi o Sβj + δ S0 φs = 0,
F G (4.5.8)
∂ µ λi j gαβ Sαβ i S j − δ S φ = 0,
β µ s
xµ ∂ µ φs = 0, ∂ µ Sµ = 0, (4.5.9)
230 Chapter 4 Unified Field Theory of Four Fundamental Interactions
∂ S0 ∂ φs
= 0, = 0. (4.5.10)
∂t ∂t
With the gauge fixing equations (4.5.8)-(4.5.9) and the static assumption (4.5.10), we
derive from (4.5.6) and (4.5.7) that
1
−∆Φs = gs Q − k02 cτφs , (4.5.11)
4
−∆φs + k0 φs = gs ∂ µ Qµ ,
2
(4.5.12)
∂ µ Qµ = ρk ∂µ ψγ µ τk ψ + ρk ψγ µ τk ∂µ ψ .
Step 2. Solution of (4.5.11). The quantity gs Q = −gs Q0 is the strong charge density of
a w∗ -weakton, and without loss of generality, we assume that
Q = β δ (r), (4.5.19)
and β > 0 is a constant, inversely proportional to the volume of the w∗ -weakton. Hence
(4.5.11) can be rewritten as
gs A 1 −k0 r
− ∆Φs = gs β δ (r) + e , (4.5.20)
ρw r
k02 cτκ 1
A= with physical dimension .
4 L
Assume Φs = Φs (r) is radially symmetric, then (4.5.20) becomes
% &
1 d 2 d gs A 1 −k0 r
− 2 r Φs = gs β δ (r) + e ,
r dr dr ρw r
Inserting ϕ in (4.5.22) and comparing the coefficients of r k , then we obtain the following
relations
1
α1 = k0 α0 + ,
2
1 2 1
α2 = k0 α0 + k0 ,
2 3
..
. (4.5.24)
2k0
αk = αk−1 − k02 αk−2 , ∀k " 2.
k+1
where α0 is a free parameter with dimension L. Hence
r
ϕ (r) = α0 (1 + k0r + + o(r)),
2α0
and often it is sufficient to take a first-order approximation.
Step 3. Strong interaction potential of w∗ -weakton. The formula (4.5.21) with (4.5.23)-
(4.5.24) provides an accurate strong interaction potential for the w∗ -weaktons:
0 1
1 A0 r
Φ0 = gs β − ϕ'(r)e −k 0 , (4.5.25)
r ρw
where A0 = Aα0 /β is a dimensionless parameter, and ϕ'(r) is
r
ϕ' = 1 + k0r + + o(r),
2α0
which is a dimensionless function, and
Since gs β is to be determined for the elementary particle, we can take g s β as the strong
charge of the w∗ -weakton, still denoted by gs . In addition, it is sufficient to take approxima-
tively ϕ' = 1 + k0r. Then the formula (4.5.25) is rewritten as
0 1
1 A0 −k0 r
Φ0 = gs − (1 + k0r)e .
r ρw
It is the strong interaction potential given in (4.5.4).
4.5 Strong Interaction Potentials 233
q = (q1 , q2 , q3 ) (4.5.28)
where q1 , q2 , q3 are the wave functions of the three quarks constituting a nucleon, and
% &
igs
Dµ q = h̄c∂µ + Skµ τk q. (4.5.29)
h̄c
The action L defined by (4.5.27) is SU(3) gauge invariant. Physically this means that the
contribution of each quark to the strong interaction potential energy is indistinguishable.
With PID, the corresponding field equations for the action (4.5.27) are
% &
gs k2
∂ ν Sνk µ + λikj gαβ Sαi µ Sβj − gs Qkµ = ∂µ − 1 xµ φnk , (4.5.30)
h̄c 4
D gs E mc
iγ µ ∂µ + i Skµ τk q − q = 0, (4.5.31)
h̄c h̄
where the parameter k1 is defined by
1
r1 = = 10−13 cm, (4.5.32)
k1
which is the strong attraction radius of nucleons, and
Qkµ = qγµ τ k q (τ k = τk ).
Let the strong interaction nucleon potential Φkn and its dual potential φn be defined by
Φn = ρk S0k , φn = ρk φnk .
In the same spirit as for deriving the weakton potential equations (4.5.11) and (4.5.12), for
Φn and φn we deduce that
1
− ∆Φn = gs Qn − k12 cτ1 φn ,
4 (4.5.33)
− ∆φn + k12 φn = gs ∂ µ Qµ ,
234 Chapter 4 Unified Field Theory of Four Fundamental Interactions
Qµ = ρk Qkµ = (Q0 , Q1 , Q2 , Q3 )
represents the quark current density inside a nucleon. Similar to (4.5.15) and (4.5.19), for
Qn and ∂ µ Qµ we have
κn
∂ µ Qµ = − δ (r), Qn = βn δ (r), (4.5.34)
ρn
where ρn is the radius of a nucleon, βn and κn are constants, inversely proportional to the
volume of nucleons. Hence, in the same fashion as in deducing (4.5.25), from (4.5.33) and
(4.5.34) we derive the following strong nucleon potential as
0 1
1 An −k1 r
Φn = βn gs − ϕ (r)e , (4.5.35)
r ρn
where ϕ (r) is as
r
ϕ (r) = 1 + k1r + + o(r), α0 as in (4.5.24), (4.5.36)
2α0
and An is a dimensionless parameter:
κn k12 cτ1
An = .
4βn
Note that βn is inversely proportional to the volume Vn of a nucleon. Hence we have
% &3
βn NV0 ρw
= =N (N = 3), (4.5.37)
β Vn ρn
where N is the number of strong charges in a nucleon, β is the parameter as in (4.5.25)
and (4.5.26), and V0 the volume of a w∗ -weakton. Since gs β in (4.5.25) is taken as gs , by
(4.5.37) the formula (4.5.35) is expressed as
0 1
1 An
Φn = gs (ρn ) − ϕ (r)e−kn r ,
r ρn
% &3 (4.5.38)
ρw
gs (ρn ) = 3 gs ,
ρn
and ϕ (r) is as in (4.5.36).
In summary, for a particle with N strong charges and radius ρ , its strong interaction
potential can be written as
0 1
1 A
Φ = gs (ρ ) − ϕ (r)e −kr
,
r ρ
% &3 (4.5.39)
ρw
gs (ρ ) = N gs ,
ρ
4.5 Strong Interaction Potentials 235
quark confinement phenomena. The direct reason is that all current theories for interactions
fail to provide a successful strong interaction potential to explain the various level strong
interactions.
Now, we can derive the quark confinement by using the layered strong interaction po-
tentials (4.5.41).
The strong interaction bound energy E for two particles is given by (4.5.43). In particu-
lar, for two same particles we have
0 1
2 1 A
E = gs (ρ ) − (1 + kr)e −kr
,
r ρ
% &3 (4.5.45)
ρw
gs (ρ ) = gs .
ρ
The quark confinement can be well explained from the viewpoint of the strong quark
bound energy Eq and the nucleon bound energy En . In fact, by (4.5.45) we have
% &7
Eq Aq ρn
$ . (4.5.46)
En An ρq
Eq = 1020En . (4.5.48)
En ∼ 10−2 GeV.
which is at the Planck level. This clearly shows that the quarks is confined in hadrons, and
no free quarks can be found.
Remark 4.19 The magnitude order (4.5.42) and (4.5.47) are not accurate, but they are
enough precisely to explain the important physical problems as the quark confinement and
asymptotic freedom etc.
4.5 Strong Interaction Potentials 237
The strong interaction potentials provide also a natural explanation for the asymptotic free-
dom phenomena. To this end, we need to introduce the asymptotic freedom in two per-
sptives: the deep inelastic scattering experiments, and the QCD theory for the coupling
constant of quark potentials.
e− + p → e− + p. (4.5.49)
The collision is an inelastic scattering if the particles are changed after the collision. For
example the usual e-p inelastic scatterings are as follows
e− + p → e− + π + + n,
(4.5.50)
e− + p → e− + π 0 + p.
2. QCD theory for asymptotic freedom. The notion of asymptotic freedom was first
introduced by (Gross and Wilczek, 1973; Politzer, 1973), who were awarded the Nobel Prize
in 2004. By using the renormalization group, they derived the strong interaction coupling
constant of quarks as follows
4π
αs = % & , (4.5.51)
2
11 − n f ln(q /λ )
2 2
3
where λ is a parameter, n f = 6 is the flavor number of quarks, and q2 is the transfer mo-
mentum by the incidence electron.
In QCD, the coupling constant αs stands for the strength of the strong interactions be-
tween the quarks in a proton. It is clear that as the transfer momentum q 2 tends to infinitely
large, αs tends to zero:
αs → 0 as q2 → ∞. (4.5.52)
It means that the greater the incidence electron kinetics is, the more freedom the quarks
exhibit. The asymptotic freedom is based on the fact (4.5.51)-(4.5.52).
3. Strong interaction potential for asymptotic freedom. The strong interaction potentials
provide also a natural explanation for the asymptotic freedom phenomena. By (4.5.41) and
(4.5.44), the strong force between quarks is as
0 1
dΦq 1 Aq
F = −gs (ρq ) = g2s (ρq ) 2 − k12 re−k1 r ,
dr r ρq
which implies that there are two radii r 1 and r2 such that
This indicates that there is a free shell region inside a proton with radius R ≃ r 1 , such that
the three quarks are free in this shell region.
When a lower energy electron collides with the proton, the electromagnetic force cause
the electron moving away, leading to the elastic scattering as given by (4.5.49). However,
when a high speed electron collides with a proton, it can run into the inside of this proton,
interacting with one of the quarks. Since the quark was in a free shell region with no force
4.5 Strong Interaction Potentials 239
acting upon it, this particular quark will behavior as a free quark. As it moves into the
attracting region of the proton, the quark confinement will hold this quark, which, at the
same time, will collide with gluons, exchanging quarks, leading to the inelastic scatterings
as given in (4.5.50). This explains the asymptotic freedom.
Remark 4.20 The two phenomena of the quark confinement and the asymptotic free-
dom are consistent. On the one hand, the quarks in a proton are held tightly together by the
strong interaction to cause the quark confinement, and on the other hand, quarks look as if
they are free. The consistency is the direct consequence of the quark potential formula in
(4.5.41).
The conflicting characteristics of the strong interactions demonstrated in (4.5.54) and (4.5.55)
can hardly be explained by any existing theory. However, the layered strong interaction po-
tentials in (4.5.41) derived based on PID and PRI lead to a natural explanation of these
characteristics, as well as explanation of the quark confinement and asymptotic freedom in
previous subsections. In this subsection, we shall show that the strong interaction nucleon
potential (4.5.38), which can be regarded as a modified Yukawa potential, fits experiments.
Meanwhile, we also determine some parameters for the strong interaction potentials, in-
cluding the strong charge gs .
1. Experimental results. Experiments showed that the strong nucleon force has the
following properties:
• Nucleon force is of short-ranged, with the radius of the force range: r ∼ 2 × 10 −13 cm.
• Nucleon force has a repelling center and an attracting region: it is repulsive for r <
1 1
× 10−13 cm, is attractive for × 10−13 < r < 2 × 10−13 cm, and diminishes for
2 2
r > 2 × 10−13. Namely, experimentally, the nucleon force behaves as
⎧
⎪ 1 −13
⎨ > 0 for 0 < r < r = 2 × 10 cm,
⎪
⎪
F exp < 0 for r < r < 2 × 10−13 cm, (4.5.56)
⎪
⎪
⎪
⎩
$ 0 for r > 2 × 10−13 cm.
240 Chapter 4 Unified Field Theory of Four Fundamental Interactions
More precisely, the experimental data for the strong interaction nucleon potential can
be schematically shown in Figure 4.2; see (Weisskopf, 1972).
2. Yukawa potential. Based on the classical strong interaction theory, the potential
holding nucleons to form an atomic nucleus is the Yukawa potential
g 1
ΦY = − e−kn r , r1 = = 10−13 cm, (4.5.57)
r kn
where g is the Yukawa strong charge, and
g2 = 1 ∼ 10 h̄c. (4.5.58)
4. Parameters An and g2s . First, we use the experimental data in (4.5.56) to determine
the parameter An in (4.5.62). By (4.5.56) we know that
1
F exp = 0, at r = × 10−13 cm.
2
Hence, let
1
Fn = 0, at r = × 10−13 cm.
2
Then, it follows from (4.5.62) and (4.5.63) that
which leads to
% &6
2 ρn ρn
g2s = g2 .
9 (An kn r12 − eρn) ρw
4. The largest attraction force of Fn is achieved at r = 1.5r1 , and the force is given by
0 1
8e−1/2 g2 2 1
Fmax = − 3e − . (4.5.71)
8 − e1/2 r12 9
4.5 Strong Interaction Potentials 243
5. By (4.5.68), the attractive and repulsive regions of the strong nucleon force Fn are as
follows ⎧
⎪ > 0 for 0 < r < r1 /2,
⎨
Fn < 0 for r1 /2 < r < 9r1 , (4.5.72)
⎪
⎩
> 0 for 9r1 < r.
6. Based on (4.5.67), the theoretical curve of the strong interaction potential energy
for two nucleons is as shown in Figure 4.4, where we see that the theoretical result is in
agreement with the experimental curve shown in Figure 4.2.
Hence at the atomic and molecular scale, the ratio between strong repulsive force and the
electromagnetic force is
% &6
Fa ρw
= Ns2 g2s /Ne2 e2 , (4.5.76)
Fe ρa
where Ns is the number of strong charge, and Ne is the number of the electric charge. Note
that each nucleon has three strong charges, and the protons are almost the same as neutrons.
Therefore, we assume that
Ns = 6Ne .
In view of (4.5.70), the ratio (4.5.76) becomes
% &6
Fa ρn 2
= 4β 2 g2 /e2 , β2 = √ $ 0.2.
Fe ρa 8 e−e
g2 1 1
2
is in the range of ∼ .
e 137 13.7
Then, by ρn = 10−16 cm and (4.5.74), we derive that
>
Fa 10−50 at the atom level,
∼
Fe 10−56 at the molecular level.
Taking inner products of (4.4.48) and (4.4.49) with ωa respectively, we derive the field
equations of the two dual weak interaction potentials Φw and φw as follows
gw 1 gw
∂ ν Wν µ − κab gαβ Wαaµ Wβb − gwJµw = (∂µ − k2 xµ + γ Wµ )φw , (4.6.3)
h̄c 4 h̄c
0 1
1 ∂2
− ∆ φw + k02 φw − gw ∂ µ Jµw (4.6.4)
c2 ∂ t 2
gw F G 1
= ∂ µ κab gαβ Wαaµ Wβb − γ Wµ φw − k02 xµ ∂ µ φw ,
h̄c 4
c ω , W is as in (4.6.1), φ = ω φ a , and
where κab = εab c µ w a w
gw
Jµw = ωa ψγµ σa ψ , Wµν = ∂µ Wν − ∂ν Wµ + κabWµaWνb . (4.6.5)
h̄c
Experiments showed that the SU(2) gauge fields Wµa for weak interaction field particles
possess masses. In addition, the dual Higgs fields φwa of Wµa also have masses. In (4.6.3),
there is no massive term of Wµ . However, we see that on the right-hand side of (4.6.3) there
is a term
gw
γφwWµ , (4.6.6)
h̄c
which is spontaneously generated by PID, and breaks the gauge-symmetry. Namely, (4.6.6)
will vary under the following SU(2) gauge transformation
1
Wµa → Wµa − εbc
a b c
θ Wµ − ∂µ θ a
gw
We shall show that it is the spontaneous symmetry breaking term (4.6.6) that generates mass
from the ground state of φw .
It is clear that the following state
is a solution of (4.6.3) and (4.6.4), which is a ground state of φwa . Let a0 = φ0a ωa , which is a
constant. Take the transformation
φw → φw + a0 , Wµa → Wµa , ψ → ψ.
$
where k1 = gw γ a0 /h̄c represents mass.
Thus, (4.6.7) and (4.6.8) have masses as
where mH and mW are the masses of Higgs and W ± bosons. Physical experiments measured
the values of mH and mW as
By (4.4.46), equations (4.6.7) and (4.6.8) need to add three gauge fixing equations.
Based on the superposition property of the weak charge forces, the dual potentials W0 and
φw should satisfy linear equations, i.e. the time-component µ = 0 equation of (4.6.7) and
the equation (4.6.8) should be linear. Therefore, we have to take the three gauge fixing
equations in the following forms
F G h̄c a0 k02
κab gαβ Wαa0Wβb − ∂ µ (WµaW0b ) + γ W0 φw − x0 = 0,
gw 4
gw µ F G k2 (4.6.11)
∂ κab gαβ Wαaµ Wβb − γ Wµ φw − 0 xµ ∂ µ φw − k02 a0 = 0,
h̄c 4
∂ µ Wµ = 0,
and with the static conditions
∂ ∂
Φw = 0, φw = 0, (Φw = W0 ). (4.6.12)
∂t ∂t
With the equations (4.6.11) and the static conditions (4.6.12), the time-component µ = 0
equation of (4.6.7) and its dual equation (4.6.8) become
1
− ∆Φw + k12 Φw = gw Qw − k02 cτφw , (4.6.13)
4
− ∆φw + k02 φw = gw ∂ µ Jµw , (4.6.14)
where cτ is the wave length of φw , Qw = −J0w , and Jµw is as in (4.6.5). The two dual equations
(4.6.13) and (4.6.14) for the weak interaction potentials Φw and φw are coupled with the
Dirac equations (4.4.50), written as
D gw E mc
iγ µ ∂µ + i Wµa σa ψ − ψ = 0. (4.6.15)
h̄c h̄
In addition, by (4.6.9) and (4.6.10) we can determine values of the parameters k 0 and k1 as
follows
k0 = 2k1 , k1 = 1016 cm−1 . (4.6.16)
The parameters 1/k0 and 1/k1 represent the attracting and repulsive radii of weak interaction
forces.
In the next subsection, we shall apply the equations (4.6.13)-(4.6.15) to derive the lay-
ered formulas of weak interaction potentials.
4.6 Weak Interaction Theory 247
2gw
∂ µ Jµw = − ⃗ µ ) · J⃗µ ,
ω ×W
(⃗ (4.6.21)
h̄c
248 Chapter 4 Unified Field Theory of Four Fundamental Interactions
µ µ µ µ
and J⃗µ = (J1 , J2 , J3 ). The weak current density Ja is as
where Bρw ⊂ Rn is the ball with radius ρw . Similar to the case (4.5.14) for the strong
interaction, the average value (4.6.22) is
⃗ µ = ⃗ζµ /ρw ,
W ⃗ζµ = (ζ 1 , ζ 2 , ζ 3 ).
µ µ µ
Qw = β δ (r),
and β is a scaling factor. We can take proper unit for gs such that β = 1. Thus, putting
Qw = δ (r) in (4.6.26) we derive that
gw B 1 −k0 r
− ∆Φw + k12 Φw = gw δ (r) + e . (4.6.27)
ρw r
4.6 Weak Interaction Theory 249
Let the solution of (4.6.27) be radially symmetric, then the equation (4.6.27) can be equiv-
alently written as
% &
1 d 2 d gw B 1 −k0 r
− 2 r Φw + k12 Φw = gw δ (r) + e . (4.6.28)
r dr dr ρw r
The solution of (4.6.28) can be expressed as
gw −k1 r gw B
Φw = e − ϕ (r)e−k0 r , (4.6.29)
r ρw
where ϕ (r) satisfies the equation
% & % &
1 2k0 2 2 1
ϕ +2
′′
− k0 ϕ −
′
+ k1 − k0 ϕ = . (4.6.30)
r r r
Let ϕ be in the form
∞
ϕ= ∑ βn r n (the dimension of ϕ is L).
n=0
ϕ = β0 (1 + k0r).
3. Repulsive condition. For the two particles as above, if their weak interaction constant
B satisfies the inequality
1 k 4B 2 −kr 1
+ " k re , ∀0 < r ! ,
r2 r ρ k
or equivalently B satisfies
e
B ! ρ k (e = 2.718, k = 1016 cm−1 ), (4.6.34)
2
then the weak force between these two particles is always repulsive.
It follows from this conclusion that for the neutrinos:
(ν1 , ν2 , ν3 ) = (νe , νµ , ντ ),
(ν 1 , ν 2 , ν 3 ) = (ν e , ν µ , ν τ ),
4.6 Weak Interaction Theory 251
8 D mW c E2
G2w = √ Gf , (4.6.35)
2 h̄
and G f is given by
D m c E2
p
G f = 10−5h̄c/, (4.6.36)
h̄
where mW and m p are masses of W ± bosons and protons. By the gauge theory, Gw is also the
coupling constant of SU(2) gauge fields. Therefore we can regard G w as the weak charge
of nucleons, i.e.
Gw = gw (ρn ), ρn the nucleon radius.
In addition, it is known that % &3
ρw
gw (ρn ) = 9 gs .
ρn
Hence, we deduce from (4.6.35) and (4.6.36) that
% &6 % &2
ρw 8 mw
81 g2w = √ × 10−5h̄c.
ρn 2 mp
In a comparison with (4.5.66) and (4.6.37), we find that the strong charge g s and the
weak charge gw have the same magnitude order. Direct computation shows that
g2w gw
≃ 0.35 equivalently ≃ 0.6.
g2s gs
We know that the interaction field equations are oriented toward two directions: i) to
derive interaction forces, and ii) to describe the field particles and derive the particle transi-
tion probability. The PID-PRI weak interaction field equations describing field particles are
given by (4.4.42)-(4.4.45). Here, for convenience, we write them again as follows:
% &
gw a αβ b c 1 D mH c E 2 gw
∂ ν Wνaµ − εbc g Wα µ Wβ − gw Jµa = ∂µ − xµ + γbWµb φ a , (4.6.38)
h̄c 4 h̄ h̄c
D m c E2 g
H w
− ∂ µ ∂µ φ a + φ a − γb ∂ µ (Wµb φ a ) (4.6.39)
h̄ h̄c
gw a αβ µ b c 1 D mH c E2
= εbc g ∂ (Wα µ Wβ ) + gw∂ µ Jµa − xµ ∂ µ φ a ,
h̄c 4 h̄
where mH is the Higgs particle mass, and Wµa (1 ! a ! 3) describe the intermediate vector
bosons as follows
W ± : Wµ1 ± iWµ2 ,
(4.6.40)
Z: Wµ3 ,
H ± : φ 1 ± iφ 2 ,
(4.6.41)
H0 : φ 3.
To match (4.6.40) and (4.6.41), we take the SU(2) generator transformation as follows
⎛ ⎞ ⎛ ⎞⎛ ⎞
σ'1 1 i 0 σ1
⎝σ'2 ⎠ = √1 ⎝1 −i 0 ⎠ ⎝σ2 ⎠ .
2 0 0 √
σ'3 2 σ3
D E
(W' µ1 , W 'µ3 ) = (Wµ± , Zµ ) = √1 (Wµ1 ± iWµ2 ),Wµ3 ,
'µ2 , W
2
D 1 E
(φ' , φ' , φ' ) = (φ , φ ) = √ (φ 1 ± iφ 2 ), φ 3 .
1 2 3 ± 0
2
The equations (4.6.38) and (4.6.39) obey PRI, and under the above transformation they
4.6 Weak Interaction Theory 253
becomes
igw αβ ±
∂ ν Wν±µ ± g (W )α µ Zβ − Zα µ Wβ± ) − gwJµ± (4.6.43)
0 h̄c 1
2 1 D mH c E2
= ∂µ + kW Wµ± + kZ2 Zµ − xµ φ ± ,
4 h̄
ig w αβ
∂ ν Zν µ − g (Wα+µ Wβ− − Wα−µ Wβ+ ) − gwJµ0 (4.6.44)
0 h̄c 1
2 2 1 D mH c E2
= ∂µ + kW Wµ + kZ Zµ −
±
xµ φ 0 ,
4 h̄
Dm cE gw
H
#H ± + H ± − ∂ µ (γ'bW ' µb H ± ) (4.6.45)
h̄ h̄c
gw αβ µ ± ' b ' c 1 D mH c E 2
= g ∂ (' εbcWα µ Wβ ) + gw ∂ µ Jµ± − xµ ∂ µ H ± ,
h̄c 4 h̄
D m c E2 gw
H
#H 0 + H 0 − ∂ µ (γ'bW ' µb H 0 ) (4.6.46)
h̄ h̄c
gw αβ µ 0 ' b ' c 1 D mH c E 2
= g ∂ (' εbcWα µ Wβ ) + gw ∂ µ Jµ0 − xµ ∂ µ H 0 ,
h̄c 4 h̄
1
where H ± = √ (φ 1 ± iφ 2 ), H 0 = φ 3 in (4.6.45) and (4.6.46), and
2
1
Jµ± = √ (Jµ1 ± iJµ2 ) the charged current,
2
Jµ0 = Jµ3 the neutral current,
igw (4.6.47)
Wν±µ = ∂ν Wµ± − ∂µ Wν± ± (Zµ Wν± − Zν Wµ± ),
h̄c
igw + −
Zν µ = ∂ν Zµ − ∂µ Zν + (W W − Wν+Wµ− ),
h̄c µ ν
and
2 gw γ1 gw γ3
kW =√ , kZ2 = , (4.6.48)
2h̄c h̄c
which are derived by the following transformation
⎛ 2⎞ ⎛ ⎞⎛ ⎞
kW 1 i 0 γ1
g
2 ⎠ = √ w ⎝1 −i ⎠ ⎝γ2 ⎠ ,
⎝kW
√0
k2 2h̄c 0 0 2 γ3
Z
First, we note that these equations are nonlinear, and consequently, no free weak inter-
action field particles appear.
254 Chapter 4 Unified Field Theory of Four Fundamental Interactions
Second, there are two important solutions of (4.6.43)-(4.6.44), dictating two different
weak interaction procedures.
The first solution sets
Wµ± = 0, φ 0 = 1. (4.6.49)
1 D mH c E 2
#Zµ + kz2 Zµ = −gw Jµ0 + xµ . (4.6.50)
4 h̄
The second solution takes
Zµ = 0, φ ± = 1. (4.6.51)
1 D mH c E 2
#Wµ± + kw2 Wµ± = −gw Jµ± + xµ . (4.6.52)
4 h̄
We are now ready to obtain the following physical conclusions for the weak interaction
field particles.
W ± ↔ H ±, Z ↔ H 0.
2. Lack of freedom for field particles. Due to the nonlinearity of (4.6.43)-(4.6.46), the
weak interaction field particles W ± , Z, H ± , H 0 are not free bosons.
h̄ h̄
mW = kW , mZ = k Z ,
c c
and kW and kZ are as in (4.6.48).
4. Basic properties of field particles. From (4.6.43)-(4.6.46) we can obtain some basic
information for the bosons as follows:
xµ ∂ µ φ a = 0 for 1 ! a ! 3,
Wµa = 0 for 1 ! a ! 3,
where νL and lL are the left-hand neutrino and lepton, lR is the right-hand lepton, (φ + , φ 0 )
are scalar fields with electric charge (1,0).
256 Chapter 4 Unified Field Theory of Four Fundamental Interactions
Dirac equations:
iγ µ (∂µ + ig2Bµ )R − Gl ϕ lL = 0,
% & % &
g2 g1 ν 0 (4.6.62)
iγ µ (∂µ + i Bµ − i Wµa σa ) L − Gl R = 0.
2 2 lL ϕ
Here
Jµa = Lγµ σa L 1 ! a ! 3,
JµL = ν L γµ νL + l L γµ lL ,
JµR = Rγµ R, γµ = g µα γ α .
5. Masses are generated at the ground states. It is clear that the following state
ϕ = a, Wµa = 0, Bµ = 0, L = 0, R = 0,
258 Chapter 4 Unified Field Theory of Four Fundamental Interactions
and √
m1 c g 1 a m2 c g 2 a m3 c g1 g2 a
=√ , =√ , = √ .
h̄ 2 h̄ 2 h̄ 2
6. The masses mW and mZ can be derived from (4.6.63) and (4.6.64) as follows. Ac-
cording to the IVB theory for particle transition, the W ± particles are characterized as
W ± : Wµ1 ± iWµ2 .
which requires the following transformation of SU(2) generators from the Pauli matrices
σa : ⎛ ⎞ ⎛ ⎞⎛ ⎞
σ'1 1 i 0 σ1
⎝σ'2 ⎠ = √1 ⎝0 −i 0 ⎠ ⎝σ2 ⎠ .
2 0 √
σ'3 0 2 σ3
4.6 Weak Interaction Theory 259
In addition, we also need to obtain the mass mZ of Z boson by diagonalizing the massive
matrix (4.6.64). It is easy to see that
⎛ 2 ⎞
mW 0 0 0
c2 ⎜ 0 mW 2 0 0⎟
UMU † = 2 ⎜ ⎝ 2
⎟,
⎠
h̄ 0 0 m Z 0
0 0 0 0
⎛ 1 i ⎞
√ √ 0 0 (4.6.66)
⎜ 2 2 ⎟
⎜ ⎟
⎜ 1 i ⎟ g1 g2
U =⎜ ⎜ √2 − √2 0 0⎟⎟, α= , β= ,
⎜ ⎟ |g| |g|
⎝ 0 0 α β⎠
0 0 −β α
.
where |g| = g21 + g22 , and
c2 mW 2 a2 2 c 2 mZ a2 2
= g , = |g| . (4.6.67)
h̄2 2 1 h̄2 2
7. The field equations governing W ± and Z bosons are obtained from the equations
(4.6.63) under the following transformation
⎛ +⎞ ⎛ 1⎞
Wµ Wµ
⎜Wµ− ⎟ ⎜Wµ2 ⎟
⎜ ⎟ ⎜ ⎟
⎝ Zµ ⎠ = U ⎝W 3 ⎠ for U as in (4.6.66). (4.6.68)
µ
Aµ Bµ
In this case, the equations (4.6.63) become
⎛ D E2 ⎞
∂ ν W + − mW c W + ⎛g ⎞
⎜ νµ µ ⎟ 1 +
⎜ h̄ √
⎟ ⎜ 2 ⎟ J µ
⎜ D E ⎟
⎜∂ ν W − − mW c 2 W − ⎟ ⎜ ⎜√g1 − ⎟ ⎟
⎟ = ⎜ 2 Jµ ⎟
⎜ νµ µ ⎟
⎜ h̄ ⎜
⎟ + higher order terms. (4.6.69)
⎜ D m c E2 ⎟ ⎜
⎜ ν Z ⎟ ⎝ |g|J 0 ⎟
⎜ ∂ Zν µ − Zµ ⎟ µ ⎠
⎝ h̄ ⎠ em
−eJµ
∂ ν Aν µ
where e = g1 g2 /|g|, and
1
Jµ± = (Jµ1 ± iJµ2 ),
2
1
Jµ0 = (g2 J 3 − g22 JµL − 2g22JµR ),
2|g|2 1 µ
1
Jµem = (Jµ3 + JµL + 2JµR).
2
260 Chapter 4 Unified Field Theory of Four Fundamental Interactions
8. The Higgs field equation governed H 0 boson is given by (4.6.61), and at the ground
state ϕ = a which can be written as
D m c E2 1
H
∂ µ ∂µ ϕ − ϕ = (ϕ + a)(g21WµaW µ a + g22Bµ Bµ − 2g1g2Wµ3 Bµ ) (4.6.71)
h̄ 4
+ λ ϕ (ϕ 2 + 2aϕ ) + Gl (l L R + RlL ),
and we don’t know which of these potentials plays the role of weak interaction potential. In
fact, with the mixed fields
Wµ± , Zµ , Aµ ,
it is even more difficult to determine the weak force.
If we combine the classical electroweak theory with PRI, and take
as the weak interaction potential, then from the field equations (4.6.60), we can only deduce
the weak force potential in the following form:
g1 −kr g1 −kr 1
Φw = e or Φw = − e , k = √ ag1 .
r r 2
4.6 Weak Interaction Theory 261
It implies that the weak force is only repulsive or attractive, which is not consistent with
experiments.
2. Violation of PRI. In the classical electroweak theory, a key ingredient is the linear
combinations of Wµ3 and Bµ as given by (4.6.70). By PRI,
their field equations (4.6.69) must vary under general SU(2) generator transformations as
follows
σ'a = xba σb . (4.6.74)
3. Decoupling obstacle. The classical electroweak theory has a difficulty for decoupling
the electromagnetic and the weak interactions. In reality, electromagnetism and weak inter-
action often are independent to each other. Hence, as a unified theory for both interactions,
one should be able to decouple the model to study individual interactions. However, the
classical electroweak theory manifests a radical decoupling obstacle.
In fact, it is natural to require that under the condition
Wµ± = 0, Zµ = 0, (4.6.75)
the WS field equations (4.6.69) should return to the U(1) gauge invariant Maxwell equa-
tions. But we see that
Aµ = cos θw Bµ − sin θwWµ3 ,
where B µ is a U(1) gauge field, and Wµ3 is a component of SU(2) gauge field. Therefore,
Aµ is not independent of SU(2) gauge transformation. In particular, the condition (4.6.75)
means
Wµ1 = 0, Wµ2 = 0, Wµ3 = − tgθw Bµ . (4.6.76)
It implies that under a transformation (4.6.74), a nonzero weak interaction can be generated
from a zero weak interaction field of (4.6.75)-(4.6.76):
' µ± ̸= 0,
W Z'µ ̸= 0 as ya3 ̸= 0 (1 ! a ! 3),
'µ = 0
A as y33 = cot θw .
This conclusion motivates us to propose a weakton model for sub-lepton, sub-quark, and
sub-mediators (Ma and Wang, 2015b), based on the new field theory and the new insights
for the weak and strong interactions presented in the last chapter.
One important theoretical basic for the weakton model is the field theory developed in
the last chapter. In particular, the weak and strong charges are responsible for the weak
264 Chapter 5 Elementary Particles
and strong interactions. The confinement property demonstrated by the weak and strong
interaction potentials give rise to the needed confinement for the weakton constituents of
the composite particles.
Remarkably, the weakton model offers a perfect explanation for all sub-atomic decays.
In particular, all decays are achieved by 1) exchanging weaktons and consequently exchang-
ing newly formed quarks, producing new composite particles, and 2) separating the new
composite particles by weak and/or strong forces.
One aspect of this decay mechanism is that now we know the precise constituents of
particles involved in all decays/reactions both before and after the reaction. It is therefore
believed that the new decay mechanism provides clear new insights for both experimental
and theoretical studies.
This chapter is organized as follows. Sections 5.1 and 5.2 recall the basic knowledge of
particle physics and the quark model. Sections 5.3 and 5.4 are based entirely on (Ma and
Wang, 2015b), with the weakton model introduced in Section 5.3, and with the mechanism
of subatomic decays and electron radiations presented in Section 5.4. The last section,
Section 5.5 studies the color algebra associated with the color quantum number, leading
to detailed structure on mediator clouds around subatomic particles. Section 5.5 is based
entirely on (Ma and Wang, 2014b).
where e, µ , τ are the electron, the muon, the tau, and νe , νµ , ντ are the e-neutrino, the µ -
neutrino, the τ -neutrino. Together with their antiparticles, there are total 12 leptons:
5.1 Basic Knowledge of Particle Physics 265
particles: (e− , νe ), (µ − , νµ ), (τ − , ντ ),
antiparticles: (e+ , ν e ), (µ + , ν µ ), (τ + , ν τ ).
2. Quarks. Based on the standard model, there are three generations of quarks consisting
of 6 particles and 6 antiparticles, which participate all four fundamental interactions:
quarks: (u, d), (c, s), (t, b),
antiquarks: (u, d), (c, s), (t, b),
where u, d, c, s,t, b are the up quark, down quark, charm quark, strange quark, top quark,
and bottom quark.
The quark model asserts that three quarks are bounded together to form a baryon, and
a pair of quark and antiquark are bounded to form a meson. As mentioned in Section
4.5.3, quarks are confined in hadrons, and no free quarks have been found in Nature. This
phenomena is called the quark confinement, which has been very well explained by the
layered formulas of strong interactions in the last chapter.
3. Mediators. The standard model shows that each interaction is associated with a class
of field particles, called mediators. Hence, there are four classes of mediators:
Gravitation: graviton G,
Electromagnetism: photon γ ,
Weak interaction: vector and Higgs bosons W ± , Z, H ± , H 0 ,
Strong interaction: gluons gk (1 ! k ! 8).
By the unified field theory introduced in the last chapter, there exists a natural duality
between the interacting fields {g µν , Aµ ,Wµa , Skµ } and their dual fields {φ µg , φ e , φwa , φsk }:
gµν ↔ φµg ,
Aµ ↔ φ e ,
Wµa ↔ φwa , 1 ! a ! 3,
Skµ ↔ φsk , 1 ! k ! 8.
This duality leads to four classes of new dual particle fields, called the dual mediators:
tensor graviton G ↔ vector graviton g,
vector photon γ ↔ scalar photon γ0 ,
±
charged vector bosons W ↔ charged Higgs bosons H ± , (5.1.1)
0
neutral vector boson Z ↔ neutral Higgs boson H ,
k
vector gluons g ↔ scalar gluons gk0 , 1 ! k ! 8,
where the upper index k in gk and gk0 represents the color index of gluons.
266 Chapter 5 Elementary Particles
The mediator duality (5.1.1) is derived from the unified field theory. However, the dual-
ity (5.1.1) can also be naturally deduced from the weakton model postulated in Section 5.3.
Namely, the mediator duality (5.1.1) is derived from two very different perspectives in two
different theories. This can hardly be a coincident.
4. Hadrons. Hadrons are classified into two types: baryons and mesons. Baryons are
fermions and mesons are bosons, which are all made up of quarks:
Baryons = qi q j qk , mesons = qi q j ,
where qk = {u, d, c, s,t, b}. The quark constituents of main hadrons are listed as follows:
1
• Baryons with J = :
2
√
p (uud), n (udd), Λ (s(du − ud)/ 2),
√
Σ+ (uus), Σ− (dds), Σ0 (s(du + ud)/ 2),
Ξ0 (uss), Ξ− (dss).
3
• Baryons with J = :
2
• Mesons with J = 0:
√
π + (ud), π − (ud), π 0 ((uu − dd)/ 2),
K + (us), K − (us), K 0 (ds),
0 √
K (ds), η ((uu + dd − 2ss)/ 6).
• Mesons with J = 1:
√
ρ + (ud), ρ − (ud), ρ 0 ((uu − dd)/ 2),
K ∗+ (us), K ∗− (us), K ∗0 (ds),
∗0
K (ds), ψ (cc), γ (bb).
parameters. There are numerous types of particles, which obey various conservation laws.
Each conservation law is characterized by one parameter. These physical parameters are
called quantum numbers.
The main quantum numbers of particles are:
mass m, electric charge Q, lifetime τ ,
spin J, lepton number L, baryon number B,
(5.1.2)
parity π , isospin (I, I3 ), strange number S,
G-parity, hypercharge Y.
In addition, there are some new quantum numbers, including the weak charge Q w , strong
charge Qs , and colour index k, which play an important role in in the weakton model intro-
duced in Section 5.3.
We now briefly introduce the quantum numbers listed in (5.1.2).
1. Mass. This is a quantity to characterize the inertia, and also plays the role of gravita-
tional charge. In the macro-world, masses are continuous in distribution, however in quan-
tum world masses are discrete. The same particles have the same masses, but the different
massive particles have a gap between their masses.
In non-quantum physics, the masses are additive, i.e. if a body A consists of two bodies
B and C, then the mass mA of A is the sum of masses mB and mC of B and C:
A = B + C ⇒ mA = mB + mC . (5.1.3)
Q = ±ne (n = 0, 1, 2, · · · ).
A1 + · · · + A n → B1 + · · · + B N ,
we have
n N
∑ QA k = ∑ QB k ,
k=1 k=1
1 In the weakton model of elementary particles, w∗ carries 2/3 electric charge, and w and w carry −1/3
1 2
and −2/3 electric charges respectively.
268 Chapter 5 Elementary Particles
where QAk and QBk are the electric charges of particles Ak and Bk
3. Lifetime of particles. Due to the decay property for most particles, the lifetime
becomes a quantum number. Except the long lifetime particles: the electron e, proton p,
neutrino v, photon γ , gluon gk , graviton G, all other particles undergo decays.
Usual particles with decay have very short lifetime, and in general their lifetimes τ do
not exceed 10−5s:
τ < 10−5s on an average.
The neutron n is special, and has a longer lifetime:
τ = 885.7s on an average.
Figure 5.1 (a) right-hand spin(J >0), and (b) left-hand spin(J < 0).
only take three values L = ±1, 0. For all non-lepton particles, their lepton number is zero,
and
⎧
⎪ 1 for e− , νe ,
⎨
Le = − 1 for e+ , ν e ,
⎪
⎩
0 others,
⎧
⎪ 1 for µ − , νµ ,
⎨
Lµ = − 1 for µ + , ν µ ,
⎪
⎩
0 others,
⎧
⎪ 1 for τ − , ντ ,
⎨
Lτ = − 1 for τ + , ν τ ,
⎪
⎩
0 others.
6. Baryon number B. The hadrons are classified into baryons and mesons. Baryons are
fermions and mesons are bosons. Also, baryons have an additive and conservative quantum
number: the baryon number B, defined by
⎧
⎪ 1 for a baryon,
⎨
B = −1 for an antibaryon,
⎪
⎩
0 for all other particles.
7. Parity π . Due to the Noether Theorem 2.38, each symmetry is associated with a
conservation law. The parity is a conservative quantum number corresponding to the spatial
reflection symmetry:
x → −x (x ∈ R3 ).
The weak interaction violates parity; see (Lee and Yang, 1956; Wu, Ambler, Hayward,
Hoppes and Hudson, 1957). The parities π of most particles are obtained by two methods:
the experimental and the artificial means.
The parity is a multiplication quantum number. For example, for an N particles system:
A = A1 + · · · + AN ,
where l is the sum of orbital quantum numbers of all particles, and π Ak is the parity of Ak
particle.
The parity conservation is defined as follows. For a particle reaction:
A1 + · · · + AN −→ B1 + · · · + BK , (5.1.5)
In Subsection 5.1.4, we shall introduce the violation of parity conservation in the weak
interaction in detail.
8. Isospin (I, I3 ). The isospin was first presented in 1932 by Heisenberg, and was used to
describe that the strong interaction between protons and neutrons, independently of electric
charges. Later, along with the development of particle physics, it was discovered that the
isospin is a good quantum number for all hadrons.
Isospin has two components (I, I3 ), where I is called the isospin and I3 is the third
component of isospin. In a many-particle system, the isospin I obeys the vectorial additive
rule and I3 obeys the usual additive rule. We shall introduce the isospin again in Subsection
5.1.4.
9. Strange number S. In 1947, C. D. Rochester and C. C. Butler first discovered the
strange particle K 0 , a neutral meson, from cosmic rays, and later many strange particles
such as K ± , Λ, Σ± , Ξ− , Ξ0 , Ω− were found. They are named the ”strange” because these
particles possess a kind quantum number called the strange number.
The strange number S is an additive quantum number, which is conserved only in the
electromagnetic and strong interactions, and takes the integral values:
S = 0, 1, 2, 3, · · · .
Y = S + B,
where S and B are the strange number and the baryon number.
11. G-parity. G parity is a conservative quantity only for strong interactions under the
G-transformation
Gψ = Ĉeiπ I2 ψ ,
where Ĉ is the transformation of particles and antiparticles:
n → p + e− + ν e . (5.1.6)
1) Decays are caused by the three fundamental forces: the electromagnetic, weak, strong
interactions, and, therefore, are mainly classified into two types: weak interaction
decays and strong interaction decays;
2) Decays always take place from higher masses toward to lower masses. For example
in the β -decay (5.1.6), the masses of particles on the right-hand side are smaller than
the mass of the neutron n.
3) Both strong and weak interaction decays have to obey some basic conservation laws,
as shown in the next subsection.
Remark 5.1 The following transition
p → n + e+ + νe (5.1.7)
is often called β -decay as well. In fact, in Nature the process (5.1.7) cannot spontaneously
take place, and it always occurs under certain energetic excitation. Hence, (5.1.7) is an
excited scattering.
p + γ → n + e+ + νe .
272 Chapter 5 Elementary Particles
A1 + · · · + A N → B1 + · · · + B K , (5.1.8)
where An (1 ! n ! N) are the initial particles, and Bk (1 ! k ! K) are the final particles.
There are two forms of scatterings: the elastic and non-elastic scatterings, both of which
are caused by the three interactions: the electromagnetic, the weak and the strong interac-
tions. Collision is the most important experimental method to detect new particles and new
phenomena. The scatterings (5.1.8) satisfy various conservation relations.
Here are a few important scatterings in the history of quantum physics:
Compton scattering: γ + e − → γ + e− ,
Pair annihilation: e + + e− → γ + γ ,
Pair creation: γ + γ → e + + e− ,
Moller scattering: e − + e− → e− + e− ,
Bhahba scattering: e − + e+ → e− + e+ .
Deep inelastic scattering : e − + p → e− + p + π 0 .
3. Radiations. Radiations include the electromagnetic radiation and the gluon radiation.
The first one is that electrons emit photons:
e− → e − + γ , (5.1.9)
q → q + gk , (5.1.10)
1) As an electron changes its velocity, it emits photons. This radiation is called the
bremsstrahlung. The radiation energy in unit time can be expressed by the formula:
2 e2 2
W= a with a being the average acceleration.
3 c3
In summary, there are three types of particle transitions: decays, scatterings, and ra-
diations. They are the main dynamic behavior for micro-particles, and reveal the interior
5.1 Basic Knowledge of Particle Physics 273
structure of particles, and provide crucial information about the three interactions associated
with the electromagnetic, the weak, and the strong forces.
Here we list some principal decay forms:
• Lepton decays:
µ − → e− + ν e + ν µ ,
µ + → e+ + νe + ν µ ,
τ − → e− + ν e + ντ ,
τ − → µ − + ν µ + ντ , (5.1.11)
τ → π + ντ ,
− −
τ − → ρ − + ντ ,
τ − → K − + ντ .
• Quark decays:
d → u + e− + ν e ,
s → u + e− + ν e ,
(5.1.12)
s → d + γ,
c → d + s + u.
• Mediator decays:
W + → e+ + νe , µ + + νµ , τ + + ντ ,
W − → e− + ν e , µ− + νµ, τ− + ντ , (5.1.13)
Z → e+ + e− , µ + + µ −, τ + + τ − .
• Baryon decays:
n → p + e− + ν e ,
Λ → p + π −, n + π 0,
Σ+ → p + π 0, n + π +,
Σ− → n + π −,
(5.1.14)
Σ0 → Λ + γ ,
Ξ− → Λ + π − ,
∆++ → p + π +,
∆+ → p + π 0 ,
274 Chapter 5 Elementary Particles
∆− → n + π − ,
∆0 → n + π 0 ,
Σ∗± → Σ± + π 0,
Σ∗0 → Σ0 + π 0,
Ξ∗− → Ξ− + π 0,
Ξ∗0 → Ξ0 + π 0.
• Meson decays:
π + → µ + + νµ ,
π− → µ− + νµ ,
π 0 → 2γ ,
K + → µ + + νµ , π + + π 0, π + + π + + π −,
K− → µ − + ν µ , π − + π 0, π − + π − + π +,
K 0 → π + + e− + ν e , π + + π −, π + + π − + π 0,
(5.1.15)
η → 2γ , π + + π − + π 0,
ρ ± → π ± + π 0,
ρ 0 → π + + π −,
K ∗± → K ± + π 0,
K ∗0 → K 0 + π 0,
ω → π0 + γ, π + + π − + π 0.
A1 + · · · + A N → B1 + · · · + B K . (5.1.16)
They are driven by the three fundamental forces: the electromagnetic, the weak, and the
strong interactions. Particle transitions have to obey certain basic conservation laws in
the sense that for certain conservative quantum number q, its total values on both sides
of (5.1.16) are the same:
qA = qB . (5.1.17)
In addition, some conservation laws may not be valid for all interactions. For example,
parity is valid only in the strong and the electromagnetic interactions, and is violated in the
weak interaction.
5.1 Basic Knowledge of Particle Physics 275
Pk = PkA1 + PkA2 , 1 ! k ! 3,
Angular momentum conservation law is also a universal conservation law, and for (5.1.16),
N K
∑ J⃗An = ∑ J⃗Bk .
n=1 k=1
4. Other universal conservation laws. In particle physics, the following quantum num-
bers are additive and conservative in all interactions: the electric charge Q e , the lepton
numbers Le , Lµ , Lτ , and the baryon number B.
Table 5.1 lists the conservation or non-conservation properties of quantum numbers for
the three interactions.
Remark 5.2 CP combines charge conjugation C and parity P, and CPT combines CP
and time reversal T .
276 Chapter 5 Elementary Particles
Here, we list the basic data for leptons, quarks, and hadrons in Tables 5.2-5.5. The units are:
mass in MeV/c2 , lifetime in seconds, and electric charge in the unit of proton charge.
Note that the quantum number Qw in Table 5.2 represents the weak charge number. Here
the values of Qw are based on the weakton model in Section 5.3.
! "
1
Table 5.2 Leptons spin J =
2
Leptons m Qe Qw Le Lµ Lτ τ
e− 0.51 −1 3 1 0 0 ∞
e+ 0.51 1 3 −1 0 0
µ− 105.7 −1 3 0 1 0 2.2 × 10−6
µ+ 105.7 1 3 0 −1 0
τ− 1777 −1 3 0 0 1 2.9 × 10−13
τ+ 1777 1 3 0 0 −1
ν+ 0 0 1 1 0 0 ∞
νe 0 0 1 −1 0 0 ∞
νµ 0 0 1 0 1 0 ∞
νµ 0 0 1 0 −1 0 ∞
ντ 0 0 1 0 0 1 ∞
ντ 0 0 1 0 0 −1 ∞
5.2 Quark Model 277
1
Also, all neutrinos possess left-hand spin with J = − , and antineutrinos possess right-
2
1
hand spin with J = . Namely, neutrinos move at the speed of light.
2
In addition, the data of W ± , Z, H 0 are given as follows.
them into groups based on certain common properties. The Eightfold Way can be regarded
as a successful classification for hadrons. In fact, the Eightfold Way, together with the
irreducible representations of SU(3), has played a navigation role for the introduction of the
quark model.
The Eightfold Way was introduced by Gell-Mann and Ne’eman independently in 1961.
This scheme arranges the baryons and mesons into certain geometric patterns, according
to their charge, strangeness, hypercharge and isospin I3 . These geometric patterns include
hexagons and triangles, and all particles put in each diagram are considered as a class.
The first group of hadrons consists of the eight lightest baryons:
n, p, Σ− , Σ0 , Σ+ , Λ, Ξ− , Ξ0 , (5.2.1)
which fit into a hexagonal array with two particles Σ0 and Λ at the center; see Figure 5.2.
The eight baryons (5.2.1) with J = 1/2 are known as the baryon octet. Note that particles
in Figure 5.2 range with charges lie along the downward-sloping diagonal line: Q = 1 for p
and Σ+ , Q = 0 for n, Λ, Σ0 and Ξ0 , and Q = −1 for Σ− and Ξ− . Horizontal lines associate
particles of the same strangeness S = 0 and hyper-charge Y = 1 for the neutron and proton,
S = −1 and Y = 0 for Σ− , Σ0 , Λ and Σ+ , S = −2 and Y = −1 for Ξ− and Ξ0 . The third
component I3 of isospin indicates particles in column.
5.2 Quark Model 279
In the same fashion as the baryon octet (5.2.1), this group (5.2.2) fits into a hexagonal array
with π 0 and η at the center; see Figure 5.3.
Figure 5.3
280 Chapter 5 Elementary Particles
Figure 5.4
Figure 5.5
5.2 Quark Model 281
Remark 5.3 The Eightfold Way based on the patterns given by Figures 5.2-5.5 pro-
vides a beautiful classification for hadrons. However, the most important point is that the
Eightfold way and the irreducible representation of SU(N) also provide key clues to dis-
cover the quark model for the hadron structure.
H : G → M, (5.2.5)
If all block matrices Hk (G) (1 ! k ! n) in (5.2.8) cannot be split into smaller pieces any-
more, then the direct sum of all sub-representations Hk (G) as
H1 (G) ⊕ · · · ⊕ Hn (G)
282 Chapter 5 Elementary Particles
g : CN → CN . (5.2.10)
It is known that for each linear operator g ∈ G as defined in (5.2.10), there is a unique matrix
U ∈ SU(N) such that
g(ψ ) = U ψ , ∀ψ ∈ CN . (5.2.11)
Hence, relation (5.2.11) provides a correspondence
H : G → SU(N), (5.2.12)
SU(N) = H(G)
{e1 , · · · , eN } ⊂ CN
N
Then the conjugate space C of CN can be written as
. / 0
N /
N
C = ∑ βk ek //βk ∈ C .
k=1
U : CN → CN , (5.2.14)
and its physical significance, which will be discussed in more detail in the next two subsec-
tions. Here we just give a brief introduction to the irreducible representation of (5.2.20).
A representation of (5.2.20) is a mapping defined as
N · · ⊗ N4 ⊗ N
1 ⊗ ·23 · · ⊗ N4 = m1 ⊕ · · · ⊕ mK ,
1 ⊗ ·23 (5.2.24)
k1 k2
where m j is the order of H j (U). In Subsection 5.2.4 we shall give the computational method
of (5.2.23) (or (5.2.24)) by the Young tableau.
5.2 Quark Model 285
ψ1 , · · · , ψ N , (5.2.25)
2. Each matrix U ∈ SU(N) and each U ∈ SU(N) represent the transformations of phys-
ical states of particles (5.2.25) and antiparticles (5.2.26) as follows
N N
∑ zk ψk → ∑ 5zk ψk ,
j=1 j=1
(5.2.27)
N N
∑ yk ψ k → ∑ y5k ψ k ,
j=1 j=1
where ⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎛ ⎞
z1
5 z1 y51 y1
⎜ .. ⎟ ⎜ .. ⎟ , ⎜ .. ⎟ ⎜ .. ⎟ .
⎝ . ⎠=U⎝ . ⎠ ⎝ . ⎠=U⎝ . ⎠ (5.2.28)
zN
5 zN y5N yN
3. The tensor product of fundamental particles (5.2.25) and (5.2.26) are denoted by
which represents state transformation of composite particles, similar to the state transfor-
mation (5.2.27)-(5.2.28) for a single particle system.
6. We recall the irreducible representation of SU(N) given by (5.2.23) and (5.2.24).
The irreducible representation implies that there is a decomposition in the N k composite
particles (5.2.29), which are classified into K groups as
j j
where each group G j contain m j composite particles Ψ1 , · · · , Ψm j , as in (5.2.30), such that
the space (5.2.31) can be decomposed into the direct sum of the K subspaces of span G j as
k N k2
CN 1 ⊗ C = E1 ⊕ · · · ⊕ EK , (5.2.34)
where
E j = span G j = span{Ψ1j , · · · , Ψmj j }, 1 ! j ! K, (5.2.35)
and then, the linear transformations H(U) in (5.2.32) are also decomposed into the direct
sum for all U ∈ SU(N) as in (5.2.23). Namely, under the decomposition (5.2.33) of basis of
k N k2
CN 1 ⊗ C , the representations
In other words, the subspace E j of (5.2.35) is invariant for the linear transformation (5.2.36)-
(5.2.37).
7. Sakata’s explanation of irreducible representation of SU(N). Now, we can deduce
the following physical conclusions from the discussions in above steps 1-6.
Physical Explanation 5.5 Let (5.2.29) be a family of composite particles as given by
(5.2.30). The irreducible representation (5.2.36), which usually is expressed as
N ⊗ · · · ⊗ N ⊗ N ⊗ · · · ⊗ N = m1 ⊕ · · · ⊕ m K ,
1 23 4 1 23 4
k1 k2
means that
• the composite particle system (5.2.29) can be classified into K groups of particles:
j
G j = {Ψ1 , · · · , Ψmj j }, 1 ! j ! K;
• each group G j has m j particles, such that under the state transformation (5.2.27)-
(5.2.28) of fundamental particles:
N N
U : CN → CN , U : C →C (U ∈ SU(N)),
We now examine the Physical Explanation 5.5 from the mathematical viewpoint. The
k N k2
Nk (k = k1 + k2 ) elements of (5.2.29)-(5.2.30) form a basis of CN 1 ⊗ C . We denote these
elements as
E = {Ψ1 , · · · , ΨN k }, (5.2.38)
with each Ψ j as in (5.2.30). The irreducible representation (5.2.23):
H(U) : X → X (X as in (5.2.22))
X j = span G j (1 ! j ! K)
is an invariant subspace of the mapping H(U) for all U ∈ SU(N). In particular, the block
matrix H j (U) is the restriction of H(U) on X j (1 ! j ! K):
U : CN → CN , U ∈ SU(N),
then the subspaces X j will undergo themselves a linear transformation in the fashion as
given by (5.2.40).
Thus, from the mathematical viewpoint, the Physical Explanation 5.5 is sound, and
provides the mathematical foundation for the quark model.
5.2 Quark Model 289
N · · ⊗ N4 ⊗ N
1 ⊗ ·23 · · ⊗ N4 = m1 ⊕ · · · ⊕ mK .
1 ⊗ ·23 (5.2.41)
k1 k2
A very effective method to compute (5.2.41) is the Young tableaux, which uses square
diagrams to deduce these numbers m j (1 ! j ! K). The method is divided in two steps.
The first step is to obtain the rule to group together square diagrams, and to obtain the
irreducible representation (5.2.41) in the form
N · · ⊗ N4 ⊗ N
1 ⊗ ·23 · · ⊗ N4 = 1 ⊕ 2 ⊕ · · · ⊕ K .
1 ⊗ ·23 (5.2.42)
k1 k2
The second step provides the rule and method to compute the dimensional m j from the j-th
square diagrams j on the right-hand side of (5.2.42):
1. Rule to group the square diagram in (5.2.42). First of all, in the Young tableau we
use a square to stand for a fundamental representation N of SU(N), and use a column of
N − 1 squares to stand for the conjugate representation N in the right-hand side of (5.2.42):
⎫
⎬
N= , N= N − 1.
⎭
3⊗3⊗3= ⊗ ⊗ .
N · · ⊗ N4 ⊗ N
1 ⊗ ·23 · · ⊗ N4 = ⊗ · · · ⊗ ⊗
1 ⊗ ·23 ⊗ ···⊗ (5.2.44)
1 23 4
k1 k2 k1
1 23 4
k2
Now, we give rules to group the square diagrams of the right-hand side of (5.2.42) for
different k1 and k2 .
290 Chapter 5 Elementary Particles
N ⊗N = ⊗ .
⊗ α = α ⊕ = ⊕ . (5.2.45)
α
N ⊗N⊗N = ⊗ ⊗ (5.2.47)
; <
= ⊗ ⊗
! "
= ⊕ ⊗ (by (5.2.45))
= ⊗ ⊕ ⊗
β
β
= α ⊕ ⊕ ⊕ .
α
⎡ ⎫ ⎤
⎪ ⎫
⎪
⎪ ⎪
⎢ ⎬ ⎬ ⎥
⎢ ⎥
=⎢ N+ N − 1⎥ ⊗ α
⎣ ⎪
⎪ ⎪
⎭ ⎦
⎪
⎭
⎫
⎫ (5.2.48)
⎪
⎪
⎪
⎬ ⎪
⎬
= α ⊗ N⊕ β ⊗ N −1
⎪
⎪ ⎪
⎭
⎪
⎭
⎫ ⎫ ⎫ ⎫
α ⎪ ⎪
⎪ ⎪
⎪ β ⎪
⎪
⎪
⎪ ⎪
⎪ ⎪
⎪ ⎪
⎪
⎬ ⎬ β ⎬ ⎬
= N⊕ N⊕ N −1⊕ N − 1.
⎪
⎪ ⎪
⎪ ⎪
⎪ ⎪
⎪
⎪
⎭ ⎪
⎪ ⎪
⎪ ⎪
⎪
β ⎭ ⎭ ⎭
We know that the representations of SU(N) for particles and antiparticles are conjugate
relation, i.e. if
N ⊗ ···⊗ N ⊗N ⊗ ···⊗ N (5.2.49)
1 23 4 1 23 4
k1 k2
represents the antiparticle system. Due to the symmetry of particles and antiparticles,
(4.2.49) and (4.2.50) have the same irreducible representations. In addition each composite
particle is composed of N particles with N ! 3. Hence it is enough to give the four cases
1)-4) above for the physical purpose.
2. Computation of (5.2.43). In the right-hand side of the Young tableaux (4.2.45)-
(4.2.48), the number of square diagrams is the K as in (4.2.42), and each of which represents
a dimension m j (1 ! j ! K) of an irreducible representation. The rule to compute m j is as
follows.
For example, for the following square diagram
(5.2.51)
(5.2.55)
where the data in a square equals to the number of all squares on its right-hand side and
below it adding one. For example, for the square marked 6, there are 3 squares on its right-
hand side, and 2 square below it. Hence the number k in this blank is
k = 3 + 2 + 1 = 6.
βN = 6 × 4 × 3 × 1 × 4 × 2 × 1 × 1. (5.2.56)
Thus, by (4.2.54) and (4.2.56) we can get the value of (4.2.52). In the following, we
give a few examples to show how to use the Young tableau to compute the irreducible
representations of SU(N).
which are the most important cases in particle physics. By (5.2.46) and (5.2.47), the Young
tableaux of (5.2.57) are as follows
3⊗3 = ⊗ = ⊕ (5.2.58)
3⊗3⊗3= ⊗ ⊗ = ⊕ ⊕ ⊕ (5.2.59)
5.2 Quark Model 293
According to (5.2.52) and the methods to compute αN and βN (N = 3), we infer from
(5.2.58) and (5.2.59) that
αN = 3 × 2 × 1, βN = 3 × 2 × 1, m = 1, for
αN = 3 × 4 × 2, βN = 3 × 1 × 1, m = 8, for
αN = 3 × 4 × 5, βN = 3 × 2 × 1, m = 10, for
Consequently, we derive, from (5.2.58) and (5.2.59), the following irreducible representa-
tions of (5.2.57):
3 ⊗ 3 = 1 ⊕ 8,
(5.2.60)
3 ⊗ 3 ⊗ 3 = 1 ⊕ 8 ⊕ 8 ⊕ 10.
Example 5.7 The other important cases in physics are the two irreducible representa-
tions for N = 4:
4 ⊗ 4 and 4 ⊗ 4 ⊗ 4.
Their Young tableaux are given by
⊗ = ⊕
⊗ ⊗ = ⊕ ⊕ ⊕
m = 1, for
m = 15, for
m = 4, for
3 ⊗ 3 = 8 ⊕ 1, (5.2.62)
Sakata presented a model for hadron structure, called the Sakata model. This was an early
precursor to the quark model, and also resulted in the physical implications of irreducible
representations as stated by Physical Explanation 5.5.
Sakata model proposed three particles
p, n, Λ (5.2.63)
as the fundamental particles for all strong interacting particles. In his scheme, Sakata used
the three particles in (5.2.63) as a basis of SU(3), such that each hadron consists of a funda-
mental particle and an antiparticle as
It is a coincidence that the eight particles just constitute an eight multiple state of hadrons,
and can be illustrated by the Eightfold Way; see Figure 5.3 and Figure 5.6.
Figure 5.6
5.2 Quark Model 295
Table 5.6
Sakataons I I3 Q S B
p 1/2 +1/2 +1 0 +1
n 1/2 −1/2 0 0 +1
Λ 0 0 0 −1 +1
In addition, from the quantum numbers of the three particles(5.2.63), we can deduce the
quantum numbers for the eight mesons in(5.2.65); see Table 5.6 and 5.7.
In Table 5.7, J is the spin and π is the parity. Hence J π = 0− represents that spin J = 0
and parity π = −1.
The Sakata model explains the mesons well. However, it encounters difficulties if we
use this model to describe baryons. In fact, if we use the three sakataons to form a baryon,
then the baryon number B = 3 for a baryon. Hence, we have to use two sakataons and one
anti-sakataon to form a baryon to ensure B = 1. Unfortunately those combinations do not
agree with experiments.
Table 5.7
mesons I I3 Q S Jπ m(MeV )
pn π+ 1 1 1 0 0− ∼ 140
−pn π− 1 −1 −1 0 0− ∼ 140
nn − pp
√ π0 1 0 0 0 0− ∼ 140
2
1 1
pΛ K+ 1 1 0− ∼ 495
2 2
1 1
nΛ K0 − 0 1 0− ∼ 495
2 2
0 1 1
Λn K 0 −1 0− ∼ 495
2 2
− 1 1
Λp K − −1 −1 0− ∼ 495
2 2
nn + pp − 2ΛΛ
√ η 0 0 0 0 0− ∼ 548
6
nn + pp + ΛΛ
√ η′ 0 0 0 0 0− ∼ 548
3
3 ⊗ 3 = 8 ⊕ 1,
3 ⊗ 3 ⊗ 3 = 10 ⊕ 8 ⊕ 8 ⊕ 1,
mesons = 3 ⊗ 3 = 8 ⊕ 1,
(5.2.66)
baryons = 3 ⊗ 3 ⊗ 3 = 10 ⊕ 8 ⊕ 8 ⊕ 1.
According to the Physical Explanation 5.5, we infer immediately, from (5.2.66), that the
hadrons in the Eightfold Way are composed of three fundamental particles, denoted by
q1 , q2 , q3 (5.2.67)
mesons = qi q j for 1 ! i, j ! 3,
(5.2.68)
baryons = qi q j qk for 1 ! i, j, k ! 3.
Physicists Gell-Mann and Zweig did this work independently in 1964, and presented
the celebrated Quark Model. The three fundamental particles (5.2.67) were termed the up,
down and strange quarks by Gell-Mann, denoted by
q1 = u, q2 = d, q3 = s.
2) The success of (p, n, Λ) for describing mesons suggests that for the strange number
S and the isospin (I, I3 ), u and p, d and n, s and Λ should be the same respectively.
Hence we have
! "
1 1
u : (S, I, I3 ) = 0, , ,
2 2
! "
1 1
d : (S, I, I3 ) = 0, , − ,
2 2
s : (S, I, I3 ) = (−1, 0, 0).
3) Since the baryon numbers of all baryons are B = 1, by the constituents (5.2.68) of
baryons, it is natural that
1 1 1
u: B= , d: B= , s: B= .
3 3 3
4) For all hadrons, the following formula, well known as the Gell-Mann-Nishijima rela-
tion, holds true:
B S
Q = I3 + + . (5.2.70)
2 2
This relation should also be valid for quarks. Hence, we deduce from (5.2.70) that
the electric charges of quarks are
2 1 1
u: Q= , d: Q=− , S: Q=− .
3 3 3
The data derived in 1)-4) above are collected in Table 5.3. According to the quantum
numbers of hadrons and quarks, we can determine the quark constituents of all hadrons.
For example, for uud its quantum numbers are derived from those of u and d as follows
1 1 1
uud : B = 1, Q = 1, S = 0, J = , I = , I3 = , (5.2.71)
2 2 2
which dictate that uud is the proton:
uud = p.
The quark constituents of the main hadrons are listed in Subsection 5.1.1.
In 1974, Ting and Richter discovered independently J/ψ particle, which implies the
existence of a new quark, named as the c quark. Thus, the quark family then was extended
to four members:
u, d, s, c.
As fundamental particles of SU(4), the irreducible representations (5.2.61), written as
4 ⊗ 4 = 15 ⊕ 1,
(5.2.72)
4 ⊗ 4 ⊗ 4 = 20 ⊕ 20 ⊕ 20 ⊕ 4
298 Chapter 5 Elementary Particles
suggest that the meson groups with 8 particles should be extended to 15 particles, and the
baryons be extended to 20 particles in a group. The extended group of mesons with 15
particles has been verified, which is given by
0
π ± , π 0 , K ± , K 0 , K , η , η ′ , D0 , D0 , D± , D±
s .
However, the 20 baryons corresponding to the irreducible representation (5.2.72) have not
been discovered now.
Up to now, the quark family has six members:
u, d, s, c, b,t.
e− → e − + γ ,
µ − → e− + ν e + ν µ , (5.3.1)
τ → µ + ν µ + ντ .
− −
2) Quark decay:
d → u + e− + ν e ,
s → d + γ, (5.3.2)
c → d + s + u.
3) Mediator decay:
2 γ → e + + e− ,
W ± → l ± + ν l± , (5.3.3)
+ −
Z → l +l ,
5.3 Weakton Model of Elementary Particles 299
All leptons, quarks and mediators are currently regarded as elementary particles. How-
ever, the decays in (5.3.1)-(5.3.3) show that these particles must have an interior structure,
and consequently they should be considered as composite particles rather than elementary
particles:
V is the potential energy, α k (1 ! k ! 3) are the matrices as given by (2.2.48), and α 0 is the
matrix as ! " ! "
I 0 1 0
α0 = , I= .
0 −I 0 1
By the conservation laws in quantum mechanics, if an Hermitian operator L commutes with
H in (5.3.5):
LH = HL,
then the physical quantity L is conserved.
Consider the total angular momentum J⃗ of a particle as
J⃗ = ⃗L + s⃗S,
Figure 5.7 (a) Two particles A, B rotate around the center 0, amd (b) three particles A, B, C rotate
around the center 0.
Usually, we regard m as a static mass which is fixed, and energy is a function of velocity v.
Now, taking an opposite viewpoint, we regard energy E as fixed, mass m as a function
of velocity v, and the relation (5.3.8) is rewritten as
D
v2 E
m= 1− . (5.3.9)
c2 c2
Thus, (5.3.9) means that a particle with an intrinsic energy E has zero mass m = 0 if it
moves at the speed of light v = c, and will possess nonzero mass if it moves with a velocity
v < c.
All particles including photons can only travel at the speed sufficiently close to the speed
of light. Based on this viewpoint, we can think that if a particle moving at the speed of light
(approximately) is decelerated by an interaction force ⃗F, obeying
D
d ⃗P v2 ⃗
= 1− F,
dt c2
then this massless particle will generate mass at the instant. In particular, by this mass
generation mechanism, several massless particles can yield a massive particle if they are
bound in a small ball, and rotate at velocities less than the speed of light.
Interaction charges
In the unified field model introduced in the last chapter, we derived that both weak and
strong interactions possess charges, as for gravity and electromagnetism; see Section 4.3.4:
1) Electric charge Qe , weak charge Qw , strong charge Qs are conservative. The energy
is a conserved quantity, but the mass M is not a conserved quantity due to the mass
generation mechanism as mentioned in (5.3.9).
302 Chapter 5 Elementary Particles
2) There is no interacting force between two particles without common charges. For
example, if a particle A possesses no strong charge, then there is no strong interacting
force between A and any other particles.
3) Only the electric charge e can take both positive and negative values, and other
charges take only nonnegative values.
where ρw , ρq , ρg , ρn , ρa are the radii of weakton, quark, gluon, nucleon, atom/molecule, and
gs is the strong charge of w∗ -weakton. By (4.5.70), gs can be expressed as
! "6
1 ρn
g2s = β 2 g2 , (5.3.13)
9 ρw
√ −1/4
where β = 2e /(8 − e1/2)1/2 and g is the Yukawa charge.
5.3 Weakton Model of Elementary Particles 303
The weak interaction potential for a particle with radius ρ and with N weak charges g w
is given by
E F
1 B
Φw = gw (ρ )e−kr − (1 + 2kr)e−kr ,
r ρ
! "3 (5.3.14)
ρw
gw (ρ ) = N gw ,
ρ
1
where k = = 1016 cm−1 , and r0 = 10−16 cm.
r0
For the weak interaction, there are about four levels: weaktons, mediators, quarks and
charged leptons. By (5.3.14), the layered formulas for these four level particles are given by
E F
1 Bw
Φw0 = gw e −kr
− (1 + 2kr)e −kr
,
r ρw
! "3 E F
ρw 1 Bm
Φwm = 2 gw e−kr − (1 + 2kr)e−kr ,
ρm r ρm
! "3 E F (5.3.15)
w ρw −kr 1 Bq
Φq = 3 gw e − (1 + 2kr)e −kr
,
ρq r ρq
! "3 E F
w ρw −kr 1 Bl
Φl = 3 gw e − (1 + 2kr)e −kr
.
ρl r ρl
Duality of mediators
Based on the unified field theory, there exists a natural duality between the mediators:
We shall see that the prediction (5.3.17) are in perfect agreement with the consequences of
the weakton model.
304 Chapter 5 Elementary Particles
1) The interior structure of charged leptons, quarks and mediators demonstrated by the
decays, scatterings and radiations, as as shown in (5.3.1)-(5.3.3);
2) The new quantum numbers of weak charge gw and strong charge gs introduced in
(5.3.10);
4) The weakton confinement theory given by the layered formulas of weak interaction
potentials (5.3.15)-(5.3.16); and
The weaktons consist of the following 6 elementary particles and their antiparticles:
w∗ , w1 , w2 , νe , νµ , ντ ,
(5.3.18)
w , w1 , w2 , ν e , ν µ , ν τ ,
∗
where νe , νµ , ντ are the three flavor neutrinos, and w∗ , w1 , w2 are three new elementary
particles, which we call w-weaktons.
These weaktons in (5.3.18) are endowed with the quantum numbers: electric charge Q e ,
weak charge Qw , strong charge Qs , weak color charge Qc , baryon number B, lepton numbers
Le , Lµ , Lτ , spin J, and mass m. The quantum numbers of weaktons are listed in Table 5.8.
Remark 5.9 For the weaktons and antiweaktons, the quantum numbers Q e , Qc ,
B, Le , Lµ , Lτ have opposite signs, and Qw , Qs , m have the same values. The neutrinos νe , νµ , ντ
5.3 Weakton Model of Elementary Particles 305
1
possess left-hand helicity with spin J = − , and the antineutrinos ν e , ν µ , ν τ possess right-
2
1
hand helicity with spin J = .
2
Remark 5.10 The weak color charge Qc is a new quantum number introduced for the
weaktons only, which will be used to rule out some unrealistic combinations of weaktons.
Remark 5.11 Since the fundamental composite particles as quarks only contain one
w∗ -weakton, there is no strong interaction between the constituent weaktons of a composite
particle, except the gluons. Therefore, for the weaktons (5.3.18), there is no need to intro-
duce the classical strong interaction quantum numbers as strange number S, isospin (I, I3 )
and parity π etc.
Remark 5.12 It is known that the quark model is based on the irreducible representa-
tions of SU(3) as
meson = 3 ⊗ 3 = 8 ⊕ 1,
baryon = 3 ⊗ 3 ⊗ 3 = 10 ⊕ 8 ⊕ 8 ⊕ 1.
However, the weakton model is based on the aforementioned theories and observational
facts 1)-5), different from the quark model.
and
d = w∗ w1 w2 (#↓, $↑),
s = w∗ w1 w2 (↑$, ↓#), (5.3.21)
∗
b = w w1 w2 (↑↓↑, ↓↑↓).
These arrangements (5.3.20) and (5.3.21) are speculative, and the true results will have to
determined by experiments.
which not only yields new mediators with spin J = 0, but also gives the same conclusions
as in (5.3.17).
Thus, based on (5.3.23), the weakton model also leads to the dual mediators as follows:
3. The ν -mediator. By the weak interaction potential formula Φwm in (5.3.15), the neu-
trino pairs
νe ν e , νµ ν µ , ντ ν τ (↓↑) (5.3.26)
should be bounded by the weak interacting force to form a new mediator, although they
have not been discovered. The three pairs in (5.3.26) may be indistinguishable. Hence, they
will be regarded as a particle, i.e. their linear combination
ν = ∑ αl νl ν l (↓↑), ∑ αl = 1, (5.3.27)
l l
Since the weaktons are assumed to be massless and no free w-weaktons are found, we have
to explain: i) the w-weakton confinement, and ii) the mass generation mechanism for the
massive composite particles, including the charged leptons e, µ , τ , the quarks u, d, s, c,t, b,
and the bosons W ± , Z, H ± , H 0 .
1. Weakton confinement. The weak interaction potentials (5.3.15) and the weak charge
formula (5.3.16) can help us to understand why no free w∗ , w1 , w2 are found, while single
neutrinos νe , νµ , ντ can be detected.
In fact, by (5.3.15) the weak interaction potential reads
E F
−kr 1 B
Φw = gs e − (1 + 2kr)e −kr
,
r ρw
The bound energy to hold particles together is negative. Hence for weaktons, their weak
interaction bound energy E is the negative part of gs Φw , i.e.
B 2
E =− g (1 + 2kr)e−2kr . (5.3.28)
ρw s
308 Chapter 5 Elementary Particles
By (5.3.16),
! "6
ρn
g2w = 0.63 × !c. (5.3.29)
ρw
It is estimated that
ρn
= 104 ∼ 106 .
ρw
Therefore, the bound energy E given by (5.3.28) and (5.3.29) is very large provided the
weak interaction constant B > 0.
Thus, by the sufficiently large bound energy, the weaktons can form triplets confined in
the interior of charged leptons and quarks as (5.3.19), and doublets confined in mediators
as (5.3.24)-(5.3.26). They cannot be opened unless the exchange of weaktons between the
composite particles.
The free neutrinos νe , νµ , ντ and antineutrinos ν e , ν µ , ν τ can be found in Nature. The
reason is that in the weakton exchange process there appear pairs of different types of neu-
trinos such as νe and ν µ , and between which the governing weak force is repelling because
the constant B in (5.3.28) is non-positive, i.e.
2. Massless mediators. For the mass problem, we know that the mediators:
have no masses. To explain this, we note that these particles in (5.3.31) consist of pairs as
w1 w1 , w2 w2 , w∗ w∗ , νl ν l . (5.3.32)
The weakton pairs in (5.3.32) are bound in a circle with radius R 0 as shown in Figure 5.8.
Since the interacting force on each weakton pair is in the direction of their connecting line,
they rotate around the center 0 without resistance. As ⃗F = 0 in the moving direction, by the
relativistic motion law: D
d⃗ v2
P = 1 − 2 ⃗F, (5.3.33)
dt c
Figure 5.8
5.3 Weakton Model of Elementary Particles 309
the massless weaktons rotate at the speed of light. Hence, the composite particles formed
by the weakton pairs in (5.3.32) have no rest mass.
3. Mass generation. For the massive particles
e, µ , τ , u, d, s, c, t, b, (5.3.34)
by (5.3.19), they are made up of weakton triplets with different electric charges. Hence the
weakton triplets are not arranged in an equilateral triangle as shown in Figure 5.7 (b), and in
fact are arranged in an irregular triangle as shown in Figure 5.9. Consequently, the weakton
triplets rotate with nonzero interacting forces ⃗F ̸= 0 from the weak and electromagnetic
interactions. By (5.3.33), the weaktons in the triplets at a speed less than the speed of light
due to the resistance force. Thus, by the mass generating mechanism introduced in Section
5.3.2, the weaktons become massive. Hence, the particles in (5.3.34) are massive.
Figure 5.9
4. Massive mediators. Finally, we need to explain the masses for the massive mediators:
W ± , Z, H ± , H 0 , (5.3.35)
w1 w2 , w1 w2 , w1 w1 , w2 w2 . (5.3.36)
Actually, in the weakton exchange theory in the next section, we can see that the particles
in (5.3.36) are in some transition states in the weakton exchange procedure. At the moment
of exchange, the weaktons in (5.3.36) are at a speed v with v < c. Hence, the particles in
(5.3.35) are massive, and their life-times are very short (τ ≃ 10 −25s).
In the last subsection, we solved the free weakton problem and the mass problem. In
this subsection, we propose a few rules to resolve the remainning problems.
1. Weak color neutral rule: All composite particles by weaktons must be weak color
neutral.
Based on this rule, many combinations of weaktons are ruled out. For example, it is
clear that there are no particles corresponding to the following www and ww combinations,
because they all violate the weak color neutral rule:
νe w1 w2 , w∗ w2 w2 , w∗ w1 w2 , etc., νe w1 , w∗ w1 , w∗ w2 etc.
w∗ νi , νi ν j , νi ν k (i ̸= k),
are not observed in Nature, and are ruled out by this rule.
3. L + Qe = 0, |B + Qe | ! 1 for L, B ̸= 0.
νi w1 w1 , νi w2 w2 , ν i w1 w2 , w∗ w∗ etc (5.3.37)
and as
ν w1 w2 (↑↑↑, ↓↓↓). (5.3.39)
The cases (5.3.38) are excluded by the Angular Momentum Rule 5.8. The reasons for
this exclusion are two-fold. First, the composite particles in (5.3.38) carry one strong charge,
and consequently, will be confined in a small ball by the strong interaction potential as the
quarks confined in hadrons, as shown in Figure 5.7 (b). Second, due to the uncertainty
principle, the bounding particles will rotate, at high speed with almost zero moment of
1
force, which must be excluded for composite particles with J ̸= based on the angular
2
momentum rule.
5.3 Weakton Model of Elementary Particles 311
The exclusion for (5.3.39) is based on the observation that by the left-hand helicity of
1 1
neutrinos with spin J = − , one of w1 and w2 must be in the state with J = + to combine
2 2
with ν j (1 ! j ! 3), i.e. in the manner as
In summary, under the above rules 1-4, only the weakton constituents in (5.3.19) and
(5.3.24)-(5.3.26) are allowed.
5. Eight quantum states of gluons. It is known that the gluons have eight quantum states
gk : g1 , · · · , g8 .
In (5.3.24) and (5.3.25), the vector and scalar gluons have the forms
bb = rr = gg = w ( white).
which form the bases of SU(3) and SU(3). By the irreducible representation:
3 × 3 = 8 ⊕ 1,
where (w∗ w∗ )w is a linear combination of w∗b w∗b , w∗r w∗r , w∗g w∗g . Namely, the gluons in (5.3.41)
are the antigluons of these in (5.3.40).
In summary, the quantum rules presented above can be simply expressed in the follow-
ing formulas for quantum numbers Qc , Qe , L and B:
weak color neutral: Qc = 0,
mutual exclusion of leptons and baryons: BL = 0, Li L j = 0 (i ̸= j),
(5.3.42)
relation of leptons and charges: L + Qe = 0 (L ̸= 0),
relation of baryons and charges: |B + Qe| ! 1 (B ̸= 0).
312 Chapter 5 Elementary Particles
These relations are summed up from realistic particle data. Based on (5.3.42), together with
the mass generation mechanism, the Angular Momentum Rule 5.8 and the layered formulas
of weak and strong interaction potentials, all of the most basic problems in the weakton
model have a reasonable explanation.
We conclude that all particle decays are caused by weakton exchange. The exchanges occur
between composite particles as mediators, charged leptons, and quarks.
1. Weakton exchange in mediators. First we consider one of the most important decay
processes in particle physics, the electron-positron pair creation and annihilation:
2 γ → e + + e− ,
(5.4.1)
e+ + e − → 2 γ .
In fact, the reaction formulas in (5.4.1) are not complete, and the correct formulas should
be as follows
2 γ + ν % e + + e− , (5.4.2)
γ = cos θw w1 w1 − sin θw w2 w2 ,
ν = α1 νe ν e + α2 νµ ν µ + α3 ντ ν τ ,
which imply that the probability of the photon γ at the state w1 w1 is cos2 θw and at the state
−w1 w2 is sin2 θw , and the probability of ν at νi ν i is αi2 . Namely, for photon, the densities
of the w1 w1 (#) and −w2 w2 ($) particle states are cos2 θw and sin2 θw , and the densities of
the states νe ν e , νµ ν µ , ντ ν τ are α12 , α22 , α32 . Hence the formula (5.4.2) can be written as
It is then clear to see from (5.4.3) that the weakton constituents w1 , w1 , w2 , w2 , νe , ν e can
regroup due to the weak interaction, and we call this process weakton exchange. The mech-
anism of this exchanging process can be explained using the layered formulas in (5.3.15) of
weak interaction potentials.
The layered potential formulas in (5.3.15) indicate that each composite particle has an
exchange radius R, which satisfies
ρ < R < ρ1 , (5.4.4)
5.4 Mechanisms of Subatomic Decays and Electron Radiations 313
where ρ is the radius of this particle and ρ1 is its repelling radius of weak force. If two
composite particles A and B are in a distance less than their common exchange radius,
then the weaktons in A and B are attracting to each other by their weak interacting forces
generated by Φw0 in (5.3.15), given by
E F
w −kr 1 Bw
Φ0 = gw e − (1 + 2kr)e −kr
. (5.4.5)
r ρw
There is a probability for these weaktons in A and B to recombine and form new particles.
Then, after the new particles have been formed, in the exchange radius R, the weak interact-
ing force between them is governed by the potentials of the new particles, and is repelling,
driving the newly formed particles apart.
For example, in Figure 5.10 we can clearly see how the weaktons in (5.4.3) undergo the
exchange process. When the randomly moving photons and ν -mediators, i.e. w 1 w1 , w2 w2
and νe ν e come into their exchange balls, they recombine to form an electron ν e w1 w2 and
a positron ν e w1 w2 under the weak interaction attracting forces generated by the weakton
potentials (5.4.5). Then, the weak interacting force between e − = νe w1 w2 and e+ = ν e w1 w2
governed by Φwl in (5.3.15) is repelling, and pushes them apart, leading to the decay process
(5.4.2)
We remark here that in this range the weak repelling force between e + and e− is much
stronger than the Coulomb attracting force. In fact, by (5.3.29), g 2w ∼ 1030 h̄c and the electric
1
charge square e2 = h̄c. Hence the weak repelling force in Figure 5.10 is (3g s )2 /r2 , much
137
stronger than e2 /r2 .
Figure 5.10
2. Weakton exchanges between leptons and mediators. The µ -decay reaction formula
is given by
µ − → e− + ν e + ν µ . (5.4.6)
314 Chapter 5 Elementary Particles
µ − + ν → e− + ν e + ν µ ,
νµ w1 w2 + νe ν e → νe w1 w2 + ν e + νµ . (5.4.7)
By the exclusive rule Le Lµ = 0, the two particles ν µ and ν e cannot be combined to form a
particle. Hence, ν e and νµ appear as independent particles, leading to the exchange of ν µ
and νe as in (5.4.7).
3. Weakton exchanges between quarks and mediators. The d-quark decay in (5.3.2) is
written as
d → u + e− + ν e . (5.4.8)
d + γ + ν → u + e− + ν e ,
w∗ w1 w2 + w1 w1 + νe ν e → w∗ w1 w1 + νe w1 w2 + ν e . (5.4.9)
In (5.4.9), the weakton pair w2 and w1 is exchanged, and νe is captured by the new doublet
w1 w2 , which is the vector boson W − , to form an electron νe w1 w2 .
A1 + · · · + A N → B1 + · · · + B K , (5.4.10)
the number of each weakton type is invariant. Namely, for any type of weakton w,
5 its
number is conserved in (5.4.10):
NwA! = NwB! ,
5.4 Mechanisms of Subatomic Decays and Electron Radiations 315
e− " µ − + ν µ + νe ,
e− " τ − + ν τ + νe , (5.4.11)
µ " τ + ν τ + νµ .
− −
νe w1 w2 + νµ ν µ → νµ w1 w2 + ν µ + νe ,
νe w1 w2 + ντ ν τ → ντ w1 w2 + ν τ + νe , (5.4.12)
νµ w1 w2 + ντ ν τ → ντ w1 w2 + ν τ + νµ .
From the viewpoints of quantum rules given by (5.3.42) and weakton number conservation,
the decays in (5.4.12) are allowed. However, due to the mass relations
me < m µ < m τ ,
these weakton exchange processes (5.4.12) violate the energy conservation. Therefore these
decays cannot occur. However, if ν -mediators have high energy to hit electrons, then the
reactions (5.4.12) may occur.
In a weakton exchange process, the energy conservation law can be explicitly expressed
as follows:
Energy Rule 5.13 The composite particles with the lower masses cannot undergo
weakton exchanges with massless mediators in lower energy to decay into the composite
particles with higher masses.
316 Chapter 5 Elementary Particles
We remark that the Energy Rule 5.13 is sharper than the energy conservation law. In
fact, the reaction formulas from lower masses to higher masses do not imply that the energy
conservation must be violated. Hence, it is possible that there exist more basic unknown
physical laws under the Energy Rule 5.13.
4. Other conservative quantum numbers. From the invariance of weakton numbers, we
deduce immediately the following conserved quantum numbers:
With the weakton model, all decays are carried out by exchanging weaktons. Hence
decay types can be fully classified into three types: the weak type, the strong type, and
the mixed type, based on the type of forces acting on the final particles after the weakton
exchange process.
For example, the reactions
ν µ + e− → µ − + νe ,
n → p + e− + ν e , (5.4.13)
0
π → γ + γ.
Λ → p+ + π − ( i.e. Λ + gk + 3γ → p+ + π − + γ ) (5.4.15)
is a mixed decay.
In view of (5.4.13)-(5.4.15), the final particles contain at most one hadron in a weak
decay, contain no leptons and no mediators in a strong decay, and contain at least two
hadrons and a lepton or a mediator in a mixed decay.
5.4 Mechanisms of Subatomic Decays and Electron Radiations 317
In summary, we give the criteria for decay types based on the particles appearing in the
final step of the decay process:
weak decay: at most one hadron,
strong decay: no leptons and no mediators, (5.4.16)
mixed decay: others.
Remark 5.14 The new classification is easy to understand, and the criterion (5.4.16)
provides a convenient method for us to distinguish the different types of decays and scat-
terings. In fact, this classification truly reflects the roles of interactions in particle reaction
processes.
Figure 5.11
318 Chapter 5 Elementary Particles
Replacing the Feynman diagram, we describe the scattering (5.4.18) by using Figure
5.12. It is clear that the scattering (5.4.18) is achieved by exchanging weaktons ν µ and νe .
n → p + e− + ν e . (5.4.19)
n = udd, p = uud,
d → u + e− + ν e ,
gk + γ0 + γ → u + u,
gk0 + γ0 + γ0 → d + d.
5.4 Mechanisms of Subatomic Decays and Electron Radiations 319
In (5.4.21), w∗ and w∗ in a gluon are captured by a scalar photon γ0 and a photon γ to create
a pair u and u. In (5.4.21), w1 and w2 in the two scalar photons γ0 are exchanged to form
a pair of charged Higgs H + and H − , then H + and H − capture w∗ and w∗ respectively to
create a pair of d and d.
4. Lepton decays. The lepton decays
µ − + ν → e− + ν e + ν µ ,
τ − + ν → µ − + ν µ + ντ ,
∆++ → p + π +.
The exchange mechanism of (5.4.26) was discussed in (5.4.22), which is a weak interaction,
and the quark exchange (5.4.27) is both weak and strong interaction types. But, the final
particles p and π + are driven apart by the strong hadron repelling force.
After the weakton exchange to yield d-quark pair in (5.4.26), the mechanism of the
quark exchange and decay in (5.4.27) can be interpreted as follows:
1) When the quark pair dd is formed in the exchange radius R of ∆ ++ , the strong and
weak interactions between the quarks d, d and u, u, u in ∆++ are governed by Φsq in
(5.3.12) and Φwq in (5.3.15), which are attracting and recombine these quarks to form
two new hadrons p and π + , as shown in Figure 5.13(a); and
2) the two newly formed hadrons p and π + are controlled by the strong interaction
potential Φn for hadrons, which is repulsive in the exchange radius R and pushes
them apart; see Figure 5.13(b).
Figure 5.13
2. D0 -decay. Let us discuss the D0 -decay, which is considered as the weak interacting
type in the classical theory, because it violates the strange number conservation. But in our
classification it belongs to strong type of interactions. The D0 -decay is written as
D0 → K − + π + .
c + gk + 2γ → s + u + d,
and
w1 w2 (W − ) + w1 w1 (γ ) + (w∗ w∗ )(gk ) → w∗ w1 w1 (u) + w∗ w1 w2 (d) (5.4.30)
It is clear that both exchanges (5.4.29) and (5.4.30) belong to weak interactions. However,
the final particles in the D0 -decay are K − and π + , which are separated by the strong hadron
repelling force. The principle for the quark exchange and hadron separation in the D 0 -decay
process is the same as shown in Figure 5.13.
Remark 5.15 In the mechanism of subatomic particle decays and scatterings, the
layered properties of the weak and strong interacting forces play a crucial role. Namely the
attracting and repelling radii of weak and strong forces for a particle depend on its radius ρ
and interaction constants A and B or equivalently the parameters
A B
in (5.3.11) and in (5.3.14). (5.4.31)
ρ ρ
These parameters determine the attracting and repelling regions of the particle. Hence, in the
exchange radius of particles, the weak and strong forces are attracting between weaktons
and quarks, and are repelling between the final particles in decays and scatterings. The
reason is that the parameters in (5.4.31) are different at various levels of subatomic particles.
Mixed decays
Λ → p + π −.
Λ + gk + 2γ + γ0 → p + π − + γ0 . (5.4.32)
322 Chapter 5 Elementary Particles
gk + γ + γ0 → u + u, (5.4.33)
s + γ → d + γ0 (i.e. uds + γ → udd + γ0 ), (5.4.34)
udd(n) + uu → uud(p) + ud(π ). −
(5.4.35)
The process (5.4.33) was described by (5.4.21), the quark exchange process (5.4.35) is clear,
and (5.4.34) is the conversion from s quark to d quark, described by
The weakton constituents of an electron are νe w1 w2 , which rotate as shown in Figure 5.9.
Noting the electric charges and weak charges of νe , w1 , w2 are given by
1 2
electric charge: Qνe = 0, Qwe 1 = − , Qwe 2 = − ,
3 3
weak charge: Qνw = 1, Qww1 = 1, Qww2 = 1,
In addition, the weak force formula between the naked electron and massless mediators
γ , γ0 , gk , gk0 , ν , is given by
G H
∂ 1 −kr B5
Fw = − gw (ρm )gw (ρe ) e − (1 + 2kr)e−2kr , (5.4.37)
∂r r ρ5
G H
−kr 1 1 4B5 r −kr
=gw (ρm )gw (ρe )e + − e ,
r2 rr0 ρ5 r02
where k = 1/r0 = 1016 cm−1 , gw (ρm ) and gw (ρe ) are the weak charges of mediators and the
naked electron, expressed as
! "3 ! "3
ρw ρw
gw (ρm ) = 2 gw , gw (ρe ) = 3 gw ,
ρm ρe
5 ρ5 is a parameter determined by the naked electron and mediators.
and B/
By the weak force formula (5.4.37), there is an attracting shell region of weak interaction
between naked electron and mediators
as shown in Figure 5.13, with small weak force, where ρ j ( j = 1, 2) are the zero points of
(5.4.37). Namely, they satisfy that
! "
−kρ j 3 ρ5 ρj
e ρj = 1+ , for 1 ! j ! 2.
4B5 r0
In the region (5.4.38), the weak force is attracting, and outside this region the weak force is
repelling: .
< 0 for ρ1 < r < ρ2 ,
Fw (5.4.39)
> 0 for r < ρ1 and ρ2 < r.
Since the mediators γ , γ0 , gk , gk0 and ν contain two weak charges Qw = 2gw , by (5.4.39)
they are attached to the electron in the attracting shell region (5.4.38), forming a cloud of
mediators. The irregular triangle distribution of the weaktons νe , w1 , w2 generate a small
moment of force on the mediators. Meanwhile there also exist weak forces between them.
Therefore the bosons will rotate at a speed less than the speed of light, and generate a small
mass attached to the naked electron νe w1 w2 .
Figure 5.15 (a) The naked electron is accelerated or decelerated in electromagnetic field; amd (b)
the mediators (photons) fly away from the attracting shell region under a perturbation of moment of
force.
These quarks have the same spin arrangements and energy levels. The phenomenon that
there are three identical fermions at one quantum state violates the Pauli exclusion principle.
Thus, there exist two possibilities:
2) there is a new quantum number, such that the three same flavour of quarks in the
baryons of (5.5.1) possess different values for the new quantum number. Namely,
they are not identical, and the Pauli exclusion principle is still valid.
5.5 Structure of Mediator Clouds Around Subatomic Particles 325
To solve this problem, O. W. Greenberg presented the color quantum number in 1964.
According to the constituents of (5.5.1), Greenberg thought that each flavor of quarks has
three colors:
red r, green g, blue b, (5.5.2)
Based on the quantum numbers (5.5.2) and (5.5.3), each flavor of quarks is endowed with
three different colors
qr , qg , qb , qr , qg , qb .
Since hadrons are color neutral, the color quantum number should obey the following mul-
tiplication rule:
rr = w, gg = w, bb = w,
(5.5.5)
rgb = w, r g b = w, w = white color,
and the multiplication is commutative.
The color quantum number has attained many experimental supports. Quantum Chro-
modynamics (QCD) for the strong interaction is based on this theory. In fact, it is natural to
think that the three color quantum states of each flavour of quarks
q = (qr , qg , qb ) (5.5.6)
are indistinguishable in the strong interaction. This gives rise to the SU(3) gauge theory of
QCD, i.e. the QCD action is the Yang-Milli functional of SU(3) gauge fields, given by
1
LQCD = − Skµν S µν k + q(iγ µ Dµ − m)q, (5.5.7)
4
where q is the quark triplet as in (5.5.6), m is the mass of the quark,
j
Skµν = ∂µ Sνk − ∂ν Skµ + gs fikj Siµ Sν ,
(5.5.8)
Dµ = ∂µ + igs Skµ λk ,
and λk (1 ! k ! 8) are the generators of SU(3). In QCD, λk are taken as the Gell-Mann
matrices as defined in (3.5.38).
326 Chapter 5 Elementary Particles
5.5.2 Gluons
In the last subsection we saw that the three color quantum numbers defined in (5.5.2)-(5.5.3)
lead to the QCD action (5.5.7) of SU(3) gauge fields. In the following, we introduce the
gluons in a few steps.
It was known that the SU(3) gauge theory for the strong interaction is oriented toward
two directions: 1) describing the field particles, i.e. the strong interaction mediators, and 2)
providing strong interaction potentials. In Section 4.5, we have discussed problem 2). Here,
we consider problem 1).
Experiments showed that there are eight field particles for the strong interaction, called
gluons, which are massless and electric neutral, denoted by
gk , 1 ! k ! 8. (5.5.9)
The QCD theory shows that the gluons are vector bosons with spin J = 1.
Based on both the unified field theory and the weakton model, corresponding to the
vector gluons (5.5.9) there are eight dual field particles, which are scalar bosons with spin
J = 0, denoted by
gk0 , 1 ! k ! 8. (5.5.10)
In addition, by PID, it follows from (5.5.7)-(5.5.8) that the field equations describing the
eight vector gluons (5.5.9) read
gs k αβ i j gs 1
∂ ν Skµν − f g Sα µ Sβ − gsQkµ = (∂µ + αl Slµ − α0 xµ )φsk , (5.5.11)
h̄c i j h̄c 4
for 1 ! k ! 8, and the field equations describing the eight scalar gluons (5.5.10) are
E! " F
gs 1 gs
µ k
∂ ∂µ φs + ∂ µ
αl Sµ − α0 xµ φsk = −gs ∂ µ Qkµ − fikj gαβ ∂ µ (Sαi µ Sβj ), (5.5.12)
l
h̄c 4 h̄c
Qkµ = qγµ λ k q (λ k = λk , γµ = g µν γ ν ).
The field equations (5.5.11) and (5.5.12) are as in (4.4.34) and (4.4.37).
From (5.5.11) and (5.5.12) we can deduce the following theoretical conclusions for the
gluons (5.5.9) and dual gluons (5.5.10):
1) The field functions Wµk (1 ! k ! 8) describing the gluons (5.5.9) are vector fields,
and, therefore, gk in (5.5.9) are vector bosons with spin J = 1;
5.5 Structure of Mediator Clouds Around Subatomic Particles 327
2) The field function φsk (1 ! k ! 8) describing the gluons (5.5.10) are scalar fields, and
therefore gk0 in (5.5.10) are scalar bosons with J = 0;
3) The field equations (5.5.11) and (5.5.12) are nonlinear. Consequently, the gluons g k
and gk0 are not in free states, and in their bound states the masses of g k and gk0 may
appear;
4) In the bound states, the masses can be generated by the spontaneous symmetry break-
ing in (5.5.11) and (5.5.12), and the mass terms for g k and gk0 are as follows
gs 5k l
gk : αl φs Sµ (φ5lk are the ground states),
h̄c
gk0 : α0 Φks ;
Remark 5.16 Gluons and photons are massless only in their free states. When they
are confined in mediator clouds around electrons and quarks, they may attain masses. In
this case, massive photons are described by the PID electromagnetic equations as follows:
Vector photons:
! "
e 1
∂ ν (∂ν Aµ − ∂µ Aν ) − eJµ = ∂µ + β Aµ − kxµ φ . (5.5.13)
h̄c 4
Scalar photons:
e 1
∂ µ ∂ µ φ − kφ + β ∂ µ (Aµ φ ) − kxµ ∂ µ φ = 0. (5.5.14)
h̄c 4
The masses of bound photons are created in (5.5.13) and (5.5.14) by the spontaneous gauge-
symmetry breaking.
Color charge
In the QCD theory, color quantum numbers are regarded as color charges. Namely, each
quark carries one of the color charges:
Each gluon is considered to carry a pair of a color charge and an anti-color charge in QCD.
Thus, the three pair of color charges and anti-color charges in (5.5.15) constitute fundamen-
tal bases of SU(3) and SU(3), and the gluons gk (1 ! k ! 8) form an octet of the irreducible
representation
SU(3) ⊗ SU(3) = 8 ⊕ 1,
328 Chapter 5 Elementary Particles
We remark that the conclusions (5.5.16) and (5.5.17) of QCD for gluons are completely
consistent with those in the weakton model in (5.3.40) and (5.3.41). However, the QCD
version is based on the color charges of (5.5.15), and the version of weakton model is based
on the color quantum number of the w∗ -weakton:
Gluon radiation
Based on the Standard Model, quarks can emit and absorb gluons in the same fashion as
electrons emitting and absorbing photons. For example, a red u-quark u r emits an rg gluon
grg , then ur becomes a ug quark, as shown in Figure 5.16.
Figure 5.16
ur → ug + grg , (5.5.18)
qr → gbg + qX , X =?.
5.5 Structure of Mediator Clouds Around Subatomic Particles 329
In addition, the lack of color algebra also leads to another serious problem. It is known
that hadrons are color neutral:
c1 = r, c2 = g, c3 = b, c1 = r, c2 = g, c3 = b, (5.5.20)
Hence the color indices of all non-neutral gluons cannot be defined. Thus, if a proton p
contains three quarks and N gluons (N ̸= 0):
p = uci + uc j + dck + ∑ nk gk , ∑ nk = N,
k k
rgb = w, r g b = w, (5.5.22)
rr = gg = bb = w. (5.5.23)
Basic physical considerations imply that the product operation of color indices is com-
mutative and associative:
rg = b, rb = g, bg = r,
(5.5.24)
r g = b, rb = g, bg = r.
wr = r, wg = g, wb = b, wr = r, wg = g, wb = b.
rr = r, gg = g, bb = b,
(5.5.25)
r r = r, g g = g, b b = b.
r(gb) = b, (5.5.26)
which leads to inconsistency, no matter what color we assign for gb. For example, if we
assign gb = r, then we derive from (5.5.26) that rr = b, which is inconsistent with rr = r in
(5.4.25). If we assume gb = b, then rb = b, which, by multiplying by b, leads to r = bb = b,
a contradiction again.
The above inconsistency demonstrates that in addition to the six basic color indices
r, g, b, r, g, b,
Only two of these added color indices are independent, and in fact, we can derive from
(5.5.24) that
rg = br = gb, gr = rb = bg.
y = rg = br = gb,
(5.5.27)
y = gr = rb = bg.
Definition 5.17 A color algebra is defined by the following three basic ingredients:
5.5 Structure of Mediator Clouds Around Subatomic Particles 331
1) The generators of color algebra consists of quarks and gluons, which possesses nine
color indices as
r, g, b, y, r, g, b, y, w,
which form a finite commutative group. Here y and y are given by (5.5.27), and the
group product operation is defined by
yr = b, yg = r, yb = g, yy = y,
yr = g, yg = b, yb = r, y y = y.
2) Color algebra is an algebra with quarks and gluons as generators with integer coef-
ficients, and its space is given by
. / 0
22 /
P = ∑ nk ek // nk ∈ Z ,
k=1
where ek (1 ! k ! 19) are 18 colored quarks and 4 colored gluons, and −e k represent
anti-quarks and anti-gluons.
22
3) The color index of ω = ∑ nk ek ∈ P is defined by
k=1
22
Indc (ω ) = ∏ ck k ,
n
k=1
n
where ck is the color index of ek and ck k = w if nk = 0.
Remark 5.18 Each element ω = Σnk ek ∈ P represents a particle system, and nk is the
difference between the number of particles with color index c k and the number of antiparti-
cles with color index ck . In particular, particles with colors r, g, b must be quarks, particles
with colors r, g, b must be anti-quarks, and particles with colors y, y must be gluons.
332 Chapter 5 Elementary Particles
Remark 5.19 In (5.5.27), rg, br, gb are all yellow. Consequently the gluons g rg , gbr , ggb
have the same color. However, they represent different quantum states. In particular, in the
weakton model,
grg = w∗r w∗g , gbr = w∗b w∗r , ggb = w∗g w∗b ,
which represent different quantum states.
where Indc (gk ) is the color index of the gluon gk . The color neutral law requires that
Indc (p) = w, which does not necessarily lead to c1 c2 c3 = w. For example, for the following
constituents of p:
p = ur + ur + dg + 2grg + grb ,
we have
In summary, the hadron color quantum numbers based on color algebra is very different
from the classical QCD theory.
For a baryon B with its constituents given by
3 3
B = ∑ qci + ∑ (nk gk + mk gk ) + K1 g4 + K2 g4 ,
i=1 k=1
where gk and gk (1 ! k ! 4) are as in (5.5.16) and (5.5.17), its quantum number distribution
satisfies the following color index formula:
and N1 , N2 are as
3 3
N1 = ∑ nk , N2 = ∑ mk . (5.5.29)
k=1 k=1
5.5 Structure of Mediator Clouds Around Subatomic Particles 333
Consider the transformation of a quark qc with color c to another quart qc3 after emitting
a gluon gc1 c2 :
qc → gc1 c2 + qc3 ,
c3 = cc1 c2 .
Also, for the transformation of a quark qc to another quark qc4 after absorbing a gluon gc1 c2 :
qc + gc1 c2 → qc4 ,
c4 = cc1 c2 .
{Gc , PN , Indc },
which consists of
3) a homomorphism
Indc : PN → Gc ,
such that for each element w = ∑Nk=1 nk ek ∈ PN , we can define a color index for w by
N
Indc (w) = ∏ ( Indc (ek ))nk ,
k=1
quark : q = w∗ ww,
gluon: gk = w∗ w∗ .
The only weakton that has colors is w∗ , which has three colors:
Definition 5.20 The w∗ -color algebra is the triplet {Gc , P3 , Indc }, which is defined as
follows:
1) the color group is
Gc = {r, g, b, y, r, g, b, y, w},
with group operation given in Definition 5.17;
2) the Z-modular algebra P3 given by
. / 0
/
P3 = ∑ nk w∗k // nk ∈ Z ;
k=r,g,b
The w∗ -color algebra given by Definition 5.20 is based on the weakton model, which is
much simpler than the QCD color algebra introduced in the last subsection, and is readily
used to study the structure of subatomic particles.
Consider a particle system ω , which consists of nk quarks qk , nk antiquarks qk , m1 gluons
g1 = grg , m2 gluons g2 = gbr , m3 gluons g3 = ggb , m4 color-neutral gluons g4 , and mk gluons
gk (1 ! k ! 4):
4
ω= ∑ (nk qk + nk qk ) + ∑ (mi gi + mi gi ). (5.5.34)
k=r,g,b i=1
Xω = ∑ Nk w∗k , (5.5.35)
k=r,g,b
where
Nr = (nr − nr ) + (m1 − m2 ) − (m1 − m2 ),
Ng = (ng − ng ) + (m3 − m1 ) − (m3 − m1 ), (5.5.36)
Nb = (nb − nb ) + (m2 − m3 ) − (m2 − m3 ),
and, consequently, the color index for ω is defined by
Theorem 5.21 The following assertions hold true for the w∗ -color algebra:
ω = q+π
with π as given by (5.5.38), (5.5.40)
ω = q+π
Indc (ω ) = r, g, b,
(5.5.41)
Indc (ω ) = r, g, b,
M = q + q+ π meson system,
(5.5.42)
B = q+q+q+π baryon system,
First, in particle physics, all basic and important particle systems are given by the parti-
cle systems in Assertions 1)-3) in this theorem. System (5.5.38) represents a gluon system
attached to quarks and hadrons, (5.5.40) represents a cloud system of gluons around a quark
or an anti-quark, and (5.5.42) represents a cloud system of gluons around a hadron (a meson
or a baryon).
5.5 Structure of Mediator Clouds Around Subatomic Particles 337
indicate that through exchange of weaktons, a cloud system (5.5.38) of gluons can become
a system consisting of white gluons and the same number of yellow and anti-yellow gluons.
Proof of Theorem 5.21 We proceed in the following three steps.
where
Mr = (m1 − m2 ) − (m1 − m2 ),
Mg = (m3 − m1) − (m3 − m1 ),
Mb = (m2 − m3) − (m2 − m3 ).
m
Consequently, using C −m = C , we have
Notice that y2 = y, y2 = y. Then the equality (5.5.39) follows from (5.5.44), and Assertion
1) is proved.
Step 2. For Assertion 2), with the above argument, for the particle system (5.5.40) it is
easy to see that
Indc (ω ) = Indc (q) Indc (π ) = Indc (q)yM ,
(5.5.45)
Indc (ω ) = Indc (q) Indc (π ) = Indc (q)yM .
Due to the facts that
Indc (q) = r, g, b, Indc (q) = r, g, b,
and by the multiplication rule given in Definition 5.17, the conclusion (5.5.41) follows from
(5.5.45).
Step 3. With the same arguments as above, for the meson and baryon system (5.5.42),
we can derive that
Indc (M) = ci c j yM , Indc (B) = ci c j ck yM , (5.5.46)
338 Chapter 5 Elementary Particles
ci c j = w, y, y, ci c j ck = ci cl = w, y, y.
According to the layered formulas (5.4.37) of weak forces between naked electron and
mediators, the radius r of a mediator cloud is approximately given by
ρ5 2
r3 ≃ r0 , r0 = 10−16 cm, (5.5.48)
4B5
and ρ5/B5 depends on the type of mediators. In fact, for gluons and naked electron, the
parameter B5 ! 0, therefore electrons have no gluon clouds.
It is reasonable to think that the parameter ρ5/B5 for photon and ν -mediator is different
ρ5γ ρ5ν
̸= .
B5γ B5ν
Therefore, by (5.5.48) the radius rγ of the photon layer is different from the radius rν of the
ν -layer, i.e.
rγ ̸= rν . (5.5.49)
5.5 Structure of Mediator Clouds Around Subatomic Particles 339
Due the weak repelling forces between γ and ν , only the exterior layer of cloud can
emit mediators. Hence, the bremsstrahlung implies that the photon cloud should be in the
exterior layer. Namely (5.5.49) should be written as
rγ > rν . (5.5.50)
N ! 4π r2 /r02 , (5.5.51)
where r is the radius of the layer as in (5.5.48). The inequality (5.5.51) means that all
particles on an r-sphere are arranged with distance r0 .
Based on (5.5.51), if r = kr0 , then N ! 4π k2 .
3. Spin number of vector photons. An electron refers to the system consisting of the
naked electron νe w1 w2 and its mediator cloud as shown in (5.5.47). The photon layer con-
1
tains both vector and scalar photons. As the total spin of an electron is J = , both the
2
number NJ=1 of vector photons with spin J = 1 and the number NJ=−1 of photons with spin
J = −1 should be the same, i.e. the vector photons in an electron satisfy
The condition (5.5.52) implies that the emitting and absorbing of vector photons always
in pairs with one spin J = 1 and another spin J = −1. Scalar photons and ν -mediators have
spin J = 0.
In fact, by the Angular Momentum Rule 6.14, only bosons with J = 0 can rotate around
a center without the presence of force moment. Hence the photons and ν -mediators in the
mediator cloud may all be scalar particles.
Mediator clouds of quarks
Different from the weakton constituents of charged leptons, the weakton constituents of
a quark include a w∗ -weakton with a strong charge, leading to different structure of mediator
clouds.
1. As a quark curries three weak charges 3gw and a strong charge gs , the mediator cloud
of a quark possesses three layers:
gluon layer: vector and scalar gluons,
photon layer: vector and scalar photons, (5.5.53)
ν -layer: ν -mediators.
340 Chapter 5 Elementary Particles
rg ̸= rγ , rγ ̸= rν , rν ̸= rg . (5.5.54)
Similar to the case for an electron, the particle number in each layer of (5.5.53) satisfies
a relation as (5.5.51).
1
3. The total spin of a quark system is also J = . Therefore the vector photons and
2
vector gluons in the mediator cloud of a quark have to obey the spin number rules as
5J=1
where NJ=1 and NJ=−1 are the numbers of photons with spins J = 1 and J = −1, and N
5J=−1 are that of gluons with spins J = 1 and J = −1.
and N
We remark that two scalar mediators can be transformed into a pair of vector particles
with J = 1 and J = −1 respectively.
Indc (q) = r, g, b,
(5.5.56)
Indc (q) = r, g, b.
5. The gluons in the cloud layers of quarks are confined in a hadron. However, gluons
between quarks in a hadron can be exchanged. The gluon exchange process between quarks
in a hadron is called gluon radiation. In fact, a quark can emit gluons which will be absorbed
by other quarks in the hadron.
Equivalently,
q1 + q2 → q3 + q4 .
which lead to
Indc (q1 + q2) = Indc (q3 + q4). (5.5.57)
5.5 Structure of Mediator Clouds Around Subatomic Particles 341
The relations (5.5.53)-(5.5.57) provide the basic properties of mediator clouds of quarks.
Further discussion will be given in the next chapter.
Gluon clouds of hadrons
Different from electrons and quarks, there is no photon and ν -mediator cloud layer
around a hadron, due to the fact that the radius of a naked hadron ρ H is greater than the
range of the weak interaction:
ρH " 10−16 cm.
Namely, a hadron can only have a strong attraction to vector and scalar gluons, forming a
gluon cloud with radius about the same as the radius of a hadron:
Also, hadrons are colorless. For a baryon B and a meson M given by (5.5.42), we have that
Consequently, if the naked hadron is white, then the gluon cloud π is white as well: Indc (π ) =
w.
Free gluons
If there exist free gluons, then they must be white color:
6.1 Introduction
Quantum physics is the study of the behavior of matter and energy at molecular, atomic,
nuclear, and sub-atomic levels.
Quantum physics was initiated and developed in the first half of the 20th century, fol-
lowing the pioneering work of (Planck, 1901) on blackbody radiation, of (Einstein, 1905)
on photons and energy quanta, of Niels Bohr on structure of atoms, of (de Broglie, 1924) on
matter-wave duality. Quantum physics and general relativity have become the two corner-
stones of modern physics. We refer interested readers to (Sokolov, Loskutov and Ternov,
1966; Greiner, 2000; Sakurai, 1994), among many others, for the basics and history of
quantum physics.
Our recent work on the field theory of the four fundamental interactions and on the
weakton model of elementary particles has lead to new insights to a few issues and chal-
lenges in quantum physics, including in particular 1) the basic laws for interacting multi-
particle quantum systems, 2) energy levels of sub-atomic particles, and 3) solar neutrino
problem.
Field theory for interacting multi-particle systems
Based PID and PRI, we know now that the fundamental interactions are due the cor-
responding charges of the particles involved, with the mass charge for gravitational effect,
the electric charge for electromagnetism, the strong and the weak charges for the strong and
weak interactions respectively. The geometric interaction mechanism implies that the dy-
namic matter distribution of the charged particles changes the geometry of the space-time
manifold as well as the corresponding vector-bundles, leading to the dynamic interaction
laws of the system.
Also, due to PRI, for a multi-particle system consisting of subsystems of different levels,
the coupling can only be achieved through PRI and the principle of symmetry-breaking.
The above new insights from the unified field theory in Chapter 4 give rise to the fol-
lowing
6.1 Introduction 343
1) the Lagrangian action for an N-particle system satisfy the SU(N) gauge
invariance;
2) gGaµ represent the interaction potentials between the particles.
3) for an N-particle system, only the interaction field G µ can be measured,
and is the interaction field under which this system interacts with other
external systems.
With this postulate, field equations for a given multi-particle system can be naturally
established using PID and PRI. In particular, one can achieve the unification so that the
matter field can be geometrized as hoped by Einstein and Nambu, as stated explicitly in as
stated in Nambu’s Nobel lecture (Nambu, 2008).
Neutrino was first proposed by Wolfgang Pauli in 1930 in order to guarantee the energy
and momentum conservation for β -decay. In the current standard model of particle physics,
there are three flavors of neutrinos: the electron neutrino νe , the tau neutrino ντ and the mu
neutrino νµ . The solar neutrino problem is referred to the discrepancy of the number of
electron neutrinos arriving from the Sun are between one third and one half of the number
predicted by the Standard Solar Model, and was first discovered by (Davis, Harmer and
Hoffman, 1968).
The current dominant theory to resolve the solar neutrino problem is the neutrino os-
cillation theory, which are based on three basic assumptions: 1) the neutrinos are massive,
and, consequently, are described by the Dirac equations, 2) the three flavors of neutrinos
νe , νµ , ντ are not the eigenstates of the Hamiltonian, and 3), instead, the three neutrinos are
some linear combinations of three distinct eigenstates of the Hamiltonian. However, the
massive neutrino assumption gives rise two serious problems. First, it is in conflict with
the known fact that the neutrinos violate the parity symmetry. Second, the handedness of
neutrinos implies their velocity being at the speed of light.
To resolve these difficulties encountered by the classical theory, we argue that there
is no physical principle that requires that neutrino must have mass to ensure oscillation.
The Weyl equations were introduced by H. Weyl in 1929 to describe massless spin- 12 free
particles (Weyl, 1929), which is now considered as the basic dynamic equations of neutrino
(Landau, 1957; Lee and Yang, 1957; Salam, 1957); see also (Greiner, 2000). One important
property of the Weyl equations is that they violate the parity invariance. Hence by using the
Weyl equations, we are able to introduce a massless neutrino oscillation model. With this
massless model, we not only deduce the same oscillation mechanism, but also resolve the
above two serious problems encountered in the massive neutrino oscillation model.
344 Chapter 6 Quantum Physics
Despite of the success of neutrino oscillation models and certain level of experimental
support, the physical principles behind the neutrino oscillation are still entirely unknown.
Recently, the authors developed a phenomenological model of elementary particles, called
the weakton model (Ma and Wang, 2015b). The ν mediator in the weakton model leads to
an alternate explanation to the solar neutrino problem. When the solar electron neutrinos
collide with anti electron neutrinos in the atmosphere, which are abundant due to the β -
decay of neutrons, they can form ν mediators, causing the loss of electron neutrinos. Note
that ν mediator can also have the following elastic scattering
ν + e− −→ ν + e− .
Also ν participates only the weak interaction similar to the neutrinos, and consequently
possesses similar behavior as neutrinos. Consequently, the new mechanism proposed here
does not violate the existing experiments (SNO and KamLAND).
Energy levels of sub-atomic particles
The classical atomic energy level theory demonstrates that there are finite number of
energy levels for an atom given by En = E0 + λn , n = 1, · · · , N, where λn are the nega-
tive eigenvalues of the Schrödinger operator, representing the bound energies of the atom,
holding the orbital electrons, due to the electromagnetism.
The concept of energy levels for atoms can certainly be generalized to subatomic parti-
cles. The key ingredients and the main results are given as follows.
1
1. The constituents of subatomic particles are spin- fermions, which are bound to-
2
gether by either weak or strong interactions. Hence the starting point of the study is the
layered weak and strong potentials as presented earlier, which play the similar role as the
Coulomb potential for the electromagnetic force which bounds the orbital electrons moving
around the nucleons.
2. The dynamic equations of massless particles are the Weyl equations, and the dynamic
equations for massive particles are the Dirac equations. The bound energies of all subatomic
particles are the negative eigenvalues of the corresponding Dirac and Weyl operators, and
the bound states are the corresponding eigenfunctions.
1
The Weyl equations were introduced by H. Weyl in 1929 to describe massless spin-
2
free particles (Weyl, 1929), which is now considered as the basic dynamic equations of
neutrino (Landau, 1957; Lee and Yang, 1957; Salam, 1957); see also (Greiner, 2000).
3. With bound state equations for both massless and massive particles, we derive the
corresponding spectral equations for the bound states. We show that the energy levels of
each subatomic particle are finite and discrete:
0 < E1 < · · · < EN < ∞,
6.2 Foundations of Quantum Physics 345
and each energy level En corresponds to a negative eigenvalue λn of the related eigenvalue
problem. Physically, λn represents the bound energy of the particle, and are related to the
energy level En with the following relation:
Here E0 is the intrinsic potential energy of the constituents of a subatomic particle such as
the weaktons.
4. One important consequence of the above derived energy level theory is that there are
both upper and lower bounds of the energy levels for all sub-atomic particles, and the largest
and smallest energy levels are given by
In particular, it follows from the energy level theory that the frequencies of mediators
such as photons and gluons are also discrete and finite, and are given by ω n = En /h̄ (n =
1, · · · , N). In the Planck classical quantum assumption that the energy is discrete for a fixed
frequency, and the frequency is continuous. Our results are different in two aspects. One is
that the energy levels have an upper bound. Two is that the frequencies are also discrete and
finite.
Outline of Chapter 6
Section 6.2 presents the basic postulate and facts from quantum physics from the Hamil-
tonian dynamics and Lagrangian dynamics points of view. We prove also a new angular mo-
mentum rule, which is useful for describing the weaktons constituents of charged leptons
and quarks.
Section 6.3 presents new alternative approaches for solar neutrino problem, and is based
on the recent work of authors (Ma and Wang, 2014f).
Section 6.4 introduces energy levels of subatomic particles, and is based on (Ma and
Wang, 2014g).
The field theory and basic postulates for interacting multi-particle systems are estab-
lished in Section 6.5, which is based entirely on (Ma and Wang, 2014d).
The main components of quantum physics include quantum mechanics and quantum field
theory, which are based on the following basic postulates.
346 Chapter 6 Quantum Physics
Postulate 6.1 A quantum system consists of some micro-particles, which are described
by a set of complex value functions ψ = (ψ1 , · · · , ψN )T , called wave functions. In other
words, each quantum system is identified by a set of wave functions ψ :
Postulate 6.2 For a single particle system described by a wave function ψ , its modular
square
|ψ (x,t)|2
represents the probability density of the particle being observed at point x ∈ R 3 and at time
t. Hence, ψ satisfies that
!
|ψ |2 dx = 1.
R3
L̂ψλ = λ ψλ ,
and the eigenfunction ψλ is the state function in which the physical quantity L takes value
λ . In particular, the Hermitian operators corresponding to position x, momentum p and
energy E are given by
Postulate 6.4 For a quantum system ψ and a physical Hermitian operator L̂, ψ can
be expanded as !
ψ = ∑ αk ψk + αλ ψλ d λ , (6.2.3)
R(L̂1 , · · · , L̂N )ψ = 0,
where L̂k are the Hermitian operators corresponding to lk (1 ! k ! N), provided that
R(L̂1 , · · · , L̂N ) is a Hermitian.
Two remarks are in order. First, in Subsection 2.2.5, Postulates 6.3and 6.5 are introduced
as Basic Postulates 2.22 and 2.23.
Second, in addition to the three basic Hermitian operators given by (6.2.2), the other
Hermitian operators often used in quantum physics are as follows:
where s is the spin, ⃗σ and ⃗α are as in (2.2.47) and (2.2.48), and K̂, V̂ , M̂ are the kinetic
energy, potential energy, mass operators.
In addition to these Hermitians in (6.2.2) and (6.2.5), the following 5 types of particle
current operators are very important in quantum field theory:
scalar current: ρ = ψ †ψ ,
pseudo-scalar current: P = ψ † γ 5 ψ ,
vector current: V µ = ψγ µ ψ , (6.2.7)
µ µ 5
axial vector current : A = ψγ γ ψ ,
tensor current: T µν = ψσ µν ψ ,
348 Chapter 6 Quantum Physics
where ψ = ψ † γ 0 , and γ µ (0 ! µ ! 3) and γ 5 are the Dirac matrices, which are expressed in
the forms:
! " ! "
I 0 0 σk
γ0 = , γk = for 1 ! k ! 3,
0 −I −σ k 0
! " (6.2.8)
5 0 1 2 3 0 I
γ = iγ γ γ γ = .
I 0
Remark 6.6 The particle currents defined in (6.2.7) are very important in the transition
theory of particle decays and scatterings. In fact, the general form of particle currents is
written as
JAB = ψ A γψB + h · c ( Hermitian conjugate),
where γ is a current operator in (6.2.7), ψA and ψB are wave functions of particles A and B.
For example, for the β -decay
n → p + e− + ν e ,
by the Fermi theory, the transition amplitude is
Gf
M = √ (ψ e γ µ ψνe )(ψ p γµ ψn ) + h · c,
2
where G f is the Fermi constant, and
γµ = g µν γ ν = (−γ 0 , γ 1 , γ 2 , γ 3 ).
By the Hermitian operators in (6.2.2), this relation leads to the Schrödinger equation, written
as
∂ψ h̄2
ih̄ = − ∆ψ + V (x)ψ , (6.2.10)
∂t 2m
which is clearly is non-relativistic.
2. Klein-Gordon equation. The relativistic energy-momentum relation is given by
E 2 − c 2 p 2 = m2 c4 .
From this relation we can immediately derive the following Klein-Gordon equation:
! "
1 ∂2 mc
− 2 2 + ∆ ψ − ( )2 ψ = 0. (6.2.11)
c ∂t h̄
In the 4-dimensional vectorial form, the equation (6.2.11) is expressed as
I mc J2
∂ µ ∂µ ψ − ψ = 0,
h̄
which is clearly Lorentz invariant.
3. Weyl equations. For a massless particle, the de Broglie matter-wave relation gives
E = h̄ω , p0 = h̄/λ , c = ωλ ,
E = cp0 , (6.2.12)
where ⃗α = (α1 , α2 , α3 ),
! " ! "
I 0 0 σk
α0 = , αk = , k = 1, 2, 3,
0 −I σk 0
∂ψ
ih̄ = −ih̄c(⃗α · ∇)ψ + mc2α0 ψ , (6.2.15)
∂t
where ψ = (ψ1 , ψ2 , ψ3 , ψ4 )T is the Dirac spinor.
Multiplying both sides of (6.2.15) by the matrix α0 , then the Dirac equation is rewritten
in the usual form I mc J
iγ µ ∂µ − ψ = 0, (6.2.16)
h̄
where γ µ = (γ 0 , γ 1 , γ 2 , γ 3 ) is the Dirac matrices as in (6.2.8), with γ 0 = α0 , γ k = α0 αk (1 !
k ! 3).
Remark 6.7 Based on the spinor theory, the Weyl equation (6.2.13) and the Dirac
equations (6.2.16) are Lorentz invariant (see Section 2.2.6) and space rotation invariant. In
addition, the Dirac equations are invariant under the space reflection
x5 = −x, t = t.
5 (6.2.17)
In fact, under the reflection transformation (6.2.17), the Dirac spinor ψ transforms as
5 = γ 0ψ
ψ ( or ψ = γ 0 ψ
5 ). (6.2.18)
γ 0 γ 0 = I, γ 0 γ k = −γ k for 1 ! k ! 3.
Remark 6.8 Because the Weyl spinor has two-components, under the reflection trans-
formation (6.2.17), it is invariant:
Hence the Weyl equation is not invariant under the space reflection (6.2.17), which leads to
violation of parity conservation for decays and scatterings involving neutrinos. The viola-
tion of parity conservation was discovered by (Lee and Yang, 1956).
6.2 Foundations of Quantum Physics 351
δ
L = 0. (6.2.19)
δψ
Hence, we only need to give the Lagrangian action for each of (6.2.9)
1. Schrödinger systems. For the Schrödinger equation (6.2.10), its action takes the form
! " ! "
1 ∂ ψ ∗ ∂ ψ∗ 1 h̄2
Ls = ih̄ ψ − ψ − |∇ψ |2 + V |ψ |2 . (6.2.20)
2 ∂t ∂t 2 2m
2. Klein-Gordon systems. The action for the Klein-Gordon equation (6.2.11) is as fol-
lows
1 1 I mc J2
LKG = ∂ µ ψ ∗ ∂µ ψ + |ψ |. (6.2.21)
2 2 h̄
3) Weyl systems. The action for the Weyl equation (6.2.13) reads
Lw = ψ † σ µ ∂ µ ψ , (6.2.22)
where σ 0 = −I, (σ 1 , σ 2 , σ 3 ) = ⃗σ .
In Section 2.6.4, we derived all quantum dynamic equations from the Principle of Hamil-
tonian Dynamics (PHD). PHD amounts to saying that each conservation system is charac-
terized by a set of conjugate fields
ψk , ϕk , 1 ! k ! N, (6.2.24)
352 Chapter 6 Quantum Physics
h̄2
Ĥ = − ∆ + V(x) for Schrödinger
2m
Ĥ = ih̄c(⃗σ · ∇) for Weyl spinor fields, (6.2.29)
Here the conjugate fields are the real and imaginary parts of ψ = ψ 1 + iψ 2 .
By applying the PHD, from the Hamiltonian energies in (6.2.30) we can derive the
dynamic equations of the three systems, which are equivalent to the form (6.2.27); see
(2.6.49)-(2.6.55).
6.2 Foundations of Quantum Physics 353
However the Klein-Gordon system is different. In fact, we can write the equation
(6.2.11) in the form ! " ! "
∂ ψ ψ
ih̄ = Ĥ , (6.2.31)
∂t ϕ ϕ
where Ĥ is given by
! "
0 1
Ĥ = . (6.2.32)
−h̄2 c2 ∆ + m2c4 0
However, it is clear that the operator Ĥ of (6.2.32) is not Hermitian, and therefore Ĥ is not
a Hamiltonian. Consequently the quantity
! K L
⟨Φ|Ĥ|Φ⟩ = ψ ∗ ϕ + ϕ ∗ (−h̄2 c2 ∆ψ + m2 c4 ψ ) dx
R3
is also not a physical quantity because it is not a real number in general. In other words,
under the theoretic frame based on Postulate 6.5 and PLD, the Klein-Gordon equation can
not be regarded as a model to describe a conservation quantum system.
With PHD, we can, however, show that the Klein-Gordon equation is a model for a con-
served system. As seen in (2.6.57), we take a pair conjugate fields (ψ , ϕ ) and the Hamilto-
nian energy
! E F
1 2 2 2 m2 c 4 2
H= ϕ + c |∇ψ | + 2 |ψ | dx. (6.2.33)
2 R3 h̄
Then the Klein-Gordon equation (6.2.11) can be rewritten as
⎛ ⎞
δ
⎜ δψ H 0
! " ! "
∂ ψ ⎟ ψ
= J⎜ ⎟ , (6.2.34)
∂t ϕ ⎝ δ ⎠ ϕ
0 H
δϕ
and an associated Hamiltonian energy H = H(Ψ, Φ), such that the dynamic equations of
this system are in the form
! " ! "
∂ Ψ 0 I
= J Ĥ(Ψ, Φ), J= , (6.2.37)
∂t Φ −I 0
then the conjugate fields constitute complex valued wave functions ψ k +iϕk (or ϕk +iψk ), 1 !
k ! K, for this system.
The Heisenberg uncertainty relation and Pauli exclusion principle are two important quan-
tum physical laws.
Uncertainty principle
Uncertainty Principle 6.12 In a quantum system, the position x and momentum p, the
time t and energy E satisfy the uncertainty relations given by
1 1
∆x∆p " h̄, ∆t∆E " h̄, (6.2.38)
2 2
where ∆A represents a measuring error of A value. Namely, (6.2.38) implies that x and p,t
and E can not be precisely observed at the same moment.
6.2 Foundations of Quantum Physics 355
The relations (6.2.58) can be deduced from Postulates 6.3 and 6.4. The average values
of position and momentum are
!
⟨x⟩ = ψ ∗ xψ dx,
!
∂ψ
⟨p⟩ = − ψ ∗ ih̄ dx.
∂x
In statistics, the error to an average value is expressed by the squared deviation. Namely,
the errors to ⟨x⟩ and ⟨p⟩ are
!
⟨(∆x)2 ⟩ = ψ ∗ (x − ⟨x⟩)2 ψ dx = ⟨x2 ⟩ − ⟨x⟩2 ,
!
⟨(∆p)2 ⟩ = ψ ∗ (px − ⟨px ⟩)2 ψ dx = ⟨p2x ⟩ − ⟨px ⟩2 .
I(α ) = Aα 2 − Bα + C,
where
!
A= x2 |ψ |2 dx = ⟨(∆x)2 ⟩ (by (6.2.39)),
! !
∂
B=− x |ψ |2 dx = |ψ |2 dx = 1,
∂x
! ! ! "
∂ ψ∗ ∂ ψ 1 ∂ 2 1
C= dx = 2 ψ ∗ −ih̄ ψ dx = 2 ⟨(∆p)2 ⟩ ( by (6.2.39)).
∂x ∂x h̄ ∂x h̄
(6.2.41)
It is clear that A, B,C > 0, and by (6.2.40),
2) Most particles are unstable, and their lifetime τ is very short. In addition, the energy
distribution of each particle is in a range, called the energy width Γ. By the uncertainty
relation, τ and Γ satisfy that
τ Γ " h̄/2.
This relation is very important in experiments, because the width Γ is observable
which determines the lifetime of a particle by the uncertainty relation τ ≃ h̄/2Γ.
3) Uncertainty relations (6.2.38) also imply that the energy and momentum conserva-
tions may be violated in a small scale of time and space. Both conservations are only
the averaged results in larger scale ranges of time and space.
In Section 5.3.2 we introduced the Angular Momentum Rule, which was first proved in
1
(Ma and Wang, 2015b). It says that only the fermions with spin J = can rotate around a
2
central force field. In fact, this rule can be generalized to scalar bosons, i.e. particles with
spin J = 0. In this subsection we shall discuss the rule in more details. To this end, we first
introduce conservation laws of quantum systems based on Principle 6.10.
Conservation laws based on quantum Hamiltonian dynamics (QHD)
Let H be the Hamiltonian energy of a conservative quantum system, which can be de-
scribed by the following Hamiltonian equations:
∂Ψ
= ĤΦ (Ψ, Φ),
∂t
(6.2.43)
∂Φ
= −ĤΨ (Ψ, Φ),
∂t
where
δH δH
ĤΦ = , ĤΨ = .
δΦ δΨ
Let L be an observable physical quantity with the corresponding Hermitian operator L̂
for the conjugate fields (Ψ, Φ)T of (6.2.43), and L̂ is expressed as
! "
L̂11 L̂12
L̂ = , L̂T12 = L̂∗21 .
L̂21 L̂22
It is clear that the quantity L of (6.2.44) is conserved if for the solution (Ψ, Φ) T of
(6.2.43) we have
dL
= 0,
dt
which is equivalent to
! M
ĤΦ† L̂11 Ψ + Ψ†L̂11 ĤΦ − ĤΨ† L̂22 Φ − Φ† L̂22 ĤΨ (6.2.45)
We remark here that if the QHD is described by a complex valued wave function:
ψ = Ψ + iΦ,
358 Chapter 6 Quantum Physics
L̂Ĥ − Ĥ L̂ = 0. (6.2.48)
The formulas (6.2.46)-(6.2.48) are the conservation laws of the classical quantum mechan-
ics.
Hence, the conservation laws in (6.2.45) are the generalization to the classical quantum
mechanics, and are applicable to all conservative quantum systems, including the Klein-
Gordon systems and nonlinear systems.
From the conservation laws (6.2.45) and (6.2.48) we can deduce the following Angular
Momentum Rule.
1
Angular Momentum Rule 6.14 Only the fermions with spin J = and the bosons
2
with J = 0can rotate around a center with zero moment of force. The particles with J ̸=
1
0, will move on a straight line unless there is a nonzero moment of force present.
2
In the following we give a mathematical derivation of the Angular Momentum Rule.
1. Fermions. Consider fermions which obey the Dirac equations as (6.2.46) with the
Hamiltonian
Ĥ = −ih̄c(α k ∂k ) + mc2 α 0 + V (r), (6.2.49)
where V is the potential energy of a central field, and α 0 , α k (1 ! k ! 3) are the Dirac
matrices ! " ! "
I 0 0 σk
α0 = , αk = for 1 ! k ! 3, (6.2.50)
0 −I σk 0
Jˆ = L̂ + sŜ,
6.2 Foundations of Quantum Physics 359
Notice that
x2 ∂2 − ∂2 x2 = x3 ∂3 − ∂3 x3 = −1.
Hence we get
Ĥ L̂1 − L̂1 Ĥ = h̄2 c(α 3 ∂2 − α 2 ∂3 ). (6.2.53)
Similarly we have
Ĥ L̂2 − L̂2 Ĥ = h̄2 c(α 1 ∂3 − α 3 ∂1 ),
(6.2.54)
Ĥ L̂3 − L̂3 Ĥ = h̄2 c(α 2 ∂1 − α 1 ∂2 ).
On the other hand, we infer from (6.2.49) and (6.2.52) that
M N
Ĥ Ŝ j − Ŝ j Ĥ = −ih̄2 cγ 5 ∂k (σ k σ j − σ j σ k ) = −ih̄2 cγ 5 (2iεk jl σ l )∂k = 2h̄2 cεk jl α l ∂k ,
Ĥ = cα 3 p3 , L̂ = 0.
Thus, by the conservation law (6.2.48), the assertion of Angular Momentum Rule for fermions
follows from (6.2.56) and (6.2.57).
2. Bosons. Now, consider bosons which bey the Klein-Gordon equation in the form
(6.2.43). It is known that the spins J of bosons depend on the types of Klein-Gordon fields
(Ψ, Φ): ⎧
⎪
⎪ a scalar field ⇒ J = 0,
⎪
⎪
⎪
⎪ a 4-vector field ⇒ J = 1,
! " ⎪ ⎨
Ψ
= a 2nd-order tensor field ⇒ J = 2, (6.2.58)
Φ ⎪
⎪
⎪
⎪ a real valued field ⇒ neutral bosons,
⎪
⎪
⎪
⎩
a complex valued field ⇒ charged bosons.
For the Klein-Gordon fields (Ψ, Φ)T , the Hamiltonian for a central force field is given
by
! E F
1 2 2 1 2 4
2 2
H= |Φ| + c |∇Ψ| + 2 (m c + V (r))|Ψ| dx (6.2.59)
2 h̄
The Hamiltonian energy operator Ĥ of (6.2.59) is given by
! " E F
ĤΨ 0 1
Ĥ = , ĤΦ = Φ, ĤΨ = −c2 ∆ + 2 (m2 c4 + V ) Ψ. (6.2.60)
0 ĤΦ h̄
The angular momentum operator Jˆ is
! "
L̂ 0
Jˆ = + sh̄σ̂ , σ̂ = (σ 1 , σ 2 , σ 3 ). (6.2.61)
0 L̂
Then by
! !
Φ† L̂Ψdx = − Ψ∗ L̂† Φdx,
! !
ĤΨ† L̂Φdx = − Φ† L̂ĤΨ dx,
6.3 Solar Neutrino Problem 361
However, it is clear that Ĥ and Jˆ in (6.2.60) and (6.2.61) don’t satisfy (6.2.45) for spin s ̸= 0.
Hence the quantum rule of angular momentum for bosons holds true.
Remark 6.15 The Angular Momentum Rule is very useful in the weakton model and
the theory of mediator cloud structure of charged leptons and quarks, which explain why all
stable fermions with mediator clouds are at spin J = 1/2.
The solar neutrino problem is known as that the number of electron neutrinos arriving from
the Sun are between one third and one half of the number predicted by the Standard Solar
Model. This important discovery was made in 1968 by R. Davis, D. S. Harmer and K. C.
Hoffmann (Davis, Harmer and Hoffman, 1968).
To understand this problem clearly, we begin with a brief introduction to the Standard
Solar Model, following (Griffiths, 2008).
In the nineteenth century, most physicist believed that the source of the Sun’s energy was
gravity. However, based on this assumption, Rayleigh showed that the maximum possible
age of the Sun was substantially shorter than the age of the earth estimated by geologists.
At the end of the nineteenth century, Bacquerel and Curies discovered radioactivity, and
they noted that radioactive substances release a large amounts of heat. This suggested that
nuclear fission, not gravity, might be the source of the Sun’s energy, and it could allow
for a much longer lifetime of the Sun. But, the crucial problem for this solar model was
that there were no heavier radioactive elements such as uranium or radium present in the
Sun, and from the atomic spectrum, it was known that the Sun is made almost entirely of
hydrogen.
Up to 1920, F. W. Aston gave a series of precise measurements of atomic masses. It
was found that four hydrogen atoms are more weight slight than one atom of helium-4.
This implied that the fusion of four hydrogens to form a 4 He would be more favorable, and
would release a substantial amounts of energy. A. Eddington proposed that the source of
the Sun’s energy is the nuclear fusion, and in essence he was correct.
In 1938, H. Bethe in collaboration with C. Critchfield had come up with a series of
subsequent nuclear reactions, which was known as the proton-proton p − p chain. The
p − p cycle well describes the reaction processes in the Sun, and consists of the following
four steps:
362 Chapter 6 Quantum Physics
8
Be∗ −→ 4 He + 4 He.
In the p − p chain, it all starts out as hydrogen (proton), and it all ends up as 4 He plus some
electrons, positrons, photons and neutrinos. Because neutrinos interact so weakly, they are
the unique products in the p − p reactions reaching the earth’s surface.
In the p − p chain there are five reactions to yield neutrinos:
p + p −→ 2 H + e+ + νe , (6.3.1)
2
p + p + e −→ H + νe ,
−
(6.3.2)
3
He + p −→ 4 He + e+ + νe , (≃ 0%), (6.3.3)
7 7
Be + e −→ Li + νe ,
−
(6.3.4)
8 8
Be −→ Be + e + νe . ∗ +
(6.3.5)
But the problem is that the detection of the neutrinos have an effect threshold which will
lead to a nearly vanishing response to all neutrinos of lower energy. The energy spectras of
neutrinos in the five reactions are
Em ≃ 0.4 MeV for (6.3.1),
Em ≃ 1.44 MeV for (6.3.2),
Em ≃ 18 MeV for (6.3.3), (6.3.6)
Em ≃ 0.9 MeV for (6.3.4),
Em ≃ 14 MeV for (6.3.5),
6.3 Solar Neutrino Problem 363
where Em is the maximum energy of neutrinos, and the energy flux are
Homestake experiments
The experimental search for solar neutrinos has been undertaken since 1965 by R. Davis
and collaborators in the Homestake goldmine in South Dakota. Since the neutrinos cannot
be directly detected by instruments, it is only by the reactions
νe + X −→ Y + e−
to detect the outgoing products that counte the neutrinos. The Homestake experiments take
νe + 37 Cl −→ 37 Ar + e− . (6.3.8)
Ec = 5.8MeV.
Thus, by (6.3.6) only these neutrinos from both reactions (6.3.3) and (6.3.5) can be ob-
served, which occur at a frequency of 0.015%. Theoretic computation showed that the
expected counting rate of solar neutrinos is at
In 1968, R. Davis et al (Davis, Harmer and Hoffman, 1968) reported the experimental
results, their measuring rate is
the experimental value (6.3.10) is only about one third of the theoretically expected value
(6.3.9). It gave rise to the famous solar neutrino problem.
364 Chapter 6 Quantum Physics
Super-K experiment
In 2001, the Super-Kamiokande collaboration presented its results on solar neutrinos.
Unlike the Homestake experiment, Super-K uses water as the detector. The process is elastic
neutrino-electron scattering:
νx + e −→ νx + e,
where νx is one of the three flavors of neutrinos. This reaction is sensitive to µ and τ neu-
trinos as well as e-neutrinos, but the detection efficiency is 6.5 times greater for e-neutrinos
than for the other two kinds. The outgoing electron is detected by the Cherenkov radiation
it emits in water. They observed the rate at
The Super-Kamiokande detector is located in the Mozumi Mine near Kamioka section of
the city of Hida, Japan.
Sudbury Neutrino Observatory (SNO)
Meanwhile, in the summer of 2001 the SNO collaboration reported their observation
results. They obtained
r = 35% of the predicted value.
The SNO used heavy water (2 H2 O) instead of ordinary water (H2 O), and the SNO detection
method is based on the following reactions:
νe + 2 H −→ p + p + e−, (6.3.11)
νx + 2 H −→ p + n + νx, (6.3.12)
νx + e −→ νx + e .
− −
(6.3.13)
SNO detects electrons e− , but not τ − and µ − , as there is not enough energy in the solar
electron-neutrino such that the transformed tau and mu neutrino can excite neutrons in 2 H
to produce either τ − or µ − .
KamLAND
The loss of reactor electron anti-neutrino ν e is verified by the KamLAND experiment.
A potential alternative experiment
It is known that the following reaction
νµ + n −→ µ − + p (6.3.14)
By the energy spectrum (6.3.6), the maximum energy of solar neutrinos is about 14 ∼
18 MeV, which is much smaller than m µ c2 . Hence, assuming oscillation does occur for
solar neutrinos, the reaction
νµ + 2 H −→ µ − + p + p
νµ + n + γ −→ µ − + p.
νµ + 2 H + γ −→ µ − + p + p (6.3.15)
would occur if
Eνµ + Eγ > 106 MeV. (6.3.16)
Hence one may use high energy photons to hit the heavy water to create the situation in
(6.3.16), so that the reaction (6.3.15) may take place. From (6.3.15), we can detect the µ −
particle to test the neutrino oscillation.
Alternatively, by the µ -decay:
µ − → e− + ν e + ν µ ,
we may measure the electrons to see if there are more electrons than the normal case to test
the existence of mu-neutrinos.
In order to explain the solar neutrino problem, in 1968 B. Pontecorvo (Pontecorvo, 1957,
1968) introduced the neutrino oscillation mechanism, which amounts to saying that the
neutrinos can change their flavors, i.e. an electron neutrino may transform into a muon or a
tau neutrino. According to this theory, a large amount of electron neutrinos ν e from the Sun
have changed into the ν µ or ντ , leading the discrepancy of solar electron neutrinos. This
neutrino oscillation mechanism is based on the following assumptions:
• The neutrinos are massive, and, consequently, are described by the Dirac equations.
• The three types of neutrinos νe , νµ , ντ are not the eigenstates of the Hamiltonian (i.e.
the Dirac operator)
Ĥ = −ih̄c(⃗α · ∇) + mc2α0 . (6.3.17)
366 Chapter 6 Quantum Physics
Ĥ ν j = λ j ν j for 1 ! j ! 3, (6.3.18)
Remark 6.16 The formulas (6.3.17)-(6.3.19) constitute the current model of neutrino
oscillation, which requires the neutrinos being massive. However, the massive neutrino
assumption gives rise two serious problems. First, it is in conflict with the known fact that
the neutrinos violate the parity symmetry. Second, the handedness of neutrinos implies their
velocity being at the speed of light.
In fact, by using the Weyl equations as the neutrino oscillation model we can also deduce
the same conclusions and solve the two mentioned problems. Moreover, the ν mediator
introduced by the authors in (Ma and Wang, 2015b) leads to an alternate explanation to the
solar neutrino problem.
Under the above three hypotheses (6.3.17)-(6.3.19), the oscillation between ν e , νµ and
ντ are given in the following fashion. For simplicity we only consider two kinds neutrinos
νe , νµ , i.e. ντ = 0. In this case, (6.3.19) becomes
ν1 = cos θ νµ − sin θ νe ,
(6.3.20)
ν2 = sin θ νµ + cos θ νe .
∂ νk
ih̄ = λk νk for k = 1, 2.
∂t
The solutions of these equations read
νk = νk (0)e−iλkt/h̄ , k = 1, 2. (6.3.21)
νe (0) = 1, νµ (0) = 0.
ci j = cos θi j , si j = sin θi j ,
The matrix A of (6.3.26) is a unitary matrix: A† = A−1 . Therefore, (6.3.19) can be also
rewritten as ⎛ ⎞ ⎛ ⎞
ν1 νe
⎝ν2 ⎠ = A† ⎝νµ ⎠ .
ν3 ντ
Neutrino masses
As masses are much less than kinetic energy c|p|, by the Einstein triangular relation of
energy-momentum
E 2 = p 2 c2 + m2 c4 ,
we obtain an approximate relation:
1 m2 c 3
E ≃ |p|c + .
2 |p|
The eigenvalues λk of (6.3.18) and E satisfy
1 m2k c3
λk = Ek ≃ |p|c + for k = 1, 2, 3. (6.3.27)
2 |p|
Then we have
(m2i − m2j )
λi − λ j = Ei − E j ≃ c4 , E ≃ |p|c. (6.3.28)
2E
By (6.3.28), if we can measure the energy difference λ i − λ j , then we get the mass square
difference of νi and ν j :
∆i j = m2i − m2j .
There are three mass square differences for ν1 , ν2 , ν3 :
∆21 = m22 − m21 , ∆32 = m23 − m22 , ∆31 = m23 − m21 , (6.3.29)
where Ai j are the matrix elements of A, A∗i j are the complex conjugates of Ai j . The masses
me , mµ , mτ of νe , νµ , ντ are as follows
which leads to
m2e + m2µ + m2τ = ∆32 + 2∆21 + 3m21, (6.3.33)
νe + e −→ νe + e.
This phenomenon was also observed and expanded by S. Mikheyev and A. Smirnov (Mikheev
and Smirnov, 1986), and is now called the MSW effect.
The MSW effect can be reflected in the neutrino oscillation model. We recall the oscil-
lation model without MSW effect expressed as
νk = ϕk (x)e−iλk t/h̄ ,
[−ih̄c(⃗α · ∇) + mc2α0 ]ϕk = λk ϕk k = 1, 2, 3,
⎛ ⎞ ⎛ ⎞ (6.3.34)
νe ν1
⎝νµ ⎠ = A ⎝ν2 ⎠ , A is as in (6.3.26).
ντ ν3
To consider the MSW effect, we have to add weak interaction potentials in the Hamilto-
nian operator Ĥ for neutrinos νe , νµ , ντ . The weak potential energy is as given in (4.6.32):
E F
1 B
Vν = gs (ρν )ρs (ρe )Ne e−kr − (1 + 2kr)e−kr , (6.3.35)
r ρ
370 Chapter 6 Quantum Physics
where ρν , ρe are the radii of neutrinos and electron, gs is the weak charge, and Nw is the
weak charge density. Namely the Hamiltonian with MSW effect for (νe , νµ , ντ ) is
⎛ ⎞ ⎛ ⎞⎛ ⎞
νe Ĥ + Ve 0 0 νe
Hˆ ⎝νµ ⎠ = ⎝ 0 Ĥ + Vµ 0 ⎠ ⎝ν µ ⎠ , (6.3.36)
ντ 0 0 Ĥ + Vτ ντ
where Hˆ is as in (6.3.36).
The equation (6.3.38) is the neutrino oscillation model with the MSW effect, where the
eigenvalues βk and eigenstates νk (1 ! k ! 3) are different from that of (6.3.34). In fact, the
MSW effect is just the weak interaction effect.
1. Parity problem. It is known that all weak interaction decays and scatterings involving
neutrinos violate the parity symmetry, discovered by Lee and Yang in 1956 and experimen-
tally verified by C. Wu (Lee and Yang, 1956; Wu, Ambler, Hayward, Hoppes and Hudson,
1957). It means that the neutrinos are parity non-conserved. Hence it requires that under
the space reflective transformation
x −→ −x, (6.3.39)
the equations governing neutrinos should violate the reflective invariance. Based on Re-
marks 6.7 and 6.8, the Dirac equations are invariant under the reflection (6.3.39), while
the Weyl equations are not invariant. Hence, the massive neutrino oscillation model is in
conflict with the violation of parity symmetry.
2. Handedness and speed of neutrinos. Experiments showed that all neutrinos possess
1
only the left-handed spin J = − , and anti-neutrinos possess the right-handed spin J =
2
6.3 Solar Neutrino Problem 371
1
. It implies that the velocity of free neutrinos must be at the speed of light, which is a
2
contradiction with massive neutrino assumption.
In fact, the handedness is allowed only for massless particles. Otherwise, there exist two
coordinate systems A and B satisfying
vA < v p < v B ,
where vA , vB and v p are the velocities of A, B and the particle. When we look at the particle
ν from A and B, the spins would be reversed. Therefore, all massive particles must have
both left-handed and right-handed spins.
In addition, all experiments measuring neutrino velocity had found no violation to the
speed of light.
3. Infinite number of eigenvalues and eigenstates. The neutrino oscillation theory faces
the problem of the existing of infinite number of eigenvalues. In the massive model (6.3.34),
the wave functions are the Dirac spinors
ϕ = (ϕ 1 , ϕ 2 , ϕ 3 , ϕ 4 )T .
For free neutrinos moving on a straight line, ϕ depends only on z. Thus the eigenvalue
equations in (6.3.34) become
! " ! 1" ! 1"
d ϕ3 2 ϕ ϕ
− ih̄cσ3 + mc =λ ,
dz ϕ 4 ϕ2 ϕ2
! 1" ! 3" ! 3" (6.3.40)
d ϕ 2 ϕ ϕ
− ih̄cσ3 − mc =λ ,
dz ϕ 2 ϕ4 ϕ4
where !"
1 0
σ3 = . (6.3.41)
0 −1
The equations (6.3.40) possess infinite number of eigenvalues
O
4π 2 n2 h̄2 c2
λ = m2 c4 + , ∀l > 0, n = 0, 1, 2, · · · , (6.3.42)
l2
and each eigenvalue has two eigenstates
⎛' ⎞
1 + mc2/λ
ei2π nz/l ⎜' 0 ⎟
ϕ1 = √ 3/2 ⎜ ⎝
⎟,
2l 1 − mc /λ ⎠
2
0
⎛ ⎞ (6.3.43)
' 0
ei2π nz/l ⎜ 1 + mc2/λ ⎟
ϕ2 = √ 3/2 ⎜ ⎟.
2l ⎝ ' 0 ⎠
− 1 − mc2/λ
372 Chapter 6 Quantum Physics
The problem is that which eigenvalues and eigenstates in (6.3.42) and (6.3.43) are the ones
in the neutrino oscillation model (6.3.34), and why only three of (6.3.42)-(6.3.43) stand for
the flavors of neutrinos.
The Weyl equations (6.2.13) can replace the Dirac equations to describe the neutrino
oscillation, which we call massless neutrino oscillation model, expressed as follows
νk = ϕk (x)e−iλk t/h̄ ,
ih̄c(⃗σ · ∇)ϕk = λk ϕk for k = 1, 2, 3,
⎛ ⎞ ⎛ ⎞ (6.3.44)
νe ν1
⎝νµ ⎠ = A ⎝ν2 ⎠ , A is as in (6.3.26),
ντ ν3
λ i − λ j = ωi − ω j , (6.3.45)
The eigenvalues of (6.3.44) at x-axis and z-axis are all the same as in (6.3.47), and the
eigenstates at the x and z axes are
! 1 " ! −ikx " ! " ! −ikz " ! "
ϕ e ϕ1 e 0
= −ikx and = or . (6.3.49)
ϕ2 e ϕ2 0 eikz
6.3 Solar Neutrino Problem 373
ν = αe νe ν e + αµ νµ ν µ + ατ ντ ν τ , (6.3.50)
where αe2 + αµ2 + αµ2 = 1. The values αe2 , αµ2 , ατ2 represent the ratio of the neutrinos νe , νµ
and ντ in our Universe
In view of (6.3.50), we see the reaction
νe + ν e −→ ν (νe ν e ),
νµ + ν µ −→ ν (νµ ν µ ), (6.3.51)
ντ + ν τ −→ ν (ντ ν τ ),
which are generated by the weak interaction attracting force, as demonstrated in the weak
charge potentials
E ! " F
−r/r0 1 Bi 2r −r/r0
Φi = gw e − 1+ e for 1 ! i ! 3, (6.3.52)
r ρν r0
where r0 = 10−16 cm, B1 , B2 , B3 > 0 are the weak interaction constants for νe , νµ , ντ respec-
tively, and ρν is the neutrino radius.
The formula (6.3.52) defines attractive radii Ri for the neutrinos and antineutrinos of the
same flavors. Namely, when νi and ν i are in the radius Ri , the reaction (6.3.51) may occur:
d
Φi (Ri ) = 0. (6.3.55)
dr
Thus we can give a non-oscillation mechanism of neutrinos to explain the solar neutrino
problem. Namely, due to the β -decay, there are large amounts of electronic anti-neutrinos
374 Chapter 6 Quantum Physics
ν e around the earth, which generate the reaction (6.3.51) with νe , leading to the discrepancy
of the solar neutrino.
In addition, there are three axis eigenvalues of the Weyl equations given by (6.3.48)
and (6.3.49). We believe that they are the three flavors of neutrinos ν e , νµ , ντ . Namely, the
following three wave functions
! " ! "
sin ky cos ky
ψ1 = c 1 + c2 ,
− cos ky sin ky
! −ikx "
e
ψ2 = −ikx , (6.3.56)
e
! −ikz " ! "
e 0
ψ3 = c 3 + c4 ikz
0 e
represent the three flavors of neutrinos. In fact, massive particles in field equations are
distinguished by different masses, and flavors of neutrinos by different axis eigenstates.
Also, neutrinos have significant scattering effect of the weak interaction. In fact, they
can have weak interactions with all subatomic particles. This can certainly cause deficits of
neutrinos, and may be one of the main reasons for the loss of solar neutrinos.
1) each subatomic particle consisting of two or three weaktons or quarks bound by weak
or strong interaction, and
w∗ , w1 , w2 , νe , νµ , ντ ,
w∗ , w1 , w2 , ν e , ν µ , ν τ .
Each weakton carries one unit of weak charge gw , and both w∗ and w∗ are only weaktons
carrying a strong charge gs .
2. Six classes of subatomic particles. There are six types of subatomic particles:
charged leptons, quarks, barons, mesons, intermediate bosons, mediators, whose members
are listed as follows.
6.4 Energy Levels of Subatomic Particles 375
1) Charged leptons:
e± , µ ± , τ ± .
2) Quarks:
u, d, s, c, b, t,
u, d, s, c, b, t.
3) Baryons:
p± , n, Λ, Σ± , Σ0 , ∆++ , ∆± , ∆0 , Ξ± , Ξ0 , etc.
4) Mesons:
π ± , π 0 , K ± , K 0 , η , ρ ± , ρ 0 , K ∗± , K ∗0 , etc.
5) Intermediate bosons:
W ±, Z0 , H ±, H 0 .
6) Mediators:
γ , γ0 , gk , gk0 , ν .
e− = νe w1 w2 , µ − = νµ w1 w2 , τ − = ντ w1 w2 .
e+ = ν e w1 w2 , µ + = ν µ w1 w2 , τ + = ν τ w1 w2 .
u = w∗ w1 w1 , c = w∗ w2 w2 , t = w∗ w2 w2 ,
d = w∗ w1 w2 , s = w∗ w1 w2 , b = w∗ w1 w2 .
Baron = qqq.
4. Bound forces of subatomic particles. The main forces holding weaktons and quarks
to form subatomic particles are the weak and strong interactions, and their force sources are
from the interaction charges, i.e.
weak charge gw , strong charge gs .
The 4-dimensional interaction potentials are
weak potential Wµ = ωaWµa = (W0 ,W1 ,W2 ,W3 ),
(6.4.1)
strong potential S µ = ρk Skµ = (S0 , S1 , S2 , S3 ),
The acting forces are
weak force =− gw ∇W0 ,
strong force =− gs∇S0 ,
(6.4.2)
weak magnetism =− gw curl W ⃗,
strong magnetism=− gscurl⃗S,
where W⃗ = (W1 ,W2 ,W3 ), ⃗S = (S1 , S2 , S3 ).
Since each weakton carries one weak charge gw , and w∗ , w∗ carry one strong charge gs ,
by the constituents of subatomic particles, the main bound energy for various particles takes
charged leptons and quarks: weak interaction,
hadrons (barons and mesons) : strong interaction,
intermediate bosons: weak interaction, (6.4.3)
vector and scalar gluons: weak and strong interactions,
other mediators: weak interaction.
Remark 6.17 We need to explain that although each quark has three weak charges, due
to the weak force range r ! 10−16 cm and the distance r > 10−16 cm between the quarks in a
hadron, the main bound force of hadrons is strong interaction, and the weak forces between
the quarks an be ignored.
where W0 is the weak charge potential of a composite particle with N weak charges g w and
with radius ρ , and ρw is the weakton radius, B the weak interaction constants of this particle,
r0 = 10−16 cm.
Based on (6.4.4), the weak potential energy generated by two particles with N1 and N2
weak charges and with radii ρ1 , ρ2 is expressed as follows
E ! " F
−r/r0 1 B12 2r −r/r0
Vw = N1 N2 gw (ρ1 )gw (ρ2 )e − 1+ e , (6.4.5)
r ρ12 r0
where gw (ρ1 ) and gw (ρ2 ) are as in (6.4.4), and B12 /ρ12 depends on the types of the two
particles.
The basic weak charge gw satisfies the relation (4.6.37), i.e.
! "6
ρn
g2w = 5 × 10−3 h̄c. (6.4.6)
ρw
where gs (ρ1 ) and gs (ρ2 ) are as in (6.4.7), and A12 /ρ12 depends on the types of the two
particles.
The basic strong charge gs satisfies the relation (4.5.66), i.e.
! "6
ρn
g2s = 2 × 10 −2
g2 (g2 ≃ 1h̄c). (6.4.9)
ρw
weaktons in mediators are massless. The spectral equations for both massive and massless
bound states are very different. in the following we shall discuss them respectively.
The subatomic particles consist of two or three fermions, their wave functions are the
Dirac spinors
Ψ1 , · · · , ΨN , N = 2 or 3. (6.4.10)
As N = 2 or 3, for each particle its bound energy can be approximatively regarded as the
superposition of the remaining N − 1 particles. Thus the bound potential for each fermion
takes the form
⎧
⎪ (N − 1)gwWµ for weak interaction,
⎨
gAµ = (N − 1)gsSµ for strong interaction, (6.4.11)
⎪
⎩
(N − 1)(gwWµ + gsSµ ) for weak and strong interactions
We now derive spectral equations for massive bound states from (6.4.14). Let the solu-
tions of (6.4.14) be in the form
2 )t/h̄
Ψk = e−i(λ +mk c ψ k.
1
λ − gA0 = mk ν 2 .
2
Now, we need to give the expression of (⃗σ · ⃗D)2 . To this end, note that the Pauli matrices
satisfy
(σ k )2 = 1, σ k σ j = −σ j σ k = iεljk σ l .
Hence we obtain Q R2
3
(⃗σ · ⃗D) =2
∑σ k
Dk = D2 + i⃗σ · (⃗D × ⃗D). (6.4.19)
k=1
380 Chapter 6 Quantum Physics
g⃗
with ⃗D = ∇ + i h̄c A, we derive that
M N
⃗D × ⃗D = i g ∇ × ⃗A + ⃗A × ∇ .
h̄c
Note that as an operator we have
∇ × ⃗A = curl ⃗A − ⃗A × ∇.
Hence we get
⃗D × ⃗D = i g curl⃗A.
h̄c
Thus, (6.4.19) is written as
g
(⃗σ · ⃗D)2 = D2 − ⃗σ · curl⃗A. (6.4.20)
h̄c
By (6.4.20), the spectral equation (6.4.18) is in the form
E F! k " ! k " ! k "
h̄2 2 ψ1 ψ1 ψ1
− D + gA0 +⃗ µ k · curl⃗
A = λ , (6.4.21)
2mk ψ2k ψ2k ψ2k
Since the fermions are bound in the interior Ω of subatomic particle, ψ k (1 ! k ! N) are zero
outside Ω. Therefore the equation (6.4.21) are supplemented with the Dirichlet boundary
conditions:
(ψ1k , ψ2k )|∂ Ω = 0 (1 ! k ! 3), (6.4.23)
where Ω ⊂ R3 is a bounded domain.
The boundary value problem (6.4.21)-(6.4.23) is the model for the energy level theory
of massive subatomic particles.
Massless bound states
In order to obtain the spectral equations for massless subatomic particle, we have to
derive their wave equations, which are based on the basic quantum mechanics principle: the
Postulate 5.5.
We first recall the Weyl equation
∂ψ
= c(⃗σ · ∇)ψ , (6.4.24)
∂t
which describes massless and free fermions. The Weyl equation (6.4.24) is derived from the
de Broglie relation
E = cp ( see (6.2.12)) (6.4.25)
6.4 Energy Levels of Subatomic Particles 381
with
∂
, p̂ = ih̄(⃗σ · ∇).
Ê = ih̄ (6.4.26)
∂t
As consider the massless bound states in weak and strong interactions, the Hermitian
operators in (6.4.26) are replaced by
∂
Ê = ih̄ − gA0, p̂ = ih̄(⃗σ · ⃗D), (6.4.27)
∂t
where ⃗D = (D1 , D2 , D3 ) is as in (6.4.22).
In Section 6.4.1, we knew that the mediators such as photons and gluons consist of two
massless weaktons, which are bound in a small ball Br by the weak and strong interactions.
Hence the Weyl spinor ψ of each weakton is restricted in a small ball B r , i.e.
ψ = 0, ∀x ̸∈ Br ,
ψ |∂ Br = 0. (6.4.28)
However, in mathematics the boundary problem for the Weyl equations generated by
(6.4.25) and (6.4.27) given by
p̂Ê = Ê p̂.
p̂A0 ̸= A0 p̂.
Hence, in order to ensure p̂Ê being Hermitian, we replace p̂A0 by 12 ( p̂A0 + A0 p̂), i.e. take
∂ g
− ( p̂A0 + A0 p̂), p̂ = ih̄c(⃗σ · ⃗D).
p̂Ê = ih̄ p̂ (6.4.31)
∂t 2
Then by Postulate 6.5, from (6.4.30) and (6.4.31) we derive the boundary problem of
massless system in the following form
∂ψ ig
(⃗σ · ⃗D) = c(⃗σ · ⃗D)2 ψ − {(⃗σ · ⃗D), A0 }ψ ,
∂t 2h̄ (6.4.32)
ψ |∂ Ω = 0,
382 Chapter 6 Quantum Physics
ig
−h̄c(⃗σ · ⃗D)2 ϕ + {(⃗σ · ⃗D), A0 }ϕ = iλ (⃗σ · ⃗D)ϕ ,
2
and by (6.4.20) which can be rewritten as
! " ! "
ϕ1 ig ϕ1
[−h̄cD2 + g⃗σ · curl⃗A] + {(⃗σ · ⃗D), A0 }
ϕ2 2 ϕ2
! 1 "
ϕ
= iλ (⃗σ · ⃗D) , (6.4.33)
ϕ2
(ϕ 1 , ϕ 2 )|∂ Ω = 0,
The eigenvalue equations (6.4.33)-(6.4.34) are taken as the model for the energy levels
of massless bound states. The mathematical theory (Theorem 2.42) established in Subsec-
tion 3.6.5 laid a solid foundation for the energy level theory provided by (6.4.33).
Remark 6.18 In the equations (6.4.21)-(6.4.22) and (6.4.33) for bound states, we see
that there are terms
In (6.4.35), the physical quantity ⃗µ = h̄g⃗σ /2m represents magnetic moment, and the term
in (6.4.36) is magnetic force generated by the spin coupled with ether weak or strong in-
teraction. In other words, in the same spirit as the electric charge e producing magnetism,
the weak and strong charges gw , gs can also produce similar effects, which we also call
magnetism.
Indeed, all three interactions: electromagnetic, weak, strong interactions, enjoy a com-
mon property that moving charges yield magnetism, mainly due to the fact that they are all
gauge fields.
6.4 Energy Levels of Subatomic Particles 383
Here ψ1k and ψ2k represent the left-hand and right-hand states. The bound states are due to
the weak interaction, and the potential in (6.4.11) takes the form
⃗ ).
gAµ = 2gwWµ = (2gwW0 , 2gwW
By (6.4.21)-(6.4.23), the spectral equations for charged leptons and quarks are as follows
! "
h̄2 2gw ⃗ 2 j ⃗ )ψ j
− ∇+i W ψ + 2(gww0 +⃗µ j · curlW
2m j h̄c
=λψ j in ρw < |x| < ρ , 1! j!3 (6.4.37)
1 2 3
ψ = (ψ , ψ , ψ ) = 0 at |x| = ρw , ρ ,
where ρw is the weakton radius, ρ the attracting radius of weak interaction, W0 is given by
(6.4.4) with N = 1, gw (ρ ) = gw , and
h̄gw
⃗µ j = ⃗σ is the weak magnetic moment.
2m j
We are in position now to derive a few results on the energy levels for charged leptons
and quarks based on (6.4.37).
1. Bound states and energy levels. We know that the negative eigenvalues and eigen-
functions of (6.4.37) correspond to the bound energy and bound states. Let
−∞ < λ1 ! · · · ! λN < 0
λk is independent of j (1 ! j ! 3), i.e. each weakton has the same bound energy λ k but in
different bound state ψkj .
In the right-hand side of (6.4.39), the first term stands for the kinetic energy, the second
term for the weak magnetic energy, and the third term for the weak potential energy, the
potential energy in (6.4.39) is negative. Hence, the bound energy can be written as
In addition, the energy distributions of charged leptons and quarks are discrete and finite:
and N is the number of negative eigenvalues. Each energy level E k can be expressed as
3. Parameters of electrons. In all charged leptons and quarks, only the electrons are long
life-time and observable. Hence the physical parameters of electrons are important. By the
spectral equation (6.4.37) we can derive some information for electronic parameters.
To this end, we recall the weak interaction potential for the weaktons, which is written
as E ! " F
1 Bw 2r −r/r0 −r/r0
W0 = gw − 1+ e e , (6.4.41)
r ρw r0
where Bw is the constant for weaktons.
Assume that the masses of three weaktons are the same. We ignore the magnetism, i.e.
⃗ = 0. Then (6.4.37) is reduced in the form
let W
h̄2
− ∆ψ + 2gwW0 ψ = λ ψ for ρw < |x| < ρ , (6.4.42)
2m
ψ = 0, for |x| = ρw , ρ .
We shall apply (6.4.41) and (6.4.42) to derive some basic parameters and their relations
for the electrons.
6.4 Energy Levels of Subatomic Particles 385
Let λe and ψe be the spectrum and bound state of an electron, which satisfy (6.4.42). It
follows from (6.4.41) and (6.4.42) that
! "
1 2 2 1 κe
λe = mv + 2gw − (6.4.43)
2 re ρw
1 2
where mv is the kinetic energy of each weakton in electron re the radius of the naked
2
electron, κe is the bound parameter of electron, and they are expressed as
!
1 −r/r0
re = e |ψe |2 dx,
Bρ r
! ! "
2r −2r/r0
κe = Bw 1+ e |ψe |2 dx,
Bρ r0
I m v J2 !
w
= |∇ψe |2 dx.
h̄ Bρ
These three parameters are related with the energy levels of an electron, i.e. with the λ k
and ψk . However, the most important case is the lowest energy level state. We shall use the
spherical coordinate to disscuss the first eigenvalue λ1 of (6.4.42). Let the first eigenfunction
ψe be in the form
ψe = ϕ0 (r)Y (θ , ϕ ).
Then ϕ0 and Y satisfy
! "
h̄2 1 d 2 d
− r ϕ0 + 2gsW0 ϕ0 + βr2k ϕ0 = λe ϕ0 ,
2m r2 dr dr (6.4.44)
ϕ0 (ρw ) = ϕ0 (ρ ) = 0,
and E ! " F
1 ∂ ∂ 1 ∂2
sin θ + 2 Yk = βkYk , (6.4.45)
sin θ ∂ θ ∂θ sin θ ∂ ϕ 2
where βk = k(k + 1), k = 0, 1, · · · .
Because λe is the minimal eigenvalue, it implies that βk = β0 = 0 in (6.4.44). The
eigenfunction Y0 of (6.4.45) is given by
1
Y0 = √ .
4π
Thus ψe is as follows
1
ψe = √ ϕ0 (r),
4π
and λe , ϕ0 are the first eigenvalue and eigenfucntion of the following equation
h̄2 1 d ; 2 d <
− r dr ϕ0 + 2gwW0 ϕ0 = λ1 ϕ0 (6.4.46)
2m r2 dr
ϕ0 (ρw ) = ϕ0 (ρ ) = 0.
386 Chapter 6 Quantum Physics
q = w∗ ww and q = w∗ ww.
Hence, each quark possesses one strong charge g s and three weak charges 3gw . It looks as
if the bound energy of baryons is provided by both weak and strong interactions. However,
since the weak interaction is short-ranged, i.e.
where ρ0 is the quark radius, ρ1 the strong attracting radius, S µ = (S0 , ⃗S) as in (6.4.1) is 4-
dimensional strong potential, and ⃗µk = h̄gs⃗σ /2mk the strong magnetic moment. ψ k satisfy
the nomalization
!
|ψ k |2 dx = 1, in Ω = {x ∈ R3 | ρ0 < |x| < ρ1 }.
Ω
6.4 Energy Levels of Subatomic Particles 387
Mediators are massless bosons, which consists of a weakton and its anti-particle. From
the viewpoint of bound energy, gluons are generated by both weak and strong interactions,
and others are bound only by weak interaction. Hence, the spectral equations of gluons are
different from those the other mediators.
g = w∗ w∗ . (6.4.54)
Based onthe weakton model, w∗ and w∗ contain a weak charge gw and a strong charge gs .
By (6.4.6) and (6.4.9),
g2w
= 0.25,
g2s
i.e. gw and gs have the same order. Therefore, the interactions for the gluon are both weak
and strong forces, i.e. the 4-dimensional potential A µ = (A0 , ⃗A) is as
In (6.4.54), we only need to consider the bound state for a single weakton. Then the
spectral equation is provided by (6.4.33)-(6.4.34), and for (6.4.55) which is written as
2. Photons and ν -mediators. The other mediators such as photons and ν -mediator
consist of a pair of weakton and anti-weakton:
γ = α1 w1 w1 + α2 w2 w2 ,
(6.4.57)
ν = αe νe ν e + αµ νµ ν µ + ατ ντ ν τ .
The weaktons in (6.4.57) only contain a weak charge g w , hence the bound energy of γ and
ν is given by the weak force, i.e.
⃗ ).
gAµ = gwWµ = (gwW0 , gwW
6.4 Energy Levels of Subatomic Particles 389
⃗D = ∇ + i gwW
⃗.
h̄c
3. Energy levels of mediators. By the spectral theory of the Weyl operator established
in Subsection 2.6.5, the negative eigenvalues of (6.4.56) and (6.4.57) are finite, i.e.
−∞ < λ1 ! · · · ! λN < 0,
which stand for bound energy of mediators. It shows that the energy levels of each kind
mediator are finite
0 < E1 ! · · · ! EN . (6.4.59)
Each energy level Ek can be expressed as
Ek = E0 + λ k (1 ! k ! N), (6.4.60)
where the number N of energy levels depends on the particle type. Each subatomic particle
lies in an energy state of Ek (1 ! k ! N) in (6.4.63). In traditional conception, the energy
distribution of all particles is infinite and continuous, i.e. a particle can lie in any state of
energy E with 0 < E < ∞. Hence the energy level theory established here arrive at a very
different conclution:
Physical Conclusion 6.21 The energy distribution of subatomic particles is finite and
discrete.
Usually, we cannot observe the discreteness of energy because the number N of energy
levels is so large that the average difference of adjacent energy level:
EN − E1
∆Ek = Ek+1 − Ek ≃ ≃ 0,
N
is very small. In the following discussion we shall show this point.
To this end, we first give the estimates of the number N of energy levels for various
types of subatomic particles.
1. Energy level number of electrons and quarks. To consider the approximatively com-
putation of number N of energy levels, we always ignore the magnetic effects. In this case,
the spectral equations for charged leptons and quarks are given by (6.4.42), and which are
written as
h̄2
− ∆ψ + 2gwW0 ψ = λ ψ , in ρw < |x| < ρ , (6.4.64)
2mw
ψ = 0, on |x| = ρw , ρ ,
where mw is the masses of weaktons in leptons and quarks, ρ is the attracting radius of weak
interaction.
For the weak interaction potential W0 , we approximatively take
g w Bw
W0 = − .
ρw
Then take the dimensional transformation
x → ρx (ρ as in (6.4.64)).
ρw ≪ ρ .
4mw Bw ρ 2 2
−∆ψ − gw ψ = λ ψ for |x| < 1,
h̄2 ρw (6.4.65)
ψ =0 on |x| = 1.
6.4 Energy Levels of Subatomic Particles 391
It is clear that for the equation (6.4.65), the parameters r, α , θ as in (3.6.28) of Theorem
2.37 are as follows
4mw Bw ρ 2 2
r = 1, α = 0, θ = gw
h̄2 ρw
Thus the number N of the energy levels of charged leptons and quarks is approximatively
given by
E F3
4 Bw ρ 2 mw c g2w 2
N= , (6.4.66)
λ1 ρw h̄ h̄c
where λ1 is the first eigenvalue of −∆ in unit ball B1 ⊂ R3 .
By the de Broglie relation,
mw c 1
∼ ,
h̄ ̸λ
where λ
̸ is the wave length of weaktons, i.e. λ
̸ = nrm (n = 1, 2, · · · ), rm the mediator radius.
We take ̸λ = rm ≃ ρ , and
4 Bw ρ mw c
≃ 1, ≃ 102 , ρ ≃ 1.
λ1 ρw h̄
5 = N · N1 ,
N
where N is as in (6.4.67) and N1 the energy level number of mediator cloud around the
charged leptons and quarks.
2. Energy level number of hadrons. We only consider the case of baryons, and the case
of mesons is similar. For the baryons, spectral equation (6.4.48) can be approximatively
reduced in the form
4mq Aq ρ12 2
−∆ψ − gs ψ = λ ψ , for |x| < 1,
h̄2 ρq (6.4.68)
ψ = 0, on |x| = 1.
where mq is the quark mass, ρq the quark radius, ρ1 the strong attracting radius, and Aq the
strong interaction constant of quarks.
392 Chapter 6 Quantum Physics
By Theorem 3.38, from (6.4.68) we can get the estimates of energy level number N of
baryons as follows
E F3
4 ρ12 Aq mq c g2s 2
N= , (6.4.69)
λ1 ρq h̄ h̄c
where λ1 is as in (6.4.66). By (6.4.69) and (6.4.9) we can get the same estimate as in
(6.4.67).
3. Energy level number of mediators. Likewise, we only consider the energy level
number of photons. In this case, the spectral equation (6.4.58) can be reduced as
! "
Bw ρ r 2
−∆ϕ = i λ + gw (⃗σ · ∇)ϕ , for |x| < 1,
h̄cρw (6.4.70)
ϕ =0 on |x| = 1,
where ργ is the photon radius.
By (6.4.70) we can see that the parameter K as in (3.6.51) takes
Bw ργ g2w
K= .
ρw h̄c
Thus, by (3.6.51) the energy level number N of photons is given by
! "3 E F3
K 1 Bw ργ g2w
N= = , (6.4.71)
β1 β1 ρw h̄c
where β1 is as in (3.6.51), and β1 ∼ o(1).
From the physical significance, ργ is approximatively the weak attracting radius of
weaktons, and ρw /Bw = ρ satisfying
P
> 0 for 0 < r < ρ ,
Fw
< 0 for ρ < r < ργ .
Hence, it is natural to think that
Bw ρ γ ργ
= = 102 ∼ 104 .
ρw ρ
Thus, by (6.4.71) and (6.4.6) we get hat
! "18
ρn
N∼ " 1090. (6.4.72)
ρw
Remark 6.22 In the estimates (6.4.66), (6.4.69) and (6.4.71), the energy level numbers
N for various subatomic particles are counting the multiplicities of eigenvalues. However,
the numbers N have the same order as the real energy level numbers. It is because that the
multiple eigenvalues are unstable, under the perturbation of electromagnetism together with
weak and strong magnetism, most multiple eigenvalues become the simple eigenvalues.
6.4 Energy Levels of Subatomic Particles 393
It is clear that the largest and smallest energy levels are given by
Emax − Emin = λN − λ1 .
Since |λ1 | ≫ |λN |, the average energy level gradient (for two adjacent energy levels) is
approximatively given by
Emax − Emin |λ1 |
∆E = ≃ . (6.4.73)
N N
The first eigenvalue λ1 of (6.4.70) is
! "
h̄c
λ1 ≃ −K the unit is .
ργ
h̄c Bw ργ g2w
∆E = β13 K −2 , K= .
ργ ρw h̄c
Remark 6.23 The physical conclusion that all particles have finite and discrete energy
distribution is of very important significance for the quantum field theory. It implies that
the infinity appearing in the field quantization does not exist, and the renormalization theory
needs to be reconsidered.
394 Chapter 6 Quantum Physics
ψ = ψ (t, x1 , · · · , xk ) (6.5.2)
be the wave function describing the N-particle system (6.5.1). Then, the classical theory for
(6.5.1) is provided by the Schrödinger equation
∂ψ N
h̄2
ih̄ =−∑ ∆k ψ + ∑ V (x j , xk )ψ , (6.5.3)
∂t k=1 2mk j̸=k
• It is non-relativistic model;
• The model does not involve the vector potentials ⃗A of the interactions between parti-
cles.
• The model is decoupled with interaction fields, i.e. the interaction fields in the model
are treated as given functions.
6.5 Field Theory of Multi-Particle Systems 395
We note that in general multi-particle systems are layered, and may consist of numerous
sub-systems. In particular, we know that the weak and strong interactions are also layered.
Hence, here we consider the same level systems, i.e. the systems which consist of identical
particles or sub-systems possessing the same level of interactions.
For multi-particle systems with N same level subsystems Ak (1 ! k ! N), the energy
contributions of Ak are indistinguishable. Hence, the Lagrangian actions for the N-particle
systems satisfy SU(N) gauge invariance. Thus we propose the following basic postulate:
Postulate 6.25 An N-particle system obeys the SU(N) gauge invariance, i.e. the
Lagrangian action of this system is invariant under the SU(N) gauge transformation
⎛ ⎞ ⎛ ⎞
ψ
51 ψ1
⎜ .. ⎟ ⎜ .. ⎟
⎝ . ⎠ = Ω ⎝ . ⎠ , Ω ∈ SU(N), (6.5.5)
ψ
5N ψN
where ψ1 , · · · , ψN are the wave functions of the N particles.
396 Chapter 6 Quantum Physics
We now need to explain the physical significance of the SU(N) gauge fields induced by
Postulate 6.25.
Let each particle of the N-particle system carry an interaction charge g (for example
a weak charge g = gw ). Then, there are interactions present between the N particles. By
the SU(N) gauge theory, the gauge invariant 4-dimensional energy-momentum operator is
given by
Dµ = ∂µ + igGaµ τa for 1 ! a ! N 2 − 1, (6.5.6)
and the interaction energy generated by the N particles is
.
Ψ(iγ µ Dµ Ψ) for fermions,
E= (6.5.7)
|Dµ Ψ|2 for bosons,
where Ψ = (ψ1 , · · · , ψN )T , and Dµ is as in (6.5.6). From (6.5.6) and (6.5.7) we obtain the
physical explanation to the SU(N) gauge fields Gaµ , stated in the following postulate:
Postulate 6.26 For a system of N-particles in the same level with each particle carrying
an interaction charge g, the N particles induce dynamic interactions between them, and the
SU(N) gauge fields
gGaµ for 1 ! a ! N 2 − 1 (6.5.8)
stand for the interaction potentials between the N particles.
The N particles induce dynamic interactions between them in terms of the SU(N) gauge
fields (6.5.8). These interaction fields cannot be measured experimentally because they
depend on the choice of generator representation τa of SU(N). By the SU(N) geometric
theory in Section 3.5, there is a constant SU(N) tensor
Gµ = αaN Ga (6.5.10)
is independent of the SU(N) representation τa . The field (6.5.10) is the interaction field
which can be experimentally observed. Thus we propose the following basic postulate.
Postulate 6.27 For an N-particle system, only the interaction field given by (6.5.10)
can be measured, and is the interaction field under which this system interacts with other
external systems.
The main motivation to introduce Postulates 6.25 and 6.26 are as follows. Consider an
N-particle system with each particle carrying an interaction charge g. Let this be a fermionic
system, and the Dirac spinors be given by
Ψ = (ψ1 , · · · , ψN )T .
By Postulates 6.3 and 6.5, the Dirac equations for this system can be expressed in the general
form
iγ µ Dµ Ψ + MΨ = 0, (6.5.11)
where G = (Giµj ) is an Hermitian matrix, representing the interaction potentials between the
N particles generated by the interaction charge g.
Notice that the space consisting of all Hermitian matrices
τ0 , τ1 , · · · , τK with K = N 2 − 1, (6.5.13)
The equations (6.5.11) with (6.5.14) are just the Dirac equations in the form of SU(N)
gauge fields {Gaµ | 1 ! a ! N 2 − 1} with a given external interaction field G0µ . Thus, based
on Postulate 6.24, the gauge invariance of an N-particle system and the expressions (6.5.11)
and (6.5.14) of the N fermionic particle field equations dictate Postulates 6.25 and 6.26.
The derivation here indicates that Postulates 6.25 and 6.26 can be considered as the
consequence of 1) the gauge invariance stated in Postulate 6.24, and 2) the existence of
interactions between particles as stated in (6.5.12), which can be considered as an axiom.
398 Chapter 6 Quantum Physics
Fermionic systems
Consider N fermions at the same level with interaction charge g, the wave functions
(Dirac spinors) are given by
where LG is the sector of the SU(N) gauge fields, and LD is the Dirac sector of particle
fields:
1
LG = − Gab gµα gνβ Gaν µ Gbαβ ,
4h̄c
E ! " F (6.5.18)
ig ig c
LD = Ψ iγ µ ∂µ + G0µ + Gaµ τa − M Ψ,
h̄c h̄c h̄
where Gaµ (1 ! a ! N 2 −1) are the SU(N) gauge fields representing the interactions between
the N particles, τa (1 ! a ! N 2 − 1) are the generators of SU(N), and
1
Gab = Tr(τa τb† ),
2
g a b c
Gaµν = ∂µ Gaν − ∂ν Gaµ + λ G G .
h̄c bc µ ν
According to PID and PLD, for the action (6.5.17) the field equations are given by
δL
= Dµ φa by PID,
δ Gaµ
(6.5.19)
δL
=0 by PLD,
δΨ
where Dµ is the PID gradient operator given by
! "
1 1 gα gβ 0
Dµ = ∂ µ − k2 x µ + Gµ + Gµ ,
h̄c 4 h̄c h̄c
6.5 Field Theory of Multi-Particle Systems 399
Gµ is as in (6.5.10), α and k are parameters, k −1 stands for the range of attracting force of
I gα J−1
the interaction, and is the range of the repelling force.
h̄c
Thus, by (6.5.18) and (6.5.19) we derive the field equations of the N-particle system
(6.5.15)-(6.5.16) as follows
M g b αβ c d N
Gab ∂ ν Gbν µ − λcd g Gα µ Gβ − gΨγµ τa Ψ (6.5.20)
h̄c
E F
1 gα gβ 0
= ∂ µ − k2 x µ + Gµ + G φa for 1 ! a ! N 2 − 1,
4 h̄c h̄c µ
⎛ ⎞ ⎛ ⎞
E F ψ1 ψ1
ig ig ⎜ . ⎟ c ⎜ ⎟
⎟ − M ⎜ .. ⎟ = 0,
iγ µ ∂µ + G0µ + Gaµ τa ⎜ .
.
⎝ ⎠ h̄ ⎝ ⎠ . (6.5.21)
h̄c h̄c
ψN ψN
where γµ = g µν γ ν , and G0µ is the interaction field of external systems. It is by this field G0µ
that we can couple external sub-systems to the model (6.5.20)-(6.5.21).
Remark 6.29 In the field equations of multi-particle systems there is a gauge fixing
problem. In fact, we know that the action (6.5.17)-(6.5.18) is invariant under the gauge
transformation
I J ! "
Ψ,
5 G5aµ τa = eiθ a τa Ψ, Gaµ eiθ b τb τa e−iθ b τb − 1 ∂µ θ b τb . (6.5.22)
g
then (Ψ,
5 G 5aµ ) is a solution of (6.5.23) as well. In (6.5.22) we see that G
5aµ have N 2 − 1 free
functions
θ a (x) with 1 ! a ! N 2 − 1. (6.5.24)
1) there might exist some unknown fundamental principles, which can provide the all or
some of the N 2 − 1 gauge fixing equations; and
2) there might be no general physical principles to determine the gauge fixing equations,
and these equations will be determined by underlying physical system.
400 Chapter 6 Quantum Physics
Bosonic systems
Φ = (ϕ1 , · · · , ϕN )T ,
1 1 I c J2
LKG = |Dµ Φ|2 + |MΦ|2
2 2 h̄
ig ig
Dµ = ∂µ + G0µ + Gaµ τa .
h̄c h̄c
Then, the PID equations of (6.5.25) are as follows
M g b αβ c d N ig K L
Gab ∂ ν Gbν µ − λcd g Gα µ Gβ + (Dµ Φ)† (τa Φ) − (τa Φ)† (Dµ Φ) (6.5.26)
E h̄c 2 F
1 2 g g
= ∂µ − k xµ + α Gµ + β G0µ φa for 1 ! a ! N 2 − 1,
4 h̄c h̄c
⎛ ⎞ ⎛ ⎞
ϕ1 I c J2 ϕ1
µ ⎜ .. ⎟ 2 ⎜ .. ⎟
D Dµ ⎝ . ⎠ − M ⎝ . ⎠ = 0. (6.5.27)
h̄
ϕN ϕN
Mixed systems
The interaction fields of this system are SU(N1 )× SU(N2 ) gauge fields, SU(N1 ) gauge fields
are for fermions, and SU(N2 ) for bosons:
where LG1 and LG2 are the sectors of SU(N1 ) and SU(N2 ) gauge fields as given in (6.5.18)
with N = N1 and N = N2 respectively.
Define the two total gauge fields of SU(N1 ) and SU(N2 ), as defined by (6.5.9)-(6.5.10):
Another remark is that the gauge actions LG1 and LG2 in (6.5.28) obey the gauge invari-
ance, but sectors LD and LKG break the gauge symmetry, due to the coupling of different
level physical systems. Namely, the Principle of Symmetry-Breaking 2.14 holds true here.
In addition, the field equations (6.5.30) and (6.5.31) spontaneously break the gauge symme-
5µ on the right-sides of the field equations.
try, due essentially to the fields G µ and G
Layered systems
Without loss of generality, we assume A and B are the fermion systems. Thus the action
of this layered system is
!
G H
K K
L= LAG + ∑ LB j G + LAD + ∑ LB j D dx, (6.5.35)
j=1 j=1
where
where G0µ is the external field. The corresponding PID field equations of the layered multi-
particle system (6.5.34) follow from (6.5.35) and (6.5.36), and here we omit the details.
6.5 Field Theory of Multi-Particle Systems 403
Remark 6.30 Postulate 6.27 is essentially another expression of PRI, which is very
crucial to couple all sub-systems together to form a complete set of field equations for a
given multi-particle system. In particular, this approach is natural and unique to derive
models for multi-particle systems, satisfying all fundamental principles of (6.5.4), the Prin-
ciple of Symmetry-Breaking 2.14, and the gauge symmetry breaking principle (Principle
4.4). It is also a unique way to establish a unified field theory coupling the gravity and other
interactions in various levels of multi-particle systems. In the next subsection we discuss
this topic.
In Chapter 4, we have discussed the unified field theory, in which we consider two aspects:
1) the interaction field particles, and 2) the interaction potentials. Hence, it restricted the
unified field model to be the theory based on
where N1 , · · · , NK are the particle numbers of various sub-systems and layered systems.
The two types of unified field models based on (6.5.37) and (6.5.38) are mutually com-
plementary. They have different roles in revealing the essences of interactions and particle
dynamic behaviors.
In this subsection, we shall establish the unified field model of multi-particle systems
based on (6.5.38), which matches the vision of Einstein and Nambu. In his Nobel lecture
(Nambu, 2008), Nambu stated that
Einstein used to express dissatisfaction with his famous equation of gravity
Gµν = 8π Tµν
His point was that, from an aesthetic point of view, the left hand side of the
equation which describes the gravitational field is based on a beautiful geo-
metrical principle, whereas the right hand side, which describes everything
else, . . . looks arbitrary and ugly.
... [today] Since gauge fields are based on a beautiful geometrical prin-
ciple, one may shift them to the left hand side of Einstein s equation. What
is left on the right are the matter fields which act as the source for the gauge
fields ... Can one geometrize the matter fields and shift everything to the left?
404 Chapter 6 Quantum Physics
The gravity will be considered only in systems possessing huge amounts of particles,
which we call gravitational systems. Many gravitational systems have very complicated
structures. But they are composites of some simple systems. Here we only discuss two
cases.
where R is the scalar curvature, G is the gravitational constant, g = det(g µν ), LAN1 and LAN2
are the sectors of SU(N1 ) and SU(N2 ) gauge fields for the electromagnetic interaction
1
LAN1 = − Gab gµα gνβ Aaµν Abαβ 1 ! a, b ! N12 − 1,
4
1
LAN2 = − G5kl gµα gνβ A 5kµν A
5l
αβ 1 ! k, l ! N22 − 1,
4 (6.5.40)
n1 e a b c
Aaµν = ∂µ Aaν − ∂ν Aaµ + λ A A n1 ∈ Z,
h̄c bc µ ν
5kµν = ∂µ A
A 5kµ + n2 e 5
5kν − ∂ν A λkA 5j
5i A n2 ∈ Z,
h̄c i j µ ν
where M1 and M2 are the masses, ∇µ is the covariant derivative, and A0µ is the external
electromagnetic field.
6.5 Field Theory of Multi-Particle Systems 405
Based on PID and PLD, the field equations of (6.5.39) are given by
δ c4 G g
L= D φ , (PID)
δ g µν 8π G µ ν
δ
L = DAµ φa , (PID)
δ Aaµ
δ ! (6.5.42)
L = DAµ φ5k , (PID)
δ Aµ
5 k
δ
L = 0, (PLD)
δΨ
δ
L = 0, (PLD)
δΦ
where
n1 e n2 e 5
DG
µ = ∇µ + Aµ + Aµ ,
h̄c h̄c
1 n1 e n2 e 5
DAµ = ∂µ − k12 xµ + α Aµ + α
5 Aµ , (6.5.43)
4 h̄c h̄c
! 1 n1 e n2 e 5 5
DAµ = ∂µ − k22 xµ + β Aµ + β Aµ .
4 h̄c h̄c
Here Aµ = αaN1 Aaµ and A 5µ = α N2 A
k
5kµ are the total electromagnetic fields generated by the
fermion system and the boson system.
By (6.5.39)-(6.5.41), the equations (6.5.42)-(6.5.43) are written as
1 8π G I n1 e n2 e 5 J g
Rµν − gµν R = − 4 Tµν + ∇µ + Aµ + Aµ φν , (6.5.44)
M 2 c
n1 e b αβ c d N h̄c h̄c
Gab ∂ ν Abν µ − λ g Aα µ Aβ − n1 eΨγµ τa Ψ (6.5.45)
E h̄c cd F
1 n1 e n2 e 5
= ∂µ − k12 xµ + α Aµ + α
5 Aµ φa ,
4 h̄c h̄c
M N i
5lν µ − n2 e 5
K L
G5kl ∂ ν A λilj gαβ A5iα µ A5βj + n2 e (Dµ Φ)† (τ5k Φ) − (τ5k Φ)† (Dµ Φ) , (6.5.46)
E h̄c 2
F
1 n1 e n2 e 5 5 5
= ∂µ − k22 xµ + β Aµ + β Aµ φk ,
4 h̄c h̄c
E F
in 1 e in 1 e c
iγ µ ∂µ + A0µ + Aaµ τa Ψ − M1 Ψ = 0, (6.5.47)
h̄c h̄c h̄
I c J2
gµν Dµ Dν Φ − M22 Φ = 0, (6.5.48)
h̄
where the energy-momentum tensor Tµν in (6.5.44) is
1 1
Tµν = − gµν (LAN1 + LAN2 + h̄cLD + h̄cLKG ) + (Dµ Φ)† (Dν Φ) (6.5.49)
2 2
1 αβ a b 1 5 αβ 5k 5l
− Gab g Aµα Aνβ − Gkl g Aµα Aνβ .
4 4
406 Chapter 6 Quantum Physics
The energy-momentum tensor Tµν contains the masses M1 , M2 , the kinetic energy and elec-
tromagnetic energy.
It is clear that both sides of the field equations (6.5.44)-(6.5.48) are all generated by
the fundamental principles. It is the view presented by Einstein and Nambu and shared by
many physicists that the Nature obeys simple beautiful laws based on a few first physical
principles. In other words, the energy-momentum tensor Tµν is now derived from first
principles and is geometrized as Einstein and Nambu hoped.
Systems with four interactions
The above systems with gravity and electromagnetism in general describe the bodies in
lower energy density. For the systems in higher energy density, we have to also consider
the weak and strong interactions. The interactions are layered as shown below, which were
derived in (Ma and Wang, 2015b, 2014g):
The layered systems and sub-systems above determine the action of the system with
four interactions as follows:
!
c4 √
L= R −gdx + actions of all levels, (6.5.50)
8π G
6.5 Field Theory of Multi-Particle Systems 407
and the action of each layered level is as given by the manner as used in (6.5.35)-(6.5.36).
Hence, the unified field model of a multi-particle system is completely determined by
the layered structure of this system, as given by (6.5.50). It is very natural that a rationale
unified field theory must couple the matter fields and interaction fields together.
Remark 6.31 Once again we emphasize that, using PRI contractions as given by
(6.5.10) and proper gauge fixing equations, from the unified field model (6.5.50) coupling
matter fields for multi-particle system, we can easily deduce that the total electromagnetic
field Aµ obtained from (6.5.50) satisfies the U(1) electromagnetic gauge field equations,
and derive the weak and strong interaction potentials as given in (Ma and Wang, 2015a,
2014h).
Classical quantum mechanics is essentially a subject to deal with single particle systems.
Hence, the hydrogen spectrum theory was perfect under the framework of the Dirac equa-
tions. But, for general atoms the spectrum theory was defective due to lack of precise field
models of multi-particle systems.
In this subsection, we shall apply the field model of multi-particle systems to establish
the spectrum equations for general atoms.
1. Classical theory of atomic shell structure. We recall that an atom with atomic number
Z has energy spectrum
Z 2 me4
En = − 2 2 , n = 1, 2, · · · . (6.5.51)
n 2h̄
If we ignore the interactions between electrons, the orbital electrons of this atom have the
idealized discrete energies (6.5.51). The integers n in (6.5.51) are known as principal quan-
tum number, which characterizes the electron energy levels and orbital shell order:
n: 1 2 3 4 5 6 7
(6.5.52)
shell symbol: K L M N O P Q.
Each orbital electron is in some shell of (6.5.52) and possesses the following four quantum
numbers:
For each given shell n, there are sub-shells characterized by orbital quantum number l,
whose symbols are:
l: 0 1 2 3 4 ···
(6.5.53)
sub-shell: s p d f g · · · .
By the Pauli exclusion principle, at a give sub-shell nl, there are at most the following
electron numbers
Nnl = Nl = 2(2l + 1) for 0 ! l ! n − 1.
Namely, for the sub-shells s(l = 0), p(l = 1), d(l = 2), f (l = 3), g(l = 4), their maximal
electron numbers are
n−1
Nn = ∑ Nl = 2n2. (6.5.54)
l=0
2. Atomic field equations. Based on the atomic shell structure, the electron system of an
atom consists of shell systems as (6.5.52), which we denote by
Thus, we have two kinds of classifications (6.5.55) and (6.5.56) of sub-systems for
atomic orbital electrons, which lead to two different sets of field equations.
where
1 µα νβ an an
LSU(Kn ) = − g g Aµν Aαβ 1 ! an ! Kn ,
I 4h̄c J
me c
LDn = Ψn iγ µ Dµ − Ψn 1 ! n ! N,
h̄
e (6.5.59)
Aaµν
n
= ∂µ Aaνn − ∂ν Aaµn − λbanncn Abµn Acνn ,
! h̄c "
ie 0 ie an
Dµ Ψn = ∂µ − Aµ − Aµ τan Ψn ,
h̄c h̄c
where Aaµn are the SU(Kn ) gauge fields representing the electromagnetic (EM) potential of
the electrons in Sn , λbanncn are the structure constants of SU(Kn ) such that Gan bn = 21 tr(τan τb†n ) =
δan bn , A0µ is the EM potential generated by the nuclear, g = −e (e > 0) is the charge of an
electron, and me is the electron mass.
The PID gradient operators for SU(K1 ) × · · · × SU(KN ) in (6.5.19) are given by
G H
1 e
h̄c k̸∑
Dnµ =
(k)
∂µ + Aµ for 1 ! n ! N, (6.5.60)
h̄c =n
(k) K a
where A µ = αaKk Aµk is the total EM potential of Sk shell as defined in (6.5.10).
Then by (6.5.58)-(6.5.60), the field equations of (6.5.57) can be written in the following
form
e
∂ ν Aaνnµ + λbanncn gαβ Abαnµ Acβn + eΨγµ τ an Ψn (6.5.61)
G h̄c H
e
h̄c k̸∑
A µ φ an for 1 ! an ! Kn2 − 1, 1 ! n ! N,
(k)
= ∂µ +
=n
E F
µ ie 0 ie an me c
iγ ∂µ − Aµ − Aµ τan Ψn − Ψn = 0. (6.5.62)
h̄c h̄c h̄
B. F IELD EQUATION OF SYSTEM Snl . The precise model of atomic spectrum should
n−1
take (6.5.56) as an N-particle system. Also, Sn = ∑ Snl is again divided into n sub-systems
l=0
Sn : Sn0 , · · · , Snn−1.
Hence, the system Snl has more sub-systems than Sn , i.e. if Sn has N sub-systems, then Snl
1
has N(N + 1) sub-systems.
2
Let Snl have Knl electrons with wave functions:
K
Snl : Ψnl = (ψnl1 , · · · , ψnlnl ), 1 ! n ! N, 0 ! l ! n − 1, (6.5.63)
where LSU(Knl ) and LDnl are similar to that of (6.5.59). Thus, the field equation of the system
(6.5.63) is determined by (6.5.64).
Remark 6.32 The reason why atomic spectrum can be divided into two systems
(6.5.57) and (6.5.63) to be considered is that in the system (6.5.63) the electrons in each
Snl have the same energy, and in (6.5.57) the electons in each S n have the same energy if we
ignore the interaction energy between different l-orbital electrons of S nl . Hence, the system
of Snl is precise and the system of Sn is approximative.
3. Atomic spectrum equations. For simplicity, we only consider the system S n , and for
Snl the case is similar. Since the electrons in each Sn have the same energy λn , the wave
functions in (6.5.57) can take as
It is known that the EM fields Aaµ in atomic shells are independent of time t, i.e. ∂t Aaµ = 0.
Therefore, inserting (6.5.65) into (6.5.61) and (6.5.62) we derive the spectrum equation in
the form
where Φn = (ϕn1 , · · · , ϕnKn )T , Aaµn = (Aa0n , ⃗Aan ),⃗α = (α1 , α2 , α3 ) and α0 are as in (3.1.15),⃗γ =
(γ 1 , γ 2 , γ 3 ) is as in (6.2.8), V = ze/r is the Coulomb potential of the nuclear, ⃗A = (A1 , A2 , A3 )
is the magnetic potential of the nuclear, and
! "
ie ⃗ ie ⃗ an
DΦn = ∇ − A − A τan Φn ,
h̄c h̄c
⃗Abαn = ∂α ⃗Abn − ∇Abαn − e λ bn Acαn ⃗Adn .
h̄c cn dn
The equations (6.5.66)-(6.5.68) need to be complemented with some gauge fixing equa-
tions; see Remark 6.29.
Chapter 7
Astrophysics and Cosmology
The aim of this chapter is to study fundamental issues of astrophysics and cosmology based
on the first principles dictating the law of gravity, the cosmological principle, and the prin-
ciple of symmetry-breaking. The study and the results presented in this chapter are based
on recent papers (Ma and Wang, 2014e,a; Hernandez, Ma and Wang, 2015).
First, we have rigorously proved a basic Blackhole Theorem on the nature and structure
of black holes, Theorem 7.15:
Assume the validity of the Einstein theory of general relativity, then black holes
are closed, innate and incompressible.
This theorem was originally discovered and proved in (Ma and Wang, 2014a, Theorem
4.1). One important part of the theorem is that all black holes are closed: matters can neither
enter nor leave their interiors. Classical view was that nothing can get out of black holes,
but matters can fall into blackholes. We show that nothing can get inside the blackhole
either. This theorem offers a very different views on the geometric structure and the origin
of our Universe, on the formation and stability of stars and galaxies, and on the mechanism
of supernovae and active galactic nucleus (AGN) jets.
In particular, we rigorously show that our Universe is not originated from a Big-Bang,
and is static, under the assumption of the Einstein general relativity and the cosmological
principle. The redshift is then due mainly to the black hole effect, and the CMB is caused
by the blackbody equilibrium radiation.
Also, we show that the dark energy is associated with the the negative pressure, the
effect of the gravitational repelling force, and the dark matter is caused the space curved
energy.
This chapter is organized as follows. Section 7.1 introduces the momentum represen-
tation of astrophysical fluid dynamical models. One important aspect of the models is the
coupling between the gravitational field equations and the fluid dynamics equations, using
the principle of symmetry-breaking. Section 7.4 proves the black hole theorem.
Stellar circulations are studied in Section 7.2, leading to the mechanism of supernovae
explosion. Section 7.3 addresses galactic circulations, and provides the mechanism of AGN
jets.
412 Chapter 7 Astrophysics and Cosmology
Based on the black hole theorem, theorems on the structure and origin of the Universe
are proved in Section 7.5.
Finally, in Section 7.6, we have derived 1) the PID cosmological model of our Universe,
2) the nature of dark energy and dark matter, and 3) the gravitational force formulas.
Here we choose (7.1.10) instead of (7.1.11) to represent the viscous term in (7.1.1). The
reason is that the Laplace-Beltrami operator
gives rise to an additional term gi j R jk uk . In fluid dynamics, the term µ div · ∇u represents
the viscous (frictional) force, and the term gi j R jk uk is the force generated by space curvature
and gravitational interaction. Hence physically, it is more natural to take (7.1.10) instead of
(7.1.11).
Remark 7.2 In the fluid dynamic equations (7.1.9), the symmetry of general relativity
breaks, and the space and time are treated independently.
where (r, θ , ϕ ) is the spherical coordinate system, and u = u(r,t) and v = v(r,t) are functions
of r and t, which are determined by the gravitational field equations.
In the exterior of a ball, Schwarzschild first obtained an exact solution of the Einstein
field equations in 1916, which describes the gravitational fields for the external vacuum
state of a static spherically symmetric matter field.
Let m be the total mass of a centrally symmetric ball. Then the classical Newtonian
gravitational potential of the ball reads
mG
ϕ =− . (7.1.13)
r
Based on the Einstein general theory of relativity, the time-component g 00 of gravitational
potential gµν and the Newton potential ϕ have the following relation
2
g00 = − 1 + 2 ϕ (7.1.14)
c
Hence, by (7.1.13) and (7.1.14) for the ball we have
2mG
g00 = −1 + . (7.1.15)
c2 r
Now we consider the gravitational field equations in the exterior of the ball. In the
vacuum state,
Tµν = 0. (7.1.16)
On the other hand, by R = g µν Rµν , the Einstein gravitational field equations
1 8π G
Rµν − gµν R = − 4 Tµν
2 c
can be equivalently written as
8π G 1
Rµν = − (Tµν − gµν T ), T = gkl Tkl .
c4 2
Thus, by (7.1.16) the Einstein field equations become
Rµν = 0 (7.1.17)
The nonzero components of the metric (7.1.12) g µν are
g00 = −eu , g11 = ev , g22 = r2 , g33 = r2 sin2 θ . (7.1.18)
Since the gravitational resource is static, u and v only depend on r. By (7.1.18), all
nonzero components of the Levi-Civita connection (7.1.5) are given by
1 1
Γ100 = eu−v u0 , Γ111 = v0 , Γ122 = −re−v ,
2 2
1 1
Γ133 = −re−v sin2 θ , Γ010 = u0 , Γ212 = , (7.1.19)
2 r
1 cos θ
Γ233 = − sin θ cos θ , Γ313 = , Γ323 = .
r sin θ
7.1 Astrophysical Fluid Dynamics 415
Rµν = 0, ∀µ 6= ν .
Therefore, the vacuum Einstein field equations (7.1.17) become the following system of
ordinary differential equations
1 00 1 0 1 0
u + u − (v − u0)u0 = 0, (7.1.21)
2 r 4
1 00 1 0 1 0
u − v − (v − u0 )u0 = 0, (7.1.22)
2 r 4
r 0
(u − v0 ) − ev + 1 = 0. (7.1.23)
2
By the Bianchi identity, only two equations of (7.1.22)-(7.1.23) are independent. The
difference of (7.1.21) and (7.1.22) leads to
u0 + v0 = 0,
Namely
d
(re−v ) = 1.
dr
It follows that
b
e−v = 1 − ,
r
where b is a to-be-determined constant.
Then it follows from (7.1.24) that
b
eu = eβ e−v = eβ 1 − .
r
416 Chapter 7 Astrophysics and Cosmology
By scaling time t, we can take eβ = 1. Hence the solution of the Einstein field equations
(7.1.17) is given by
u b
g00 = −e = − 1 − ,
r
−1 (7.1.25)
b
g11 = ev = 1 − .
r
Comparing (7.1.25) with (7.1.15), we can obtain
2mG
b= .
c2
Thus, we get the solution of (7.1.17) as
2mG
g00 = − 1 − 2 ,
c r
2mG −1
g11 = 1 − 2 ,
c r
We have in particular
2mG
e−v = eu = 1 − .
c2 r
which is called the Schwarzschild solution or metric.
TOV metric
The Schwarzschild metric (7.1.26) describes the exterior gravitational fields of a cen-
trally symmetric ball. For the interior gravitational fields, the metric is given by the TOV
solution.
Let m be the mass of a centrally symmetric ball, and R be the radius of this ball. In the
interior of the ball, the variable r satisfies 0 6 r < R. Let the ball be a static liquid sphere
consisting of idealized fluid, an approximation of stars. The energy-momentum tensor of an
idealized fluid is in the form
where p is the pressure, ρ is the density, and u µ is the 4-velocity. For a static fluid, u µ is
given by
1
uµ = √ (1, 0, 0, 0).
−g00
7.1 Astrophysical Fluid Dynamics 417
By the Bianchi identity, only three equations of (7.1.28)-(7.1.31) are independent. Here
we also regard p and ρ as unknown functions. Therefore, for the four unknown functions
u, v, p, ρ , we have to add an equation of state to the system of (7.1.28)-(7.1.31):
ρ = f (p), (7.1.32)
c2 r
M(r) = (1 − e−v). (7.1.34)
2G
Then the equation (7.1.28) can be rewritten as
1 dM
= 4πρ ,
r2 dr
whose solution is given by
Z r
M(r) = 4π r2 ρ dr for 0 < r < R. (7.1.35)
0
By (7.1.35), we see that M(r) is the mass, contained in the ball B r . It follows from (7.1.34)
that
2GM(r)
e−v = 1 − . (7.1.36)
c2 r
Inserting (7.1.36) in (7.1.29) we obtain
1 8π G 3
u0 = pr + 2GM(r) . (7.1.37)
r(c2 r − 2MG) c2
Putting (7.1.37) into (7.1.31) we get
0 p + c2 ρ 8π G 3
p =− pr + 2GM(r) . (7.1.38)
2r(c2 r − 2MG) c2
Thus, it suffices for us to derive the solution p, M and ρ from (7.1.32)-(7.1.34) and
(7.1.38), and then v and u will follow from (7.1.36)-(7.1.37) and (7.1.33).
The equation (7.1.38) is called the TOV equation, which was derived to describe the
structure of neutron stars.
We note that (7.1.36) is the interior metric of a blackhole provided that 2GM(r)/
2
(c r) = 1. Thus the TOV solution (7.1.36) gives a rigorous proof of the following theo-
rem for the existence of black holes.
7.1 Astrophysical Fluid Dynamics 419
Theorem 7.3 If the matter field in a ball BR of radius R is spherically symmetric, and
the mass MR and the radius R satisfy
2GMR
= 1,
c2 R
then the ball must be a blackhole.
ρ = ρ0 a constant.
The functions (7.1.39)-(7.1.41) are the TOV solution. By (7.1.12), the solution (7.1.40) and
(7.1.41) yields the metric
" 1/2 #2
2 3 2Gm 1/2 1 2Gmr2
ds = − 1− 2 − 1− 2 3 c2 dt 2 (7.1.42)
2 c R 2 c R
−1
2Gmr2
+ 1− 2 3 dr2 + r2 (d θ 2 + sin2 θ d ϕ 2 ),
c R
this subsection we discuss the basic differential operators (7.1.2)-(7.1.8) under the spherical
coordinate systems (θ , ϕ , r).
For a 3D sphere M, the Riemannian metric is given by
In (7.1.43) we have
1 1
Γ221 = Γ212 = , Γ233 = − sin θ cos θ , Γ331 = Γ313 = ,
r r
3 3 cos θ 1 r r
Γ32 = Γ32 = , Γ22 = − , Γ33 = − sin2 θ ,
1 (7.1.45)
sin θ α α
1 1 dα k
Γ11 = , Γi j = 0 for others.
2α dr
We deduce from (7.1.45) the explicit form of the Ricci curvature tensor (7.1.4):
1 dα 1 r dα
R11 = − , R22 = − 2 − 1,
α r dr α 2α dr (7.1.46)
R33 = R22 sin2 θ , Ri j = 0 6 j.
∀i =
Based on (7.1.45) and (7.1.46) we can obtain the expressions of the differential operators
(7.1.2)-(7.1.8) as follows:
1 1 ∂ ∂ uϕ 1 ∂ 2 uϕ 1 ∂ 2 ∂ uϕ
∆uϕ = sin θ + + r (7.1.48)
r2 sin θ ∂ θ ∂θ sin2 θ ∂ ϕ 2 α ∂r ∂r
2 cos θ ∂ uϕ 2 cos θ ∂ uθ 2 ∂ ur
+ − 2uϕ + +
sin θ ∂ θ sin3 θ ∂ ϕ r sin2 θ ∂ ϕ
1 ∂ α0 ∂ 2
+ 2 2 (ruϕ ) − (r uϕ ) ,
αr ∂r 2α ∂ r
1 1 ∂ ∂ ur 1 ∂ 2 ur 1 ∂ 2 ∂ ur
∆ur = 2 sin θ + 2 + r (7.1.49)
r sin θ ∂ θ ∂θ sin θ ∂ ϕ 2 α ∂ r ∂r
2 cos θ ∂ uθ ∂ uϕ r ∂ 2 α 0
− 2 ur + r uθ + r +r − ur .
αr sin θ ∂θ ∂ϕ 2 ∂ r 2α
∂ uθ ∂ uθ ∂ uθ 2
uk Dk uθ = u r + uθ + uϕ + uθ ur − sin θ cos θ uϕ2 (7.1.50)
∂r ∂θ ∂ϕ r
∂ uϕ ∂ uϕ ∂ uϕ 2 cos θ 2
uk Dk uϕ = u r + uθ + uϕ + uθ uϕ + uϕ ur , (7.1.51)
∂r ∂θ ∂ϕ sin θ r
∂ ur ∂ ur ∂ ur r 2 α0
uk Dk ur = u r + uθ + uϕ − (uθ + sin2 θ uϕ2 − u2r ). (7.1.52)
∂r ∂θ ∂ϕ α 2r
√ √
4) By (7.1.8) and g = r2 sin θ α , the divergent operator divu is
1 ∂ ∂ uϕ 1 ∂ 2√
divu = (sin θ uθ ) + + 2√ (r α ur ). (7.1.54)
sin θ ∂ θ ∂ϕ r α ∂r
Remark 7.4 The expressions (7.1.47)-(7.1.54) are the differential operators appearing
in the fluid dynamic equations describing the stellar fluids. However, we need to note that
the two components uθ and uϕ are the angular velocities of θ and ϕ , i.e.
dθ dϕ
uθ = , uϕ = .
dt dt
In classical fluid dynamics, the velocity field v = (vθ , vϕ , vr ) is the line velocity. The relation
of u and v is given by
1 1
u θ = vθ , uϕ = vϕ , ur = v r . (7.1.55)
r r sin θ
422 Chapter 7 Astrophysics and Cosmology
Hence, inserting (7.1.55) into (7.1.1) with the expressions (7.1.47)-(7.1.54), we derive
the Navier-Stokes equations in the usual spherical coordinate form as follows
∂ vθ 1 ∂p
+ (u · ∇)vθ = ν ∆vθ − + fθ ,
∂t ρr ∂ θ
∂ vϕ 1 ∂p
+ (u · ∇)vϕ = ν ∆vϕ − + fϕ ,
∂t ρ r sin θ ∂ ϕ (7.1.56)
∂ vr 1 ∂p
+ (u · ∇)vr = ν ∆vr − + fr ,
∂t ρα ∂ r
div v = 0,
where
2 ∂ vr 2 cos θ ∂ vϕ vθ 1 dα ∂
∆vθ = ∆v
e θ+ − − 2 2 − 2 (rvθ ),
r2 ∂ θ r2 sin2 θ ∂ ϕ r sin θ 2 α r dr ∂ r
e ϕ + 2 ∂ vr + 2 cos θ ∂ vθ − vϕ
∆vϕ = ∆v −
1 dα ∂
(rvϕ ),
r2 sin θ ∂ ϕ r2 sin2 θ ∂ ϕ r2 sin2 θ 2α 2 r dr ∂ r
e r − 2 vr + ∂ vθ + cos θ vθ + 1 ∂ vϕ + 1 ∂
∆vr = ∆v
1 dα
v r ,
α r2 ∂θ sin θ sin θ ∂ ϕ 2α ∂ r α dr
(7.1.57)
∆
e is the Laplace operator for scalar fields given by
1 ∂ ∂T 1 ∂ 2T 1 ∂ 2 ∂T α0 ∂ T
∆T = 2
e sin θ + 2 2 + r − , (7.1.58)
r sin θ ∂ θ ∂θ r sin θ ∂ ϕ 2 α r2 ∂ r ∂r 2rα 2 ∂ r
vθ ∂ v θ vϕ ∂ v θ ∂ vθ vθ vr cos θ vϕ2
(v · ∇)vθ = + + vr + − ,
r ∂θ r sin θ ∂ ϕ ∂r r r sin θ
vθ ∂ v ϕ vϕ ∂ v ϕ ∂ vϕ vϕ vr cos θ vϕ vθ
(v · ∇)vϕ = + + vr + + , (7.1.59)
r ∂θ r sin θ ∂ ϕ ∂r r r sin θ
vθ ∂ v r vϕ ∂ v r ∂ vr 1 1 dα 2
(v · ∇)vr = + + vr − (v2 + v2ϕ ) + v ,
r ∂ θ r sin θ ∂ ϕ ∂ r αr θ 2α dr r
and the divergent term div v reads
√
1 ∂ (sin θ vθ ) 1 ∂ vϕ 1 ∂ (r2 α vr )
div v = + + 2√ . (7.1.60)
r sin θ ∂θ r sin θ ∂ ϕ r α ∂r
7.1.4 Momentum representation
The Universe, galaxies and galactic clusters are composed of stars and interstellar nebulae.
Their velocity fields are not continuous. Hence it is not appropriate that we model cosmic
objects using continuous velocity field v(x,t) as in the Navier-Stoks equations or by discrete
position variables xk (t) as in the N-body problem.
7.1 Astrophysical Fluid Dynamics 423
The idea is that we use the momentum density field P(x,t) to replace the velocity field
v(x,t) as the state function of cosmic objects. The main reason is that the momentum density
field P is the energy flux containing the mass, the heat, and all interaction energy flux, and
can be regarded as a continuous field. The aim of this section is to establish the momentum
form of astrophysical fluid dynamics model.
The physical laws governing the dynamics of cosmic objects are as follows
The mathematical expressions of these laws are given respectively in the following:
1. Gravitational field equations.
1 8π G 1 e
Rµν − gµν R = − 4 Tµν + (D µ Φν + D
e ν Φµ ),
2 c 2 (7.1.62)
e µ = Dµ + e Aµ ,
D
h̄c
where Aµ is the electromagnetic potential, the time components of g µν are as
2
g00 = − 1 + 2 ψ , g0k = gk0 = 0 for 1 6 k 6 3,
c
and ψ is the gravitational potential.
2. Fluid dynamic equations. The Newton’s Second Law can be expressed as
dP
= Force, (7.1.63)
dτ
where τ is the proper time given by
√
dτ = −g00 dt, (7.1.64)
dP ∂ P ∂ P dxk ∂P 1
= + k = + (P · ∇)P,
dτ ∂ τ ∂ x dτ ∂τ ρ
424 Chapter 7 Astrophysics and Cosmology
and
ν ∆P + µ ∇(divP) the frictional force,
− ∇p the pressure gradient,
c2
ρ (1 − β T)∇g00 = −ρ (1 − β T)∇ψ the gravitational force.
2
Hence, the momentum form of the fluid dynamic equations (7.1.63) is written as
∂P 1
+ (P · ∇)P = ν ∆P + µ ∇(divP) − ∇p − ρ (1 − β T)∇ψ , (7.1.65)
∂τ ρ
where the differential operators ∆, ∇ and (P · ∇) are with respect to the space metric g i j (1 6
i, j 6 3) determined by (7.1.62), as defined in (7.1.2)-(7.1.8).
3. Heat conduction equation:
∂T 1
+ (P · ∇)T = κ ∆T
e + Q, (7.1.66)
∂τ ρ
where ∆
e is defined as
e = − √1 ∂
∆T
√ ij ∂T
gg ,
g ∂ xi ∂xj
and g = det(gi j ), 1 6 i, j 6 3.
4. Energy-momentum conservation:
∂ρ
+ divP = 0, (7.1.67)
∂τ
where ρ is the energy density:
ρ = mass + electromagnetism + potential + heat.
5. Equation of state:
p = f (ρ , T ). (7.1.68)
Remark 7.5 Both physical laws (7.1.63) and (7.1.67) are the more general form than
the classical ones:
dv
m = Force the Newton’s Second Law,
dτ (7.1.69)
∂m
+ div(mv) = 0 the continuity equation,
∂τ
where m is the mass density. Hence the momentum representation equations (7.1.65)-
(7.1.67) can be applicable in general. The momentum P represents the energy density flux,
consisting essentially of
P = mv + radiation flux+heat flux.
Hence in astrophysics, the momentum density P is a better candidate than the velocity field
v, to serve as the continuous-media type of state function.
7.1 Astrophysical Fluid Dynamics 425
M = {x ∈ R3 | r0 < r < r1 }.
The stellar atmosphere consists of rarefied gas. For example, the solar corona has mass
density about ρm = 10−9 ρ0 where ρ0 is the density of the earth atmosphere. Hence we use
the Schwarzschild solution in (7.1.44) as the metric:
2mG −1 2mG
α (r) = 1 − 2 , r0 > . (7.1.70)
c r c2
where m is the total mass of the star, and the condition r0 > 2mG/c2 ensures that the star is
not a black hole.
The stellar atmospheric model is the momentum form of the astrophysical fluid dynam-
ical equations defined on the spherical shell M :
∂P 1 mGρ
+ (P · ∇)P = ν ∆P + µ ∇(divP) − ∇p − 2 (1 − β T )~k,
∂τ ρ r
∂T 1
+ (P · ∇)T = κ ∆T,
e (7.1.71)
∂τ ρ
∂ρ
+ divP = 0,
∂τ
where P = (Pr , Pθ , Pϕ ) is the momentum density field, T is the temperature, p is the pressure,
ρ is the energy density, ν and µ is the viscosity coefficient, β is the coefficient of thermal
expansion, κ is the thermal diffusivity, α is as in (7.1.70), ∆P, (P · ∇)P, ∆T,
e div P are as in
(7.1.57)-(7.1.60), and
Pθ ∂ T Pϕ ∂ T ∂T
(P · ∇)T = + + Pr . (7.1.72)
r ∂ θ r sin θ ∂ ϕ ∂r
The equations (7.1.71) are supplemented with the boundary conditions:
∂ Pθ ∂ Pϕ
Pr = 0, = 0, =0 at r = r0 , r1 ,
∂r ∂r
T = T0 at r = r0 , (7.1.73)
T = T1 at r = r1 ,
426 Chapter 7 Astrophysics and Cosmology
where T0 and T1 are approximatively taken as constants and satisfy the physical condition
T0 > T1 .
Remark 7.6 First, there are three important parameters: the Rayleigh number Re, the
Prandtl number Pr and the δ -factor δ , which play an important role in astrophysical fluid
dynamics:
mGr0 r1 β T0 − T1 ν 2mG
Re = , Pr = , δ= 2 . (7.1.74)
κν h κ c r0
The δ -factor δ reflects the relativistic effect contained the Laplacian operator.
Remark 7.7 Astronomic observations show that the Sun has three layers of atmo-
spheres: the photosphere, the chromosphere, and the solar corona, where the solar atmo-
spheric convections occur. It manifests that the thermal convection is a universal phe-
nomenon for stellar atmospheres. In the classical fluid dynamics, the Rayleigh number
dictates the Rayleign-Bénard convection. Here, however, both the Rayleigh number Re
and the δ -factor defined by (7.1.74) play an important role in stellar atmospheric convec-
tions.
Remark 7.8 For rotating stars with angular velocity ~Ω, we need add to the right hand
side of (7.1.71) the Coriolis term:
where h i
Tµν = gµα gνβ ε α ε β + pgαβ ,
and ε µ is the 4D energy-momentum vector.
For the fluid component of the system, it is necessary to simplify the model by making
some physically sound approximations.
Approximation Hypothesis 7.9 The metric (7.1.75) and the stationary solutions of
the fluid dynamical equations are radially symmetric.
Under Hypothesis 7.9, the metric (7.1.75) is as in (7.1.12), or is written in the following
form
ds2 = −ψ (r)c2 dt 2 + α (r)dr2 + r2 d θ 2 + r2 sin2 θ d ϕ 2 , (7.1.77)
and the fluid dynamic equations are rewritten as
∂P 1 c2 ρ d ψ
+ (P · ∇)P = ν ∆P + µ ∇(divP) − ∇p − (1 − β (T − T0 ))~k,
∂τ ρ 2α dr
∂T 1 (7.1.78)
+ (P · ∇)T = κ ∆T
e + Q(r),
∂τ ρ
∂ρ
+ divP = 0,
∂τ
where P = (Pr , Pθ , Pϕ ), ∇P, ∆T,
e (P·∇)P, div P are as in (7.1.57)-(7.1.60), ∇p is as in (7.1.56),
~k = (1, 0, 0), and
∂ T Pθ ∂ T Pϕ ∂ T
(P · ∇)T = Pr + + .
∂r r ∂ θ r sin θ ∂ ϕ
The gravitational field equation (7.1.76) for the metric (7.1.77) is radially symmetric,
therefore
Φν = Dν φ , φ = φ (r).
Thus we have
1 1 00 1
D0 D0 φ = ψ 0φ 0, D1 D1 φ = φ − 2 α 0φ 0,
2αψ α 2α
1 0
D2 D2 φ = D 3 D3 φ = φ, D µ Dν φ = 0 for µ 6= ν .
rα
Then, in view of (7.1.28)-(7.1.31), the equation (7.1.76) can be expressed by
1 1 α0 1 8π G 1
− − 2 = − 2 ρ0 + ψ 0φ 0,
α r2 rα r c 2αψ
1 1 ψ0 1 8π G 1 1
+ − 2 = 2 p + φ 00 − 2 α 0 φ 0 , (7.1.79)
α r2 rψ r c α 2α
" #
1 ψ 00 1 ψ 0 2 α 0 ψ 0 1 ψ 0 α 0 16π G 2
− − + − = p + φ 0,
α ψ 2 ψ 2αψ r ψ α c2 rα
428 Chapter 7 Astrophysics and Cosmology
where the pressure p satisfies the stationary equations of (7.1.78) with P = 0 as follows
c2 0
p0 = − ψ ρ [1 − β (T − T0 )] ,
2
(7.1.80)
κ d 2 dT
r = −Q(r).
α r2 dr dr
The functions ψ and α satisfy the boundary conditions (7.1.33), i.e.
2Gm 2Gm −1
ψ (r0 ) = 1 − 2 , α (r0 ) = 1 − 2 . (7.1.81)
c r0 c r0
In addition, for the ordinary differential equations (7.1.79)-(7.1.81), we also need the bound-
1
ary conditions for ψ 0 , φ 0 and T . Since − c2 ψ 0 represents the gravitational force, the condi-
2
tion of ψ 0 at r = r0 is given by
2mG
ψ 0 (r0 ) = 2 2 . (7.1.82)
c r0
Based on the Newton gravitational law, φ 0 is very small in the external sphere; also see (Ma
and Wang, 2014e). Hence we can approximatively take that
φ 0 (r0 ) = 0. (7.1.83)
Finally, it is rational to take the temperature gradient in the boundary condition as follows
∂T
(r0 ) = −A (A > 0). (7.1.84)
∂r
Let the stationary solution of the problem (7.1.79)-(7.1.81) be given by pe, Te, ψ , α , φ 0 .
Make the translation transformation
P → P, p → p + pe, T → T + Te.
7.2 Stars
7.2.1 Basic knowledge
The large scale structure of the Universe consists of mainly the following levers:
Star is the most elementary constituent of the Universe, and we now explore some of their
basic properties.
1. Mass m. The mass of the Sun is m = 2 × 1030kg, and the range of the masses of the
main-sequence stars is about
0.1m ∼ 40m .
A few extreme stars have masses m ' 60m , and the least massive stars have masses around
m ' 0.07m .
2. Radius R. The radius of the Sun is R = 7 × 105 km, and the radii of the main-
sequence stars are
0.3R ∼ 25R .
A neutron star has radius R ' 10km, and a red giant star has R = 10 3R .
3. Surface temperature T . The surface temperature of the Sun is T = 5800◦K, and the
range of surface temperatures of stars in general is
2600◦K ∼ 35000◦K.
5. Parameter relation. Based on the radiation theory of black bodies, the above three
parameters R, T, L enjoy
L = 4πσ R2 T 4 , (7.2.1)
Figure 7.1 The HR diagram in which the luminosity L and the surface temperature T of stars are
plotted.
Each star is plotted as a point in the HR diagram in Figure 7.1. The stars in the band
are the main-sequence stars. The red giants are in the right-top region, and have lower
temperatures and higher brightness, and the white dwarfs are in the left-bottom region, and
have higher temperatures and low brightness.
The main-sequence stars are young and middle aged stars, the red giants are old aged,
and the white dwarfs are dead stars.
7. Variable stars. Most stars shine with almost constant brightness, but a small number
of stars, called variable stars, change periodically their brightness. Since the luminosity
of a star depends on its radius (see (7.2.1)), variable stars are rhythmically expanding and
contracting, pulsating in size and brightness.
7.2 Stars 431
8. Red giants. Stars are held together by gravity. The gravitational force pulling inward
is opposed by a force pushing outward, consisting mainly the thermal pressure in the inte-
rior. Stars maintain their balance of pressure and gravity by the heat energy produced by
burning hydrogen.
Stars less massive than the Sun evolve more slowly and stay on the main-sequence
longer than 1010 years. But more massive stars evolve more quickly, and end their life
becoming white dwarfs and neutron stars (or possibly black holes) as their final fate deter-
mined by their masses.
After a more massive star consumes its central supply of hydrogen, it leaves the main
sequence and enters into the group of red giants in the HR diagram.
When the central nuclear reaction has ceased to generate heat, the interior pressure re-
duces and the star begins to contract, leading to the release of gravitational energy. Hence,
this contraction causes the temperature to rise. Thus, the hydrogen outside the core begins
to burn. The burning shell causes the star to expend. Thus, the star is luminous, large and
cool, and becomes a red giant.
9. White dwarfs. For a star with mass in the range
1m 6 m < 5m , (7.2.3)
when it is in the red-giant phase, its expanding shell is thrown off into space, and the naked
core is all that remains. Contraction ceases, nuclear burning ends, and the core cools down
as a white dwarf. A white dwarf has a size approximatively equal to that of the earth, with
mass m 6 1.44M .
10. Neutron stars and pulsars. A star with mass
5m < m, (7.2.4)
will eventually evolve to a neutron star. A neutron star has a radius of about 10km with mass
m > 3m , and has a high density of nearly 109 ton/cm3 . The neutron star has a magnetic
field of 1012 gauss that is 1012 times stronger than the earth’s magnetic field.
The pulsar is a special neutron star, which emits a pulse-like energy message.
11. Supernovae. When a large massive red giant (m > 5m ) exhausts its nuclear fuels,
it begins to collapse. This contraction will lead to explosion, and the explosive star is called
a supernova. A neutron star is born after a huge explosion of a supernova.
main-sequence stars, white dwarfs and neutron stars are stable stars. The radii of unstable
stars may change; variable stars and expanding red giants are unstable stars. The dynamic
equations governing the two types of stars are different, and will be addressed hereafter
separately.
We note that the fluid dynamic equations (7.1.78) represent the Newton’s second law,
and their left-hand sides are the acceleration and their right-hand sides are the total force.
The total force consists of four parts: the viscous friction, the pressure gradient, the rela-
tivistic effect, and the thermal expansion force, which are given as follows:
• The viscous friction force is caused by the electromagnetic, the weak and the strong
interactions between the particles and the pressure, and is given by
as defined by (7.1.57).
• The thermal expansion force is due to the coupling between the gravity ∇ψ and the
heat Q:
c2 d ψ
FT = 0, 0, βT , (7.2.8)
2α dr
which is the main driving force for generating stellar interior circulations and nebular
matter spurts of red giants.
The two forces (7.2.7) and (7.2.8) are the main driving forces for the stellar motion, and
hereafter we derive their explicit formulas.
1. Formula for thermal expansion force. The thermal expansion force (7.2.8) is radially
symmetric, which is simply written in the r-component form
c2 d ψ
fT = β T.
2α dr
7.2 Stars 433
fT = σ (r)T, (7.2.9)
which has
2νδ 2
fr ∼ Pr → ∞ as δ → 1 (for Pr > 0). (7.2.15)
1−δ
The property (7.2.15) will lead to a huge supernovae explosions as they collapse to the radii
r0 → 2mG/c2 . It is the explosive force (7.2.15) that prevents the formation of black holes;
see Sections 7.2.6 and 7.3.3.
3. Derivation of formula (7.2.14). To deduce (7.2.14) we have to derive the gravitational
potential α . The first equation of (7.1.79) can be rewritten as
dM 2 c2 r 2 ψ 0 φ 0 c2 r 1
= 4π r ρ 0 − , M= 1− ,
dr 4G αψ 2G α
It gives the solution as
Z r 2 0 0 −1
4 c2 r ψφ 2MG
M = π r 3 ρ0 − dr, α = 1− 2 .
3 4G 0 αψ c r
4
By ρ0 = m/ π r03 , and in the nondimensional form (r → r0 r), we get
3
α = (1 − δ r2 − η )−1 for 0 6 r 6 1, (7.2.16)
Thus, the σ -factor (7.2.11) follows from (7.2.16), (7.2.18) and (7.2.20).
5. Thermal Force and (7.2.21). The thermal expansion force acting on the stellar shell
(i.e. at r = r0 ) can be deduced from (7.2.9) and (7.2.11) in the following (nondimensional)
form
c2 r03 β (1 − δ )δ
fT = σ0 T, σ0 = A (A > 0). (7.2.22)
2κ 2
By 0 < δ < 1, we have
The positiveness of σ (r) in (7.2.23) shows that the thermal force f T of (7.2.22) is an
outward expanding force. It is this power that causes the swell and the nebular matter spurt
of a red giant. We also remark that the temperature gradient A on the boundary is maintained
by the heat source Q.
First we briefly recall the dynamic transition theory developed by the authors in (Ma and
Wang, 2013b) and the references therein. Many dissipative systems, both finite and infinite
dimensional, can be written in an abstract operator equation form as follows
du
= Lλ u + G(u, λ ), (7.2.24)
dt
where Lλ is a linear operator, G is a nonlinear operator, and λ is the control parameter.
436 Chapter 7 Astrophysics and Cosmology
Lλ ϕ = β (λ )ϕ .
The following are the main conclusions for the dynamic transition theory; see (Ma and
Wang, 2013b) for details:
• Dynamic transitions of (7.2.24) take place at (u, λ ) = (0, λ1 ) provided that λ1 satisfies
the following principle of exchange of stability (PES):
<0 for λ < λ1 (or λ > λ1 ),
Reβ1 = 0 for λ = λ1 ,
(7.2.25)
>0 for λ > λ1 (or λ < λ1 ),
Reβk (λ1 ) < 0, ∀k > 2.
• Let uλ be the first transition state. Then we can also use the same stratege outlined
above to study the second transition by considering PES for the following linearized
eigenvalue problem
Lλ ϕ + DG(uλ , λ )ϕ = β (2) (λ )ϕ .
The governing fluid component equations are (7.1.85). We first make the nondimen-
sional. Let
where P = (Pθ , Pϕ , Pr ),~k = (0, 0, 1), σ (r) and FG P are as in (7.2.11) and (7.2.14), Pr = ν /κ
is the Prandtl number, and the ∆ is given by
2 ∂ Pr 2 cos θ ∂ Pϕ Pθ
∆Pθ = ∆P
e θ+ − 2 2 − 2 2 ,
r2 ∂ θ r sin θ ∂ ϕ r sin θ
e ϕ + 2 ∂ Pr + 2 cos θ ∂ Pθ − Pϕ ,
∆Pϕ = ∆P (7.2.27)
r2 sin θ ∂ ϕ r2 sin θ ∂ ϕ r2 sin2 θ
e r − 2 Pr + ∂ Pθ + cos θ Pθ + 1 ∂ Pϕ ,
∆Pr = ∆P
α r2 ∂θ sin θ sin θ ∂ ϕ
Based on the dynamic transition theory introduced early in this section, the stellar cir-
culation depends on the following three forces:
1
Pr∆P, FG P, σ T~k, (7.2.28)
κ
where in general Pr∆P prevents/slows-down the circulation.
By (7.2.14) we see that FG P depends on the δ -factor. The Sun’s δ -factor is δ =
10 /2, and in general the δ -factors for stars are of the order:
−5
Hence it is clear that all stars, except black holes, have small δ -factors.
On the other hand, for a small δ -factor, it follows from equations (7.1.79) and (7.1.80)
that α , ψ , φ has the order
α ∼ 1 + δ + η, η , η 0 ∼ δ 2, α0 ∼ δ ,
ψ ∼ rδ , ψ 0 /ψ ∼ δ , φ 0 , φ 00 ∼ δ .
FG P ∼ δ P for δ 1.
438 Chapter 7 Astrophysics and Cosmology
Thus, in view of (7.2.29) we conclude that the relativistic effect FG P is negligible on the
stellar interior motion for all stars except supernovae.
Hence the main driving force for stellar circulations is the thermal expansion force char-
acterized by the σ -factor σ0 in (7.2.22). Due to δ 1, σ0 can be approximately given
by
r2 mGβ
σ0 = 0 2 A, (7.2.30)
κ
which plays the similar role as the Rayleigh number Re in the earth’s atmospheric circu-
lation. The value σ0 of (7.2.30) is large enough to generate thermal convections for main-
sequence stars and red giants.
We remark that the heat source Q is caused not only by nuclear reactions, but also by the
pressure gradient, the density and the gravitational potential energy. Based on the σ -factor
in (7.2.30), we obtain the following physical conclusions:
− Pr∆P + ∇p = λ1 P for x ∈ B1 ⊂ R3 ,
∂ Pθ ∂ Pϕ
Pr , , =0 at r = 1,
∂r ∂r
where ∆P is as in (7.2.27).
For the main-sequence stars, the σ -factors are much larger than the first eigenvalue λ 1
of (7.2.32). For example, the Sun consists of hydrogen, and
With thermal expansion coefficient β in the order β ∼ 10−4/K, the σ -factor of (7.2.30) for
the Sun is about
σ ∼ 1030A [m/K]. (7.2.32)
7.2 Stars 439
Due to nuclear fusion, stars have a constant heat supply, which leads to a higher bound-
ary temperature gradient A. Referring to (7.2.32), we conclude that there are always interior
circulations and thermal motion in main-sequence stars and red giants, which has large σ -
factors.
2. Red giants. The nuclear reaction of a red giant stops in its core, but does take place
in the shell layer, which maintains a larger temperature gradient A on the boundary than
the main-sequence phase. Therefore, the greater σ -factor makes the star to expand, and the
increasing radius r0 raises the σ -factor (7.2.30). The increasingly larger σ -factor provides
a huge power to hurl large quantities of gases into space at very high speed.
3. Neutron stars and pulsars. Neutron stars are different from other stars, which have
bigger δ -factors, higher rotation Ω and lower σ -factor (as the nuclear reaction stops). In-
stead of (7.2.26) the dynamic equations governing neutron stars are
∂P 1 1
+ (P · ∇)P = Pr∆P + FG P − 2~Ω × P − ∇p + σ T~k,
∂τ ρ κ
∂T 1 (7.2.33)
+ (P · ∇)T = ∆T
e + Pr ,
∂τ ρ
divP = 0.
As the nuclear reaction ceases, the temperature gradient A tends to zero as time t → ∞,
and consequently
σ → 0 as t → ∞. (7.2.34)
Based on the dynamic transition theory briefly recalled earlier in this section, we derive
from (7.2.33) and (7.2.34) the following assertions:
• Due to the high rotation Ω, the convection of (7.2.33) for the early neutron star is time
periodic, and its period T is inversely proportional to Ω, and its convection intensity
√
B is proportional to σ − σc , i.e.
C1
period T ' ,
Ω (7.2.35)
√
intensity B ' C2 σ − σc ,
where σc is the critical value of the transition, and c1 , c2 are constants. The properties
of (7.2.35) explain that the early neutron stars are pulsars, and by (7.2.34) their pulsing
√ √
radiation intensities decay at the rate of σ − σc or kA − σc (k is a constant).
440 Chapter 7 Astrophysics and Cosmology
The model describing stellar atmosphere circulation without rotation is given by (7.1.71)-
(7.1.73), and the eigenvalue equations read as
e θ + 2 ∂ Pr − 2 cos θ ∂ Pϕ − Pθ − c0 Pθ
Pr ∆P
r2 ∂ θ r2 sin2 θ ∂ ϕ r2 sin θ r2 ν
δ ∂ 1 ∂ϕ
+ 2 (rPθ ) − = β Pθ ,
2r ∂ r r ∂θ
e ϕ + 2 ∂ Pr + 2 cos θ ∂ Pθ − Pϕ − c0 Pϕ
Pr ∆P
r2 sin θ ∂ ϕ r2 sin2 θ ∂ ϕ r2 sin2 θ r2 ν
δ ∂ 1 ∂p
+ 2 (rPϕ ) − = β Pϕ ,
2r ∂ r r sin θ ∂ ϕ (7.2.36)
e r − 2 (1 − δ )(Pr + ∂ Pθ + cos θ Pθ + 1 ∂ Pϕ ) − c1 Pr
Pr ∆P
r2 ∂θ sin θ sin θ ∂ ϕ r2 v
δ2 1 δ ∂ Pr 1 √ ∂p
+ Pr − + ReT − (1 − δ ) = β Pr ,
2(1 − δ ) r2 2r ∂ r r2 ∂r
√
e + 1 RePr = β T,
∆T
r 2
divP = 0,
∂ Pθ ∂ Pϕ
T = 0, Pr = 0, = = 0 at r = r0 , r0 + 1, (7.2.37)
∂r ∂r
where
1 ∂ ∂f 1 ∂2 f 1−δ ∂ ∂f
∆
ef = sin θ + + 2 r2 ,
r2 sin θ ∂ θ ∂θ r2 sin θ ∂ ϕ 2
2 r ∂r ∂r
As the δ -factor of (7.1.74) is smaller than one, then by (7.2.38), β 1 satisfies the inequality
√
β1 6 −k1 + k2 Re, (7.2.39)
By (7.2.39) we get
β1 → +∞ as m → ∞, or as r0 → ∞.
In addition, it is known that
(k)
β1 → +∞ (k > 2) as β1 → +∞.
Hence based on the dynamic transition theory, we derive the following physical conclu-
sions.
Physical Conclusion 7.10 For the stars with small δ 1, their atmosphere have
thermal convections to occur. In particular for the stars with large mass and radius, the
convections are in turbulent state.
δ2 1
F= Pr (7.2.40)
2(1 − δ ) r2
represents the radially expanding force which is the relativistic gravitational effect. The
force (7.2.40) satisfies
F → +∞ as δ → 1 for Pr > 0.
It implies that the force F in (7.2.40) is explosive as δ → 1. In addition, by (7.2.38) the first
eigenvalue β1 satisfies the following inequality
√ k3
β1 6 −k1 + k2 Re + for δ → 1.
1−δ
Thus we can deduce the following conclusions.
442 Chapter 7 Astrophysics and Cosmology
Physical Conclusion 7.11 Due to the explosive force F of (7.2.40), the stars with
δ w 1(δ = 1 is the black hole) has no the stellar atmospheres, which are erupted into the
outer space. However, as 1 − δ > 0 is small, the stellar atmospheres are in the turbulent
convection state,caused by the relatistic gravitational effect.
4. Let lτ and lr be the convection scales in the horizontal direction and radial direction.
The ratio
lτ
r=
lr
depends on the coefficient ratio of the horizontal components of the momentum (Pθ , Pϕ )
and the radial momentum component Pr in (7.2.36), i.e. the ratio
δ − 1 − c0/ν
η= . (7.2.41)
−2(1 − δ ) − c1/ν + δ 2 /2(1 − δ )
Physical Conclusion 7.12 The convection scale ratio γ depends on δ for δ 1 and
δ → 1, and on the ratio c0 /c1 for δ < 1 and v 1. Moreover, γ possesses the following
properties
o(1) for δ 1 and v large,
γ= (7.2.42)
1 for δ → 1.
1
The sun atmosphere convections satisfy the relation (7.2.42). For the Sun, δ w ×10−5 .
2
The observations show that
.
the photosphere convection r = 1 ∼ 2,
.
the chromosphere convection r = 2 ∼ 3.
5. For the stars with rotation ~Ω, the corresponding eigenvalue equations with the Cori-
olis force are " √ #
f Re~
Pr ∆P − 2 + 2 kT − 2~Ω × P = β P,
r r
√ (7.2.43)
Re
∆T
f+ Pr = β T,
r2
div P = 0,
with the boundary conditions (7.2.37).
In (Ma and Wang, 2013b) we showed that there is a lower bound Ω 0 such that as Ω > Ω0
the first eigenvalue β1 of (7.2.43) is complex in the critical state (7.2.25). Therefore, for the
stars with the bigger rotation their atmosphere convections in the first phase transition are
time periodic.
where ψ = ψ (r,t), α = α (r,t), and R(t) is the scalar factor representing the star radius. For
convenience, we denote
The stars with variable radii are essentially in radial motion. Hence, the horizontal
momentum (Pθ , Pϕ ) is assumed to be zero:
(Pθ , Pϕ ) = 0. (7.2.45)
In the following we develop dynamic models for astronomical objects with variable sizes.
1. Gravitational field equations. We recall the gravitational field equations (Ma and
Wang, 2014e):
8π G 1 1
Rµν = − 4 (Tµν − gµν T ) − (Dµν φ − gµν Φ). (7.2.46)
c 2 2
The nonzero components of the metric (7.2.44) are
where ρ is the energy density, Pr is the radial component of the momentum density. Then
direct computations imply that
1 1
T = gµν Tµν = −ρ + 3p, T00 − g00 T = (ρ + 3p),
2 2
1 1 1 1
T11 − g11 T = ek+v (ρ − p), T22 − g22 T = ek r2 (ρ − p),
2 2 2 2
1 1 1
T33 − g33 T = (T22 − g33 T ) sin2 θ , T10 − g10 T = g00 g11 Pr c.
2 2 2
7.2 Stars 445
∂ 2φ ∂φ
Dµν φ = − Γλµν λ .
∂ x µ ∂ xν ∂x
Let φ = φ (r,t). Then we have
1 1 1
D00 φ = φtt − 2 ut φt − eu−v ur φr ,
c2 2c 2
1 v−u 1
D11 φ = φrr − 2 e (kt + vt )φt − vr φr ,
2c 2
r2 −u
D22 φ = − 2 e kt φt + re−v φr ,
2c
D33 φ = D22 φ sin2 θ ,
1
D10 φ = φrt − (ur φt + φr kt + φr vt ).
2c
Thus, the field equations (7.2.46) are written as
R10 = −D1 D0 φ ,
8π G 1 1
Rkk = − 4 (Tkk − gkk T ) − (Dkk φ − gkk Φ) for k = 0, 1, 2,
c 2 2
which are expressed as
rur 8π Gr r
1+ kt + vt = 2 eu+k+v Pr + crφrt − (ur φt + φr kz + φr vt ), (7.2.47)
2 c 2
3 2 1 2
3ktt + kt + vtt + vt + kt vt − ut (kt + vt ) (7.2.48)
2 2
2 u−k−v 1 1 2
−c e urr + u2r − ur vr + ur
2 2 r
8π G 2 u−k−v 2eu−k
= − 2 (ρ + 3p) − c D00 φ + e D11 φ + 2 D22 φ ,
c r
3 1
ktt + kt2 + vtt + vt2 + 3kt vt − ut (kt + vt ) (7.2.49)
2 2
2 u−k−v 1 2 1 2
−c e urr + ur − ur vr − vr
2 2 r
8π G u 2 u−k−v 2eu−k
= e ( ρ − p) + c (e D 11 φ − D 00 φ − D22 φ ),
c2 r2
3 1 2c2 eu−k−v h v r i
ktt + kt2 + kt (vt − ut ) + 2
e + (kr + vr − ur ) − 1 (7.2.50)
2 2 r 2
8π G u 2 u−k−v
= 2 e (ρ − p) + c (D00 φ − e D11 φ ),
c
446 Chapter 7 Astrophysics and Cosmology
3. Equation of state. We know that the gravitational field equations represent the law of
gravity, which essentially dictates the gravity related unknowns: e u , ev , R = ek/2 , φ .
The laws for describing the matter field are the motion equation (7.2.51), the heat equa-
tion (7.2.52), and the energy conservation equation (7.2.53). To close the system, one needs
to supplement an equation of state given by thermal dynamics, which provides a relation
between temperature T , pressure p, and energy density ρ :
f (T, p, ρ ) = 0, (7.2.54)
which leads to
dMr
+ 4π R3r2 ev/2 Pr = 0,
dt
and (7.2.55) follows.
d
ER = 0.
dt
It follows from (7.2.55) that
On the other hand, the physically sound boundary condition for the star with variable radius
is
∂ Pr
= 0 on r = 1, (7.2.57)
∂r
which means that there is no energy exchange between the star and its exterior. Thus,
(7.2.56) and (7.2.57) imply that there is a shock wave outside the star near the boundary.
Remark 7.13 Formula √ (7.2.55) is very important. In fact, due to the boundary con-
dition (7.1.81) and ev/2 ' 1/ 1 − δ , in the star shell layer, (7.2.55) can be approximately
written as √
1−δ d
ρ Pr = − Mr for R − r > 0 small, (7.2.58)
4π R2 dt
where δ = 2Mr0 G/c2 R. This shows that a collapsing supernova is prohibited to shrink into
a black hole (δ = 1). In fact, the strongest evidence for showing that black holes cannot
be created comes from the relativistic effect of (7.2.14), which provides a huge explosive
power in the star shell layer given by
νδ 2
Pr → ∞ as δ → 1 (Pr 6= 0). (7.2.59)
1−δ
Here Pr is the convective momentum different from the contracting momentum Pr in (7.2.58);
see Section 7.3.3 for details.
Remark 7.14 One difficulty encountered in the classical Einstein gravitational field
equations is that the number of unknowns is less than the number of equations, and con-
sequently the coupling between the field equations and fluid dynamic and heat equations
become troublesome.
448 Chapter 7 Astrophysics and Cosmology
divP = 0.
The first eigenvalue β depends on the δ -factor, and by (7.2.14)
1/2
Prδ 2
β1 ∼ as δ → 1. (7.2.61)
1−δ
Based on the transition criterion (7.2.25), the property (7.2.61) implies that the star has
convection in the shell layer, i.e., the radial circulation momentum flux Pr satisfies
which provides a very riving force, resulting in the supernova explosion, as shown in Figure
7.2.
fr = 0 at r = r0 .
Here r0 is the radius of the blackhole core. Hence, the supernova’s huge explosion preserves
an interior core of smaller radius containing the blackhole core, which yields a neutron star.
In particular, the huge explosion has no imploding force, and will not generate a new black
hole.
The analysis in Steps 1-6 above provides the supernova exploding mechanism, and
clearly provides the power resource of the explosion.
In addition, by (7.2.12) and (7.2.16) we have
Z r 2 0 0
1 1 r ψφ
α= , η (r) = dr for 0 6 r 6 r0 .
1 − δ r2/r02 − η (r) 2r 0 αψ
η 0 (r) = 0.
1
Let η = f . Then
r
450 Chapter 7 Astrophysics and Cosmology
Figure 7.3 The spherical region enclosing a matter field with mass M and radius R satisfying (7.3.1)
is called black hole.
We recall again the Schwarzschild metric in the exterior of a black hole written as
In (7.3.2) we see that at r = Rs , the time interval is zero, and the spatial metric blows up:
√ Rs 1/2
−g00 dt = 1 − dt = 0 at r = Rs , (7.3.3)
r
√ Rs −1/2
g11 dr = 1 − dr = ∞ at r = Rs . (7.3.4)
r
The coordinate system (t, x) with x = (r, θ , ϕ ) represents the projection of the real world to
the coordinate space. Therefore the radial motion speed dr/dt in the projected world differs
p
from the proper speed vr by a factor −g00/g11 , i.e.
dr p
= −g00 /g11vr .
dt
Hence, the singularity (7.3.3) and (7.3.4) means that for an object moving toward to the
boundary of a black hole, its projection speed vanishes:
dr
=0 at r = Rs .
dt
This implies that any object in the exterior of the black hole cannot pass through its boundary
and enter into the interior. In the next subsection we shall rigorously prove that a black hole
is a closed and innate system.
Mathematically, a Riemannian manifold (M , gi j ) is called a geometric realization (i.e.
isometric embedding) in RN , if there exists a one to one mapping
~r : M → RN ,
such that
d~r d~r
gi j = · .
dxi dx j
The geometric realization provides a “visual” diagram of M , the real world of our Universe.
In the following we present the geometric realization of a 3D metric space of a black
hole near its boundary. By (7.3.2), the space metric of a black hole is given by
Rs −1 2 2MG
ds2 = 1 − dr + r2 (d θ 2 + sin2 θ d ϕ 2 ) for r > Rs = 2 . (7.3.5)
r c
In the interior of a black hole, the Riemannian metric near the boundary is given by the
TOV solution (7.1.42), and its space metric is in the form
−1
2 r2
ds = 1 − 2 dr2 + r2 (d θ 2 + sin2 θ d ϕ 2 ) for r < Rs , (7.3.7)
Rs
A geometrical realization of (7.3.7) is
q
~rint = r sin θ cos ϕ , r sin θ sin ϕ , r cos θ , ± Rs − r .
± 2 2 (7.3.8)
The diagrams of (7.3.6) and (7.3.8) are as shown in Figure 7.4, where case (a) is the embed-
ding (
~rext for r > Rs ,
~r+ = +
~rint for r < Rs ,
and case (b) is the embedding
(
−
~rext for r > Rs ,
~r = −
~rint for r < Rs .
The base space marked as R3 in (a) and (b) are taken as the coordinate space (i.e. the
projective space), and the surfaces marked by M represent the real world which are separated
into two closed parts by the spherical surface of radius Rs : the black hole (r < Rs ) and the
exterior world (r > Rs ).
Figure 7.4 M is the real world with the metric (7.3.5) for r > Rs and the metric (7.3.7) for r < Rs ,
and in the base space R3 the coordinate system is taken as spherical coordinates (r, θ , ϕ ).
In particular, the geometric realization of (7.3.7) for a black hole clearly manifests that
the real world in the black hole is a hemisphere with radius Rs embedded in R4 ; see Fig-
ure 7.4(a):
x21 + x22 + x23 + x24 = R2s for 0 6 |x4 | 6 Rs ,
where
q
(x1 , x2 , x3 , x4 ) = r sin θ cos ϕ , r sin θ sin ϕ , r cos θ , ± Rs − r .
2 2
7.3 Black Holes 453
This together with no escaping of particles from the interior of the black hole shows that
lim Pr = 0.
r→R+
s
In other words, there is no energy flux Pr on the Schwarzschild surface, and we have shown
that no external energy can enter into a black hole.
In conclusion, we have shown that black holes are closed: no energy can penetrate the
Schwarzschild surface.
Step 4. Assertion 3) follows by applying conclusion (7.5.55) and the fact that sub black-
holes are incompressible. The theorem is therefore proved.
We remark again that the singularity on the boundary of black holes is essential and can-
not be removed by any differentiable coordinate transformation with differentiable inverse.
The Eddington and Kruskal coordinate transformations are non-differentiable, and are not
valid.
Remark 7.16 The gravitational force F generated by a black hole in its exterior is
given by
mc2 ∂ψ
F= ∇g00 = −mg11 ,
2 ∂r
where ψ is the gravitational potential. By (7.3.2) we have the following gravitational force:
2MG mMG
F = − 1− 2 . (7.3.13)
c r r2
F =0 at r = Rs .
7.3 Black Holes 455
Black holes are a theoretical outcome. Although we cannot see them directly due to their
invisibility, they are, however, strong evidences from many astronomical observations and
theoretic studies.
In the following, we first briefly recall the Chandrasekhar limit of electron degeneracy
pressure and the Oppenheimer limit of neutron degeneracy pressure; then we present new
criterions to classify pure black holes, which do not contain other black holes in their inte-
rior, into two types: the quark and weakton black holes, by using the δ -factor.
1. Electron and neutron degeneracy pressures. Classically we know that there are two
kinds of pressure to resist the gravitational pressure, called the electron degeneracy pres-
sure and the neutron degeneracy pressure. These pressures prevent stars from gravitational
collapsing with the following mass relation:
(
1.4M for electron pressure,
m< (7.3.14)
3M for neutron pressure.
Hence, by (7.3.14), we usually think that a dead star is a white dwarf if its mass m < 1.4M ,
and is a neutron star if its mass m < 3M . However, if the dead star has mass m > 3M ,
then it is regarded as a black hole. Hence the neutron pressure gradient is thought to be
a final defense to prevent a star from collapsing into a black hole. Thus, 3M becomes a
critical mass to determine the possible formation of a black hole.
2. Interaction potential pressure. However, thanks to the strong and weak interaction
potentials established in (Ma and Wang, 2015a, 2014c), there still exist three kinds of po-
tential pressures given by
neutron stars,
quark black holes if they exist, (7.3.16)
weakton black holes if they exist.
We are now in position to discuss these potential pressures. By the theory of elementary
particles, a neutron is made up of three quarks n = uud, and u, d quarks are made up of
three weaktons as u = w∗ w1 w1 , d = w∗ w1 w2 . The three levels of particles possess different
potentials distinguished by their interaction charges:
456 Chapter 7 Astrophysics and Cosmology
ρw 3
neutron charge gn = 3 gs ,
ρn
3
ρw (7.3.17)
quark charge gq = gs ,
ρq
weakton weak charge gw ,
g2
f= .
r2
The force acts on particle’s cross section with area S = π r 2 , which yields the interaction
potential pressure as
f g2
P= = 4. (7.3.18)
S πr
Let each ball Br with radius r contain only one particle. Then the mass density ρ is given
by
3m0
ρ= , (7.3.19)
4π r3
where m0 is the particle mass. By the uncertainty relation, in Br the particle energy ε0 is
h̄
ε0 = ,
2t
and t = r/v, where v is the particle velocity. Replacing v by the speed of light c, we have
h̄c
ε0 = .
2r
By m0 = ε0 /c2 , the density ρ of (7.3.19) is written as
3h̄ 3h̄
ρ= or equivalently r4 = . (7.3.20)
8π cr4 8π cρ
Inserting r4 of (7.3.20) into (7.3.18), we derive the interaction potential pressure P in the
form
8cρ g2
P= . (7.3.21)
3h̄
3. Critical δ -factors. It is known that the central pressure of a star with mass m and
radius r0 can be expressed as
Gm2 2π c2 2mG
PM = = ρδ , δ= , (7.3.22)
r04 3 c 2 r0
7.3 Black Holes 457
g2n ∼ h̄c.
Thus, we take
π
g2n = h̄c, (7.3.25)
4
and (7.3.24) is just the black hole criterion.
If the δ -factor satisfies that
4 g2n 4 g2q
6δ < , (7.3.26)
π h̄c π h̄c
then the neutrons will be crushed to become quarks and gluons. The equality (7.3.25) shows
that the star satisfying (7.3.26) must be a black hole which is composed of quarks and
gluons, and is called quark black hole.
If δ satisfies
4 g2q 4 g2w
6δ < , (7.3.27)
π h̄c π h̄c
then the quarks are crushed into weaktons, and the body is called weakton black hole.
In summary, we infer from (7.3.24), (7.3.26) and (7.3.27) the following conclusions:
a neutron star if δ < δnc and m > 1.4M ,
a body = a quark black hole if δnc < δ < δqc , (7.3.28)
c c
a weakton black hole if δq < δ < δw ,
where
2
4 gj
δ jc = for j = n, q, w.
π h̄c
458 Chapter 7 Astrophysics and Cosmology
5. Upper limit of the radius. Weaktons are elementary particles, which cannot be
crushed. Therefore, there is no star with δ -factor greater than δwc . Thus there exists an
upper limit for the radius rc for astronomical bodies with mass m, determined by
2MG
= δwc .
c 2 rc
Namely, the upper limit of the radius rc reads
π mG h̄c
rc = . (7.3.29)
2c2 g2w
Finally we remark that since black holes cannot be compressed, the δ -factor of any
astronomical object cannot be less than one: δ > 1. Therefore, by (7.3.25) and (7.3.26),
there exist no weakton black holes.
1. Jeans theory on the origin of stars and galaxies. In the beginning of the twentieth
century, J. Jeans presented a general theory for the formation of galaxies and stars. He
thought that the Universe in the beginning was filled with chaotic gas, and various astro-
nomical objects were formed in succession by a process of gas decomposition into bulks of
clouds, consequently forming galaxies, stars, and planets.
According to the Jeans theory, a ball of clouds with homogeneous density ρ can be held
together only if
V + K 6 0, (7.3.30)
where V is the total gravitational potential energy, and K is the total kinetic energy of all
particles. The potential energy V is
Z R
GMr 3GM 2
V =− × 4π r2ρ dr = − , (7.3.31)
0 r 5R
7.3 Black Holes 459
where M is the mass of the cloud, Mr = 4π r3 ρ /3, and R is the radius. The kinetic energy K
is expressed as the sum of thermal kinetic energies of all particles:
3
K = NkT,
2
where N is the particle number, T is the temperature, and k is the Boltzmann constant.
Assume that all particles have the same mass m, then N = M/m, and we have
3M
K= kT. (7.3.32)
2m
Thus, by (7.3.30)-(7.3.32) we obtain that
GM 5
> kT. (7.3.33)
R 2m
The inequality (7.3.33) is called the Jeans condition.
2. Masses of astronomical objects. The Jeans condition (7.3.33) guarantees only the
gaseous clouds being held together, and does not imply that the gas clouds can contract to
form an astronomical object. However, a black hole must attract the nebulae around it to
form a compact body.
We consider the mass relation between an astronomical object and its black hole core.
The mass M of the object is
M = Mb + M1 , (7.3.34)
where Mb is the mass of the black hole, and M1 is the mass of the material attracted by this
black hole. The total binding potential energy V of this object is given by
Z R
GMr
V =− × 4π r2ρ dr, (7.3.35)
Rs r
where R is the object radius, Rs is the radius of the black hole, ρ is the mass density outside
the core, and
Z r
Mr = Mb + 4π r2 ρ dr. (7.3.36)
Rs
Since Rs R, we take Rs = 0 in the integrals (7.3.35) and (7.3.36). We assume that the
density ρ is a constant. Then, it follows from (7.3.35) and (7.3.36) that
Z R
4π 4 Mb R2 4π
V = −4π Gρ Mb r + ρ r dr = −4π Gρ + ρ R5 .
0 3 2 3×5
4
By ρ = M1 / π R3 and Mb = M − M1 , we have
3
460 Chapter 7 Astrophysics and Cosmology
G 3 9
V =− MM1 − M12 . (7.3.37)
R 2 10
The stability of the object requires that −V takes its maximum at some M1 such that
dV /dM1 = 0. Hence we derive from (7.3.37) that
5 1
M1 = M, Mb = M. (7.3.38)
6 6
The relation (7.3.38) means that a black hole with mass Mb can form an astronomical object
with mass M = 6Mb .
3. Relation between radius and temperature. A black hole with mass Mb determines the
mass M of the corresponding astronomical system: M = 6Mb . Then, by the Jeans relation
(7.3.33), the radius R and average temperature T satisfy
2 × 6GmMb
T= (7.3.39)
5kR
where T is expressed as
Z
3
T= τ (x)dx,
4π R3 BR
where BR is the ball of this system, and τ (x) is the temperature distribution. Let τ = τ (r)
depend only on r, then we have
Z R
3
T= r2 τ (r)dr. (7.3.40)
R3 0
4. Solar system. For the Sun, M = 2 × 1030kg and R = 7 × 108m. Hence the mass of the
solar black hole core is about
1
M b = × 1030kg,
3
and the average temperature has an upper limit:
4 6.7 × 10−11m3 /kg · s2 × 1030kg × 1.7 × 10−27kg
T= × ' 108 K.
5 1.4 × 10−24kg · m2 /s2 · K × 7 × 108m
For the earth, M = 6 × 1024kg, R = 6.4 × 106m. Thus we have
5. The radii of the solar and earth’s black hole cores. The radius of solar black hole is
given by
Rs = 500 m,
and the radius of black hole of the earth is as
3
Res = cm.
2
7.4 Galaxies 461
7.4 Galaxies
7.4.1 Introduction
Galaxy is an elementary unit in the large scale structure of our Universe, which is composed
of star clusters, stars, gases and dusts. In the following we introduce this topic.
1. Levels of cosmical structure. There are mainly five levels of cosmic structure: stars,
star clusters, galaxies, clusters of galaxies, and the Universe, and their basic scales are given
inTable 7.1.
Elliptical galaxies have an oval appearance, which are classified into eight grades, de-
noted by E0 , E1 , · · · , E7 , according to their apparent ellipticity e = (a − b)/a, where a is the
major radius and b is the minor radius. The E0 -type of elliptical galaxies are spherical in
shape with e0 = 0, and the E7 -types are of the most ablate appearance, i.e. the apparent
ellipticity ek of Ek is arranged in order
The S-spirals form a sequence of three types: Sa , Sb , Sc . The Sa have large galaxy nuclei
and tightly wound arms, the Sb have moderate galaxy nuclei and less tightly wound arms,
and the Sc have the smallest nuclear bulges and the least wound arms. Our galaxy (the Milky
Way) is type Sb .
About one third of all spiral galaxies are the SB-type. They also consist of three types:
SBa , SBb , SBc , according to the size of the galaxy nuclei and tightness of the spiral arms,
exactly as in S-spiral sequence.
The galaxy classification given above was introduced by E. Hubble, who arranged the
galaxies in an orderly diagram as shown in Figure 7.5.
462 Chapter 7 Astrophysics and Cosmology
Figure 7.5 Hubble classification of galaxies, the ellipticals arranged on the left, and the spirals ar-
ranged on the right in two parallel sequences.
The SO (S-zero) galaxies have the characteristicsof ellipticals and spirals. They are disk
shaped, like spirals, but lack spiral structure, and therefore like flat ellipticals.
The amount of interstellar gas, also called nebula, in galaxies increases from left to right
in the diagram given by Figure 7.2. In ellipticals the amount of gas is very small, and it is
also small in SO galaxies. However, the nebula increases progressively in the spirals from
Sa and SBa to Sc and SBc . in addition, another important distinction between ellipticals
and spirals is that clusters of galaxies contain 80% elliptics, and outside clusters, 80% of
galaxies are spirals.
3. Galaxy properties. Various properties of galaxies are listed in Tables 7.2-7.4.
4. Active galaxies. According to their active extent, galaxies are divided into two types:
normal galaxies and active galaxies. Active galaxies mainly include
7.4 Galaxies 463
Active galaxies are usually characterized by their extraordinary energy emision. The
starburst galaxies emit a large amount of infrared radiation, which is caused by a great
number of newly formed stars. But the three types of galaxies: Seyferts, Quasars and
Radio-galaxies are known as active galaxies because they possess a compact region at their
centers that has a much higher radiation than normal luminosity, which is called the active
galactic nucleus (AGN).
AGN generates and emits immense quantities of energy over a wide range of electro-
magnetic wavelengths. Today, it is widely believed that AGN is an accretion disk generated
by a supermassive black hole. Though we have not completely confirmed evidence to show
the existence of black holes, many astronomical observations strongly suggest their exis-
tence. The main features of active galaxies are listed in Table 7.5.
5. The Milky Way. Our own galaxy is known as the Milky Way, which is of S b type and
has 1011 stars and an enormous quantities of clouds of gases and dusts. Its radius is about
105 ly.
The Milky Way consists of two parts: the disk and the halo. The disk has the radius
of about 5 × 105 ly and the thick of about 5 × 104 ly. The disk is composed of stars and
nebulae, rotating around the center of the galaxy. The Sun is 3 × 10 4 ly from the nucleus, it
moves at 300km/s and has a period of 2 × 108 years.
The halo of the Milky Way is spherically-shaped, centered on the nucleus of the galaxy,
and has the radius of 105 ly. The halo consists of less gas and roughly 120 globular clusters,
each of which has hundreds of thousands of stars and moves in an elliptical orbit around the
nucleus of the galaxy. The halo does not rotate with the disk.
6. Galaxy clusters. Galaxies are not uniformly distributed in the Universe, but aggregate
in clusters of different size. Clusters of galaxies are the largest known astronomical systems
bound by gravitational attraction. They form dense regions in the large scale structure of
our Universe. The clusters are associated with much larger, non-gravitationally bounded
groups, called superclusters.
The great regular clusters of galaxies are spherically-shaped, have roughly thousands
of galaxies, almost all of which are of E and S0 types. The regular clusters of galaxies
464 Chapter 7 Astrophysics and Cosmology
are typically 5 × 106ly in radius, and have no clear outer boundaries. Clusters are locked
as their galaxies held together by mutual gravitational attraction. However, their velocities
are too large, exceeding 103 km/s, for them to remain gravitationally bound by their mutual
attractions. It implies the presence of either an additional invisible mass component, or
an additional attractive force besides the Newtonian gravity. This is the so called dark
matter phenomenon. Astronomical observations manifest that there are large amounts of
intergalactic gas which is very hot between 107 ∼ 108 K. The total mass of the gas is greater
than that of all galaxies in the cluster. The wind of intergalactic gas streaming through these
fast moving galaxies is strong enough to strip away their interstellar gas. This explains
why the E and S0 types of galaxies have less gas because their gas has been swept out by
intergalactic winds.
The other galaxy clusters are irregular. They have various sizes ranged from thousands
of members to a few tens of members. The smaller clusters are also called galaxy groups.
For example, our galaxy is a member of a galaxy group known as the local group which
possesses about 40 galaxies. The irregular clusters lack spherical symmetry in shape and
contain a mixture of all types of galaxies.
where r0 is the radius of galaxy nucleus and r1 the galaxy radius. In the following we
develop models for spiral and elliptical galaxies, and provide their basic consequences on
galactic dynamics.
1. Spiral galaxies. Spiral galaxies are disc-shaped, as shown in Figure 7.6. We model
the galaxy in a disc domain as
D = {x ∈ R2 | r0 < |x| < r1 }, (7.4.1)
for which the spherical coordinates reduce to the polar coordinate system (ϕ , r):
π
(θ , ϕ , r) = ,ϕ,r for 0 6 ϕ 6 2π , r0 < r < r1 . (7.4.2)
2
The metric satisfying the gravitational field equations (7.1.62) of the galaxy nucleus is the
Schwarzschild solution:
2 M0 G
g00 = − 1 + 2 ψ , ψ =− ,
c r
(7.4.3)
δ r0 −1 2M0 G
g11 = α (r) = 1 − , δ= 2 ,
r c r0
where r0 < r < r1 and M0 is the mass of galactic nucleus.
With (7.4.2) and (7.4.3), the 2D fluid equations (7.1.65)-(7.1.68) are written as
∂ Pϕ 1 1 ∂p
+ (P · ∇)Pϕ = ν ∆Pϕ − ,
∂τ ρ r ∂ϕ
∂ Pr 1 1 ∂p Mr G
+ (P · ∇)Pr = ν ∆Pr − − ρ (1 − β T) 2 ,
∂τ ρ α ∂r αr
(7.4.4)
∂T 1
+ (P · ∇)T = κ ∆T
e + Q,
∂τ ρ
∂ρ
+ divP = 0,
∂τ
supplemented with boundary conditions:
Pϕ (r0 ) = ζ0 , Pr (r0 ) = 0, T (r0 ) = T0 ,
(7.4.5)
Pϕ (r1 ) = ζ1 , Pr (r1 ) = 0, T (r1 ) = T1 .
Here α is as in (7.4.3), and Mr is the total mass in the ball Br .
2. Elliptical galaxies. Elliptical galaxies are spherically-shaped, defined in a spherical-
annular domain, as shown in Figure 7.7:
The metric is as in (7.4.3), and the corresponding fluid equations (7.1.65)-(7.1.68) are
in the form:
∂P 1 M0 G
+ (P · ∇)P = ν ∆P − ∇p − ρ (1 − β T) 2 ~k,
∂τ ρ αr
∂T 1
+ (P · ∇)T = κ ∆T e + Q, (7.4.7)
∂τ ρ
∂ρ
+ div P = 0,
∂τ
supplemented with the physically sound conditions:
∂ Pθ ∂ Pϕ
Pr = 0, = 0, = 0 at r = r0 , r1 ,
∂r ∂r (7.4.8)
T (r0 ) = T0 , T (r1 ) = T1 .
Spiral galaxies are divided into two types: normal spirals (S-type) and barred spirals (SB-
type). We are now ready to discuss these two sequences of galaxies by using the spiral
galaxy model (7.4.4)-(7.4.5).
Let the stationary solutions of (7.4.4)-(7.4.5) be independent of ϕ , given by
The heat source is approximatively taken as Q = 0. Then the stationary equations of (7.4.4)-
(7.4.5) are !
00 0 1 δ r 0 00 3 0
eϕ
P
rPeϕ + 2Peϕ − Peϕ − rPeϕ + Peϕ − = 0,
r r 2 2r
∂ pe 1 e2 1
= P − ρ (1 − β T)M
e r G,
∂r rρ ϕ r2 (7.4.14)
!
d e
2 dT
r = 0,
dr dr
Peϕ (r0 ) = ζ0 , Peϕ (r1 ) = ζ1 , Te(r0 ) = T0 , Te(r1 ) = T1 .
The first equation of (7.4.14) is an elliptic boundary value problem, which has a unique
solution Pϕ . Since δ r0 /r is small in the domain (7.4.1), the first equation of (7.4.14) can be
approximated by
2 1
Peϕ00 + Pϕ0 − 2 Peϕ = 0,
r r
which has an analytic solution as
√ √
5−1 5+1
Peϕ = β1 rk1 + β2rk2 , k1 = , k2 = − . (7.4.15)
2 2
By the boundary conditions in (7.4.14), we obtain that
∂ Pϕ 1 Peϕ d Peϕ 1 ∂p
+ (P · ∇)Pϕ = ν ∆Pϕ − ( + )Pr − ,
∂τ ρ r dr r ∂ϕ
∂ Pr 1 2Peϕ ρβ Mr G 1 ∂p
+ (P · ∇)Pr = ν ∆Pr + Pϕ + T− ,
∂τ ρ αr αr 2 α ∂r
(7.4.17)
∂T 1 e + r0 r1 γ Pr − 1 Peϕ ∂ T ,
+ (P · ∇)T = κ ∆T
∂τ ρ ρ r2 ρr ∂ ϕ
div P = 0,
P = 0, T =0 at r = r0 , r1 ,
1 Peϕ d Peϕ 1 ∂p
− ∆Pϕ + ( + )Pr + = λ Pϕ ,
ν r dr rν ∂ ϕ
2Peϕ ρβ Mr G 1 ∂p
− ∆Pr − Pϕ − T+ = λ Pr ,
αν r αν r2 αν ∂ r
(7.4.18)
1 ∂ T r0 r1 γ
− ∆T + Peϕ − Pr = λ T,
ρ r ∂ ϕ κρ r2
div P = 0,
P = 0, T =0 at r = r0 , r1 .
dictate the dynamic behaviors of spiral galaxies, which are determined by the physical pa-
rameters:
Z r1
T0 − T1 2M0 G
ζ 0 , ζ 1 , r0 , r1 , κ , ν , β , γ = , δ = 2 , Mr = M0 + 4π r2 ρ dr. (7.4.20)
r1 − r 0 c r0 r0
Based on the dynamic transition theory in (Ma and Wang, 2013b), we have the following
physical conclusions:
• If the parameters in (7.4.20) make the first eigenvalue λ1 < 0, then the spiral galaxy
is of S0-type.
7.4 Galaxies 469
• If λ1 > 0, then the galaxy is one of the types Sa , Sb , Sc , SBa , SBb , SBc , depending on
the structure of (Pϕ0 , Pr0 ) in (7.4.19).
• Let λ1 > 0 and the first eigenvector (Pϕ0 , Pr0 ) of (7.4.19) have the vortex structure as
shown in Figure 7.8. Then the number of spiral arms of the galaxy is k/2, where k is
the vortex number of (Pϕ0 , Pr0 ). Hence, if k = 2, the galaxy is of the SBc -type.
Figure 7.8 The vortex structure of the first eigenvector (Pϕ0 , Pr0 ).
The reason behind the number of spiral arms being k/2 is as follows. First the number of
vortices in Figure 7.8 is even, and each pair of vortices have reversed orientations. Second, if
the orientation of a vortex matches that of the stationary solution Pϕ (r) of (7.4.14), then the
superposition of Pϕ (r) and Pϕ0 of (7.4.19) gives rise to an arm; otherwise, the counteraction
of Pϕ (r) and Pϕ0 with reversed orientations reduces the energy momentum density, and the
region becomes nearly void.
Remark 7.17 There are three terms in (7.4.18), which may generate the transition of
(7.4.17):
k1 T k2 Pr ρβ Mr G r0 r1 γ
F1 = 0, − 2 , − 2 , k2 = , k2 = ,
r r αν κρ
! !
1 Peϕ d Peϕ 2Peϕ
F2 = + Pr , − Pϕ , 0 ,
ν r dr αν r
1 dα ∂ 1 ∂ 1 dα
F3 = − 2 (rPϕ ), ( Pr ), 0 .
2α r dr ∂ r 2α ∂ r α dr
The term F1 corresponds to the Rayleigh-Bénard convection with the Rayleigh number
β Mr Gr0 r1 γ
R = k 1 k2 = ,
ανκ
the term F2 corresponds to the Taylor rotation which causes the instability of the basic flow
(Pϕ , Pr ) = (Peϕ , 0), and F3 is the relativistic effect which only plays a role in the case where
δ ' 1.
470 Chapter 7 Astrophysics and Cosmology
where (Peϕ , Te, pe, ρ ) is the solution of (7.4.24)-(7.4.25). Then the equations (7.4.7) are rewrit-
ten as
∂ Pθ
Pr = 0, Pϕ = 0, = 0, T =0 at r = Rs , R1 . (7.4.27)
∂r
Ω0 Ω 1 , (7.4.28)
This property leads to the instability of the rotating flow represented by the stationary solu-
tion:
(Pr , Pθ , Pϕ ) = (0, 0, Peϕ ), (7.4.29)
which is similar to the Taylor-Couette flow in a rotating cylinder. The rotating instability can
generate a circulation in the galactic nucleus, as the Taylor vortices in a rotating cylinder,
472 Chapter 7 Astrophysics and Cosmology
as shown in Figure 7.9. The instability is caused by the force F = (Fr , Fθ , Fϕ , T ) in the
equations of (7.4.26) given by
2Peϕ Peϕ ∂ Pr
Fr = Pϕ − ,
ρα r ρ r sin θ ∂ ϕ
2 cos θ Peϕ Peϕ ∂ Pθ
Fθ = Pϕ − ,
ρ r sin θ ρ r sin θ ∂ ϕ
! ! (7.4.30)
1 Peϕ ∂ Peϕ 1 cos θ e ∂ Peϕ Peϕ ∂ Pϕ
Fϕ = − + Pr − Pϕ + Pθ − ,
ρ r ∂r ρ r sin θ ∂θ ρ r sin θ ∂ ϕ
Peϕ ∂ T
T =− .
ρ r sin θ ∂ θ
3. Rayleigh-Bénard instability. Due to the nuclear reaction (fusion and fission) and the
large pressure gradient, the galactic nucleus possesses a very large temperature gradient in
(7.4.25) as
DT = T0 − T1 , (7.4.31)
which yields the following thermal expansion force in (7.4.26), and gives rise to the Rayleigh-
Bénard convection:
Mb G 1 d Te
Fr = β ρ T, T= Pr . (7.4.32)
αr 2 ρ dr
4. Instability due to the gravitational effects. Similar to (7.2.7), there is a radial force in
the term ν ∆ur of the third equation of (7.4.26):
ν ∂ 1 dα
Fr = Pr , (7.4.33)
2α ∂ r α dr
where
α = (1 − Rs/r)−1 , Rs < r < R 1 . (7.4.34)
In (7.4.33) and (7.4.34), we see the term
ν R2s
fr = Pr , (7.4.35)
1 − Rs/r r4
which has the property that
(
+∞ for Pr > 0 at r = Rs ,
fr = (7.4.36)
−∞ for Pr < 0 at r = Rs .
It is the force (7.4.36) that not only causes the instability of the basic flow (7.4.29), but also
generates jets of the galaxy nucleus.
7.4 Galaxies 473
5. Latitudinal circulation. The above three types of forces: the rotating force (7.4.30),
the thermal expansion force (7.4.32), and the gravitational effect (7.4.35), cause the insta-
bility of the basic flow (7.4.29) and lead to the latitudinal circulation of the galactic nucleus,
as shown in Figure 7.9.
6. Jets and accretions. Each circulation cell has an exit as shown in Figure 7.10, where
the circulating gas is pushed up by the radial force (7.4.35)-(7.4.36), and erupts leading to
a jet. The cell has an entrance as shown in Figure 7.10, where the exterior gas is pulled
into the nucleus, is cyclo-accelerated by the force (7.4.35), goes down to the inner boundary
r = Rs , and then is pushed by Fθ of (7.4.30) toward to the exit. Thus the circulation cells
form jets in their exits and accretions in their entrances. In Figure 7.11(a), we see that there
is a jet in the latitudinal circulation with k = 1 cell, and in Figure 7.11(b) there are two jets
in the circulation with k = 2 cells in its south and north poles, and an accretion disk near its
equator.
Figure 7.10
474 Chapter 7 Astrophysics and Cosmology
Figure 7.11 (a) A jet in the latitudinal circulation with k = 1 cell, two jets in the latitudinal circulation
with k = 2 cells.
7. Condition for jet generation. The main power to generate jets comes from the gravi-
tational effect of (7.4.35)-(7.4.36) by the black hole. The radial momentum Pr in (7.4.35) is
the bifurcated solution of (7.4.26), which can be expressed as
Pr = R2s Qr ,
Fr > 0 if fr > fE .
The condition (7.4.39) can be equivalently rewritten as that there is a critical mass Mc
such that the galactic nucleus is active if its mass M is bigger than Mc , i.e. M > Mc . By
(7.4.21), we have
105 M < Mc or 106 M < Mc .
Remark 7.18 The jets shown in Figures 7.9 and 7.10 are column-shaped. If the cell
number k > 3 for the latitudinal circulation of galaxy nucleus, then there are jets which are
disc-shaped. We don’t know if there exist such galaxy nuclei which have the disc-shaped
jets in the Universe. Theoretically, it appears to be possible.
Remark 7.19 Galactic nucleus are made up of plasm. The precise description of AGN
jets requires to take into consideration of the magnetic effect in the modeling. However the
essential mechanism does not change and an explosive magnetic energy as in (7.4.37) will
contribute to the supernovae explosion.
1. The Hubble Law. In 1929, American astronomer Edwin Hubble discovered an ap-
proximatively linear relation between the recession velocity v and the distance R of remote
galaxies, which is now called the Hubble Law:
v = HR, (7.5.1)
where H is called the Hubble constant, depends on time, and its present-time value is
Formula (7.5.1) implies that the farther away the galaxy is from our galaxy, the greater
its velocity is.
2. Expansion of the Universe. An important physical conclusion from the Hubble Law
(7.5.1)-(7.5.2) is that our Universe is expanding.
If we regard our Universe as a 3-dimensional sphere:
Each point on the sphere Sr3 can be regarded as a center, as the radius r increases, all points
on the sphere are moving away from the point. Moreover, the farther away a remote object
476 Chapter 7 Astrophysics and Cosmology
is from the point, the faster the object appears to be moving. For example, for any two
points p1 and p2 on a sphere Sr3 , θ is the angle between p1 and p2 ; see Figure 7.12. As the
radius r varies from r1 to r2 , the distance R between P1 and P2 varies from R1 to R2 :
R1 = θ r 1 → R 2 = θ r 2 .
Figure 7.12
dR
v= = θ ṙ, (R = θ r). (7.5.4)
dt
Assume that ṙ is proportional to r, i.e.
Then the Hubble law (7.5.1) follows from (7.5.4) and (7.5.5).
The Hubble Law (7.5.1) also manifests that the recession velocity is independent of
the direction. Hence, it implies that the Universe is isotropic. This very property gives
rise to a strong restriction on the global structure of the Universe: there are only two types
of geometry for our Universe: either the 3D sphere (7.5.3) or the 3D Euclidian space R 3 .
Namely, we have reached the following physical conclusion.
Physical Conclusion 7.20 The Hubble law allows only two types of topological struc-
ture, either S3 or R3 , as our Universe. Further, the Hubble Law requires that the Universe
is expanding.
3. Big-Bang theory. In 1927, the Belgium cosmologist G. Lemaı̂tre first proposed that
the Universe begins with a big explosion, known as the Big-Bang theory. After the Hubble
law was discovered, in 1931, G. Lemaı̂tre regarded the recessions of remote galaxies as the
7.5 The Universe 477
expanding of the Universe, and he thought that as the time inverses, the early universe must
be in a high temperature and dense state, and the stage is considered as the beginning of the
Big-Bang.
The Big-Bang theory appears to be consistent with the three most important astronom-
ical facts: the Hubble law, the cosmic microwave background, discovered in 1965 by two
physicists A. Penzias and R. Wilson, and the abundance of helium.
4. Age of the Universe. Based on the Big-Bang theory, the age of the Universe is finite.
Let T be the age, then the present-day distance between any two points is
R = vT.
Theorem 7.21 (Topological Structure of the Universe) The global topological struc-
ture of the Universe is either S3 or R3 .
6. Global geometrical structure. Based on the Cosmological Principle 7.21, three physi-
cists A. Friedmann (1922), G. Lemaı̂tre (1927), H. P. Robertson (1935), and a mathemati-
cian A. G. Walker (1936), independently derived the globally geometrical structure of the
Universe.
where R(t) is the scalar factor representing the radius of the Universe, which depends only
on time t, and k = 1, 0, −1 stand for the sign of space scalar curvature in the following
sense:
1
k=1: M = S3 the 3D sphere with scalar curvature = 2 ,
R
k=0: M = R3 the 3D flat Euclidean space, (7.5.9)
1
k = −1 : M = L3 the Lobachevsky space with scalar curvature = − 2 ,
R
where the Lobachevsky space L3 has the same topological structure as R3 .
Historically, Theorem 7.22 was rigorously proved by Robertson and Walker, and was
assumed by Friedmann and Lemaı̂tre to derive the field equations satisfied by R(t).
7. The Newton cosmology. The Newton cosmology is based on the Newton Gravitational
Law. By Cosmological Principle (Roos, 2003), the universe is spherically symmetric. For
any reference point p ∈ M , the motion equation of an object with distance r from p is
d2r GM(r)
=− 2 , (7.5.10)
dt 2 r
where M(r) = 4π r 3 ρ /3, and ρ is the mass density. Thus, (7.5.10) can be rewritten as
follows
4
r00 = − π Grρ . (7.5.11)
3
Make the nondimensional
r = R(t)r0 ,
where R(t) is the scalar factor, which is the same as in the FLRW metric (Ma and Wang,
2014e). Let ρ0 be the density at R = 1. Then we have
ρ = ρ0 /R3 . (7.5.12)
2) Let E0 be the total kinetic energy of the Universe, M is the mass, then we have
3 GM 2
for k = 0,
E0 = 5 R 2 (7.5.22)
2 GM − 1 Mc2 for k = 1,
3π R 2
where the first term represents the total gravitational bound potential energy, and the second
term is the energy resisting curvature tensor.
3) By (7.5.18), Ṙ 6≡ 0, and consequently the universe is dynamic.
4) By (7.5.15), R̈ < 0, the dynamic universe is decelerating.
9. The Lemaı̂tre cosmology. Consider the Einstein gravitational field equations with
the cosmological constant Λ term:
8π G 1
Rµν = − 4
(Tµν − gµν T ) + Λgµν , Λ > 0, (7.5.23)
c 2
then the metric (7.5.8) satisfies the following equations
4π G 3p Λc2
R̈ = − ρ + 2 R+ R, (7.5.24)
3 c 3
2
Ṙ 8π G Λc2 kc2
= ρ+ − 2, (7.5.25)
R 3 3 R
Ṙ p
ρ̇ + 3 ρ + 2 = 0. (7.5.26)
R c
The equations (7.5.24)-(7.5.26) are known as the Lemaı̂tre cosmological model, or the Λ-
cosmological model, which leads to the following conclusions of Λ-cosmology:
Conclusions of Λ-Cosmology 7.24
1) The Universe is temporally open: R → ∞ as t → ∞;
2) There is a critical radius Rc , such that
the universe is decelerating for R < Rc ,
the universe is accelerating for R > Rc ,
4π Gρ0 1/3
where Rc ' ;
Λc2
7.5 The Universe 481
Namely √
Λc2 /3t
R(t)/e = const. as t → ∞;
M = E/c2 . (7.5.30)
At the initial stage, all energy is concentrated in a ball with radius R0 . By the theory of
black holes, the energy contained in the ball generates a black hole in R 3 with radius
2MG
Rs = , (7.5.31)
c2
provided Rs > R0 ; see (7.3.1) and Figure 7.3.
Thus, if the universe is born to a ball, then it is immediately trapped in its own black
hole with the Schwarzschild radius Rs of (7.5.31). The 4D metric inside the black hole of
the static universe is given by
where ψ and α satisfy the equations (7.1.79) and (7.1.80) with boundary conditions:
ψ →ψ
e, α →α
e as r → Rs ,
Basic problems
A static universe is confined in a ball with fixed radius Rs in (7.5.31), and the ball
behaves like a black hole. We need to examine a few basic problems for a static universe,
including the cosmic edge, the flatness, the horizon, the redshift, and the cosmic microwave
background (CMB) radiation problems.
1. The cosmic edge problem. In the ancient Greece, the cosmic-edge riddle was pro-
posed by the philosopher Archytas, a friend of Plato, who used “what happens when a
spear is thrown across the outer boundary of the Universe?” The problem appears to be very
difficult to answer. Hence, for a long time physicists always believe that the Universe is
boundless.
Our theory of black holes presented in Section 7.3 shows that all objects in a globular
universe cannot reach its boundary r = Rs . In particular, an observer in any position of the
globular universe looking toward to the boundary will see no boundary due to the openness
of the ball and the relativistic effect near the Schwarzschild surface. Hence the cosmic-edge
riddle is no longer a problem.
7.5 The Universe 483
2. The flatness problem. In modern cosmology, the flatness problem means that k = 0
in the FLRW metric (7.5.8)-(7.5.9). It is common to think that the flatness of the universe
is equivalent to the fact that the present energy density ρ must be equal to the critical value
given by (7.5.20). In fact, mathematically the flatness means that any geodesic triangle has
the inner angular sum π = 180◦ .
Measurements by the WMAP (Wilson Microwave Anisotropy Probe) spacecraft in the
last ten years indicated that the Universe is nearly flat. The present radius of the Universe is
about
R = 1026m. (7.5.34)
If the Universe is static, then (7.5.34) gives the Schwarzschild radius (7.5.31), from which
it follows that the density ρ of our Universe is just the critical density of (7.5.20):
Thus, we deduce that if the universe is globular, then it is static. In addition, we have shown
that any universe is bounded and confined in a 3D hemisphere of a black hole or in a 3D
sphere as shown in Figure 7.14. Hence as the radius is sufficiently large, the universe is
nearly flat.
3. The horizon problem. The cosmic horizon problem can be simply stated as that all
places in a universe look as the same. It seems as if the static Universe with boundary
violates the horizon problem. However, due to the gravitational lensing effect, the light
bents around a massive object. Hence, the boundary of a globular universe is like a concave
spherical mirror, and all lights reaching close to it will be reflected back, as shown in Figure
7.13. It is this lensing effect that makes the globular universe looks as if everywhere is the
same, and is horizontal. In Figure 7.13, if we are in position x, then we can also see a star
as if it is in position ye, which is actually a virtual image of the star at y.
Figure 7.13 Due to the lensing effect, one at x can also see the star at y as if it is another star at ye.
4. The redshift problem. Observations show that light coming from a remote galaxy is
redshifted, and the farther away the galaxy is, the larger the redshift is. In astronomy, it is
484 Chapter 7 Astrophysics and Cosmology
The gravitational redshift in a black hole are caused by both the gravitational fields of the
emitting object and the black hole.
The first type of redshift, due to the gravitational field, is formulated as
r
2mG
1− 2
c r
1+z = r , (7.5.36)
2mG
1− 2
c r0
where m is the mass of the emitting object, r0 is its radius, and r is the distance between
the object and the observer.
The second type of redshifts, due to the cosmological effect or black hole effect, is
p
−g00(r0 )
1+z = p , (7.5.37)
−g00(r1 )
where g00 is the time-component of the black hole gravitational metric, r0 and r1 are the
positions of the observer and the emitting object (including virtual images).
If a universe is not considered as a black hole, then the gravitational redshift is simply
given by (7.5.36) and is very small for remote objects. Likewise, the cosmological redshift
is also too small to be significant. Hence, astronomers have to think the main portion of the
redshift is due to the Doppler effect:
p
1 + v/c
1+z = p . (7.5.38)
1 − v/c
In addition, Hubble discovered that the redshift has an approximatively linear relation with
the distance:
z ' kR, k is a constant. (7.5.40)
Thus, the Hubble Law (7.5.1) follows from (7.5.39) and (7.5.40). It is the Hubble Law
(7.5.1) that leads to the conclusion that our Universe is expanding.
7.5 The Universe 485
However, if we adopt the view that the globular universe is in a black hole with the
Schwarzschild radius Rs as in (7.5.31), the black hole redshift (7.5.37) cannot be ignored.
By (7.5.32) and (7.5.33), the time-component g00 for the black hole can approximatively
take the TOV solution as r near Rs :
1 r2
g00 = − 1− 2 , for r near Rs .
4 Rs
Hence, the redshift (7.5.37) is as
q
1 − r02/R2s
1+z = q , for r0 , r1 < Rs . (7.5.41)
1 − r12/R2s
It is known that for a remote galaxy, r1 is close to the boundary r = Rs . Therefore by (7.5.41)
we have
z → +∞ as r1 → Rs .
It reflects the redshifts observed from most remote objects. If the object is a virtual image
as shown in Figure 7.13, then its position is the reflection point r1 . Thus, we see that even
if the remote object is not moving, its redshift can still be very large.
5. CMB problem. In 1965, two physicists A. Penzias and R. Wilson discovered the
low-temperature cosmic microwave background (CMB) radiation, which fills our Universe,
and it is ever regarded as the Big-Bang product. However, for a static closed Universe, it
is the most natural thing that there exists a CMB, because the Universe is a black-body and
CMB is a result of black-body radiation.
6. None expanding Universe. As the energy of the Universe is given, the maximal
radius, i.e. the Schwarzschild radius Rs , is determined, and the boundary is invariant. In fact,
a globular universe must fill the ball with the Schwarzschild radius, although the distribution
of the matter in this ball may be slightly non-homogenous. The main reason is that if the
universe has a radius R smaller than Rs , then it must contain at least a sub-black hole with
radius R0 as follows r
R
R0 = R.
Rs
In Section 7.5.4 we shall discuss this topic.
A globular universe must be static. With the same argument, a spherical closed universe
have to be static as well. In this subsection, we are devoted to investigate the spherical
cosmology.
1. Cosmic radius. For a static spherical universe, its radius Rc satisfies that
Ṙc = 0, R̈c = 0.
V = 2π 2 R 3 .
Figure 7.14 (a) A 3D hemisphere, and (b) a 3D sphere piecing the upper and lower hemispheres
together.
For a static universe in a black hole with maximal radius Rc , the equation (7.5.45) becomes
8π G kc2
Ṙc = 0 ⇔ ρ= 2. (7.5.46)
3 Rc
(x1 , x2 , x3 , x4 ) = (Rc sin ψ sin θ cos ϕ , Rc sin ψ sin θ sin ϕ , Rc sin ψ cos θ , Rc cos ψ ). (7.5.47)
8GMtotal
Rc = for k = 1. (7.5.49)
3π c2
Remark 7.26 The mass Mtotal in (7.5.48) contains the energy contributed by the space
curvature, i.e.
Mtotal = M + space curved energy,
488 Chapter 7 Astrophysics and Cosmology
4π 3
M/ R = Mtotal /π 2R3c ,
3 c
we get the relation
3π
Mtotal = M. (7.5.50)
4
With the flat space mass (7.5.50), from (7.5.49) we get the Schwarzschild radius R s = Rc
for the cosmic black hole as follows
Rs = 2GM/c2 .
VΩ
MΩ;total = MΩ , (7.5.51)
|Ω|
where MΩ is the flat space mass in Ω, MΩ;total is the curved space mass, |Ω| is the volume of
flat Ω, Z
√
VΩ = gdx, g = det(gi j ),
Ω
Now, we return to the Friedmann model (7.5.18) with k = 1, which has the same form
as that of the globular dynamic equation (7.5.45), and is of the same maximal radius R c as
that in (7.5.49). Hence, it is natural that a static spherical universe is considered as if there
were two hemispherical black holes attached together. In fact, the static spherical universe
forms an entire black hole as a closed space.
4. Basic problems. Since a static spherical universe is equivalent to two globular uni-
verses to be pieced together along with their boundary, an observer in its hemisphere is as if
one is in a globular universe. Hence, the basic problems – the cosmic edge problem, flatness
problem, horizon problem, and CMB problem– can be explained in the same fashion.
The redshift problem is slightly different, and the gravitational redshift is given by
1
1+z = p , (7.5.52)
−g00 (r)
where r is the distance between the light source and the observer, and g 00 is the time-
component of the gravitational metric.
Due to the horizon of sphere, for an arbitrary point on a spherical universe, its opposite
hemisphere relative to the point plays a similar role as a black hole. Hence, in the redshift
7.5 The Universe 489
formula (7.5.52), g00 can be approximatively taken as the Schwarzschild solution for distant
objects as follows
Rs 2MG
−g00 = 1 − , Rs = , r = 2Rs − r
e for 0 6 r < Rs ,
r
e c2
where M is the cosmic mass of hemisphere, and e r is the distance from the light source to the
opposite radial point, and r is from the light source to the point. Hence, formula (7.5.52)
can be approximatively written as
√
1 2Rs − r
1+z = s = p for 0 < r < Rs . (7.5.53)
Rs α (r)(R s − r)
α (r) 1 −
r
e
where
α (0) = 2, α (Rs ) = 1, α 0 (r) < 0.
We see from (7.5.53) that the redshift z → ∞ as r → Rs , and, consequently, we cannot see
objects in the opposite hemisphere. Intuitively, α (r) represents the gravitational effect of
the matter in the hemisphere of the observer.
To see this, we consider a homogeneous universe with radius R < R s . Then the mass
density ρ is given by
3M
ρ= . (7.5.56)
4π R3
On the other hand, by Theorem 7.3, the condition for a ball B r with radius r to form a black
hole is that the mass Mr in Br satisfies that
Mr c2
= . (7.5.57)
r 2G
By (7.5.56), we have
4π 3 r3
Mr = r ρ = 3 M.
3 R
Then it follows from (7.5.57) that r
R
r= R. (7.5.58)
Rs
Actually, in general for a ball Br in a universe with radius R < Rs , if its mass Mr satisfies
(7.5.57) then it will form a black hole, and its radius r satisfies that
r
R
r6 R.
Rs
In particular, there must exist a black hole whose radius r is as in (7.5.58). Thus, we derive
the conclusion (7.5.55).
Based on (7.5.55) we can deduce that if the Universe were born to a Big-Bang and
continuously expands, then it would contain many black holes with smaller ones being
embedded in the larger ones. In particular, the Universe would contain a huge black hole
whose radius r is almost equal to the cosmic radius Rs . This is not what we observed in our
Universe.
The second difficulty of modern cosmology concerns with the Hubble Law (7.5.1),
which is restated as v = HR, where c/H = Rs . Consider a remote object with mass M0 , then
M0
the observed mass Mobser is given by Mobser = r . Consequently, the corresponding
v2
1− 2
c
gravitational force F to the observer with mass m is
Theorem 7.27 Assume a) the Einstein theory of general relativity, and b) the prin-
ciple of cosmological principle that the universe is homogeneous and isotropic. Then the
following assertions hold true for our Universe:
1) All universes are bounded, are not originated from a Big-Bang, and are static; and
2) The topological structure of our Universe can only be the 3D sphere such that to each
observer, the corresponding equator with the observer at the center of the hemisphere
can be viewed as the black hole horizon.
Theorem 7.28 If we assume only a) the Einstein theory of general relativity, and b’)
the universe is homogeneous. Then all universes can only be either a 3D sphere as given in
Theorem 7.27, or a globular universe, which is a 3D open ball B Rs of radius Rs , forming the
interior of a black hole with Rs as its Schwarzschild radius. In the later case, the Universe
is also static, is not originated from a Big-Bang, and the matter fills the entire Universe.
Also, the following assertions hold true:
1) The cosmic observable mass M and the total mass Mtotal , which includes both M
and the non-observable mass due to the space curvature energy, satisfy the following
relation (
3π M/2 for the spherical structure,
Mtotal = (7.5.59)
3π M/4 for the globular structure.
The difference Mtotal − M can be regarded as the dark matter;
2) The static Universe has to possess a negative pressure to balance the gravitational
attracting force. The negative pressure is actually the effect of the gravitational re-
pelling force, also called dark energy; and
3) Both dark matter and dark energy are a property of gravity, which is reflected in
both space-time curvature, and the attracting and repulsive gravitational forces in
different scales of the Universe. This law of gravity is precisely described by the new
gravitational field equations (4.4.10); see also (Ma and Wang, 2014e).
Mtotal = 4.7M.
This relation also suggest that the spherical universe case fits better the current understand-
ing for our Universe.
Second, due to the horizon of sphere, for an arbitrary point in a spherical universe, its op-
posite hemisphere relative to the point is as if it is a black hole. Hence the main contribution
to the redshifts is from the effect of the black hole, as explicitly given by (7.5.53).
492 Chapter 7 Astrophysics and Cosmology
Third, in modern cosmology, the view of expanding universe was based essentially on
the Friedmann model and the Hubble Law. The observations can accurately measure the
distances and redshifts for some galaxies, which allowed astronomers to get both measured
and theoretical data, and their deviation led to the conclusion that the expanding universe is
accelerating. The misunderstanding comes from the perception that the Doppler redshift is
the main source of redshifts.
v(r) ∼
= a constant for a given galaxy, (7.6.2)
as illustrated typically by Figure 7.15 (a), where the vertical axis represents the velocity
(Km/s), and the horizontal axis is the distance from the galaxy center (extending to the
7.6 Theory of Dark Matter and Dark Energy 493
galaxy radius). However, the calculation from (7.6.1) gives a theoretic curve as shown in
Figure f6.15(b), showing discrepancies between the mass determined from the gravita-
tional effect and the mass calculated from the visible matter. The missing mass suggests the
presence of dark matter in the Universe.
Figure 7.15 (a) Typical galactic rotational curve by Rubin, and (b) theoretic curve based on the
Newtonian gravitational law.
In fact, we have seen in Section 7.5.4 that the dark matter is a space curved energy, or
equivalently a gravitational effect, which is also reflected in the revised gravitational force
formula in which there is an additional attracting force to the classical Newtonian gravity.
2. Dark energy. Dark energy was first proposed in 1990’s, which was based on the
hypotheses that the Universe is expanding.
The High-z Supernova Search Team in 1998 and the Supernova Cosmology Project in
1999 published their precisely measured data of the distances of supernovas and the red-
shifts. The observations indicated that the measured and theoretical data have a deviation,
which was explained, based on the Hubble Law and the Friedmann model, as the accel-
eration of the expanding universe. The accelerating expansion is widely accepted as an
evidence of the existence of dark energy.
However, based on the new cosmology postulated in the last section, the dark energy is
a field energy form of gravitation which balances the gravitational attraction to maintain the
homogeneity and stability of the Universe.
3 1
R00 = Rtt ,
c2 R
1
Rkk = − 2 2 gkk (RRtt00 + 2Rt2 + 2c2 ) for 1 6 k 6 3,
c R
and by Tµν = diag(c2 ρ , g11 p, g22 p, g33 p), we have
1 c2 3p
T00 − g00 T = ρ+ 2 ,
2 2 c
1 c 2 p
Tkk − gkk T = gkk ρ − 2 for1 6 k 6 3,
2 2 c
1 1 3Rt
φ00 − g00 Φ = 2 φtt − φt ,
2 2c R
1 1 Rt
φkk − gkk Φ = 2 gkk φtt + φt for1 6 k 6 3.
2 2c R
we deduce that
3R0 00 3R0 3R0 p
φ +
000
φ = −8π G ρ +
0
ρ+ . (7.6.8)
R R R c2
It is known that the energy density ρ and the cosmic radius R (also called the scale
factor) satisfy the relation:
ρ0
ρ= , ρ0 the density at R = 1. (7.6.9)
R3
Hence, it follows from (7.6.9) that
ρ 0 = −3ρ R0/R.
3R0 00 24π G R0
φ 000 + φ =− 2 p. (7.6.10)
R c R
In addition, making the transformation
ψ
φ 00 = , (7.6.11)
R3
then, from (7.6.5), (7.6.7) and (7.6.9)-(7.6.11) we can deduce that
(R0 )2 φ 0 = 0. (7.6.12)
Denote ϕ = φ 00 , by (7.6.12), the equations (7.6.5), (7.6.7) and (7.6.10) can be rewritten
in the form
4π G 3p ϕ
00
R =− ρ+ 2 + R,
3 c 8π G
1 (7.6.13)
(R0 )2 = (8π Gρ + ϕ )R2 − c2 ,
3
3R0 24π G R0
ϕ0 + ϕ =− 2 p.
R c R
Only two equations in (7.6.13) are independent. However, there are three unknown
functions R, ϕ , p in (7.6.13). Hence, we need to add an additional equation, the equation of
state, as follows:
p = f (ρ , ϕ ). (7.6.14)
Based on Theorem 7.27, the model describing the static Universe is the equation (7.6.14)
together with the stationary equations of (7.6.13), which are equivalent to the form
3p
ϕ = −8π G ρ + 2 ,
c
(7.6.15)
c 4
p=− .
8π GR2
496 Chapter 7 Astrophysics and Cosmology
The equations (7.6.14) and (7.6.15) provide a theoretic basis for the static Universe,
including the dark energy.
Now, we need to determine the explicit expression for the equation (7.6.14) of state. It
is natural to postulate that the equation of state is linear. Hence, (7.6.14) can be written as
c2
p= (α1 ϕ − α2 Gρ ), (7.6.16)
G
where α1 and α2 are nondimensional parameter, which will be determined by the observed
data.
The equations (7.6.15) and (7.6.16) are the PID cosmological model, where the cosmo-
logical significants of R, p, ϕ , ρ are as follows:
In the static cosmology, dark energy is defined in the following manner. Let E ob be the
observed energy, and R be the cosmic radius. We define the observable mass and the total
mass as follows:
Eob
Mob = , (7.6.18)
c2
Rc2
MT = . (7.6.19)
2G
If MT > Mob , then the difference
∆E = ET − Eob (7.6.20)
From the PID cosmological model (7.6.15)-(7.6.17), we see that the dark energy ∆E
in (7.6.20) is essentially due to the dual gravitational potential ϕ . In fact, we infer from
(7.6.15) that
ϕ = 0 ⇔ R = 2Mob G/c2 (i.e. ∆E = 0),
(7.6.21)
ϕ > 0 ⇔ ∆E > 0.
Hence, dark energy is generated by the dual gravitational field. This fact is reflected in the
PID gravitational force formula derived in subsections hereafter.
If we can measure precisely, with astronomical observations, the energy (7.6.18) and the
cosmic radius R (i.e. MT of (7.6.19)), then we can obtain a relation between the parameters
α1 and α2 in (7.6.16). In fact, we deduce from (7.6.15) and (7.6.16) that
βp 1 + 24πα1
ρ+ = 0, β= . (7.6.22)
c2 α2 + 8πα1
As we get
∆M MT − Mob
= =k (k > 0). (7.6.23)
Mob Mob
Then by (7.6.22) and
3Mob c4
ρ= , p=− ,
4π R3 8π GR2
we obtain from (7.6.22) that
3α2 = 24kπα1 + k + 1. (7.6.24)
By the relation (7.6.24) from (7.6.22), we can also derive, in the same fashion as above,
the dark energy formula (7.6.23).
Hence, the PID gravitational field equation for the metric (7.6.25) is given by
1
Rµν − gµν R = −∇µν φ , r > r0 . (7.6.26)
2
where φ = φ (r) is a scalar function of r.
498 Chapter 7 Astrophysics and Cosmology
The formula (7.6.29) provides the precise gravitational interaction force exerted on an
object with mass m in a spherically symmetric gravitation field.
In classical physics, the field functions u and v in (7.6.29) are taken by the Schwarzschild
solution:
u 2GM v 2GM −1
e = 1− 2 , e = 1− 2 , (7.6.30)
c r c r
and φ 0 = φ 00 = 0, which leads to the Newton gravitation.
However, due to the presence of dark matter and dark energy, the field functions u, v, φ
in (7.6.29) should be an approximation of the Schwarzschild solution (7.6.30). Hence we
have
|rφ 0 | 1 for r > r0 . (7.6.31)
Under the condition (7.6.31), formula (7.6.29) can be approximatively expressed as
mc2 u 1 v
F= e − (e − 1) − rφ . 00
(7.6.32)
2 r
7.6.4 Asymptotic repulsion of gravity
In this section, we shall consider the asymptotic properties of gravity, and rigorously prove
that the interaction force given by (7.6.32) is repulsive at very large distance.
To this end, we need to make the following transformation to convert the field equations
(7.6.27) into a first order autonomous system:
r = es ,
w = ev − 1,
x1 (s) = es u0 (es ), (7.6.33)
s
x2 (s) = w(e ),
x3 (s) = es φ 0 (es ).
Step 1. Asymptotic flatness. For a globular matter distribution, its gravitational field
should be asymptotically flat, i.e.
It implies that x = 0 is the uniquely physical equilibrium point of (7.6.34) and the following
holds true:
x(s) → 0 if s → ∞ (i.e. r → ∞). (7.6.36)
(a) The solutions x(s, α ) of (7.6.34)-(7.6.35) are asymptotically flat in the sense of (7.6.36).
Namely, the initial values α are in the stable manifold E s of x = 0, defined by
In fact, by (7.6.31) and (7.6.33) we can see that all Schwarzschild solutions lie on the
line
L = {(x1 , x2 , 0)| x1 = x2 , x1 , x2 > 0}. (7.6.39)
In particular, the line L is on the stable manifold E s of (7.6.37).
Step 3. Stable manifold E s . The equations (7.6.34) can be written as
ẋ = Ax + O(|x|2),
where
0 −1 2
A = 0 −1 0 . (7.6.40)
1 −1 1
The dimension of the stable manifold E s is the number of negative eigenvalues of the matrix
A. It is easy to see that the eigenvalues of A are given by
λ1 = −1, λ2 = −1, λ3 = 2.
dim E s = 2.
7.6 Theory of Dark Matter and Dark Energy 501
Consequently, the initial value α of an asymptotically flat solution has only two independent
components due to α ∈ E s , which is of two dimensional. Namely, we arrive at the following
conclusion.
Physical Conclusion 7.29 In the gravitation formula (7.6.29) there are two free pa-
rameters to be determined by experiments (or by astronomical measurements).
In fact, the two free parameters will be determined by the Rubin rotational curve and the
repulsive property of gravity at large distance.
Step 4. Local expression of E s . In order to derive the asymptotic property of the grav-
itational force F of (7.6.29), we need to derive the local expression of the stable manifold
E s near x = 0. Since the tangent space of E s at x = 0 is spanned by the two eigenvectors
(1, 1, 0)t and (1, −1, −1)t corresponding to the two negative eigenvalues λ1 = λ2 = −1, the
coordinate vector (0, 0, 1) of x3 is not contained in E s . This implies that the stable manifold
can be expressed near x = 0 in the form
x3 = h(x1 , x2 ). (7.6.41)
Inserting the Taylor expansion for (7.6.41) into (7.6.34), and comparing the coefficients,
we derive the following local expression of (7.6.41) of the stable manifold function:
1 1 1 1
h(x1 , x2 ) = − x1 + x2 + x21 − x22 + O(|x|3 ). (7.6.42)
2 2 16 16
Inserting (7.6.41)-(7.6.42) into the first two equations of (7.6.34), we deduce that
1 1 3
x01 = −x1 − x21 − x22 − x1 x2 + O(|x|3 ),
8 8 4 (7.6.43)
1 1
x02 = −x2 + x21 − x22 − x1 x2 + O(|x|3 ).
4 4
The system (7.6.43) is the system (7.6.34) restricted on the stable manifold E s . Hence, its
asymptotic behavior represents that of the interaction force F in (7.6.29).
Step 5. Phase diagram of system (7.6.43). In order to obtain the phase diagram of
(7.6.43) near x = 0, we consider the ratio: x02 /x01 = dx2 /dx1 . Omitting the terms O(|x|3 ), we
infer from (7.6.43) that
1 1
dx2 x2 + x22 + x2 x1 − x21
= 4 4 . (7.6.44)
dx1 1 2 3 1
x1 + x2 + x2 x1 + x21
8 4 8
Let k be the slope of an orbit reaching to x = 0:
x2
k= as (x2 , x1 ) → 0.
x1
502 Chapter 7 Astrophysics and Cosmology
k = ±1, k = 2,
x2 = x 1 , x2 = 2x1 , x2 = −x1 ,
which divide the neighborhood of x = 0 into six invariant regions. It is clear that all physi-
cally meaningful orbits can only be in the following three regions:
1
Ω1 = −x2 < x1 < x2 , x2 > 0 ,
2
1 (7.6.45)
Ω2 = x2 6 x 1 6 x 2 , x2 > 0 ,
2
It is easy to show that the orbits in Ω1 with x1 > 0 will eventually cross the x2 −axis. Thus,
using the three invariant sets in (7.6.45), we obtain the phase diagram of (7.6.43) on x 2 > 0
as shown in Figure 7.16. In this diagram, we see that, the orbits in Ω 2 and Ω3 will not cross
the x2 -axis, but these in Ω1 with x1 > 0 will do.
Step 6. Asymptotic repulsion theorem of gravity. We now derive an asymptotic repulsion
theorem of gravity, based on the phase diagram in Figure 7.16. In fact, by (7.6.28) and
(7.6.29), the gravitational force F reads as
mc2 u 0
F =− e u. (7.6.46)
2
7.6 Theory of Dark Matter and Dark Energy 503
Figure 7.16 Only the orbits on Ω1 with x1 > 0 will eventually cross the x2 -axis, leading to the sign
change of x1 , and to a repelling gravitational force corresponding to dark energy.
It is known that
Hence, by x1 = ru0 (r) and (7.6.46), the phase diagram shows that an orbit in Ω 1 , starting
with x1 > 0, will cross the x2 -axis, and the sign of x1 changes from positive to negative,
leading consequently to a repulsive gravitational force F. Namely, we have obtained the
following theorem.
F → 0 if r → ∞. (7.6.47)
2) If the initial value α in (7.6.35) is near the Schwarzschild solution (7.6.31) with 0 <
α1 < α2 /2, then there exists a sufficiently large r1 such that the gravitational force F
is repulsive for r > r1 :
F > 0 for r > r1 . (7.6.48)
We remark that Theorem 7.30 is valid provided the initial value α is small because
the diagram given by Figure 7.16 is in a neighborhood of x = 0. However, all physically
meaningful central fields satisfy the condition (note that any a black hole is enclosed by a
huge quantity of matter with radius r > 0 2MG/c2 ). In fact, the Schwarzschild initial
values are as
δ 2MG
x1 (r0 ) = x2 (r0 ) = , δ= 2 . (7.6.49)
1−δ c r0
504 Chapter 7 Astrophysics and Cosmology
For example see (7.6.29), where the δ -factors are of the order δ 6 10 −1 , sufficient for the
requirements of Theorem 7.30.
The most important cases are for galaxies and clusters of galaxies. For these two types
of astronomical objects, we have
In fact, the dark energy phenomenon is mainly evident between galaxies and between clus-
ters of galaxies. Hence, (7.6.50) shows that Theorem 7.30 is valid for both central gravita-
tional fields of galaxies and clusters of galaxies. The asymptotic repulsion of gravity plays
the role to stabilize the large scale homogeneous structure of the Universe.
1 ∞
u0 (r) = ∑ ak (r − r0 )k .
r2 k=0
1 ∞
F= ∑ bk r k ,
r2 k=0
b0 = −mMG.
k0 > 0, k1 > 0.
Based on Theorem 7.30, ϕ (r) → 0 as r → ∞, and (7.6.55) can be further simplified as in the
form for r0 < r < r1 ,
1 k0
F = mMG − 2 − + k1 r , (7.6.56)
r r
506 Chapter 7 Astrophysics and Cosmology
where k0 and k1 will be determined by the Rubin rotational curve and the astronomical data
for clusters of galaxies in the next section, where we obtain that
r0 < r < r 1 ,
F(r1 ) = 0.
Both observational evidence on dark energy and Theorem 7.30 show that the distance r 1
exists. The formula (7.6.56) with (7.6.57) clearly displays the layered property of gravity:
attracting at short distance and repelling at large distance.
It is known that
3π
VΩ1 = |Ω|.
4
For VΩ2 , we propose that
2. Dark energy: the dual gravitational potential. The static universe is described by the
stationary solution of (7.6.13)-(7.6.14), which is given by (7.6.15)-(7.6.16). In the solution
a negative pressure presents, which prevents galaxies and clusters of galaxies from gravita-
tional contraction to form a void universe, and maintains the homogeneous distribution of
the Universe. The negative pressure contains two parts:
1 c2
p = − ρ c2 − ϕ (see (7.6.15)), (7.6.61)
3 24π G
where the first term is contributed by the observable energy, and the second term is the dark
energy generated by the dual gravitational potential ϕ ; see also (7.6.21).
By the Blackhole Theorem, Theorem 7.15, black holes are incompressible in their inte-
riors. Hence, in (7.6.61) the negative pressure
1
p = − ρ c2 , (7.6.62)
3
is essentially the incompressible pressure of the black hole generated by the normal energy.
By the cosmology theorem, Theorem 7.27, the Universe is a 3D sphere with a blackhole
radius. However, the CMB and the W MAP measurements manifest that the cosmic radius
R is greater than the blackhole radius of normal energy. By (7.6.21), the deficient energy is
compensated by the dual gravitational potential, i.e. by the second term of (7.6.61).
Based on Theorem 7.30, gravity possesses additional attraction and repelling to the
Newtonian gravity, as shown in the revised gravitational formula:
1 k0
F = mMG − 2 − + k1 r . (7.6.63)
r r
508 Chapter 7 Astrophysics and Cosmology
By using this formula we can explain the dark matter and dark energy phenomena. In
particular, based on the Rubin rotational curve and astronomical data, we can determine an
approximation of the parameters k0 and k1 in (7.6.63).
1. Dark matter: the additional attraction. Let Mr be the total mass in the ball with radius
r of galaxy, and V be the constant galactic rotational velocity. By the force equilibrium, we
infer from (7.6.63) that
V2 1 k0
= Mr G 2 + − k1 r , (7.6.64)
r r r
which implies that
V2 r
Mr = . (7.6.65)
G 1 + k0 r − k1 r 3
The mass distribution (7.6.65) is derived based on both the Rubin rotational curve and the
revised formula (7.6.63). In the following we show that the mass distribution Mr fits the
observed data.
We know that the theoretic rotational curve given by Figure 7.15(b) is derived by using
the observed mass Mob and the Newton formula
mMob G
F =− .
r2
Hence, to show that Mr = Mob , we only need to calculate the rotational curve vr by the
Newton formula from the mass Mr , and to verify that vr is consistent with the theoretic
curve. To this end, we have
v2r Mr G
= 2 ,
r r
which, by (7.6.65), leads to
V
vr = p .
1 − k 0 r − k1 r 2
As k1 k0 1, vr can be approximatively written as
1 1
vr = V 1 − k0 r + k02 r2 . (7.6.66)
2 4
It is clear that the rotational curve described by (7.6.66) is consistent with the theoretic
rotational curve as illustrated by Figure 7.15(b). Therefore, it implies that
Mr = Mob . (7.6.67)
The facts of (7.6.64) and (7.6.67) are strong evidence to show that the revised formula
(7.6.63) is in agreement with the astronomical observations.
7.6 Theory of Dark Matter and Dark Energy 509
The observations show that the constant velocity V in the Rubin rotational curve is about
V = 300km/s. Then we have
V2
= 1024kg/km
G
Based on physical considerations,
V2 1 1
k0 = − = 4 × 10−18Km−1 . (7.6.70)
G Mr1 r1
We can explain the dark matter by the revised formula (7.6.63). As the matter distribu-
tion Mr is in the form
V2 r
Mr = ,
G 1 + k0 r
then the Rubin law holds true. In addition, the revised gravitation produces an excessive
mass Me as
2 2
Me = MT − Mr = V r1 − V r1
,
1
G G 1 + k 0 r1
where M1 = V 2 r1 /G is the Newton theoretic value of the total mass. Hence we have
Me k 0 r1 4 Me
= = or = 5.
MT 1 + k 0 r1 5 Mr1
Namely, the additional mass M e is four time the visible matter Mr = MT − M.e Thus, it gives
1
an explanation for the dark matter.
We remark that the ratio M/M e ob = 5 is essentially the same as in (7.6.60). It shows
that the dark matter is a gravitational effect, reflected in both the space curvature and the
additional gravitational attraction.
2. Dark energy: asymptotic repulsion of gravity. If gravity is always attracting as given
by the Newton formula, then the cosmic homogeneity is unstable. In fact, It is known that
the average mass M and distance for the clusters of galaxies are as
M = 1014M = ∼ 1044Kg
(7.6.71)
= 108 pc ∼
r∼ = 1020 ∼ 1021 Km.
510 Chapter 7 Astrophysics and Cosmology
M 2 G ∼ 29
F =− = 10 N = 1028Kg. (7.6.72)
r2
However, astronomical observations indicate that no gravitational interaction between
clusters of galaxies. The Universe is isotropic, therefore no rotation to balance the huge
force of (7.6.72) in the clusters.
Thus, the new cosmology theorem, Theorem 7.27, suggests that gravity should be
asymptotically repulsive. Theorem 7.30 offers a solid theoretic foundation for the prop-
erty, based on which we derive the simplified gravitational force formula (7.6.63).
Now we consider the constant k1 in (7.6.63). Due to the astronomical fact that no grav-
itational force between clusters of galaxies, we have
By (7.6.71), we take r
2
r= × 1020km. (7.6.73)
5
Then we deduce from (7.6.63) that
k0 1
k1 r − − 2 = 0,
r r
which, by (7.6.70) and (7.6.73), leads to
In summary, we conclude that the dark matter and dark energy are essentially gravita-
tional effect generated by the gravitational potential field g µν , its dual field Φ µ and their
nonlinear interactions. They can be regarded as the gravitational field energy caused by g µν
and Φµ .
Bibliography
Davis, R., Harmer, D. S., and Hoffman, K. C. (1968). Search for neutrinos from the sun. Phys. Rev.
Lett. 20, 1205–1209, 10.1103/Phys. Rev. Lett., 20.1205.
Einstein, A. (1905). Über einen die Erzeugung und Verwandlung des Lichtes betreffenden heuristis-
chen Gesichtspunkt. Annalen der Physik, 322, 132–148, doi:10.1002/andp.19053220607.
Einstein, A. (1915). Die feldgleichungen der gravitation. Sitzungsberichte der Preussischen Akademie
der Wissenschaften zu Berlin, 844–847.
Einstein, A. (1916). The foundation of the general theory of relativity. Annalen der Physik, 354 (7),
769–822.
Englert, F. and Brout, R. (1964). Broken symmetry and the mass of gauge vector mesons. Phys. Rev.
Lett., 13 (9), 321–323.
Greiner, W. (2000). Relativistic Quantum Mechanics: Ware Equation. Berlin, Heidelberg, New York:
Springer.
Gross, D. J. and Wilczek, F. (1973). Ultraviolet behavior of non-abelian gauge theories. Phys. Rev.
Lett., 30, 1343–1346, 10.1103/Phys. Rev. Lett., 30.1343.
Guralnik, G., Hagen, C. R., and Kibble, T. W. B. (1964). Global conservation laws and massless
particles. Phys. Rev. Lett., 13 (20), 585–587.
Halzen, F. and Martin, A. D. (1984). Quarks and Leptons: An Introductory Course in Modern Particle
Physics. New York: John Wiley and Sons.
Harrison, E. (2000). Cosmology, the Science of the Universe. 2nd ed. Cambridge University Press.
Hernandez, M., Ma, T., and Wang, S. (2015). Theory of dark energy and dark matter. To appear.
Higgs, P. W. (1964). Broken symmetries and the masses of gauge bosons. Phys. Rev. Lett., 13, 508–
509.
Kaku, M. (1993). Quantum Field Theory, A Modern Introduction. London: Oxford University Press.
Klein, O. (1938). New Theories in Physics, 77 (Intern. Inst. of Intellectural Co-operation, League of
nations).
Landau, L. (1957). On the conservation laws for weak interactions. Nuclear Physics, 3, 1, 127–131,
doi:[Link]
Landau, L. D. and Lifshitz, E. M. (1975). The Classical Theory of Fields. Oxford: Pergamon Press.
Translated from the Russian by Morton Hamermesh.
Lee, T. D. and Yang, C. N. (1956). Question of parity conservation in weak interactions. Phys. Rev.,
104, 254–258, doi:10.1103/PhysRev.104.254.
Lee, T. D. and Yang, C. N. (1957). Parity nonconservation and a two-component theory of the neutrino.
Phys. Rev., 105, 1671.
Ma, T. (2011). Theory and Methods of Partial Differential Equations. Beijing: Science Press in Chi-
nese.
Ma, T. (2012). Physical World From the Mathematical Point of View: Geometrical Analysis, Gravi-
tational Field, and Relativity. Beijing: Science Press in Chinese.
Ma, T. (2014). Physical World From the Mathematical Point of View: Theory of Elementary Particles
and Unified Fields. Beijing: Science Press in Chinese.
Ma, T. and Wang, S. (2005). Bifurcation theory and applications, World Scientific Series on Nonlinear
Science. Series A: Monographs and Treatises, Vol. 53 (World Scientific Publishing Co. Pte. Ltd.,
Hackensack, NJ),.
Ma, T. and Wang, S. (2013a). Duality theory of weak interaction, IU ISCAM Preprint Series, #1302:
[Link]
Ma, T. and Wang, S. (2013c). Structure and stability of matter, IU ISCAM Preprint Series, #1303:
[Link]
Ma, T. and Wang, S. (2014a). Astrophysical dynamics and cosmology. Journal of Mathematical Study,
47:4, 305–378.
Ma, T. and Wang, S. (2014b). Color algebra in quantum chromodynamics, Isaac Newton Institute
Preprint # NI14001: [Link]
Ma, T. and Wang, S. (2014c). Duality theory of strong interaction. Electronic Journal of Theoret-
ical Physics 11:31, 101–124, iU ISCAM Preprint Series, #1301: [Link]
~iscam/preprint/[Link].
Ma, T. and Wang, S. (2014d). Field theory for multi-particle system, Isaac Newton Institute Preprint
# NI14057: [Link]
Bibliography 513
Ma, T. and Wang, S. (2014e). Gravitational field equations and theory of dark matter and dark energy,
Discrete and Continuous Dynamical Systems, Ser. A 34:2, 335–366, preprint, [Link]
org/abs/1206.5078.
Ma, T. and Wang, S. (2014f). On solar neutrino problem, Isaac Newton Institute Preprint # NI14058:
[Link]
Ma, T. and Wang, S. (2014g). Spectral theory of differential operators and energy levels of subatomic
particles, Isaac Newton Institute Preprint # NI14002: [Link]
preprints/[Link].
Ma, T. and Wang, S. (2014h). Unified field theory and principle of representation invariance. Applied
Mathematics and Optimization 69:3, 359–392, doi:0.1007/s00245-013-9226-0, preprint, http:
//[Link]/abs/1212.4893.
Ma, T. and Wang, S. (2015a). Unified field equations coupling four forces and principle of interaction
dynamics. Discrete and Continuous Dynamical Systems, Ser. A, 35:3, 1103–1138, preprint, http:
//[Link]/abs/1210.0448.
Ma, T. and Wang, S. (2015b). Weakton model of elementary particles and decay mechanisms.
Electronic Journal of Theoretical Physics 12:32, 139–178, iU ISCAM Preprint Series, #1304:
[Link]
Maki, Z., Nakagawa, M., and Sakata, S. (1962). Remarks on the unified model of elementary particles.
Progress of Theoretical Physics 28, 870–880.
Mikheev, S. P. and Smirnov, A. Y. (1986). Resonant amplification of ν oscillation in matter and solar-
neutrino spectroscopy, IL Nuovo Cimento C, 9:1, 17–26.
Milgrom, M. (1983). A modification of the Newtonian dynamics as a possible alternative to the hidden
mass hypothesis. Astrophysical Journal 270, 365–370, doi:10.1086/161130.
Milgrom, M. (2014). MOND laws of galactic dynamics. MNRAS, 437, 2531–2541, doi:10.1093/
mnras/stt2066, eprint 1212.2568.
Nambu, Y. (1960). Quasi-particles and gauge invariance in the theory of superconductivity. Phys. Rev.,
117, 648–663, doi:10.1103/[Link].117.648.
Nambu, Y. (2008). Spontaneous symmetry breaking in particle physics: a case of cross fertilization,
Slides for his Nobel Lecture.
Nambu, Y. and Jona-Lasinio, G. (1961a). Dynamical model of elementary particles based on an anal-
ogy with superconductivity. Phys. Rev. I, 122, 345–358, doi:10.1103/PhysRev.122.345.
Nambu, Y. and Jona-Lasinio, G. (1961b). Dynamical model of elementary particles based on an anal-
ogy with superconductivity. Phys. Rev. II, 124, 246–254, doi:10.1103/PhysRev.124.246.
Perlmutter, S. and et al. (1999). Measurements of ω and λ from 42 high-redshift supernovae. Astro-
phys. J., 517, 565–586.
Planck, M. (1901). Ueber das Gesetz der Energieverteilung im Normalspectrum. Annalen der Physik,
309, 553–563, doi:10.1002/andp.19013090310.
514 Bibliography
Politzer, H. D. (1973). Reliable perturbative results for strong interactions? Phys. Rev. Lett., 30, 1346–
1349, doi:10.1103/PhysRevLett.30.1346.
Pontecorvo, B. (1968). Neutrino experiments and the problem of conservation of leptonic charge.
[Link], 26, 984.
Popławski, N. (2012). Nonsingular, big-bounce cosmology from spinor-torsion coupling. Phys. Rev.
D, 85, 107502, doi:10.1103/PhysRevD.85.107502.
Quigg, C. (2013). Gauge Theories of the Strong, Weak, and Electromagnetic Interactions, 2nd ed.
Princeton Unversity Press.
Riess, A. G. and et al. (1989). Observational evidence from supernovae for an accelerating universe
and a cosmological constant. Astron. J., 116, 1009–1038.
Rubin, V. and Ford, J. W. K. (1970). Rotation of the andromeda nebula from a spectroscopic survey
of emission regions. Astrophysical Journal 159, 379–404.
Sakurai, J. J. (1994). Modern Quantum Mechanics, revised ed. Addison-Wesley Publishing Company,
Inc.
Salam, A. (1957). On parity conservation and neutrino mass, Nuovo Cimento, 5, 299–301.
Sokolov, A. A., Loskutov, Y. M., and Ternov, I. M. (1966). Quantum Mechanics. New York: Holt,
Rinehart & Winston, Inc.
Weisskopf, V. (1972). Physics in the Twentieth Century: Selected Essays. Cambridge, Massachusetts:
MIT Press.
Weyl, H. (1912). Das asymptotische Verteilungsgesetz der Eigenwerte linearer partieller Differential-
gleichungen (mit einer Anwendung auf die Theorie der Hohlraumstrahlung). Math. Ann. 71, 4,
441–479, doi:10.1007/BF01456804.
Wolfenstein, L. (1978). Neutrino oscillations in matter. Phys. Rev. D, 17, 2369–2374, doi:10.1103/
PhysRevD.17.2369.
Wu, C. S., Ambler, E., Hayward, R. W., et al. (1957). Experimental test of parity conservation in beta
decay. Phys. Rev., 105, 1413–1415, doi:10.1103/PhysRev.105.1413.
Yang, C. N. and Mills, R. (1954). Conservation of isotopic spin and isotopic gauge invariance. Phys.
Rev. 96, 191–195.
Zwicky, F. (1937). On the masses of nebulae and of clusters of nebulae. Astrophysical Journal, 86,
217–246.
Index
BEC conservation
Bose-Einstein condensation, 104 of weakton number, 302
Betti number, 141 conservation laws, 89, 264
Bianchi identity, 192, 474 conservative systems
Big-Bang, 457 dynamics of, 96
black hole, 402 control parameter, 418
existence of, 402 cosmic edge problem, 463
black holes, 432 cosmic radius, 466
geometric realization, 432 cosmological constant, 70
Blackhole Theorem, 395, 434 Cosmology Theorem, 470
Bohr atomic model, 166 Coulomb potential, 207
Bose-Einstein condensation covariant derivative, 63, 65, 66
Hamiltonian system for, 104 on a vector bundle, 119
boson, 254 critical δ -factor, 438
bound state, 367 current density, 51
bremsstrahlung, 310 curvature tensor, 67
mechanism of, 310
D
C
dark energy, 23, 24, 192, 471, 473
Chandrasekhar limit, 436 theory of, 476
charged Higgs, 211 dark matter, 23, 24, 192, 471
charged lepton, 366 dark matter and dark energy
mediator cloud, 324 the nature of, 485
weakton constituent, 294 de Broglie relation, 57
CKM matrix, 352 decay, 261, 304
classical electrodynamics, 82 decay type, 303
cluster of galaxies, 412 decoupling
CMB problem, 465 of the unified field theory, 209
color algebra, 316, 317 deep inelastic scattering, 227
color charge, 314 derivative on T M , 121
color index, 315, 317 derivative on T ∗ M , 121
formula for hadron, 318 derivative on Trk M , 122
color quantum number, 311 differential operators, 128
conjugate representation, 272 in spherical coordinates, 403
connection, 119 Dirac equations, 45, 57, 336
for GL(n) group, 121 Hamiltonian system for, 103
for SU(N) group, 120 Dirac matrices, 58, 334
for Lorentz group, 120 Dirac spinor, 58
Index 517
K M
KamLAND, 349
magnetic field, 54
KG equation, 56
magnetic potential, 207
Klein-Gordon equation, 56, 335
main driving force, 414
Hamiltonian system for, 104
pressure gradient, 415
Klein-Gordon fields, 87
relativistic effect, 415
Kruskal coordinate transformation, 434
stellar dynamics, 414
L thermal expansion force, 415
viscous friction, 415
Lagrangian action, 68 main-sequence stars, 413, 420
Lagrangian actions manifold with boundary, 143
in quantum mechanics, 86 mass, 257
Lagrangian dynamics, 80 mass charge, 180
Lagrangian for dynamics of charged par- mass generation, 196, 295
ticles, 84 mass of earth’s black hole core, 442
Laplace operator, 130 mass of solar black hole core, 441
Laplace-Beltrami operator, 131, 396, massive bound state, 362
404 massless bound state, 364
520 Index
Yang-Mills action, 74
Yang-Mills functional, 74
derivative of the, 146
Young tableaux, 278
Yukawa interaction mechanism, 182
Yukawa potential, 229
modified, 229, 230