Origins of Physics Laws and Symmetries
Origins of Physics Laws and Symmetries
Abstract
The laws of physics are constrained so that they select out no preferred coordinate
system or reference frame. This is called the principle of covariance. This principle can be
further generalized to include the coordinates in the abstract space of the functions used
to formulate those laws. This is called global gauge invariance. When this symmetry
applies independently at every point in space-time, it is called local gauge invariance.
These symmetries are almost all that are needed to derive most of the familiar laws the
law of physics, including classical mechanics, the great conservation laws, quantum
mechanics, special and general relativity, and electromagnetism. Those structures that
do not follow directly from coordinate invariance result from spontaneously broken
symmetries.
1.0 Introduction
Most laypeople think of the laws of physics as something like the Ten
Commandments—rules governing the behavior of matter imposed by some great
lawgiver in the sky. However, no stone tablet has ever been found upon which such
laws were either naturally or supernaturally inscribed. On the contrary, the laws of
2
physics are human inventions—mathematical formulas that quantitatively describe the
results of observations and measurements. These formulas are first inferred from and
then tested against observations. If they hold up, they are eventually reformulated as
part of general and universal theories that are derived from a minimum number of
assumed fundamental principles. Very often, a "law" will turn out to be nothing more
than a circular definition, such as Ohm's law which says that the voltage is proportional
to the current in a resistor, where a resistor is defined as a device that obeys Ohm's law.
Since the time of Copernicus and Galileo it has been realized that the laws of
physics should not single out any particular space-time reference frame, although a
distinction between inertial and noninertial frames was maintained in Newtonian
physics. That distinction was removed in 1916 by Einstein who formulated his general
theory of relativity in a covariant way. That is, the form of Einstein's equations is the
same in all reference frames, inertial or noninertial.
As this experience showed, physicists are highly constrained in the way they
may formulate the laws of physics. Not only must they agree with the data, the
equations that are used to describe that data should not be written in such a way as to
specify a privileged coordinate system or reference frame. This principle of covariance
generalizes other notions such as the Copernican and cosmological principles and the
principle of Galilean relativity. The application of this principle is not a matter of choice;
centuries of observations have shown that to do otherwise produces calculations that
disagree with the data in some reference frames.
In 1918, Noether showed that coordinate independence was more than just a
constraint on the mathematical form of physical laws.[1] She proved that some of the
most important physics principles are, in fact, nothing more than tautologies that follow
from space-time coordinate independence: energy conservation arises from time
translation invariance, linear momentum conservation comes from space translation
invariance, and angular momentum conservation is a consequence of space rotation
invariance. These conserved quantities were simply the mathematical generators of the
3
corresponding symmetry transformation.
As the twentieth century progressed, invariance or symmetry principles became
an increasingly dominant idea in physics. Not only were space-time coordinate
symmetries built into theories, the notion of coordinate independence was extended to
the abstract spaces physicists use to represent the other degrees of freedom of systems.
Rotational symmetry was also applied to the space of quantum state vectors, resulting
in derived properties of spin, isospin, charge, baryon number, and other observables
that agreed with measurements.
Charge conservation, for example, was found to follow from the invariance of
the Schrödinger equation to changes in the phase of the complex wave function. And
then, a remarkable discovery was made. It was found that the Schrödinger equation
could be made invariant to a local phase change in the wave function, that is, a change in
phase that varies from point to point in space-time, provided that vector and scalar
potentials were added. The potentials turned out to be exactly those that give the
classical electric and magnetic fields. This local quantum phase symmetry was precisely
related to the local classical gauge symmetry of electrodynamics. Maxwell's equations
were derived from a single principle—local phase invariance.
If we think of the Schrödinger wave function as a "vector" in 2-dimensional
complex space, then changing phase is equivalent to a rotation in that space and phase
invariance, or gauge invariance, is equivalent to rotational invariance. Indeed, the
generator of that transformation is the electric charge whose conservation follows from
global gauge invariance.
In the standard model, the fields associated with the weak and strong nuclear
forces are obtained by extending the idea of gauge symmetry to higher dimensions of
abstract space. There the situation is complicated by the fact that all the symmetries are
not exact at the "low temperatures" of current experimentation. Good thing. The
diversity and complexity of the universe is a result of broken symmetries, without
which we would not be here to do the experiments.
4
Twentieth century physics was also marked by the discovery that symmetries
are often broken. In the 1950s, it was found that weak interactions maximally violated
space reflection symmetry; that is, they were not invariant under the parity operation P
that changes the handedness or chirality of a system. In the 1960s certain rare decays
were found to be noninvariant under the combined operation CP, where C changes a
particle to its antiparticle. The study of the origin of CP violation remains a subject of
considerable experimental and theoretical effort to this date.
In this paper, it will be shown that much of familiar physics can be derived from
the generalized notion of coordinate invariance applied not only in space-time but in
the spaces of other observables and the spaces of the functions that are used to
mathematically describe physical phenomena. In order to make this result accessible to
the greatest number of people, the mathematical level will be limited to that of an
advanced undergraduate student in physics or mathematics. The equations will appear
very familiar—just those found in physics textbooks, and it may appear that the author
is using hindsight to make things come out the way they already are. However, the
reader is asked to look carefully at how those equations are obtained. Certain familiar
principles normally taken as axioms, such as the quantization of angular momentum
and the invariance of the speed of light will be derived from the hypothesized
symmetry principles without additional assumptions.
Assume y = y(q). The state vectors of quantum mechanics are familiar examples of y-
space vectors. We can imagine a set of coordinate axes in y-space. Extending the notion
of covariance to this space we will assume that the following principle holds: the laws of
physics cannot depend on the orientation of the vector y in y-space. This principle is called
gauge symmetry.
2-dimensional vector with coordinates (Re{y }, Im{y }). Let us perform a unitary
transformation on y:
y' = Uy (3.1)
where U† U = 1, so
That is, |y| is invariant to the transformation, as required by gauge symmetry. We can
q = eG (3.6)
U ≈ 1 + ieG (3.7)
.
where G† = G is hermitian and is called the generator of the transformation. Then,
7
y' ≈ y + ie G y (3.8)
Then
Gm = -i∂/∂qm (3.10)
Define
∂
Pm ≡ hG = -ih (3.11)
∂q
m
different from the reciprocal of the units of qm. The transformation operator can then be
† written
∂
U =1+ (3.12)
∂qm
†
which we recognize as the quantum mechanical operator for the x-component of
momentum. Note that this association was not assumed but derived and no connection
with mass and velocity has yet been made. This just happens to be the form of the
generator of a space translation. Similarly, we can take q2 = y, q3 = z and obtain the
generators Py and Pz .
provide a connection with the fully relativistic treatment we will make later, let qo ≡ ict,
where c is, like h, another arbitrary conversion factor. Later we will associate it with the
speed of light in a vacuum and find (not assume) that it is a Lorentz invariant. For now,
†
∂ h ∂
Po = ih =- (3.14)
∂q o c ∂t
†
9
We can then define
∂
H ≡ -iPoc = ih (3.15)
∂t
†
which we recognize as the quantum mechanical Hamiltonian (energy) operator. Note,
again, that this familiar result was not assumed but derived. No connection with the
physical quantity energy has yet been made. This just happens to be the form of the
generator of a time translation.
∂
Hy = ih (4.1)
∂t
†
This is the time-dependent Schrödinger equation of quantum mechanics, where y is
Ê i ˆ
y (t) = y (0)expÁ - Ht ˜ (4.2)
Ë h ¯
so †
Ê i ˆ
U(t) = expÁ - Ht ˜ (4.3)
Ë h ¯
†
10
So, let y be a linear vector and y be its dual. For simplicity, we take our linear
† †
y y =1 (4.4)
† y¢ = U y > (4.5)
† y U †U y = 1 (4.6)
f = Ay (4.7)
†
11
Then,
f Ay = 1 (4.8)
and †
f U † AU y = f ¢ A y = f ¢ f ¢ = 1 (4.9)
We can define †
A' = U† A U (4.10)
and write
f A¢ y = 1 (4.11)
In that case, †
Alternatively,
† define
12
Then, †
when the system is in the state y(0) is, in the Schrödinger picture,
where †
†
13
In the Heisenberg picture we have
and †
where †
Êi ˆ Ê i ˆ
A(t) = U † A(0)U = expÁ Ht ˜ A(0)expÁ - Ht ˜ (4.21)
Ëh ¯ Ë h ¯
Let us †
look further at the time evolution of operators. Suppose we make an
infinitesimal transformation in time t Æ t + dt. Then
i
U(t) = 1 - Ht (4.22)
h
†Ê i ˆ Ê i ˆ i
A(t + dt) = Á 1+ Ht ˜ A(t)Á 1- Ht˜ = A(t) - [ A, H ] (4.23)
Ë h ¯ Ë h ¯ h
Since †
∂A
A(t + dt) = a(t) + dt (4.24)
∂t
†
14
it follows that
∂A i
= - [ A,H ] (4.25)
∂t h
† observable then is
The time rate of change of an
dA ∂A ∂A dq k
= +Â (4.26)
dt ∂t k ≠0 ∂q k dt
or, †
n
dA i ∂A dq k
= - [A, H ] + Â (4.27)
dt h k =1
∂q k dt
†
where the sum excludes the time variable.
Now we move to gauge transformations involving the non-temporal variables
of a system. Consider the case where A = Pj . Then,
dPj n ∂P
i j dq k
dt
=-
h
[ ]
Pj ,H + Â
∂q k dt
(4.28)
k=1
†
Next, let us look at the transformation of these non-temporal variables. Let the variable
†
15
operator is
i
U =1+ Pe (4.29)
h k k
Thus, †
i
y(q k + e k ) = y (q k ) + P e y (q k ) (4.30)
h k k
Suppose we have an
† operator A defined by
A y (q k ) = f (q k ) (4.31)
†
where
Ê i ˆ Ê i ˆ i
A¢ = Á 1+ Pke k ˜ AÁ 1- Pkek ˜ = 1- [ A, Pk ] (4.34)
Ë h ¯ Ë h ¯ h
so, †
∂A i
= - [ A, Pk ] (4.35)
∂q k h
[Pj , Pk ] = 0 (4.36)
for j ≠ k, and so †
∂Pj
=0 (4.37)
∂q k
†
Recall (4.28),
dPk i ∂P dq j
= - [Pk ,H ] + Â k (4.38)
dt h j
∂q j dt
† all zero, so
The summed terms are
17
dPk i
= - [Pk ,H ] (4.39)
dt h
∂q k i
= 1 = - [ q k ,Pk ] (4.40)
∂q k h
or, †
[q k ,Pk ] = ih (4.41)
È h ∂ ˘ h ∂y h ∂
[q k ,Pk ]y = Íqk , i ∂q
˙y = q k
i
- q y = ihy
∂q k i ∂q k k
(4.42)
Î k˚
†
For example,
[x,Px ] = ih (4.43)
Thus, †
dPk ∂H
=- (4.45)
dt ∂q k
A = y Ay (4.46)
Aa =a a (4.47)
†
19
always yield the eigenvalue corresponding to that state.
The symbol a a stands for an operator that projects y onto the a axis.
† † †
† Âa a =1 (4.48)
a
In that case, the state vector of†a system will be the linear combination
y =Â a ay (4.49)
a
y (q) = q y (4.50)
relativistically) of the particles of the system. Momentum-space wave functions are also
† often used.
More generally, the eigenstates q are the basis states of a particular, arbitrary
representation, like the unit vectors i, j, and k of the Cartesian coordinate axes x, y, z.
† †
20
y (q) = Â y †i q i (4.51)
i
A = y † Ay (4.53)
y ¢ = Uy = exp(iq )y (4.54)
= (fx ,$fy ,$fz ), where fx is the angle of rotation about the x-axis, etc., then the generators
of the rotations about these axes will be the angular momentum components
(Lx ,Ly ,$Lz). Rotational invariance about any of these axes will lead to conservation of
†
Specifically, let us consider an infinitesimal rotation of the position vector r = (x, y) by df
And so,
†
dx = -ydf (5.3)
and
22
dy = x df (5.4)
∂f ∂f
f (x + dx, y + dy) = f (x,y) + dx + dy (5.5)
∂x ∂y
∂f ∂f
f (f + df) = f (f ) - ydf + xdf (5.6)
∂x ∂y
= f (f ) + idfGf
†
where
†
Ê ∂ ∂ˆ
G = -i Á x - y ˜ = xPy - yPx = Lz (5.7)
Ë ∂y ∂x ¯
†
the angular momentum about z. Similarly,
and
This result can be generalized as follows. If you have function that depends on a spatial
23
position vector r = (x, y, z), and you rotate that position vector by an angle q about an
where the direction of q is the direction of the axis of rotation. Once again this has the
From the previous commutation rules one can show that the generators Lx , Ly ,
[ J x , J y ] = ihJ z (5.13)
†
24
2
where J = J 2x + J 2y + J 2z is the square of the magnitude of J.
† J z j ,m = mh j ,m (5.15)
used to describe the quantum state of a system. That space has coordinate axes that are
†
Qq =qq (6.1)
Thus, †
25
y = Â ci qi (6.2)
i
q i q j = dij (6.3)
2 2†
so y y = 1 and c i = qi y is the probability for a measurement of Q giving the
Xx = xx (6.5)
often has nothng to do with waves. Since x is usually regarded as a continuous variable,
y-space
† is infinite dimensional. That is, x is not one axis but an infinite number of
axes, one for every real number x. Even if we assume that x is discrete in units of the
Planck length, and space is finite,
† we still have an awfully large number of dimensions.
If the particle is an electron, then y-space may also include the basis states + 12
and - 21 that are the eigenstates of the z-component of spin of the electron. Even
†
though spatial coordinates are more familiar than spins, 2-dimensional spin subspace is
the basis states + 12 and - 21 can be thought of as analogous to the unit vectors i and
j in the more familiar 2-dimensional subspace (x, y). The spin state y is in general a 2-
† † oriented at some arbitrary angle. The basis vectors define two
dimensional vector
possible orientations of the spin angular momentum vector†S in familiar 3-dimensional
space, one along the z-axis and the other opposite. (The choice of z-axis here is arbitrary
conventio). Thus, for example, if S points originally along the z-axis, a rotation of 180o
However, note that a rotation in y-space of only 90o takes the spin state from
† † Ê qˆ
U = expÁ i ˜ I (6.6)
Ë 2¯
More generally,
Ê s • qˆ
U = expÁ i ˜ (6.7)
Ë 2 ¯
† †
We see that U again has the form of a gauge†transformation. The generator of the
gauge transformation in the spin vector subspace of a spin 1/2 particle is the spin
angular momentum operator (in units of h ), S = s/2. We could also have obtained this
result from our previous proof that the gauge transformation for a rotation in 3-space
is †
U = exp(iL• q ) (6.9)
7. Special Relativity
Now we are ready to inject some familiar physics into the mix. It turns out to be most
elegant to do this within the framework of special relativity. But note that, as was the
case for quantum mechanics, the usual starting axioms will not be asserted. Rather they
will be derived from the assumption of gauge invariance.
Let us consider the first four variables (q0 , q1 , q2 ,q3 of our set {q} which we
have arbitrarily set to (x0 , x1 , x2 ,x3 ) = (ict, x, y, z), where t is the time and (x, y, z) are the
spatial coordinates of an event. The constant c is simply a factor that converts units of
time to units of distance. It will turn out to be the invariant speed of light in a vacuum,
but that is not being assumed at this point. Also, the assumption that q0 is an imaginary
number is not necessary; it just makes things easier to work out at this level of
sophistication.
28
Let x' = (x'0 , x'1 , x'2 , x'3 ) be the position of the event in reference frame moving
at a speed v = bc along the z-axis with respect to the reference frame x, where
is shown in many textbooks, the proper distance will be invariant if Lnm is the Lorentz
transformation operator
†
1 0 0 0
0 1 0 0
L nm = (7.2)
0 0 cosy siny
0 0 - siny cosy
where cosy = g, siny = ibg, and g = (1 - b2 )1/2. By writing it this way, we see that the
invariance in 3-space.
The complex angle y is a mathematical artifact of taking the zeroth component
of the 4-vector to be imaginary number and time a real number. We can make y real
Pk p k = pk pk (7.4)
where pk is the eigenvalue of†Pk when the system is in a state given by the eigenvector
pk . Similarly,
† HE =EE (7.5)
Ê E ˆ
p = (p0 , p1 , p2 , p3 ) = Á i , px , py , p z ˜ (7.6)
Ë c ¯
is invariant to rotations in†4-space. The invariant quantity m is called the mass of the
particle. Note that the length of the 4-momentum vector is (in the metric we have
chosen to use)
30
1/2
( pm pm ) = imc (7.8)
dPk ∂H
=- (7.9)
dt ∂q k
†
This definition allows us to connect the operator Pk with the operationally defined
Working with the operationally defined quantities, we can write (using boldface
type for familiar 3-dimensional spatial vectors)
dp • dr = -dEdt (7.10)
which is the rest energy. Next let us look at the particle in another reference frame
(x,$y,$z) in which the particle is moving along the z-axis at a constant speed v. Then, from
the Lorentz transformation, the 3-momentum of the particle in that reference frame
will be
Ê b ˆ
pz = gÁ p¢z + E¢˜ = g (0 + bmc) (7.13)
Ë c ¯
†
We can write this in vector form
p = g mv (7.14)
We note that p Æ mv when v << c, So, we have (finally) derived the well-known the
pc2 p
v= Æ (7.16)
E m
†
when v << c since, in that case, E = mc2 . We can also show that, for all v,
32
Ê 2 ˆ1/2
E = Á pc + m2 c 4 ˜ (7.17)
Ë ¯
†
This is a "free particle" since
dp
F= = -—E = 0 (7.18)
dt
†
More generally we can write
Ê 2 ˆ 1/2 1
T = Á pc + m2 c4 ˜ - mc2 Æ mv2 (7.20)
Ë ¯ 2
†
when v << c, is the kinetic energy, or energy of motion, and V(r) is the potential energy.
The force on the particle is then
F = -—V (7.21)
E = |p|c (7.22)
pc 2 pc 2
v= Æ =c (7.23)
E pc
†
Thus c is the speed of a zero mass particle, sometimes called "the speed of light." Since c
is the same constant in all references frames, the invariance of the speed of light, one of
the axioms of special relativity, is thus seen to follow from 4-space rotational symmetry.
So we have now shown that the generators of translations along the four axes of
space-time are the components of the 4-momentum, which includes energy in the
zeroth component and 3-momentum in the other components. These have their
familiar connections with the quantities of classical physics. Mass is introduced as a
Lorentz invariant quantity that is proportional to the length of the 4-momentum
vector. The conversion factor c is shown to be, as expected, the Lorentz-invariant speed
of light in a vacuum.
8. Classical Mechanics
Except for specific laws of force for gravity and electromagnetism, all of classical
mechanics can now be inferred from the above discussion. Conservation of energy,
linear momentum, and angular momentum follow from global gauge invariance in
space-time. Newton's first and third laws of motion follow from momentum
conservation. Newton's second law basically defines the force on a body as the time
rate of change of momentum,
34
dp
F= (8.1)
dt
†
Above we saw that, for the operators P and H,
dP
= -—H (8.2)
dt
†
The classical observables will correspond to the eigenvalues of these and so
dp
= -—E (8.3)
dt
dp
F- = -—V (8.4)
dt
9. Electromagnetism
In the following sections we will switch to the conventions used in slightly more
advanced physics so that the resulting equations agree with the textbooks at that level.
We have already seen that h and c are arbitrary conversion factors, so we will work in
units where h = c = 1. Furthermore, we will use a non-Euclidean (but still geometrically
†
†
35
flat) metric in defining our 4-vectors:
Ê1 0 0 0 ˆ
Á ˜
n Á 0 -1 0 0˜
hm = (9.1)
Á 0 0 -1 0 ˜
Á ˜
Ë 0 0 0 -1¯
2
pm hnm pn = E 2 - p = m2 (9.2)
This choice of metric has†the advantages of enabling us to directly identify the mass
with the invariant length of the 4-momentum vector and eliminating the need for
imaginary zeroth components.
In quantum mechanics, the state of a free particle is an eigenstate of energy and
momentum. Consider the 4-momentum eigenvalue equation for a spinless particle
(spin can be included, but this is sufficient for present purposes)
i∂m f = pm f (9.3)
† ∂f
where we now use the convention ∂m f ≡ . The quantity f is the eigenfunction
∂xm
f (x) = x pm and can be thought of as having two abstract dimensions, its real and
†
imaginary parts. If we rotate the axis in this space by an angle q we have the gauge
† transformation,
36
position x. This is the type of gauge invariance we have already considered, what we
call global gauge invariance. The generator of the transformation, q, is conserved.
transformation and
[ ( )]
∂m f ¢ = exp[iq (x)] ∂m + i ∂m q f (9.5)
Dm = ∂m + iq Am (9.6)
= exp(iq)Dmf
Pm = -i∂m (9.9)
Pm = -iDm (9.10)
Writing
Pm = Pm + qAm (9.11)
and A is the 3-vector potential. We will further justify this connection below. As already
mentioned, q(x) = -q x(x) and thus q is conserved when x(x) is a constant. Also, note that
Ê 2 ˆ Ê 2 ˆ ∂y
Á P - qA + qV)˜y = Á -ih— - qA + qV)˜y = ih (9.12)
Ë ¯ Ë ¯ ∂t
†
In quantum field theory, the basic quantity from which calculations proceed is
the Lagrangian density. The Klein-Gordon Lagrangian density for a spinless particle of
mass m is
L = - 12 ∂m ∂m f + 21 m2 f 2 (9.13)
L = - 12 DmDm f + 21 m2 f 2 (9.14)
DmDm f + m2 f = 0 (9.15)
Spin 1/2 particles of†mass m are described by the Dirac Lagrangian which
similarly can be made gauge invariant by writing it, using conventional notation,
L = iy g m Dmy - my y (9.16)
ig mDmy - my = 0 (9.17)
1 mn
L=- F Fmn + m2A Am Am (9.18)
16p
where †
The first term in L is gauge invariant while the second is not unless we set mA = 0. This
leads to the deeply important result that particles with spin 1 whose Lagrangians are
locally gauge invariant are necessarily massless. The photon is one such particle.
However, other spin 1 fundamental particles exist with nonzero masses. These masses
result from broken gauge symmetry, as we will briefly discuss below.
In any case, the existence of a vector field Am associated with a massless spin 1
electromagnetic potential.
To see the classical connection, note that
40
A' = A + —x (9.21)
Thus, B may be interpreted as the familiar classical magnetic field 3-vector; the above
equation is Gauss's law of magnetism, one of Maxwell's equations. The zeroth
component of the 4-vector potential,
∂x
A¢o = Ao + (9.24)
∂xo
can be written †
∂x
V¢ = V - (9.25)
∂t
†
41
which implies that the 3-vector
∂A ¢
E ¢ = -—V¢ -
∂t
∂x ∂A ∂—x
= -—V + — - - (9.26)
† ∂t ∂t ∂t
∂A
= -—V - =E
† ∂t
† Furthermore,
is also locally gauge invariant.
Ê ∂A ˆ
— ¥ ÁE + ˜ = -— ¥ —V = 0 (9.27)
Ë ∂t ¯
so †
∂(— ¥ A) ∂B
—¥E=- =- (9.28)
∂t ∂t
U = exp(iq) (10.1)
can be trivially thought of as a1x1 matrix. The set of all such unitary matrices comprises
the transformation group U(1). The generators of the transformation, q, form a set of
Ê1 ˆ
U = expÁ is • q˜ (10.2)
Ë2 ¯
U = exp(iA) (10.3)
we have
43
detU = exp(TrA) (10.4)
È 1 ˘
U = expÍ- igt ¥ q˙ (10.5)
Î 2 ˚
matrices and we use a different symbol just to avoid confusion with spin. While the spin
S = s/2 is a vector in familiar 3-dimensional space, t is a 3-vector in some more abstract
space we will call isospin space. The 3-vector T = t/2 is called the isospin or isotopic spin.
Global gauge invariance under SU(2) implies conservation of isospin. The quarks and
leptons of the standard model have T = 1/2. The quantity g is a constant analogous to
the electric charge that measures the strength of the interaction.
Once again it is important not to confuse isospin space with the 2-dimensional
subspace of the state vectors on which U operates. When the isospin space 3-vector x(x)
depends on the space-time (yet another space) position 4-vector x we once more have a
local gauge transformation. The generators being like angular momenta do not
mutually commute, so the transformation group is non-abelian. This type of non-abelian
gauge theory is called a Yang-Mills theory.
Let us attempt to make this clearer by rewriting U with indices rather than
boldface vector notion:
È 1 ˘
U = expÍ- igt kx k (x)˙ (10.6)
Î 2 ˚
†
44
where the repeated Latin index k is understood as summed from 1 to 3.
Encouraged by our success in obtaining the electromagnetic force from local
U(1) gauge symmetry, let us see what we can get from local SU(2) symmetry.
Following the U(1) lead, we define a covariant derivative
1
D m = ∂m + igt k Wmk (10.7)
2
†
where W mk are three 4-vector potentials analogous to to the electromagnetic 4-vector
potential Am. As before, the introduction of the fields W mk maintains local gauge
†
invariance. Or, we can say that local gauge invariance implies the presence of three 4-
vector potentials W mk . In the standard model,†these are interpreted as the fields of the
weak interaction.
In†quantum field theory, a particle is associated with every field, the so-called
quantum of the field. The spin and parity of the particle, JP , is determined by the
transformation properties of the field. The quantum of a scalar field has JP = 0+ ; a
vector field has JP = 1. For the electromagnetic field described by the potential Am, the
quantum is the photon. Since Am is a vector field, the photon has spin 1. It is a vector
gauge boson.
Similarly, the weak fields W mk will have three spin 1 particles as their
quanta—three vector gauge bosons W-, Wo, and W+ , where the superscripts specify the
electric charges of the particles.
† These can also be viewed as the three eigenstates of a
particle with isospin T = 1.
If the U(1) symmetry of electromagnetism and the SU(2) symmetry of the weak
interaction were perfect, we would see the photon and three W bosons above.
45
However, these symmetries are broken at the "low" energies at which most physical
observations are made, including those at the current highest energy particle
accelerators. This symmetry breaking leads to a mixing of the electromagnetic and
weak forces. Here, briefly, is how this comes about in what is called unified electroweak
theory.
The covariant derivative for electroweak theory (assumed, not derived) is
written
Y t
Dm = ∂m + ig1 Bm + ig2 k W mk (10.8)
2 2
term. The quantity Y is a constant called the hypercharge generator that can take on
different values in different applications, a detail that need not concern us here. The
SU(2) term includes a constant g2 , the vector T = t/2, or isospin, and the vector field
W mk ,$k$= 1,"2,"3.
the quanta of the fields W mk ,"k$=$1,"2, constitute the vector gauge bosons of the
†
electroweak sector of the standard model. Their mixing is also like a rotation, gauge
symmetry being
† broken in this case,
46
Ê Am ˆ Ê cosq W sin qW ˆÊ Bm ˆ
Á ˜ =Á ˜Á ˜ (10.9)
Ë Zm ¯ Ë - sin q W cosq W ¯ÁË Wmo ˜¯
where the rotation angle qw is called the Weinberg (or weak) mixing angle. This parameter
†
is not determined by the theory and must be found from experiment. The current value
of sin2 qw = 0.23115. The constants that determine the strength of the interaction are
photon is massless, the W± and Z bosons have large masses. These masses are shown to
arise from another symmetry-breaking process called the Higgs mechanism. The
symmetry-breaking is apparently spontaneous, that is, not determined by any known
deeper physical principle. Spontaneous symmetry breaking describes a situation, like
the ferromagnet, where the fundamental laws are symmetric and obeyed at higher
energy, but the lowest energy state of the system breaks the symmetry.
Moving beyond the weak interactions and SU(2), we have the strong
interactions and SU(3). In general, for SU(n) there are n2 - 1 dimensions in the subspace.
Let us add the new term to the previous ones that included the electroweak forces
Y t l
Dm = ∂m + ig1 Bm + ig2 k W mk + ig 3 a Gam (10.11)
2 2 2
summed from 1 to 3 in the SU(2) term and the repeated index a is summed from 1 to 8
in the SU(3) term. The la are eight traceless 3x3 matrices analogous to the three Pauli
47
2x2 isospin matrices tk, and the Gam are eight spin 1 fields analogous to the singlet field
Bm and the triplet field W mk . of the electroweak interaction. The gauge bosons in this case
†
are eight gluons. The symmetry is not broken, so they are massless. Global gauge
invariance under
† SU(3) implies the conservation of another quantity called color charge.
While there is much more to the standard model, this should suffice to illustrate
its basis in gauge symmetry and the importance of spontaneous broken symmetry.
dy = 0,
† †
† d 2y
=0 (11.1)
dt 2
Also, †
48
d 2y o
=0 (11.2)
dt 2
Furthermore, †
1 2
(dt )2 = (dt)2 - 2
2 dy = (dt) (11.3)
c
†
Let us work in units where c = 1. We can write the above in 4-vector form,
d 2y a
=0 (11.4)
dt 2
Next, let us consider a coordinate system xa fixed to a second body such as the
earth, or any system that may be in relative acceleration. The equation of motion can be
transformed to that coordinate system as follows:
d Ê ∂y r dx m ˆ
Á ˜=0 (11.5)
dt ÁË ∂x m dt ˜¯
d 2x l dx m dxn
2 + Glmn =0 (11.6)
dt dt dt
†
49
where
2
∂x l ∂ y r
Glmn = (11.7)
∂y r ∂x m ∂xn
d 2x k
= Gkoo = gk (11.8)
dt 2
for k = 1,2,3, where g = (g1 , g2 , g3 ) is the Newtonian field vector ("acceleration due to
†
gravity"). Thus, the Gkoo elements of the affine connection are just the Newtonian
gravitational field components in the limit of low speeds. Additional elements then are
needed to describe gravity at speeds near the speed of light.
The Newtonian field vector for any distribution of mass can be obtained from
the gravitational potential f which is in general a solution of Poisson's equation
— 2f = 4pGr (11.9)
Gm
f (r) = - (11.11)
r
density and energy is the zeroth component of a 4-vector. Let us search for a covariant
quantity to replace density.
Suppose we have a dust cloud in which all the dust particles are moving
slowly, that is, with v << c, in some reference fame. Let the energy density in that
frame be ro . Let Eo be the rest energy of each particle (c = 1) and no be the number per
r o = Eo no (11.12)
r = E n = g Eo g no = g2 ro (11.13)
where g is the Lorentz factor, E = g Eo, and n = g no. To see the latter, note that n =
dN/dV, where dN is the number of particles in the volume element dV, dNo = dN, and
Note that r is not simply the component of a 4-vector because of the factor g2 .
51
Rather it must be made part of a second-rank tensor. We can write
Tmn = r ov m v n (11.14)
Too = r ov o v o = g 2 r o = r (11.15)
comprise energy and momentum flows in various directions: Toi is the energy flow per
unit area in the i-direction, that is, a heat flow; Ti i is the flow of momentum component i
per unit area in their direction, the pressure across the i plane; Ti j is the flow of
momentum component i per unit area in the j-direction, the viscous drag across the j-
plane; Tio is the density of the i component of momentum.
most, a very weak form of Mach's principle that was much earlier proposed by Leibniz.
Leibniz had objected to Newton's notion of an absolute space with respect to which
bodies accelerate and argued that at least another body must be present for space and
time concepts to be useful.[4]
52
In what has become the standard model of general relativity, Einstein related
Gmn to the curvature of space-time in a non-Euclidean geometry. In non-Euclidian
Einstein assumed that Gmn is a function of gmn and its first and second derivatives.
1
Rmn - gmn R + Lg mn = -8pGTmn (11.18)
2
where Rmn and R are contractions of the rank four Riemann curvature tensor. To see the
†
explicit forms of these quantities, consult any textbook on general relativity.
The quantity L is the infamous cosmological constant. It is often reported in the
media and in many books on cosmology that the cosmological constant was a "fudge
factor" Einstein introduced to make things come out the way he wanted. Perhaps that
was his motivation, but the fact is that unless one makes further assumptions, a
cosmological constant is required by Einstein's equations of general relativity and
should be kept in the equations until some principle is found that shows it to be zero.[5]
For many years the measurements of the cosmological constant gave zero within
measuring errors, but in the past two decades Einstein's fudge factor has resurfaced
again in cosmology.
12. Conclusions
The sophisticated reader who at least has glanced at the equations in this paper will
53
recognize them as very familiar. Almost every one can be found in standard textbooks.
What has been attempted here is to show that those equations do not follow from very
unique or very surprising physical properties of the universe. Rather, they arise from
the very simple notion that whatever mathematical "laws" you write down to describe
measurements, your equations cannot depend on the origin or direction of the
coordinate systems you define in the space of those measurements or the space of the
functions used to describe those laws. That is, they cannot reflect any privileged point of
view. Except for the complexities that result from spontaneously broken symmetries,
the laws of physics may be the way they are because they cannot be any other way. Or,
at least they may have come about the simplest way possible. Table 11.1 summarizes
these conclusions.
Acknowledgements
Special thanks to Brent Meeker and other members of the avoid-L discussion group for
their help with this manuscript.
Table 1. The laws and other basic ideas of physics and their origin.
Law/idea of Physics Origin
Conservation of momentum Space translation symmetry
Conservation of angular momentum Space rotation symmetry
Conservation of energy
(First law of thermodynamics) Time translation symmetry
Newton's 1st Law of Motion Conservation of momentum
(space translation symmetry)
Newton's 2nd Law of Motion Definition of force
Newton's 3rd Law of Motion Conservation of momentum
(space translation symmetry)
Second law of thermodynamics Statistical definition of the arrow of time
Special relativity Space-time rotation symmetry
Invariance of speed of light Space-time rotation symmetry
54
General relativity Principle of covariance
Quantum time evolution
(time-dependent Schrödinger equation) Global gauge invariance
Quantum operator differential forms Global gauge invariance
Quantum operator commutation rules Global gauge invariance
Quantization of action Global gauge invariance
Quantization rules for angular momenta Global gauge invariance
Maxwell's equations of electromagnetism Local gauge invariance under U(1)
Quantum Lagrangians for particles in
presence of electromagnetic field Local gauge invariance under U(1)
Conservation of electric charge Global gauge invariance under U(1)
Masslessness of photon Local gauge invariance under U(1)
Conservation of weak isospin Global gauge invariance under SU(2)
Electroweak Lagrangian Mixing of U(1) and S(2) local gauge
symmetries (spontaneous symmetry
breaking)
Conservation of color charge Global gauge invariance under SU(3)
Strong interaction Lagrangian Local gauge invariance under SU(3)
Masslessness of gluon Local gauge invariance under SU(3)
Structure of the vacuum (Higgs particles) Spontaneous symmetry breaking
Doublet structure of quarks and leptons Conservation of weak isospin
(global gauge invariance under SU(2))
Masses of particles Higgs mechanism
(spontaneous symmetry breaking)
55
References
1. E. Noether, Nachr. d. König. Gesellsch. d. Wiss. zu Göttingen, Math-phys. Klasse,
235 (1918), p. 7; English translation M. A. Travel, Transport Theory and Statistical
Physics 1(3), 183 (1971). See also Nina Byers, in Proceedings of the Israel
Mathematical Conference 12 (1999). Online at
[Link] This
contains links to Noether's original paper including an English translation.
4. Jennifer Trusted, Physics and Metaphysics: Theories of Space and Time, Paperback
edition (London and New York: Routledge, 1994), p. 90.