Theoretical Physics Thermodynamics, Electromagnetism, Waves, An
Theoretical Physics Thermodynamics, Electromagnetism, Waves, An
i
c
-
THEORETICAL PHYSICS
This book is in
THE ADDISON-WESLEY SERIES IN PHYSICS
THEORETICAL PHYSICS
Thermodynamics, Electromagnetism,
Waves, and Particles
by
F. WOODBRIDGE CONSTANT
Jarvis Professor of Physics
Trinity College
t
i
PREFACE
physics can one realize the unity of physics and its broad philosophical
aspects, a truly stimulating picture and a liberal education! More ad¬
vanced and specialized study may appropriately follow.
The author wishes to express sincere appreciation to Professor Francis
W. Sears for his careful reading of the manuscript and many helpful
suggestions.
July 1957 F. W. C.
i
CONTENTS
Part I. Thermodynamics
5- 1 Introduction.76
5-2 The state of a gas.76
5-3 Phase space.77
5-4 Fundamental hypotheses.78
5-5 Probability.79
5-6 Boltzmann’s distribution law.81
5-7 Maxwell’s distribution law.85
5-8 Energy distribution.87
5-9 Evaluation of the constants B and /3.87
5-10 Experimental verification of Maxwell’s speed distribution . 90
5-11 Entropy and probability .90
5- 12 The equipartition of energy principle.93
9- 1 Introduction.171
9-2 Ampere’s law of force between current elements.171
9-3 Moving charge equivalent to a current element.173
9-4 The magnetic field. Definition of B.173
9-5 Magnetic induction and force due to a current circuit. . . . 175
9-6 Magnetic forces on current circuits.177
9-7 The circuital form of Ampere’s law.179
9-8 The curl of B.182
9-9 The magnetic scalar potential.185
9-10 The divergence of B.188
9-11 The vector potential. 189
References.350
Index.360
*
' ■
,
Part I
THERMODYNAMICS
Including
KINETIC THEORY
and
STATISTICAL MECHANICS
«
CHAPTER 1
or it might be the air inside a tire jdiat is being inflated. However, we shall
usually deal with systems containing a definite quantity of matter; these
are called closed systems and they are characterized by a constant mass,
although not necessarily by a constant volume. Such systems may ex¬
change energy, but not matter, with the surroundings (i.e., adjoining
systems). Thus 18 kilograms (one kilogram-mole) of water may be frozen,
partly vaporized, or entirely vaporized and still constitute the same closed
system. Henceforth systems referred to will be assumed to be closed
unless otherwise stated.
such as the internal energy, the entropy, etc., each new variable must be
functionally related to any two of the others. One of the problems of
thermodynamics is to find these relations.
I *
Tx + dT T i + 2 aIT
w- =
■1 /
lim (ir) > (1-1)
Pf-> 0 \Pf'
Ts - T i = 100°. (1-2)
Ts
lim (1-3)
Ti p —*o
Ti = 273.15° ± 0.03°,
(1-5)
Ts = 373.15° ± 0.03°.
(b) Method since 1954. The temperature and pressure at which ice,
liquid water, and water vapor coexist in equilibrium is called the triple
point of water. On the temperature scale just defined the temperature Tt
of this triple point was determined to be 273.16°K. Since 1954 the interval
between 0°K and Tt has been arbitrarily set equal to 273.16°. This means
that Eq. (1-2) is replaced by
Tt = 273.16°. (1-6)
£ = lim(4 d-7)
Tt Pt-^o\Pt/
where pt is the pressure of the gas at the temperature Tt and the right
side of Eq. (1-7) is subject to experimental determination.
This currently accepted definition of the Kelvin scale requires only one
fixed point, while method (a) involved two.
8 THERMODYNAMIC VARIABLES [CHAP. 1
t = T + b,
t = T - 273.15. (1-9)
1-6 Work. One way in which a system may exchange energy with its
surroundings is through work. If a force F produces an infinitesimal dis¬
placement ds, then the work done is defined as the scalar product F-ds.
For a finite displacement the work IE is /F*ds.
Let F be a force exerted by a system on its surroundings. If IE = /F-ds
is positive, the system does work on the surroundings, while if IE is negative
the surroundings do work on the system. When a gas in a cylinder expands
by pushing a piston up, IE is positive; when a gas is compressed, IE is
negative. The unit of work is the unit of energy. This unit in the mks
system is the joule, or newton-meter.
Suppose that a system is enclosed by a boundary of any shape, as in
Fig. 1-3, and that the pressure of the system is p. Consider a portion of
the surface of area A, so that the force exerted by the system against this
portion of the boundary is
F = pA.
If this portion moves out a distance ds, the work 5 IE done by the system is
5IE = pA ds.
But A ds — dV, the increase in the volume of the system. Hence we have
8W = p d V. (1-10)
W = fpdV. (1-11)
If we compare the upper and lower paths between a and b in Fig. 1-5,
we see that the area under the curve (the work done) is greater for the
upper path. Thus the work cannot be expressed in terms of the initial
and final states only. It is then impossible to assign a definite value of W
to a given state.
To summarize:
(1) Work (W) around a closed path is not always zero.
(2) Work (W) depends on the path.
(3) Work (W) is not a thermodynamic function.
(4) BW is not an exact differential.
1-7 Heat. Since the time of Joule it has been recognized that when a
system takes in heat it is simply being given energy. In thermodynamics
we consider that a system may exchange energy with its surroundings in
two ways, (1) through work and (2) through heat. Heat then, like work,
will be measured in terms of the energy unit, the joule. In calorimetry
and engineering practice the usage of special heat units, the calorie and the
Btu, is well established, but such additional units are really unnecessary.
The calorie is just a certain fixed number of joules (it is now defined as
4.186 joules), just as the yard is defined as a certain fixed fraction of the
meter. (For contrast compare the case of foreign exchange where the ratio
of, say, the dollar to the franc varies from time to time and place to place.)
A system will be considered to “take in heat” when, as a result of a
difference in temperature between the system and its surroundings,
energy flows into the system. Heat is energy in transit. It will be more
completely defined in connection with the first law of thermodynamics.
Energy given to a system as a result of heat processes will be denoted by
the letter Q.
Let us return to Example 1 of Section 1-4, in which we considered a gas
expanding to twice its original volume at constant temperature. If the
gas expands by rushing into an evacuated vessel, it is found that to main¬
tain its temperature it need gain practically no energy through heat
processes (Q ^ 0). On the other hand, if the gas is allowed to expand
slowly (reversibly) by pushing up a piston, then the gas must be permitted
to gain energy through some thermal process or the temperature of the gas
will fall. The gas may “take in heat” (Q > 0) through conduction by
placing it in contact with surroundings at the temperature which is to be
maintained. Evidently Q, the heat taken in, depends on how the gas is
allowed to expand. If the reversible expansion process is represented by
a curve on a p-V diagram, then Q, like W, depends on the path followed
between two given states. Since the heat taken in depends on how the
system reaches a given state, it is meaningless to speak of the “heat in
the system” at that state. The term “heat” refers to a flow of energy and
not to stored energy.
PROBLEMS 11
Problems
1. Suppose that for a certain gas T' find r - T[. Is T' - T[ = 50° ?
ps = 1.4 atm when p; = 1.0 atm. If 5. Refer to method (b) of defining
further readings of ps and pi at lower the Kelvin temperature scale. Suppose
densities are not taken and if we as¬ that the value of Ft in Eq. (1-6) were
sume that TS/Ti = ps/pi, Ts — Ti = increased by 1%, how would Ts — T<
100°, find Ti and Ts. be affected?
2. If we assume Eq. (1-1) and Eq. 6. How is 0°K defined on the gas
(1-4), but in place of Eq. (1-2) we thermometer scale? Show how this
take Ts t- T i = 180°, then we obtain definition is independent of the fact
the Rankine or “Fahrenheit absolute” that all actual gases liquefy (and
scale, (a) Find T» and Ts on this scale, solidify) before 0°K is reached.
(b) Find absolute zero on the Fahren¬ 7. Compute the work done by a gas
heit scale. when expanding from a volume Va to
3. Suppose that we assume lim^^o a volume U6 if the process is (a)
(pjpi) = 1.366 and that there are isobaric, at constant pressure p, (b)
100° between the ice and steam points, isothermal, at constant temperature T.
but in place of Eq. (1-1) we take Assume the equation of state pV =
T'/T'i = limPi_*0 (p/pd2- (Primes RT, where R is a constant, (c) Repre¬
refer to this new scale of tempera¬ sent the work in (a) and the work in
tures.) Find T\ and T's. (b) graphically on a p-V diagram.
4. (a) Find limPi_>o(p/Pi) at T = 8. (a) Express a pressure of one
323°K (50°C) from Eq. (1-1). atmosphere (76 cm of mercury) in
Evaluate the square of this quantity, n/m2.
that is, limPi_>o(p323/p273)2- (b) Compute the work done due
(b) Set T'/T'i = limPi_»o(p/Pi)2, to expansion when 1 kg of water
as in problem 3. Use the value (volume = 1000 cm3) is converted
of T[ found in problem 3 and your into 1 kg of steam (volume = 1.67 m3)
result in (a) above to find the value of at atmospheric pressure, expressing
T' corresponding to 323°K. For this the answer in both joules and calories.
CHAPTER 2
(2-1)
This is the way we define change in internal energy. But of course the
first law must be more than a definition of AU. What the law further
states (and this is most important) is that AU is the same for all processes
leading from the same initial state to the same final state. In other words,
if the system is taken from a state a, specified by a definite temperature,
pressure, and volume, to another state b by two different paths (on a
p-V diagram), AU is the same for each path. If the two paths are denoted
by the subscripts 1 and 2, respectively, the first law of thermodynamics
may be written as
Uh - Ua = Qx - Wx = Q2 - W2. (2-2)
This gives the first law an experimental basis. While we cannot measure
U directly, we shall see that Q as well as W can be measured, and so we
can verify experimentally that Qi — Wj does equal Q2 W2 in Eq. (2-2).
An immediate consequence of the above is that we can think of there
being a definite internal energy associated with any given state of a system.
If Ua is the internal energy for state a and if, in going from state a to
state b, there is an invariable change AU, then for state b the internal
energy must have the definite value Ub = Ua + AU. This means that U
is a thermodynamic variable. As stated in Section 1—3, in the case of a
chemically homogeneous gas only two thermodynamic variables are
independent; in such a case U must be a function of any two other thermo¬
dynamic variables, say p and V, V and T, or p and T.
If Eq. (2-2) were not true we could take the system from state a to
state b by one path and back to state a by the other, ending with a different
internal energy for state a than at the start. Now if a system could have
its energy changed without any other apparent change, this would amount
to energy being created or destroyed, which contradicts experimental
observation.
When trying to understand the first law of thermodynamics it is helpful
to recall the significance of Newton’s second law of motion (F = ma).
Newton’s law introduced the concept of mass and stated that it is constant
for a given particle; defining the unit of mass arbitrarily, the equation
F — ma may be used to measure force. Similarly, the first law of thermo¬
dynamics introduces the new concept of change in internal energy and
14 THE LAWS OF THERMODYNAMICS [CHAP. 2
states that this change is constant for all paths between two given states,
as stated by Eq. (2-2); this equation may then be used to measure the
quantity Q.
For a process involving a very small change between neighboring states
we may call the change in internal energy, dU, an exact differential,
since U is a thermodynamic variable. Equation (2-2) may then be
written as
dU = 8Q - 8W, (2-3)
that is, the difference between two inexact differentials, 8Q and bW,
equals an exact differential.
Suppose that a system is taken from state a to state b by (1) giving the
system Q units of heat, but not letting any work be done (as when heating
a beaker of water over a burner). Then suppose that one may also bring
the system from state a to state b (2) without giving it heat, but by doing
a certain amount of mechanical work W on the system (see the example
at the end of the last section). In Eq. (2-2) Qi = Q, W\ — 0, Q2 = 0,
W2 = —IE, so that Q — W. The work IE may be measured, and hence
Q may be determined experimentally. This experiment was originally
done by Joule, and when carefully performed we find that 4.186 joules
will raise the temperature of 1 gm of water from 14.5°C to 15.5°C. When
the calorie was defined as the heat required to raise 1 gm of water from
14.5°C to 15.5°C, the above experiment was taken as determining the
mechanical equivalent of heat J to be 4.186 joules per calorie. Now the
calorie is regarded as merely another energy unit defined as 4.186 joules
(just as 1 inch equals 2.54 cm) and the above experiment is taken as a
measurement of what is termed the specific heat capacity c of water at 15°C.
The definition of c is
(2-4)
From what has just been said we see that this statement is equivalent
to the following one, by Clausius:
A third statement of the second law is the following, known as the Kelvin-
Planck law:
2-4 The Carnot cycle. From the theoretical standpoint the most im¬
portant cycle is the Carnot cycle, which consists of four reversible processes,
alternately isothermal and adiabatic, as shown in Fig. 2-1. In this cycle
. i "•
i............■■■..■■■■■■■I
■ . Gas \\
1
Wf
7-li-'-y X V %v
Insulation
m
Tx
(a) (b) (c)
Fig. 2-2. Steps in a Carnot cycle.
Ub - Ua = Q - W.
Flere Ub = Ua for the complete cycle and the net heat taken is
Q = 28Q = Q2-Qi,
so that here
W = Q2 - Qx. (2-5)
W = Q2 Qi = Q2 — Qi-
The net heat taken per cycle from the hot reservoir, AQ2, is then
AQ2 = Q2 Q2 — Q2 Qi T~ Qi — Q2 = Qi — Qi-
AQ1 = Qi Qi = AQ2.
and transfers the heat Q2 — Q2 from a hotter to a cooler body. If Q'2 > Q2,
the second law is not contradicted. However, if Q2 > Q2, then Q'2 — Q2
is negative and we have a device in which each cycle has the sole result of
transferring heat Q2 — Q 2 from the colder to the hotter reservoir, in
violation of the second law. The second law thus requires that
Theorem. The efficiency of a Carnot engine is the greatest possible for any
If E and E' are both Carnot engines, then either one can be run back¬
wards by the other. Following the reasoning above we can show, first,
that we must have Q2 > Q2, and then, when the roles of E and E' are
reversed, we must also have Q2 > Q2. Both conditions cannot be satisfied
unless
Q'2 — Q2,
and the efficiencies W/Q2 and W/Q2 are the same. We have proved the
From these two theorems we may conclude that the efficiency of a Carnot
engine operating between two given temperatures is independent of the
working substance used and is a function only of the temperatures T2
and Tl
Ts — T i — 100°, (2-7)
’Jioo — (2-8)
Qs — Q (2-9)
Qs
(d) Let
Vx Vx _
X = (Ts Ti) 100 (2-10)
”100 vioo
T — Ts — X, (2-11)
Examples. (1) If t]x = O.5?7100, then X = 50 and T is 50° lower than Ts,
i.e., halfway between the boiling and freezing points of water. This is a defini¬
tion of 50°C, and it does not depend on the thermal properties of any particular
substance.
(2) If 7]x = 1.4r7100, then X = 140 and T is 140° lower than Ts. T now corre¬
sponds to —40°C.
(3) The same formulas may be used to define T when Q > Qs. From the
second law, we see that T now cannot be less than Ts, for we must consider
our Carnot engine to be running backwards as a refrigerator, taking in heat Qs
and giving up more heat Q at a higher temperature. t]x will now be negative.
Suppose 7] x = — 0.5t7100, then X = —50 and T is 50° above Ts. This corresponds
to 150°C.
(f) Combining Eqs. (2-8), (2-9), (2-10), and (2-11), we can write
Ts - T _ Qs-Q
(2-12)
Ts - T{ Qs-Qi’
T « Q)
or
T = aQ, (2-13)
Ts = 373.15°,
Q2 — Qi _ T 2 — Ti (2-14)
v - Q2 t2
Example. A Carnot engine operates between the boiling and freezing points
of water at atmospheric pressure. Taking T2 = Ts = 373 and Ti = Ti = 273 ,
we find
100
0.268.
V ~ 373
22 THE LAWS OF THERMODYNAMICS [CHAP. 2
-sp 8Q Q2 Op
(2-15)
2^ t ~ T2 Tx‘
8Q _ jl
(2-16)
T ~ a
for a Carnot cycle. In each step the heat SQ' given to the surroundings by
the system is 8Q' = —8Q. We see that
£f = 0 (2-17)
-v -V
Fig. 2-3. Any reversible cycle may Figure 2-4
be divided into Carnot cycles.
2-7] ENTROPY 23
Equations (2-16) and (2-17) apply to each strip separately and hence to
the whole cycle, since going clockwise around all of the strips is equivalent
to going around the whole cycle. Replacing the summation sign by the
symbol for integration around a closed path, we have
/ (2-18)
Hence
f 8Q_f 5Q_f 8Q
1 T ~ J T~ .IT’
acb bda bea
or
(2-19)
acb adb aeb
Theorem. The integral of bQ/T is the same for any reversible process
leading from a given initial state to a given final state.
^ = dS, (2-20)
where 8Q is the heat taken in during a very small reversible change. Here
S must be a function of the thermodynamic variables and hence is itself
a thermodynamic variable [see paragraph (2) below]. The function S is
called the entropy of the system. It is an extensive quantity. When a
24 THE LAWS OF THERMODYNAMICS [CHAP. 2
(2-21)
This says, in words, that to evaluate the change in entropy of a system between
any two states we must take the system along a reversible path connecting the
states, divide the heat given to the system along an element of the path by the
corresponding Kelvin temperature, and find the integral of this quotient for
the whole path.
We see that while 8Q itself is not an exact differential, there are two ways
of obtaining an exact differential from 8Q. One is to subtract from 8Q
another inexact differential 8W and so obtain (according to the first law)
the exact differential dU. The other method is to divide 8Q by T and ob¬
tain (according to the second law and for a reversible process) the exact
differential dS.
In connection with the definition of entropy, the two following points
should be noted:
(1) We have defined only change in entropy, just as we define change in
potential energy in mechanics and change in the electric potential in elec¬
trostatics. In mechanics it is often convenient to arbitrarily assign the value
of zero to the potential energy at some given point (sea level, floor level,
etc.) and then to compute the potential energy at any other point relative
to the reference point. In this way values may be assigned to the potential
energy at different points. Similarly, here in thermodynamics, the entropy
of a given state of the system may arbitrarily be called zero. Thus in the
case of water, the entropy of the liquid at 0°C and a pressure of one at¬
mosphere is assumed to be zero when we wish to compile a table giving the
entropy of water over a wide range of temperatures and pressures. On the
other hand, Nernst has pointed out that there is justification for regarding
the state of zero entropy as the crystalline state at absolute zero. While
important to physical chemists, the absolute value of the entropy of a
system need not concern us here.
(2) Equation (2-20) may be used to compute changes in entropy for
reversible processes only. If S is to be a thermodynamic function and have
a certain value Sa for the state a and another definite value Sb for the state
b, then Sb — Sa must have the same value no matter how the system
goes from a to b, reversibly or irreversibly. We have seen that this is so
for all reversible processes. We now complete the definition of entropy
by saying that Sb — Sa is taken to have the same value for an irreversible
process as for a reversible one between the same two end states. To com¬
pute Sb — Sa for an irreversible process one of the following methods
may be used:
2-7] ENTROPY 25
i
(a) Devise a reversible process connecting the same two end states and
compute the entropy change for this process, using
56 - = (2-21)
Solution. If such an ideal gas expands freely into an evacuated vessel no heat
is taken in or given out. However, this does not mean that the change in entropy
is zero, for free expansion is an irreversible process. We must replace such a
process with a reversible one. This may be done by imagining the gas to be in a
cylinder with a piston which is allowed to move slowly by gradually reducing
the pressure on its exterior.
In the free expansion process both W and Q are zero. Hence, from Eq. (2-1),
Ub — Ua = 0, or Ub = Ua. Since U is a thermodynamic function of, say, V
and T, Ub must equal Ua for the reversible expansion process as well. In the
reversible process, when the gas does the work 8W pushing the piston out, it
must be given, in order to maintain T (and U) constant, the heat
8Q = SW = p dV.
R In 2.
Sb — Sa = R In 2,
as in Example 1.
Solution. As the first stage of the process we may consider the water to be
heated slowly in a reversible manner, as in Example 2 of Section 1-4. Since T
is not constant, we must integrate dS = 8Q/T. We thus obtain
373
= 75,000 In
273
= 23,600 joules/kg-mole-deg.
AS2 = j,f 8Q
Lu
= ~T
40.7 X 106
373
= 109,100 joules/kg-mole-deg.
ASi = Cp In ^
a
323
= 75,000 X In
273
= 12,670 joules/kg-mole-deg.
AS2 = Cv In ^
J- a
= 75,000 X In —
= —10,850 joules/kg-mole-deg.
A Su = AS + A S', (2-22)
where AS is the entropy change of the system undergoing the process and
A S' is the entropy change of the rest of the universe. The “rest of the
universe” may be considered to consist of two parts: (a) the immediate
surroundings with which the system exchanges energy and (b) the re¬
mainder of the universe, which is not affected during the process. If the
system whose p, V, and T are being observed is isolated from its surround¬
ings, then (b) will include all of the universe except the system. However,
28 THE LAWS OP THERMODYNAMICS [CHAP. 2
The same conclusion holds when the system loses a small amount of heat
to its surroundings, as well as when there is no exchange at all. We have
the
Now let us turn to irreversible processes. All natural processes fall into
this class, since we can never completely eliminate friction or take the time
to perform an infinite number of infinitesimal reversible steps. We saw in
the examples of the last section that when an irreversible process occurred
in an isolated system, the system actually gained entropy. In the case of
a system isolated from its surroundings, the latter are not affected, and
so we have
Qa ~ Qi . T2 - Tj
Q'2 t2 ■
This leads to
Qi , __ T\
Q'2 t2 ’
Qi . Qi
Ti T2 ’
Thus the cold reservoir gains more entropy than the hot reservoir loses
and there is a net gain in entropy for the surroundings of the engine. The
system within the engine returns to its original state during each cycle,
so that AS — 0. Hence
ASu > 0.
This holds for all actual engines, since we cannot build an engine without
some friction. As a result, some of the work W is spent against friction
and thus turned back into heat at a temperature lower than T2, which
results in lowered efficiency.
There are many natural processes in which a system or reservoir gains
internal energy at the expense of energy stored in some available form.
Examples include falling bodies, the release of a stretched film or spring,
the release of a compressed gas, the coming to rest of a pendulum or a golf
ball, the discharge of a battery, and other uses of chemical energy. In
these examples it may be noted that the source gives up available stored
energy, like the P.E. of a stone at the top of a cliff, or the P.E. of a stretched
string, or the K.E. of a moving object. Such energy is quite different from
the internal energy U, for the former is not a function of such thermo¬
dynamic variables as T and V, while U is. This available stored energy
may be used (1) to increase the available stored energy of some other
object, and (2)' to do work against friction. Natural processes always
involve some of (2). Doing work against friction produces a local rise in
temperature and results in a flow of heat into the surroundings, which,
as has been shown, means a net gain in entropy for the universe. Hence
in all the above processes ASu > 0. We thus have the
30 THE LAWS OF THERMODYNAMICS [CHAP. 2
This leads us to the following fourth way of stating the second law:
Solution, (a) Since the T and p of the resistor are constant (its state is un¬
changed), AS = 0.
(b) The water (i.e., the surroundings) takes in the 1200 joules as heat at a
constant temperature of 300°K; hence
AS = "300” = 4 l°ules/deg-
Example 2. Repeat the preceding example for the case where the resistor is
thermally insulated from its surroundings.
Solution. The electrical resistance is like friction to the flow of the electrons
constituting the electric current. Thus the 1200 joules are added to the wire,
and raise its temperature 1200/10 = 120°. A reversible process leading to the
same end result would involve letting the resistor take in 1200 joules as heat,
and for such a process we obtain
/•420
AS / *2 10 dT
'300 T T
420
10 In 3.36 joules/deg,
300
AS' = 0,
Example 3. A bullet whose K.E. is 1200 joules buries itself in the ground.
If the ground temperature is 27°C, find the entropy change of (a) the bullet, (b)
the ground, and (c) the universe.
Solution. This problem is exactly like Example 1 except that the available
stored energy used up is kinetic rather than chemical. We have (a) AS = 0,
(b) AS' = 4 joules/deg, and therefore (c) ASu = 4 joules/deg.
Example 4. An engine working between 127°C and 27°C takes in 1200 joules
of heat from the hot reservoir. Its efficiency is 80% of that of a Carnot engine
operating between these two temperatures. Find (a) W, (b) Qi, and (c) AS„ per
cycle.
Solution, (a) A Carnot engine working between 127°C and 27°C would have
an efficiency of 100/400 = £, or 25%. The actual engine then has an efficiency
of 0.8 X 25% = 20%. Q2 = 1200 joules. Hence
W
0.20,
1200
W = 240 joules.
960
AS' _-™“ +
400 300
= -3.0 + 3.2
= 0.2 joule/deg.
The —3.0 joules/deg is the entropy lost by the hot reservoir and the +3.2
joules/deg is that gained by the cooler reservoir. We thus have
2-9 Entropy and availability of energy. We saw in the last section that
in natural processes there is a gain in the entropy of the universe and a loss
in the stored available energy in the universe. Thus an increase in entropy
is accompanied by a decrease in the energy available for work in the
universe. This energy is not lost or destroyed but, according to kinetic
theory, becomes associated with a random thermal motion of molecules,
atoms, or electrons, and cannot be put to our use. Kelvin first recognized
this 'principle of the degradation of energy. It leads us to a fifth statement of
the second law, as follows:
32 THE LAWS OF THERMODYNAMICS [CHAP. 2
Problems
is decreased 10% (30°), (c) T\ is de¬ tropy change per cycle of each reser¬
creased 50°. voir and that of the universe.
6. A Carnot refrigerator is operated 10. A 1-kg stone at 27°C falls 102
between 250°K and 300°K and requires meters into a lake whose temperature is
250 joules of work per cycle. Find (a) 27°C. Find the entropy change of (a)
the heat absorbed from the low- the stone, (b) the lake, and (c) the
temperature reservoir, (b) the heat universe.
ejected at the higher temperature, (c) 11. Repeat problem 10 for the case
the coefficient of performance. where the stone is lowered reversibly
7. A refrigerator operates between into the lake. Compare the loss in
250°K and 300°K, requiring 250 joules available energy in the two cases.
of work per cycle, but it has only one- 12. Compute the entropy per kilo¬
half the coefficient of performance of gram-mole of ice at —20°C relative to
the Carnot refrigerator in problem 6. that of water at 0°C, using C = 37,500
Find the heat absorbed from the cold joules/kg-mole-deg and, for the heat
box and the heat ejected into the room, to melt ice, L/ = 6 X 106 joules/kg-
per cycle. mole.
8. Compute the net gain in entropy, 13. A kilogram-mole of an ideal gas
ASU) per cycle (a) in problem 6, (b) in is compressed reversibly at constant T
problem 7. to half its original volume. Compute
9. A reversible engine takes in 2000 the change in entropy of (a) the gas,
joules from a reservoir at 400°K; it (b) the surroundings, and (c) the
then gives up Q[ joules to a reservoir at universe.
300°K and Qi joules to a reservoir at 14. Repeat problem 13 for the cycle
200°K, doing 750 joules of work per consisting of a reversible compres¬
cycle, (a) Compute the efficiency and sion to half volume followed by a
compare with that of a Carnot engine free expansion back to the original
working between 400°K and 200°K. volume.
(b) Find Q[ and Qi. (c) Draw the cycle 15. Draw a Carnot cycle on a T-S
on a p-V diagram, (d) Find the en¬ diagram.
CHAPTER 3
THERMODYNAMIC RELATIONS
The various terms used in thermodynamics have been defined and the
fundamental laws have been stated; it remains to express the laws in
convenient mathematical form and to derive certain relations from them.
In this chapter we shall not make an exhaustive survey of such relations,
but rather content ourselves with the derivation of some of the more
important ones. In this way the mathematical technique employed in
thermodynamics will be illustrated.
dU = 8Q - 8W. (3-1)
8Q = T dS, (3-2)
8W = pdV. (3-3)
dU = T dS — p dV. (3-4)
Equation (3-4) is based on and expresses the content of both the laws
of thermodynamics. This is the fundamental equation of thermodynamics,
just as (d/dt)(mv) = F is the fundamental equation of mechanics. Note
that Eq. (3-4) involves only thermodynamic variables and exact differ¬
entials whose values do not depend on whether a process is reversible or
irreversible.
The above three equations all refer to the same functional relationship,
and so the equation of state can be expressed in the general form
F(p, V, T) = 0. (3-8)
RT V
V =
V p
which leads to
Unless 7 is unity,
(?z)
\dp/T
= _Z
p’
(d_Z)
\dT/p
_ zp _ zT
Hence
V V
dV = --dp-Y-dT.
Similarly, we find
= (3-u)
Cy = (3-13)
In the case of solids and liquids there is not a great deal of difference
between Cv and Cv, but for gases Cp is considerably the greater. This is
because at constant pressure a gas expands and does work, this work
requiring the addition of more heat for a given increase in T. The ratio of
specific heats 7 is defined as
7 = C7fi- (3-14)
(3-15)
38 THERMODYNAMIC RELATIONS [CHAP. 3
and call V i and V2 the respective volumes for the two states, we have
/3 = i(«Z)
77l =
) = the coefficient of expansion,
V \dTj„
and
dU 8U
dU = dS dV. (3-17)
,dS/ v \d V/ s
T =
<£),- -’AS).-
Further differentiation yields
dA = dU — d(TS)
= -pdV — SdT.
(3-19)
dG = dU - d(TS) + d(pV)
= -SdT + V dp,
(3-20)
dH = dU + d(pV) = T dS + V dp,
(3-21)
(3-22)
Solution. This example illustrates the care that must be taken, in thermo¬
dynamic problems, to express all quantities in the same system of units. Let us
use the mks system.
2.68 cm of mercury = 0.0268 m of mercury.
Density of mercury = 13.6 gm/cm3 = 0.0136 kg/cm3 = 13,600 kg/m3.
g = 9.8 m/sec2.
(dp/dT)v = (0.0268) X (13,600) X (9.8) = 3570 newtons/m2-deg.
Lv = (9.7 X 106) X (4.186) = 4.06 X 107 joules/kg-mole. Hence
V2 - Vi
3-6] THE GIBBS-HELMHOLTZ EQUATION 41
bW = S de, (3-23)
where S is the electromotive force (work per unit charge) of the cell at
the temperature T of the experiment. In this case we see that 8 takes the
place of p, and e that of V. Equation (3-4), which expresses the combined
first and second laws, now becomes
dU = TdS - 8 de (3-24)
(3-25)
(as) =l(zQ\
\ de)x T V de)t
1_
(3-26)
T
(3-27)
Heat of reaction = Fv
de / t
(3-28)
3-7 Ideal gases. Here we shall define an ideal gas as one which obeys
pV = a, (a constant), (3-29)
3-7] IDEAL GASES 43
U = U(T). (3-30)
From the kinetic theory viewpoint this amounts to neglecting the volume
occupied by the molecules themselves, and to assuming no intermolecular
forces. At low pressures and high temperatures actual gases closely
approach this ideal.
Let us consider p as p(V, T) and write
*= + (l)/r- (3-31)
From Eq. (3-29), V dp + p dV = 0 at constant T, or
(dp\ _ p_
(3-32)
\d V) t V
(eA =(ds\
\dT/v \dV/t
i(sq\
~ T\dV/T
5Q = dU + pdV — + pdV,
so that
(ifi)
\;i V/t
= V-
Hence
(8p\ = P (3-33)
\dTJv T
dp — — ~ydV + ipdT,
dp ,dV_dT
(3-34)
p + V T
44 THERMODYNAMIC RELATIONS [CHAP. 3
In p + In V = In T + In R,
In (pV) = In (RT),
pV = RT, (3-35)
which is the equation of state for an ideal gas. When V refers to a kilogram-
mole, R is the gas constant per kilogram-mole. It is found experimentally
that R has the same value for one kilogram-mole of any ideal gas.
Solution. Changing to mks units, as in the example at the end of Section 3-5,
we have
8310 joules/kg-mole-deg.
V\ ,= _ (dS/dV)p _
&V) s (dS/dp) v
The value of the right side of this equation will not be altered if we multiply
the numerator and denominator by equal quantities, such as those in
Eqs. (3-19) and (3-20), Maxwell’s second and third relations. Using also
the definitions of Cp, Cv, and y given in Eqs. (3-12), (3-13), and (3-14),
we get
(dp) 0dS/dV)p(dV/dT)p(dS/dV)T
\dV/s (dS/dp)v(dp/dT) y(dS/dp)T
(dv\ _ _yp
\dV/s v ’
dp dV
In p = —y In V + In b,
pdV
(3-38)
T
since at constant volume 8W = 0 and all heat given to the gas goes to
increase its internal energy. From Eq. (3-35),
P R
T V
Hence we have
dT_ dV_
dS = Cv R
T V
Since Cy and R are constant, integration between states a and b shows that
-feMft)-
Then we have
r _ T(as'\ dU , fdV\
Lp ~ 1 \dTs dT^V \dT/p
— Cy d- R- (3-41)
Thus we have
J
Sb — Sa = Cv In (3-42)
nte)’
which may be reduced to the form given in Example 1 of Section 3-1.
Joule and Thomson (who later became Lord Kelvin) later improved
on the free-expansion experiment by throttling the expansion so as to
permit a continuous flow of gas from a region of higher pressure p i to one
of slightly lower pressure p2. This was done by separating the two regions
by a plug of porous material, as shown in Fig. 3-1 (a). We see that a
thermometer is placed on each side of the plug to record the respective
temperatures. The device is thermally insulated, and after the experiment
has run for some time a steady state is reached in which no heat flows
from the gas to the wall or from the wall to the gas. Any temperature
difference between T\ and T2 is then due only to the expansion of the gas.
(b)
8W = p2AX2 p\AX\,
48 THERMODYNAMIC RELATIONS [CHAP. 3
dU = -SW = -d(pV).
Hence
(3-43)
where the subscript H means that this equation only holds when dll = 0,
as in this experiment. From Eqs. (3-12) and (3-20), we have
(3-44)
pV = RT,
Since Cp is not zero, (dT/dp)jj must be. Hence an ideal gas would show
no heating or cooling. Conversely, if a gas shows no heating or cooling in
the porous-plug experiment, we have
dV_dT
(p constant),
V ~ T
V oc T at constant pressure.
d (V\ _ TdV - V dT dT
(p constant).
dT \ T/ T2 F4
50 THERMODYNAMIC RELATIONS [CHAP. 3
Y = — 3^ + C (p constant).
I i
Let us use the further experimental fact that as T increases actual gases
obey more and more closely the ideal gas equation
V _ R
T ~ v
When T is large the K/3T3 term is very small, hence we must have
V
and
pV=RT-|j|- (3-45)
Equation (3-45) is the equation of state for most actual gases. Since this
equation is based on experimental data, it is closely obeyed by most gases
over the temperature range covered by the data. From Eq. (3-45) we see
that as p approaches zero {p —> 0) the behavior of an actual gas approaches
that of an ideal gas, as represented by pV = RT and the resulting pro¬
portionality between T and p at constant volume. This is the justification
for the definition of the Kelvin temperature scale given in Section 1-5.
(3) The value of T, the Kelvin temperature, at any fixed point may be
computed by measuring all the other quantities in Eq. (3-44), substituting
these values in the equation, and solving for T. By this method we may
find T{.
3-9 Equations of state. We have already met the following two equa¬
tions relating p, V, and T:
pV = RT (ideal gas)
and
pV = RT — K' ^ (actual gas),
where
K' = — is a constant.
Many other empirical equations of state have been proposed. One that
fits experimental data extremely well over a wide range of p, V, and T is
3-10] PROPERTIES OF A VAN DER WAALS GAS 51
pV = ~V + B)( 1 (3-46)
where
c
A = Aa (l - fj, B = £„ (l - ,
y^3 ’
and A0, ai Bo, b, and c are constants having different values for different
gases. Notice the resemblance of this equation to Eq. (3-45) when
A = B = 0. The more adjustable constants it contains, the better will
an empirical equation fit the facts.
A more systematic way of representing the behavior of an actual gas
is by means of a power series expansion of pV, such as the following:
y-2«p, b « V.
3-10 Properties of a van der Waals gas. Writing the van der Waals
equation in powers of V, we get
F3 - (b + —) V2 + - V - — = 0. (3-49)
V P / V V
52 THERMODYNAMIC RELATIONS [CHAP. 3
F3 - 3FCF2 + 3F2cF - Fc = 0,
ah
3FC = 5 + f 3F2 — —— > vi =
Pc Pc Pc
8a
Pc 2752 ’ Vc 35, Tc (3-50)
2TRb
In Fig. 3-2 are sketched (a) two isothermals above Tc, (b) two isother¬
mals below Tc, and (c) the isothermal for Tc, all as given by Eq. (3-49).
Fig. 3-2. Isothermals for a van der Waals gas and vapor.
3-10] PROPERTIES OF A VAN DER WAALS GAS 53
1
(dT) _ RT - VV + a/V
Lp \dpjH V - «/^2
2a
pV
54 THERMODYNAMIC RELATIONS [CHAP. 3
(neglecting the term a/V2, since it is small compared with p). This means
that (dT/dp)n is positive so that, with dp negative for an expansion, dT
must also be negative. Thus the gas cools.
In a similar manner, when a = 0, b > 0, we get
Example. For hydrogen, the values of the constants a and b in the van der
Waals equation are a — 25 X 103 n-m4/(kg-mole)2, b = 2.66 X 10-2 m3/kg-
mole, and we may take R = 8300 joules/kg-mole-deg.
Let us first estimate the relative magnitudes of the a/V2 and b terms at, say,
p — standard atmospheric pressure and T = 300°K. We have seen that at
one atmosphere p = 1.01 X 105 n/m2. As a first approximation for V, we take
RT 8300 X 300
V 24.7 m /kg-mole,
V 1.01 X 105
a 25 X 103
72
■ 41 n/m2.
610
~2 « P> b «
Pc = = 13 X 105 n/m2,
T c = 36 = 8 X 10 2 m3/kg-mole,
8a 8 X 25 X 103
Tc 33.5°K.
27 Rb 27 X 8300 X 2.66 X IQ-2
The critical pressure is about 13 atm. Hydrogen at 300°K is far above its
critical temperature, which means that at ordinary temperatures (1) it cannot
be liquefied, (2) it will behave very nearly like an ideal gas.
3-11] GIBBS’ PHASE RULE 55
2a 2 X 25 X 103
2.02 X 10 2 m3/kg-mole.
pV ~ 105 X 24.7
Since b = 2.66 X 10 2 m3/kg-mole it is the larger factor and the gas will
warm up.
If we take 2a/pV = 2a/RT and equate this to b, we get the inversion tempera¬
ture Tq, below which the gas cools on expansion. For hydrogen
2a 50 X 103
226°K.
Rb ~ 8300 X 2.66 X IQ-2
3-11 Gibbs’ phase rule. We have seen that for a single substance in
one of its phases (solid, liquid, or vapor) there are only two independent
variables or “degrees of freedom.” When we consider two phases of the
same substance in equilibrium, such as ice and water, or water and water
vapor, we find that at a given pressure the temperature is fixed, that is,
there is but one degree of freedom left. Similarly, it is found that there is
just one temperature and pressure, called the triple point, at which ice,
water, and water vapor can exist together in equilibrium, that is, there are
no degrees of freedom for such a system.
It is an experimental result that for a system consisting of a single
chemical substance
n = 3 — </>, (3-52)
where <j> is the number of phases present in equilibrium, and n is the number
of degrees of freedom. If we add another substance to our system, we may
then have an additional degree of freedom. In a mixture of two gases the
proportion of one gas to the other may be considered as a variable, in ad¬
dition to T and p, giving such a system three degrees of freedom. Dissolv¬
ing a little salt in some water is another example, the salt concentration
constituting a third degree of freedom.
However, when two different substances do not mix, react, or form a
solution, the substances may as well be considered as separate systems
having no relation to each other.
When two substances react chemically to form a third, we cannot regard
each as an independent constituent in a mixture of the three. We designate
56 THERMODYNAMIC RELATIONS [CHAP. 3
n = 2 + c — </>, (3-53)
which is known as Gibbs’ phase rule. This expression reduces to Eq. (3-52)
as a special case when c = 1.
Example 1. A beaker of water in air has a bell jar placed over it. The air
under the jar then becomes saturated with water vapor. If we consider air as a
single substance, since its composition is fixed, we have c = 2 (H2O, air), 4> = 2
(liquid water, air and vapor mixture), and so n = 2. The variables may be the
temperature and total pressure, or T and the amount of air. The latter could be .
varied by connecting the bell jar to a pump.
Example 3. Suppose that some ice is also in equilibrium with the water and
alcohol mixture in Example 2. Now c = 3 as before, but (j) = 3 (solid ice, liquid
mixture, vapors and air), and so n = 2. One variable is p, since the pressure of
the atmosphere changes. The other variable may be the alcohol concentration,
which then determines T.
Problems
1. Solve the van der Waals equation 9. An ideal gas is taken around a
for p as a function of V and T. Carnot cycle (see Fig. 2-1). At the
2. Given that pV + a/V = RT temperature T2 the volume increases
and U = (3/2)RT — a/V, where a is from Va to Vb, then the volume in¬
constant. Find (a) (dU/dT)v and (b) creases adiabatically from Vb to Vc,
(dU/dT)p. decreases isothermally (at temperature
3. (a) Show that the coefficient of T1) to Va, and decreases adiabatically
expansion /3 = l/T for an ideal gas. back to Va- Find the work done by the
(b) Show that for a gas obeying the gas and the heat taken in along each
van der Waals equation step, in terms of the F’s and T’s. Show
that the efficiency is (T2 — TT+TV
= RV\V - b) 10. Prove that, in general, (a)
P RTV3 - 2a(7 - b)2 0BT/dV)s = (Cv - Cp)/fiVCv, (b)
(dT/dp)s = k(Cp - Cv)/0Cp, and (c)
4. (a) Show that the compressibility
(dp/dV)s = y/kV.
k = 1/p for an ideal gas. (b) Show
11. Taking Cv as constant, show that
that for a gas obeying the van der
(a) Ub — Ua = Cv(Tb — Ta) for an
Waals equation
ideal gas, and (b) Ub — Ua —
V2(V - b)2 Cv{Tb — Ta) + a/Va — a/Vb for a
k ~ RTV3 - 2a(V - 6)2' van der Waals gas.
12. Show that for a van der Waals
5. If a wire expands on being heated, gas Sb - Sa = Cv In (Tb/Ta) +
show that if a tension is applied R In (Fb — b)/(Va — b).
adiabatically the temperature falls. 13. Show that for a van der Waals
6. Find T;, the Kelvin temperature gas
of melting ice, when p = 1 atm, given 2a(V - b)2
that an increase in pressure of 1 atm Cp — Cv = R
RTV3
depresses the melting point 0.0073°C,
Vi = 0.0196 m3/kg-mole for ice, V2 = 14. (a) Suppose that with a certain
0.0180 m3/kg-mole for water, and Lf = gas in the porous-plug experiment it is
1.44 X 106 cal/kg-mole. observed that Cp(dT/dp)H — —k,
7. If the electromotive force for a where A; is a positive constant. Find
certain cell is found to be given by the relation between V and T for such
£ = 1.0000 + 0.00015 (t — 15), where a gas. (b) Compare the behavior of
t is the temperature in °C, find the such a gas with that of one whose
heat of reaction at 25°C. equation of state is p(V — b) = RT.
8. An ideal gas undergoes the follow¬ 15. Compute T(dV/dT)p — V for a
ing processes: (a) V = k, (b) pV = k, van der Waals gas. (Neglect second-
(c) pV2 = k, (d) p/V = k, (e) pT =k, order small terms containing a2, ab,
where A; is a constant. If the initial or b2, and for first-order small terms
state is given by Pa = 1 atm, Va = containing a or b consider pV = RT
24.6 m3, Ta = 300°K, and if in the to be a good enough approximation.)
final state pb = 2 atm, find Vb and Tb Show that the inversion temperature
for each process. (below which the gas cools and above
58 THERMODYNAMIC RELATIONS [CHAP. 3
KINETIC THEORY
Mol. wt.
m (4-1)
Ao
and
No
n = (4-2)
V
dn __ dft
n 47t
77
dn = — dft. (4-3)
*As stated in the Example of Section 3-7, the volume V of a kg-mole of any
ideal gas under standard conditions is 22.4 m3, so that Nq/V is of the order of
1025 molecules/m3, or 1019 molecules/cm3.
4-4] CONCEPT OF PRESSURE IN KINETIC THEORY 61
During many such collisions per unit time, the molecules striking a given
area undergo a certain average rate of change of momentum. From New¬
ton’s third law, the wall experiences an equal and opposite rate of change
of momentum, just as though a certain average force were pushing against
the wall. The pressure of the gas against the wall is taken to be this
average force per unit area. By following this line of reasoning we shall
proceed to derive an expression for the pressure in terms of m, n, and v.
Figure 4-2
molecules with directions between 0 and 0 + dd, <t> and 0 + d(j>. Each of
these AN molecules experiences a momentum change AM given by Eq.
(4-4). The contribution of these molecules to the pressure is found by
Figure 4-3
4-4] CONCEPT OP PRESSURE IN KINETIC THEORY 63
AN AM
mnv2 sin 9 cos2 6 dd d<f>.
dA dt 2tt
r2ir /'7r/2
x/2
\rnnv1. (4-7)
dp = \mnvv2 dv.
p = nvv2 dv.
dv,
n
so that we have «
p = \mnv2. (4-8)
In Eq. (4-8) v* means the average value of the squares of the different
speeds, or the mean square speed, while the square root of this quantity is
64 KINETIC THEORY [CHAP. 4
called the root mean square (rms) speed. The rms speed vrm3 must be dis¬
tinguished from the mean or average speed v. Let us put dnv for nv dv,
the number of molecules per unit volume with speeds between v and v + dv.
Then, by definition,
j2 dnv, (4-9)
and the rms speed vTms = Vp2. Note that if we were dealing with only a
few molecules we would replace the above integrals with summations over
the individual molecules.
_ l2 T l2 T 22 + 22 T 22 + 32 + 32 + 42 + 62 + 102
V 10
= ^ = 18.4(m/sec)2,
l~bl~t_2T2-j-2-|-3 + 3T4-|-6Tl0
v = --
10
34
= — = 3.40 m/sec,
which shows that the rms and mean speeds may be quite different.
Let us return to Eq. (4-8). We note that ran, the mass per molecule
times the number of molecules per unit volume, is the mass per unit
volume or density of the gas. If we call the density p, we have
V = hpv2- (4-11)
4-4] CONCEPT OF PRESSURE IN KINETIC THEORY 65
Now we can measure p and p for a gas and use Eq. (4-11) to calculate the
rms speed of its molecules in a given state. We have
vs = V yp/p. (4-13)
Example 2. For most gases we find that when p = 1.01 X 105n/m2 (one
atmosphere) and T = 273°K (0°C), then V = 22.4 m3/kg-mole. Find the
rms speed under these conditions for (a) hydrogen, (b) oxygen, (c) air
(p = 1.29 kg/m3). Compare (c) with vs for air.
" = dt ks/n'8'
I'rms = V 3p/p
= 1840 m/sec.
(b) For oxygen 1 kg-mole = 32 kg, hence its density is 16 times that of
hydrogen. Since rrms varies inversely as the square root of the density, the rms
speed for oxygen must be one-fourth that for hydrogen at the same T and p.
Thus, for oxygen,
1840
Vrms = — /
7— = 460 m/sec.
4
(c) For air we have as an average (the nitrogen molecules have a greater vims
than the oxygen molecules)
3.03 X 105
^rms —
1.29
= 485 m/sec.
The speed of sound in air under standard conditions is found to be 331 m/sec,
which is somewhat smaller than prms, as expected.
1 N0 —z
V = 3 -y mv2,
or
pV = £ Nomv*. (4-14)
(In this chapter K.E. will refer only to translational kinetic energy.)
We are postulating the molecules to be hard elastic spheres with no forces
between them. The internal energy U of such a gas can only be kinetic.
When the gas takes in heat (8Q > 0) at constant volume (5 W = 0), its
internal energy must increase (dU > 0) and so its molecules must move
faster. Experimentally we observe a rise in temperature for any gas when
its internal energy is increased. The average K.E. per molecule, \mv2,
and the temperature T go up and down together. At constant temperature
%mv2 is constant and hence, according to Eq. (4-14), pV is also constant,
and the gas obeys Boyle’s law.
That an ideal gas also obeys Joule’s law follows from the assumption of
no intermolecular forces. What happens if we increase V at constant T?
Moving the molecules farther apart does not increase the internal P.E.
when the molecules do not attract one another. Keeping T constant means
that there is no change in the internal K.E. Hence dU — 0, and U is a
function of T only.
Since the kinetic theory definition of an ideal gas is equivalent to that
in thermodynamics, we can now use the equation of state derived for an
ideal gas, namely,
pV = RT.
4-5] CONCEPT OF TEMPERATURE IN KINETIC THEORY 67
0mv2 = RT,
Thus the average K.E. per molecule is proportional to the absolute tem¬
perature. The kinetic theory concept of temperature is that it is a measure
of the average kinetic or thermal energy of the molecules.
Let us set
k = ~, (4-16)
iV o
If a gas obeys the equation pV — RT, computing yrms from Eq. (4-15)
will lead to the same results as when we used Eq. (4-12). Let us, however,
assume the value of N0, compute k, and find the K.E./molecule.
Solution.
R 8310
(a) k ~ Nh ~ 6.03 X 1026
* " __
(b) imv2 = f kT
= 5.65 X 10 21 joule/molecule.
68 KINETIC THEORY [CHAP. 4
5.65 X 10-21
K.E./molecule
1.6 X 10-19
4-6 Molecular hypotheses for a van der Waals gas. The van der Waals
equation of state differs from that for an ideal gas in two correction terms:
b, which was added by Clausius, and a/V2, which was later included by van
der Waals.
In his correction term, Clausius allowed for the finite size of the mole¬
cules. Let us call the radius of a single molecule r and assume that all the
molecules are alike and are hard elastic spheres. Then in a collision between
two molecules the closest the two centers can get must be 2r, as in Fig.
4-4(a). The center of the unshaded molecule cannot move everywhere
in the containing vessel, for in this particular collision it finds itself ex¬
cluded from a spherical volume of radius 2r about the other molecule, as
shown in Fig. 4-4(b). This excluded volume is (4/3)7r(2r)3 = (32/3)7rr3,
and it is the unavailable volume per pair of molecules; for if the center of
the unshaded molecule is kept out of this volume, this will ensure that the
center of the shaded molecule will, at the same time, be kept at a distance
of at least 2r from the molecule with which it is colliding.
In one kilogram-mole there are
N0/2 pairs of molecules. The total
volume excluded because of the
finite size of the molecules is
-2 r—
6 = ^xf
(a) (b)
4 3
= 4W0 X g TrrL (4-19)
Fig. 4-4. Molecular collision.
4-6] MOLECULAR HYPOTHESES FOR A VAN DER WAALS GAS 69
If we exclude this volume from the actual volume V, then we can consider
the molecules as point masses moving in the reduced volume V — b.
Hence in place of pV = RT, we write
where, according to Eq. (4-19), b is equal to four times the volume of all
the molecules in V.
Van der Waals attempted to allow for possible intermolecular forces in
his correction term. At close distances the internal structure of the mole¬
cules plays a part in determining the equation of state of a gas. If a mole¬
cule consists of positive and negative charges which are not symmetrically
distributed, and if these charges can be caused to shift their relative posi¬
tions in the molecule, then one molecule will have an electric field around
it and this field will act on a neighboring molecule. Since unlike charges
attract, the unlike charges on adjacent faces of two molecules will cause an.
attraction between them.
As a molecule reaches the wall of the container there are no other mole¬
cules ahead of it, but it is attracted to those in the layer immediately be¬
hind. This pull from behind results in the molecule striking the wall with
less momentum than would otherwise be the case, so that the pressure p
which we observe is really less than that for an ideal gas at the same V
and T. Before we can use the equation pV = RT for an actual gas we
must add a term to the observed pressure p. This correction term will be
proportional to the number of molecules per unit area in the surface layer
(on which the other molecules pull back) and to the number of molecules
per unit area in the next layer (these being the molecules doing the pulling).
Both quantities are proportional to N0/V, hence the reduction in pressure
is proportional to 1/E2. Calling the equivalent ideal gas pressure p', we
write for the observed pressure p,
, a
V — V y2 ’
or
P = V+y-2-
3 _b_
167T No
r = 1.38 X 10 10 m.
This value agrees well with those computed by other methods, thus lending
further experimental support to the kinetic theory.
p + dp = p — pg dy,
mN o
p = mn =
y
so that
dp __ _ mg No
(4-23)
p RT V
(4-24)
To integrate the right side of Eq. (4-24) we must know how T depends
on y, or the “lapse rate. ” In the case of the earth’s atmosphere T is found
to vary with altitude (usually decreasing), up to the beginning of the
stratosphere, the bottom of which varies from a distance of 5 miles above
the earth’s poles to 10 miles above the equator. Up to this level the total
decrease in T is about 25%, but above this point T is constant up to much
greater heights.
We shall now, however, consider a simpler problem and assume that T
and g are constant for all values of y. Integration of Eq. (4-24) yields
—(mgNol RT)y
P = Poe (4-25)
This equation is known as the law of atmospheres, and for small differences
in altitude the pressure of the atmosphere is actually found to vary some¬
what in this exponential manner.
Perrin realized that this same formula could be applied to the con¬
centration of beads in his liquid suspension, and with much greater accuracy
than when applied to the atmosphere. First, the value of mg was so much
larger for one of his beads than for a molecule that he needed to vary h
very little in order to get a large variation in c. Second, a uniform tempera¬
ture could be maintained very precisely. Third, the beads were large
enough so that their diameters could be measured and, knowing the
density of the resinous substance, the mass of the bead could be calculated.
By subtracting from this value the mass of the displaced liquid, the effec¬
tive value of the term mg in Eq. (4-26) could be found, and the expression
could then be used to compute N0.
(b) Faraday’s constant and charge on the electron. In electrolysis experi¬
ments it is found that it takes 1000 F coulombs of electric charge to
deposit one kilogram-mole of a univalent substance like silver, where F is
the Faraday constant defined in Section 3-6. Each ion carries an electronic
charge e, while the charge 1000 F is carried by N0 ions. Thus
N0e = 1000 F,
N0 = 1000 F/e.
the same value (within limits of error) for N0. That some of these values
are based on kinetic theory is additional verification of that theory.
U = %RT.
dU= 3
Cv
dT ~ 2
From Eq. (3-41),
Cv = Cv + R = | R.
Hence
Cp 5
7 Cv 3'
This value for y is in good agreement with that found experimentally for
monatomic gases (He, Ne, A, Kr, Xe). Other gases have different values
for 7 and this is attributed to their molecules having other forms of
kinetic energy (rotational, vibrational) in addition to that of translation.
We would expect the molecules of a monatomic gas to have the closest
resemblance to the hard elastic spheres postulated for an ideal gas, and
experimental results support this view.
(b) Transport phenomena. Molecular theory may be used to explain the
following three properties of a gas: viscosity, thermal conductivity, and
diffusion.
Viscous stresses arise when different layers of a flowing gas move with
different speeds. Then, because of thermal motions, some of the molecules
in a faster moving layer drift into an adjacent layer where the forward
velocity is less, transporting forward momentum to the slower moving
layer. Conversely, some molecules of the slower moving layer drift into
the faster moving layer and decrease the momentum of the latter. This
transfer of momentum, which is equivalent to internal friction, can be
computed from kinetic theory considerations.* One interesting result
is the prediction that at constant temperature the coefficient of viscosity
is independent of the pressure, a result which can be verified experi¬
mentally.
Problems
1. Use the value of Avogadro’s num¬ suming (a) that the bullets are em¬
ber to calculate the mass of a molecule bedded in the wall, and (b) that the
of (a) H2, (b) 02. bullets rebound with 40% of their
2. Compute the number of mole¬ original horizontal velocity.
cules per cm3 in a gas at (a) standard 7. Compute i>rm8 for oxygen mole¬
conditions, and (b) at 0°C and a pres¬ cules at (a) 27°C and a pressure of 1
sure of one-millionth of an atmosphere atm, (b) 327°C and 1 atm, and (c)
(a fair vacuum). 327°C and 0.25 atm.
3. Show that Eq. (4-7) may be de¬ 8. Compute the temperature at
rived easily if one assumes that on the which a gas molecule would have a
average one-third of the molecules are mean K.E. of translation of one elec¬
moving parallel to each of the three tron-volt.
rectangular axes X, Y, Z. 9. Show that the number of mole¬
4. Find v and ^rras for a group of 10 cules of gas striking the walls of a con¬
molecules when (a) each has a speed of tainer per unit area, per unit time, per
5 m/sec, (b) five have speeds of 3 m/sec unit solid angle, and in a direction mak¬
and five have speeds of 7 m/sec, (c) ing an angle 6 with the normal, is
their speeds are 1, 2, 3, 4,..., 10 m/sec, nv cos 6/4-jt.
respectively. 10. Show that the total number of
5. Suppose that the speed distribu¬ molecules of a gas striking a unit area
tion for a group of particles is given by of a container wall per unit time, in¬
Nv dv = 106 v dv up to v = 500 m/sec, cluding those coming from all direc¬
but Nv dv = 0 for v > 500 m/sec. tions, is nv/4.
Find (a) the total number of particles, 11. A vessel of volume V contains
(b) v, and (c) vrmB. Plot a graph of Nv no molecules of a gas per unit volume.
VS. V. A small hole of area A is made in the
6. In one minute 240 machine gun container wall, (a) Neglecting leakage
bullets strike a vertical wall. Each back into the vessel from outside, show
bullet has a mass of 0.01 kg and a that the number of the original mole¬
horizontal velocity of 2.5 km/sec. Find cules left inside decreases exponen¬
the average force against the wall, as¬ tially with time, (b) In terms of V, A,
PROBLEMS 75
and v, find the time required for only 15. In a colloidal suspension experi¬
no/e molecules to be left. (Here e is the ment it would be practicable to have
base of natural logarithms.) c = co/2 when y = 2 mm and T =
12. Find the volume and radius of an 300°K. Further, the suspended beads
oxygen molecule, given that b in the should be at least 10~5 m in diameter.
van der Waals equation equals 0.032 Suppose that their density is 800
m3/kg-mole for oxygen. kg/m3. Show that the effective value
13. Taking 29 kg/kg-mole as the of mg must be greatly reduced by sus¬
average molecular weight for air and pension of the beads in a liquid of very
T — 260°K, find (a) the height y at nearly the same density.
which, according to Eq. (4-25), p = 16. Assume that the coefficient of
po/2, and (b) p/po at the top of Mt. viscosity pv is a function of the mass,
Everest (29,000 ft). (Experimental ob¬ radius, and speed of the gas molecules.*
servation indicates that p = po/2 at Take pv = kmxrvvz, where k is a di¬
about 18,000 ft.) mensionless constant. From dimen¬
14. Below the stratosphere we may sional analysis find the powers x, y,
take T = To — ky (k is a constant) to and z, and from this find the depend¬
represent the variation in temperature ence of pv on T.
with height in the earth’s atmosphere.
(a) Integrate Eq. (4-24) for this case.
(b) Taking the observed lapse rate *pv may be briefly defined as the
k = 6°K/km and To = 280°K (aver¬ viscous force per unit area per unit
age sea-level temperature at New velocity gradient. For a more complete
York), compute y when p = po/2. definition see Volume I, pp. 229-230.
CHAPTER 5
to be equivalent. As we pass from these to (c), (d), and (e), the descrip¬
tions become more and more specific, (e) being too much so for practical
purposes. In this chapter we shall be primarily interested in (c), which we
shall call a macroscopic description of a state. However, to find (c) we shall
also have to refer to (d), which will be termed a microscopic description of
a state. Abbreviating, we say that (c) defines a macrostate and (d) a micro¬
state of the system.
Since the molecules of a gas are in constant motion, the system is con¬
tinually changing of its own accord from one microstate to another and,
almost as frequently, from one macrostate to another. However, the more
microstates corresponding to the same macrostate, the more likely the
latter is to occur.
Example. As an analogy, consider two coins, a penny (p) and a dime (d),
that are tossed into the air simultaneously. Call falling “heads” equivalent to
being in cell 1 and “tails” to being in cell 2.
There are three macrostates, as follows: (1) ni = 2, n2 = 0, (2) m = 1,
712 = 1, (3) ni = 0, ri2 = 2, that is, two heads, a head and a tail, and two tails.
Now the first and third macrostates can each occur in only one way, p\di and
P2^2, respectively; there is only one microstate for each of these macrostates.
However, two microstates, p\d,2 and p2di, correspond to the second macrostate.
If the coins are not “loaded,” heads and tails are equally likely to occur and the
second macroscopic state should turn up twice as often as either of the others.
In a large number of throws the percent difference between the predicted and
observed ratios is small.
N = nx + n2 + n3 + • • • = a constant. (5-1)
(d) The total energy is constant. This follows from the principle of con¬
servation of energy. In this chapter we are not considering processes in¬
volving a change from one state to another, but rather we are investigating
the velocity and energy distribution for a given state. If we call the total
energy V and the energy per particle Ex for cell 1, E2 for cell 2, E3 for cell
3, etc., we have
Example. Suppose that there are only two cells and four molecules, a, b, c,
and d. Consider the possible macrostate defined by m = 3, n2 = 1. Figure
5-2 (a) shows the four corresponding microstates. Figure 5-2 (b) shows the same
microstate as the first microstate in Fig. 5-2(a). Thus W = 4. By Eq. (5-3),
80 CLASSICAL STATISTICAL MECHANICS [CHAP. 5
Cell 2 a b c d n2 = 1 a
(a) (b)
Figure 5-2
4!
m = 4, n2 = 0, W = ~ = 1,
ni = 2- n2 = 2, W = JL = 6)
m = l,n2 = 3, W = ■— = 4,
ni = 0, n2 = 4, W = = 1.
The reader should check the above values of IF by listing the different micro¬
states for each case. Note that 0! = 1, for an empty cell presents no alterna¬
tives when distributing molecules. Observe also that the most uniform distribu¬
tion (rii = ri2 = 2) is also the most probable.
In IF = In N! — In rii! — In n2! • • • .
Fortunately our n/s are all large numbers and so we can use a mathe¬
matical approximation known as Stirling’s theorem, namely,
In x! = x In x — x, (5-4)
/ X
In x dx = x In x — x + 1,
which simply states that if some cells gain molecules, they do so at the
expense of other cells that lose the same molecules.
The final condition is SU = 0. We differentiate Eq. (5-2), noting that
the ft/s may change, but not the Ejs (which are characteristic of the cells).
Hence
^iSfti + E25n2 + E35ft3 + • • • = 0. (5-7)
Equations (5-5), (5-6), and (5-7) must all be satisfied at the same time,
and they are independent of one another. The equations may be com¬
bined and yet have their independence specified mathematically if we
multiply each by a separate arbitrary constant and then add them. Since
the right side of the resulting equation is zero, we can divide through
by one of the arbitrary constants. This amounts to multiplying one
equation by unity, a second by the ratio of two arbitrary constants,
and the third by another such ratio. These two ratios are the only two
independent constants introduced. This method is known as Lagrange’s
method of undetermined multipliers. Let us, then, multiply Eq. (5-5) by
unity, Eq. (5-6) by the constant X, and Eq. (5-7) by the constant 0.
Adding these three equations, we get
The Sn/s in Eq. (5-8) must satisfy the two condition equations (5-6)
and (5-7), otherwise the Sn/s may vary arbitrarily. If we consider Sn3,
Sn4, Sn5, ■ • • as independent, Eqs. (5-6) and (5-7) determine Sni and Sn2
in terms of the other Sn/s. Now let us assign to X a value which makes
In 712 X (3E2 = 0.
In this equation the Sri/s are all independent. Since we can make all of
them zero except Sn3, we see that the coefficient of Sn3 must vanish, and
so on for the other terms. Hence W (or In IE) is a maximum, subject to
the stated conditions, for those values of n\, n2, n3, • • • for which
m = Ae~0Ei, (5-10)
Low-energy cells will contain more particles than high-energy cells, with
cells of zero energy being the most populated. This does not mean, how¬
ever, that more molecules have zero energy than any other energy, for we
shall find that there are more cells corresponding to high than to low energies
and the most probable energy is greater than zero. Thus if we take a zone
of Qi cells all corresponding to about the same energy Ei, the number of
particles in this zone, which we shall call Ni, is given by
Ni = Agie~pE\ (5-11)
We shall see that for a small energy range dE, gi is proportional to Ej*.
From Eq. (5-10) we s^e that all cells corresponding to the same energy
but to different positions in the container are equally populated. This is
true for all values of E{. If the potential energy due, say, to a gravita¬
tional field, may be neglected, it follows that Ei is independent of x, y, and
z and thus the molecules are uniformly distributed in coordinate space.
This is what we would expect, and it is a consequence of our hypothesis
that all microstates are equally probable.
N — ni T n2 T- n3 = 10,
If 5n3 - —2, two molecules leave the cell of highest energy for cell(s) of
lower energy. Since these two molecules give up energy, others must gain
energy. Hence molecules in cell 1 gain energy and enter cell 2. Since N and U
remain constant, we must have
Now I 10!
w 5! 3! 2!
2520,
10!
W' 120.
3! 7! 0!
The original distribution is much the more probable one and also bears closer
resemblance to the Boltzmann distribution.
Our problem now is to express gv in terms of v and dv. From the definition
of gv, we have
gv = number of cells in the fth zone
To obtain the volume in phase space of the fth zone we must integrate an
element of volume dx dy dz dvx dvy dvz throughout that zone, or
fffdxdydz = V,
where V is the volume occupied by the gas. This leaves integration with
respect to vx, vy, and vz. As we care only about the resultant speed v
and not about the velocity components, we can proceed as follows.
86 CLASSICAL STATISTICAL MECHANICS [CHAP. 5
Imagine vX) vy, and vz plotted along three mutually perpendicular axes.
About the origin describe a sphere of radius v. The surface of this sphere
will contain all points for which v2 + v2 + vz = 1,2 ■ Now describe a
second sphere of radius v -j- dv, concentric with the first. The volume
between these two spheres will contain all points in velocity space for
which the speed is between v and v + dv. This volume is Airv2 dv. Hence
The volume of our zone in phase space is F(47rv2 dv). Since the volume of
one cell is r, we have
gv = irZ
T
(5~13)
and
Ev = \mv1.
If we substitute these last two expressions back into Eq. (5-12) and let
B = 4ttA V/t, we get
Nvdv = Bv2e~?mv2'2dv, (5-14)
which is Maxwell’s distribution law for molecular speeds. For very small
values of v the exponential term is near unity and Nv is proportional to v2.
For very large values of v the exponential term is the more important one
and Nv is proportional to e~av2. Thus when we plot Nv against v, as in
Fig. 5—5, the curve first rises parabolically, later reaches a maximum, and
finally falls exponentially to zero. The maximum corresponds to the most
probable speed, designated as vp. As the area under the curve to the right
of the maximum is greater than to the left, the mean speed v is greater
than vp and wrms > v > vv. (See problems 7 and 8 at the end of this
chapter.)
5-9 Evaluation of the constants B and /3. We have seen that each
distribution law contains two constants, such as B and /3 in Eq.. (5-14),
which are related to the two undetermined multipliers in Section 5-6.
These multipliers were introduced because of the conditions that the total
number of particles N and total energy U are constant for a given state.
88 CLASSICAL STATISTICAL MECHANICS [CHAP. 5
N = / Nv dv
Jo
2 — pmv /2
= B vze dv.
Jo
n -/3mu 12
Jn — dv.
Then
N_
B = (5-16)
J2
U = / imv2Nv dv
Bm / V*e~emv2i2dv
2 Jo
Nm
© (5-17)
The following values of some of the J’s are taken from standard integral
tables:
Jo — T>X/2-jr/l3m, J1 = ——, J2 =
B = 2NVWmz/2TT,
5-9] EVALUATION OF THE CONSTANTS B AND (8 89
i
U = N$kT).
Let us equate the two expressions for U and solve for /3. We then find that
/?• = (5-18)
kT'
Ni = Agie~Ei/kT, (5-19)
Nv = (4N/V^)(m/2kT)3l2v2e~mv2l2kT
vs. v have been drawn for three different temperatures, Tx < T2 < T3.
The higher the temperature, the more energy the molecules possess and
the more the higher speeds are favored. However, the areas under all
three curves are the same (N constant).
Fig. 5-7. Distribution function at three different temperatures, T3 > T2 > T\.
90 CLASSICAL STATISTICAL MECHANICS [CHAP. 5
Let Wa be the probability or chance that a certain event will occur and
let Wb be the probability of occurrence of another independent event.
Then the probability W that both of these events will occur is W = WaWi).
Suppose, for example, that one person tosses a coin while another draws
a card from an ordinary complete deck. The result of repeated coin tossing
is that heads appear close to half the time, so we say that the chance that
a person will toss heads one particular time is Wa = Similarly, the
5-11] ENTROPY AND PROBABILITY 91
chance of drawing a king from the card deck is Wb = j3. So the prob¬
ability that one person will toss heads while another person draws a king
is 2V) i-e-, both of these events will occur together on the average only
once in twenty-six times.
Let us return to the two independent systems in equilibrium. If Wa
now represents the probability that the first system is in the state of
entropy Sa, while 1T& is the probability that the second system is in the
state of entropy Sb, then the probability that both of these events will
occur together is
W = Wa X IT&. (5-22)
We have seen that the state of equilibrium is both the state of maximum
entropy and of maximum probability, and this suggests a relationship be¬
tween entropy and probability. The simplest relationship consistent with
Eqs. (5-21) and (5-22) is a logarithmic one. Therefore we shall take
S = /c In W, (5-23)
Sa = k In Wa, Sb = k In Wb,
we have
S = Sa + Sb = k In (WaWb) = k In IT,
du = 8Q = TdS. (5-24)
From Eq. (5-2), we have
X) drii = 0
i
K.E./molecule = § kT.
(b) Two degrees of freedom. Suppose that we had molecules which, like
marbles in a flat dish, could move only in a two-dimensional plane. Equa¬
tion (5-19) would still apply, but in deriving the velocity distribution we
would proceed as follows.
In place of the two concentric spheres in velocity space, one of radius v
and the other of radius v + dv, take two concentric circles of radii v and
v -j- dv, respectively, these circles being in the -plane containing all possible
velocities. Then, for the fth zone,
Nv dv = Cve
where C is independent of v.
For such a velocity distribution we have
K.E./molecule =
94 CLASSICAL STATISTICAL MECHANICS [CHAP. 5
V , —mv 12kT
gv = — dv, N v dv = De dv,
rx i 2 —mv2/2kT j
>mv e
J0 \mve dv
and K.E./molecule = ncc
00 _—tod2/ 2kT
So e dv
Jz
\m
Jo
= UT.
(d) / degrees of freedom. The three previous cases all indicate that the
average K.E. per molecule is %kT per degree of freedom. We shall show that
this is indeed so whenever the energy of a molecule is a quadratic function
of the velocity components, as is usually the case. Then each degree of
freedom acquires the same average K.E., namely, fkT.
Suppose, for example, that q is the coordinate associated with one of the
degrees of freedom, vq the corresponding velocity component, and %mv\
the K.E. associated with this velocity component. (Here m is a constant,
such as the mass or moment of inertia of the molecule.) To find the number
of molecules with a given velocity component between vq and vq -f dvq we
must take a zone of cells in a phase space of 2/ dimensions, this zone in¬
cluding all possible values of the coordinates and other velocities, but just
the exact variation dvq of vq. Then Nv dv will be proportional to dvq. For
the energy associated with this sheet of cells we may write
Eq = E -j- \mv\,
where E' is the average energy corresponding to the other degrees of free¬
dom. E is independent of vq, so that when integrating with respect to vq
we may consider e~E'lkT as a constant. We then have for the average
value of \mv\, that is, the average K.E. per molecule associated with the
velocity component vq,
I™ f00 ,.2
2m jo vqe — mvllZkT dv.
1
£ _ J2
~ST^~mvl^kT dvq = tm Jo
= ikT.
5-12] THE EQUIPARTITION OF ENERGY PRINCIPLE 95
Example. Suppose that the molecules of an ideal diatomic gas are rigid, but
free to rotate about either of two mutually perpendicular axes, these axes being
at right angles to the axis of the molecule. Find Cv, Cv, and y for such a gas and
compare with experimental values of y for diatomic gases.
Solution. We have a gas with five degrees of freedom per molecule, three of
translation and two of rotation. From the equipartition of energy principle the
K.E./molecule = - kT,
Zi
and the
K.E./mole = - RT.
Zi
Hence
U = - RT,
dU 5
Cv =
dT 2 ’
Cp = Cv + R — 2
7
&-S-
At 15°C the values of y found experimentally are 1.41 for H2, 1.404 for N2,
1.401 for 02, 1.404 for CO, 1.41 for HC1, all in excellent agreement with the
theoretical value. _
than zero. Such is not the case with the rotational energy. A study of
molecular spectra indicates that the rotational energy of a molecule is
limited to a discrete set of values, say 0, Ex, E2, E3 ■ • • . This is explained
theoretically by the quantum theory (Chapter 15). The successive allowed
values of rotational energy differ by finite amounts. As a typical example
we may take HC1, for which Ex is about 2.5 X 10-3 electron volts (ev).
At the end of Section 4-5 we found that at 0°C the average K.E. of trans¬
lation of a molecule is 3.5 X 10~2 ev; since this latter value is over ten
times E\, it is quite possible that in a collision between two molecules of
HC1 gas at 0°C rotational energy will be excited at the expense of transla¬
tional energy. In other words, relative to the mean translational K.E.,
the allowed rotational energy states are so close together that they may
be regarded as practically continuous, in which case the rotational degrees
of freedom must share in the equipartition of energy.
At very low temperatures, however, the mean translational K.E., which
is proportional to T, may be much less than Ex, the first allowed value of
the rotational energy. In this case the population of the first rotational
state, which is proportional to e~EllkT, must be very small. The rota¬
tional energy then will not contribute to the specific heat.
A nonrigid molecule containing n atoms (here considered as mass points)
must possess 3n degrees of freedom. Three of the degrees of freedom are
associated with the translational K.E. In general, a complex molecule has
three degrees of freedom of rotation (corresponding to rotation about three
mutually perpendicular axes), although for diatomic molecules the number
is only two, as stated in the previous example. This leaves (3n — 6) or
(3n — 5) degrees of freedom associated with the vibrations of nonrigid
molecules. To a first approximation these vibrations are simple harmonic
oscillations. Since a simple harmonic oscillator possesses an average
-potential energy equal to its average kinetic energy, if a vibrational degree
of freedom shares in the thermal energy, the mean total energy associated
with this vibrational degree of freedom must be kT rather than \kT.
According to quantum mechanics the energy of an oscillator is also
limited to a series of discrete values (Chapter 6), and it turns out that for
molecules these energy states are much farther apart than are the rota¬
tional ones referred to earlier. The result is that at room temperature the
population of any vibrational state (other than that of zero energy) is very
small, and so the vibrational degrees of freedom do not contribute to the
specific heat.
It is interesting to observe that throughout physics the internal structure
of particles does not play a role until energies are reached which are
comparable to the energy levels involved in the structure. This is the
reason why the hard-sphere model of a molecule works as well as it does
even though we know it to be a great simplification.
97
i
Problems
1. In tossing a coin, let us consider 11. At 0°C find the number of oxy¬
that “heads” are in cell 1, and “tails” gen molecules per thousand with speeds
are in cell 2. Suppose that ten coins greater than 1000 m/sec.
are tossed, (a) Compute W for each pos¬ 12. Show that the number of gas
sible distribution from all heads to all molecules with ^-components of veloc¬
tails, (b) If the ten coins are tossed at the ity between vx and vx -f- dvx is equal to
same time, what is the percent proba¬ (N/^n){m/2kTyl2e-™xl2kT dvx.
bility that seven or more will land alike? 13. From Eq. (5-15) find, in terms
2. Given three cells and five parti¬ of kT, the most probable energy Ev.
cles. Find W for (a) n\ = 5, ri2 = 14. Zartman and Ko used in their
713 = 0, (b) Ul = 4, 712 = 1, 713 = 0, experiment a beam containing a mix¬
(c) 711 =3, 712 = 2, 713 = o, (d) ture of Bi and Bi2 molecules issuing
711 = 3, 712 = 713 = 1, (e) ni = 712 = from an oven at 850°C. (a) Find ?;rms
2, 713 = 1. for each of the two kinds of molecules,
3. Compute the percent error made (b) Taking the cylinder of Fig. 5-8 to
in taking In N! = N \n N — N when be 10 cm in diameter and rotating at
(a) N = 4, (b) N = 8, (c) N = 16. 3600 rpm, find where, relative to when
4. Refer to the Example at the end the cylinder is at rest, the Bi and Bi2
of Section 5-6. If 5ri2 = 2, find 5ni, molecules with rms speeds would strike
bnz, and W'. the glass plate G.
5. Given 100 particles and 10 cells 15. Consider polyatomic molecules
of equal energy. Find In W for (a) the with three rotational degrees of free¬
most probable distribution, (b) the dom. Find Cv, Cp, and y. List some
least probable distribution. experimental values of y for polyatomic
6. Given 100 particles and 10 cells. gases.
Find the ratio of W for the distribution 16. A nonrigid diatomic molecule
7ll = 712 = • • • = 778 = 10, 719 = 13, possesses six degrees of freedom (3 for
ni0 = 7; to the W for a completely each atom), one being vibrational. As¬
even distribution. suming simple harmonic vibrations,
7. Show that the most probable the K.E. and P.E. associated with the
speed for a molecule is given by vp = vibration are equal. If the vibra¬
y/2kT/m. Find the ratio vv/v rms* tional energy is in thermal equilibrium
8. Show that the mean speed is with that associated with translation
given by v = y/8kT/irm. Find the and rotation (as is actually the case
ratio v/vima. at high temperatures), find Cv, Cp,
9. At 0°C compute vp, v, and vTma for and y.
O2 molecules. 17. Compute the value of Cv for a
10. At 0°C find the number of solid, assuming that the molecules can
oxygen molecules per kg-mole with oscillate only three-dimensionally
speeds between (a) 100 and 101 m/sec, about fixed points. (The result,
(b) 300 and 301 m/sec, (c) 500 and Cv = 3R, is the law of Dulong and
501 m/sec, (d) 700 and 701 m/sec. Petit, which is in good agreement with
Using these values and that of vv from experiment at high temperatures.)
problem 9, plot Nv vs. v.
CHAPTER 6
QUANTUM STATISTICS
very light particles that the postulates of the quantum theory are
most significant. This suggests incorporating the fundamental principle
of quantum mechanics (Heisenberg’s uncertainty principle) into the
statistical theory of electrons and photons. At the same time, when
dealing with such minute particles, it became apparent that all electrons
are alike and indistinguishable, as are photons of the same energy. In fact,
should not this idea even be extended to gas molecules?
The study of the spectra of atoms with several valence electrons led to
two further discoveries, (1) the spin of the electron, and (2) Pauli’s exclu¬
sion principle. It was found that the electron possesses a spin angular
momentum of %(h/2r) and, for a given situation of the electron, two
orientations, opposite to each other, are possible for the spin vector.
To the other quantum numbers that specify the state of an atomic electron
a spin quantum number was added. The Pauli principle asserts that no
two electrons in an atom or molecule can have the same set of quantum
numbers, i.e., that no two electrons in a given system can be in the same
quantum state. Fermi and Dirac recognized that the Pauli principle
should also apply to the free electrons in a metal and they developed
quantum statistics for such particles (see Section 6-7). The so-called
Fermi-Dirac (F-D) statistics were extended later by Sommerfeld, with
highly successful results.
Further experimental evidence has indicated that the F-D statistics
are obeyed by any group of particles for which the individual spin is (a)
the same as for the electron (this includes protons, neutrons, positrons,
and n-mesons), or (b) an odd integral multiple of the electron’s spin. Parti¬
cles whose spins are zero or an even integral multiple of the electron s
spin are not subject to the Pauli principle, and so a group of such particles
must obey a different set of quantum statistics, that known as the Bose-
Einstein (B-E) statistics (see Section 6-4). Photons and x-mesons fall into
this category.
The question of which statistics gas molecules obey is answered by count¬
ing the number of particles (protons, neutrons, and electrons) in the
complex molecule; if this number is odd, the F-D statistics apply, while
if the number is even, the B-E statistics must be used. Actually, the
number is odd in only a few rare cases (e.g., monatomic beryllium vapor),
so we shall henceforth consider only molecules that obey the B-E statistics.
In a complex particle the spins of its components seem to be lined up
parallel or antiparallel, so that an even number of such components, each
with a spin of %(h/2x), gives a resultant spin of 0, 1, 2, • • • times (h/2x),
while an odd number of components gives a resultant spin of 2, 2>
times (h/2x). If we consider this resultant spin to determine the statistics
and then apply the rules of the preceding paragraph, we see that “even”
molecules should obey the B-E and “odd” molecules the F-D statistics.
100 QUANTUM STATISTICS [CHAP. 6
E = nhv, n = 0, 1, 2, 3, • • • (6-1)
(6-3)
In Eq. (6-2) we may express the kinetic energy in terms of the mo¬
mentum px = mx instead of x. We then obtain
or
(6-4)
(6-5)
Fig. 6-1. Phase space orbit of linear Fig. 6-2. Quantized phase paths for
oscillator. linear oscillator.
E = \ kA2 = nhv.
A = \/2nhv/k.
A2 = —=■ (6-6)
7rv mk
px dx = nh, n = 0, 1, 2, 3, • • • . (6-7)
we took our cells to be several times larger than h, they would include more
than one possible energy state. Therefore, for one-dimensional oscillators
the size of the cell should be h. This suggests that for six-dimensional
phase space the size of the unit cell should b eh3. In the last chapter we
assumed all cells to be of the same size, but we did not specify this size.
Now we see that quantum theory tells us what this size should be.
Planck’s ideas were extended by Einstein, who reasoned that since
energy is emitted and absorbed in units of hv, radiation itself must be com¬
posed of energy bundles, called photons or quanta. For light of frequency v
the energy per photon is taken to be hv. This hypothesis found excellent
confirmation in measurements of the photoelectric effect. These experi¬
ments also led to an accurate determination of h, namely,
Equation (6-7) later became the basis of the Bohr theory of the hydro¬
gen atom, where it was generalized to apply to each degree of freedom.
In classical mechanics the laws of physics place no limitation on how
precisely we may specify the position and momentum of a particle. This
position and momentum, and hence the state of the particle, can be
represented by a point in phase space. In quantum mechanics, however,
we find that if a coordinate x is measured to within an uncertainty Ax,
then by the very process of this measurement an uncertainty Apx is introduced
into the corresponding momentum such that Ax Apx ~ h.
The above is a statement of Heisenberg's uncertainty or indeterminacy
principle. It is a generalization of the conclusion reached for the harmonic
oscillator on the basis of Planck’s quantum hypothesis, and it leads to
a confirmation of our postulate that for particles moving in three-dimen¬
sional coordinate space, the size of the unit cell in the corresponding
phase space should be h3. The state of a particle cannot be specified more
precisely than by saying that the point representing its position and
momentum lies somewhere within such a cell.
where the integration is taken to include all the cells in the given zone.
Let gz represent the number of unit cells and Ni the number of particles
in the ith zone. While dv is to be considered small compared with the
range in speed of the particles in the whole container, it is still possible to
consider it large enough for a zone to contain a great many cells and
particles. We may then regard <7; and Ni as large numbers and apply
Stirling’s theorem to gd. and Nil
The numbers N1, N2, N3, etc., which specify the number of particles
in each zone, now define a macrostate. The number of ways in which the Ni
particles of the ith zone can be distributed among its g{ cells will be desig¬
nated as Wi, and the thermodynamic probability of the macrostate will
be defined as
W = Wi ■ W2 ■ W3 ■ • • = IIWi,
i
where the product extends over all zones of phase space.
c b, d a, e
(6-8)
Cell 1 2 3 4
b, d c a, e
(6-10)
Cell 3 2 1 4
which is the same as (6-8), since the same cells contain the same particles.
The cells (with their respective particles) can be interchanged like this in g\
ways, all of which correspond to the same microstate, so that when we
come to counting different arrangements we must divide (6-9) by g!.
Since the particles are now to be regarded as indistinguishable, an
arrangement such as
b c, d a, e
(6-11)
Cell 1 2 3 4
c b, d a, e
(6-12)
3 2 1 4
w N j — 1)!
g\Ni\
(g + Nj - l)!
(6-13)
(g - i)!^<!
When we combine these arrangements with each of those for all other
zones, we get
w = n Wi = n (g(g+-Nii)!tff!
i i
- 1)!
‘
(6-14)
g -\- N j l
2> Ni
8N i = 0. (6-15)
L SNi = 0, (6-16)
£ Ei 8Ni = 0, (6-17)
8Ni = 0.
or
(6-18)
and we see that for any zone the ratio of the number of particles to the
number of cells in the zone is independent of g. If the zones have different
numbers of cells, say <7; in the fth zone, then Ni will be equal to Qi times
the ratio given in Eq. (6-18). We then obtain
Qj
Ni = ex+PEi — 1 ’
(6-19)
/3 ^ ji' (6-20)
A = ^ « !• (6-21)
We shall compute A for a gas at room temperature and show that con¬
dition (6—21) is actually satisfied. Since such a gas is found to obey the
M-B statistics, we may write
is the number of molecules in the fth zone in phase space; Qi, the number of
unit cells in this zone, is equal to the volume of the zone divided by h3.
To find the volume of this zone we must first remember that it includes
all possible values of the coordinates so that (refer to the sextuple integra¬
tion in Section 6-3)
/// dx dy dz = V,
V being the volume of the gas. In momentum space the zone will include
the volume between two spheres, one of radius p and the other of radius
p -f- dp, where
p — mv
is the scalar magnitude of the momentum of any molecule with the speed
v, and
dp = m dv.
= 4:ttvi3v2 dv.
Hence
47rm3FA 2 — mv2/2kT
Nv dv = v e dv, (6-23)
h3
which is the Maxwell volocity distribution law, Eq. (5-14), with the
constant of proportionality
47rm3 V A
B = (6-24)
h3
The only difference between the above expressions and those derived in
Section 5-7 is the presence of the m3 term, which enters because our phase
space is a momentum-coordinate rather tlian a velocity-coordinate one.
The size of the unit cell, h3, appears also, but we shall see that A is pro¬
portional to h3, so that actually Planck’s constant cancels out and its
magnitude here is immaterial.
108 QUANTUM STATISTICS [CHAP. 6
(^m3VA\ 1 /8 fffcaya
N = Nv dv = BJ 2
~ \ h* ) 4 \ m3
or
Nh3
A = (2TmkT)-312. (6-25)
V
A « 2X 10-7,
so that A <5C 1.
Let us now consider the case when A might be much larger than in this
example, so that ex = 1/A is not large enough to allow us to neglect the — 1
term in Eq. (6-18). We see that A increases as (1) T decreases, and (2)
m decreases. The effect of decreasing the temperature is twofold because
of the T ^2 factor and because V decreases with T at constant pressure.
The best gas to use in testing for a departure from the M-B theory is
helium, for it is one of the lightest gases and it can be cooled to very low
temperatures before it condenses. Low-temperature measurements with
helium have shown departures from the M-B theory in the direction in¬
dicated by Eq. (6—18) j the gas is then termed degenerate. Thus we can
regard the M-B statistics as a special case of the B-E statistics, this special
case including gas molecules at all but very low temperatures.
Nv dv — 8irv2 ^ • (6-26)
c3
If we were to assume the equipartition of energy, we would assign an
average K.E. of \kT and an average total energy of kT to each member of
a group of linear oscillators in thermal equilibrium. Each wave is
statistically equivalent to an oscillator, so we would then associate an
energy kT with each mode of vibration in Eq. (6-26). However, realizing
that this would not lead to his empirical law, Planck made the assumption
stated in Section 6-2, that energy variation is limited to a discrete set of
values.
Let us use the Boltzmann distribution law, Eq. (5-10), with /3 = 1 /kT,
and, following Planck, impose the restriction that E can only have one of
the following values: 0, hv, 2hv, • — . The average energy associated
with the frequency v is then
_
E n=0 nhvAe
oo a 7 —7i hv I kT
Ev = hv
e-x(l - e~xr2
= hv
(1 - e~x) —i
= hv
1 — e~x
1
= hv
ex — 1
hv (6-28)
ehvjkT — 1
u, dv = E.N. dv = ■, (6-29)
the law which he had already found empirically and which is known as
Planck’s radiation law.
We shall now follow the method of Bose and show that Planck’s law can
be obtained from quantum statistics without recourse to such classical
concepts as oscillators or standing waves. We shall, however, assume
Einstein’s hypothesis that light is composed of photons such that the
Energy/photon = hv (6-30)
and the
Momentum/photon = hv/c. (6-31)
A = ~= 1 (6-32)
6-6] STATISTICS OF A PHOTON GAS. PLANCK’S RADIATION LAW Ill
1
for a photon gas. In place of Eq. (6-19), we have
N‘ = JSifzri - (6-33)
where again /3 = 1/kT.
We now let Ni be the number of photons in the volume V, and nv dv
the number per unit volume, between the frequencies v and v + dv, or
Ni — Vnv dv.
ffi = v dv-
Finally, the photons of frequencies close to v all have practically the same
energy
Ei = Ev = hv, (6-34)
This is the number of photons per unit volume with frequencies between
v and v + dv. It will be seen that Eq. (6-35) differs from (6-26), which
was based on the classical concept of standing waves and did not contain
the temperature.
The energy density of the radiation for the frequency range v to v + dv
is the product of nv dv and the energy per photon, or
7 \ Ti 8irhv dv /{%
Uv dv = (n„ dv)E, = —ehv/kT _ 1» (b-ob)
W = Wx • W2 ■ W3 ■ • • = UWi.
i
In computing W; the easiest method is to divide the cells into two groups,
group A including all occupied cells (one particle each), and group B all
unoccupied cells (no particles). There will be IV; cells in group A and
g — Ni in group B. Let us number the cells 1, 2, 3, etc. We might have
a distribution such as
Group A Group B
□ □□-
(6-37)
Cell 2 3 6 1 4 5
_j L_
■ y _j
Ni cells (9 — Nf) cells
6-7] FERMI-DIRAC STATISTICS 113
Group A Group B
(6-38)
Cell 6 3 2 1 5 4
is the same as (6-37). The cells in group A may be changed about in Nil
ways and those in group B in (g — Ni)! ways. Hence
Q' (6-39)
Ni\(g - Ni)\ ’
Having found the expression for IT, we proceed as we did with the M-B
and B-E statistics. Variations in the AVs from their values for the state
of equilibrium are taken to be subject to the three conditions (1) 8W -
5 (In W) = 0 because IT is a maximum, (2) 8N = 0 because N is constant,
and (3) bU = 0 because U is constant.
From Eq. (6-40), we have
In IT = £ [g In g - g - Ni In Ni + N<
%
= £ [g In g - Ni In Ni - (g - Ni) In (g - Ni)].
114 QUANTUM STATISTICS [CHAP. 6
If we let Ni vary by SN{, but keep g constant, the three conditions hsted
above become
S(ln W) = E In SNi = 0,
i %
SN = X SN i = 0,
i
8U = X EibNi = o.
i
g - Nj
E Ni
8N i = 0.
g - Nj
X - pEi = 0,
Ni
Ni = 1
g ex+?Ei 1
^i e^+PEi + 1 ’ (6-41)
^ = kf' (6-42)
Nh
A 2~y (ZTrmkT) -3/2 (6-43)
6-8] ELECTRON GAS 115
For an electron gas consisting of the free electrons in a metal, this formula
will lead to a much larger value of A than we computed before because (1)
m is much smaller, and (2) N/V is much greater. The mass of a nitrogen
molecule is 28 X 1840 times that of an electron. In a metal there is ap¬
proximately one free electron per atom, but the density of atoms in a solid
is several thousand times that of the molecules in a gas at standard condi¬
tions. Thus even at room temperature we cannot assume A « 1 for the
free electrons in a metal. Such an electron gas is said to be completely
degenerate, that is, at laboratory temperatures it never obeys classical
statistics.
To picture the distribution for an electron gas, let us start with absolute
zero. As the temperature of an ordinary gas is decreased toward abso¬
lute zero, its molecules crowd more and more into the cells of very low
energy, approaching the condition where they would all be in the cell of
lowest (zero) energy. However, because of Pauli’s principle, electrons can¬
not crowd into one or a few cells. The distribution leading to the lowest
total energy (the one we would expect to be associated with absolute zero)
is that where the cells of lowest energy are all solidly filled, each with one
particle. Enough of these cells from zero energy up to, say, Em must be
occupied to accommodate all of the electrons, as shown in Fig. 6-3(a).
All cells of higher energies (A; > Em) will be empty. Such a distribution
is shown by the solid rectangular curve in Fig. 6-4. Thus an electron gas
possesses a so-called zero-point energy at absolute zero.
Et
Ni
9i
T = 0°K Tx t2>t1
(a) (b) (c)
Fig. 6-4. Fermi-Dirac distribution
Fig. 6-3. Variation in cell distribu¬
tion with temperature, for electrons. function.
Picture a metal near absolute zero being given enough energy to raise
its temperature one degree. Very little of this energy will be transferred
to the electrons in the metal, for the average thermal energy of the mole¬
cules C’kTj is not enough to disturb those electrons which are not near the
116 QUANTUM STATISTICS [CHAP. 6
top of the filled cells. Such electrons would have to jump all the way up
to the unfilled cells above Em. However, a little energy might be absorbed
by electrons in those cells just below the highest one filled, raising them to
nearby unfilled cells, as shown in Fig. 6-3 (b) and (c). As the temperature
rises, more and more of the “knee” of the curve in Fig. 6-4 will be rounded
off, as shown by the dotted curves, which begin to resemble the exponential
shape corresponding to the Boltzmann distribution. However, suppose
that even at room temperature the rounding-off affects only a very slight
portion of the zero-point rectangular curve. This would mean that only a
negligible fraction of the free electrons share in the thermal motion, and so
their contribution to the specific heat would be small. The fact that the
specific heat of a solid at ordinary temperatures does not depend on whether
it possesses free electrons can thus be explained by the F-D, but not by
the M-B, statistics. Detailed calculations show that only at temperatures
of the order of 106 degrees will the electrons packed in the lower occupied
energy states be loosened and so contribute to the specific heat.
The solid packing of the energy states below E{ = Em corresponds math¬
ematically to X < 0 in Eq. (6-41). This would mean that ex < 1,
f^>1. Approximate calculations based on Eq. (6—43), assuming
one free electron per atom, also lead to A ]> 1. If we put
X = -EJkT,
Eq. (6-41) becomes
9i
Ni = (6-44)
e(E~Em)lkT 4 1
- ’
which, we shall see, corresponds to Fig. 6-4. Let N{ refer to just one cell
so that gt 1. If T is small, or 1/kT is large, the exponential term is
practically zero when E{ < Em and nearly infinite when Ei > Em; there¬
fore if we plot Ni vs. Ei, we will obtain a flat curve with a sharp knee at Em
as in Fig. 6-4. ’
Table 6-1
Molecules distinguishable a/
Uncertainty principle V v/
Pauli’s exclusion principle v/
Example. Given two particles to be divided among three cells, two of the
cells being in zone 1 and one in zone 2. Compare, using each of the statistics, the
probability for (a) Ni = 2, N2 = 0 with that for (b) Ni = 1, N2 = 1, and
that for (c) Ni = 0, N2 = 2.
92!_
(1) Fermi-Dirac: W = W\ W2 = .. ,,-:——
Ni\(gi — N1)! ■A/2! (92
— IV2)!
Zone 1 Zone 2
2! 1!
(a) Wa
2!0! ' 0!1!
2!
(b) Wb
Tm
(c) N2 — 2 is impossible, since zone 2 has only one cell and two particles
cannot be in the same cell.
Wc = 0.
Wa : Wb: Wc = 1:2:0.
3! 0!
(a) Wa
1!2! ' 0!0!
2! 1!
(b) Wb
111! ’ 0!1!
1! 2!
(c) Wc
1!0! ’ 0!2!
Wa : Wb: Wc = 3:2:1.
6-9] SUMMARY AND COMPARISON 119
i
(3) Maxwell-Boltzmann: Here we must consider not zones but cells of equal
size. If there are n\ particles in the first cell of zone 1 and m in the second,
N\
W = £
7ii!ri2hV2!
ab
a b
2! 2! 2!
Wa = 4
2!0!0! + 1!1!0! + 0!2!0!
b a
ab
2! ab
Wc = 1
0!0!2!
Wa :Wb-We = 4:4:!.
120 QUANTUM STATISTICS [CHAP. 6
Problems
1. Suppose that with an electron 9. From Eq. (6-36) show that the
microscope the position of a particle energy per unit volume integrated for
with respect to the X-axis is determined all frequencies is proportional to T4.
to within an accuracy of 10-10 m (1 A). Given f™x3dx/(e* - 1) = tt4/15,
Find the velocity range Av x per cell in show that the proportionality constant
phase space for particles of mass (a) is a = 87t5A:4/15c3/i3.
9 X 10-31 kg (electron), (b) 2 X 10~24 10. From Eq. (6-36) find the ex¬
kg (large molecule), (c) 3 X 10-15 kg pression for u\ d\, the energy density
(small oil drop). between the wavelengths X and X -f- dX.
2. Given 4 particles to be divided Differentiate and show that when u\
among 2 zones, one of 3 cells and one is a maximum the product XT' is inde¬
of 2 cells. For B-E statistics find W pendent of T (Wien’s displacement
for each macrostate from Ni = 4, law).
N2 = 0 to N1 = 0, N2 = 4, using 11. Show that the product XT7 in
both the formula and block dia¬ problem 10 is approximately equal to
grams. he/5k.
3. Repeat problem 2 for F-D statis¬ 12. Experimentally we find 7.64 X
tics. 10~16 joule/m3(°K)4 for the constant
4. Show in detail the derivation of a in problem 9, and 2.88 X 10~3 m-°K
Eq. (6-15) and Eq. (6-18). for the product XT of problems 10 and
5. Show that /3 = 1/kT for the (a) 11. From these values determine h
B-E statistics, (b) F-D statistics. and k.
6. Find the temperature at which 13. Use Eq. (6-43) to compute A
A = 0.5 for an ideal gas composed of for an electron gas, taking m = 9 X
helium molecules. (Take p = 1 atm.) 10~31 kg and N/V = 6 X 1028 free
7. Show that if the summations in electrons/m3, T = 300°K.
Eq. (6-27) are replaced by integrations 14. In Eq. (6-44) let gi = 1, T =
and n is considered to vary contin¬
300°K, and Em = 9.0 electron volts
uously from zero to infinity, then (tungsten). Find for (a) =
Ev = JcT.
8.9 ev, (b) Ei =. 9.0 ev, and (c) E{ =
_ 8. Show that Eq. (6-28) reduces to
9.1 ev. Note the sharpness of the knee
X, = kT if hv/kT « 1.
of the distribution curve.
1
i t
Part II
ELECTROMAGNETISM
%
CHAPTER 7
7-2 General outline. The fundamental laws of physics are those which
do not involve the special properties of a particular substance. In electro¬
magnetism such laws must be those that apply to charged bodies situated
in empty space, although the experiments leading to the discovery of these
laws were usually conducted in air. Fortunately, the presence of air makes
very little practical difference; therefore we shall divide our treatment of
electromagnetism into two parts and consider (1) the laws for charged bodies
in vacuo, or in a gas such as air at atmospheric pressure, and (2) the
modifications in these laws when the medium is a liquid or solid possessing
particular electric or magnetic properties. The first part, (1), dealing with
general laws, will receive the most attention; we shall include in the present
chapter a brief statement of these laws, and then we shall devote a separate
chapter to the development and applications of each (Chapters 8-11).
The second part, (2), will comprise Chapter 12, further development of
this more complicated branch of electromagnetism being left to more ad¬
vanced texts.
After briefly sketching the historical development of electromagnetism,
we shall try to present the subject in the most logical manner possible.
The historical growth of electromagnetism was much like putting together
the different sections of a jigsaw puzzle and then finally fitting these
sections together to form the complete picture. Now, of course, we can
start with a look at the picture as a whole and then make a detailed study
of its parts.
123
124 FUNDAMENTAL LAWS OF ELECTROMAGNETISM [CHAP. 7
of a general principle which we call Faraday’s law. From this law we can
compute the electromotive force induced in a circuit when the time rate
of change of the magnetic flux through the circuit is known. Faraday re¬
garded electric and magnetic forces as due to fields, and he pictured these
fields by drawing so-called lines of force to represent the direction and
intensity of a field.
Maxwell, in 1864, brought the theory of electromagnetic fields to its
culmination. Fie summarized, in the form of differential equations, the
laws governing electric and magnetic fields. These electromagnetic equa¬
tions embody the laws of Coulomb, Ampere, and Faraday, and they are
compatible with the continuity 'principle that electric charge can neither
be created nor destroyed. In his attempt to make his equations consistent
with that expressing the continuity principle, Maxwell introduced a term
called the displacement current into one of his equations. This term was
not predicted by the laws of Coulomb, Ampere, and Faraday, but its inclu¬
sion in his theory led Maxwell to the belief that electromagnetic waves
should exist. He computed the speed of such waves to be that of light and
so concluded that light and electromagnetic waves must be the same thing.
The experiments of Hertz on the properties of electromagnetic waves
confirmed Maxwell’s theory.
Since Maxwell’s time the point of view taken by physicists toward elec¬
tromagnetism has changed. First, since isolated magnetic poles have not
been observed, all magnetic effects are attributed to charges in motion,
in line with a suggestion made by Ampere. The magnetism of a piece of
iron is regarded as due to small “Amperian currents” (which actually may
be spinning electrons) in the iron. This viewpoint eliminates the need for
such a concept as the magnetic pole, and as a result it makes Coulomb’s
law for the force between poles unnecessary. We are left with the following
basic principles: (1) the law of force between static charges, (2) the law of
force between moving charges (or currents), (3) electromagnetic induction,
and (4) the displacement current postulate of Maxwell.
Another modification of the nineteenth century viewpoint is due to the
theory of relativity. The basic principle of this theory is that the laws of
physics which one would observe to hold true on, say, a rocket moving
with a high but constant velocity, would be the same as the laws that have
been discovered on or relative to the earth. This means that the laws of
Coulomb, Ampere, Faraday, and Maxwell must be such that from them
alone we cannot detect such a thing as absolute motion; in the above case
of the earth and rocket, one is no more “at rest” than the other. If we
change from the usual laboratory reference system (x, y, z, t) to a coor¬
dinate-time reference system (x'; y', z', t') moving with the rocket, Max¬
well’s equations should have the same form as before, i.e., they should be
invariant, according to the relativity theory. We shall find in Chapter 14
126 FUNDAMENTAL LAWS OF ELECTROMAGNETISM [CHAP. 7
that this is so. The relativity principle plays an important part in the
theory of rapidly moving charged particles. The expressions derived in
relativistic electrodynamics are more general (and more complicated) than
those encountered in the nonrelativistic, or “classical ” theory.
In the past thirty years quantum mechanics has also made its impact on
electromagnetic theory. This subject is treated briefly in the last chapter.
We shall now proceed to develop the theory of electromagnetism as
based on the laws of Coulomb, Ampere, Faraday, and Maxwell.
(7-1)
dt
7-5 Forces and fields. In mechanics, Newton’s laws are the general
ones, whereas Hooke’s law describes a property of a certain class of media.
Newton’s laws of motion are used to determine the accelerations and the
resulting motion of a system once the forces are known. However, physics
must also list and describe the different kinds of forces encountered in
our world. In mechanics we consider gravitational forces (including weight),
which are of a general nature, and forces such as friction, elasticity, and
viscosity, which are associated with specific objects or media. In addition
to such mechanical forces there are forces of a general nature that act only
on charged bodies (at rest or in motion), forces that are termed electro¬
magnetic. Finally, within the nuclei of atoms a third universal type of
force, associated with mesons, is believed to account for the strong forces
holding nuclear particles together.
At present we are concerned with electromagnetic forces. We must de¬
scribe when they occur, how they act, and in what manner they depend
on other physical quantities. For example, we must state what force one
charged body exerts on another and how this force depends on the posi¬
tions, velocities, and charges of the two bodies (as well as on the intervening
medium, if it is not equivalent to a vacuum). Like the law of gravitation
and Hooke’s law, our electromagnetic laws (1) postulate the existence of
new types of forces, and (2) lead to formulas relating the forces to other
quantities.
Forces of friction, elasticity, and viscosity involve contacts between
bodies. Electromagnetic forces, however, resemble gravitational forces
in that they seem to involve “action at a distance. ” Two charged bodies
A and B may interact even when separated by a vacuum, just as the
earth and sun interact gravitationally despite the miles of empty space
between them. We shall find it preferable to replace the action-at-a-
distance concept with the field point of view. We say that surrounding a
charged body A is a field, and that if the other charged body B is placed
in this field, then A’s field acts (exerts a force) on B; similarly, B’s field
acts on A, so that A and B interact mutually, in agreement with Newton’s
third law. This concept of fields, with lines of force to represent them,
may at first seem to be merely a dodge to make action at a distance seem
130 FUNDAMENTAL LAWS OF ELECTROMAGNETISM [CHAP. 7
E (7-2)
defines the electric intensity at the given point. The direction of E is that
of the force on a positive charge located in the field. From this definition
the mksc unit for E is seen to be the newton per coulomb (abbreviated to
n/coul). Since F is found to be proportional to q', this definition makes the
value of E independent of q'.
(b) When a small charge q' moves in a magnetic field with a velocity v',
the force F acting on it is found to be perpendicular to v/ and to depend
on (1) the charge q , (2) the speed v', and (3) the direction of motion.
The force has a maximum value Fmax when the velocity v' lies in a certain
plane, while the force is zero when v' is perpendicular to this plane. The
value of Fmax is found to be proportional to q' and to v'. (We shall see that
these statements are all contained in Ampere’s law.) The vector B rep¬
resenting the magnetic field is defined as follows. The magnitude of B is
taken to be
o F
1 max
B = w (7-3)
and the direction of B is chosen normal to that plane in which v' must lie
m order for the force to be a maximum. This means that when v' is
parallel to B the force will Ipe zero. While the positive and negative
directions of B will be distinguished later, the convention usually followed
is indicated in Fig. 7-1.* B is called the magnetic induction, or the magnetic
flux density. From Eq. (7-3) we see that the mksc unit for B is the newton-
second per coulomb-meter (n-sec/coul-m). However, it is customary to
define a new mksc unit, the weber, as
„ , „ newton-meter-second
1 weber — 1 --,
coulomb
—17
_ 24.0 X 10— _ 1500 n/coul to the west.
1.6 X 10-19
8.0 X 10
B = 5 X 10 5 w/m2.
1.6 X 10-19 X 107
(a) Ampere’s law. As in Fig. 7-2, let I ds and I' ds' be two current ele¬
ments a distance r apart, and let r j be a unit vector directed from the
first current element toward the second. Assume the currents to be steady.
Ampere’s law states that the two current elements will interact, the force
d~F on I' ds' due to / ds being given by
jj■/ ds x (ds x
1 ^2 (7-4)
and the force is attractive if the currents are in the same direction. We can
say that the ampere is that steady current which will produce an inter¬
action of 10~7 newton when present in each of two parallel current ele¬
ments of such dimensions and separation that dsds'/r2 is unity. We see
in this way how Eq. (7-6), and hence Eqs. (7-4) and (7-5) together,
define the ampere.
One coulomb is that quantity of charge which in one second passes any
section of a circuit carrying a steady current of 1 ampere.
(b) Coulomb’s law for charges. As in Fig. 7-3, let q and q' be two point
charges a distance r apart, and let rj be a unit vector directed from q
toward q'. Assume the charges to
be at rest. Coulomb’s law states
that the two charges will interact,
the force F on q' due to the presence
of q being given by
This is a positive constant; in other words, we find that like charges repel.
For two charges, each of one coulomb, one meter apart, the force would
be 9 X 109 newtons, a tremendous force. However, while currents of 1
134 FUNDAMENTAL LAWS OF ELECTROMAGNETISM [CHAP. 7
to 10 coul/sec are not excessive, a static charge on a small pith ball will
measure about 10-8 coulomb. For atomic particles q is of the order of
10-19 coulomb, although r may then be as small as 5 X 10-11 meter
(as in the hydrogen atom). From
dS
surface S (Fig. 7 4) and let /\E • dX be the line integral of the electric
intensity around the periphery A of this surface. Take an element dS
of the surface, in the normal direction. The positive normal is determined
by the way a right-handed screw would advance when turning in the same
sense as that in which the line integral is taken. Now form the surface
integral of the time derivative of the magnetic induction, namely,
JXs (dB/dt) • dS. Then Faraday’s law states that
/*•*=-//“•■»• p-io)
It may be noted that all quantities in Eq. (7-10) have been previously
e ned (see Eqs. 7 2 and 7-3) and yet no new constant appears. The
7-7] SUMMARY 135
rJB-dx
x
= ff*-da’
s
(7'u)
when there is no net current through the surface S. This equation states
that a changing electric field induces a magnetic field. Maxwell’s equation
has been confirmed directly and, as was said earlier, it has received a
great deal of indirect confirmation because it led to the prediction of
electromagnetic waves.
Any experiment based on Maxwell’s postulate may be used to find the
ratio CyC'i- Since we arbitrarily chose C\ to be 10 7 n/amp2, such
measurements serve to evaluate C2 in mksc units. In this way we obtain
the value of 9 X 109 n-m2/coul2 for C2. If we assigned a different value
to Ci, the value of C2 would also change, but the ratio C2/C\ would be the
same. This ratio is the important quantity; its value is close to
— = 9 X 1016m2/sec2.
C1
which is the speed of light in empty space. This is one of our universal
constants. It makes its appearance in electromagnetic theory regardless of
what system of units is chosen. We are encountering here a constant of
Nature that also appears in relativistic mechanics, but not in Newtonian
mechanics.
Problems
8-1 Coulomb’s law of force between charged bodies. This law may be
stated in words as follows:
Bodies with like charges repel and with unlike charges attract one another;
for point charges the force of interaction is proportional to the product of the
charges and inversely proportional to the square of their distance apart.
Expressed mathematically [see Section 7-6(b)], Coulomb’s law states
that the force F on a point charge q' due to the presence of another point
charge q is
(8-1)
E = (8-2)
We have seen that when q and qf are expressed in coulombs (as defined
by Ampere’s law), and when r is in meters and F is measured in newtons,
then for free space (a vacuum) C2 has a measured value close to
9 X 109 n-m2 /coul2. If we had chosen a larger unit in terms of which
to express charge, then, for the same two charges, q and qf would be smaller
numbers; to preserve the equality of C2qq' /r2 to F, C2 would have to have
a larger value (see problem 6 of Chapter 7). Similarly, a smaller unit of
charge would result in a smaller value for C2. We prefer to take the coulomb
as our unit of charge because the units derived from it (ampere, volt, ohm,
farad, etc.) are those used in common practice.
It is found convenient to replace C2 by another constant «o, defined by
1
C2 (8-3)
4tT €q
The reason for introducing the factor 4ir here is to avoid its appearance
later on, particularly when we come to Maxwell’s electromagnetic equa¬
tions. From the numerical value of C2 given above we obtain for e0 a
measured value, in the mksc system, close to
137
138 coulomb’s law. electkostatics [chap. 8
1 1 coul2
e° ~ 4^C ~2 ~ 36x X 109 n-m2 '
1 (8-4)
47reo r2
(8-5)
where q' is the test charge and F the force on it due to the presence of
other charges.
At a distance r from a charge q the force on another charge q' is given
by Eq. (8-4), so that the electric intensity at q' due to q is
1 q_
E = (8-6)
4x6(3 r2 r
where rx is a unit vector directed away from q. Equation (8-6) gives the
field due to a point charge only; it is not a definition of E or a general
expression, such as is Eq. (8-5).
If the electric field is due to the presence of more than one charge, each
exerts its own force on the test charge. Since forces add vectorially, so also
do the fields of the separate charges, and we have for the resultant field
E = 1 V Si r (8-7)
4x6(3 4^ r2
u l l
ll’
where the subscript i refers to any one of the charges. If the charges are
spread over surfaces, or occur as a continuous (macroscopically speaking)
volume distribution, then the summation in Eq. (8-7) is replaced by an
integration. The integration of a vector function must be performed on
one component at a time; this gives us the components of the resultant
vector. Actually, however, we seldom use this method in practice, for we
shall find others that are better. It is assumed that the reader is familiar
with an elementary text in which examples of the application of Eq. (8-7)
are given.*
8-3 Gauss’ law. The electric field of a point charge and the gravitational
field of a mass particle are both central, inverse square fields. Because of
this close analogy, many of the expressions and theorems applying to
electric fields may be found from the corresponding ones derived for
gravitational fields. Charge corresponds to mass, electric intensity to
gravitational field, and l/47re0 to minus the gravitational constant.* The
reader who is familiar with the gravitational expressions may use this
analogy to arrive at the equations which will now be derived directly from
Coulomb’s law.
Consider any surface S which surrounds a volume r in which is located
a point charge q (Fig. 8-1). At any element of area dS let 6 be the angle
between the direction of dS and that of the electric intensity E due to q.
From Fn. fS-fil. we have s
= 47rg,
Fig. 8-1. Gauss’ law.
where dtt is the solid angle subtended by dS at q and 4tt is the complete
solid angle about a point. If more than one point charge is present within
S, we will find that
(8-8)
s
(8-9)
s T
While Eqs. (8-8) and (8-9) state what is generally called Gauss’ law for
electrostatics, this “law” should be considered a theorem derived from
Coulomb’s law, or an alternate form of Coulomb’s law.
*See Yol. I, Sections 5-10 and 5-11, where Gauss’ law is stated in Eq. (5-28).
140 COULOMB S LAW. ELECTROSTATICS [CHAP. 8
With the aid of Gauss’ law we can quickly derive expressions for the
electric field in the neighborhood of charged spheres, cylinders, and
infinite planes, as we shall briefly show.
Illustration 1. Charged sphere. Suppose that a charge q is uniformly
distributed over the surface or volume of a spherical body. We take for
the surface S in Eq. (8-9) a sphere of radius r outside and concentric with
the charged sphere. From symmetry considerations, the vector E must be
constant in magnitude over S and directed radially outward if q is positive,
so that E • dS = EdS. We then have
E = (8-10)
4ir«o r2
Thus, outside the sphere, the field is the same as it would be if the spherical
distribution were replaced by a point charge q at the center. (Compare
the analogous gravitational theorem.)
Illustration 2. Charged cylinder of infinite length. Let the charge per unit
length be constant and call it X. Take the surface S to be cylindrical, con¬
centric with the charged cylinder, and of unit length. Then from Gauss’
law the field outside the charged cylinder at a distance r from the axis is
given by
/ / e0E • dS = e0F(2-7rr) = X,
1
E = (8-11)
2-7reo r
£oE • dS = toEA.
s
eo EA = <jA,
E = —• (8-12)
eo Fig. 8-2. Electric field between
charged plates.
The field is directed as shown in
Fig. 8-2.
Gauss’ law is also employed in elementary texts to show that (1) the
charge on a solid conductor resides entirely on the surface, and (2) if the
conductor is hollow and empty, there can be no net charge on the inner
surface.
(e0 is a constant and can be put before or after V). Since the limits of
integration are the same for each side of this equation, we may equate the
integrands and obtain
V • E = p/eo- (8-13)
142 coulomb’s law. electrostatics [chap. 8
V x E = V x £ E, = £ v x Ei = 0. (8-14)
i i
Therefore, in general, in electrostatics the curl of the field is zero, and the
field is again referred to as irrotational and conservative. Given the value
of p for all points in free space, Eqs. (8-13) and (8-14) define E and so are
equivalent to Coulomb’s law.
™tin mX/
ratio —dW/q ' , that
iny+WhlCh
is, the deCtriC v°tential at b exceeds that at a as the
(8-15)
Let us compute the work per unit charge performed by an agent when a
charge is taken around a closed path in the field, say from the point a
back to a. From Eq. (8-15), we have
dV = -
f E • ds,
/ E • ds = 0,
or the line integral of the electric intensity around any closed path is zero
in*electrostatics. This is just equivalent to the previous statement that
an electrostatic field is irrotational.
144 coulomb’s law. electrostatics [chap. 8
Let the components of E be Ex, Ey, and Ez. Take the case where ds is
in the X-direction, so that E • ds = Ex dz. For this displacement dV =
—Ex dx. Since this represents a change in V when y and 2 are held con¬
stant, we have
u
dx = u.
In a similar manner, we find that
«Z__p p
dy ~ " dz - E‘-
E = — VF, (8-16)
. d d
V = 1
dx Jaj + k5’
and i, j, k are unit vectors in the three principal directions.
Since electric potential has the dimensions of work divided by charge
the mksc unit is the joule/coulomb. This is also called the volt, which is
thus defined as
1 volt = 1 joule/coulomb.
dV nx
E = — — (8-17)
dn
just outside a conductor.
We may use Gauss’ law to find a relation between E at a point just out¬
side a charged conductor and a, the charge per unit area on the conductor
in the neighborhood of the point. Let us refer back to Illustration 3 of
Section 8-3, where we took for the surface S in Gauss’ law a pillbox en¬
closing an area A of the conductor’s surface. We may follow the same pro¬
cedure here. In both cases there is no field inside the conductor, and just
outside the conductor the field is normal to the surface, therefore the re¬
sulting expression will be the same, namely,
a
E = — ni.
eo
dV
a = — e o —— (8-18)
dn
This equation may be used to find the charge density at any point on the
surface of a conductor, once the potential function V is known.
Example 1. The electron volt has been defined (see Section 4-5) as the work
done in taking a particle bearing the electronic charge (1.6 X 10 coul) through
a p.d. of 1 volt. Thus
plate.
146 coulomb’s law. electrostatics [chap. 8
7 = 0 V = 200 V = 100
AV 200
E = —200 volts/m.
Ax 1
F> (-100)
= -f- 50 volts/m
to the right.
From a — eoEn, we see that
2
cra = —200 eo coul/m ,
2
<rc = +50 eo coul/m .
B has a charge density of +200eo coul/m2 on its left side and one of —50eo
coul/m2 on its right side, or a net charge density
07 ax
—ax. V a possible function of y,
dx 2
8-5] ELECTRIC POTENTIAL 147
t
37
6, V = by + a possible function of x.
dy
Hence
V = —\ax2 + by.
= 10 / (ax dx — b dy)
along any path connecting the two points. (The reader should check this.)
(c) From Eq. (8-13), we have
p = eoV • E =
= to a.
(Numerical values may be substituted for eo and a.) There is thus a uniform
positive space charge. This space charge alone does not determine the field, for
the charges on, or potentials of, the surfaces bounding the field must also be
given if the field is to be completely specified.
Example 4. An electric field is represented by the potential function V
b cos 6/r2, where b is a constant and r and 6 are polar coordinates. Find the
polar components of the electric intensity.
dV _ 2b cos 6
Er =
dr r3
1 dV b sin 9
Ee =
r dd r3
This is the field due to an electric dipole, i.e., to two equal and opposite charges
very close together, centered at the origin. We shall develop the expression for
the dipole potential in the following section.
148 coulomb’s law. electrostatics [chap. 8
E = —
dr
Hence
dr
^*2
4x60 r~ + C-
V = -J- (8-19)
47T€o r
This gives the electric potential due to a point charge only and it is not a
definition of or general expression for V.
The value of the potential concept becomes more apparent when we
consider a field due to several charges. Then, since potential is work per
unit charge, which is a scalar quantity, the potential due to a number
of charges is found by algebraic addition, i.e.,
V =
1 gj.}
(8-20)
47re0 4- Vi '
where n is the distance from the charge q:i to the point P(x, y, z) where V
is to be determined. Since V is a scalar quantity, the summation is much
easier to evaluate than in the case of E (Eq. 8—7). Hence it is often simpler
to first compute V and then to find E from E = —VF. This point will
now be illustrated.
Illustration. Potential due to a group of point charges located near the origin.
Suppose that we have a finite number of charges grouped about the origin
and that the distance of any charge q{ from the origin is much smaller
than the distance r from the origin to the point P(x, y, z) at which we
wish to compute V (see Fig. 8-4).
Let the coordinates oFr/,- be we then have for the distance
from qi to P
1
V =
47Te0
8-6] POTENTIAL DUE TO POINT CHARGES: MULTIPOLES 149
i
V = Vi
dfi yj 0 + .2 vj. 0 d + .2 W-
dr,*
Vi
1 d2 (iV
+ 2 W -dtjidrj i
(—)
Vi/Jo
%iVi
+ (8-22)
drudt; r) vrti
Vfi^li Vi.
01
1 1
v=
4we0 Z ^ ^ 1Z + m 2 ?«< + w Z ms*
where l = x/r, m = y/r, n = z/r are the direction cosines of the vector r.
The quantities indicated by the summation signs depend on the distribu¬
tion of the charges only and not on the location of P; these quantities are
properties of the charge distribution just as the total mass, moments of
inertia, and products of inertia are properties of a given distribution of mass
particles.
The first summation, Yli qi, is the total net charge. It is called the
monopole moment of the distribution and it is a scalar quantity.
The next three summation terms are called the three components,
mx, my, mz, of the dipole moment of the distribution; this is a vector with
the components
1 1
V = [lqd\
4tt €q r2
b cos 9
(8-25)
Fig. 8-5. Electric dipole.
where 6 is the angle between r and the X-axis and b is independent of the
coordinates of P. This is the potential function taken in Example 4 of the
last section. From it we easily obtained the field of a dipole.
Similarly, the third bracket contains six summations which are called
the components of the quadrupole moment of the distribution; this is a
8-6] POTENTIAL DUE TO POINT CHARGES: MULTIPOLES 151
i i
E2
myy Q iV i. 'lUyz — mzy £ qmrfi, (8-26)
i
Note that the potential due to the monopole moment varies as 1/r,
that due to the dipole moment as 1/r2, and that due to the quadrupole
moment as 1/r3. The term in 1/r4 would contain the octupole moment, etc.
In Fig. 8-6 are shown some charge distributions illustrating the physical
meaning of the electric moments that we have defined. In (a) only mxx is
not zero, that is, we have a linear quadrupole. In (b) we have a linear
quadrupole plus a monopole (2q). In (c) we have only mxy not zero, or
what is called a square quadrupole. In (d) the distribution has monopole,
dipole, and quadrupole moments.
Si
q — 2q 9
! A J
Linear quadrupole
(a) (b)
q . • 9
T -q
d- -X -x
1 • —q
I—d—~\
i
Square quadrupole
(c) (d)
(8-27)
where r is the distance from the volume element dr (containing the charge
p dr) to P. Even if the point P is within the volume r, so that there is a
charge density at the point of observation (where r = 0), Eq. (8-27) may
still be used. In this case, in the immediate neighborhood of P, we may
take for volume elements concentric spherical shells, so that dr = 47rr2 dr.
With p finite, as assumed, the integrand pdr/r = 4-rrpr dr is also finite,
no matter how small r is.
In a similar manner, the potential due to surface and line distributions
of charge may be computed by evaluating surface and line integrals, re¬
spectively.
The total potential V at a point in an unbounded region may be found
by adding the contributions due to all point-charge and continuous-charge
distributions. In a bounded region, however, the potential at a point P
cannot be calculated unless we also know the potential at every point on
the boundary. For example, within an empty hollow conductor we have a
region containing no charge, even at its boundary (the inner surface of the
conductor), yet, if the conductor is insulated, we may change the potential
at a point within this region by varying the charge on or near the outer
surface of the conductor. Problems involving boundary conditions will
now be considered.
V • vv = 0, (8-29)
is also met in many other branches of theoretical physics, such as the irrota-
tional flow of an incompressible fluid, steady heat flow, etc.; the scalar
function V has a different physical meaning in each case.
A typical electrostatic problem might be stated as follows. Certain con¬
ducting surfaces, each with a given size, shape, and position, are main¬
tained at stated potentials; we wish to find the electric intensity in the
region between these surfaces, and the charge density at any point on a
surface. If the space between the conductors is empty, we solve Laplace’s
equation; if a space charge exists in the region, we must be given the func¬
tion p and solve Poisson’s equation. In either case, the solution of the
partial differential equation (8-28), (or 8-29), will involve constants of
integration which must be determined from the boundary conditions.
These conditions are that V must reduce to the stated potential of a con¬
ductor when the coordinates are given values corresponding to points on
the surface of that conductor. Once the constants of integration have been
determined, we shall have found a particular solution or function V of the
coordinates which satisfies Poisson’s or Laplace’s equation, as the case
may be, and which reduces to the correct values of the potential at all
boundary surfaces. This solution will be mathematically and physically
unique, that is, for a problem such as this, which is completely defined, it
is the only solution. Once V has been found, we may differentiate V to find
the components of the electric intensity at a given point and, with the aid
of Eq. (8-18), the surface charge density on any of the conductors present.
The above theory is all based on Coulomb’s law, from which Poisson’s
equation Eq. (8-28) was derived.
The general solution of Poisson’s or Laplace’s equation is beyond the
scope of this text, but examples illustrating the solution of electrostatic
problems will be given in this and the following two sections of this chapter.
Illustration 1. One-dimensional fields. Consider a field in the X-direction
only, such as that between two parallel conducting plates, one at x = 0
and the other at x = L. Let the first plate be at a potential of V0 volts
and the other at zero potential. Let us find the electric intensity and surface
charge densities when (a) p = 0, (b) p = k, where k is a constant.
Solution. Since this is a one-dimensional problem, V is a function of x
only, and Eq. (8-28) becomes
d2V _ p
(8-30)
dx2 e0
d2V
dx2
= o, (8-31)
[chap. 8
154 coulomb’s law. electrostatics
V = Ax + B.
V = - ^ x + V0. (8-32)
F dV Vo
eo To
1
L
(Note that for the right-hand conductor the outward normal is in the
negative X-direction.)
(b) Letting p = k, Eq. (8-30) becomes
d2V k
(8-33)
dx2 e0
Integration gives
V = - ~ x2 + Ax + B.
Z€0
dV\ kL eoFo;
c?t\ kL cq Lq
eo^o
O’L
°L
—
“
+«0
+‘° U /,.1 " ~ V ~ T
2 L
Note that for the two plates together the net charge per unit area is
—kL, while the total space charge between these unit areas is -\-kL. In
a closed system, that is, one across whose boundary no field passes, the
total charge must be zero, according to Gauss’ law. In the present example
there is no field within either conducting plate, and so the surfaces of the
plates and the space between constitute a closed system with zero net
charge.
Illustration 2. Fields with zonal symmetry. Let us assume that the field
is symmetric about the X-axis and that p — 0. We write Laplace’s
equation in terms of spherical coordinates, omitting the 0-term, since V
is independent of 0. We then have
(8-35)
(8-36)
*gee Vol. I, Section 14-6; also see Margenau and Murphy, The Mathematics of
Physics and Chemistry.
156 coulomb’s law. electrostatics [chap. 8
Bo
and
Each of these equations must be an identity for all values of the coordinate
6. The P’s are functions of 6, so that as d is varied, the P’s will be altered,
each in a different manner. Hence for either equation to be an identity,
each P must have the same coefficient on each side of the equation. Equat¬
ing coefficients of like P’s in each equation, we get
and
All of the constants are zero except A1} and Px = -a3Ai = a3E0, so
for this particular problem Eq. (8-36) becomes
The term E0r cos d represents the potential due to the uniform field and
the term a E0 cos 6/r2 is the potential due to the sphere. The latter term
is of the same form as for a dipole, Eq. (8-25).
8-8] poisson’s and laplace’s equations 157
1
dV 2ad
Er =-r— = Eo COS 9 ~\-r- E0 COS 9,
or rA
dV
o = — e0 \ l = 3e0E0 COS 9.
«.OT/r=a
(8-38)
dx2 dy2
Z
(From symmetry, V cannot depend
on z.) Fig. 8-8. Semi-infinite planes.
158 coulomb’s law. electrostatics [chap. 8
1 d2X _ 1 d2Y
(8-40)
X dx2 Y dy2 '
Since the left side of this equation is a function of x alone and the right
side is a function of y alone, x and y independent variables, each side must
be equal to the same constant, called the constant of separation. We shall
call this constant a; its value will be determined by the boundary conditions
of our particular problem. We may then write the two ordinary differential
equations
d2X d2Y
aX and —aY. (8-41)
dx2 dy2
X = Aie^x + A2e~^ax,
(8-42)
Y = Ble^v + B2e~^v .
Substituting the solutions in Eq. (8-42) back in Eq. (8-39) and combin¬
ing constants, we get
V = Ae^xe^v + A'e^xe~^y
(8-43)
+ Be~raxe^~av + £'e-V^e-V=^
We are now ready to introduce the boundary conditions for our particular
problem, in order to determine the constants. Equation (8-38) contains
second order derivatives in both x and y, so that it was necessary to inte¬
grate twice with respect to each variable. Hence for boundary conditions
we must know the value of V at (a) two values of x for all values of y,
and (b) two values of y for all values of x.
Let us take our vertical planes to be grounded plates at x — 0 and
x = L, and let us take the bottom plane (y = 0) which closes the gap to
be an insulated plate which is held at the constant potential Vo- The
effect of this plane must decrease as y —> oc. Hence we have
The second part of condition (b) tells us that_a must be negative, since
otherwise y/—a would be imaginary and e±v'~aV would oscillate in value
but not approach zero as y approaches infinity. Then, with y/ a real, we
must have A = B = 0. _
Putting a = —P2, where P is real, and i = y/—l, we have
V = A'eipxe~pv + B'e-ipxe~pv.
If we use the identity eix = cos x + i sin x and introduce new constants,
this may be written
717T 1 O Q
sin PL = 0, P = -jj ’ n = 1, 2, 3, • .
Adding the solutions for each value of n in order to obtain a more general
solution, we get
V • VF =-- • (8-28)
«0
1
V = (8-45)
4tT€0
where rq and r2 are the distances to the two charges. At the point P(x, y, z)
g _1_ _1_
V = (8-46)
47T6o \/(x — d)2 + y2 + z2 VW+ d)2 + y2 + 22
We see that this satisfies both boundary conditions. Each term in Eq.
(8-45) or Eq. (8-46) satisfies Laplace’s equation and so V does also.
This must then be a solution of the problem.
For convenience, let the XF-plane include P and the charges. The
components of the field to the right of the plane (behind the plane there
is actually no field) are
dV q I (x ~ d)_(x + d) |
„ dV q (_y__ _y_ . 1
E* = _ ~du ~ 47T60 \[(X - d)2 + y2p/2 [{x + d)2 + y2?12J
(8-47)
162 coulomb’s law. electrostatics [chap. 8
The charge density induced on the plane at a distance y from the origin is
7- 0
~9- • 9
V= 0
9* • —9
In a problem (Fig. 8-11) where the grounded plane is bent to form two
planes at right angles, we must introduce three image charges in order to
satisfy the boundary conditions (V = 0 for both the vertical and horizontal
planes) These image charges again resemble mirror images. Behind the
vertical plane we put an image charge -q as before; below the horizontal
8-9] THE THEORY OF IMAGES 163
plane we put another image charge —q, the same distance from the plane
as the given charge q; finally, a charge of -\-q, the image of the image
charges, must be located, as in Fig. 8-11. The field in the quadrant be¬
tween the planes is that of the four charges.
When two planes meet at any angle 0, a finite number of images is ob¬
tained only when 6 = w/n, where n is a positive integer.
To prove this, we shall first find the potential due only to q and q' and then
show that it satisfies the boundary conditions.
At any point P(r, 6) we have for the potential due to q and q'
V =
r~
4tt€o (-
Vi + r2/
-)>
as in Eq. (8-45). Substituting for r1 and r2 and using Eq. (8-49), we get
V =
1 __g_
€q
VV2 + d2 — 2r d cos 6
a _q_
d Vr2 + (o4/d2) — 2r(a2/d) cos 6 (8-50)
The field obtained is that outside the sphere only (inside it is zero). The
total induced charge is q'. The force of attraction F between the charge
q and the grounded sphere is that between q and its image, or
q2 a 1
(8-51)
47re0 d (d — a2/d)2
2 = x + iy, i = V—1.
This complex function can be broken up into a real part g(x, y) plus an
imaginary part ih(x, y), that is,
dg dh
(8-53)
- %t dy dig
Since / is a function of 4-
dl=idl.
dy dx
dg . dh . dg dh
dy 1 dy 1 dx dx
When we equate first the real and then the imaginary terms on each side
of the last equation, we obtain
dg dh dh dg
dy dx ’ dy dx
Substitution of these relations back in Eq. (8-53) shows that the scalar
product of the gradients is zero. The gradients, and hence the two
corresponding sets of curves, must intersect orthogonally.
We next prove the following
Theorem: The functions g(x, y) and h(x, y) each satisfy Laplace’s equa¬
tion.
-2 d2h
d2g d2h d g _
dy2 dydx dx2 dxdy
Since
d2h d2h
dxdy dydx '
we have
dg + —
n2 _
= 0.
dx2 ^ dy2
d2h d2h
dx2 + dy2
V = 2 kxy. (8-55)
Then the equation 2kxy = B will, as we vary B, give us the equations for
different equipotentials. These will be equilateral hyperbolas, as shown in
Fig. 8-13, and they may be thought of as traces in the XF-plane of equi¬
lateral hyperbolic cylinders extending parallel to the Z-axis.
Now consider the equipotential for which B = 0; this is represented by
xy = 0 or x = 0, y = 0.
This corresponds to the YZ- and XZ-planes. Suppose that we take these
planes to be conducting. Taking V = 0 for these planes simply means
that potentials at all other points are given relative to the potential of these
planes. We may now investigate the field in the quadrant between these
planes, for which x, y, and B are all positive. In order to have a bounded
region, we may think of this field as extending out to another equipotential
surface which could be another conductor at the potential of that surface.
This would be a hyperbolic cylinder, and we can take it as far from the
origin as we wish.
8-10] CONJUGATE FUNCTIONS 1G7
<
dV
= -2t*'
(8-56)
dV
Ey= - -2 kx,
dy
(J =
(d_v) —2 t0kx for the horizontal plane.
€° \dy/v=o
Note that the charge density and field increase with distance from the
corner of the planes outward.
The direction of the field could also be found from the first theorem we
derived, which tells us that the two sets of curves given by Eq. (8-54)
are orthogonal. Since we let 2kxy = B represent our equipotentials,
must represent the lines of force. The lines of force are hyperbolic and, for
the quadrant we are considering, asymptotic to the line x — y, as shown
by the dotted lines in Fig. 8-13. To prove that Eqs. (8-56) and (8-57) both
168 coulomb’s law. electrostatics [chap. 8
lead to the same field, let us write the differential equation for a line of
force. Since an element of a line of force has, by definition, the direction of
the field, the components dx and dy of such an element must satisfy the
equation
dx _ d y
(8-58)
Ex ~ Ey '
dx _ dy
or x dx = y dy ,
y ~ x
2
x a constant,
the positive plate. At this instant the field (force per unit charge) will be
acr/e0 (see Eq. 18-12), so that the work done on da will be
The total work done, or energy supplied to the system, in building up the
charge a will be
2 7 f1
W = — / a da
Co Jo
1 a2 3 4l
2 eo
= hoE%
since E = a/e0. Since the field is uniform, we may picture this energy W
as being stored throughout the region (of volume Z) between unit areas of
the plates. We then have
Problems
case a charge would be drawn from the 15. Two conducting spheres are con¬
outer to the inner surface of a charged centric, the radius of the inner being
spherical conductor. The fact that no a and that of the outer b. One sphere
such charge is found on the inner sur¬ is grounded and the other is at a po¬
face was shown by Cavendish to be a tential Vo■ Find the charge on each
sensitive experimental proof of the when the grounded sphere is (a) the
inverse square law. outer one, (b) the inner one.
C'SyTwo horizontal metal plates are 16. By integration of the charge
1 cm apart in air. An oil drop bearing density a, find the total charge induced
a charge of one electronic unit is bal¬ on a grounded plane when a charge q
anced between the plates when the is placed in front of the plane.
p.d. between them is 800 volts. Find 17. (a) Find the image charges when
thnmass of the drop. a charge q is at equal distances d from
(T9. JAn electric field is represented by two grounded planes meeting at 60°.
E^= ay, Ev = ax, where a = 100 (b) Find the force on the charge q.
volts/m2. Find (a) the potential func¬ 18. A conducting sphere of radius a
tion V, taking V = 0 at the origin, is at a potential Vo. The center of the
(b) the work done by the field when sphere is at a distance d/2 from an in¬
10-8 coul is taken from x — —1 m, finite grounded plane. Neglecting
y = —2 mtoz = 2 m, y = 3 m, and terms smaller than a2/d2, find the
(c) the charge density p at any point. image charges and the charge on the
10. If the electric potential for a sphere.
field is V = ax/y, where a is constant, 19. The centers of two conducting
find Ex, Ey, and the charge density at spheres are a distance d apart. One
any point P{x, y). sphere, of radius a, is at a potential Vo,
11. (By direct application of Eq. the other, of radius b, is grounded.
(8^=20), find the electric potential at the Find the charges on each sphere, neg¬
point P{r, 0) due to an electric dipole lecting terms smaller than (a) a/d and
consisting of a charge -\-q at x = d/2, b/d, (b) a2/d2, ab/d2, and b2/d2.
y = 0 and a charge — q at x = —d/2, 20. Consider the complex function
y = 0. Use polar coordinates and take f(z) = kz = k(x iy). To what elec¬
r^>d. trostatic problem does it lead, if we
12>Find directly the potential at follow the procedure of Section 8-10?
Pfe-tf) due to the quadrupole shown 21. Repeat problem 20, taking/(z) =
in (a) Fig. 8-6(a), and (b) Fig. 8-6(c). kz112.
13. Solve Poisson’s equation, given 22. A sphere of radius a bears a
that the field is in the X-direction and charge q. (a) Find the work done in
P = kx, where k is constant. Take as
charging the sphere, (b) From Eq.
boundary conditions V = 0 at x = 0, (8-59), by integration, find the energy
V = Fq at x — L. stored in the field surrounding the
14. Solve (so far as possible) La¬ sphere.
place’s equation for a region bounded
by a rectangular box. Introduce ap¬
propriate boundary conditions.
CHAPTER 9
171
172 ampere’s law. steady magnetic fields [chap. 9
We assume that 7 and /' are steady currents. The direction of the unit
vector rj is from one current element (say I ds) toward the other (/' 7s'),
on which the force is to be calculated. * When 7s and 7s' are parallel or
antiparallel, the two current elements interact on each other with equal
and opposite forces, and when the currents are in the same direction the
interaction is attractive, as explained in Chapter 7.
It must be admitted that Eq. (9-1) cannot be verified by direct experi¬
mental testing because in practice we deal with complete current circuits,
not with idealized current elements. This objection may be answered as
follows.
We shall see that Eq. (9-1), which can be termed a “point statement”
(since current elements have small spatial dimensions), is more general
than any of the formulas that give the force between whole circuits, since
the latter statements depend on the size and shape of the circuits. Our aim
is to express a fundamental law of nature in its most general form and then
to apply it to special cases. If from our general expression we derive ones
that can be tested experimentally and which, when so tested, are always
in agreement with fact, then our original “law” can be considered to have
the necessary validity. The statement of the law of gravitation in terms
of two idealized point masses and its subsequent extension to finite bodies
is a case in point. A statement such as Eq. (9-1) may be called a shrewd
“guess,” based on hindsight, but after all it is due to such guesses that
theoretical physics has progressed and men like Newton and Planck have
become famous.
We have seen that the factor C\ is arbitrarily chosen, thus determining
the unit of current or charge. The ampere, and hence the coulomb, are
defined by taking
In important expressions later; the factor 4tt will not appear if we use p0
rather than C i.
the direction of the forward velocity being the same as that of ds'. Since
pA ds' = q', we obtain
I'ds' = q'v', (9-5)
which means that the force on a current element I' ds' is the same as it
would be on a charge q', at the same location and moving with a velocity
v', provided that I' ds' = q'v'.
We may now say that for a charge q', moving with the velocity v', the
force due to a current element I ds, according to Ampere’s law, is
dF = to J , V- X (ds X £t) ;
4tt r2
where r is the distance from the current element to q' and rx is a unit
vector in that direction.
ular to (ds x rx), as in Fig. 9-1. Taking v' to be in the plane giving maxi¬
mum dF, we define the magnitude of the magnetic induction at q' as
dB = I ds * ri, (9-8)
4t r2 v
MO t / / ds X Ti
dFn = — la v = gV|dB|
4tt *
Direction of ds X and dB
As stated in Section 7-5, or as may be seen from Eq. (9-7), the mksc
unit for magnetic induction is the n-sec/coul-m, the n/amp-m, or the
kg/coul-sec. If we define
1 weber = 1 newton-meter-second/coulomb,
F = g IqW * / 0-9)
s
for the force on a charge q' moving with the velocity v', due to the circuit
containing the current I.
Since dB is proportional to (ds X ri)/r2, which, according to Eq. (9-9),
is integrable, we may integrate Eq. (9-8) and so get
» = g1 / S
(9“10)
for the magnetic induction due to a current circuit.
Substituting Eq. (9-10) into Eq. (9-9), we obtain
q'v' x B (9-11)
for the magnetic force on a moving charge due to the field B. This is also
referred to as the Lorentz force. Since this equation follows from our defini¬
tion of magnetic induction and is consistent with Ampere’s law, it may be,
and frequently is, taken as defining B.
Illustration 1. Field due to a linear current circuit. Consider a current I
in a long straight wire, the return portion of the circuit being so far away
that its effect can be neglected. Let us compute B at an external point P
whose perpendicular distance from the
axis of the wire is R. Referring to Fig.
9-2, in which 6 is the angle between
direction of the current and that of iq,
we see that
ds X ri ds sin 6
into the paper.
F = ev x B.
e v
m BR (9-13)
d¥ = ? W ds' x • (9-14)
47r J
S
r2
dF = V ds' X B, (9-15)
where B is the magnetic induction at ds' due to the circuit with the current
7. If more than one circuit contributes to the field at ds', we may sum
Eq. (9-15) for each.
Second, to find the force on the I' circuit between, say, the points a
and b, we integrate Eq. (9-15) with respect to s’:
F = 7
F _ no (2II'\
2 X 10~7 (9-17)
l 4tt \ R )
the force being attractive when the currents are in the same direction and
repulsive when the currents are opposed.
Equation (9-17) leads to a simplified definition of the ampere. We see
that when I = T = 1 amp and R = 1 meter, then the force per meter is
2 X 10 7 newton. We could use Eq. (9-17) to measure current, letting
I = I' and measuring R and the force on a movable section of one wire;
this would be an absolute determination of the current, i.e., it would be
independent of any electrical standards. While this experiment can be
performed, it is found in practice that it is more accurate to measure the
force between current-carrying coils, for example with the Kelvin current
balance, deriving the appropriate formula from Eq. (9-16). The instru¬
ments used for such measurements are called electrodynamometers.
Illustration £. Torque on a rectangular current circuit in a uniform mag¬
netic field. Consider a rectangular coil suspended so that its plane is parallel
to the magnetic induction B, as in Fig. 9-3. Let the length of the coil
perpendicular to the field be l and its width w, and suppose that there is
a current /' in the coil.
We use Eq. (9-16) to compute the force on each side of the coil. For the
top and bottom the force is zero, since ds' and B then are parallel and
ds X B = 0. For the side ab, which is perpendicular to B, the force per
turn will be /7£ normal to the plane of the coil. On the opposite side cd the
force will have the same magnitude, but the opposite direction. The
result is a torque L whose magnitude is given by
B • dX =» - ^ d!2. (9-21)
47T
field B is directed away from the loop; on the other side ft will be con¬
sidered to be negative.
To find the line integral of B around a closed path X we let dX be an
element of such a path* and, using Eq. (9-21), obtain
We see that the value of the line integral depends on the total change in
the solid angle ft. Two cases arise, as follows:
(a) The path of integration is not linked with the current circuit. Take,
for example, the path a shown in Fig. 9-5. If we go around this path, ft
alternately increases and decreases, the net change for the whole path be¬
ing zero. Hence
x
'\
- \
\
j Path a
:>{ \
1
q / Path b
(b) The path of integration links the current circuit. Let us follow the
path b in Fig. 9 o. Starting in, or just above, the plane of the current coil
we have initially ft = 2,r (half the solid angle about a point). As we move
up and away from the coil, ft decreases, becoming zero at the point Q.
Passing around to the negative side of the coil, ft decreases further be¬
coming negative. As we approach the plane of the coil and our starting
point, ft approaches the value -2r. Thus for the whole path the net change
m ft is —4tt. Hence &
Hn this and following chapters dX will refer to an element of any path whereas
ds will refer to an element of a conducting circuit. UP ’ wneieas
9-7] THE CIRCUITAL FORM OF AMPERE’S LAW 181
Should the path of integration link more than one current circuit, we would
find
This last statement is sometimes called the circuital form of Ampere’s law.
Take any surface S whose periphery is the path of integration X. Then
the sum of the currents linking the path is the total current passing through
S. In terms of the current density J (the current per unit area), this total
current through S is //s J ■ dS. Hence we obtain
(9-24)
x s
Just as Gauss’ law enabled us to obtain quickly the expression for the
electric intensity in problems involving a high degree of symmetry, so we
find that the circuital form of Ampere’s law offers, in some cases, the
easiest method of finding the expression for B.
Illustration 1. Field due to a linear current circuit. Let us return to the
problem of the very long straight circuit discussed as Illustration 1 of
Section 9-5. We wish to find B at the external point P, which is a distance
R from the circuit. In Eq. (9—23) we take for our path X a circle of radius
R passing through P and with center at the wire, as shown in Fig. 9-6.
From symmetry considerations B must have the same magnitude at all
points on this path. From Ampere’s law we know that B cannot be radial,
but must be circular about the wire, so that B and dX are parallel. Hence
2-kRB = mo 1}
N turns
Since the path A threads the current
N times, we have, from Eq. (9-23),
Bl = hqNI,
(9-25)
B =
B • dl = I (V X B) ■ dS,
(V x B) . dS = //MoJ . dS.
since both surface integrals refer to the same surface, we may equate the
integrands, and obtain
V X B = MoJ. (9-26)
<
point where V x Bis being evaluated), then
J = pv (9-27)
and
V x B = mo pv (9-28)
for a steady flow of charge.
Illustration 1. The magnetic field outside a linear current circuit. We shall
again consider the field around a long straight current-carrying wire. We
saw that outside the wire the field B is directed in circles around the wire.
This might lead one to conclude that the field has a curl, but we shall
show that V x B = 0 outside the wire.
Let the Z-axis be along the wire in the direction of the current, or out
from the page in Fig. 9-8.
From Eq. (9-12) we have for the field outside and at a distance r from
the axis of the wire
Since the field is directed circularly about the wire, as shown, at the point
P(x, y) we have
B* V
~ J
R_mol v_
B r - 2tt r2
By MO I
Bvj n o
B ~ 2tt r2
Bz = 0.
184 ampere’s law. steady magnetic fields [chap. 9
k (dBy _ dBx\
V x B
\dx dy /
dBy
dx
5 (
bx
dx '\X2 + V2)
)
1 - b
x2 -f- y2
2 bx2
{x2 + y2)2 J
dBx by \i . b 2 by2
* (
dy dy \kX2 + V2) X2 _|_ y2 {pc,2 + y2)2
Thus
The same conclusion follows from Eq. (9-26), since outside the conductor
J is everywhere zero.
We have here a field whose curl is zero at all points outside the con¬
ductor. In this region the field does not rotate or turn about itself (it
possesses no local eddies), but the field does circulate about the conductor.
The line integral of B around a closed path is ny0I, where n is the number
of times that the path passes around the conductor. The reason that the
line integral around any closed path is not zero is that V X B is not zero
everywhere, as we shall show in the next illustration.
Illustration 2. The magnetic field inside a linear current circuit*. Take
the same conductor as in the last illustration and call its radius a. Let
us assume that the current density J is constant throughout, so that
V X B = I.
7r a2 (9-29)
Thus within the conductor the field is rotational, i.e., it turns about itself
just as do two adjacent points on a rotating turntable.
Let us see how B depends on r in this case. Applying Eq. (9-23) to a
circular path of radius r (r < a), we have
•It is true that we shall not be computing the field in air or in vacuo, but this
makes no d.fference so long as the conductor is a nonmagnetic one like, say!
9-9] THE MAGNETIC SCALAR POTENTIAL 185
(9-30)
Thus inside the wire B increases with r, while outside B is inversely pro¬
portional to r. For a circular field whose magnitude is proportional to rn,
the curl is zero only when n = —1. Consider the present case where
n = +1. We have
as in Eq. (9-29).
Corresponding to the theorem that circular fields with magnitudes pro¬
portional to rn have zero curl only when n = — 1, there is the theorem
that radial fields with magnitudes proportional to rn have zero divergence
only when n = —2. For the electric field due to and outside of a charged
sphere |E| °c 1/r2 and so V-E = 0; but in general V-E = p/e0 and
V x B = n0J, according to the laws of our physical world.
B • dX = - ™ ■ dl,
or
B = - ^ Vfi = (9-31)
186 ampere’s law. steady magnetic fields [chap. 9
where
Vm = lT n (9-32)
Vm = m = £2 / (Acosd\
47T 4-7T \ r2 /
(9-33)
Be — — 1 dFTO _ hqIA sin 6
T SO 7*3
9-9] THE MAGNETIC SCALAR POTENTIAL 187
= fioI ds x rx
47r r2
This equation tells us that lines representing the direction of the field dB
are circles about the axis of the current element. Such lines, then, form
closed curves. These lines do not start at (diverge from) any point, nor
do they stop at (converge toward) any point. Thus the field dB is source-
free or solenoidal, that is, its divergence is zero. Furthermore, the sum of
any number of such fields will be solenoidal. We shall now prove mathe¬
matically that V • B = 0.
Let a = (uq/4ltt)(I ds) and b = rj/r2. Then dB = a x b. For any
two vectors such as a and b, we may write
V • (a x b) = b ■ (V x a) — a • (V x b). (9-34)
V x b = 0.
Thus the right side of Eq. (9-34) is zero, and since a X b = dB, we
have V • dB = 0. Since
V • B = 0. (9-35)
B = V X A, (9-36)
where A is called the vector potential. We shall now look for the expression
for A at the point P, given the current circuit(s) producing the field B.
First we note that if ds has the components dsx, dsy, and dsz, and
r = (x2 + ij2 + z2)q then
ds x rx
^2
b = s7/v><(t)'
S
circuit are independent of each other, the curl can be taken outside of the
integral sign. We then have
B = V x A,
where
(9-37)
since the curl of the gradient is always zero. Therefore we must not
consider A to be completely determined by Eq. (9-37). A somewhat
analogous situation is met in electrostatics, where we found that we could
express the electric intensity as E = — VV, where V is the electrostatic
potential. While the potential due to a point charge is given by
V = g/4xe0r, it is still possible to add a numerical constant, say C, to V,
since V( V T~ C) = —V V = E. This constant is determined by the
boundary conditions for a given problem.
The vector potential may also be expressed in terms of the current
density J, or in terms of the volume density p and velocity v of the moving
free charges producing the magnetic field at P. Suppose that the current
element I ds has a cross section A and a volume dr = Ads. Then
I ds = JA ds = JA ds = J dr,
A = (9-38)
where the volume integral is taken over all regions containing a current
density. Since J = pv, we also may write
A =
(9-39)
While p and v may vary throughout space, we must make the restriction
that they are independent of time, so that we have a steady field. In the
case of a single charge, the field that it produces at a point P will vary as
9-11] THE VECTOR POTENTIAL 191
the charge moves along; such a field will not be steady. Our expressions
will give the instantaneous field at P to a good approximation if v « c,
where c is the speed of light.
Illustration 1. Vector potential for a straight section of a current circuit.
Referring to Fig. 9-11, in which the Z-axis is taken to coincide with the
part of a current circuit giving a field at P(0, y, 0), we see that here
ds will always be in the Z-direction. Then Ax = Av = 0, ds — dz,
and r = V22 + y2- Consider the contribution to Az at P due to that
segment of wire extending from z — zx to z = z2, namely,
rz2
A = I dz
4tt JZl Vz2 + y2
Let us try to find the field at P due to this straight segment of a current
circuit, using B = V x A, Ax = Ay = 0, and Eq. (9-40) for Az. Since
A depends on y only, we have
B = V x A
y/Vif + y2 y/V z2 + y 2
dAz _ no I • (9-41)
BX
dy 4tt 22 + Vz2 + y2 zl + Vz2 + y2
192 ampere's law. steady magnetic fields [chap. 9
Lim _ZJ/_
Zl~>— oo (zi + Vzf -f y2)Vz\ + y2
= Lim
zi- zx(zf + y2)1'2 + z\ + y2
= Lim _zfitt_ —y
—oo
z1V^1[r+ Mv2/zj) + • • •] y2 —\y2 + y2
2
y
^ __ Mo gy
4ltv r (9-42)
4tt \ r)
TTVV__VXrl
v x r ~ r2 ’ Fig- 9-12. Slowly moving charge.
where rx is a unit vector from the charge to the point P where we are getting
the field. Thus
Moq v x rx
4tt f2 (9-43)
PROBLEMS 193
Problems
center, then find B = —VFm. Check distant R\ from one wire and R2 from
against the answer to problem 7. the other.
20. Show that at a point outside a 22. Find the vector potential A, and
current circuit the vector potential A from it the components of B, at the
satisfies Laplace’s equation. point P(x, y, z) for a small square cir¬
21. Two long straight parallel wires cuit of side 2a and current I, taking the
carry a current I in opposite directions. circuit to be in the XF-plane, with
Find the vector potential A at a point center at the origin, and a <3' r.
CHAPTER 10
10-1 Definition of electromotive force. Let us call the electric fields due
to stationary charges electrostatic fields. We saw in Chapter 8 that if we
imagine an agent to take a small test charge around any closed path X in
such a field, then the total work done by the field on the charge will be zero,
196
I
10-1] DEFINITION OF ELECTROMOTIVE FORCE 197
since
/
x
Es • dT. = 0, (10-1)
(10-2)
This represents the work per charge done by the field; 8 is expressed in
volts.
In general, the electric field E may be partly electrostatic and partly not.
If
E = Es + E',
8 = f E • ds (10-3)
F = qvB (10-4)
E = — = vB in the Z-direction.
q
This effective field exists only in the moving arm ab of length l. From Eq.
(10-3), we have for the emf around the circuit
g = f E • ds = vlB, (10-5)
Since vl is the area swept out per unit time, which we shall call dA/dt, we
may write
(10-6)
Note that if the entire circuit were moving in the F-direction with the
speed v, the galvanometer arm cd would experience an effective E from c
toward d just equal to that in ab and, as a result, the S for the circuit
would be zero. The galvanometer G would then show no deflection
Therefore what we detect in the case where only the arm ab moves is the
relative motion of ab with respect to the rest of the circuit.
Figure 10-2 shows the same circuit as in Fig. 10-1; as before, ab is the
moving arm. The induced current I, which is in the counterclockwise
10-2] MOTIONAL ELECTROMOTIVE FORCE 199
*
sense, is also indicated. Since 8 is the work per unit charge done by the
field as charge flows around the circuit, the time rate of doing work by the
field, or the power P, is
P = 8 % = 8/. (10-7)
at
Since this work is expended only when the conductor is moving across the
magnetic field, the source of the energy must be the agent moving the
wire, as we shall now show.
From Chapter 9, we know that a conductor of length l carrying a cur¬
rent I across a uniform magnetic field B experiences a force which, to
distinguish it from the force F in Eq. (10-4), we shall call F'. From Eq.
(9-16), we have
F' = I ds x B = IIB
J a
Equating the two expressions for P, Eqs. (10-7) and (10-8), we find again
which shows that the derivation of this equation based on Ampere’s law
and the concept of an electric field existing in a conductor that is moving
across a magnetic field are consistent with the conservation of energy
principle. This, then, is the theory of the electric generator, through which
mechanical energy is converted into electrical energy.
200 faraday’s law. electromagnetic induction [chap. 10
F = qv x B, (10-9)
so that
E = v x B (10-10)
8 = J'E • ds,
8 = fv x B • ds = <£ ds • v x B.
ds'
v =
dt
where ds' is the displacement of the element of length ds in the time dt.
We then have
ds x ds'
-I dt
B. (10-12)
10-3] MAGNETIC FLUX 201
Since the area of a parallelogram is the cross product of the base times the
side, the surface dS' swept out by the element ds in the time dt is dS' =
ds x ds'.* Therefore we finally get
8 = (10-13)
8 = (10-13)
From the definition of <h in Eq. (10-14), the flux threading a circuit is given
*In a case such as the Faraday disk generator (problem 3 at end of chapter),
where the circuit has a moving arm and yet a fixed area, dS' does not refer to
area added to, or subtracted from, that of the circuit.
202 faraday’s law. electromagnetic induction [chap. 10
(10-15)
dt
where dS/dt represents the time rate of change of the surface element dS.
However, dS'/dt is the area swept out per unit time by an element ds of
the periphery. To see how dS/dt and dS'/dt are related, let us consider the
situation shown in Fig. 10-4. According to Section 10-1, 8 is counterclock¬
wise. Let us take this as the 'positive sense for integration around the circuit.
It is convenient to relate this positive sense around the circuit to a positive
direction through the circuit. Since we are using only right-handed axes
and rules, we shall take the positive direction for S and dS to be that of
the advance of a right-handed screw rotating in the positive sense. If we
apply this rule to Fig. 10-4, we see that the positive direction for S is that
of B, out from the page. Motion of the moving element ds is such that
dS'
—rr = ds X v
dt
dS
dt dt
for the motional emf. The induced emf is equal to minus the time rate of
change of magnetic flux through the circuit. ,
The example just discussed covers most cases involving motional emf.
The Faraday disk generator, referred to in the footnote at the end of the
last section (see also problem 3 of this chapter), belongs in a little different
category, one involving accelerated (rotational) motion of a conductor;
foi such cases Eq. (10—16) is still valid provided d^/dt is interpreted as the
time late of cutting flux. If accelerated motion is not involved, we must
speak only of the motion of one part of the circuit relative to another
part and not of motion relative to the magnetic field, so that the expression
“cutting flux” has no meaning for unaccelerated motion.
10-4] Faraday’s law 203
__M> _ _ NAB
8 ~ ~ At ~ At
Note that NA is the effective area of the whole circuit comprised by the
coil. Suppose that N = 200, A = 10 cm2 - 1(T3 m2, B = 0.2 w/m2,
and At = 0.1 sec. Then A4> = 4 X 10~2 weber and the average induced
emf would be 0.4 volt. This numerical example brings out an interesting
point, namely, that no new quantity is introduced and defined by Faraday’s
law. While constants of proportionality Cx and C2 were necessary when
stating Ampere’s and Coulomb’s laws, no such constant appears in Fara¬
day’s law. This is because Coulomb’s law was our introduction to a new
branch of physics (electricity) and Ampere’s law served as our starting
point in magnetism, while Faraday’s law states an interconnection between
magnetism and electricity.
In this experiment we usually measure the total charge q passing a point
in the circuit while the coil is turned, rather than S. This may be done
by connecting the coil to a calibrated ballistic galvanometer.* A ballistic
Now assume that the current in the circuit is proportional to the emf
(Ohm’s law), or
7 = I’
where R is a constant called the resistance of the circuit. We shall see (Chap¬
ter 12) that this assumption holds for metallic circuits. Since I = dq/dt,
we have
T. dq d<f>
K -r- = -— •
dt dt
We may integrate each side of this equation with respect to the time, for
the interval during which the coil is turned. Then
_ [j NAB
q~ Jdl = = --R-- (10-20)
(10-21)
s
Note that we write dB/dt rather than dB/dt because B is a function of the
coordinates as well as the time, and we want here only its variation with
time at a fixed point, not its variation due to moving the circuit to where
B has a different value.* We shall henceforth use the dot notation for par¬
tial time derivatives, writing Eq. (10-21) as
8 = — B-dS. (10-22)
To Faraday the emf given by Eq. (10-22) and the motional emf of Eq.
(10-6) were both illustrations of the same physical law, namely, that an
emf is present in a circuit whenever the magnetic flux linking the circuit is
changing. This approach has the advantage of simplification, one of the
aims of theoretical physics, but at the same time it obscures an important
point. In the case of a steady magnetic field (B = 0), an emf can exist
only if a moving conductor is present. If the conductor is removed, does
the emf exist? The answer is no; the nonelectrostatic field in the moving
conductor can be said to exist only when there are moving charges present
on which the magnetic field can exert a Lorentz force. Motional emf was
attributed to an effective electric field, but the existence of such a field
depends on the relative motion between the conductor and the observer.
Such is not the case for the nonelectrostatic field that gives rise to the emf
in Eq. (10-22), for a changing magnetic field gives rise to a nonconservative
electric field, whether or not a conductor is present. This important point
was first recognized by Maxwell. It is here that Faraday s law makes its
contribution to the theory of electromagnetic fields.
So far in this chapter we have con¬
sidered 8 to be the line integral of
the electric field around a closed
conducting path, or circuit, s. If
the emf is due to a changing mag¬
netic field, as in Eq. (10-22), we
may replace the path s around the
circuit by a path X fixed in space, js increasing
as in Fig. lO-o. j?lG ^q_5 gmf arounq fixed path.
S >*
/
x
E • dl (10-23)
is the emf around any path X. We now combine Eqs. (10-22) and (10-23)
to obtain
(10-24)
X s
eS __ e d&
F =
2irR 2tvR dt
F = it(mv)-
so that
d , s e d<t>
- (m«) _ ^ ^
d(mv) = d<f>.
2irR
The condition that the electrons keep in the circular orbit of radius R
is (Eq. 9-13)
mv
eB =
~R
d(mv)
e dB = d(mv) = eR dB.
R
= eR dB,
ZttR
which, when integrated between the limits zero and <t>, and zero and BR,
respectively, gives us, for the flux out to r = R,
<!> = 2itR2Br.
Note that if the field had everywhere the same value BR that it has at
r — the flux through the orbit would be only irR2BK. Hence the field
must be some function B(r) of the distance r from the axis of the tube.
We have
rR
B(r) = 7 Br-
E • dl = - 11 B • dS. (10-24)
X s
When we apply Stokes’ theorem in vector analysis to the left side of this
equation, we get
(V X E) • dS = — JJ B ■ dS.
s s
Since both integrals refer to the same arbitrary surface S, we may equate
the integrands, obtaining
V x E = —B. (10-25)
10-7 Relation between E and the vector potential. We saw that we may
put
B = V x A,
d
B - (V x A) = V x A, (10-26)
dt
or
V x (E + A) = 0. (10-27)
E + A = —V*. (10-28)
8
M = — (10-30)
dli/dt ’
M *—■ = — •
dt dt
Integrating with respect to the time between the limits 0 and and 0
and <F, respectively, we get
MI i = (10-31)
10-8] MUTUAL INDUCTANCE AND SELF-INDUCTANCE 211
Therefore the mutual inductance between the path X and circuit 1 is also
the magnetic flux through the path per unit current in circuit 1.
Usually the path X is taken to coincide with another conducting circuit,
which we shall call circuit 2; then the S in Eq. (10-30) is the induced
emf in circuit 2.
As a special case, we may let the path X coincide with circuit 1, in which
event <t> is the flux per unit current through circuit 1 due to its own current
and M is then called the coefficient of self-inductance, usually designated
by the symbol L. In this case S is the emf induced in circuit 1 as its own
current changes. Calling the induced emf Si, we have
Si
L (10-32)
dli/dt
The name of this unit honors an American physicist, Joseph Henry, who,
while a schoolteacher at Albany Academy, discovered mutual inductance
independently of Faraday.
While M is usually calculated from Eq. (10-31) (in which $ is found from
Ampere’s law), we shall now derive a general expression which will show
that M depends only on the positions of two circuits and that it does not
matter which we call circuit 1 and which circuit 2. Let us recall that
B = V X A (see Eq. 9—37), and substitute for B in Eq. (10—22).
212 faraday’s law. electromagnetic induction [chap. 10
We find that
8 = — V X A • dS.
8 A • dX, (10-33)
where
Mo m 11 ds
A =
4ir J r
and
Mo d/1 ds
(10-34)
47r df r
g _ _ Mo d/i i ds • d7> 1
4?r di J J r
X s
^ = (10-35)
For self-inductance, X and s are the same. While Eq. (10-35) then
appears to be an improper integral (since r = 0 when ds and d\ coincide),
a finite value for L results, except for a circuit of infinitely thin wire (a
physical impossibility).
Illustration. Toroidal solenoid with two windings. Consider the thin
air-core ring solenoid shown in Fig. 10-8. Let R be its mean radius, Ni
the number of turns in one winding, N2 the number of turns in the other
winding, and A the cross-sectional area within either winding. We assume
that each winding is uniform and thin (compared with R).
d _ MqNiIi vqN\Ii
2tvR ~ l ’
where l is the mean circumference. As a result, the flux per turn of the
second winding will be BA, and the total flux threading, or passing through,
circuit 2, which comprises the second winding, will be
$ - N2BA = ^ N1N2AI1.
M = ^ NxN2A. (10-36)
To find the self-inductance of one winding, say the first, we simply replace
N2 hy N i and obtain
Li = ff NjA. (10-37)
Similarly,
L2 = N\A. (10-38)
214 faraday’s law. electromagnetic induction [chap. 10
We may write these expressions in terms of the turns per unit length of
circumference for each winding. If we put
Ni N2
ni n2 =
l
we get
M Li = n0nlA. (10-39)
H0nln2-A, —r — noniA,
l l
M = V%L~2.
In general, when the windings are not thin and directly on top of one
another, M will be smaller, or M < \ZL\L2. The ratio M/\/LiL2 is
called the coefficient of coupling; its value must lie between 0 and 1.
Example. To help grasp the magnitude of the henry unit, take the following
numerical case. For a ring solenoid let IVi = 1000, N2 = 400, l = 40 cm, A =
3 cm2. We then find from Eq. (10-36) that
where the positive sense is the direction of the current. Using Eq. (10-32)
we have
10-9] ENERGY STORED IN A MAGNETIC FIELD 215
W = \L12. (10-40)
L = y N2A.
w =.|| n2ai2.
The magnetic field exists only inside the winding; in this region of volume
lA, where A is the cross section, the magnetic induction is nearly uniform
NI
B = ho -j- ■
R1 2 3/ A
W = • (10-41)
2mo
If we assume a uniform field within the winding and postulate that the
energy W is distributed evenly throughout the volume IA, we have
1 B2
Energy density of magnetic field = - —- • (10-42)
2 no
Problems
Maxwell correlated the expressions for electric and magnetic fields based
on the laws of Coulomb, Ampere, and Faraday. He found that, in general,
these expressions were not compatible with the equation expressing the
continuity principle that electric charge can neither be created nor de¬
stroyed. He saw that the field equations were consistent with the equation
of continuity for a steady state, but not when the charge density at any
point was changing with time. Maxwell found that to remove this incon¬
sistency in the case of a nonsteady state it was only necessary to add to
the equation for V X B another term, one involving the time rate of change
of the electric field E. With the addition of this new term, Maxwell’s
field equations became a concise summary of the laws of electromagnetism.
In the present chapter we shall consider Maxwell’s equations as they
apply to fields due to charges located in vacuo and (to a good approxima¬
tion) in air. These equations may be modified to include fields in media
with special electric or magnetic properties; in Chapter 12 we shall discuss
the macroscopic or nonquantum effects of such media. The great impor¬
tance of Maxwell’s work, however, was that it led him to the development
of the electromagnetic theory of light. We shall conclude this chapter
with the derivation, from Maxwell’s equations, of the wave equations for
fields in empty space.
11-1 The field equations based on the laws of Coulomb, Ampere, and
Faraday. Let us summarize the expressions which we have developed for
the divergence and curl of E and B.
From Coulomb’s law (Chapter 8), we found that
(11-1)
V x E = 0 (11-2)
for an electrostatic field (see Eqs. 8-13 and 8-14). We should realize that
these equations for a static field may not be sufficiently general to hold
true for fields varying with time. In fact, we found from Faraday’s law
that V x E is not zero at a point where the magnetic induction B is chang¬
ing with time, for the correct relation then (see Eq. 10-25) is
V x E = —B. (11-3)
218
11-1] maxwell’s field equations 219
V-B = 0 (11-4)
V•V X E = —V -B.
The left side of this equation vanishes because the divergence of the curl
of any vector is zero, and on the right side we may transpose the order of
the space and time differentiation. We thus find that
— (V • B) = 0, (H-5)
dt v
and since V • B is zero when the currents are steady, it must remain zero
even if the currents are not steady at a later time. All that has been said
about conduction currents applies as well to the so-called displacement
currents, to which we shall come presently. Equation (11-4) is therefore
true generally.
The expression for the curl of B derived from Ampere s law was
V x B = moJ (H-6)
(see Eq. 9-26), but we must remember that the magnetic field referred to
in this equation was assumed to be due to steady currents. We saw that
an electric field may originate from (1) static charges, and (2) changing
220 maxwell’s postulate [chap. 11
magnetic fields. Does this not suggest that there may be two possible
origins for a magnetic field, say (1) steady currents, and (2) changing
electric fields? We shall see that Maxwell answered this question affirma¬
tively. However, for the present we shall simply state that of the four
equations,
V*E = p/e0, V x E = —B,
(11-7)
V-B = 0, V x B = moJ,
the first three may be considered to hold for either a steady or a nonsteady
state, but that the last equation has been proved only for a steady state.
_ S
The rate of change of charge within the region r is the volume integral of
the rate of change of the charge density. From the principle of conserva¬
tion of charge this volume integral must also equal dq/dt, so that
dq
dt
Let us equate these two expressions for dq/dt and use the divergence
theorem of Gauss to transform the surface integral into a volume integral
over the same region r. We then have
-//^•dS = -///*•(*)*-///£*•
s
Since both volume integrals are taken throughout the same arbitrary
volume, we may equate integrands to obtain
_ V.(pv)=§J, (11-8)
V • V X B = moV • J = 0.
V • (pv) = 0.
If we combine this result with Eq. (11-8), we are led to the conclusion that
This, of course, is a true statement for a steady state, and it was for such a
state that Eq. (11-6) was derived. However, we know that, in general, p
may vary with time; hence for such cases Eq. (11-6) cannot be valid.
Maxwell first investigated mathematically how one could alter Eq.
(11-6) so as to make it consistent with the equation of continuity. Taking
the time derivative of Eq. (11-1), we have
222 maxwell’s postulate [chap. 11
Let us introduce a new field vector D which, for air or vacuum, is defined
as
D = e0E, (11-10)
~ = V • D. (11-11)
V • (pv) = V- J = -V D,
or
V ■ (J + D) = 0. (11-12)
Since the divergence of the quantity (J -f D) is always zero, this quantity can
(see Vector Analysis) be put equal to the curl of another vector. Although
Eq. (11-12) alone does not completely define this other vector, in choosing it
Maxwell was guided by the knowledge that the resulting expression must
reduce to Eq. (11-6) for a steady state. Hence he assumed that Eq. (11-6)
must be replaced by the equation
J + D = v X (11-13)
13
H = (11-14)
MO
with his particular postulate was Maxwell able to, derive his theory of
electromagnetic waves. We may consider the experimental observation
of such waves, with the properties predicted, as the experimental basis
for Maxwell’s postulate. Furthermore, we shall show that this postulate
has a reasonable physical interpretation.
JJ(V
s
X H) • dS = JJ
s
(J + D) • dS.
^
x
H ■ dl = JJ
s
(J + D) • dS. (11-16)
mmf = JJ s
(J D) • dS. (11-18)
Equations (11-16) and (11-18) tell us that there are two ways of producing
a magnetic intensity, one with an ordinary current of density J, as observed
by Oersted and postulated by Ampere, and the other by means of a chang¬
ing electric displacement, as postulated by Maxwell. Since D 01 E for
air or a vacuum, we may say that a changing electric field results in a mag¬
netic field. This is the converse of Faraday’s discovery that a changing
magnetic field produces an electric field.
If we take the case where J = 0 everywhere, Eq. (11-16) becomes
Example. Two parallel plates are 1 cm apart. With air between the plates,
the maximum possible intensity is found to be 3 X 106 n/coul or volts/m; this is
called the dielectric strength of air, or the value of E at which the medium becomes
conducting. We may, then, let the p.d. between our plates increase to 30,000
volts. Suppose that this is accomplished in 1 sec. Then E = 3 X 106 n/coul-sec
and
Compare this with possible values for the J term in Eq. (11-15). In a copper
wire of 1 mm2 = 10-6 m2 cross section a current of 10 amp would not be
excessive, giving a value for J of 107 amp/m2, many times greater than the value
computed for D above.
has but one term? The answer is that the term missing in Eq. (11-20)
involves a current density of “magnetic current,” or a flow of magnetic
poles of one sign, and because, as has been pointed out, isolated poles of
one sign and “magnetic currents” due to them are not found in our physical
world, the term analogous to J in Eq. (11-18) and the converse of Ampere’s
law do not exist. We must face the fact that the fundamental role of
electric charges leads to a certain lack of symmetry in our equations.
11-5] maxwell’s displacement current 225
1 -Q--
Switch
I =
I — A cq E,
Thus the value of D between the plates is equal to the current per unit
area of the positive plate. Maxwell pointed out that if one pictured a
current density D between the plates, then, since AD = I, one could
think of the current / as being continuous all the way around the circuit,
including the gap between the plates. With D representing a current density
between the plates, we may then attribute the resulting magnetic field to
maxwell’s postulate [chap. 11
226
V D = P, V X E = —B,
(H-21)
V • B = 0, V X H = J + D,
11-7 The wave equations. To obtain the wave equation for one of the
field vectors we must obtain an equation containing that vector only. Let
us take as our example the wave equation for E in empty space. In Eqs.
(11-21) we set p and J equal to zero, so that Maxwell’s equations take the
symmetric form pSj
V - D = 0, V X E = —B,
(11-22)
V • B = 0, V X H = D.
V X (V x E) = -V x B.
V • D = V • (e0E) = e0V • E = 0,
V X H = V X — = — V X B = D = e0E,
Mo MO
or
V X B = CgMO^-
d2E
V X B — £0MoE — eoMo
dt2 ’
V • VE = e0MoE, (11-24)
the wave equation for E, as can be seen by comparing this equation with
the three-dimensional wave equations in the theory of elasticity,* where
it is shown that the ratio of the coefficient of the V • V term to that of the
d2/dt2 term is (at least for plane waves) equal to the square of the speed.
We conclude that the electric field E is propagated in infinite empty space
with a speed c which is given by
c=-4=- (H-25)
V £0M0
1
eo —
coul2/n-m2.
4tt X 9 X 109
dEx BE^ BE z _
Bx By ' Bz
For the plane waves we are considering, BEy/By and BEZ/Bz are zero,
hence BEX/Bx is also zero. This means that Ex does not vary with x,
as it would for a longitudinal wave in the X-direction. Here Ev and Ez
may depend on x, but they are components of E perpendicular to the
direction of propagation. A similar argument follows for B from the
equation V • B = 0. Hence plane electromagnetic waves are transverse.
The similarity between the predicted properties of electromagnetic
waves and those of light led Maxwell to postulate that light and electro¬
magnetic waves are one and the same thing.
Hertz, about fifteen years after Maxwell published his theory, first
produced electromagnetic waves experimentally; such waves have since
been found to have all the properties predicted by Maxwell. The micro-
waves used in radar are a good example of this type of wave.
BE z
—Bx, (a)
II
1
£
By
BEX BE z
By, (b) (11-27)
Bz Bx
BEy dEx
—Bz. (c)
Bx By
11-8] RELATION BETWEEN E AND H IN A PLANE WAVE 229
Since Ex and Ez are zero, and, Ey can vary only with x and t for plane
waves in the X-direction, we here have Bx = By — 0. This means that
the only component of B that varies with time, and hence is being propa¬
gated as part of the wave, is Bz. Therefore, if the electric vector is in the
F-direction, then the magnetic vector is in the Z-direction. E and B
(or H) are mutually perpendicular and E X B, or E x H, is in the direc¬
tion of propagation of the wave (see Fig. 11-2).
/B = moH
/
Fig. 11-2. Propagation of plane electromagnetic wave.
dH z _ dHy
= Dx, (a)
dy dz
dHx dH 2
= Dy, (b) (11-28)
dz dx
dHy dHx
= Dz- (c)
dx dy
dHz _ dEy
dx €° dt
dEy dH z
dx Mo dt
230 maxwell’s postulate [chap. 11
For plane waves traveling in the -j-X-direction with the speed c, both Ey
and Hz must be functions of (x — ct).* Since
dH g dEy dEy dH z
dx e°C dx ~dx~ ~ M°C ~dx '
These integrate to
Hz = e0CEy, Ey = HocH z.
€o |E |2 = M0|H|2. (11-30)
V • VE = ^ E. (11-31)
c2
a) = 2irv = 27r/period
We see that this satisfies the condition that, at any point, E must repeat
itself every time t increases by 2tt/w, or for each period of the vibration.
This assumes that we are dealing with harmonic waves; but any waveform
that repeats itself with a definite frequency may, by Fourier analysis, be
considered as a sum of harmonic waves.
This equation involves the coordinates only, yet its solution may be very
difficult. For one thing, while V • V4> (where $ is a scalar function) is
simply the divergence of the gradient of 4>, we cannot regard V • VE in
the same way, because E is a vector and VE is a dyadic, or tensor. However,
in cartesian coordinates V • VE = d2E/dx2 + d2E/dy2 + d2E/dz2. Equa¬
tion (11-33) is sometimes called the Helmholtz equation, or the time-
independent wave equation.
If the direction of wave propagation is taken to be the Z-direction, then
it is best to go back to the form of the wave equation containing the time
explicitly and to build the solution as the product of a function of x and y
times a function of (oit — yz), where y is called the propagation constant.
Thus we might write for the X-component of E,
In general, however, y does not equal w/c (that is, co/y ^ c), as the
following illustration will show, yet 2ir/y will always represent the wave¬
length X. Since v = w/2-k, the phase velocity \v will equal co/y; thus the
phase velocity is not always equal to c in space free of charge, although it
does equal c for electromagnetic waves in infinite empty space (without
walls).
Illustration. Rectangular wave¬
guide. Consider the hollow conduc¬
tor or waveguide of rectangular cross
section shown in Fig. 11-3. The
waveguide extends in the Z-direc¬
tion, has a width a in the X-direc-
tion and a height b in the F-di-
rection. We shall show that waves
of wavelength shorter than a cer¬
tain limit may be propagated down
the length of the pipe, i.e., in the
Z-direction.
232 maxwell’s postulate [chap. 11
and similar equations for Ey and Ez. From what has been said, we may
assume that the solutions for Ex, Ey, and Ez will each contain the factor
e~Uat—ye) muitiplied by some function f(x, y). In finding the form of the
latter, we may turn back to the solution of the equation
d2V d2V
dx2 dy2
Ez = 0,
k\ + k\ + t2 = (11-37)
0
where X0 is the wavelength in empty space (with no walls) of waves with
the frequency u/2t. Second, these solutions satisfy the boundary condi¬
tions
Ex = 0 at y = 0, Ey = 0 at x = 0,
Ex = 0 at y = b, Ev = 0 at x — a,
11-10] RELATION BETWEEN VECTOR AND SCALAR POTENTIALS 233
and ’ (11-38)
a b
where m and n are integers.
From Eqs. (11-37) and (11-38) we see that we must have
O
If 7 is imaginary, the eiyz factor will either increase or decrease indefinitely
with z, depending on the sign of y. E cannot increase indefinitely, since
that would involve a corresponding increase in energy. A decrease in E
with z involves attenuation of the waVe. Hence only those waves will be
propagated in the waveguide for which
iHs2+©2-
The largest value that X0 may have is called the cutoff wavelength. For
a > b, this is found by taking m = 1 and n = 0, giving 2a as the cutoff
wavelength.
We have been considering here only waves for which Ez = 0, that is,
transverse electric waves; the various integral values of m and n give us the
so-called TETOn modes for our waveguide. Similarly, we could have taken
Hz — 0 and obtained the transverse magnetic, or TMmre, modes. Further
discussion of waveguides and modes is left to a more specialized treatment
of this subject. However, the presentation just given has shown that with
a waveguide of laboratory dimensions, say 0.5 cm in width, wavelengths
longer than 1 cm cannot be transmitted without attenuation, while some¬
what shorter waves may be propagated if they properly “fit” the wave¬
guide; very much shorter waves, such as those in the optical region, are
easily transmitted with little effect from the walls.
V X (V X A) = MoJ - + A).
We have seen that for empty space the field vectors satisfy a wave equation
such as Eq. (11-24), which means that when p = J = 0 the potentials
from which the field vectors may be derived must also satisfy a wave
equation with the same numerical coefficients. This is possible if
V-A=-e0Mo4r, (11-43)
or
V • A = ~ eoMo'f’-
V • V4 = e0Mo'E
V • VA = cqmoA,
V • V4 = eoPoE — — ? (11-44)
eo
'P q 1
~ 4tT€0 (z2 + y2 + Z2)1/2 ’
• ...
d'k ... d* ...
* “ & w + ai (y) + ^ (2)
q xx + yy + zz
4reo (x2 + y2 + *2)3/2 ‘
A mo _SY_
4-7T (x2 + y2 + z2)l>2 '
so that
V •A =
Mo£
4t G
dx \(a;2 + y2 + z2)112
A ( -y
dy \(x2 + y2 + z2)112.
i(_=i_)
dz \(x2 + y2 + z2)112/
_ mo? xx + yy + zz
— 4tt (x2 + y2 + z2)3'2
and v must be the velocity and r the distance from the charge q to P
at a previous instant, earlier by r/c than the time under consideration. Of
course, r/c varies with the distance from P to each respec tive charge. When
the potentials are computed on this basis they are called retarded 'potentials.
H • (V X E) — E • (V X H) = V • (E X H).
V • (E x H) = —moH • H — e0E • E — E • J,
or
jJJv.(E X K)dr = -
T
~ JJJ
T
\ (e0£2 + y0H2)dr
- JJJe
r
• J dr. (11-46)
Applying the divergence theorem to the left side, this expression becomes
JJ
s
(E X H) • dS = -|///
T
l (e0E2 + n0H2) dr - JJJE-Jdr,
T
(11-47)
where S is the surface surrounding r.
First consider a volume r for which E X H = 0 everywhere on its
surface, so that we have
is the amount of energy per unit area and per unit time crossing a surface
whose normal is in the direction of II. (n is known as the Poynting vector.)
Equation (11-45) is the equation of, continuity for the flow and con¬
servation of electromagnetic energy. It is known as Poynting’s theorem.
We see that for energy to be propagated in an electromagnetic field we
must have both an electric and a magnetic field present, and these fields
must not be parallel. These conditions were satisfied in the case of plane
waves, for which we showed that E X H is a vector in the direction of
wave propagation. A static charge, however, does not produce a magnetic
field, and so in electrostatics there is no Poynting vector. For a charge
moving with constant velocity, the electric and magnetic fields move with
the charge, and the total energy stored in the field remains the same; at a
point in the neighborhood of the charge and not in the direction of motion,
E X H is not zero, but the flow of energy keeps pace with the motion of
the charge and no energy is radiated out into space. Only when a charge
is accelerated does it radiate energy (see Section 11-13).
maxwell's postulate [chap. 11
238
n = EH = J— E2,
\ MO
or since c = l/VeoMo>
n= c6qE2.
V
E =
7 ’
in the direction of the current. We shall show in the next chapter that the
tangential component of E is the same on each side of an uncharged surface,
so that just outside our wire E is the same as it is inside. We saw that the
magnetic induction outside the conductor is directed in circles, according
to the right-hand fist rule of Section 9-5. Just outside the surface of the
wire,
MO I_ I
B = H =
2-k R 2tR
in the tangential direction. Just outside the wire E and H are perpendicular
and
VI
n = E x H =
2-71-Rl
into the wire. This is the rate of flow of energy per unit area. For the
length l the surface area is 2tRI, so that the rate at which energy enters
this section is just VI. But VI is also the rate at which energy is being
11-13] RADIATION FROM AN ACCELERATED CHARGE 239
taken from the source of emf ih the circuit and is being used to drive the
current through the length l of the wire. According to the Poynting view¬
point, this energy, which goes into raising the temperature of the system,
is supplied by the field outside the wire. However, in the source of emf,
say a battery, I and H have the same relative directions, while the direction
of E is reversed, so that n is outward. So, if we wish, we may picture energy
as flowing from the battery out into the field and then back into the wire.
Perhaps in this example a viewpoint built around the analogy between
the electrical circuit and a water system with a pump would have more
physical appeal, but there are other problems, involving electromagnetic
waves and radiation, in which the Poynting vector is a useful concept, as
the next section will illustrate.
Thus the line of force must have a kink LP, as shown. While LP may be
curved, it will be so short that we may treat it as a line segment, if At <$C t.
Therefore we see that Q2L = ct and OP = c(t + At), and Q2LPM is a
line of force.
maxwell’s postulate [chap. 11
240
qa sin 8
H = (11-52)
47r rc
for the radiated energy per unit area and per unit time.
To find the total rate of radiation of the charge during its acceleration,
we must multiply Eq. (11-53) by the area 2xr2 sin 8 dd (the area of the
zone between 6 and 6 -(- dd on a sphere of radius r), and then we must
integrate with respect to 6. Calling this total rate of radiation dW/dt,
we get
PROBLEMS 241
<
2 2
dW gV / ( sin3 ddd
q a
(11-54)
dt Sire0c3 70 6xeoC3
We see from this formula that the rate of radiation by an accelerated charge
increases with the square of its acceleration.
When electrons are stopped at the target of an x-ray tube a rapid de¬
celeration occurs and a part of the kinetic energy of the electron is trans¬
ferred to radiant energy, the rest being added to the internal thermal
energy of the target. (The correct quantitative explanation of this effect
requires the application of quantum mechanics.)
Another example of radiation by an accelerated charge is an oscillating
electric dipole, or an antenna in which the polarity of charge at the two
ends is opposite and continually reversing.
Problems
1. Derive the equation of continuity units, (a) Find the mksc unit for cr.
for the flow of (a) a fluid, (b) heat. (b) If an alternating electric field
Define all terms and state postulates represented by Eoeiat exists within the
assumed. copper, find the ratio of the conduction
2. Derive the wave equation for H to the displacement current density at
in empty space. a frequency of 1 megacycle/sec.
(Ik) (a) Derive the wave equation for 8. For a plane wave in free space,
E in a medium where p = 0, but show that the energy crossing unit area
J = aE, a being a constant (the me¬ in unit time equals the energy con¬
dium obeys Ohm’s law), (b) Taking tained in the volume c (the volume
E = E*e-“*, find the time-independ¬ swept out by unit area of the wave-
ent wave equation for this case. front in unit time).
4. Derive d’Alembert’s equation for 9. Suppose that for a plane wave in
the vector potential A. free space the Poynting vector is 0.06
5. For plane waves the ratio |E|/|H| watt/m2, (a) Find E and H. (b) Find
is called the wave impedance. Show that the energy density of the field. These
the mksc unit for wave impedance is will be effective (rms) values.
volts/amp and find its numerical value 10. Suppose that the wave in prob¬
for free space. lem 9 has a cross section of 105 m2 and
6. Show that for an air-core toroidal that it is emitted as a pulse, the trans¬
solenoid the ra^io of the mmf to the mitter radiating for 10-6 sec. Find
flux is 1/p.oA, where l is the circum¬ (a) the length of the pulse, (b) the total
ference and A is the area of cross sec¬ energy radiated, and (c) the power of
the transmitter during emission.
tion.
7. For copper, we find that J =oE, 11. Suppose that a charge q oscil¬
where a has the value 5.8 X 107 mksc lates about the origin, its displacement
242 maxwell’s postulate [chap. 11
at any time t being given by y — 13. (a) Write the wave equation for
A sin cot. Show that the energy radi¬ a scalar function u in spherical co¬
ated per cycle is W = q2A2co3/6e0c3. ordinates. (b) Show that u =
12. For the waveguide in Section Aei(-at~yr\/r, with y = oo/c, is a solu¬
11-9 the components of E were found tion. (c) What sort of wave does this
to be given by Eqs. (11-36). (a) solution represent?
Show that in these equations Ai = 14. Discuss the Poynting vector and
—Azki/hx. (b) Prove that H has a its physical interpretation in the case
longitudinal component Hz. of a charged bar magnet.
f
CHAPTER 12
We shall postulate, then, that we are dealing with linear media. Unless
stated otherwise, we shall also assume that the medium is homogeneous
(same properties at all points) and isotropic (same properties in all direc¬
tions). Such linear, homogeneous, isotropic media are sometimes called
Class A media; they do not include crystals.
Three linear relationships between electromagnetic quantities will be
treated, namely, (1) that between the current density J and the electric
intensity E, (2) that between the electric displacement D and the electric
intensity E, and (3) that between the magnetic induction B and the
magnetic intensity H.
12-2 Ohm’s law: the relation between J and E. We may always write
J - cE, (12-1)
which is a definition of the electrical conductivity a. In general, a might be
some complicated function of E, or it might vary in an erratic way, like
the ratio of a person’s age to his weight, but henceforth we shall postulate
for the substances under consideration, that
This is one form of Ohm’s law. It is found to hold true for many common
materials, notably metallic conductors and electrolytes.
Suppose that we consider a portion of a conducting rod, or wire, whose
length is l and whose cross section is A. If the field inside the conductor is
longitudinal and of magnitude E, the current I will be
I = JA = *EA. (12-2)
1 = (t) v■ (12~3)
We now define the resistance R of any section of a conductor as
V
R = j- (12-4)
1 ohm = 1 volt/ampere.
12-3] kirchhoff’s laws 245
R = ~^r- (12-5)
aA
12-3 Kirchhoff’s laws. The main reason for introducing these well-
known laws here is to discuss their significance, rather than their applica¬
tions. Consider a network of current circuits containing sources of emf
Si, S2, S3, • • • and resistances R1} R2, R3, • ■ ■ • Call the respective currents
Iu I2j /3, • • • , where the subscript refers to a given branch of the network
(Fig. 12-1). Kirchhoff’s laws state that for steady (direct) currents:
I. At any junction the sum of the currents entering equals the sum of the
currents leaving that junction, and
II. Around any closed loop in the network the sum of the emf s equals the
sum of the products of current and resistance for each conductor in the loop.
The first law is a statement of the continuity principle, or the law of the
conservation of charge. With steady currents, charge cannot accumulate
anywhere and daw I must hold true. In the equation of continuity,
dp
V . (Pv) = V • J = — —>
V•J = 0 (12-6)
E = Es + E'.
From Eq. (12-1), we have
E = i,
which yields
J
Es + E' = -
a
Es -ds + fE'-ds
The first integral on the left is zero, because the electrostatic field is con¬
servative. The second integral is the electromotive force 8 around the
loop, or the algebraic sum of the emf’s of whatever sources are present in
the loop. The integral on the right side may be transformed as follows:
fE • ds = fl dR. (12-7)
m = qd. (12-8)
sultant of the dipole moments in any finite volume must then be in the
direction of E. If there are N dipoles per unit volume and on the average
each has a moment m# in the direction of E, then
P =: NmE. (12-9)
face and let this form part of a volume Ad extending a distance d straight
into the medium, where d is the length of a dipole. Then the electric mo¬
ment of this volume will be (Ad)P. According to Eq. (12-8), the charge q
that must be displaced a distance d from the charge — q in order to produce
the dipole moment (Ad)P = APd is q = AP. On the surface there will
appear an unbalanced positive charge per unit area equal to P. Letting <r'
be the surface charge density due to polarization, we have
P-dS. (12-11)
s
12-5] POLARIZATION CHARGES 249
W:
Q (±3 77—LA
a£3Wo
VI—J
<3 &M
x = 0
From the conservation of charge principle, a net charge q' leaving a volume
that was originally neutral must leave a charge —q' inside the volume,
where
t (12-12)
—Q
If we combine Eqs. (12-11) and (12-12) and apply the divergence theorem
to the surface integral, we have
12-6 Gauss’ law for dielectrics. Suppose that we wish to compute the
electric field at some point in a region that contains both free and polariza¬
tion charges. Two alternatives are open to us. We may (a) treat the polar¬
ization charges on an equal basis with the free charges and consider
the medium to be in other respects like air or a vacuum, or we may (b)
disregard the presence of polarization charges, but treat the medium as
possessing a different permittivity e than that of air or a vacuum (that is,
e ^ <e0). While (a) has the appeal of greater physical reality, it has the
disadvantages that one must first determine the polarization charges and
that the polarization charges further complicate the expressions. In
method (b) the results are simpler and more analogous to those for free
space. Values of e for different media have been measured and listed. We
shall show how method (b) follows from method (a).
Let us apply Gauss’ law to the surface S enclosing the volume r. This
theorem may be extended to media other than free space if we write it as
The inclusion of the p' term takes care of the alteration in the field due to
the presence of a polarizable medium. Equation (12-14) reduces to Eq.
(8-9) for free space. Now for p we substitute — V • P and obtain
Next we apply the divergence theorem to the last term and transpose it,
which yields
D = e0E + P. (12-16)
(12-17)
D = 6 E, (12-18)
this equation defines e, the electric 'permittivity of the medium, and the
statement that for a given medium e is a constant independent of E is our
restricted “law” for Class A dielectrics. This postulate is equivalent to
saying that the polarization is proportional to the applied field, for from
Eqs. (12-16) and (12-18) we have
P = D — €oE = (e £o)E.
When we put Eq. (12-18) into Eq. (12-17), Gauss’ law takes the form
eE • dS P dr, (12-19)
which differs from that for air or a vacuum only in that e replaces e0.
Finally, we define the dielectric constant Ke of the medium as
6
K, (12-20)
60
which is the most convenient form, since values of Ke for different media
are listed in tables. (For air Ke = 1.0006 and for a vacuum Ke is unity.)
Ke is a dimensionless quantity, while e is expressed in terms of the same
unit as <„ (coul2/n-m2), and D in coul/m2, as before
From Eq (12-21) we see that if we compare the field for a given distnbu-
tion of free charges in a dielectric (filling the whole region where there is a
fie'ld) with the field for the same charge distribution in free space, then,
252 ELECTROMAGNETIC PROPERTIES OF MATERIAL MEDIA [CHAP. 12
since the right side of the equation is the same for each case, KeE at a
point in the dielectric must equal E at the same point when the medium
is air or a vacuum. Thus the presence of the dielectric reduces the field
to 1 /Ke of its value without the dielectric.
If we change the left side of Eq. (12-17) back into a volume integral,
using the divergence theorem, and equate integrands, we get
V •D — p, (12-22)
Since in a dielectric the field of a point charge is still central and a func¬
tion of the distance from the charge to the point where the field is com¬
puted, we again have
V X E = 0 (12-24)
Fig. 12-6. Gauss’ law applied to di- Fig. 12-7. Lines of D and E.
electrics.
Gauss’ law in the form of Eq. (12-21) to a surface Si enclosing an area A'
of the positive plate, and extending into the air space between the plates,
we find as before that in the air (Ke =1) the field Ea is
a
Ea
eo
Ed = §!=-• (12-26)
Ke e
Now take the surface S3 to include only an area of the dielectric s left
face, one side of S3 being in air (where the field is opposite to the normal),
and*one side being in the dielectric (where the field is outward). Using
Gauss’ law in the form of Eq. (12-14), we get
— eo Ea + eo Ed — <*'>
t
where a' is the surface charge density due to polarization on this face of the
slab. Substituting for Ea and Ed, we find that
(12-27)
1
Note that Dd = eEd = = e0Ea = Da. In Fig. 12-7 the solid lines
represent E, while the solid plus dotted lines represent D for the case
where Ke — 3.
254 ELECTROMAGNETIC PROPERTIES OF MATERIAL MEDIA [CHAP. 12
The second important case is when there are two conductors in the vicin¬
ity of each other and they are given the charges -\-q and —q, respectively.
Let Vab be the potential difference between the conductors. The capaci¬
tance C of this combination, which is called a capacitor, is defined as
C = (12-28a)
Vab q,
12-9] ENERGY STORED IN AN ELECTRIC FIELD 255
location of the conductors present and the value of Ke for the intervening
medium.
From the definition of C we see that the mksc unit of capacitance is the
coulomb/volt. We again introduce another name for a derived unit and
define
1 farad = 1 coulomb/volt.
This name honors the English physicist Michael Faraday, whose work in
connection with electromagnetic induction was discussed in Chapter 10.
Illustration. Parallel-plate capacitor. Consider the parallel plates de¬
scribed in the illustration at the end of Section 12-7. Let the separation of
the plates be l and the thickness of the dielectric slab be t. Then, as before,
Ea = — ’ Ed — j=-— ,
eo &eto
so that, if the negative plate is grounded, the potential of the other plate
will be
V = /E • <&. = Ea(l - t) + Edt
eo {l ~ + Kl.
q = aA.
r_eo A . (12-29)
I — t A~ t/Ke
If the dielectric completely fills the space between the plates, so that t — l,
then C will be proportional to Ke.
W = qVa da
= \qV. (12-30)
W = %CV2, (12-31)
W = iKee0E2Al,
If we again take the view that the work done in charging the capacitor
goes into energy stored in the field, then, noting that the field is uniform
throughout the volume Al, we may say that
M = NIAtlu (12-34)
'
'
\
b-S
I' = 1 V = 1
II ’
X
(a)
I' = 2 I' = 3
^
o
II
II
-►*>- -***-
0 x
(b)
dMz dMy
J'x oc - — „ J
dy dz
dMx dMz
T' oc-— 7
dz dx
dMy dMx
J'z OC-^ — „ 7
dx dy
or
J' a v X M.
J' = V x M. (12-35)
B • dl — y0 J■dS MO (V x M) • dS.
Next we apply Stokes’ theorem to the last term and transpose, getting
(12-37)
H = — - M, (12-38)
MO
a definition in agreement with that given earlier for free space (where
M = 0). Equation (12-37) now becomes
which is a more general form of Ampere’s law, good for any medium.
However, this equation is not of much practical value if we do not know
how H and B are related. At this point we shall introduce such a relation
and discuss its applicability.
B • dX = n J j J • dS, (12-41)
x s
Km = (12-42)
Mo
j B • dk = Kmpoffj-dS, (12-43)
x s
j> H • dX = HI.
x
The right side of Eq. (12-39) is the total conduction current threading the
path, that is, NI. Hence we have
HI = NI,
H = —■ (12-44)
From this we see that for a toroidal solenoid, or a long straight solenoid, H
is equal to the ampere-turns per meter. We still cannot calculate B if m
262 ELECTROMAGNETIC PROPERTIES OF MATERIAL MEDIA [CHAP. 12
Bl = KmnoNI,
NI
B — Kmn0
l
Bl = mo NI MoJ' • dS,
or
Within the core the field is uniform and the intensity of magnetization M
is constant in magnitude. Then J' = V x M is zero in the core. There¬
fore, the contribution to the left side of Eq. (12-45) must come from the
surface of the core, where the medium changes. We see that a current
circulating in the same sense as I, if Km > 1, or in the opposite sense if
Km < 1, and with the magnitude of (Km — 1) I per turn, will account
for the left-hand term.
It is instructive to compare this illustration with that at the end of Sec¬
tion 12-7. If we have two charged parallel plates, with a slab of dielectric
filling part (and air the rest) of the space between them, and if the free
charge density is a on one plate and — a on the other, the value of D is the
same in the dielectric as in the air; however, the value of E in the dielectric
(for which Ke > 1) is less than in the air. The polarization of the dielectric
produces on each surface of the dielectric slab a polarization charge opposite
in sign to that of the free charge facing it, with the result that the electric
field in the dielectric is weakened. In the magnetic case, on the other
hand, the magnitude of H is NI/l regardless of the medium, but the value
of B is greater in a core for which Km > 1 than in an air core with the same
number of ampere-turns per meter. The magnetization M adds to the
flux of magnetic induction when Km > 1.
Let us compare the values of Km for the three classes of magnetic media
mentioned earlier.
(1) Diamagnetism. We saw in Section 10-4 that when a magnetic field
is applied to a substance, the motions of the atomic electrons are altered
in such a manner as to produce magnetic moments in a sense opposite to
12-12] THE POSTULATE RELATING B TO H 263
whatever the medium. From this it follows that the normal component of
B is the same on each side of a boundary surface, or
The explanation is similar to that for Eq. (12-23), which followed from
the equation V • D = 0.
If we go back to Eq. (12-39) and use Stokes’ theorem to change the line
integral into a surface integral, we get
fj
s
(V x H) • dS = JJ]s’
• dS,
This is the general form of the “point statement” of Ampere’s law. While
it applies to all media, it is valid only for a steady state (no displacement
currents). At a surface where J == 0, we find, in the same way as for
the electric case when V x E = 0, that the tangential component of H
is the same on each side of the surface, or
In Chapter 11, D stood for e0E, so that this equation was equivalent to
Eq. (8-13), namely,
V •E = —•
12-14] maxwell’s equations for material media 265
From Eq. (12-22), we see that the equation V *D = p holds for any
medium so long as we understand that in general D equals eE and not e0E.
If we replace e0 with e in the equation for V *E, we get V -E = p/e.
Is this equation equivalent to V • D = p? The answer is “yes” only when
e is independent of the coordinates, such as at a point within a homogeneous
dielectric, but not at a point on its surface. (Why?) The correct general
form of Maxwell’s first equation is thus V • D = p.
Let us turn to Maxwell’s second equation,
V x E = -B.
V -B = 0.
It was pointed out in the last section that this equation always holds true.
The fourth of Maxwell’s equations (11-21) was
V X H = J + b.
For a steady state this reduces to Ampere’s law expressed in the form
V x H = J.
In Chapter 11, H stood for B/p0. For magnetic media we get the same
equation provided that we take the relation between H and B to be
H = B/p rather than H = B/p0. What holds true in this respect for con¬
duction currents is also found to hold for displacement currents.
Maxwell’s equations for any medium then are:
V •D = p, V X E = —B,
(12-50)
V - B = 0, VxH = D|J.
From the definitions given for a, e, and p, we add to Eqs. (12-50) the
supplementary constitutive equations
1 (12-52)
v
c
n (12-54)
v
that is, the ratio of the velocity of an electromagnetic wave in vacuo to its
velocity in the medium. For transparent media p generally equals p0, so
that in this case
n = Vc/ £o = y/K~e , (12-55)
n = E x H, (12-56)
but for the energy density of an electromagnetic field the expression is now
B
V • e0E = p Vacuum (or air) —* V X —— J
Po
PROBLEMS 267
V • («oE + P) = p
VX(^“M) = J
6oE + P = D ——M=H
Mo
V • D = p V X H= J
B
D = eE H
M
V • (eE) = p VX-=J
M
eo MO
B
V • Kee0E — p V X
KmPO
=J
Problems
1. Derive the expression for the re¬ 4. A point charge q is placed at the
sistance R equivalent to the resistances center of a small spherical cavity in an
Ri, R2, and R3 in parallel. infinite dielectric whose constant is Ke.
2. A circuit contains a resistance R, Find the induced polarization charge q'
self-inductance L, capacitance C, and on the wall of the cavity and show that
emf given by Soe1"*, all in series. Let I E in the dielectric is the same as it
be the current and q the charge on C, would be if q' were added to q and the
at any time l. (a) Write the equation medium were free space.
expressing the equality between the 5. A line of force representing E
applied emf and the sum of the poten¬ meets the interface between two di¬
tial differences across R, L, and C at electric media, of constants K1 and K2,
any instant. (b) Using I — dq/dt, respectively. The line is refracted,
obtain a differential equation for I and making an angle 61 with the normal in
compare it with the mechanical one for the first medium, and an angle 62 in
forced oscillations with damping. the second medium. Find an expres¬
3. How will the current I vary with sion relating di, 62, Ki, and K2.
the voltage V in a material for which 6. Two large parallel plates are 3 mm
(a) cr = <tq\/E/Eo, and (b) cr = apart and are maintained at a poten¬
aoE2 3/Eo, where op and Eo are con¬ tial difference of 600 volts. Between
stants ? the plates are two dielectric layers,
268 ELECTROMAGNETIC PROPERTIES OF MATERIAL MEDIA [CHAP. 12
Part III
WAVES, PARTICLES, AND
RELATIVITY
#
(
< >
CHAPTER 13
PHYSICAL OPTICS
of refraction, etc. Similarly, the subscript 2 will refer to the other medium
into which the light may penetrate. We shall take the interface between
the two media to be the plane x = 0.
The electric vector E has been shown to satisfy the wave equation
V • VE (13-1)
where
(13-2)
<32E 1 d2E
(13-3)
dx2 V2 dt2
for which the solution was shown to be some function/(x ± vt). It has
been pointed out that any periodic wave may be considered to be built up
of harmonic waves, so we need only concern ourselves with the case where
the function / is a harmonic one, say
/ = Ae*rXx-vt)l\t (13_4)
where j = \/—l (in this chapter), and X is the wavelength. Changing this
to
Ae2* j(vt—x)/\
/ = (13-5)
/ = AeMt-ix/v)]' (13_6)
(13-7)
(13-8)
(13-11)
sm i sm r
(13-12)
274 PHYSICAL OPTICS [CHAP. 13
Ei — |E ]. | = E\y, Hi = |H 11 = Hiz
are the amplitudes of the electric and magnetic vectors of, say, the incident
wave. (We shall continue to use primes for the reflected wave and the sub¬
script 2 for the refracted wave.)
In the last chapter (Eq. 12-25) we saw that at a boundary the tangential
component of E must be continuous. In medium 1 we must add the fields
due to the incident and the reflected waves, so we have
Ei I Mi
(13-16)
Hi ~ yjei ’
For the reflected wave either E or H must be reversed in phase, since the
direction of propagation, which is that of E x H, is reversed. If Ex and
H i are both positive, we have
(13-17)
E2
(13-18)
h2 ~
13-2] REFLECTION AND REFRACTION AT NORMAL INCIDENCE 275
Using these last three equations to eliminate the H’s in Eq. (13-15), we get
i
U3-19)
Between this equation and Eq. (13-14) we may eliminate E2, which yields
E\ _ ni — n2 Hi
(13-21)
Ei nx n2 Hi
The ratio of the reflected power, and energy, to the incident power, and
energy, is
E'iH'i _ (nx — n2V
(13-22)
EiH} — \nx + n2)
In the same manner we may find the ratios E2/Ex and E2H2/EiHx.
The results are
E2 _ 2ni
E\ nx -f- n2
(13-23)
E2H2 __ 4win2
EXHX ~ (nx + n2)2 ‘
The last equation gives the fraction of the incident power, and energy, that
is transmitted into the second medium.
276 PHYSICAL OPTICS [CHAP. 13
Example. Given that n = 1.5 for ordinary glass. Find the percent of power
transmitted and the percent reflected for light incident normally from air to glass.
e[h[
= ttp X 100% = 4% of power is reflected.
EiH! Zb
Thus a glass plate in air reflects only 4% of the light intensity falling normally
on its surface.
El
Ej
vi
change phase upon reflection, as in the figure, so that the tangential com¬
ponent of Hi is H'i cos i.
Using the condition that the tangential component of E must be con¬
tinuous at the boundary, we have
Ex + E\ = E 2, (13-24)
Between Eqs. (13-24) and (13-26) we may eliminate E2 and solve for E( in
terms of Ei. Let us at the same time put Vei/e0 — n-i and Ve2/e0 = n2.
The result is
n i cos i | j n i cos i i
= Ei
Ln2 cos r j [_n2 cos r J
sin (r — i)
E i = Ei (13-27)
sin (r + i)
Note that when n2 > n\, and r < i, Ei and E\ have opposite signs, in¬
dicating a change in phase for E, as we found for normal incidence. The
case of grazing incidence, for which i = 90°, is interesting; for it we find
that E[ = Ei sin (r - 90°)/sin (r + 90°) = -Elt or the coefficient of
reflection equajs —1, which means that all of the light is reflected.
For case (b), where E lies in the plane of incidence, Fresnel found in a
similar way that
tan (r — i) _
E\ = Ei (13-28)
tan (r + i)
278 PHYSICAL OPTICS [CHAP. 13
An interesting situation arises in this case when r + i — 90°, that is, when
the incident and refracted rays are at right angles to each other. Then in
Eq. (13-28) the denominator becomes infinite and so E[ = 0. This means
that for light which is polarized so that E lies in the plane of incidence,
and which is incident at the angle i = 90° — r, no light is reflected and
all is transmitted. If unpolarized light is incident at this so-called polarizing
angle, then no reflected light will have its electric vector in the plane of
incidence, i.e., the reflected light will be completely polarized, with its
electric vector at right angles to the plane of incidence, or parallel to the
interface. It was this effect that led to the discovery of polarized light.
Equations (13-27) and (13-28) are known as Fresnel’s equations be¬
cause in 1823 Fresnel derived equivalent equations for transverse waves in
a hypothetical elastic ether. Of course, Fresnel did not know about elec¬
tromagnetic waves; what we called the electric vector E he thought of as a
displacement.
(13-29)
where the integration is taken between two points Q and P on the same
ray. It usually turns out that the optical length for the actual path is a
minimum, which also means that the optical path is the path of least time.
We shall prove this theorem for three cases, (a) a homogeneous medium,
(b) reflection, and (c) refraction.
(a) Homogeneous medium. Here n is a constant, so that
Q
13-4] fermat’s principle 279
the point 0, where the ray enters the second medium, to be our origin.
Let L be the optical length of this actual path. Then
Suppose that the varied path crosses the interface at the point O', whose
coordinates are (0, by, bz). The optical length L' of this path will be
nxV x2 + y\ — n2V x2 + y\
n2y2
+ ,- sy
)
-
\V x2 + y22 V x2 + y2
where i and r are the angles of incidence and refraction, respectively. (If
yi is positive, y2 must be negative, so that sin r = —y2/{x\ + y\)1/2.)
From Snell’s law, ni sin i = n2 sin r, so that
bL = 0,
1/2
*That is, Vx2 -J- y2 — 2yby = X^x2 -f- y2 1 2 yby
x2 + y2
(see Eq. 13-6). Adding and using the trigonometric formula cos a+
cos /3 = 2 cos £(a + j8) cos £(« — 0), yields
282 PHYSICAL OPTICS [CHAP. 13
E = E\ + E2 = 2A cos |
-£)+?(■ -5)]
, x\ C02 / xy
cos t-)-[ t-)
L 2 V Vx) 2 \ v2 /_
Example. For deep-sea waves, we find that v — \/~g\/2t, where g is the accel¬
eration due to gravity. Find the group velocity for waves with a small range of
frequency.
13-6] HUYGENS’ PRINCIPLE 283
1 d /v\ 47xv
G dv \v) g
Here the longer waves travel faster and, for a limited range in wavelength, the
group velocity is half the average phase velocity.
13-6 Huygens’ principle. We are familiar with the fact that sound
waves can pass around corners and that a speaker may be heard even when
he is not in the line of sight of the listener. Similarly, water waves bend
to some extent around the edge of a breakwater. This phenomenon, called
diffraction, is observed for light waves when they pass through apertures
not many times wider than the wavelength of the light, or when the edge of
the shadow cast by a sharp object is examined under a microscope. The
observed effects can be explained only in terms of wave theory, and in
doing so we must start with some law or postulate. This postulate is an
extension of the principle of superposition given in the last section, and it
was first stated by Huygens as follows.
Suppose that light travels out from a source and reaches all points on a
certain surface at the same time t. Then this surface is one of constant
phase for a given wavelength, and we call it a wavefront. We wish to
find the shape and position of this wavefront at a slightly later time
t + 8t. This may be done, according to Huygens, by assuming that at
l =z t every point on the original wavefront is a source of a secondary dis¬
turbance, which spreads out with the velocity v of the wave at that point,
forming spherical wavelets. After the time 8t these wavelets will have
radii of v8t and the envelope of these wavelets, Huygens postulated, will
be the new wavefront at the time t 8t. In justification of this procedure
it may be mentioned that points on different wavelets correspond to the
same phase, and that the envelope of these small spherical wavelets will
contain more such points of phase agreement than any other surface that
could be drawn. Thus, according to the principle of superposition, the
envelope will be4a surface of maximum reenforcement.
The laws of reflection and refraction may be derived from Huygens’
principle For example, in the case of a plane wave in medium 1 striking
the surface of a denser medium 2, as in Fig. 13-5, the wavelets will have
radii Vi8t = c8t/n\ in the first medium and v28t = c St/n2 in the second;
284 PHYSICAL OPTICS [CHAP. 13
with n2 > rii, the wavelets travel more slowly in the second medium and
the new wavefront in that medium is more nearly parallel to the interface,
so that r < i. A quantitative treatment leads to Snell’s law.
There are, however, some objections to Huygens’ principle. First, it
leads to a backward as well as a forward wave because the secondary
wavelets have two envelopes, one on each side of the original wavefront.
Of course, we usually know which way the wave is progressing and so
can arbitrarily rule out the backward wave, but it is still a formal weak¬
ness of the theory. Second, Huygens’ principle does not tell us the relative
intensity at a point, in the case of diffraction.
Huygens’ principle was modified and extended to cover diffraction by
Young and Fresnel. They retained the idea that every point on a wave-
front may be regarded as a source of a secondary disturbance, but they
carried out the superposition of the secondary waves in a different way.
Rather than taking the envelope of the wavelets at the time t + 8t, they
considered the individual phase of each secondary wave as it passed, at
some definite time, through a given point. They postulated that the result
of adding these secondary waves, allowing for phase differences, determined
the actual disturbance at the point. This amounts to treating all the ele¬
ments of area of the original wavefront as interfering sources of light.
Simple applications of this method are given in elementary texts. Here
our task will be to derive a mathematical expression for diffraction which
embodies Fresnel’s modification of Huygens’ principle, but is even more
general in its approach. Before doing so, we must derive a purely mathe¬
matical theorem in vector analysis, which we shall need to use.
(13-33)
s r
13-7] green’s theorem 285
Let us put A = uVv, where u and v are scalar functions of the coordinates.
We then have
and (13-39)
nV
V • Vv =-5- v.
c2
Upon substituting these relations into the right side of Eqs. (13-36), we
find that
2 2
n co
(uV • Vv vV • Vu) dr (uv — vu) dr = 0.
286 PHYSICAL OPTICS [CHAP. 13
e-jkr
v = (13-41)
r
where k = nw/c and r is the distance from P to a given point. The function
V = vejut will then also satisfy the wave equation (see problem 11-13);
while this is the solution for spherical waves spreading out from the point
r = 0, here we just consider v a mathematical function. Since v becomes
infinite at the point P, we must exclude this point from the region of integra¬
tion t in Green’s theorem. We therefore surround P with a very small
spherical surface S’ and take r to be the region between S' and some outer
closed surface S. Then the surface of integration in Green’s theorem must
include both S and S'. Remembering that the positive direction of the
normal is outward, we see that for S' the positive normal is toward P. Now
we may apply Eq. (13-40) to our present functions, and write
13-8] KIRCHHOFF S DIFFRACTION FORMULA 287
+ • dS' = 0. (13-42)
S'
The second integral, the one over the small spherical surface S' about P,
may be easily computed. Take the term
-jkr
- (Vu) p 4ttr2
and, as we let r approach zero, this term vanishes. Now consider the term
4:irUp.
Lim
r~*0
uf - b s
Vu — uV •dS. (13-45)
288 PHYSICAL OPTICS [CHAP. 13
(13^6)
where to — 2-kv and k = u/v = 2ir/\. Taking out the time factor and
13-8] kirchhoff’s diffraction formula 289
remembering that we are letting r', rather than r, represent the distance
from Q, we have
A. — i kvr
u = — e , (13-47)
r
so that
_ , du , jk
Ae
—jkr'
V“ = r‘ 6? = 11
,—jkr
jk -jkr
rl y»2 r
When we substitute these expressions back into Eq. (13-45) we get the
terms r( • dS and rx • dS. Let r[ • dS = cos (n, r') dS, where (n, r') is the
angle between r( and the normal dS, the latter being outward on the side
away from P. Similarly, let rx • dS = cos (n, r) dS. We shall assume that
so that 1/r2 « jk/r, and similarly for r'. This amounts to saying that
while the phase may not be considered constant over a narrow aperture,
the distances may. Substitution in Eq. (13-45) then yields
which is Kirchhoff’s formula for the diffraction of light from a point source.
The factor j in Eq. (13-48) represents a constant phase shift of 90°.
This is unimportant, however, since we may start counting our time when¬
ever we wish. The quantity A depends upon the strength of the source.
The exponential factor is a phase factor; in it slight changes in r or r' may
mean a large change in phase, since k(r + r') = 2?r(r + r')/\ and X « r
or r'. However, the other factors may be treated as approximately constant
when the integration is taken across an aperture whose dimensions are small
compared with r and r'. In such cases, we may write
Up = #
2X TT
A
[cos (#, r) - cos («, r')] (/
JJ
dS. (13-49)
disturbance at the aperture is what it would be if the screen did not exist
(Eq. 13-47), and that r and r' are large compared with A. Nevertheless,
Kirchhoff’s formula proves quite adequate for finding the disturbance up
at a point which is not too near the aperture.
Illustration 1. Variation with direction of the amplitude of the secondary
wavelets. As in Fig. 13-7, let Q be on the normal of a small aperture in a
screen, so thatr'x and dS are antiparallel and cos (n, r') = cos 180° = — 1.
Let 0 be the angle between iq and dS, so that
When P is on the opposite side of the screen from the source and in line
with the source and aperture, cos 0=1 and the factor in Eq. (13-50) has
the maximum value of 2. On the other hand, if P is taken to be between
Q and the aperture and we consider only the secondary wavelets and not
the direct disturbance from Q, cos 0 = — 1 and the cosine factor is zero.
The backward wave is thus ruled out, and our theory also tells us how the
amplitude of the secondary wavelets varies with the direction, i.e., the
angle 0. The objections to the original Huygens’ principle have thus been
removed.
Illustration 2. Fresnel zones. We turn now to the phase factor e~ik(-r+r'\
Suppose that we have a monochromatic point source Q on one side of a
plane screen, with the point P on the other side, and that the straight line
connecting Q and P intersects the screen at 0 (Fig. 13-8). For the line
QOP the sum r + r' will have a minimum value which we shall call R.
Let us draw on the screen about the point 0 a curve such that for every
point on this curve r + r' = R + A/2, where X is the wavelength. Let us
next draw the curve for which r + r' = R + 2X/2, then the one for which
r + r' = R + 3X/2, and so on. These curves will be circles if the screen
13-8] kirchhoff’s diffraction formula 291
up — a\ — «2 + a3 — «4 + • • ' • (13-51)
(13-52)
Since the successive a’s differ only slightly in value, the quantities in
parentheses are very small and may be neglected, so that we have
ai
Up = T'
The disturbance at P for no screen is half that due to the first zone acting
alone. «
(b) The aperture in the screen uncovers just the first zone, then
Up = <Xi,
Up = A+ \2 - “2 + 27 + " • + V ’
while if n is even, we can put
/ Qre—1 ^n + l\
+
V 2
O-n
2 )
cannot be neglected. We must evaluate each an and use Eq. (13-51).
The contribution of a zone depends on (1) its area, (2) its distance, and (3)
the obliquity factor cos (n, r) — cos (n, r').
from Q to P, but intersecting S at points O', 0", etc., all close to 0 (Fig.
13-9). We may, if we wish, imagine S to be a screen with a small hole
just permitting the pencil of rays to pass through. According to Fermat’s
modification of Huygens’ principle, all of these rays must reenforce one
another at P, which they will do if their paths all have nearly the same
optical lengths, thus assuring close phase agreement. This means that the
first variation in the optical length, as we go from the path QOP to the path
QO'P, or to QO"P, etc., must be zero, or
nds = 0,
as in Section 13-4.
Problems
1. Show that total reflection occurs the prism, then sin J(A -)- D) =
for ni > ri2 and sin i > U2/ni. Dis¬ n sin \A.
cuss for this case (a) the angle r, (b) 8. For ripples on the surface of a
the damping of the transmitted wave, liquid the phase velocity is given by
and (c) the Poynting vector for the v = (2tT/p\)1/2, where T is the sur¬
transmitted wave. face tension, p the density of the liquid,
2. Show that for reflection at normal and X the wavelength of the ripples.
incidence H[/Hi = —E[/E\. Find the group velocity for waves with
3. Derive Eq. (13-23) and show a small range of frequencies.
that the fraction of power trans¬ 9. How must the phase velocity
mitted plus the fraction reflected equals depend on the frequency for the group
unity. velocity to be infinite?
4. Show that for oblique incidence 10. By expanding both sides, prove
and E normal to the plane of incidence, that
E2 = 2Ei sin r cos I'/sin (r -f- i). V • (uVo) = (Vm) • (Vv) + uV • Vv.
5. Derive Eq. (13-28).
6. In terms of optical path, discuss 11. Plot up against 6 graphically,
the focusing effect of a double convex using polar coordinates with up as the
lens. radial coordinate.
7. A ray of light enters, passes 12. Consider a source emitting light
through, and emerges from a prism of wavelength X to be very far in front
whose vertex angle is A (see Fig. of a plane (that is, r' = °o), and let P
13-9). Inside tlje prism the index of be a distance l behind the plane, (a)
refraction is n and outside it is unity. Find, in terms of X and l, the outer
The ray travels parallel to the base in radii of the 1st, 2nd, 3rd, . . . , nth
the prism. Use Fermat’s principle to Fresnel zones on the plane, (b) How
show that if D is the angle through do the width and area of a zone depend
which the ray is deviated because of on n? Take nX <K l.
294 PHYSICAL OPTICS [CHAP, 13
13. Discuss the relative intensity at is a circular disk centered on the line
P in problem 12 if the zone pattern is QP, then, if the diameter of the disk is
gradually uncovered by moving to one small compared with r' and r, the value
side a semi-infinite screen that origi¬ of up is the same as if the disk were
nally covers the zonal plane in front not present.
of P' 16. Take Fresnel zones on a spheri¬
14. Take the zonal plane in problem cal surface of radius r' about Q, the
12 to be a screen with a circular hole distance QP being R. Show that
centered on the line QP. Discuss the Up = iai, or half the contribution
relative intensity at P as the radius from the first zone. Compute this con¬
of the hole is gradually increased. tribution by integration, showing that
15. Show that if the only obstacle up = Ae~jkR/R, as it should for a
between the source Q and the point P spherically spreading wave.
CHAPTER 14
The fundamental laws of physics that we have considered so far all date
from before the year 1900, whereas the principle of relativity and the
quantum principle were only discovered in the present century. The
physics of the period prior to 1900 is frequently referred to as “classical
physics” and we have seen that it succeeded in discovering the laws and
building the theories in terms of which the ordinary occurrences of our
physical world can be explained. The reason that so much could be accom¬
plished without the use of the relativity and quantum principles, is that the
speed of light c is so great and Planck’s constant h is so small. In fact, if
the speed of light were infinite, those effects peculiar to the relativity theory
would not occur and, of course, if h were zero it (h) would be of no im¬
portance. As it is, the speeds of material objects in everyday life are very
much less than c and the uncertainties in their action* are much greater
than h. We shall see that when the expressions of the relativity theory or
quantum theory differ from those of classical physics, the former reduce
to the latter when c is relatively infinite and h is relatively zero, so that the
newer physical theories embrace the older ones as a special case; this case,
however, includes most common occurrences.
*See Vol I p 193. The action between two states of a system is defined as the
time integral between the states of twice the kinetic energy; it has the dimensions
of the product of energy and time.
295
296 THE SPECIAL THEORY OF RELATIVITY [CHAP. 14
We shall see that the second postulate states what has been found to be
an experimental fact. The basis of the first postulate is that it leads to de¬
ductions which have also been found to be true. The two postulates to¬
gether tell us that there is no physical experiment that can be performed
relative to one inertial system only by means of which the uniform velocity of
that inertial system relative to another inertial system can be detected. This
is Einstein’s principle of relativity.
Suppose, for example, that we are enclosed in a large box without
windows, like a freight car or an elevator, and that this box can move
smoothly and quietly relative to the ground. If the motion is uniform,
we will not be able to tell when our box is moving. If, however, the motion
is accelerated, it would be immediately detectable, since, when a car speeds
up, we are accelerated backwards relative to the car. Similarly, when a
car is accelerated centripetally on a curved track, we appear to be thrown
toward one side of the car. In an elevator we can detect the speeding up
and slowing down by the apparent change in our weight, or the force of
gravity.
*See Vol. I, p. 67. We shall see that in relativistic mechanics the. invariance
of F = d{mv)/dt is maintained by postulating that mass varies with velocity.
14-3] SPACE-TIME AND FIELD TRANSFORMATIONS 297
V X E = -B, (14-1)
dE z dEy 9BX
dy dz dt
The invariance principle requires that this expression must transform into
the equation
V' X E' = -B', (14-2)
which are called the Newtonian (or Galilean) space-time transformation equa¬
tions. Referring to Fig. 14—1, we see that after a time t the Y'Z'-plane
has moved a distance vt, so that the rc’-coordinate of a point would appear
to be vt less than the ^-coordinate. Lorentz found, however, that with
these simple transformation equations the invariance of Maxwell’s equa¬
tions could not be maintained, so he abandoned Eqs. (14-3). Working on
the basis that Eq. (14-1) must transform to Eq. (14-2), and so on for each
of Maxwell’s equations, Lorentz arrived at the following expressions, which
we shall later derive from the principle of relativity:
. x — vt
x' — — j
Vl — v2/c2
^ t — vx/c2
Vl — V2/c2
k =
VT~— v2/c2
which are the Lorentz field transformation equations. He also derived addi¬
tional transformation equations for charge and current density, subject
to the conservation of charge principle; but in order not to complicate
matters further, these equations will not be given here. In other words,
let us take the case where p = J = 0.
As for many expressions in physics, it is much easier to test for the
correctness of Lorentz’ equations than it was to derive them initially. If
the Lorentz transformations are correct, then Maxwell’s equations for
empty space,
V • E = 0, VxE = —B,
(14-6)
V • B = 0, V X B — e0/i()E,
14-3] SPACE-TIME AND FIELD TRANSFORMATIONS 299
where V' = i'd/dx' + j'd/dy' + k'd/dz' and B' = dB'/dt', etc. We shall
apply this test to the Lorentz equations by checking the first of Eqs.
(14-7); we shall leave the checking of the others to be done as problems.
We wish to show that Eqs. (14-4), (14-5), and (14-6) are consistent with
the equation
dE'x dE'y dE'z
(14-8)
dx' dy' dz'
If we solve Eqs. (14-4) for x, y, z, t in terms of x', y', z', t', we obtain
the relations
which show that motion of S' in the X-direction with velocity v is equiva¬
lent to motion of S in the X'-direction with velocity —v. Differentiating
Eqs. (14-9) yields
dx dx dx , dt _ kv
dy' dz' dt' ~ kV> dx' ~ c2
etc., so that
d _ d dx_ d_dy_.d_dz_,d_dt_
dx' ~~ dx dx' + dy dx' + dz dx' dt dx'
300 THE SPECIAL THEORY OF RELATIVITY [CHAP. 14
£ ^ dEx V dEx
0.
dx ' c2 dt / \ dy dy / \ dz
I •
dB z dBy 1 dEx
— CqMO Ex
dy dz c2 dt
TV 1 ?1?2
F “ 4^ V5"11'
Ex = E'x, Bx = 0,
Ey = kE'y, By — ~kvE’z / ^ ,
Ez = kE'z, Bz = kvEy/c2.
Suppose that each observer is at a point for which E'y = E'x and E'y ^ 0.
Then Ey = kEx and Bz ^ 0.
We find that the way in which an electromagnetic field is separated into
electric and magnetic parts depends on the motion of the observer’s frame
of reference.
We have seen that from the mathematical standpoint the Lorentz
transformations keep Maxwell’s equations invariant. There are, however,
other consequences. (1) We note that t' ^ t and that t' is a function of x
as well as of t. Thus the value of t' depends on where an observer in the
14-4] THE MICHELSON-MORLEY EXPERIMENT 301
ce = Vc2 — V2.
h = 21 - = —a (14-10)
Vc2 — v2 c Fig. 14-2. Race between two swim¬
mers in a flowing stream.
2 Ic 2 ir
(14-11)
h c — v C + V
302 THE SPECIAL THEORY OF RELATIVITY [CHAP. 14
We must take v < c, or the second swimmer could not reach the point C.
Then k > 1 and hence t2 > tx. The first swimmer wins, but the differ¬
ence in times, involving only terms in the square and higher powers of v/c,
will be small if v « c. Note that by measuring tx, t2, and l we can de¬
termine the drift velocity v, if we know c.
Let us assume that the two swimmers are accurately timed. Suppose
(this is a purely hypothetical case) that the two times were found to be
exactly the same, what explanations might be offered? We might conclude
that (1) the ratio v/c is zero and k = 1, within the limits of measurement,
or (2) that the distance from A to C was inaccurately measured and that it
is actually less than the distance across the stream. Conclusion (l)Jmplies
either no appreciable current (v = 0), or very fast swimmers (c —» oo).
Conclusion (2) would necessitate a shortening of the distance AC to l/k,
where l is the width of the stream; with l/k in place of l in the expression
for t2, we see that we would have t2 = tx.
The reason that tx and t2 are different is because the current, which
carries the swimmers along, affects them differently. In the same way,
sound waves are carried by the air and, relative to the ground, travel
faster with the wind than across or against the wind. In the time of
Maxwell much thought was given to the question of whether there was
any medium which carried light. Some physicists suggested that there
was such a medium, pervading all space, which they called the ether, and
they regarded it as an absolute frame of reference. If such an ether existed,
the earth could not always be at rest relative to the ether, since in six
months’ time the motion of the earth in its orbit is reversed. Suppose that
the earth is moving through the ether, then this relative motion, called
ether drift, should affect the motion of light just as the current of the stream
was seen to alter the effective speeds of the two swimmers.
Maxwell suggested a crucial experiment for testing the ether theory.
This experiment was performed by Michelson, who was shortly aided by
Morley, and it is called the Michelson-Morley experiment. The apparatus
employed was essentially a Michelson interferometer floating on a pool of
mercury. The interferometer, diagrammed in Fig. 14-3, is the optical
analog of the case of the stream and the two swimmers. A beam of light
from a source Q strikes a half-silvered glass plate G, so that part of the light
is reflected to the mirror M x and part passes through G and is reflected by
the mirror M2. Upon returning to G, the two beams are again partly re¬
flected and partly transmitted. The transmitted part of the beam from
Mi joins the reflected part of the beam from M2 and the combined light
passes into the telescope T. The optical lengths l of the two paths are made
the same. Then if there is an ether drift v parallel to GM2, the light from
M2 will be, with respect to phase, a little behind that from Mx. A pattern
of interference fringes is observed through the telescope T.
14-4] THE MICHELSON-MORLEY EXPERIMENT 303
The crucial part of the experiment comes when the whole apparatus is
rotated 90° about a vertical axis, so that any ether drift originally parallel
to GM2 will now be parallel to GM i, causing a shift in the relative phases of
the two combining beams. During the rotation the observer must watch
for a resulting shift in the interference pattern seen through the telescope.
Taking for v the earth’s orbital velocity of 30 km/sec, Michelson and
Morley computed for their apparatus a shift of about half a fringe, ac¬
cording to the ether theory. Actually they found no significant shift, al¬
though they believed that a shift of 1/100 of a fringe could have been
observed. This experiment has been repeated many times since Michelson
first attempted it in 1880, and it is now generally agreed that the earth’s
motion through space cannot be detected as a motion through the ether.
As in the case of the two swimmers, the null result in the Michelson-
Morley experiment may be interpreted as indicating (1) that the drift
current is effectively zero, or (2) that distances parallel to the drift current
are effectively shorter by a factor 1/k than they appear to be to an ob¬
server moving with the apparatus. As we have said, the earth moves in dif¬
ferent directions at different times of the year, and therefore if there were
an ether the earth could not always be at rest relative to it. Conclusion (1)
then necessitates giving up the ether drift hypothesis. The Irish physicist
Fitzgerald suggested possibility (2), which amounts to the following.
Suppose that an observer measures the length of an object to be V when
he is moving with the object, but he finds the same length to be l = V/k
when the object is moving longitudinally with a speed v relative to the
observer In the Michelson-Morley experiment the observer is moving
with the apparatus and he finds that the V of the longitudinal path is equal
to the l of the transverse path. An observer at rest relative to the fixed
stars would say that the longitudinal path is shorter. Lorentz, who was
304 THE SPECIAL THEORY OF RELATIVITY [CHAP. 14
_ (x2 + y2 + zy12
c~ t
x2 + y2 + z2 — c2t2 — 0. (14-12)
where K is independent of x and t. The reasons for this “guess” are (a) a
linear relation is the simplest and should be tried first, and (b) the relation
must reduce to that given by the Newtonian transformation x' = x — vt,
when »«c. We shall see that the linear relation is sufficiently general to
satisfy all conditions and that it does reduce to the Newtonian relation
when v <3C c.
Since motion is relative, we may also regard S as moving in the — in¬
direction with the speed v, in which case we must write
x = K'{x' + vt').
Eliminating x' between this equation and (14-15) and solving for t' yields
t' = K (14-16)
In Eq. (14-14) we substitute for x' from Eq. (14-15) and for t' from Eq.
(14-16), to obtain
2 xt
+ c2K2
v V KK'J v2 V
Since this equation must be an identity, it must hold for all values of x
and t. This means that the coefficients of the x2, xt, and t2 terms must
vanish separately, or
K‘
0 it
KK h) = 0’
(v2 - c2)KK' + c2 = 0,
and
-c2 - K2v2 + K2c2 = 0.
1
K = K' = k.
y/1 — w2/c2
306 THE SPECIAL THEORY OF RELATIVITY [CHAP. 14
These values of K and K' also satisfy the first conditional equation.
Equations (14-15) and (14-16) then become
which are the same as the Lorentz equations (14-4). Thus these equations
are consistent with each of Einstein’s postulates for the special theory of
relativity.
or
Now (x'a — x'b) is the length of the stick as checked by the observer on
the train, and hence is exactly one meter. Then the length measured by
the ground observers is less than one meter by the factor 1 /It.
14-7] SIMULTANEITY AND RELATIVITY 307
= (xa — xb)/k.
If we take xa > xb, we see that tb > t'a. Observer C then claims that
by his clock B was a little later than A in making his observation. C might
then argue that since B looked at the moving meter stick later than A did,
B would see the back end at a point farther along the track than he would
have if he had looked at his end just when A was looking at the front end.
The result of B’s lateness would be to make the back end position seem
nearer that of the front end, i.e., the length would come out short of its
true value. . 7 . ,,
We see that two events, at xa and xb, which occur simultaneously in the
system S, are not observed to be simultaneous in the system S'. This theorem
is also a reciprocal one.
308 THE SPECIAL THEORY OF RELATIVITY [CHAP. 14
l - = vT^o^ = 0.8.
The ground observers report that the moving stick appears to them to be 0.8 m
long. In communicating this information to the moving observer, the ground
observers hold up a stick of their own which they have cut to be 0.8 m long. The
moving observer looks at this stick and concludes that it is 0.8/k = 0.64 m long
and that the ground observers missed the length of the moving stick by 0.36 m.
The moving observer explains this as the result of B’s reading being made
(y/c2)( 1.0 m) sec later than A’s reading. The moving observer points out that in
this time the back end of his stick would advance a distance
2
v(t'b -t'a) =^X(lm) = 0.36 m,
c*
which is what the moving observer believes to be the error made by the ground
observers.
Since k > 1, the time between A’s flashes is found to be longer by ob¬
servers in the S' system than it is according to A (who is in the system S).
Conversely, due to the relativity of motion, the time between two events
occurring at the same place in S' is found to be longer by observers in *S
than by observers in S'. We may sum up by saying that a clock will be
found to run more and more slowly the greater the relative motion between the
clock and the observer.
This effect becomes important in the case of high-energy particles of low
mass, such as mesons, whose speeds relative to the ground may be only
slightly less than that of light. These particles are observed to have short
lifetimes, but with their high speeds they may travel considerable distances
during their lifetimes. Since the speed of a meson gradually decreases as it
loses energy in the earth’s atmosphere, the velocity of the ground relative
to the meson also changes. Hence for a given meson the lifetime listed is
that for a system in which the meson is at rest. Calling this lifetime to,
the lifetime r observed in the earth system, relative to which the meson
is moving, may be considerably greater than r0- The apparent distance
traveled by the meson will be the value of v dt integrated over the apparent
lifetime r.
Example. A meson has a speed v = 0.6c relative to the ground. Find how far
the meson travels relative to the ground if its speed remains constant and the
time of its flight, relative to the system in which it is at rest, is 2 X 10-8 sec.
Solution. Since v = 0.6c, l/k = 0.8, k = 5/4. Relative to the earth the time
of flight At is
At = k X 2 X 10~8 = 2.5 X 10~8 sec.
v, _ dv'
V = * V"~W’
» dt
Let us use these expressions to find the relation between Vx and Vx. We
get
dx' _ dx — v dt _ (dx/dt) — v Vx — v
V' = --
y X - 7./
=
-
dt' dt — vdx/c2 1 — vdx/c2 dt 1 — vVx/c2
y _ -Yjl- (14-20)
k(l-vVx/c2)’
V' =
v2
k( 1 - vVx/c2)
The corresponding equations for Vx, Vy, Vz in terms of V'x, V'v, V'z
follow in the usual way, i.e., by changing the sign of v and interchanging
primed and unprimed quantities. For example,
V'x + v
(14-21)
~ 1 + vV'Jc2 '
Suppose now that a bullet is fired from a train in the forward direction
(Fig. 14-4). The muzzle velocity of a bullet is measured relative to the
gun, and since the gun moves with the train, we may let Vx be the muzzle
velocity. Then Vx is the velocity of the bullet relative to the ground. We
shall show that Vx cannot exceed c so long as V’x and v are less than c.
Vx =
(V'x/c )+ (y/c)
c
(1 - V'x/c)(l - v/c)
c.
LI + (v/c)(V'x/c) J 1 + (v/c){V'x/c)
(14-22)
Since each factor in brackets on the right is less than unity, we must have
Vx < c.
14-10] RELATIVISTIC DYNAMICS AND THE VARIATION OF MASS 311
(0-l)(0.1)
Vx 1 0.9945c,
1 + 0.81 J °
whereas according to the Galilean transformations the answer would be Vx = 1.8c.
I. The net work done by the forces equals the gain in kinetic energy of the
system* and
II. If the net force is zero, the momentum must remain constant.
and we now inquire whether this may have the property of invariance
under a Lorentz transformation if we consider the mass m to vary with
the velocity The relation which we shall derive was first arrived at from
the study of the motion of charged particles in electromagnetic fields, a
subject called electrodynamics. We shall find it easier to derive the relation
*Work done by the earth’s gravitational force when a body is raised and work
done by frictional forces must be considered negative.
312 THE SPECIAL THEORY OF RELATIVITY [CHAP. 14
In S' In S
A B A B
— V'
V x —>
7 V'
vX V' -4 V' 0^0 vx » vx
—
V y
r ' —> —V'
V y - V'y -> V'y Vay * Vay V by * V by
k ’ Vy ~ k(l - vVx/c*) ‘
lM= i
(14-25)
Vay C2
m0 m0
ma = mb (14-26)
\/l — V2/c2 Vl - \E2/c2
(mbVby\2 _ [ 1 - Vl/c2
W-Ll -(VI+V1J/C*
$
To show that this is unity, we must relate Vx to Vx v, using Eq. (14 21),
which gives us
17 -
~ 1 +V2/c2’
314 THE SPECIAL THEORY OF RELATIVITY [CHAP. 14
so that
2 2
VVs 2vJ C — V
= 1 -
c2 4- v2 c2 -1- v2
V by
“ (x " ^ S+
c2 + v2> '
V2
vX
,
+
V2
V by ~
4v'c4 + (c2 -
—
(C2
V*)2vi
+ P2)2
and
(rnbVby\2 _ 1 - Vl/c2
\ma Vayj C2 -V
+ yy2/
4»V + (cz -
1 -
c2(c2 -j- V2)2
WaVay — m^Vby-
Since this is the mass that satisfies Eq. (14-24), it is also called the mo-
mental mass. From Eq. (14-27), we see that as v —> c, m —> oo, which
14-11] einstein’s mass-energy relationship 315
again tells us that no material object can attain a speed equal to or greater
than the speed of light.
Since objects in everyday life do not have speeds at all close to that of
light, their variation of mass with velocity is not measurable. However,
in the case of such small particles as the electron, proton, deuteron, etc.,
it has been possible to verify Eq. (14-27) experimentally. A beta particle
from a radioactive source, or an electron accelerated to a high energy in a
betatron, may have a speed sufficient for its apparent mass to be several
times its rest mass. For the heavier proton to show a mass increase of the
same proportion as that for an electron, the energy of the proton must
be 1840 times as great; nevertheless our powerful proton accelerators,
such as the synchrotron and cosmotron, reach energies well past the
threshold beyond which mass variation must be taken into account.
d , \ dv dm , dm . nn\
We see that the equation F = ma does not hold at high speeds if we take
m _ mo> nor is it valid in general if we take m to be the apparent mass.
In fact, if v and a are in the same direction, we find (see problem 6 at end
of this chapter) that
m0 (14-29)
F = a,
(1 - v2/c2)3'2
d ds
K.E. = \ F ds = I % (mv) ds - I o dt
“ (mv) dt dt
v=0
dt
= / v f (mv) dt = I vd(mv).
.1 n dt
316 THE SPECIAL THEORY OF RELATIVITY [CHAP. 14
m0v
K.E.
V1 — v2/c2
1
C2(l _ V2/C2)3I2
v dv
-V1 — v2/c2
rv
/ V dv 2 1
= m0
Jo (1 — P2/c2)3/2 “ m°C (1 - P2/C2)1/2
m0c (14-30)
(a/ 1 Vc5
This is the relativistic expression for kinetic energy. If the first term is
expanded by the binomial theorem, we find that
K.E. = tuqc'
2 , 3 ^
c2t8c4 '"O'
Thus the kinetic energy of a moving body equals its gain in mass times c2.
We may also say that the apparent mass of a body increases linearly with
its kinetic energy, so that an increase in mass is an indication and measure
of the gain in kinetic energy. It is also found* that an increase in the
potential energy of a system of particles is accompanied by a similar in¬
crease in mass equal to the gain in energy divided by c2. Therefore we may
say, in general, that the gain (or loss) in the energy of a system is equal to the
gain (or loss) in its apparent mass multiplied by c2.
We may go one step further and interpret the term m0c2 in Eq. (14-31)
as the rest energy of a body whose rest mass is m0. This rest energy may
= total energy.
If here we let E stand for the total energy, we arrive at Einstein’s famous
principle of the equivalence of mass and energy.
E = me2. (14-32)
Example. An electron and positron which are practically at rest come to¬
gether and annihilate each other, producing two photons of equal energy. Find
the energy and equivalent mass of each photon.
Solution. The rest mass m0 of an electron is 9.1 X 10"31 kg. This is equivalent
(in the mks system) to the energy
318 THE SPECIAL THEORY OF RELATIVITY [CHAP. 14
The positron has the same rest mass and energy. Since the combined energy is
shared equally by two photons, each photon will have an energy of 512,000 ev.
While the rest mass of a photon has no meaning, since a photon cannot be
thought of as existing in a state of rest, we may say that during its existence each
of the above photons will have an “associated mass” equal to 9.1 X 10-31 kg.
Problems
1. Show that all four of Maxwell’s length, what should the fringe or phase
equations (14-6) are invariant under shift in the interference pattern be,
the Lorentz transformations. according to the ether theory, when the
2. In the system S take E to be in apparatus is rotated 90° ?
the F-direction and B to be in the Z- 5. A certain young lady decides on
direction. For the two cases when (a) her twenty-fifth birthday that it is
E > cB and (b) cB > E, find the time to slenderize. She weighs 200 lb.
velocity v < c of the system S' in She has heard that if she moves fast
which the field will appear to be either enough she will appear thinner to her
entirely electric or entirely magnetic. stationary friends, (a) How fast must
For each case state whether it is E' or she move to appear slenderized by a
B' that is zero. factor of 50% ? (b) At the speed found,
3. Referring to the two swimmers in what will her mass appear to be (to her
Section 14-4, take the current v — stationary friends)? (c) If she main¬
3 ft/sec and the speed c of each swim¬ tains her speed until the day she calls
mer in still water to be 5 ft/sec. Find her twenty-ninth birthday, how old
(a) the times t\ and <2 when AC = will her stationary friends claim she is
BC = 100 ft, (b) the distance AC for according to their measurements?
which, with BC = 100 ft, the swim¬ 6. Calling the ratio F/a the inertial
mers would have the same times. mass mi, show that, according to rela¬
4. In the Michelson-Morley experi¬ tivistic dynamics, (a) mi = too/(1 —
ment take l — 11 m and v/c = 10~4 v2/c2')3/2 when F is in the direction of
(the ratio of the earth’s orbital velocity the velocity v, and (b) to; = mo/(l —
to the speed of light), (a) Neglecting v2/c2)1/2 whenF is normal to v. (These
terms smaller than v2/c2, show that values of mi are called the longitudinal
the time difference <2 — £1 = lv2/(?. and transverse inertial masses, respec¬
(b) For light of 5 X 10-7 m wave¬ tively.)
PROBLEMS 319
7. (a) Define the momentum p of a weighs close to 1 kg. If the sphere cools
body, according to relativistic dy¬ 1200°C, what is the equivalent loss in
namics. (b) Show that p2 = m2c2 — mass?
TOqC2. 13. An atomic mass unit (amu) is
8. For what energy in electron volts defined as 1/(6.02 X 1026) kg. (a)
will the ratio m/mo for an electron be Find the energy equivalence of 1 amu.
(a) 1.1, (b) 2, (c) 10, (d) 100? (b) If a deuteron (H2) and a triton
9. Repeat problem 8 for protons in¬ (H3) combine in a fusion process to
stead of electrons. form an alpha particle (He4) and a
10. For what energy in electron volts neutron (n), find the percent loss in
will the ratio v/c for an electron be (a) mass if the respective masses are
0.10, (b) 0.50, (c) 0.90, (d) 0.99? 2.0147, 3.0170, 4.0039, and 1.0090 amu.
11. Assume an electron to be a (c) Compute in kilowatt-hours the
charged spherical shell of radius a and energy released per kilogram in (b).
charge e. (a) Find the work W to put 14. Suppose that a plane electro¬
a charge e on a sphere of radius a. (b) magnetic wave is incident normally on
Equate this work to the rest energy a plane absorbing wall. Use Eq.
tooc2 and get an expression for a in (14-32) to show that the radiation will
terms of e, mo, c, and eo. (c) Substitute exert a pressure equal to the Poynting
numerical values and find a for an elec¬ flux divided by the speed of light. Com¬
tron. (d) Similarly, find a for a proton. pute this pressure for sunlight (for
12. A red-hot sphere of iron (speci¬ which the Poynting flux = 1400
fic heat capacity = 0.11 cal/gm-°C) watts/m2).
CHAPTER 15
WAVE MECHANICS
15-1 Introduction. The laws of classical physics apply very well to the
behavior of objects that we can see with our eyes. However, in modern
atomic physics we talk about and experiment with particles too small to
be seen even under the highest-powered microscope. Physicists have be¬
come so familiar with the properties of, say, the electron, that the existence
of this particle seems just as real as that of a tennis ball. It is often con¬
venient to form a mental picture, or model, of something that we cannot
see. Such a model helps us to assimilate the known facts about the object
in question. Rutherford’s planetary model of the atom is a good example.
A model may also suggest and lead to the discovery of new properties
possessed by the object. Important discoveries have been made in this
way. However, we must guard against taking a model too literally, since
a model may also suggest properties that the object in question does not
actually possess. For example, physicists talked so much about the
planetary model of the atom that some began to think that the electrons
in an atom really were tiny spheres circulating in definite orbits; it seemed
to them that with a more powerful microscope one might actually be able
to see an electron occupying a definite position at a given time, just as in
the case of the familiar tennis ball. We shall see that this conclusion is
misleading, and that we are on safe ground only when we speak of the
observable properties of the electron.
In describing most of the properties of atomic particles a new universal
constant, not needed in classical physics, must be introduced. This
constant is h, Planck’s constant, and its introduction forms the basis for
the quantum principle. We saw in Chapter 6 how this constant was
originally introduced by Planck to explain cavity radiation. We also
saw how this postulate led, in the case of a linear oscillator, to discrete
energy levels and to a recognition of the futility of specifying the state
of the oscillator, in px — x momentum space, closer than to within an
area equal to h. This latter result, it was pointed out, is a special case of
Heisenberg’s uncertainty principle.
A law of physics can usually be expressed in many different, but equiv¬
alent, ways. Heisenberg’s principle is the most concise statement of the
quantum principle, so that in Planck’s work we have the essence of
the quantum theory. Unfortunately, the uncertainty principle is not
the easiest starting point in quantum theory, and Planck did not work out
the mechanics for applying this principle to atomic systems in general.
This was eventually done by Heisenberg about twenty-five years later.
320
15-1] INTRODUCTION 321
hv — Ei — Ef. (15—1)
By 1924 there still remained the problem of finding a logical theory that
would account for the discrete energy levels of an atom and that would
lead to the numerical calculation of these energies. The postulate of the
Bohr-Sommerfeld theory, fpdq= nh, was arbitrary, and it was found to
apply only to one-electron atoms. The solution of the problem developed
independently along two different lines, one being the matrix mechanics
of Heisenberg and the other the wave mechanics of Schroedinger, Born,
and others. These two approaches to the problem were found to be
equivalent, and since wave mechanics is easier to understand, we shall
use this approach here.
Wave mechanics is a theory. Like kinetic theory, it is based on funda¬
mental laws plus certain additional postulates. The validity of these
postulates must be determined by how successful the theory is. A good
theory should lead to (1) the correct functional relationship between
quantities involved, (2) the correct numerical values for any constants in
the formulas, and (3) the prediction of new experimental discoveries.
Wave mechanics has accomplished all of these things.
In presenting a theory, a choice must be made between a historical ap¬
proach and the most logical approach. The latter was chosen in mechanics,
thermodynamics, and electromagnetism. Here we shall follow the historical
development of our theory, as we did in the last chapter, because it is
instructive to see how the new theory grew out of and is related to con¬
cepts and theories presented earlier, and this approach may make the
theory seem more reasonable than if it were presented in the abstract.
We shall also see that just as relativistic mechanics includes Newtonian
mechanics as a special case, so ivave mechanics reduces to ordinary me-
322 WAVE MECHANICS [CHAP. 15
15-2 The dual nature of light. In discussing light, we may say that the
three basic processes are emission, 'propagation, and absorption. In emis¬
sion and absorption we are concerned with how light energy is produced
at the expense of some other form of energy, and vice versa. Actually, we
do not detect the presence of light until it falls on some absorbing object
such as the human eye, a strip of film, or a photocell, in which cases the
light energy is transformed into an electric nerve pulse, chemical energy,
and the energy of photoelectrons, respectively.
Photoelectric and other measurements indicate that light energy is
emitted and absorbed in quantum units, rather than continuously, but
note that the emission and absorption processes have nothing to do with
determining what path the light will follow as it travels from its source
to its sink. However, suppose that we use a system of slits to limit the
path of the light and that we observe along which paths the greatest
amount of energy will travel. Then we can explain or predict our results
correctly by means of the wave theory. In wave theory, we must intro¬
duce the concept of wavelength X and velocity of propagation c, both of
which may be determined experimentally. The frequency v associated
with light of wavelength X is found from the wave relation v = c/X.
We see, then, that the photon theory is used to explain emission and
absorption, and the wave theory to explain the propagation of light.
These theories complement each other, since they apply to different phe¬
nomena and describe different aspects of light. Using both the corpuscular
and the wave descriptions tells us more about the nature of light than we
could learn if we adopted only one viewpoint.
It is tempting to think of photons as maintaining their identity along
the path between where we observe them to be emitted and where we see
them absorbed, but such a postulate has no experimental basis. When
we use the wave theory to calculate the intensity of the light falling on a
certain area of a screen, we are computing how many photons will be
received there in a given time; this in turn may be interpreted as finding
what fraction of a large number of emitted photons will reach the given
area, or the probability that one photon will do so. While we may also
think of the intensity as being proportional to the square of the magnitude
of the electromagnetic field vectors, these quantities are not directly
15-2] THE DUAL NATURE OF LIGHT 323
i
measurable for light in the range from the near infrared on up to gamma
rays because the frequencies are so high. Conversely, in radiowaves the
energy per photon is too small to measure, and so the photon theory
cannot be employed. We do not use both theories at the same time.
The important difference between classical wave theory and the photon
viewpoint is that the former is concerned with continuously varying
quantities, while the latter deals with discontinuous processes. For ex¬
ample, according to classical physics, the intensity of an electric field E
can be decreased gradually from some finite value to zero. A photon,
on the other hand, cannot be subdivided; “less than one photon” must
mean “no photon.” We shall see later that the indivisibility of the photon
is related to Heisenberg’s uncertainty principle. The completely causal
relationship between past and future events which is assumed in classical
theory is replaced in the quantum theory by a statistical relationship
giving the probability that a certain future event will occur.
It was realized that there must exist borderline phenomena for which
an equally good explanation can be found by either the wave or the photon
theory. It was also realized that a subject can never be completely under¬
stood if we look at only one aspect of it at a time. The bridging of the two
theories was accomplished by the correspondence principle, which was
first proposed by Bohr. This principle states that quantum expressions
must revert to the corresponding classical ones in the limiting case for
which the contribution of one photon to the total energy may be regarded
as infinitesimally small. In this case the average value predicted for a
quantity by the quantum theory becomes the stated value by the classical
theory. As an example, we may refer back to Eqs. (6-27) and (6-28)
where, if hv « kT, so that the summation may be replaced by an integra¬
tion, the classical expressions are obtained.
If a photon possesses energy, we may, if we wish, speak of the mass as¬
sociated with, or equivalent to, that energy. We shall call this “associated
mass” of the photon m. Note, however, that a photon does not have a rest
mass since there is no such thing as a photon at rest. A photon either
moves with the speed of light, or it has been absorbed and no longer exists.
Einstein postulated that the energy E of a photon is
E = hv, (15-2)
where v is the frequOncy of the light. The “associated mass must then be
Since a photon travels with the speed of light, it must possess a momentum
p given by
324 WAVE MECHANICS [chap. 15
hv h
p = me = — = — (15-4)
c A
This last result may be obtained in another way. As the speed of a par¬
ticle approaches that of light, its momentum and energy both become
very large, but the ratio of the momentum to the energy approaches the
finite value 1/c. Thus
EE. = hv - W. (15-5)
This equals the rest mass of an electron (ra0 = 9.1 X 10 31 kg) when
E is about 512,000 ev, so that for x-ray photons of about 100,000 ev m is
less than one-fifth of ra0. It is shown in mechanics that in an elastic impact
between a moving and a stationary particle very little K.E. is transferred,
15-2] THE DUAL NATURE OF LIGHT 325
even in a head-on collision, unless the two masses are of the same order of
magnitude. Accordingly, we might expect that the Compton effect would
be of little practical significance in the case of low-energy photons corre¬
sponding to the visible and ultraviolet regions; this is verified experi¬
mentally.
As in Fig. 15-1, consider a photon of energy hv to strike an electron of
low kinetic energy. Let the electron recoil at the angle 9 with the speed v
and let the photon be scattered at the angle <j> with the reduced energy
hv'. From the principle of conservation of energy, we have
1
hv = m0c2 + hv'. (15-6)
y/1 — v2/c2
The relativistic expression for kinetic energy must be used, because, for the
recoil electrons, v may be an appreciable fraction of c.
K.E. = me2 — m0c2
mnV . „ hv' .
0 = ---sin 9-sin <f>. (15-8)
VT — v2/c2 c
Between these last three equations we may eliminate 9 and v, solving for
/ in terms of v and </>. We now introduce the wavelengths X = c/v and
326 WAVE MECHANICS [CHAP. 15
X' = c/v', since these are what we measure, and find (see problem 3 at
end of chapter) that
The factor h/m0c, called the Compton wavelength, has a numerical value
of about 0.024A, where 1A = 10-10m. For monochromatic x-rays
scattered through 90°, we find that part of the beam undergoes just this
change in wavelength.
The kinetic energy of the recoil electrons is given by
K.E. = hv — hv'.
Such x-rays are obtained from a tube operating at over 100,000 volts. If
X' — X = 0.024A, X' will be 0.148A, or about 6/5 of X. Then hv' will be
about 5/6 of hv, or about 83,000 ev, so that the K.E. of the recoil electron
will be about 17,000 ev. The coincident production of a scattered photon
and recoil electron has been observed by means of Geiger counters.
15-3 The dual nature of atomic particles. Louis de Broglie first pro¬
posed in 1924 that a material particle such as an electron might have a dual
nature, just as light does. In the study of light the wave properties, in¬
volving X, v, and c, are the more familiar ones. In terms of these properties
we have defined the mechanical properties of the light corpuscles as follows:
E = hv, V = \- (15-10)
A
For a material particle it is the other way around; we are used to speaking
of mass m, momentum p, and energy E. These are measurable quantities.
Therefore when de Broglie postulated that such a particle may also have
wave properties, he defined the associated wavelength X and frequency v
in terms of m, p, and E. In doing so, de Broglie was guided by the relations
for light (Eqs. 15-10), and so he postulated that associated with a particle
are waves of wavelength X and frequency v, given by
h E
X = v (15-11)
V h '
15-3] THE DUAL NATURE OF ATOMIC PARTICLES 327
Since v < c for a material particle, c2/v > c. This need not concern us
because u is neither the velocity of a material particle nor, as will be shown
later, the group velocity of a set of waves. In fact, the waves associated
with a particle are not to be regarded as being mechanical or electro¬
magnetic, but rather “probability waves.” By this we mean that the in¬
tensity of the waves at any point will be taken as giving the fraction of a
large number of similar particles, emitted with the same initial velocity,
that will reach a given area in unit time, or the probability of one particle
reaching that area. The waves are thus a device for computing the prob¬
ability that a particle will behave in a certain way. De Broglie argued
that in the case of light one cannot think of a unit of energy without
associating with it a wavelength and frequency, and that therefore material
particles, like protons, must be accompanied by phase waves.
The behavior of a material particle is usually computed from the laws
of mechanics. Is there, then, any need for a new method? If so, what are
its advantages? De Broglie suggested that wave mechanics might stand
in relation to particle mechanics as wave optics does to geometrical optics.
In geometrical optics light is assumed to travel through a homogeneous
medium in straight lines or rays. To explain the details of the diffraction
pattern caused by an obstacle, one must resort to wave optics and be con¬
cerned with distances of the order of magnitude of one wavelength. WTen
we compute the wavelength X = h/mv associated with an ordinary object,
it turns out to be insignificantly small, but for an electron the mass m
is very much less than for ordinary objects and h/mv is found (see example
below) to be comparable to x-ray wavelengths. Since x-rays are diffracted
by crystals, why should not such electrons be, too? This question was
answered by Davisson and Germer who, while studying the scattering of
electrons from metals, found that electrons are indeed diffracted by crystals
just as x-rays are, and in accordance with the relation X h/mv.
De Broglie also realized that in dealing with atomic systems, where the
dimensions are also comparable to x-ray wavelengths, wave mechanics
might be able to explain things that ordinary mechanics could not. For
example, according to the Bohr theory of the hydrogen atom, only those
electron orbits are allowed for which the angular momentum pe is an in¬
tegral multiple of H/2t, or
(15-13)
328 WAVE MECHANICS [CHAP. 15
This whole number rule was empirical when it was first proposed, the argu¬
ment in its favor being that it worked. De Broglie suggested, by way of
explanation, that if waves accompany an electron around its orbit, then
a sort of resonance might occur when the circumference of the orbit equals
an integral multiple of the electron’s wavelength. This idea, which was
later refined by Schroedinger, amounted to saying that the allowed
quantum orbits correspond to the various possible modes of vibration, or
standing waves, in a string. For such a string, waves are reflected back
and forth with constructive interference only when the wavelength X is
such that the complete distance down the string and back is an integral
number n of wavelengths. Here the restriction that n must be a whole
number has a physical basis, hence it was appealing to carry the idea over
to the case of the atomic orbit and its electron waves.
Consider a circular orbit of radius r. Taking X = h/p, where p is linear
momentum, de Broglie put the orbital circumference 2-kr equal to nX, or
2-7rr = n\ = n— >
V
h
Pe = rp = n
mv = V2 meV,
X _ h _ h
mv \/~2meV
sf 2Tdt=0, (15-14)
Jc
where T is the kinetic energy, t is the time, and C and D refer to the
terminals of the path.
In optics a beam of light, say from a flashlight, must also be properly
aimed to strike a given target. According to Fermat’s principle (see Sec¬
tion 13-4), if the optical path is compared with all neighboring paths that
might be traversed, at the speed appropriate to each medium, from source
to target, it will be found that the actual path is the one traversed in the
least time. This principle may be written as
rD
8 - = 0, (15-15)
Jc u
E — T + V,
where E is the total energy of the particle and V its potential energy at a
given point. As we have said, E is assumed to be constant, or the system
to be conservative, but V may be a function of the coordinates. Neglecting
relativity enables us to take
9 ds
8
[D mv ds = 0. (15-16)
Jc
To eliminate v, we write
-D
8 V/2m(E — V) ds — 0. (15-17)
Jc
The principle of least action in the form of Eq. (15-17), and the principle
of least time, Eq. (15-15), become identical if we postulate that
V2m{E - V) = -,
u
(15-18)
(15-19)
only here we must remember that E stands for T + V and not for me2.
15-4] analogy: ordinary mechanics and geometrical optics 331
The wavelength of the waves follows from X = ufv, so that only the
constant C remains to be determined.
Condition (15-18) ensures the right path for our particle, but it does not
ensure that the particle will traverse the path with the correct speed v
at every point. Fortunately, C may be chosen in such a way as to make
this so.
From Eqs. (15-18) and (15-19) we see that u varies with the frequency.
This is dispersion. In a dispersive medium the group velocity G is not
the same as the average phase velocity u of a set of waves of varying fre¬
quency. We have seen that group velocity is identified with signal velocity,
which suggests that we should try to make the group velocity of our
particle waves the same as the actual velocity v of the particle. This im¬
plies that we must take our waves to have a range of frequency Av about
the frequency v = E/h. From Eq. (13-32), we have
where we now write u for the phase velocity. Putting v — E/h and using
Eq. (15-18), we get
I _ A (E\ = A.
\EV2m(E - V) (15-20)
G ~ dE\u) ~ dE L C
1 m_rn_
v ~ mv y/2m(E — V)
♦ __= (15-22)
V2m{E - V) mv P
u _ E h h _ h_
~ ^ 7~T
p E p mv
V • VE
E = E*0, y, z)e±2*ivt.
V • VE* + E* = 0.
This equation and the boundary conditions determine the amplitude factor
E* at any point.
When we consider the application of this procedure to the de Broglie
waves, we must note that the latter are not physical waves in the sense
that elastic and electromagnetic waves are, but that they are waves which
we invent and study mathematically. Nevertheless, it seems like a good
“guess” to assume that they satisfy equations similar to those for E above.
However, another problem arises in connection with the fact that the total
energy E of our particle will be constant only for steady-state problems.
This means that when de Broglie waves are propagated, the frequency v,
which according to Eq. (15-19) equals E/h, will not necessarily be con¬
stant. Since the relationship between the amplitude equation and the
wave equation depends on the time factor e±2*ivt, it will vary with the
15-5] schroedinger’s equation 333
total energy. This suggests that we start with just the amplitude part
of our wave function and with the amplitude equation.
Let y, z) be the amplitude function for our de Broglie waves. We
postulate that it must satisfy the differential equation
4 2 2
V • V* + * = 0. (15-23)
For these waves we must put v = E/h and give u the value chosen in Eq.
(15-22). Equation (15-23) then becomes
V • V* + (E - V)ip = 0, (15-24)
where SI' is the complete wave function. (The minus sign in the exponent
is preferred only for relativistic reasons.) Since v = E/h,
d* . 2iriE
— 2 TlvV —-T—
dt h
or
h 34' _ ih d’L
E'ir
2wi dt 2ir dt
so that we have
4:irm d'f' 8ir2mV^ _ ^ (15-26)
V • V’F
~Hh ~dt h2 _ :
d2\p r2m
87
(E — \kx2)\J/ = 0. (15-27)
dx2 h2
We first simplify this equation by putting
87v2mE 4:ir2mk 7
h2 = a, ~h^ = b-
15-7] SIMPLE HARMONIC OSCILLATORS 335
which yields
(15-28)
The solution of Eq. (15-28) is not simple, but fortunately it had already
been worked out by mathematicians before Schroedinger’s equation was
known. We shall sketch the solution, emphasizing only those points
which help illuminate Schroedinger’s theory.
As a solution of Eq. (15-28) we try
* = e-q2'2 £ (15-29)
s=0
and equating to zero the coefficients of q°, g\ q2, and qn, we find that
2 — p P a2, •
a2 = — ^ «o> a3 =
3-2
ai, a4
4-3
2 n — ft (15-30)
dn-
a„+2 — (n _)_ 2)(n + 1)
2n 2
Lim n
n—>°° n2
336 WAVE MECHANICS [CHAP. 15
while if we expand
n+2
i L.
e9 = 1 + q*2 +
2! (n/2)! 1 [in + 2)/2]!
+
— 1 b2q~ T~ q4 • + bnqn + &n+2 Qn h2 +
then
which —4oo. This makes our solution appear physically impossible until we
consider the possibility that the infinite series might break off after, say,
the nth term. For this to happen, we must have an+2 — 0, then am+4,
an+6; etc., will all vanish as well, and our series will become a polynomial
Hn>2 known as a Hermite 'polynomial. These polynomials, multiplied by
e~q 12, approach zero as q —» oo. They also give solutions that are single¬
valued and continuous.
Evidently, the important condition is that
2n - )8 = 0, (15-31)
2t E = (n + n = 0, 1, 2, 3, • • • .
hence we get
E = (n + £)hf, to = 0, 1, 2, 3, • • • . (15-32)
have a wavelike character, the number of maxima plus minima being equal
to n. Note the resemblance, in this respect, to the modes of vibration of a
string. If we plot l^l2 as a function of x, it is found that when n is large
\in\2 is a maximum in the neighborhood of ±A, where A is the amplitude
of the oscillator in classical mechanics. An oscillator such as a pendulum is
observed to spend more time near the ends of its range of swing than
elsewhere, for then its velocity is less, so that, for large values of n, Born’s
probability interpretation of \\J/\2 seems reasonable. However, for small
values of n the situation is different. For example,
\io\2 = Cle~q,
which has the form of the error function, with a maximum at q = x = 0.
This does not mean that Born’s postulate is incorrect, for when n = 0 the
energy of a visible oscillator, like a spring or a pendulum, would be so
small that its amplitude would be too minute to observe. For an atomic
oscillator, such as a molecule, the important fact is that its spectral fre¬
quencies, and hence energy levels, can be observed, and that wave mechan¬
ics predicts these correctly.
1 M
sin2 6 d(j>2)
872m
+ (15-34)
i = R(r)(d(0)$(i). (15-35)
1 d2®
4> d<j)2 '
We may then put this equal to a constant of separation, say —a2, and
obtain
t —ia<f>
a = 0, 1, 2, 3, • • • .
n = 1, 2, 3, • • • . (15-36)
8e2h2n2
If we combine this equation with Eq. (15-1), the correct spectral fre¬
quencies for hydrogen are obtained.
Since Eq. (15-36) was also derived by Bohr, the real success of wave
mechanics does not become apparent until we investigate a more com¬
plicated problem, such as that of the normal helium atom. Here the
Bohr-Sommerfeld theory failed, but if the methods of wave mechanics
are extended by the application of perturbation theory, f the correct energy
levels are obtained for helium.
*1 = o, (15-37)
ax2 n2
- (E - V,M'2 = 0, (15-38)
= Aieikx + Bie~ikx,
. ikx
$3 = e ,
where
In writing the expression for ^3, we assume that no wave travels back
from infinity in region 3. The amplitude of *3 is made unity because we
are interested only in relative amplitudes.
340 WAVE MECHANICS [CHAP. 15
The A’s and B’s are determined by conditions at the two boundaries
x = 0 and x = L. One condition is that \p must not change discontinu-
ously at a boundary, or
’/'i = ^2 at x = 0, — ^3 at x — L.
Ai + Bi — A2 + B 2,
Bl + + D®2
eikL = + B2<r<ZJ1,
or
A2=K1+1) •**■“
(15-42)
B2 = | (i _ *) e«*+M
The term .die1**, when multiplied by the time factor e~2irlvt, represents
waves in region 1 moving in the plus X-direction, or toward the barrier.
We then want to compare Ai (the amplitude of these waves) with unity
(the amplitude of the waves in region 3);
;)(1 + *)e.— +
Al~4 1 + i-
15-9] PENETRATION OF A POTENTIAL BARRIER 341
l = ~ V2m(V0 - E) = ik',
When 2k'L is large, cosh 2k'L behaves like e2k’L. The important factor
then is the product of the width of the barrier and VFo ~~ E.
In a radioactive nucleus a particle has a finite chance of escaping in a
given time, even though the total energy E of the particle may be con¬
siderably less than the value of V0 at the top of the nuclear barrier (Fig.
15-3). As a result, we observe that in a sample of radioactive material a
certain fraction of the nuclei decay in any given time. This fraction is
called the decay constant. While the potential barrier for a nucleus is not
the simple rectangular one that we assumed above, the essential features
are the same. When the decay constants for different radioactive sub¬
stances are measured and compared with the corresponding energies of the
emitted particle, we find a relationship qualitatively similar to the one
above between l/|Ai|2 and y/V0 — E, namely (when 2k'L » 1),
_ -oVFo—e
oc e >
Coordinate Qm
II
O
h d
Momentum Pm
V° ~ 2n dq
Thus, for the observable x the operator is x itself, while for linear mo¬
mentum px it is (h/2-iri) (d/dx).
Postulate II states that the only possible values which measurement of
an observable may yield are those for which the equation
leads to a solution for \p subject to the same restrictions (\f/ must be single¬
valued, etc.) as those listed for Schroedinger’s equation.
(a) Position. Equation (15-44) for the coordinate x is
xi = XrJ, (15-45)
or
(x — xm)\p = 0. (15-46)
15-10] THE OPERATOR APPROACH 343
2= *-*• ^
2s ah(1M8)
We then integrate as in (b) to get
* =
We now find that pgm must be restricted to a discrete set of values. Adding
2t to 6, which means a complete revolution in space, must bring ^ back
to its original value if \f/ is to be single-valued at a point. Thus we must
limit pem to
n = 0, 1, 2, 3 • • • ,
= Cmeine.
*Dirac’s delta function 8(q) is defined by 8(q) = 8(—q), 8(q) - 0 except for
q = 0, 8{q)dq = 1.
344 WAVE MECHANICS [CHAP. 15
H =
V
G
8ir2m \dx2
—
3
+—) + F
dy2 " dz2/ '
d2j
- = E\p,
8ir2m \dx2 dy2 ^ 522
or
8ir 2m
v• 4- (E - F)* = 0,
- = ///Pl^l2 dx dy dz
(15-50)
fSjWdxdydz
Xm — dx.
E = §j$Em\\pm\2 dx dy dz
= Emfff\ipm\2 dx dy dz
== Em,
since ij/m is assumed normalized. The result seems obvious, but it lends
support to postulate III and points up the difference between an observable
whose discrete values may be found experimentally and one which must
(for atomic systems) be treated statistically.
For further development of the general theory of quantum mechanics
the reader is referred to one of the many specialized texts available. The
above treatment will show the broad abstract nature of the theory, which
has successfully handled such matters as the intensity and polarization of
spectral lines, electron spin, etc.
x-rays, but then the trouble is that one photon of such radiation strikes our
electron with greater momentum and energy than does one photon of radia¬
tion of longer wavelength. We have seen in the Compton effect that when a
photon is scattered by an electron, the latter recoils in some forward direc¬
tion. The energy imparted to the electron increases with the energy of the
incident photon. When we use radiation of shorter wavelength we improve
our resolving power and thus may locate an object more accurately in
position, but at the same time we increase the disturbance of the particle
and make its subsequent momentum more uncertain. Let Aq be the smallest
resolvable separation for the wavelength used, and let Ap be the corre¬
sponding uncertainty in the momentum of the scattered quantum, then the
product Aq Ap is of the order of magnitude of Planck’s constant h, or
Aq Ap ~ h. (15-51)
Aq
= n, > n + 1,
T
Aq (> ~ — X. (15-52)
|Ap| = p AX.
We then have
|A$| |Ap| ~ h,
Problems
1. Bitter, F., Currents, Fields and Particles. New York: Wiley, 1956.
2. Di Francia, Toraldo, Electromagnetic Waves. New York: Interscience,
1955.
350
REFERENCES 351
Optics
Quantum Theory
1951. • AT V 1
2. Lindsay, R. B. and Margenau, H., Foundations of Physics. New York:
Wiley, 1936. .
3. Pauling, L. and Wilson, E. B., Introduction to Quantum Mechanics. New
York: McGraw-Hill, 1935.
4. Richtmyer, F. K, Kennard, E. H. and Lauritsen, T., Introduction to
Modern Physics. 5th ed. New York: McGraw-Hill, 1955.
5. Schiff, L. I., Quantum Mechanics. 2nd ed. New York: McGraw-Hill, 1956.
ANSWERS TO PROBLEMS
CHAPTER 1
1. 250°, 350°
2. (a) 491.8°, 671.8°, (b) —459.8°F
3. 115.5°, 215.5°
4. (a) 1.183, 1.40, (b) 161.7°, 46.2°
5. 101°
6. Extrapolate p vs. T to zero pressure.
7. (a) p(Vb - Va), (b) RT In (Vb/Va)
8. (a) 1.013 X 105 n/m2, (b) 1.69 X 105 joules
CHAPTER 2
9. (a) 37.5% vs. 50%, (b) 750 joules, 500 joules, (d) —5 joules/deg,
2.5 joules/deg, 2.5 joules/deg, 0
10. (a) 0, (b) 3.33 joules/deg, (c) 3.33 joules/deg
11. (a) 0, (b) 0, (c) 0; 1000 joules vs. 0
12. —24,800 joules/kg-mole-deg
13. (a) —R In 2, (b) R In 2, (c) 0
14. (a) 0, (b) R In 2, (c) R In 2
352
ANSWERS TO PROBLEMS 353
CHAPTER 3
19. 3, 3, 2
20. 3, 2, 1
CHAPTER 4
CHAPTER 5
1. (a) 1, 10, 45, 120, 210, 252, 210, 120, 45, 10, 1, (b) 34.4%
2. (a) 1, (b) 5, (c) 10, (d) 20, (e) 30
3. (a) 51.4%, (b) 18.6%, (c) 7.5%
4. —1, —1, 1260
5. (a) 221, (b) 0
6. 0.42
7. 0.818
8. 0.922
9. 377 m/sec, 425 m/sec, 461 m/sec
10. (a) 2.36 X 1023, (b) 1.21 X 1024, (c) 1.09 X 1024, (d) 3.94 X 1023
11. 2
13. kT/2
14. (a) 366 m/sec, 259 m/sec, (b) 0.52 cm, 0.73 cm
15. 3R, 4/2, 1.33
16. 772/2, 9/2/2, 1.29
CHAPTER 6
1. (a) 7.4 X 106 m/sec, (b) 3.3 m/sec, (c) 2.2 X 10~9 m/sec
2. (a) 15, (b) 20, (c) 18, (d) 12, (e) 5
3. (a) 0, (b) 2, (c) 3, (d) 0, (e) 0
6. 1.8°K
10. (8?rhc d\/\5)/(eh^kT - 1)
12. 6.71 X 10~^4 joule-sec, 1.40 X 10~23 joule/deg
13. 2.4 X 103
14. (a) 0.98, (b) 0.50, (c) 0.02
CHAPTER 7
3. 4 X 1033 *
4. 8.85 X 10-12 coul2/n-m2, 12.56 X 10-7 n/amp2
5. ri is dimensionless.
6. 9 X 1011 n-m2/franklin2, 10-5 n-sec2/franklin2, 9 X 1016 m2/sec2
7. (a) 10 amps (b) 9 X 1020 dyne-cm2/abcoul2
(c) 3 X 1010 cm/sec (d) 10-4 w/m2
8. (a) 3 X 109 statcoul
(b) (1/9 X 1020) dyne-sec2/statcoul2
(c) 3 X 1010 cm/sec
9. (a) 3.14 X 10~9 volt, (b) 3.14 X 10-7 volt
10. 3.5 X 10-15 w/m
CHAPTER 8
2. qx/4ireo(x2 -f- a2)3/2
3. cr/eo
4. (a) 0, (b) pr/3e0
5. — q', q + q’
8. 1.3 X 10~15 kg
9. (a) — axy, (b) 4 X 10-6 joule, (c) 0
10. —a/y, ax/y2, —2eoax/y3
11. qd cos 6/^ireor2
12. (a) (gd2/167reor3)[(3x2/r2) — 1]
(b) 3qd2xy/47reor5
13. V = (V0/L)x + k(L2x - x3)/6e0
14. V = 2n2mCnm sin (nirx/Li) sin sin (hrz/Ls), (n/Li)2 +
(m/L2)2 + (l/L3)2 = 0
15. (a) 4:ireoVoab/(b — a), —47reoEoab/(b — a)
(b) —47T€oFoab/(b — a), 47reoEo52/(^ ~ a)
16. — q
17. (a) —q at 90°, 210°, and 330°; q at 150° and 270°; (b) 0.013 q2/t0 d2
18. Behind plane: —47reoaEo[l + (o/d) + (a2/d2)]
On sphere: 47reoaFo[l + (a/d) + («2/d2)]
19. (a) 47reo«Eo, ■—^rreoaVob/d
(b) 4-]reoaVo(l + ab/d2), —i-ireoaVob/d
20. Infinite charged plane
21. Semi-infinite charged plane
22. (a) q2/8ire0a, (b) q2/8Tre0a
356 ANSWERS TO PROBLEMS
CHAPTER 9
(d) 0
(e) 0.192 X 10-7 n in — F- and -f-X-directions
2. (a) 10~° w/m2 in Z-direction
(c) 3.53 X 10~7 w/m2 in Z-direction
(d) 0
(e) 2.7 X 10~7 w/m2 at 45° with —Y- and +Z-directions
3. (a) 2 X 10-5 w/m2 in Z-direction
(c) 2 X 10~5 w/m2 in Z-direction
(d) 0
(e) 1.41 X 10-5 w/m2 at 45° with —F- and -(-^-directions
5. (a) 7.8 X 10~n m/sec, (b) 1.6 X 10-6 m
6. (a) 0
(b) 1.6 X 10 _14 n in Z-direction
(c) 1.6 X 10-14 n in — F-direction
(d) 1.13 X 10-14 n in Z-direction
(e) 1.31 X 10_14 n in FZ-plane
7. y0!R2/2{x2 + R2)3/2
CHAPTER 10
CHAPTER 11
CHAPTER 12
4. -q(Ke - 1)/Ke
5. K2 tan 9i = K1 tan 02
ANSWERS TO PROBLEMS
358
CHAPTER 13
CHAPTER 14
9. (a) 9.42 X 107 ev, (b) 9.42 X 108 ev, (c) 8.48 X 109 ev, (d) 9.33 X 1010 ev
10. (a) 2.57 X 103 ev, (b) 7.86 X 104 ev, (c) 6.60 X 105 ev, (d) 3.62 X 106 ev
11. (a) e2/87re0a, (b) a = e2/87re0m0c2, (c) 1.41 X 10~15 m, (d) 7.66 X 10~19 m
CHAPTER 15
4. S _
5. (a) X = (h/V2m0eV)( 1 — 56 + )
(b) X = (hc/eV)[ 1 - (1/6) + f(l/62)-]
>
• >
' -7
.
'
•fV
Jt
-■" t