Fred - Light Matter Interaction
Fred - Light Matter Interaction
John Weiner
IFSC-CePOF-INOF, Universidade de São Paulo, São Paulo, São Carlos, Brazil
Frederico Nunes
Universidade Federal de Pernambuco, Recife and IFSC-CePOF-INOF, Universidade de São
Paulo, São Paulo, São Carlos, Brazil
1
To Annick and Marize for patience and encouragement.
Preface
we show how a “real” antenna can be built up from an array of oscillating charges
and how an array of macro-antennas can be used to concentrate the spatial direction
of emission or reception.
After these three Complements to Chapter 2 we devote Chapter 3 to a fairly
extensive discussion of surface waves at the interface between dielectrics and metals,
because they play such an important role in “plasmonic” structures and devices. In fact
this propagation can be expressed in terms of circuit and waveguide theory, familiar to
electrical engineers; and Chapter 4, Transmission Lines, Waveguides, and Equivalent
Circuits, establishes the correspondence between classical electromagnetics and circuit
properties such as capacitance, inductance, and impedance. Furthermore we illustrate
how waveguide modal analysis and impedance matching can be used to guide the
design of nanoscale optical devices.
On the atomic scale (a few hundred picometers) and at interaction energies less
than or comparable to the chemical bond, light-matter interaction can be very well un-
derstood through a semiclassical approach in which the light field is treated classically
and the atom quantally. We therefore retain the classical electrodynamics treatment
in Chapter 5, Radiation in Classical and Quantal Systems, while presenting a very
simple quantum atomic structure with dipole transitions among atomic and molecu-
lar internal states. We take a very down-to-earth wave mechanical approach to the
quantum description in order to bring out the analogies between classical light waves,
quantum matter waves, classical dipole radiation and atomic radiative emission.
In addition to the Complements we have included a number of Appendices that
provide some supplementary discussion of the analytical tools used to develop the
physics and engineering of light-matter interaction. Appendix A is a brief discussion of
systems of units in electricity and magnetism. Although the Système International (SI)
has now been almost universally adopted, it is still worthwhile to understand how this
system is related to others; what quantities and units can be chosen for “convenience”
and what are the universal constraints that all systems must respect. Appendix B
is a brief review of vector calculus that readers have probably already seen, but who
might find a little refresher discussion useful. Appendix C discusses how the important
differential operations of vector calculus can be recast in different coordinate systems.
Although the Cartesian system is usually the most familiar, spherical and cylindrical
coordinates are practically indispensable for frequently encountered problems. Much
of the book deals with harmonically oscillating fields, and Appendix D is succinct
review of the quite useful phasor representation of these fields. Finally Appendices E,
F, G present the properties of the special functions, Laguerre, Legendre, and Hermite
that are so commonly encountered in electrodynamics and quantum mechanics. These
Appendices are an integral part of the book, not just some “boiler plate” nailed on at
the end. Readers are strongly encouraged to pay as much attention to them as they
do to Chapters and Complements.
Most of the material in this book is not new. Excellent texts and treatises on
classical electrodynamics, physical optics, circuit theory, waveguide and transmission
line engineering, atomic physics and spectroscopy are readily available. The real aim of
this book is take the useful elements from these disciplines and to organize them into a
course of study applicable to light-matter interaction at the nanoscale. To the extent,
Preface ix
for example that waveguide mode analysis and sound design practice in microwave
propagation informs the nature of light transmission around and through fabricated
nanostructures, it is relevant to the purposes of this book. Rugged, reliable laser sources
in the optical and near-infrared regime together with modern fabrication technologies
at the nanoscale have opened a new area of light-matter interaction to be explored.
This exploration is far from complete, but the present book is intended to serve as
point of entry and useful account of some of the principal features of this new terrain.
Acknowledgements
This work has benefited from discussions, suggestions and remarks of many people over
the past several years. In particular Ben-Hur Borges, Nader Engheta, Henri Lezec, Ma-
sud Mansuripur, and Euclydes Marega, Jr., have contributed invaluable insight. JW
was also the grateful recipient of a visiting fellowship at the Center for Nanoscale Sci-
ence and Technology (CNST) of the National Institute of Standards and Technology
(NIST) from 2009-20011. The CNST provided a uniquely stimulating human and ma-
terial infrastructure for studying fundamental questions of light-matter interaction at
the nanoscale. During the year 2011-2012 JW has also benefited from the hospitality of
the Instituto de Fı́sica de São Carlos (IFSC) of the Universidade de São Paulo (USP),
and within the institute the Centro de Pesquisas em Óptica e Fotônica (CePOF). Fi-
nancial support for this year was provided by a generous Visiting Researcher grant
from the Fondação de Amparo à Pesquisa do Estado de São Paulo (FAPESP). JW
would also particularly like to thank Vanderlei Bagnato for the years of collaborative
work and fruitful interactions over the past quarter century, studying atomic optical
cooling in general and “cold collisions” in particular. Research in light-matter interac-
tion at the atomic level has contributed significantly to a deeper appreciation of these
phenomena at the nanoscale. Finally JW thanks Euclydes Marega, Jr. for his generous
and warm hospitality during a year in which he shared much of his time, his office
space, and know-how, allowing the timely completion of this project. FN acknowledges
financial support from FAPESP during 2011-2012 that permitted him to work closely
with Ben-Hur Borges and JW at the São Carlos campus of USP. Scientific discussions
with Professor Borges together with his able technical assistance on computer pro-
gramming contributed greatly to the work presented here. FN also acknowledges the
Universidade Federal de Pernambuco (UFPE) for a year sabbatical that made his stay
in São Carlos possible and the hospitality of CePOF and the IFSC.
Contents
The phrase “light-matter interaction” covers a vast realm of physical phenomena from
classical to quantum electrodynamics, from black holes and neutron stars to to meso-
scopic plasmonics and nanophotonics to subatomic quantum objects. The term “inter-
action” implies that light and matter are distinct entities that influence one another
through some intermediate agent. The history of scientific inquiry from the earliest
times to the present day might be neatly summarized into three questions: what is
the nature of light itself, of matter itself, and of the interaction agent. We now know
from Einstein’s celebrated equation E = mc2 that light (E) and matter (m) are fun-
damentally manifestations of the same “thing,” related by a universal proportionality
constant, the square of the speed of light in vacuum (c2 ). Nevertheless under ambient
physical conditions normally found on earth, the distinction between light and matter
makes sense; their interaction meaningful and worth studying.
physical contact. Together with Newton’s theory of gravitation these ideas elevated
action-at-a-distance to serious consideration. The next step was the great unification
of electric, magnetic, and optical phenomena effected by James Clerk Maxwell (1831-
1879). In 1865 Maxwell published A Dynamical Theory of the Electromagnetic Field in
which he set forth the proposition that light was in fact a transverse electromagnetic
wave. In modern form the four Maxwell equations and the Lorentz force law constitute
a unified classical theory of electricity, magnetism, and light.
While Young, Fresnel, and Maxwell were putting light, electricity, and magnetism
on a firm foundation, understanding of ponderable matter and chemical interaction was
also advancing rapidly. Building on Boyle’s experiments with gases, Joseph Priestly
(1733-1804) conducted extensive experiments on “airs” and identified nitric (NO) and
nitrous (N2 O) oxides and oxygen (O2 ), which he called “dephlogisticated air”. This
term refers to “phlogiston,” a substance that was thought to be contained within
matter and expelled during combustion. Although now discredited by modern un-
derstanding of oxidation, Priestly used the principle of phlogiston to rationalize his
observations. In France Joseph Louis Gay-Lussac (1778-1850) annunciated another
physical gas law, complementary to Boyle’s law. Gay-Lussac found that for a given
quantity of gas, the volume was directly proportional to the temperature at constant
pressure. Back in England John Dalton (1766-1844) determined that elements com-
bine in simple number ratios and began to determine atomic masses. Dalton also
determined that the pressure of a mixture of gases was equal to the sum of the pres-
sures of the individual constituents. But the major unifying advance in understanding
chemical reactivity was carried out by Antoine Lavoisier (1743-1794), considered the
father of modern chemistry. Lavoisier was the first to clearly annunciate that between
reactants and products of a chemical reaction mass was conserved. The recognition
of mass conservation as a fundamental principle distinguished light, now recognized
as an electromagnetic field wave, from ponderable matter. Finally Dimitri Mendeleev
(1834-1907) unified the growing list of elements into a rational order with the periodic
table in 1869. As the nineteenth century drew to a close, there were grounds for great
satisfaction. Although the inner structure of the atomic elements was not known, their
chemical identity and relationships to each other were firmly established. The periodic
table not only rationalized known elements but predicted those not yet discovered.
Oliver Lodge could write in 1888 “The whole subject of electrical radiation seems to
be working itself out splendidly.” There was only the somewhat perplexing null result
of the Michelson-Morley experiment in 1887 that failed to measure the velocity of
the “luminiferous aether” through which light was supposed to travel, and the fail-
ure of the equipartition of energy principle to account for the spectral distribution of
black-body radiation. But these were considered minor blemishes on a near perfect
masterwork of human understanding.
of the inertial reference frame in which it is measured. It follows that any experiment
attempting to determine the luminiferous aether wind with respect to the earth’s mo-
tion must yield a null result. Furthermore, in a subsequent paper published later in the
same year, Einstein showed that special relativity implied the equivalence of matter
and energy through the famous formula, m = E/c2 . The “new philosophy” originat-
ing with Descartes and culminating in the achievements of Maxwell and Mendeleev
appeared to establish a fundamental distinction between light and matter. Within
a few decades of their master works, however, special relativity announced a funda-
mental equivalence between them. Furthermore, in a subsequent paper of the same
year, Einstein proposed the quantization of radiation that reestablished some of the
old Newtonian corpuscular properties of light. Energy quanta in the form of E = hν,
where ν is the frequency of light and h is Planck’s constant, also removed the “ultravio-
let catastrophe” from the Rayleigh-Jeans formulation of black-body radiation. Finally
6 Historical Synopsis
2.1 Introduction
Before we can discuss the interaction of light and matter we have to establish a working
vocabulary for the vector fields of classical electrodynamics and the equations that
relate them. These equations, Maxwell’s equations, together with the Lorentz force
law serve as the postulates of that theory. A class of solutions to these equations
describes the propagation of plane electromagnetic waves, and we will explore this
propagation in vacuum, in dielectrics, and in good conductors. In complements to the
main chapter we will study polarization fields in bulk matter, the relation between the
macroscopic polarization field and the microscopic atomic polarizability, and finally
develop further the central role played by the radiating dipole in the construction of
antennas and antenna arrays. Later in a complement to Chapter 5 we will also describe
how standing waves are established in a cavity and how to calculate the energy density
by counting modes. The energy density is then applied to the problem of classical black-
body radiation theory, and we will describe how it leads to a fundamental contradiction
between the predictions of classical theory and experimental measurement.
µ0
≡ 10−7 (2.3)
4π
and the numerical value of the permittivity of free space is fixed by the condition that
1
= c2 (2.4)
0 µ0
where c is the speed of light in vacuum. Therefore we must have
1
= 10−7 c2 (2.5)
4π0
The time-independent vector fields issuing from these force expressions are the electric
field, E, and the magnetic induction field, B. The E-field is the Coulomb force per
unit charge,
1 q
E= r (2.6)
4π0 r3
and the magnetic induction B-field is given by the Biot-Savart law,
I×r dl × r
Z Z
µ0 µ0
B= dl = I (2.7)
4π r3 4π r3
where I is the current running in a wire and dl is an element of the wire length. The
Lorentz force law succinctly summarizes the effect of the E- and B-fields on a charged
particle moving with velocity v,
F = q(E + v × B) (2.8)
In addition to these two fields, the displacement field D and magnetic field H are
needed to describe the modification of force fields in dielectric or conductive matter.
In linear, isotropic materials these two additional fields are linked to E and B by the
“constitutive relations”
D = E (2.9)
1
H= B (2.10)
µ
where and µ are the permittivity and permeability of the material. These material
parameters are related to those of free space by
= 0 r (2.11)
µ = µ0 µr (2.12)
where r and µr are the relative (and unitless) permittivity and permeability respec-
tively. The relative permittivity r is often called the dielectric constant, but the rel-
ative permeability rarely finds application and does not have a common alternative
name. Unfortunately many authors use for the dielectric constant so there is danger
of confusion between permittivity (with MKS units of C 2 /N · m2 ) and the unitless
dielectric constant. However one can always discern from the context what is meant
by the symbol . There is also some difference of opinion among authors as to the most
appropriate nomenclature for the B-field and H-field. Here we follow conventional us-
age and call the B-field the “magnetic induction field” and the H-field the “magnetic
field”.
Classical Fields in Matter 9
D = 0 E (2.13)
B = µ0 H (2.14)
When these force fields act on a material medium, however, the D and H fields take
on added terms. Matter consists of positively charged core nuclei surrounded by dis-
tributions of negatively charged electrons. If the core nuclei are arranged according to
some symmetric spatial extension the material is crystalline; if not the material may
be a glassy solid, a liquid or a gas. The electric charge distribution may be bound to
the nuclei or delocalized throughout the crystal structure. In any case if E and B are
present, the the Lorentz forces acting on the electric charge distribution will clearly
modify it. These modifications can be characterized by the introduction of two new
fields, the polarization P and the magnetization M. In the presence of matter the
displacement and magnetic fields now are expressed as
D = 0 E + P (2.15)
B = µ0 H + M (2.16)
For E-and B-fields that are not too strong the P and M fields themselves are propor-
tional to E and H.
P = 0 χe E (2.17)
M = µ0 χm H (2.18)
where χe , χm are electric and magnetic susceptibility. Equations 2.15, 2.16 then become
D = 0 (1 + χe )E (2.19)
B = µ0 (1 + χm )H (2.20)
and from Eqs. 2.9, 2.10 we see that the relative permittivity and permeability can be
written in terms of the corresponding susceptibilities as
r = 1 + χ e (2.21)
µr = 1 + χm (2.22)
The relative permittivity and permeability are unitless but may be complex, the imag-
inary parts reflecting absorptive loss.
to electric and magnetic forces. The equations governing the spatial and temporal
behavior of these fields are Maxwell’s equations,
∇ · D = ρfree (2.25)
∇·B=0 (2.26)
∂B
∇×E=− (2.27)
∂t
∂D
∇ × H = Jfree + (2.28)
dt
The first two equations make statements about field sources. The first states that the
source of the displacement field D is the “free” electric charge density ρfree . In fact
the charge density is composed of two terms: the free charge density and the “bound”
charge, ρbound .
ρ = ρfree + ρbound (2.29)
The bound charge density is defined as the negative divergence of the polarization
field,
ρbound = −∇ · P (2.30)
so that from Eqs. 2.25, 2.30 we see that
ρ
∇·E= (2.31)
0
The second source equation, Eq. 2.26, states that the magnetic induction field B does
not originate from a magnetic charge density, ρm . In fact Equation 2.26 implies that
magnetic source “charges” do not exist; and magnetic monopoles have never been
found in nature (although Dirac’s quantum electrodynamics implies that that must
exist somewhere or have existed at some time). The second pair of equations, termed
Faraday’s Law and the Maxwell-Ampère Law respectively, describe the spatial and
temporal behavior of the fields. Equation 2.28, Ampère’s Law, introduces another field,
the charge current density Jfree . Just as Eq. 2.29 expresses the total charge density
as the sum of the free and bound charge densities so the total current density J is
composed of the sum of free and bound current densities.
The bound charge density is defined by two contributions: one from the polarization
field and the other from the magnetization field.
∂P 1
Jbound = + ∇×M (2.33)
∂t µ0
and consequently, by Eqs. 2.15, 2.16, 2.28, 2.33,
∂D
Jfree = ∇ × H − (2.34)
∂t
The term ∂D/∂t is called the displacement current.
Static Fields, Potentials, and Energy 11
∇·B=0 (2.43)
but we find from the Biot-Savart law describing the magnetic field issuing from current
flowing in a wire, Eq. 2.7, that the curl of the B-field is given by
∇ × B = µ0 J (2.44)
where J is the distribution of the current densities giving rise to B. Just as we intro-
duced the scaler potential V (r) to characterize the curl-less E-field, so we can introduce
a vector potential A to characterize the divergence-less B-field.
B=∇×A (2.45)
and setting the divergence of the vector potential to zero, we find the standard ex-
pression for A is
J(r0 ) 0
Z
µ0
A(r) = dτ (2.46)
4π r
where r = |r − r0 |, the distance between the point r where A is evaluated and r0 the
position of the current density J. Just as it is possible to write the energy of some
distribution of electric charge as an integral over the product of the charge density
distribution ρ(r) and the scalar potential V (r) (Eq. 2.41), we can write the magnetic
energy as the integral over scaler product of the current density distribution J(r) and
the vector potential A(r) Z
1
Emag = J · A dτ (2.47)
2
By recognizing that J is proportional to the curl of B from Eq. 2.45, substituting in
Eq. 2.47 and integrating by parts, we can write the magnetic energy entirely in terms
of the B-field in an expression analogous to Eq. 2.42
Z
1
Emag = B 2 dτ (2.48)
2µ0
and this result can in fact be obtained directly from one of the basic postulates of
classical electrodynamics, the Lorentz force law,
Static Fields, Potentials, and Energy 13
F = q(E + v × B) (2.50)
The force F acting on charged particles set in motion with velocity v does work on
the particles at the rate
dEmech
= qE · v (2.51)
dt
where Emech is the mechanical energy of the particle system. Of course the B-field,
always acting
R at right angles to the direction of motion, can do no work on the particles.
But q = ρdτ and v = Jfree /ρ so
Z
dEmech
= E · Jfree dτ (2.52)
dt
Now we can use the Ampère-Maxwell law, Eq. 2.28, to eliminate Jfree in Eq. 2.52. The
dot product in the integrand becomes
∂D
E · Jfree = E · (∇ × H) − E · (2.53)
∂t
The next step is to invoke a vector-field identity and the Faraday law, Eq. 2.27, to
rewrite the first term on the right of Eq. 2.53. The vector-field identity is
∇ · (E × H) = H · (∇ × E) − E · (∇ × H) (2.54)
where the integral of the second term on the right is taken over the surface σ bounding
the volume integral in the first term. The two terms in the volume integral are identified
with the electric and magnetic parts of the electromagnetic field energy. In a free-space
14 Elements Classical Field Theory
volume or in any medium with negligible polarization the decrease in the field energy,
as work is being done on the system of particles, can be written,
Z
dEem d 1 1 2
− =− B + 0 E 2 dτ (2.58)
dt dt 2 µ0
confirming that the total electromagnetic field energy can be identified as the sum
of the separate static magnetic and electric terms as in Eq. 2.49. We can now write
Eq. 2.57 as an energy conservation expression,
I
dEmech dEem
+ = − E × H · da (2.59)
dt dt
Where the term on the right is interpreted as an integral over the energy flux (energy
per unit area per unit time) flowing across the bounding surface. This cross product
expression is termed the Poynting vector, S.
S = (E × H) (2.60)
Notice that energy increasing within the volume is equal to a negatively signed energy
flux into the volume across the boundary surface. The negative sign on the Poynting
vector is simply a consequence of the sign convention on the vector surface differential
da, positive pointing outward.
Solution 2.1
To solve this problem, let us consider the following model for the hydrogen atom. We
assume that the electron is represented by a cloud of charge density ρ, symmetrically
distributed in a sphere of radius R centered at the origin x = 0 where a positively
charged proton is located.
e e
ρ= = (2.61)
V 4/3πR2
The force of attraction between the proton and the electron charge distribution is
given by Coulomb’s law.
e2
F=− r̂ (2.62)
4π0 r3
Under the action of a constant external E-field, the cloud will be displaced from its
equilibrium position. Let us call x the displacement distance between charge centers at
Three Illustrative Applications 15
Fig. 2.1 Displacement of the electron charge cloud under the influence of a constant external
E-field.
which a new balance of forces will occur. Figure 2.1 illustrates this displacement. We
seek to calculate the restoring force on the electron charge density using Gauss’s law.
This restoring force will be exerted by the charge fraction δq located at the surface
of the sphere with radius x. The fraction of charge outside the volume Vx = 4/3πx3
produces no force on the proton. Taking ρ as the electron charge density, we have
x 3
δq = (ρ)(Vx ) = −e . (2.63)
R
The magnitude of the effective Coulomb force between the proton and the displaced
electron charge cloud is now
eδq e2 x3
|F| = 2
=− = −Kx (2.64)
4π0 x 4π0 x2 R3
which shows that small displacements result in a linear restoring force with the force
constant K.
e2
K= (2.65)
4π0 R3
Remembering that the frequency of oscillation is given by
r s
K e2
ω= = (2.66)
m 4π0 R3 me
and taking R = 0.529−10 m as the radius of the hydrogen atom in the ground state,
and me = 9.109 × 10−31 Kg, the rest mass of the electron, we calculate the charge
frequency of oscillation ωe = 4.13 × 1016 s−1 .
Example 2.2 Calculate the E-field of a three-dimensional static electric dipole from
the potential.
16 Elements Classical Field Theory
Solution 2.2
In Section 2.8 we studied the oscillating dipole and emphasized the time-dependence
of the field and radiation. Here we study the static dipole and illustrate the usefulness
of calculating the E-field by first writing down the scalar potential, then taking its
divergence.
A dipole is a distribution of two equal but opposite charges, separated by a distance
a as shown in Fig. 2.2. The dipole is a vector which by convention points from the
negative to the positive charge. In Fig. 2.2 the dipole p is oriented along z, p = qaˆ
z .
The calculation of the E-field is carried out through the electric potential using E =
−∇V (x, yz). Since the dipole is oriented along the z-axis and has cylindrical symmetry
around this axis, the dipole field is invariant with rotation about the azimuthal angle
ϕ. From Fig. 2.2, at point r we have for the potential,
q 1 1
V (x, y, z) = − (2.67)
4π0 r1 r2
a 2 a 2
r12 = r2 + − a r cos θ and r22 = r2 + + a r cos θ (2.68)
2 2
Eq. 2.67 takes the form
q 1 1
V (r, θ) = 1/2 − (2.69)
4π0
1/2
a 2 a 2
r2 + 2 − ar cos θ r2 + 2 + ar cos θ
Taking advantage of the dipole cylindrical symmetry, we calculate the electric field by
applying the gradient operation to the potential in polar coordinates,
with
∂V 1 ∂V
Er = − and Eθ = − (2.71)
∂r r ∂θ
Carrying out the derivative operations on Eq. 2.69 we find
a a
q r− 2 cos θ r+ 2 cos θ
Er = 3/2 − (2.72)
4π0 2
3/2
a 2 a 2
r + 2 − ar cos θ r2 + 2 + ar cos θ
and
a a
q 2 sin θ 2 sin θ
Eθ = 3/2 + (2.73)
4π0
3/2
a 2 a 2
r2 + 2 − ar cos θ r2 + 2 + ar cos θ
In many practical problems the dipole potential and field are of interest when r >> a,
the region of space termed the far field. Under these conditions we can write
a 2 n a 2 a n
2 2n
r + ± a r cos θ = r 1+ ± cos θ
2 2r r
2n a 2 a
'r 1+n ± cos θ (2.74)
2r r
and therefore write the dipole potential Eq. 2.67 in the far field as
q 1 1 a 2 a 1 a 2 a
V (r, θ) = 1− − cos θ − 1 − + cos θ
4π0 r 2 2r r 2 2r r
(2.75)
which, after simplification, yields
qa cos θ
V (r, θ) ' (2.76)
4π0 r2
and the far-field dipole E-field
18 Elements Classical Field Theory
Fig. 2.3 Relation between cartesian and polar coordinates for dipole oriented along z axis
We can write the dipole potential and E-field, Eqs. 2.76, 2.77, in vector form as
p · ˆr
V (r, θ) = (2.78)
4π0 r2
and
(2p · ˆr )ˆ
r − (p · ˆθ )ˆ
θ
E(r, θ) = 3
(2.79)
4π0 r
where ˆr , ˆθ are unit vectors in polar coordinates. As discussed at some length in
Complement C, the dipole vector E-field can be expressed either in polar or cartesian
coordinates. The unit vectors of the E-field in polar coordinates are ˆr , ˆθ , ˆϕ , and
the dipole p in Fig. 2.3 can be written in terms of the polar components as
The relations between the unit vectors in polar coordinates and cartesian coordinates
are given by
ˆr = ˆx sin θ cos ϕ + ˆy sin θ sin ϕ + ˆz cos θ (2.81)
ˆθ = ˆx cos θ cos ϕ + ˆy cos θ sin ϕ − ˆz sin θ (2.82)
ˆϕ = −ˆ
x sin ϕ + ˆy cos ϕ (2.83)
Three Illustrative Applications 19
Fig. 2.4 Dielectric sphere immersed in a uniform electric field oriented along the z direction.
By multiplying the first of these relations by cos θ and the second by − sin θ we can
easily show that
r − sin θˆ
cos θˆ θ = ˆz (2.84)
In Fig. 2.3 we see that in an r, θ plane, carrying out the dot products in Eq. 2.80 with
the help of Eqs. 2.81, 2.82, 2.83 results in
r − sin θˆ
p = p(cos θˆ θ ) (2.85)
and therefore
(p · ˆθ )ˆ
θ = −p sin θˆ
θ = p − (p cos θ)ˆ
r = p − (p · ˆr )ˆ
r (2.86)
Solution 2.3
Consider a dielectric sphere placed in a uniform electric field E as shown in Fig. 2.4
We analyze the problem in polar coordinates. The potential along the z direction is
given by
V = −E · ẑ = −E0 r cos θ (2.88)
Our first goal is to determine the potential within and external to the sphere. To find
the potential we solve Laplace’s equation,
∇2 V = 0 (2.89)
20 Elements Classical Field Theory
in both regions and the join the solutions at the boundary. We show in Complement
C, Eq. C.38 how to express the Laplacian operator in polar coordinates. An equivalent,
slightly different form is
1 ∂2 ∂2V
1 ∂ ∂V 1
∇ · ∇V = (rV ) + · sin θ + (2.90)
r ∂r2 r2 sin θ ∂θ ∂θ r2 sin2 θ ∂ϕ2
The problem of the dielectric sphere subject to a constant electric field along the z axis
is obviously cylindrically symmetric. Before attacking this problem directly we examine
the form of the solutions to Laplace’s equation for spherical symmetry. Assuming for
the moment a spherically symmetric potential V , we write V as a product of a radial
function R(r)/r, a polar angular function Θ(θ), and an azimuthal angular function
Φ(ϕ).
R(r)
V (r, θ) = Θ(θ) Φ(ϕ) (2.91)
r
Now we substitute this form back into Eq. 2.90 and arrange all terms with r and θ
dependence on the left side and all terms with ϕ dependence on the right side. After
some algebra the result is
r2 sin2 θ ∂ 2 R(r) ∂ 2 Θ(θ) 1 ∂2Φ
sin θ ∂Θ(θ)
+ cos θ + sin θ = − (2.92)
R(r) ∂r2 Θ(θ) ∂θ ∂Θ2 (θ) Φ(ϕ) ∂ϕ2
Since r, θ, ϕ are independent variables the only way that Eq. 2.92 can be true is for
each side to be equal to a constant, called the “separation constant”. We can choose
the form of this constant to be anything we want, so we choose it to be m2 . From the
right side of Eq. 2.92 we then have
1 ∂2Φ
= −m2 (2.93)
Φ(ϕ) ∂ϕ2
and we can see by simple inspection and can verify by substitution that the solutions
are
Φ(ϕ) = e−imϕ (2.94)
In order for Φ to be a single-valued function in ϕ (modulo 2π), m must be an integer,
m = 0, ±1, ±2, . . .. After division by sin2 θ the left side of Eq. 2.92, also equal to m2 ,
can be rearranged to
r2 ∂ 2 R ∂ 2 Θ(θ) m2
1 ∂Θ(θ)
+ cos θ + sin θ − =0 (2.95)
R(r) ∂r2 sin θΘ(θ) ∂θ ∂θ2 sin2 θ
The first term in Eq. 2.95 depends only on r and the next two only on θ. Therefore
they can also be separated and set equal to a separation constant, the form of which
we choose to be l(l + 1).
r2 d2 R
= l(l + 1) (2.96)
R(r) dr2
d2 Θ(θ) m2
1 dΘ(θ)
− cos θ + sin θ − = l(l + 1) (2.97)
(sin θ)Θ(θ) dθ dθ2 sin2 θ
Three Illustrative Applications 21
Equation 2.97 can be made more concise by recognizing that the term in square brack-
ets is an expanded differential of a product of functions. Multiplying both sides by Θ
we have
m2
1 d dΘ
sin θ − Θ = −l(l + 1)Θ (2.98)
sin θ dθ dθ sin2 θ
The radial and angular differential equations can finally be rearranged to
d2 R l(l + 1)
− =0 (2.99)
dr2 r2
d2 Θ m2
dΘ
+ cot θ + l(l + 1) − Θ=0 (2.100)
dθ 2 dθ sin2 θ
We have already observed that m must be zero or an integer in order to maintain Φ as
a single-valued function. In this field-matching problem of a dielectric sphere immersed
in a constant electric field, oriented along z, the E-field will show no ϕ dependence,
and we can set m = 0. The resulting expression is called Legendre’s equation.
d2 Θ dΘ
+ cot θ + [l(l + 1)] Θ = 0 (2.101)
dθ2 dθ
Often the independent variable of the Legendre equation is taken to be x = cos θ
rather than θ itself. Then Eq. 2.101 takes the form
d dΘ
1 − x2 + l(l + 1)Θ = 0 (2.102)
dx dx
The physically admissible solutions to Eq. 2.102 are a family of polynomials in cos θ
called the Legendre polynomials. They are labeled by the index l which must only
assume the positive integer values l = 0, 1, 2, . . .. The first few Legendre polynomials
are listed in Table 2.1 The Rodrigues’ formula is a concise expression of any desired
member l of the Legendre polynomials
1 dl l
Pl (x) = l l
x2 − 1 (2.103)
2 l! dx
The solution to the separated radial equation, Eq. 2.99, is
R(r) = Arl + Br−(l−1) (2.104)
where A and B are as yet undetermined constants. That Eq. 2.104 really is a solution
can be verified by direct substitution.
22 Elements Classical Field Theory
Now we are in a position to write the general solution, Eq. 2.91, to Maxwell’s
equation governing the response of a dielectric sphere immersed in a constant electric
field.
∞ h
V (r, θ) X i
= U (r, θ, ϕ) = Arl + Br−(l+1) Pl (cos θ)e−imϕ (2.105)
r
l=0
Because the problem has cylindrical symmetry and the solution is invariant about the
z axis of the applied E-field, the m = 0 solution of Φ(ϕ) = e−imϕ is the appropriate
choice, and Φ(ϕ) = 1. Furthermore, we must fit the solution to the physical boundary
conditions at the origin, at infinity and at the interface on the surface of the sphere.
Since the potential must remain finite as r → 0, it is clear that we should choose B = 0
for r inside r0 , the radius of the sphere. In this region
∞
X
Uin (r, θ) = Al rl Pl (cos θ) (2.106)
l=0
Outside the sphere, in the limit r → ∞ we see from Eq. 2.88 that the asymptotic
solution must take on the form −E0 cos θ. In order to fit this result, A1 = −E0 , with
all other Al = 0. The solution outside the sphere must therefore have the form
∞
X
Uout (r, θ) = −E0 r cos θ + Bl r−(l+1) Pl (cos θ) (2.107)
l=0
for the first continuity condition. For the second condition we remember that
∂U (r, θ)
Er = − (2.109)
∂θ
and that components of the D-field normal to the boundary must be continuous. Then
we have for the second continuity condition
∞ ∞
" #
(l−1) −(l+1)
X X
−in Al lr0 Pl (cos θ) = −out −E0 r0 cos θ + Bl r0 Pl (cos θ) (2.110)
l=0 l=0
where in , out are the permittivities inside and outside the sphere. Next we will elim-
inate all terms with summations in Eqs. 2.108, 2.110 except one by using the orthogo-
nality and normalization properties of the Legendre polynomials.
Z +1
2
Pl (cos θ)Pm (cos θ) d(cos θ) = δ(ml) (2.111)
−1 2m +1
Three Illustrative Applications 23
The result is
2 2 −(m+1) 2
Am r0m = − E0 r0 + Bm r0 (2.112)
2m + 1 3 2m + 1
and
−(m+1) 2 out m + 1 2 out 1
Bm r0 1+ = E0 r0 1 − · (2.113)
2m + 1 in m 3 in m
or, h i
2
2 1 − out 1
·m
−(m+1) in
Bm r0 = E0 r0 h i (2.114)
2m + 1 3 1 + out m+1in m
Substituting Eq. 2.114 into the right side of Eq. 2.112 results in the following expression
involving Am and r0 .
h i
out 1
2
2 2 1 − in · m
Am r0m = − E0 r0 + E0 r0 h i (2.115)
2m + 1 3 3 1+ out m+1
in m
The continuity condition should hold for any choice of r0 , the radius of the sphere,
and we see that if we choose m = 1, then the expression for Am will be independent
of r0 . Then we find
3out
A1 = −E0 (2.116)
in+2out
and from Eq. 2.114
in − out
B1 = r03 E0 (2.117)
in + 2out
Now we can substitute A1 and B1 back into Eqs. 2.107, 2.106 to obtain the expressions
for the scalar potential U (r, θ) outside and inside the dielectric sphere.
r3
in − out
U (r, θ)out = −E0 r cos θ + E0 02 cos θ outside potential (2.118)
r in + 2out
3out
U (r, θ)in = −E0 r cos θ inside potential (2.119)
in + 2out
From Eqs. 2.118, 2.119 we can easily calculate the E-field outside and inside the sphere
which was the initial goal we set out to reach.
3out
Ein (r, θ) = E0 êz (2.120)
in + 2out
Fig. 2.5 Charges q1 , q2 , q3 , . . . are point sources on which E-field lines originate. The integral
form of Gauss’s law states that the surface integral of E over the volume enclosing these
charges is equal to the total charge divided by the permittivity.
ρfree ∂B
∇·E= ∇×E=− Faraday’s Law
0 ∂t
∂D
∇·B=0 ∇×H= +J Maxwell-Ampère Law
∂t
Dynamic Fields and Potentials 25
In the case of static fields the E-field is irrotational, ∇ × E = 0, and the B-field
is “divergenceless”, ∇ · B = 0. Therefore the E-field can be specified by the gradient
of a scalar function, the scalar potential V and the B-field is determined by the curl
of a vector function, the vector potential A.
In the case of fields changing in time the E-field is no longer irrotational because
∂B ∂A
=∇×
∂t ∂t
and substituting into Faraday’s Law we have
∂A
∇ × E = −∇ ×
∂t
or
∂A
∇× E+ =0 (2.125)
∂t
The quantity E + ∂A/∂t is itself irrotational and can therefore be written as the
gradient of a scaler function.
∂A
E+ = −∇V
∂t
or
∂A
E = −∇V − (2.126)
∂t
Then from Maxwell’s E-field divergence equation
∂A ρfree
∇ · E = ∇ · −∇V − =
∂t 0
or
∂ ρfree
∇2 V + (∇ · A) = − (2.127)
∂t 0
Substituting B = ∇ × A into the left side of the Maxwell-Ampère Law and taking the
time derivative of Eq. 2.126 we find
∂2A
∂V
∇ × (∇ × A) = µ0 J − µ0 0 ∇ − µ0 0 2
∂t ∂t
∂2A
∂V
∇2 A − µ0 0 2 − ∇ ∇ · A + µ0 0 = −µ0 J (2.128)
∂t ∂t
Equations 2.127 and 2.128 are just a reformulation of the Maxwell relations in terms
of the time-dependent potentials rather than fields.
26 Elements Classical Field Theory
and Z " #
µ0 J(r0 , tr ) J̇(r0 , tr ) ˆ 0
B(r, t) = + × `dτ (2.135)
4π `2 c`
In Eqs. 2.134, 2.135 J̇ denotes the time derivative of the source current density, and
thus we write Z
∂A µ0 J̇ 0
= dτ (2.136)
∂t 4π `
which is used in the third term on the right side of Eq. 2.134.
Fig. 2.7 Electric dipole oscillator aligned along z axis. The maximum length of the dipole
is a and the coordinates of field point p, with respect to the dipole midpoint, are indicated
in spherical coordinates r, θ, φ.
Substituting the current into Eq. 2.133 the vector potential for the harmonic oscillator
is
µ0 a/2 −q0 ω sin ω t − rc ẑ
Z
A(r, t) = dz (2.139)
4π a/2 r
But since the length a of the oscillator is short compared to the wavelength we can
simply replace the integral with the integrand multiplied by a.
µ0 p0 ω h r i
A(r, t) ' − sin ω t − ẑ (2.140)
4πr c
We can write down the retarded scaler potential for some point r at time t, V (r, t) by
using Eq. 2.132 and Fig. 2.7.
1 q0 cos [ω(t − r1 /c)] q0 cos [ω(t − r2 /c)]
V (r, t) = − (2.141)
4π0 r1 r2
Now if the point where the field is to be evaluated is sufficiently far from the dipole
such that|r| >> a and the length of the dipole is sufficiently short such that the
Dipole Radiation 29
wavelength of light λ = ω/c >> a then it can be easily shown by expanding 1/r1,2 in
a Taylor series that
p0 cos θ ω h r i 1 h r i
V (r, θ, t) ' − sin ω t − + 2 cos ω t − (2.143)
4π0 rc c r c
∂A µ0 p0 ω 2 h r i
=− cos ω t − ẑ (2.145)
∂t 4πr c
∂A µ0 p0 ω 2 h r i
=− cos ω t − (cos θr̂ − sin θθ̂) (2.146)
∂t 4πr c
Substituting Eqs. 2.143 and 2.146 into 2.144 and writing the gradient operator ∇ in
spherical coordinates (see Appendix C),
∂ 1 ∂V
∇= r̂ + θ̂
∂r r ∂θ
r i ω 2
µ0 p0 sin θ ω h h r i
Bϕ = − sin ω t − + cos ω t − ϕ̂ (2.149)
4π r2 c rc c
30 Elements Classical Field Theory
We see that in the quasi-stationary regime the surviving E-field and B-field terms from
an oscillator source (the first terms on the right in Eqs. 2.157–2.159) are in quadra-
ture, not in phase as in far-field radiation. This regime corresponds to conventional
engineering circuit response where the product of frequencies ω (radio and microwave)
and circuit size r are well within the ωr/c . 2π criterion.
Note that when the quasi-stationary regime goes to the truly stationary, ω → 0,
Eqs. 2.157–2.159 go to their electrostatic dipole limits,
P = 0 χ e E (2.160)
where χe is the linear electric susceptibility, an intrinsic property of the medium re-
sponding to the light field. Recall the relation between the electric field E, the po-
larization P, and the displacement field D in a material medium. In the rationalized
MKS system of units the relation is
D = 0 E + P (2.161)
Furthermore, for isotropic materials, in all systems of units, the so-called constitutive
relation between the displacement field D and the imposed electric field E is written
D = E
D = 0 (1 + χ)E
and
= 0 (1 + χ)
32 Elements Classical Field Theory
as the permittivity, and one must decipher from the context whether the permittivity
(units of C 2 /J · m) or relative permittivity–dielectric constant (unitless) is intended.
For example it is clear from expressions involving the refractive index such as
√
n = η + iκ =
or a propagation parameter √
k = k0 n = k
that is the dielectric constant whereas in an expression involving a constitutive
relation such as
D = E
the factor denotes a material permittivity.
In a dielectric medium the traveling wave solutions of Maxwell’s equation become,
ηz
E = E0 ei(kz−ωt) −→ E0 e[iω( )−ω κc z]
c −t
and the complex spatial part of the field is called a phasor. Unless otherwise explicitly
stated otherwise we will assume from here forward that the usual fields of electromag-
netism, E, B, D, H, P, M, J, S are in the phasor form.
1 The reader should be mindful that physics literature and engineering literature use different
conventions to describe time-harmonic phasor fields. The form A = A0 ei(k·r−ωt) is the physics
convention. The form A = A0 e−j(β·r−ωt) is the engineering convention. An important consequence
of this choice is that complex quantities ã must be expressed as ã = a+iα with the physics convention
and ã = a−jα with the engineering convention. Thus a complex propagation parameter in the physics
convention would be written k̃ = k + iκ while engineers would write β̃ = α − jγ. Most of the circuit,
transmission line and waveguide literature is written by engineers.
Plane Electromagnetic Waves 35
Fig. 2.8 Plane electromagnetic wave propagating in free space. The E-field is aligned along
x, the B-field along y, and the wave vector k is oriented along z.
2π
ω= v = kv (2.185)
λ
In free space v is the speed of light c and in other homogeneous materials is given by
c
v=
n
where n is the index of refraction of the material. The index of refraction in free space
is unity and in matter is always greater. In materials with negligible absorption, n is a
real number greater than unity, but in absorbing, “lossy” materials n is complex with
a positive imaginary term. As light of a single frequency propagates through materials
of different indices of refraction, ω remains constant, but the phase velocity changes as
c/n and the wavelength shortens by λ/n. It is important not to confuse the “angular”
frequency ω with the frequency of optical cycles ν. The relation is always
ω = 2πν
The magnetic induction field B, the other constituent of the plane electromagnetic
wave propagating in free space in the z direction, is given by
with complex amplitude B̃0 and polarization direction êB . The reason for denoting
the field amplitudes complex is to express a possible phase relation between them. In
fact the divergence and curl relations in Maxwell’s equations insure that the E- and
B-fields are orthogonal to the direction of propagation and that the amplitudes are in
Plane Electromagnetic Waves 37
phase. The E and B fields combine to form the electromagnetic (E-M) field. They are
mutually orthogonal and orthogonal to the direction of propagation.
where êk is the unit vector of the propagation parameter k. The relative amplitude
between magnetic induction and electric fields in vacuum is given by,
1 √
B0 = E0 = 0 µ0 E0 (2.187)
c
and for a traveling wave through free space and homogeneous matter, the time oscil-
lation of the E and B fields is always in phase.
∂Ex ∂Hy
= −µ = iωµHy (2.188)
∂z ∂t
∂Ey ∂Hx
=µ = −iωµHx (2.189)
∂z ∂t
and
∂Hy ∂Ex
= − = iωEx (2.190)
∂z ∂t
∂Hx ∂Ey
= = −iωEy (2.191)
∂z ∂t
Equations 2.188 and 2.190, relating Ex and Hy , are coupled as are Eqs. 2.189 and
2.191, relating Ey and Hx . The first of these pairs can be decoupled by differentiating
Eq. 2.188 by z and Eq. 2.190 by t. The order of differentiation is interchangeable which
results in
∂ 2 Ex ∂ 2 Ex
− µ =0 (2.192)
∂z 2 ∂t2
Taking into account the harmonic time variation at frequency ω, we have
∂ 2 Ex
+ µω 2 Ex2 = 0 (2.193)
∂z 2
The product of the material parameters µ = 1/v 2 , where v is the phase velocity
of the plane wave in the medium. The plane wave propagation parameter k = ω/v
38 Elements Classical Field Theory
so Eq. 2.193 can be written finally as the one-dimensional wave equation for the Ex
component of the plane wave.
∂ 2 Ex
+ k 2 Ex2 = 0 (2.194)
∂z 2
Clearly analogous equations can be written for all the field components in the relations
Eqs. 2.188–2.191. Furthermore, substitution of plane wave E-field solution, Eq. 2.184,
into the second-order wave equation, 2.179, results in
∇2 E + k 2 E = 0 (2.195)
√
with k = ω/c = ω µ0 0 when the waves are propagating in free space. Equation 2.195
is clearly valid for each of the Cartesian components and can be written simply in
terms of the wave amplitude,
∇2 E + k 2 E = 0 (2.196)
as such it is called the Helmholtz scalar wave equation. The analogous equation is
obviously valid for the plane-wave H-field, and the relation between the E-field and
H-field plane-wave amplitudes is specified by Eq. 2.189.
r
E µ0
B= and therefore E= H (2.197)
c 0
p
The proportionality factor between the amplitudes, µ0 /0 , has units of impedance
and is variously called the impedance of free space, or intrinsic impedance, or wave
impedance and is denoted by Z0 .
r
µ0
Z0 = = 376.73 Ohms (2.198)
0
In lossless media µ and are real numbers, but in lossy dielectrics or conductive
materials such as metals or semiconductors, the permittivity becomes complex. We
examine the interaction of plane waves and lossy or conductive media in the next
section.
We start again with Faraday’s law and the Maxwell-Ampère law including both
displacement and charge current in a medium characterized by µ, , σ. Assume further
that Ohm’s law obtains so that J = σE.
∂B ∂H
∇×E=− = −µ
∂t ∂t
∂D ∂E
∇×H= +J= + σE
∂t ∂t
Let us posit plane solutions with the E-field aligned along x, B-field aligned along y,
and propagating along z.
Ex = E0 ei(kz z−ωt)
(2.199)
By = B0 ei(kz z−ωt)
The single frequency, harmonic time dependence results in
∇ × E = iµωH (2.200)
∇ × H = (−iω + σ)E (2.201)
Take the curl of both sides of Eq. 2.200 and apply the standard “curl-curl” identity
∇ × ∇ × E = iµω∇ × H (2.202)
2
∇(∇ · E) − ∇ E = iµω∇ × H (2.203)
2
−∇ E = iµω∇ × H (2.204)
Because, as usual, we assume a source-free environment where the plane wave is prop-
agating, ∇ · E evaluates to zero in Eq. 2.203. Now substituting Eq. 2.201 into the right
side of Eq. 2.204 results in
σ
∇2 E + µω 2 1 + i E=0 (2.205)
ω
This is clearly a “Helmholtz-like” wave equation, and since we have posited the E-field
along x,
∂ 2 Ex σ
2
+ µω 2 1 + i Ex = 0 (2.206)
∂z ω
and the H-field along y
∂ 2 Hy σ
2
+ µω 2 1 + i Hy = 0 (2.207)
∂z ω
The k of Eq. 2.196, there assumed real, generalizes to a complex expression
√ σ 1/2
k = µω 1 + i (2.208)
ω
For nonmagnetic lossy materials we can write µr = 1 an = 0 r = 0 (0 + 00 ). Then
1/2
2π 0 00 σ
k= +i + (2.209)
λ0 ω0
40 Elements Classical Field Theory
Plane Waves in a Lossy Dielectric (but poor Conductor). If σ/ω0 << 00 then the
material is considered a poor conductor or good insulator and the propagation param-
eter becomes
2π 0
k→ ( + i00 )1/2 (2.210)
λ0
A plane wave propagating through this medium reflects dissipative loss due to 00 .
0 00 1/2
Ex = E0 ei[2π/λ0 ( +i ) z−ωt]
(2.211)
In many cases |00 /0 | << 1 in which case we can write
2π √ 0 00
k' 1+i 0 (2.212)
λ0 2
and Eq. 2.211 can be written
00 √
−(2π/λ0 ) √ z i[(2π/λ0 ) 0 z−ωt]
Ex = E0 e 2 0 e (2.213)
The first exponential factor with the real argument damps the amplitude E0 as the
wave propagates along z. The second exponential factor√ represents the usual harmonic
wave oscillation through a medium characterized 0 . The two factors evidently rep-
resent a damped travelling plane wave. As 00 → 0 the damping vanishes and the
oscillatory term represents√a plane wave propagating through a lossless dielectric with
real refractive index η = 0 . The wave impedance in a lossy medium is again deter-
mined from Faraday’s law. We find in this case
r
µ0
Ex = Hy (2.214)
0 (0 + i00 )
and r
µ0
Z= (2.215)
0 (0 + i00 )
In the limit of a lossless, nonmagnetic dielectric, 00 → 0,
r
µ0 1
Z= = Z0 √ (2.216)
0
Plane Waves in a Good Conductor (low frequency limit). As in Section 2.10.4 here
we continue to treat σ as a real quantity. We shall see in Section 2.10.7 that although
this assumption is adequate for radio and microwave frequencies, it breaks down at
optical frequencies.
When σ/ω0 >> 00 we have a good conductor. The generalized propagation pa-
rameter, Eq. 2.209 becomes, as 00 ω0 /σ → 0,
1/2
2π 0 σ
k→ +i (2.217)
λ0 ω0
The good conductor (often a metal such as silver or gold) is then characterized by
the real part of the permittivity and the conductivity of the metal at frequency ω.
Plane Electromagnetic Waves 41
If the dissipative part 00 of the permittivity is retained, then the imaginary terms in
Eq. 2.209 can be grouped together and labeled as a “total” 00tot .
σ
00tot = 00 + (2.218)
ω0
and an “equivalent conductivity” including both terms of the imaginary part of the
permittivity can be defined as
σequiv = ω0 00tot (2.219)
Returning to Eq. 2.273, we see that if σ/ω0 >> 0 then a plane wave entering the
conductor will by damped by the conductivity term.
1/2
2π √
σ
k' i (2.220)
λ0 ω0
√ √
Using the identity i = (1 + i)/ 2,
1/2
2π σ
k' (1 + i) (2.221)
λ0 2ω0
The imaginary term damps the plane wave propagating into the conductor. The depth
of penetration, the skin depth δ is defined as inverse of this imaginary term
1/2
λ0 2ω0
δ= (2.222)
2π σ
As a specific example take silver metal as a good conductor and λ0 = 514.5 nm, the
“green line” of an Argon-ion laser. At this wavelength, 0 ' −15 and 00 ' 0.3. The
conductivity of silver is σ ' 6.3 × 107 S/m. The ratio σ/ω0 ' 2 × 103 much greater
than the real or imaginary parts of the dielectric constant. The expression for the skin
depth is therefore valid and for this example δ ' 2.6 × 10−9 m. At optical frequencies
an incident plane only penetrates a few nanometers below the surface of a good metal.
The impedance is again determined from Faraday’s law, specifying the relation
between the E- and H-fields in the conductor.
r −1 r
µ0 (1 + i) ω0
Ex = √ Hy (2.223)
0 2 σ
and
r
ω0
Z = Z0 e−i(π/2) (2.224)
σ
Note that the E-field lags the H-field in the conductor by a phase angle of 45 degrees.
42 Elements Classical Field Theory
and the time-rate of doing work on or by these charges by the electromagnetic field is
is
dE
= E · Jtot (2.230)
dt
Now we use the Maxwell-Ampère law and Faraday’s law in their forms appropriate
for time-harmonic phasors. We generalize the permittivity to include the possibility
of lossy dielectrics and conductors and write = 0 (0 + i00 ), but do not consider the
much less frequent case of magnetic materials. Therefore we leave the permeability as
µ0 and write
∂E∗
∇ × H∗ = ∗ + J∗ tot = iω∗ E∗ + J∗ tot (2.231)
∂t
∂H
∇ × E = −µ0 = iωµ0 H (2.232)
∂t
Plane Electromagnetic Waves 43
∇ · (E × H∗ ) = (∇ × E) · H∗ − (∇ × H∗ ) · E (2.233)
Then applying Stokes’ theorem to the term on the left, rearranging the terms on the
right, and dividing all terms by 2,
Z I Z Z
1 ∗ 1 ∗ 1 00 2 1
− Jfree · E dV = (E × H ) · da + ω0 |E| dV + σ|E|2 dV
2 2 S 2 2
Z
iω 0 2 2
+ 0 |E| − µ0 |H| dV (2.235)
2
This power balance equation requires careful interpretation. The term on the left can
be considered a source term from which energy is emitted. The negative sign denotes
power flowing away from the emitter through the surface S of the enclosing volume V .
The first term on the right is immediately recognizable as the cycle-averaged power
passing out of the volume enclosed by surface S. This term is the phasor version of
Eqs. 2.57, 2.59; and we can identify the integrand with the Poynting vector
S = E × H∗ (2.236)
The next two terms indicate dissipative loss: absorptive loss from 00 and conductive
loss from σ due to material present within the enclosed volume. The last term describes
energy “stored” in the material. Taking into account Eq. 2.227, we can write this last
term as Z Z
dEstored iω 0 2 1 dP
= χ 0 |E| dV = − E· dV (2.237)
dt 2 2 dt
and we see that this term represents power flow into the polarizability of the material.
Since polarizability is a density of dipoles, we can think of this term as that part of the
total emitted energy that is stored in the dipole oscillators of the material rather than
the dipole oscillators of the field. The negative sign indicates that the stored energy
and field energy oscillate out of phase.
General Case. We posit a plane propagating from left to right along z, with E-field
linearly polarized along x and H-field along y as shown in Fig. 2.8. The interface is in
the x − y plane and situated at z = 0. The half-space to the left (z < 0) is vacuum,
and the half-space to the right (z > 0) is characterized by , µ0 , σ. The incident E-
and H-phasor fields are given by
Ei = E0 eik0 z x̂ (2.238)
E0 ik0 z
Hi = H0 eik0 z ŷ = e ŷ (2.239)
Z0
p
where Z0 is the impedance of free space, Z0 = µ0 /0 . At the interface a fraction of
the wave is reflected and a fraction is transmitted. We write the reflected wave as2
Et = T E0 eikz x̂ (2.242)
E0 ikz
Ht = T e ŷ (2.243)
Z
In the material to the right, the propagation parameter k is given by Eq. 2.209 and
rewritten here for convenience,
1/2
2π 0 σ
k= + i 00 +
λ0 ω0
At z = 0 the total parallel field amplitudes on either side of the boundary must be
continuous across it. Therefore
Ei + Er = Et (2.244)
Ei − REi = T Ei (2.245)
Hi + RHi = T Hi (2.246)
Ei Ei Ei
+R =T (2.247)
Z0 Z0 Z
From Eqs. 2.245, 2.247 we have
1−R=T (2.248)
1 R T
+ = (2.249)
Z0 Z0 Z
2 The choice of phase for the reflected wave is a matter of convention. The reflected waves could
also be written Er = RE0 e−ik0 z x̂ and Hr = −R(E0 /Z0 )e−k0 z ŷ. The first convention is often used
in physical optics while the second is common in electrical engineering. The second choice results in
a change of sign for the reflection coefficient with the consequence that R = (Z − Z0 )/(Z0 + Z) and
1 = T − R.
Plane Electromagnetic Waves 45
1=R+T (2.250)
Z0 − Z
R= (2.251)
Z0 + Z
2Z
T = (2.252)
Z0 + Z
Lossless Dielectric. For a lossless dielectric 00 = 0 and σ = 0 so the propagation
vector becomes k = (2π/λ0 ) 0 . The Poynting vector on the left side of the boundary
is
E02
1 − R2 ẑ
S− = (2.255)
Z0
and since R < 1, the net energy flux propagates from left to right. In the region z < 0
the oscillatory term in Eq. 2.254 is due to the standing wave set up by reflection of
the plane wave at the boundary. Notice that the phase of the standing wave is in
quadrature with respect to the incident wave. The Poynting vector on the right side is
Lossy Dielectric. In the case of a lossy dielectric the propagation vector becomes
2π 0 1/2
k= ( + i00 ) (2.258)
λ0
2π
= (η + iκ) (2.259)
λ0
and the wave impedance becomes complex.
r r
µ0 µ0 Z0
Z= = = (2.260)
0 (0 + i00 ) η + iκ
46 Elements Classical Field Theory
Therefore, from Eqs. 4.60, 2.252 the reflection and transmission coefficients become
complex, and the parallel field component continuity conditions at the boundary now
give
2
T = (2.261)
(η + 1) + iκ
(η − 1) + iκ
R= (2.262)
(η + 1) + iκ
Figure 2.9 shows a plane wave incident from the left on a slightly absorbing slab of
SiO. The shortened wave length and slight loss of amplitude due to absorption can be
discerned within the material. The Poynting vector in the z < 0 region is
2
|E0 | 2
Sz<0 = 1 + R∗ ei2k0 z − Re−i2k0 z − |R| ẑ (2.263)
Z0
and in the z > 0 region
2 2
2 |E0 | −2κz |E0 | 4(η − iκ) −2κz
Sz>0 = |T | e ẑ = · e ẑ (2.264)
Z∗ Z0 (η + 1)2 + κ2
So we see that the complex Poynting vector is conserved across the boundary. When
the Poynting vector is complex, we interpret the energy flux or the power flow as the
cycle-averaged real part of the Poynting vector. The net energy flux traversing the
boundary from the z < 0 region is therefore
2
1 |E0 | 2η
P− = Re[S − ] = · (2.267)
2 Z0 (η + 1)2 + κ2
and the energy flux penetrating the lossy dielectric on the z > 0 side of the boundary
is
2
1 |E0 | 2η
P + = Re[S + ] = · e−2κz (2.268)
2 Z0 (η + 1)2 + κ2
Note that the incident energy flux on the boundary from the left is
2
E0∗ −ik0 z
1 |E0 |
Pi = Re E0 eik0 z · e = (2.269)
2 Z0 2Z0
Plane Electromagnetic Waves 47
Fig. 2.9 Top panel shows the real part of the incident E-field Ei , reflected E-field Er and
total E-field ET in air to the left of the air-dielectric interface. The wave length of the
propagating wave in air is λ0 = 620 nm. Within the dielectric the transmitted E-field Et is
shown. The dielectric is SiO with index of refraction n = η + iκ = 1.969 + i0.01175. Bottom
panel shows the real part of the incident H-field Hi . reflected H-field Hr and total H-field HT
in air to the left of the air-dielectric interface. Within the dielectric the transmitted H-field
Ht is shown. Note the shortening of the wave length and the slight decrease in amplitude as
the wave penetrates into the lossy dielectric.
2 |E |2
|E0 | 2 0 2η
PT = Pi + Pr = 1 − |R| = · (2.271)
2Z0 Z0 (η + 1)2 + κ2
accords with the net power flow calculated from the superposed incident and reflected
fields (Eqs. 2.263, 2.267).
Good Conductor (Low Frequency Regime). A plane wave incident on a perfect con-
ductor and on a good conductor are shown in Figs. 2.10, 2.11. The difference between
“perfect” and “good” is essentially the magnitude of the conductivity, σ. When σ → ∞
a conductor approaches the perfect conductor idealization. In the case of a good con-
ductor
1/2
2π 0 σ
k= +i (2.272)
λ0 ω0
As we saw in the case of silver metal, for good conductors σ/ (ω0 ) is on the order of
thousands while 0 is on the order of unity. We can then to a quite good approximation
write
1/2
2π σ
k' i (2.273)
λ0 ω0
and recalling from Eq. 2.222 that the skin depth is defined as
1/2
λ0 2ω0
δ=
2π σ
we can express k as
√
2i 1+i
k' =± (2.274)
δ δ
The sign of k is chosen so that the wave amplitude decays exponentially as it propagates
into the good conductor in the z > 0 half space. Wave impedance in the conductor is
r r
µ0 µ0 −1/2 ω0 1/2 k0 δ
Z= = ·i = Z0 · (1 − i) (2.275)
0 σ 2
Fig. 2.10 Top panel shows the real part of the incident E-field Ei , reflected E-field Er
and total E-field ET in air to the left of the air-perfect metal interface. The wave length
of the propagating wave in air is λ0 = 632 nm. Within the perfect conductor (σ → ∞) no
electromagnetic field penetrates. Bottom panel shows the real part of the incident H-field
Hi . reflected H-field Hr and total H-field HT in air to the left of the air-dielectric interface.
Within the perfect conductor the transmitted H-field Ht is null.
2
− |E0 |
· 1 + R∗ − R − |R|2
Sz=0 =
Z0
2
|E0 | 4k0 δ
= · 2 (1 − i) (2.280)
Z0 2 + 2k0 δ + (k0 δ)
Fig. 2.11 Top panel shows the real part of the incident E-field Ei , reflected E-field Er
and total E-field ET in air to the left of the air-dielectric interface. The wave length of the
propagating wave in air is λ0 = 632 nm. Within the dielectric the transmitted E-field Et is
shown. Bottom panel shows the real part of the incident H-field Hi , reflected H-field Hr and
total H-field HT in air to the left of the air-dielectric interface. Within the dielectric the
transmitted H-field Ht is shown.
2 2
|E0 | |T | −(2/δ)z
Sz>0 = e ẑ
Z∗
2
|E0 | 4k0 δ −(2/δ)z
= · 2 (1 − i)e ẑ (2.281)
Z0 2 + 2k0 δ + (k0 δ)
2 2
+ |E0 | |T |
Sz=0 =
Z∗
2
|E0 | 4k0 δ
= · 2 (1 − i) (2.282)
Z0 2 + 2k0 δ + (k0 δ)
− +
and it is clear that the complex Poynting vectors Sz=0 = Sz=0 . Because the skin depth
is only a few nanometers we see that the energy flux penetrates significantly only a
few tens of nanometers. As in the case of the lossy dielectric the power penetrating
into the metal is given by
1 +
P+ = Re Sz=0
2
2
|E0 | 2k0 δ
= · 2 (2.283)
Z0 2 + 2k0 δ + (k0 δ)
(2.284)
And the power entering the metal from the z < 0 side is
1 −
P− = Re Sz=0 (2.285)
2
2
|E0 | 2k0 δ
= · 2 (2.286)
Z0 2 + 2k0 δ + (k0 δ)
(2.287)
So we see that P − = P + and the power across the boundary is conserved. Furthermore
we can confirm that the sum of the incident power Pi and reflected power Pr is equal
to the power transmitted across the boundary.
1 |E0 |2
1 − |R|2
Pi + Pr = (2.288)
2 Z0
|E0 |2 2k0 δ
= · 2 (2.289)
Z0 2 + k0 δ + (k0 δ)
(2.290)
Now from the earlier example in Section 2.10.4 for an air-silver metal interface, the
value for k0 = 1.22 × 107 m−1 , the skin depth δ = 2.6 × 10−9 m, and the product
k0 δ = 3.2 × 10−2 . Therefore
|E0 |2
P− = P+ ' · k0 δ (2.291)
Z0
The cycle-averaged incident power is
|E0 |2
Pi = (2.292)
2Z0
52 Elements Classical Field Theory
and therefore the fraction of the incident power entering the metal is
P+
= 2k0 δ = 6.4 × 10−2 (2.293)
Pi
|E0 |2
Pr = −|R|2 (2.294)
2Z0
Pr
' (1 − k0 δ) = 96.8 × 10−2 (2.295)
Pi
Current Density and Joule Heating. We can easily calculate the magnitude and phase
of the current density induced in the real metal by using Ohm’s law and the field
continuity conditions at the interface. From Ohm’s law we have
J = σEt x̂ (2.296)
Ei (1 − R) = Ei T = Et
where the expression for the conductor permittivity m is obtained from Eq. 2.273.
Since |Z| << |Z0 | the transmission coefficient T can be written
r
2Z 2Z ω0
T = ' = 2 e−iπ/4 (2.298)
Z0 + Z Z0 σ
We see that the transmitted E-field exhibits a phase lag with respect to the incident
E-field by π/4, and Eq. 2.296 shows that the current density induced in the metal also
lags the incident E-field by π/4.
Plane Electromagnetic Waves 53
Both the current density J and the transmitted E-field are rapidly attenuated expo-
nentially as they penetrate the conductor. From Eq. 2.274 we can write the propagation
parameter in the conductor in terms of the skin depth.
√
J(z) = 2 σω0 E0 ei(z/δ−π/4) e−z/δ x̂ (2.300)
r
ω0
Et (z) = 2 E0 ei(z/δ−π/4) e−z/δ x̂ (2.301)
σ
The power density (W m−3 ) associated with “Joule heating” in the conductor is given
by
dPJoule
= Et · J∗ (2.302)
dV
In order to compare this Joule heating power to P + or P − we need first to integrate
over the penetration depth in the z direction, then integrate over a cross sectional
area of one square meter in the x − y plane in order to calculate the volume power
dissipated in the conductor. This power dissipation corresponds to the third term on
the right in the power-balance, Eq. 2.235.
Z 1 Z 1 Z ∞
1
PJoule = Re Et · J∗ dxdydz
2 0 0 0
2 δ
= 2 ω0 |E0 |
2
2
|E0 |
= · k0 δ (2.303)
Z0
and we see that this result agrees with our earlier power calculation, Eq. 2.291, based
on the Poynting vector energy flux entering the conductor normal to its surface.
Substituting the complex quantities into Eq. 2.306 and gathering the real and imagi-
nary parts,
02 00 2
km − km = µ0 0 0 ω 2 − µ0 σ 00 ω
σ 00 σ 00
= µ0 0 ω 2 0 − = k02 0 − = α2 (2.308)
0 ω ω0
0 00
2km km = µ0 0 00 ω 2 + µ0 σ 0 ω
σ0 σ0
= µ0 0 ω 2 00 + = k02 00 + = β2 (2.309)
0 ω ω0
The two coupled equations, 2.308, 2.309 can be solved for the real and imaginary parts
of km .
r
α
q
0 4
km = ± √ 1 ± 1 + (β/α) (2.310)
2
r
α
q
00 4
km = ± √ −1 ± 1 + (β/α) (2.311)
2
We also know that the propagation parameter within the metal km is related to the
free-space propagation parameter k0 through the complex index of refraction n in the
metal. 2
km
= n2 = (η + iκ)2 (2.312)
k0
But n is also related to the real and complex “effective” dielectric constants in the
metal by
n2 = 0eff + i00eff (2.313)
The relations between η, κ and 0eff , 00eff are
r r
0eff q
η= 1 ± 1 + (00eff /0eff ) (2.314)
2
r r
0
eff q
κ= 1 ± −1 + (00eff /0eff ) (2.315)
2
But
Plane Electromagnetic Waves 55
0
km = k0 η (2.316)
00
km = k0 κ (2.317)
Then comparing Eqs. 2.316, 2.317 to Eqs. 2.310, 2.311 we see that
σ 00
0eff = 0 − (2.318)
ω0
σ0
00eff 00
= + (2.319)
ω0
The two terms 0 , 00 are the real and imaginary parts of the metal dielectric constant
excluding the highly polarizable conduction electrons. They represent the response of
the much more tightly bound valence electrons to the transmitted plane wave. The
terms involving σ 0 , σ 00 represent the frequency-dependent response of the conduction
electrons to the transmitted wave. In high-conductivity materials these latter two
terms dominate the response. Now real metals are dispersive which means that 0eff and
00eff depend on the frequency. A simple dispersion relation can be found if the metal
conductor is modeled as a harmonically oscillating free-electron gas. The harmonic
motion is driven at the incident wave frequency ω and damped at a rate Γ, inserted into
the model phenomenologically and corresponding to acoustic and radiative dissipation.
Damped Harmonic Oscillator Model for Conduction Current. We posit that the equa-
tion of motion of the conduction electrons is governed by harmonic acceleration driven
by a plane-wave electromagnetic field propagating in the metal and polarized along
the x direction.
d2 x dx
me 2 + me Γ = eEt = eT E0 eikm z−ωt (2.320)
dt dt
where me , e are the electron mass and charge, Γ a phenomenological damping constant,
and eEt = eT E0 ei(km z−ωt) the driving force on the conduction current. Dropping the
common e−iωt factor we have for the solutions of position, velocity and acceleration
1
x=− eT E0 (2.321)
me (ω 2+ iΓω)
dx ω
=i eT E0 (2.322)
dt me (ω 2 + iΓω)
d2 x ω2
2
= 2
eT E0 (2.323)
dt me (ω + iΓω)
Now the amplitude of the conduction current density is given by
dx ω
Γω − iω 2 eT E0
Jc = eNe = eNe (2.324)
dt me
where the electron density in the metal conduction band Ne is related to the resonance
frequency of the oscillating electrons ωp by
e 2 Ne
ωp2 = (2.325)
me 0
56 Elements Classical Field Theory
The resonant frequency is called the “bulk plasmon frequency”. The conduction cur-
rent Jc can now be written as
ωp2
Jc = 0 T E 0 (2.326)
Γ − iω
But from the standard constitutive relation we have
1
Jc x̂ = σEt x̂ = σ0 T E0 x̂ (2.327)
0
Comparing Eqs. 2.327 and 2.326 and separating real and imaginary parts, we see that
σ Γ ω
= 2 + i (2.328)
0 Γ ω2 Γ2 ω2
ω2 + p ωp2 ωp2 + ωp2
Now from Eq. 2.325 we can estimate the bulk plasmon frequency and compare it to ω
and Γ. A high-conductivity metal such as silver exhibits a conduction electron density
Ne ' 6 × 1028 m−3 and therefore ωp ' 1.3 × 1016 s−1 . The damping rate Γ due to
radiative loss and acoustic coupling is typically ∼ 1014 s−1 , and ω in the visible and
near IR is ∼ 1015 s−1 . Therefore we can write
σ ωp2 ωp2
'Γ 2 +i (2.329)
0 ω ω
Thus at relatively low frequencies (radio and microwave) the real term dominates, and
the conductivity is said to be “ohmic”. As the driving frequency into the optical range,
however, the imaginary term dominates. At ω ∼ 1015 s−1 the real term is only about
5% of the imaginary term. We can, from Eqs. 2.329, 2.318, 2.319, write the frequency
dependence of the “effective” dielectric constants in the metal as
!
0 0
ωp2
eff = − 2 (2.330)
ω
!
00 00
Γωp2
eff = + 3 (2.331)
ω
The valence electrons in a metal are very tightly bound with a large energy gap between
the valence ground and excited states. Therefore we can to good approximation write
0 = 1 and 00 = 0. The effective, frequency-dependent dielectric constants then simplify
to
!
2
ωp
0eff ' 1 − 2 (2.332)
ω
!
00
Γωp2
eff ' (2.333)
ω3
Exercises 57
Fig. 2.12 Plane wave propagating from left to right, reflected at the interface.
2.11 Exercises
1. A plane wave propagates in free space with an E-field and B-field given by
Eqs. 2.184 and 2.186, calculate the Poynting vector S associated with this wave.
2. As shown in Fig. 2.12 a plane wave of wavelength 620 nm is incident from the
left on a lossless, nonmagnetic dielectric slab with dielectric constant 0 = 2.25.
Calculate the reflection and transmission coefficients R, T , the reflected power
and the transmitted power, assuming and E-field amplitude of 1 V/m. Assume
the dielectric slab to be infinitely thick.
3. Now reconsider the dielectric slab in Fig. 2.12 as a slightly lossy dielectric with an
index of refraction corresponding to SiO at λ0 = 620 nm. At this wavelength n =
η + iκ = 1.969 + i0.01175. Calculate the reflection and transmission coefficients,
the reflected power and the transmitted power for this case.
58 Elements Classical Field Theory
60
A
Energy Flow in Polarizable Matter
∇ · S = ∇ · (E × H) (A.3)
= −E · (∇ × H) + H · (∇ × E) (A.4)
From Maxwell-Faraday law and the Maxwell-Ampère law, Eqs. 2.27, 2.28, we have
∂B ∂H
∇×E=− = −µ0 (A.5)
∂t ∂t
and
∂D
∇×H= + Jfree (A.6)
∂t
so that
∂D ∂H
∇·S=−E· + Jfree + H · −µ0 (A.7)
∂t ∂t
∂D ∂H
= − E · Jfree + E · + µ0 H · (A.8)
∂t ∂t
Now specify the case where there no free currents (Jfree = 0), but material within the
enclosed volume is polarizable. Then the displacement field is
D = 0 E + P
62 Polarizable Matter
and
∂D ∂H
−∇ · S = E · + µ0 H · (A.9)
∂t ∂t
1 ∂ 1 ∂ ∂P
= 0 · (E · E) + µ0 (H · H) + E ·
2 ∂t 2 ∂t ∂t
∂ 1 ∂P
0 E 2 + µ0 H 2 + E ·
= (A.10)
∂t 2 ∂t
The term in brackets on the right side of Eq. A.10 is the field energy and the second
term is the time rate of change of the “mechanical energy” of the material. We see that
the flow of energy into or out of any material included within the volume is controlled
by the time dependence of the polarization field in that material.
∂2P ∂P
+γ + ω02 P = ωd2 0 E (A.11)
∂t2 ∂t
where ωd2 is a “strength of external field coupling” parameter and E is the external
driving field. The coupling parameter ωd clearly is the frequency of the driving field
E. After a few lines of tedious but straight-forward algebra we find
stored
z }| {
dissapative
z }| {
2 2
∂P 1 1 ∂ ∂P 2 2
γ ∂P
E· = + ω0 P + (A.12)
∂t 0 ωd2 2 ∂t ∂t | {z } 0 ωd2 ∂t
| {z } potential
kinetic
Remembering that for a polarizable but nonmagnetic material the bound current den-
sity is given by,
∂P
Jbound = (A.13)
∂t
and
dEmech
= E · Jbound (A.14)
dt
we see immediately that Eq. A.12 represents the flow of mechanical energy into and
out of the ensemble of electric dipoles constituting the material of the system. Three
terms constitute this mechanical energy: the kinetic and potential energy stored in the
dipoles and the dissipated mechanical energy associated with the damping rate γ as
denoted in Eq. A.12.
Drude-Lorentz Dispersion 63
the fields in complex representation and calculate the absolute value of the energy
terms. In complex notation the E-field and its time derivative are written
∂E
E = E0 e−iωt = −iωE
and
∂t
and the absolute value of the stored energy factor is
2 " 2 #
∂P 2 2 2 2 ∂E 2 2
+ ω0 P = ( − 1) 0 + ω0 E
∂t ∂t
2
∂P 2 h 2
i
+ ω02 |P| =20 ω02 + ω 2 (0 − 1)2 + (00 ) E02 (A.23)
∂t
Now from the dispersion relation Eq. A.17 we can also write
2
0 2 00 2
ωp4 ω02 − ω 2 γ 2 ωp4 ω 2
( − 1) + ( ) = h i2 + h i2 (A.24)
2 2
(ω02 − ω 2 ) + (γω)2 (ω02 − ω 2 ) + (γω)2
ωp4
= 2 (A.25)
(ω02 − ω 2 ) + (γω)2
Therefore, from Eqs. A.22, A.23, A.25 we have the absolute value of the stored energy
from Eq. A.12
" 2 # " #
1 ∂P 2 2 0 ωp2 ω02 + ω 2
+ ω0 |P| = E02 (A.26)
20 ωp2 ∂t 2 (ω02 − ω 2 )2 + (γω)2
By “stored energy” we mean that part of the energy density that flows from the fields
into the dipoles, is stored there and can flow back to the fields.
Making analogous substitutions we have for the dissipative part of dEmech /dt,
2
γωp2 ω 2
γ ∂|P|
= 2 0 E02
0 ωp2 ∂t 2 2
(ω0 − ω ) + (γω)2
and writing the magnetic field energy in terms of the electric field energy
0 0 p 0 2
H2 = ||E 2 = + 00 2 E 2 (A.31)
µ0 µ0
the stored energy density Us within the curly brackets on the right side of Eq. A.30
can be written p
0 0 2ω 00 2 2
Us = + 0
+ + 00 E2 (A.32)
2 γ
and averaging Us over an optical cycle
p
0 2ω 00
hUs i = 0 + + 0 2 + 00 2 E02 (A.33)
4 γ
In the limit of a lossless material, 00 , γ → 0,
1 0 2
hUs i = 0 E 0 (A.34)
2
as expected.
Finally, we can express this conservative energy density in terms of the real and
imaginary parts of the index of refraction, η and κ, respectively.
0 = η 2 − κ2
(A.35)
00 = 2ηκ (A.36)
So that
2 2ωηκ
U s = 0 η + E2 (A.37)
γ
and averaging Us over an optical cycle,
0 2 2ωηκ
hUs i = η + E02 (A.38)
2 γ
66 Polarizable Matter
and
∂P 2 ∂E
q
2
= 0 (0 − 1) + (00 ) (A.42)
∂t ∂t
So substituting A.42 into A.39 we have
∂Ud γ0 ω 2 h 0 2 2
i
= 2
( − 1) + (00 ) E 2 (A.43)
∂t ωp
where E0 is the electric field amplitude. This expression can be further simplified by
noting that
2 2 ωp4
(0 − 1) + (00 ) = 2
(ω02 − ω 2 ) + (γω)2
∂Ud
= 0 00 ωE 2 (A.44)
∂t
and is therefore in phase with the driving field energy flux, Sz = Ex Hy cos2 ωt. There-
fore the rate of change of stored energy is in quadrature
2
dUs 2 2ωηκ
= −2ω0 η + E02 cos ωt sin ωt (A.48)
dt γ
In contrast from Eq. A.44 the rate of change of the dissipated energy density is in
phase with the driving field energy flux
dUd
= 0 00 ωE02 cos2 ωt (A.49)
dt
while the dissipated energy itself accumulates over the optical cycles
ωt sin 2ωt
Ud = 0 00 E02 + (A.50)
2 4
Figure A.1 shows the time dependence of the various constituents of the energy density
in a polarizable medium.
Us ' 0 0 E 2 (A.51)
The stored energy exhibits the same “dispersive” frequency dependence as a driven
harmonic oscillator, in phase with the driving field below resonance, ω0 − ω > 0 and
π out of phase at frequencies above resonance, ω0 − ω < 0. In the simplest case of
a metal modeled with a single resonance at the bulk plasmon frequency, ω0 = ωp ,
the stored energy responds in phase at all frequencies “to the red” of ωp . For most
68 Polarizable Matter
Fig. A.1 Top panel: Red (full) curve shows time dependence of stored energy (normalized
to 0 E02 ), Eq. A.47, and blue (dashed) curve shows time dependence of the rate of change
of stored energy, Eq. A.48. Bottom panel: Red (full) curve shows accumulation of dissipated
energy with time, Eq. A.50 and blue (dashed) curve shows rate of change of dissipated energy
(dissipated power), Eq. A.49. These graphs are plotted with the following parameters taken
for silver metal: 0 = −9.108, 00 = 0.753, ω = 3.768 × 1015 s−1 ,(λ = 500 nm), γ = 1.0 × 1014 .
common noble metals ωp is in the blue or near ultraviolet regions of the spectrum.
Therefore, for incident driving fields in the visible or near-infrared, the usual case in
plasmonic studies, the stored energy is in phase with the incident energy flux.
The dispersion relation for the material loss term is
γωωp2
00 (ω) = 2 2
(A.54)
(w02 − ω 2 ) + (γω)
and, using Eq. ?? the dissipative energy density is peaked at the resonance frequency
ω0 , decreasing symmetrically as the driving frequency is tuned above or below reso-
Drude-Lorentz Dispersion 69
nance. " #
γωωp2
ωt sin 2ωt
Ud = 0 2 2
E02 + (A.55)
(w02 − ω 2 ) + (γω) 2 4
The dispersive and absorptive frequency profiles are simply those of a driven, damped
harmonic oscillator. This behavior is consistent with the polarization of the material
modeled as a density of dipoles.
B
Macroscopic Polarization from
Microscopic Polarizability
B.1 Introduction
In Chapter 2 the polarization vector field was introduced in Eqs. 2.15, 2.17. The polar-
ization field P adds vectorially to the applied electric field (multiplied by the permittiv-
ity) 0 E to produce the total displacement field D, and in many cases the polarization
field itself is proportional to the applied electric field. The proportionality constant,
the susceptibility χ, is a property of the material; and the dielectric constant of the
material is simply related to this susceptibility by = 1 + χ. As the term suggests, the
susceptibility is a measure of the response of the material to an external applied field.
Since matter is actually composed of atoms and molecules, the macroscopic response
must somehow be related to the distortion of local electric fields around the constituent
microscopic particles. The goal of this complement is to develop that relation.
Fig. B.1 External electric field Eext is applied to two external conducting parallel plates.
The material slab with dielectric constant and susceptibility χ exhibits a polarization field
−P the source of which is the surface charge density σP induced at the slab-plate interface.
The volume of the sphere inside the slab is large with respect to the size of individual dipoles
comprising the material but small compared to the bulk volume of the slab itself.
the bulk volume. Using Gauss’s law, we can compute this E-field EP at the center of
the sphere due to the bound charge density around the surface. The net field at the
center of the sphere, experienced by a molecule at that site, is then Eext − P/0 + EP .
The displacement field D = 0 Eext must be continuous across the slab boundary,
and therefore the E-field at the point of interest due to the external applied field is
D P
= Eext + (B.1)
0 0
The surface charge density at the two slab interfaces σPslab is the source of a polariza-
tion field at all points within the slab equal to −P/0 . Now we invoke Gauss’s law to
calculate the E-field EP at center of the sphere from the polarization charge density
σPsphere at the sphere surface. This bound charge density is determined from the po-
larization component normal to the sphere, and therefore the distribution of charge
density around the surface goes as σPsphere (θ) = |P| cos θ where θ is the angle between
the z axis and the surface normal as shown in Fig. B.1. The E-field at the sphere center
is then
72 Polarization and Polarizability
σPsphere cos θ
Z
1
EP = dS (B.2)
4π0 S r02
where r0 is the sphere radius. The surface differential is dS = r02 sin θ dθ dϕ, and
integrating over the azimuthal angle ϕ we have
where p is the induced dipole moment of an individual molecule. Since the macroscopic
polarization P can be interpreted as a dipole density, we can write
Polarization and Polarizability 73
P
P = N p = N α0 Eeff = N α0 E + (B.11)
30
where N is the number of molecules per unit volume. Remembering that for linear
response
P = 0 ( − 1)E (B.12)
We can set B.11 equal to B.12 and find
−1 Nα N0 ρm α
= = (B.13)
+2 3 3M
where in the last term on the right N0 is Avogadro’s constant, ρm is the density and M
is the molecular mass. This expression is called the Clausius-Mossotti relation. It tells
us how we can calculate the dielectric constant , a macroscopic property, from the
microscopic polarizability, α. As such it furnishes a key bridge between microscopic
and macroscopic physics of matter.
C
The Classical Charged Oscillator and
the Dipole Antenna
p0 sin θ ω 2 h r i
Eθ = − cos ω t − θ̂ (C.1)
4π0 rc2 c
µ0 p0 sin θ ω 2 h r i
Bϕ = − cos ω t − ϕ̂ (C.2)
4π rc c
and taking account of µ0 = 1/0 c2 we note that the electric and magnetic field ampli-
tudes are related by, B = E/c. We can associate a characteristic “current” ic of the
dipole with length a oscillating at frequency ω,
ωp0
ic = (C.3)
a
and a charactersitic electric field Ec
1/2
µ0 ic a sin θ
Ec = (C.4)
0 2λr
where λ is the wavelength of light at frequency ω. We recall from Eq. 2.155 that the
cycle-averaged energy flux emitted by the oscillator is given by
µ0 p20 ω 4 sin2 θ
hSi = (C.5)
32π 2 c r2
or in terms of ic and Ec ,
1/2 2
1 µ0 ic a sin θ
hSi = (C.6)
8 0 λr
Figure C.1 shows the distribution of radiated power density from a point dipole. The
The Proto-antenna 75
Fig. C.1 Angular distribution of power density (power radiated through a unit area) from
a radiating point dipole
where we have used the fact that ω 4 p20 = ω 4 p20 /2. Now from a units analysis it is
easy to show that
1/2
µ0
' 377 ohms (C.8)
0
and the cycle-averaged power emitted by the oscillator can be set equal to the average
power required to drive the oscillator
1
hPi = Rc i2c (C.9)
2
where Rc is the characteristic oscillator impedance to radiation,
1/2
2π µ0
Rc =
3 0
=790 ohms
and for a given dipole length and wavelength (a << λ) the radiative impedance Rr is
1/2
2π µ0 a 2
Rr = (C.10)
3 0 λ
76 Dipole Antenna
Fig. C.2 Geometric disposition for determining the E-field of a line antenna of length R at
point M by integrating contributions from point dipoles aligned along z.
Finally the directional gain of any antenna, G(θ), is defined as the ratio of power
emitted in a given direction to the total radiated power. In the case of the dipole
proto-antenna, from Eqs. C.5, C.7,
hSi 3
G(θ) = 4πr2 = sin2 θ (C.11)
hPi 2
We seek to write the distance P M in terms of r and then integrate the resulting
expression for E over the length of the antenna. Invoking the law of cosines we can
write
Real Antennas 77
2 !1/2
2 2
1/2 OP 2OP cos θ
P M = r + OP − 2r OP cos θ =r 1+ − (C.13)
r r
At distances far from the antenna where r >> OP we have P M → r. Then substitut-
ing this result into C.12 and using k = ω/c we have
1/2 "Z #
+R/2
µ0 sin θ
dz e−[ω(t− c )] θ̂
r
ikz z
E(M ) ' ic (z)e (C.14)
0 2λr −R/2
where
2π
kz = k cos θ =
λ
C.2.1 The Half-wave Antenna
We posit that the antenna length is λ/2 and that a proto-dipole oscillator source,
located at the midpoint, is driving current to the antenna extremeties. A standing
wave is set up along the antenna such that
2πz
ic (z) = ic cos = ic cos (kz) (C.15)
λ
Then the integral in Eq. C.14 becomes
Z λ/4
2πz
Ic (k) = ic cos eik cos θz dz (C.16)
−λ/4 λ
and the resulting E-field at P is
1/2 θ
µ0 ic cos π cos 2
h r i
Ehw (P ) = − cos ω t − θ̂ (C.17)
0 2πr sin θ c
The cycle-averaged energy flux for the half-wave dipole antenna, hSihw , is
ic cos2 π cos θ2
2
1
hSihw = (C.18)
8π 2 0 c r2 sin2 θ
and the total cycle-averaged power of the half-wave antenna is
1 2 π cos2 π cos θ2
Z
hPihw = i dθ (C.19)
4π0 c c 0 sin θ
hPihw =36.5i2c
1
hPihw = i2c Rr
2
and therfore the radiative impedance of the half-wave antenna is Rr = 73 ohms. A
typical FM radio emission frequency is 100 MHz, and from the above calculation we
see why a typical FM radio antenna consists of a 75 ohm cable about 1.5 meters in
length.
78 Dipole Antenna
Fig. C.3 Plot of the radiation distribution, normalized to the maximum at θ = π/2, from a
linear half-wave antenna and single point-dipole. Angles 78◦ and 90◦ correspoind to half-power
points for half-wave antenna and point dipole, respectively.
m=(N −1)
X
= Ehw (P )einϕ e−imϕ
m=0
1 − e−iN ϕ
= Ehw (P )einϕ
1 − e−iϕ
sin(N ϕ/2)
= Ehw (P ) (C.20)
sin(ϕ/2)
where ϕ is the azimuthal phase shift associated with the displacement a along x.
Figure C.5 shows the angular distribution of the E-field of a five-element linear array
as shown in Fig. C.4.
Real Antennas 79
Fig. C.4 Five λ/2 antennas arranged in a linear array along the x axis, each element of the
array separated by a distance d.
2π
ϕ = kx a = sin θ cos ϕ a
λ
Fig. C.5 Angular E-field distribution of a half-wave antenna array with five elements.
wavelength λ = 0.3 m. The separation between the elements of the array is d = λ/2 =
0.15 m. We see that the maximum of the principal lobes of intensity occur for θ = π/2
80 Dipole Antenna
Fig. C.6 Angular distribution of radiated power from the five-element array shown in
Fig. C.4. Note that the power directed along x with half-power points of 10.4◦ compared
to a single half-wave antenna with half-power points at 78◦ .
but we see small adjacent lobes, reminiscent of a diffraction pattern. This result is
hardly surprising since Eq. C.20 is the same as for linear diffraction grating of N
elements. The corresponding radiated power distribution of the five-element array is
shown in Fig. C.6. The concentration of the radiated power along x is clearly evident.
The angular aperture of the half-power points in the five-element array is only 10.4◦
compared to 78◦ for the single-element antenna.
3
Surface Waves
3.1 Introduction
We treat here an important wave phenomenon, the surface wave. It occurs in many
physical systems both mechanical and electromagnetic. Mechanical surface waves can
exist at the interface between two media with different densities such as the oceans
and earth. Ordinary sea waves and tsunamis are surface waves. Seismic events can pro-
duce both longitudinal and shear waves at the earth’s surface, resulting in earthquakes.
Electromagnetic surface waves can occur at the interface between dielectric and con-
ductive media or between two dielectrics satisfying appropriate boundary conditions.
Air-salt water or glass-metal interfaces support electromagnetic surface waves. Here
we focus on electromagnetic waves at the interface between common dielectrics and
noble metals such as air or glass and silver or gold. We will characterize these waves by
their distinctive properties of dispersion, spatial field distribution, and polarization.
These properties will be used to analyze useful phenomena such as wave guiding within
subwavelength structures and spatial light localization below the diffraction limit.
interface between air and land, considered as a weakly conductive material. His anal-
ysis showed that the Hertz vector Π, that describes the electromagnetic field in each
medium is given by the sum of two fields, P and Q corresponding to two waves. The
wave Q represents a space propagating wave and P a surface pguided wave. For long
distances R from the source, Q varies as (1/R), where R = (r2 + z 2√ ), (see Fig. 3.1
for a definition of the coordinates) while P varies asymptotically as 1/ r and is con-
fined to the Earth’s surface. Over the course of the next two decades a controversy
arose over the existence of the so-called Zenneck waves because Sommerfeld’s original
analysis contained a sign error, and it was found that it was difficult to excite the
surface-guided wave with practical antenna sources or to detect and distinguish the
true surface wave from the spatially propagating wave very close to the surface. In
any event, as far as long-distance wireless communication was concerned, the whole
question became irrelevant when it was discovered that reflection from the ionosphere
was the principal agent responsible for over-the-horizon radio transmission.
Fig. 3.2 Configuration of field components for TE and TM polarized plane waves incident
on a dielectric-metal interface.
1 Although in Chapter 2 the direction of propagation is always along z, in this chapter we must
consider both the incident propagating wave and the surface wave. The incident space-propagating
wave is taken to be incident along z, while the surface wave propagates along x.
84 Surface Waves
∂Ey
= (iωµ0 )Hx (3.3)
∂z
∂Hx ∂Hz TE polarization
− = −i(ω0 )Ey (3.4)
∂z ∂x
∂Ey
= −(iωµ0 )Hz (3.5)
∂x
and
∂Hy
= (iω0 )Ex (3.6)
∂z
∂Ex ∂Ez TM polarization
− = (iωµ0 )Hy (3.7)
∂z ∂x
∂Hy
= −(iω0 )Ez (3.8)
∂x
The incident and reflected waves propagate in the dielectric medium characterized
by the dielectric constant d . The propagation vector in free space is denoted k0 and
within the dielectric kd = k0 n where n is the index of refraction. The dielectric con-
stant and the index of refraction of any medium are related by = n2 . Since the
incident k = ki is in the x − z plane we can write
Similarly on the metal side of the interface for the transmitted propagation vector k m
we have
i ∂Eyd kd
Hxd = − = z Eyd (3.14)
ωµ0 ∂z ωµ0
d
i ∂Ey kd
Hzd = = − x Eyd (3.15)
ωµ0 ∂x ωµ0
Plasmon Surface Waves at Optical Frequencies 85
The same analysis applies to the metal side of the interface with the result
m m
Eym (x, z) = E0y
m i[(kx x+kz
e z)−ωt]
(3.16)
and
i ∂Eym km
Hxm = − = z Eym (3.17)
ωµ0 ∂z ωµ0
m
i ∂Ey km
Hzm = = − x Eym (3.18)
ωµ0 ∂x ωµ0
But this last equation can only be valid for k0 = 0 since the dielectric constants on the
two sides of the boundary can never be equal. Therefore there can be no wave solution
at the boundary for TE polarization.
Then using Eqs. 3.6, 3.8, we find expressions for Exd and Ezd .
kzd
Exd = H d (x, z) (3.24)
ω0 d y
kd
Ezd = − x Hyd (x, z) (3.25)
ω0 d
d
kzs,d = ± √ k0 (3.37)
d + m
m
kzs,m = ±√ k0 (3.38)
m + d
Note that if the real part of the dielectric constant in the metal is negative and if
d + m < 0, then kzm , kzd will be imaginary. The Ez component of the surface wave on
both sides of the interface will then be evanescent.
d
kzs,d → ±iκds = ±i √ k0 (3.39)
d + m
m
kzs,m → ±iκm
s = ±i √ k0 (3.40)
d + m
The choice of sign (±) ensures that the wave amplitude decreases exponentially with
increasing distance (±z) from the interface. With the choice of coordinate axes as
shown in Fig. 3.2,
Plasmon Surface Waves at Optical Frequencies 87
n2s = 0 m · m
00 (3.48)
m + d 1 + i 0 m
+d m
For the two metals commonly used in plasmonic devices, silver and gold, |00m /0m | <<
1 over the optical range of interest, and this characteristic can be used to simplify
88 Surface Waves
Fig. 3.3 Vertical (z) components of the surface wave electric field (E-field) and displacement
field (D-field). Note that Ez is discontinuous at the surface while Dz is continuous. The units
of the E-field and D-field are arbitrary.
Plasmon Surface Waves at Optical Frequencies 89
Fig. 3.4 Real part of the dielectric constant 0 vs. wavelength for silver from the Johnson
and Christy measurements (see citation 8 in the reading list at the end of the chapter).
Eq. 3.48. Figure 3.4 shows a plot the real and imaginary terms of the dielectric constant
of silver metal as a function of wavelength over the optical range; and Fig. 3.5 plots
the ratio of |00m /0m | << 1 for the same silver data. Separating Eq. 3.48 in real and
imaginary terms while dropping fractional terms quadratic in 00m , we find
0m d 00m d
2
ns = 0 · 1+i 0 0 (3.49)
m + d m (m + d )
and from the definition of the complex index of refraction
Setting the real and imaginary terms in Eqs. 3.49, 3.50 equal,
0m d
ηs2 − κ2s = (3.51)
0m + d
0m 00m d
2ηs κs = 2 (3.52)
0m (0m + d )
Equations 3.51, 3.52 can be decoupled and solved for ηs and κs , the two components
of the surface complex index of refraction. The result is
90 Surface Waves
Fig. 3.5 Imaginary part of the dielectric constant 00 vs. wavelength for silver from the
Johnson and Christy measurements (reference 8 in the reading list).
s
0m d
ηs = 0
(3.53)
m + d
3/2
1 00m 0m d
κs = (3.54)
2 (0m )2 0m + d
These expressions can be simplified further if the dielectric is vacuum or air. In that
case d can be set equal to unity and
s
0m
ηs = (3.55)
0m + 1
1 00m
κs = p · (3.56)
2 0m (0m + 1)3/2
Fig. 3.6 Ratio 00 /0 vs. wavelength for silver from the Johnson and Christy measurements
(reference 8 in the reading list).
kzd kd d d
dielectric ω0 d H( x, z, t) − ω0xd Hy (x, z, t) Hy0 ei(kx x+kz z−ωt)
kzm km m m
metal ω0 m Hy (x, z, t) − ω0xm Hy (x, z, t) Hy0 ei(kx x+kz z−ωt)
iκd ks d s
d side − ω0sd Hy (x, z, t) − ω0xd Hy (x, z, t) H0y eκs z ei(kx x−ωt)
interface
iκm ks m s
m side ω0 m Hy (x, z, t)
s
− ω0xm Hy (x, z, t) H0y e−κs z ei(kx x−ωt)
ηs = 1.0336
κs = 2.3534 × 10−3
k0 = 2π/λ0 = 9.942 × 106 m−1
Taking the “effective length” xeff to be the distance over which the amplitude of the
surface waves falls to 1/e of the initial value, we find xeff = 42.74 µm. Figure 3.7
shows the behavior of the surface wave Ex -field amplitude at the interface (z = 0),
propagating in the x direction. Figure 3.8 shows the variation of Ex , Ez near the
metal-dielectric interface.
92 Surface Waves
Fig. 3.7 The surface wave Ex -field amplitude at the interface (z = 0), propagating in the
x direction. The distance along which the amplitude decreases to 1/e of the initial value at
x = 0 is called the “effective” length.
d2 x
me = −eEx (3.57)
dt2
and if the E-field is time-harmonic, E = E0 e−iωt , we have
−ω 2 me x = −eEx (3.58)
e2
P = −Ne E (3.59)
me ω 2
with Ne the electron concentration of the plasma. From the definition of the displace-
ment field D and the constitutive relation between D and E we have
Plasmon Surface Wave Dispersion 93
Fig. 3.8 Instantaneous distribution of the electric field Ex and Ez in the x and z directions,
respectively. The signal variation of the field Ez along x is accompanied by the variation of
electric charge (+,-) corresponding to the condition of Ez field discontinuity. The represen-
tation of surface charge density along z is exaggerated for clarity.
D = 0 E + P (3.60)
D = 0 E (3.61)
P Ne e2
(ω) = 1 + =1− (3.62)
0 E 0 me ω 2
The characteristic plasma frequency is defined by
Ne e 2
ωp2 = (3.63)
0 m e
So finally we have the lossless dielectric constant
ωp2
(ω) = 1 − (3.64)
ω2
3.4.2 The Drude Model of Metals
The free-electron plasma model describes harmonic electron motion in the limit of
a negligible restoring force acting on the electron and no “collisional” or dissipative
losses. The Drude model does not introduce a finite restoring force, but does add
dissipation to the equation of motion for the electrons in the plasma gas (when a
94 Surface Waves
linear restoring force is introduced into the conduction electron equation of motion,
the theory is called the Drude-Lorentz model of metals). The equation of motion
for the simple Drude model of the conduction electrons can still be described by a
one-dimensional driven harmonic oscillator expression, where the driver issues from
an electromagnetic field incident at the dielectric-metal boundary. If we assume the
incident radiation is polarized along x, then the Drude model equation of motion can
be written
d2 x dx
me 2 + me Γ = −eE(x, t) = −eE0m e−iωt (3.65)
dt dt
where me is the electron mass, Γ is a phenomenological damping constant, and eE0m e−iωt
the driving force of an E-M field in the metal with amplitude E0m and frequency ω.
Again dropping the common oscillatory phase term, the solutions for the position,
velocity and and acceleration are
1
x= eE0m (3.66)
me (ω 2 + iΓω)
dx iω
=− eE0m (3.67)
dt me (ω 2 + iΓω)
2
d x ω2
=− eE0m (3.68)
dt2 me (ω 2 + iΓω)
Again we write the polarization as a density of dipoles, Px = −Ne ex, and now the
dielectric constant is complex
Ne e 2 ωp2
(ω) = 1 − = 1 − (3.69)
0 me (ω 2 + iΓω) ω 2 + iΓω
ω 2 − ωp2 + Γ2 ωp2 Γ
(ω) = + i (3.70)
ω 2 + Γ2 ω(ω 2 + Γ2 )
propagation constant k of the light in the material. There is a simple chain of relations
between the propagation constant k(ω) and the dielectric constant (ω). First
2π λ0
k= and λ= (3.72)
λ n
where λ0 is the free-space wavelength and n the index of refraction. Then the material
index of refraction can be complex, n = η + iκ and is related to the dielectric constant
by
p
n(ω) = (ω) (3.73)
p
0 00
η(ω) + iκ(ω) = (ω) + i (ω) (3.74)
and therefore
η 2 − κ2 = 0 (ω) (3.75)
00
2ηκ = (ω) (3.76)
1
0m = 1 − 2 (3.78)
y
1 Γ
00m = 3 (3.79)
y ωp
and taking the propagation constant of the surface wave along the dielectric-metal
interface as ks = k0 ns . The factor ns is the surface of index of refraction, the real and
imaginary parts of which we write from Eqs. 3.53, 3.54.
" 1/2 3/2 #
d 0m 00m d 00m
ks = k0 ns = k0 +i 0 2 (3.80)
d + 0m 2(m ) d + 0m
ks = ks0 + iks00 (3.81)
Thus
96 Surface Waves
Fig. 3.9 Real (panel a) and imaginary (panel b) terms of the index of refraction and the
dielectric constant calculated from the Drude model (Eq. 3.71) vs. frequency ω normalized to
ωp , the plasma frequency for silver. The relaxation rate Γ is taken to be Γ = 1014 s−1 and
ωp = 1.35 × 1016 s−1 .
Fig. 3.10 Dispersion curve for a silver-glass interface. The real dielectric constant for glass
is taken to be 0 = 2.25. The loss term in the metal is not included.
Plasmon Surface Wave Dispersion 97
1/2
d 0m
ks0 = k0 (3.82)
d + 0m
3/2
00m d 00m
00
ks = (3.83)
2(0m )2 d + 0m
ks0 c k0
x= = s (3.84)
ωp kp
The solution constitutes the dispersion curve (in “reduced” units) of the frequency vs.
the propagation parameter for the surface wave. The reduced variable x is defined in
terms of the real part of the propagation parameter ks0 and therefor this expression de-
scribes only the real part of the dispersion curve. The solution consists of two branches,
plotted in Fig. 3.10. The lower branch describes the surface-wave dispersion from the
long wavelength limit (y → 0; x → 0) to an asymptote (y → ysp = ωsp /ωp ; x → ∞).
This frequency asymptote at the short wavelength limit corresponds to the surface
plasma resonance where the surface charge density oscillates collectively over the en-
tire surface and the phase velocity of the surface wave dω/dks slows to near zero. An
expression for this surface plasma asymptotic frequency can be obtained from Eq. 3.86.
ωp
ω → ωsp = √ (3.87)
1 + d
The surface plasma resonance is often called the surface plasmon resonance and bares
a simple relation to ωp , the bulk plasmon resonance derived from the free-electron
plasma model. Between these two frequencies there is no charge density surface wave
or bulk wave propagation. This “forbidden zone” is sometimes termed a stopband. At
frequencies above ωp propagation once again occurs on the surface and in the bulk.
This frequency limit corresponds to the point where 0m → 0. Below ωp , 0m < 0 and
above ωp , 0m > 0. Also shown in Fig. 3.10 is the linear “light line”, the dispersion
curve of light freely propagating in the dielectric medium, ωll = k0 c/η.
Inclusion of the imaginary part of the surface parameter ks in the definition of the
reduced variable x in Eq. 3.84 and subsequent solution of Eqs. 3.85, 3.86 modifies the
dispersion curve and opens propagation in the stopband. The real and imaginary parts
of the full dispersion curve are shown in Fig. 3.11. Note that in panel (a) of this figure
98 Surface Waves
Fig. 3.11 Panel (a): dispersion curve for the real part of the surface propagation parameter
ks0 . Panel (b): Dispersion curve for the imaginary part of the surface propagation parame-
ter ks00 . Both propagation parameters have been normalized to kp = ωp /c. Inclusion of the
imaginary term results in finite propagation in the stopband (compare with Fig. 3.10).
Conduction Current in a Drude Metal. The conduction current in the metal is given
by
dx ω
Jc = eNe = eNe eE0m (3.89)
dt me (Γω − iω 2 )
and with the electron density related to the bulk plasma frequency, Eq. 3.63, we have
ωp2
Jc = 0 E0m (3.90)
(Γ − iω)
From the standard constitutive relations
σ
Jc = σE = 0 E (3.91)
0
and so, comparing Eqs. 3.90, 3.91, the expression for the complex conductivity can be
expressed as
σ Γ ω
= 2 + i (3.92)
0 Γ
2 +
ω 2
2
Γ2
2 +
ω2
2
ωp ωp ωp ωp
We can compare the damping rate Γ to the plasma frequency ωp if we can estimate
the latter, and from Eq. 3.63 we see that this requires knowledge of the conduction
Energy Flux and Density at the Boundary 99
Fig. 3.12 Indication of the components of the Poynting vector of the dielectric-metal inter-
face in which a wave propagates SPP.
electron density. A typical “good” metal such as silver exhibits an electron density
Ne ' 6 × 1028 m−3 . and therefore ωp ' 1.5 × 1016 s−1 . The damping due to radiative
or collisional loss is typically ∼ 1014 s−1 , and therefore Γ << ωp . Therefore in the
optical regime we can write
σ Γωp2 ωp2
' 2 +i (3.93)
0 ω ω
This last expression for the conductivity shows that at relatively low frequencies (RF,
microwave, radio wave) the real term dominates, and the conductivity is in phase
with the driving field and “ohmic”. As the frequency increases into the optical range,
however, the electrons can no longer follow the driving field in phase, and the imaginary
term dominates. At ω ' 1015 s−1 , typical of the visible optical regime, the real term is
only about 5 per cent of the imaginary term. Figure 3.10 plots the dispersion relation
of the Drude model, Eq. 3.71
S=E×H (3.94)
and the energy flux of an electromagnetic wave averaged over an optical cycle is given
by
1
S = Re[E × H∗ ] (3.95)
2
Figure 3.12 shows the x and z components of the energy flow on the two sides of the
surface between a dielectric and a metal. Writing out these components explicitly from
100 Surface Waves
Table 3.1, we have at normal incidence the transmitted energy flux impinging at the
surface from the dielectric into the metal along the z direction.
d
1 1 kz
Szd = Re Exd Hy∗ = Re 2
H0y (3.96)
2 2 d 0 ω
m
m 1 m ∗ 1 kz 2
Sz = Re Ex Hy = Re H0y (3.97)
2 2 m 0 ω
From the continuity relation at the surface, Eq. 3.33, we see immediately that the en-
ergy flux is continuous across the boundary in the z direction. On the dielectric side
the permittivity 0 d and the propagation constant kzd are real (for a lossless mate-
rial), and therefore the expression for the energy flux is also real. On the metal side
the dielectric constant is complex, m = 0m + i00m , as is kzm = k0 nm = k0 (ηm + iκm ).
The metal index of refraction nm is related to the metal dielectric constant through
√
nm = m , and therefore the expression for the product of Exm and Hy∗ in Eq. 3.97
exhibits real and imaginary terms. The real part of Eq. 3.97 represents the transmit-
ted, exponentially decreasing energy flux into the metal. In order to obtain explicit
z
expressions for nm and km we need to examine the propagation of electromagnetic
waves inside real metals.
Plane-wave solutions to these equations for waves propagating in the z direction can
be written
m
Em = E0m ei(kz z−ωt)
(3.104)
i(kzm z−ωt)
Bm = B0m e (3.105)
The amplitudes E0m , B0m and the propagation vector kzm are generally complex quan-
tities. Substituting these solutions back into the uncoupled wave equations 3.102, 3.103,
results in an expression for the propagation parameter in terms of the permeability,
complex permittivity, and complex conductivity of the metal,
2
(kzm ) = µm m ω 2 + iµm σω (3.106)
with
kzm = kr + iki m = r + ii σ = σr + iσi (3.107)
Equating real and imaginary terms after substitution into Eq. 3.106 yields
where we have set µm = µ0 . Now writing the complex permittivity in terms of the
dielectric constants, r = 0 1 and i = 0 2 we have
σi σi
kr2 − ki2 = µ0 0 ω 2 1 − = k02 1 − (3.110)
0 ω 0 ω
σr σr
2kr ki = µ0 0 ω 2 2 + = k02 2 + (3.111)
0 ω 0 ω
But the complex propagation parameter km can also be expressed in terms of the
complex index of refraction, nm .
102 Surface Waves
Fig. 3.13 Design of a planar waveguide formed by three zones of different permittivity 1 , 3 ,
the “cladding” or shell can be considered metallic and 2 , the “core,” a dielectric. This type
of structure is sometimes termed a MIM waveguide (metal-insulator-metal). The complemen-
tary structure, IMI waveguides, are also possible. Propagating modes can be symmetric or
antisymmetric with respect to the core centerline.
rameter q as a function of the core width a. Here we carry out in some detail the
matching operation for TM modes as an illustrative example.
Now we write down the real field components in the transverse z direction in each
of the three zones indicated in Fig. 3.13, while expressing the propagation in the x
direction as eikx .
For z < 0
Hy = Beikx epz (3.131)
−iB ikx pz −ip
Ex = pe e = Hy (3.132)
ω3 0 ω3 0
−Bk ikx pz −k
Ez = e e = Hy (3.133)
ω3 0 ω3 0
For 0<z<a
Hy = Aeikx cos(qz + ϕ) (3.134)
i
Ex = qAeikx sin(qz + ϕ) (3.135)
ω2 0
−k ikx −k
Ez = A cos(qz + ϕ) = Hy (3.136)
ω2 0 ω2 0
For z > a
Hy = C ikx e−rz (3.137)
iC ir
Ex = reikx e−rz = Hy (3.138)
ω1 0 ω1 0
−Ck ikx −rz −k
Ez = e e = Hy (3.139)
ω1 0 ω1 0
Plasmon Surface Waves and Waveguides 105
and since the channel width cannot be negative, this result indicates that TE modes
are not supported in the deep subwavelength limit.
3.7.2 TM Modes
TM polarization specifies the existence of Ex , Ez , Hy polarized amplitudes. We seek
solutions to the scaler wave equation in the region above the 0 /d interface and in the
region within the d dielectric slab. We match these solutions at the interface.
Surface Waves at a Dielectric Interface 107
Fig. 3.14 Surface waves propagate along z at the interface between vacuum 0 and material
dielectric d . The ground plane truncates wave propagation in −x direction and sets field
amplitudes to zero at the d -ground plane interface.
where kd2 = d k02 and Ez = Ez (x)eiβz . We assume that there is no variation in the
fields along the y direction. Within the dielectric slab the general solution, in terms of
linear combinations of complex traveling waves, is
where
kc2 = kd2 − β 2 = d k02 − β 2 (3.160)
Above the interface the E-field of the surface wave must decrease exponentially,
and the form of the solution must be
where
κ2 = β 2 − k02 (3.162)
and Eza (x)is the phasor amplitude in the air above the dielectric slab. We anticipate
that the propagation velocity of the surface wave will be somewhat slower than a wave
of the same frequency propagating in air since a significant faction of the surface wave
amplitude will be immersed in the dielectric slab. Therefore we expect β > k0 and
Eq. 3.162 is written so that κ is real and can be be chosen with positive sign. This
108 Surface Waves
choice assures that Eq. 3.161 represents an exponentially decaying amplitude in the
+x direction.
At x = 0 the boundary condition requires Ezd (0) = 0. Therefore B = A and
Where Ezd (x) is the amplitude for the surface-wave phasor Ez (x) = Ez (x)eiβz . At x = t
the amplitudes for the E-field amplitudes on both sides of the interface have to match.
The H-field amplitudes have to match as well, and we obtain the H-fields from the
E-fields by using the relations obtained in Chapter 4, Section 4.5.1. In the dielectric
slab we have
i ∂Ez
Hyd (x) = ω0 d
kc2 ∂x
i
= ± ω0 d A2i cos kc x (3.165)
kc
and on the air side
i
Hya (x) = − ω0 Ce−κx (3.166)
κ
At the interface H-field amplitudes must match. Since Hya (x) has a negative amplitude,
we must choose the negative amplitude solution for Hyd (x).
i i
− ω0 d A2i cos kc t = − ω0 Ce−κt (3.167)
kc κ
Now divide Eq. 3.164 by Eq. 3.167, the two matching equations at the interface. The
result is
kc tan kc t = d κ (3.168)
which provides a relation between the wave vector kc , the thickness of the dielectric
slab t, the damping constant κ, for a given slab material with dielectric constant d .
Equations 3.159, 3.160 specify that kc is the wave vector component along x of that
part of the surface wave within the dielectric slab. We can write κ in terms of kc and
k0 by using the fact that the propagation parameter β along z must be the same for
the two parts of the surface wave: one part in the dielectric slab and the other in the
air. We can therefore eliminate β from Eqs. 3.160, 3.162 resulting in the expression
In principle we could eliminate κ between Eqs. 3.168, 3.169 and obtain an expression
for kc as a function of three parameters: the free-space wave vector k0 , the material
dielectric constant d and the slab thickness t. However, Eq. 3.168 is a transcendental
equation with no simple analytic solution for kc . The best we can do is find solutions
for kc numerically and plot them to gain some insight into the properties of the TM
modes.
Surface Waves at a Dielectric Interface 109
Fig. 3.15 The expressions y1 , y2 plotted as a function of kc t. The function y2 is plotted for
a range of thicknesses t from 100 nm to 1000 nm. Dashed curves show four representative
solutions where the thickness t permits modes TM0 and TM1 . Note that TM0 has no cutoff
thickness but TM1 cuts off at kc t = π. Intersection of the curves y1 , y2 > 0 constitute solutions
to the transcendental equation 3.170.
One way to plot the solutions is to divide both sides of Eq. 3.168 by kc and rearrange
it as
d κ
tan(kc t) = (3.170)
kc
and consider kc as an independent variable y. We can then plot both sides of the
equation, and identify solutions to the transcendental equation as the points of inter-
section.
y1 = tan(kc t) (3.171)
1/2
d κt d k02 (d − 1) − kc2 t
y2 = = (3.172)
(kc t) (kc t)
Figure 3.15 shows schematically the appearance of the two families of curves y1 , y2
when plotted against kc t. Clearly y1 is just a plot of the tangent function with branches
starting at y1 = 0 for kc t = 0, π, 2π . . . It is also clear that only positive values of y1
can provide legitimate solutions to Eq. 3.168 since the numerator of y2 consists of three
factors all of which are intrinsically positive. For fixed d and k0 , y2 , as a function of
110 Surface Waves
(kc t), is a family of curves parametrically increasing with the slab thickness t. From
Eq. 3.172 we see that y2√→ ∞ as kc t → 0 and that y2 crosses 0, becoming pure
imaginary, when kc = k0 d − 1. Intersections of y2 with y1 within the positive half
of the first branch (0 ≤ kc t ≤ π/2) constitute solutions to the transcendental equation
3.168 belonging to the lowest TM mode, TM0 . We see immediately that the TM0 mode
has no cutoff for kc as kc t → 0. For fixed fixed kc , t and as k0 increases, y2 will begin
to intersect y1 both in the first and in the second positive branch (π ≤ kc t ≤ 3π/2).
These intersections in the second branch constitute √ TM1 solutions to 3.168. Higher
order TM modes begin to propagate as the kc = k0 d − 1 upper limit increases with
k0 . From Fig. 3.15 it clear that the cutoff points for TM1 , TM2 , . . . occur at kc t = nπ
with n = 1, 2, 3, . . . The general expression for the cutoff frequency (in Hz units) is
given by
kc
νc = √ (3.173)
2π µ
√
where 1/ µ is the local, “effective” speed of light. For the TM modes of this surface
wave, therefore, we have
nc
νc = √ n = 0, 1, 2, . . . (3.174)
2t d − 1
where d − 1 represents an “effective” dielectric constant for the TM modes in the 1-D
surface wave.
iβ 0
Ex (x, z) = ± A cos(kc x)eiβz 0≤x≤t (3.177)
kc
iβ
Ex (x, z) = ∓ A0 sin(kc t)e−κ(x−t) eiβz x≥t (3.178)
κ
and for Hy
iω0 d 0
Hy (x, z) = ± A cos(kc x)eiβz 0≤x≤t (3.179)
kc
iω0 0
Hy (x, z) = ∓ A sin(kc t)e−κ(x−t) eiβz x≥t (3.180)
κ
Surface Waves at a Dielectric Interface 111
3.7.4 TE Modes
In contrast to the case of plasmon surface waves, the 1-D waveguide between two
dielectrics can support TE waves as well as TM waves. The fields specified by TE
polarization are Hz , Hx , Ey and the scaler wave equation above the dielectric slab is
written 2
∂ x 2
− κ Hz = 0 (3.181)
∂x2
and, since the dielectric slab is nonmagnetic, within the slab the wave equation is
2
∂ x 2 2
+ k0 − β Hz = 0 (3.182)
∂x2
The form of the solutions within the slab is
with kc2 = k02 − β 2 . On the air side above the slab we again posit an exponentially
decreasing amplitude.
Hza (x) = Ce−κx (3.184)
Again at x = 0 the E-field parallel to the ground plane must vanish. Obtaining the
E-field from the H-field using the relations in Section 4.5.1, we have
i ∂Hz
Eyd (x) = − 2
ωµ0
kc ∂x
ωµ0
=∓ A2i sin kc x (3.185)
kc
and in the air space above the dielectric slab
i
Eya (x) = ωµ0 Ce−κx (3.186)
κ
The expression for Eyd in Eq. 3.185 implies that the general solution for Hzd (x) in
Eq. 3.183 specializes to
Hzd (x) = ±A2 cos kc x (3.187)
and that therefore the H-field is maximum at x = 0, the ground-plane boundary.
At first thought this condition might seem perplexing since no field should exist in
the ground plane. What the boundary condition really means is that an infinitesimal
distance above the boundary the H-field amplitude is ±2A and an infinitesimal distance
below the boundary the H-field is null.
At the boundary x = t, Hz and Ey must match.
κ
− cot kc t = (3.190)
kc
Again we set
y1 = − cot kc t (3.191)
s
κt k02 (d − 1)
y2 = = −1 (3.192)
kc t kc2
and plot y1 , y2 with kc t the independent variable. Solutions to 3.190 are at the positive
intersections of y1 , y2 . The general form of the solutions is shown in Fig. 3.16. In the first
zone 0 ≤ kc t ≤ π/2 no TE modes are possible because all the y1 solutions are negative
and therefore unphysical. The first TE mode, TE1 appears in the zone π/2 ≤ kc t ≤ π.
From Figs. 3.15, 3.16 we see that the energy ordering of the first three modes is TM0 ,
TE1 , TM1 . Subsequent TE modes first appear at kc t = 3π/2, 5π/2, 7π/2 . . . The cutoff
frequencies are then
nc
νc = √ n = 1, 3, 5, . . . (3.193)
4t d − 1
Exercises 113
3.8 Exercises
1. It is a well-known fact that aluminum (Al), soon after exposure to air, exhibits a
transparent aluminum oxide layer (Al2 O3 ). This oxide layer is about 100 nm thick
and has an index of refraction nAl2 O3 = 1.77. What would be the effective index
of refraction of a surface plasmon polariton wave propagating at the metal/metal
oxide interface if the incident light has a wavelength of 400 nm? The index of
refraction of Al at = 400 nm is n = 0.381 + i4.883.
2. A surface plasmon polariton (spp) wave propagates at the interface between air
and a gold (Au) layer. Calculate the phase and group velocity if the incident
wavelength is 780 nm. In the common case of good conductors where |0m | 00 ,
the group velocity is, to good approximation,
c
vg =
nspp
(s 0 3/2 0 )−1
d 0m
ω d m 1 dm 1 dd
'c + · · 02 + 2 (3.200)
d + 0m 2 d 0m m dω d dω
and from the Drude model the complex dielectric constant for Au can be expressed
as
ω 2
p
0m (ω) = 1 − (3.201)
ω
!
ωp2
00m = (3.202)
ω3 τ
114 Surface Waves
4.1 Introduction
In chapter 2 we summarized the elements of electromagnetics in terms of electric and
magnetic vector force fields related by Maxwell’s equations. We saw in section 2.4 that
the two time-varying equations, Faraday’s Law, Eq. 2.27 and the Maxwell-Ampère
Law, Eq. 2.28, give rise to electromagnetic wave solutions that propagate through a
medium or, as we saw in chapter 3, at the interface between media. These solutions
are generally distributed through out a spatial extent large with respect to the char-
acteristic wavelength.
In contrast the conventional radio and microwave circuit theory of electrical en-
gineering considers time-varying electromagnetic phenomena from the standpoint of
various combinations of lumped elements, (capacitors, inductors, resistors) localized in
space and linked by interconnects of negligible impedance. Furthermore, the spatial
extent of these circuit elements, together with their sources of voltage and current,
oscillating from tens of kilohertz to hundreds of megahertz, is usually much smaller
than the characteristic wavelength. The result is that all the elements of the circuits
are subject to the same time dependence without retardation. This time behavior
is termed quasistatic, and the physical arrangement of the circuit can be said to be
subwavelength. The subwavelength scaling similarity between dielectric and metallic
nanostructures interacting with optical driving fields and electrical circuits driven by
conventional voltage and current oscillators suggests that circuit analysis might find
useful application in the design of functional plasmonic and photonic devices. The
main goal of this chapter therefore is to explore this possibility.
Fig. 4.1 Schematic of an electrical circuit showing lumped elements of resistor, inductor,
capacitor in series with a voltage source. Also shown is the direction of current relative to
the polarity of the voltage source.
(2) The scalar sum of charge currents Ii flowing out of a junction node is null.
X
Ii = 0 (4.2)
i
Kirchhoff’s Voltage Rule. We consider first the voltage rule which is the more relevant
of the two for our purposes. Equation 4.1 derives from the differential form of Faraday’s
Law. By applying Stokes’ Theorem to Eq.2.27 we have
Z I Z
∂B
(∇ × E) · dS = E · dl = − · dS (4.3)
S S ∂t
where E, B are the usual electric and magnetic induction fields, S is the outward
surface normal through which the curl of E protrudes, and l is the line around the
boundary of S, the positive direction of which is taken by using the right hand rule.
With respect to Fig. 4.1 we take the line integral around the circuit and define the
voltage difference between two reference points a, b as
Z b
Vba = − E · dl (4.4)
a
Z b Z d Z f Z h Z Z
∂B ∂
− E · dl − E · dl − E·− E · dl = · dS = B · dS (4.5)
a c e g S ∂t ∂t S
or Z
∂
V0 (t) + Vdc + Vf e + Vhg = B · dS (4.6)
∂t S
where V0 (t) is the voltage source connected to terminals a, b, Vdc is the resistive voltage
drop, Vf e and Vhg the inductive and capacitive voltage drops, respectively. In order
to identify Eq. 4.6 with the Kirchhoff rule we have to make two assumptions: (1) that
there is no changing magnetic flux traversing the plane of the circuit loop. In simple
circumstances where we only consider R, L, C lumped circuits, this condition is not
hard to realize. Even if there is a changing magnetic field present, it can usually be
incorporated into the inductive voltage term to make an “effective” voltage drop. (2)
Voltage drops along the wire connecting the lumped elements in the loop are negligible.
Resistance. We know from Ohm’s law the current density J through a conductor is
proportional to the E-field along the conductor, J = σE, where σ is the conductivity.
Then the voltage drop across the resistor is
Z d Z d
J
Vdc = − E · dl = − · dl (4.7)
c c σ
R
We also know that J·dl can be written in terms of the total current I flowing through
the conductor and the resistivity ρ, the inverse of the conductivity
Z d
I
Vdc = − ρ dl = −IR (4.8)
c A
and A is the cross sectional area of the conductor. The familiar expression Eq. 4.8 is
valid at low frequencies where the current flows through the conductor uniformly over
A. At high frequencies the current in not uniform across A, passing only near the
surface and within the skin depth.
Induction. Here we apply Faraday’s law to the local inductive element labeled L in
Fig. 4.1 (not the whole circuit loop). The integral has to be over a closed path linking
the terminals e, f . The first branch of the loop goes through the current-carrying coil
from e to f . The second branch is the return path from f to e along an arbitrary path
in the space outside the coil. This second branch contributes nothing to the voltage
drop across the coil.
I Z f Z e
E · dl = E · dl + E · dl = 0 (4.10)
e f
118 Waveguides and Equivalent Circuits
Then Z f Z
∂
Vf e = E · dl = − B · dS (4.11)
e ∂t S
With the definition of the inductance L as the integral of the magnetic induction B
over the surface surrounding the inductive element, per unit current I passing through
the coil of the inductor, we have
R
B · dS
L= (4.12)
I
Therefore
∂(LI) ∂I
Vf e = − = −L (4.13)
∂t ∂t
Capacitance. We assume that the capacitive element can be represented by a par-
allel plate capacitor. From elementary electrostatics we know that the definition of
capacitance is the total charge Q accumulated on one plate divided by the voltage Vhg
across the plate.
Q
C= (4.14)
V
But Z
Q = I dt (4.15)
and thus the voltage drop across the capacitor in terms of the capacitance C and the
current looping through the circuit is
R
I dt
Vhg = −Vgh = − (4.16)
C
Putting all three lumped circuit elements together with the positive voltage source
Vba = Vs and using the Kirchhoff voltage rule, we have
Z
dI 1
Vs − IR − L − I dt = 0 (4.17)
dt C
Kirchhoff’s Current Rule. Kirchhoff’s current rule, that the charge currents Ii flowing
into and out of a circuit junction sum to zero, is illustrated in Fig. 4.2. The basis for
Eq. 4.2 is essentially the charge conservation relation that derives from the Ampère-
Maxwell equation
∂D
∇×H=J+ (4.18)
∂t
Taking the divergence of both sides of this equation and remembering that div·curl = 0
we find
∂∇ · D
∇·J+ =0
∂t
∂D
∇· J+ =0 (4.19)
∂t
Elements of Conventional Circuit Theory 119
Fig. 4.2 Circuit junction point from which currents enter and leave. The Kirchhoff current
rule states that the sum of the currents into and out of the junction must be zero.
Fig. 4.3 Lumped circuit element representation of a transmission line repeating segment
with length ∆z. The transmission line extends along the z direction
become significant and in this regime Kirchhoff’s current rule has to be checked on a
case-by-case basis. We shall assume as a working assumption that Kirchhoff’s current
rule is valid and write I
∂D X
J+ · dl ' Ii ' 0 (4.23)
c ∂t i
case) the forward and backward amplitudes will be related by a reflection coefficient.
From Eq. 4.28 we can write I(z) in terms of V0+ and V0− .
dV /dz iβ − γ +
I(z) = I0+ + I0− = − V0 − V0−
=− (4.38)
R − iωL R − iωL
In analogy to Eq. 2.198 we define an impedance Z0 as the ratio of the forward voltage
amplitude to the forward current amplitude.
V0+
Z0 = (4.39)
I0+
where we have used the negative root of β̃ 2 from 4.33 in the denominator of 4.40.
The transmission line current can now be written in terms of the voltage and the
characteristic impedance.
1 h + β̃z i
I(z) = V0 e + V0− e−β̃z (4.41)
Z0
In most practical transmission lines R and G are very small, and in the limit of a
lossless transmission line we find r
L
Z0 → (4.42)
C
4.3.2 Lossless Plane-Parallel Transmission Line
If we start with a lossless transmission line as the point of departure, we can calculate
the inductance per unit length, a function of the transmission line geometry, directly
from Faraday’s law. Figure 4.4 shows the magnetic flux lines between two parallel
plates carrying a constant current I in opposite directions. The separation between
the plates is given by s and the width of the plates is w. The ratio of w/s is sufficiently
great that the magnetic flux density B is mostly concentrated between the plates and
can be considered constant. The magnetic field between the plates can be related
to the current flowing in one of the plate conductors through the integral form of
the Maxwell-Ampère law, Eq. 2.28. Using Stokes’ theorem as we did in Eq. 4.3 for
Faraday’s law, we write
Z I Z
(∇ × H) · dS = H · dl = J · dS (4.43)
S
Applying this relation to the cross sectional area ABCD of Fig. 4.4 we have
H0 w = I (4.44)
Transmission Lines 123
Fig. 4.4 Idealized parallel-plate transmission line with no resistive losses in the current-car-
rying plates and no shunt conductance between the plates.
R
where I = J · dS, the integral of the current density over ABCD. Now R we substitute
the definition of inductance L from Eq. 4.12 into Eq. 4.44, taking the B · dS over the
surface shown in Fig. 4.4. The inductance per unit length along z is then
dL s
= L̄ = µ0 (4.45)
dz w
Voltage and Current along a Transmission Line. Equation 4.45 expresses the constant
distributed inductance along the planar wave guide of Fig. The voltage gradient along
the guide is then obtained from Eq. 4.13
∂V ∂I
= −L̄ (4.46)
∂z ∂t
and the current gradient from the distributed capacitance, obtained from Eq. 4.17.
∂I ∂V
= −C̄ (4.47)
∂z ∂t
4.3.3 Correspondence to Plane Waves
Transmission Line Voltage and Current Waves. Equations 4.46, 4.47 bear a close re-
semblance to Eqs. 2.188–2.191. Just as we found for Eqs. 4.30, 4.31 differential “Helmholtz-
like” equations for V and I separately can be obtained for the lossless transmission
124 Waveguides and Equivalent Circuits
line following the same procedure that produced Eqs. 2.193, 2.194 for plane waves.
Differentiation of 4.46 by z and 4.47 by t results in
∂2V ∂2V
2
− LC 2 = 0 (4.48)
∂z ∂t
and eliminating V in favor of an equation for I by reversing the differentiation variables
for each equation results in
∂2I ∂2I
− LC =0 (4.49)
∂z 2 ∂t2
Just as in the plane-wave case we associate LC with 1/v 2 where v is the phase velocity
of the V − I transmission wave running in the two-slab transmission line of Fig. 4.4.
Furthermore if we assume time-harmonic phasor solutions to Eqs. 4.48, 4.49 we can
write the solution to the voltage equation as
V = V0 ei(βz−ωt) (4.50)
∂2V
+ LCω 2 V = 0 (4.51)
∂z 2
We see therefore that the phase velocity v and propagation parameter β for the V − I
excitation of the transmission line is
1 ω √
v=√ and β = = LCω (4.52)
LC v
Given the voltage solution, Eq. 4.50, we can find the current by substituting 4.50 into
4.46 and integrating with respect to time.
∂I 1 ∂V 1
=− = − iβV (4.53)
∂t L ∂t L
then Z r
1 1β C
I = − iβ V dt = V = ·V (4.54)
L Lω L
The ratio of the voltage to current amplitudes has units of resistance and is identified
with the characteristic impedance of the transmission line.
r
V L
Z0 = = (4.55)
I C
which is, as expected, in agreement with Eq. 4.42.
load impedance ZL across the line conductors, the “impedance mismatch” will provoke
some fraction of the incident wave to reflect at the load position. the transmission line
is said to be “terminated” by ZL . Suppose we put the load at z = 0. We know that in
general V and I on the otherwise lossless line are related by
V (z) = V0+ eiβz + V0− e−iβz (4.56)
1 + iβz
V0 e − V0− e−iβz
I(z) = (4.57)
Z0
At z = 0 we must have
V (0) V + + V0−
ZL = = 0+ Z0 (4.58)
I(0) V0 − V0−
so the amplitude of the reflected wave in terms of the line and load impedances is
ZL − Z0
V0− = V0+ (4.59)
ZL + Z0
Now, in analogy to Eqs. 2.240, 2.241 for plane wave reflection, we define2 a reflection
coefficient Γ as
V− Z0 − ZL
Γ ≡ − 0+ = (4.60)
V0 Z0 + ZL
The voltage and current on the transmission line are then given by
V (z) = V0+ eiβz − Γe−iβz
(4.61)
Fig. 4.5 Transmission line impedance Zline (−d) with the line terminated by a load ZL 6= Z0 .
The impedance mismatch sets up a reflection on the line and a standing wave due to the
superposition of V + (z) and V − (z) waves.
Fig. 4.6 Two transmission lines joined at z = 0. The line to the left (z < 0) is characterized
by line impedance Z1 , and the line to the right (z > 0) by Z2 . Reflection coefficient Γ and
transmission coefficient T determine the amplitude of the reflected and transmitted waves,
respectively.
The transmission junction is illustrated in Fig. 4.6. On the z > 0 side, the voltage
travels only in the +z direction and the transmitted amplitude is some fraction T of
V1+ . The transmitted wave is then represented by
V1 (z = 0) = V2 (z = 0) (4.70)
V1+ (1
− Γ) = TV1+ (4.71)
2Z2
=T (4.72)
Z1 + Z2
So we find that in terms of the characteristic impedances in the two lines the reflection
and transmission coefficients are analogous to reflection and transmission of plane
waves at a material surface.
Z1 − Z2
Γ= (4.73)
Z1 + Z2
2Z2
T= (4.74)
Z1 + Z2
4.3.4 Equivalence of Plane Waves and Transmission Lines
The correspondence between electromagnetic plane wave propagation (Section 2.10,
Fig. 2.8) through media characterized by µ, and voltage/current transport along a
lossless transmission line (Section 4.3.2) characterized by L, C is indicated in Table 4.1
below. Analogous Helmholtz-like wave equations govern propagation, reflection, and
transmission in both cases.
128 Waveguides and Equivalent Circuits
Table 4.1 Equivalent quantities between plane waves and transmission lines
Note that at the load point (z = 0) the voltage is null and the current is maximum as
would be expected from a short circuit. Note also that since the product V (z)I ∗ (z) is
pure imaginary no power is delivered to the load. The impedance along the line, from
Eq. 4.63 is
Zline = −iZ0 tan βd (4.77)
As expected the voltage is maximum at the load point and the current is null. The
expression for the impedance along the line, from Eq. 4.63, is
Fig. 4.7 Diagram of how a segment of transmission line with impedance Z1 and length λ/4
can be used to match an incoming transmission line Z0 with a load ZL .
Zline = ZL (4.81)
This quarter-wave transformer can be used as a λ/4 length of line, with impedance
Z1 to match an input transmission line of impedance Z0 to a given load ZL . Figure
4.7 shows how this matching can be accomplished. In order to suppress reflection at
the Z0 /Z1 junction, Γ must be set equal to zero there. Therefore,
and p
Z1 = Z0 ZL (4.85)
The line impedance at the quarter-wave point, Eq. 4.84, is called a quarter-wave trans-
former because the load impedance is “transformed” by the square of the characteristic
impedance of the transmission line.
130 Waveguides and Equivalent Circuits
4.5 Waveguides
Up to now we have studied plane waves in Chapter 2 and transmission lines in the
present chapter. Waves propagating within or on guiding structures, which essentially
define the transverse field boundary conditions, are related to unbounded plane waves
and even more closely related to transmission lines. But guided waves also exhibit
unique characteristics that require extension of our familiar ideas of voltage, current,
capacitance and inductance. Waveguiding at microwave frequencies was developed for
radar during WWII, but now integrated optical circuits at the nanoscale use waveg-
uides to transport optical signals onto and around integrated microchips. We discuss
in this section the elements of waveguide theory and the relation guided waves bear
to plane electromagnetic waves and to transmission lines.
The prototypical transmission line, consisting of two parallel slab conductors sep-
arated by a dielectric, is shown in Fig. 4.4. In a sense this structure can be considered
a wave guide since the E-field terminates on the two conducting slabs separated by a
constant distance in the y − z plane, although the H-field is unbounded along y. It is
clear from the front and side views, that the E- and H-field lines are confined to the
x−y plane with no field components along the direction of propagation, z. Waves with
this property are called transverse electromagnetic (TEM) waves. An ideal plane wave
propagating in space also exhibits phase fronts confined to the plane orthogonal to
the propagation direction, so they are TEM waves as well. Note that if we added two
conducting slabs in the x − z plane so as to enclose a volume with a single conductor
of rectangular x − y cross-section, no TEM field could exist since the four conducting
sides would define an interior space of constant voltage (and hence a null E-field).
However, waveguides with rectangular, cylindrical, and even arbitrarily shaped cross-
section do exist, but in order to understand them we have to extend our ideas beyond
the familiar TEM wave. Waves propagating within structures of a single conductor
are classified into two types: transverse electric (TE) waves with an H-field component
along the propagation direction z (but E-field only in the transverse x − y plane), and
transverse magnetic (TM) waves with an E-field component along the propagation
direction (but only with transverse H-field components).
4.5.1 Field Solutions for TM and TE Waves
We assume field solutions to the Helmholtz equation that will have some spatial varia-
tion in the x − y plane, subject to the boundary conditions of a known perfect electric
conductor (PEC) guide structure, and propagate along the z direction with eiβz . For
the time being we will take the propagation parameter to be real (lossless medium).
The general form of the phasor fields are then
E(x, y, z) = [E(x, y) + Ez (x, y)ẑ] eiβz (4.97)
iβz
H(x, y, z) = [H(x, y) + Hz (x, y)ẑ] e (4.98)
Once again we write the two curl equations of Maxwell with harmonic time dependence,
no current sources in nonmagnetic material (µ = mu0 ).
∇ × E = iωµ0 H
∇ × H = −iωE
132 Waveguides and Equivalent Circuits
and write out explicitly the various components of H and E. For the H-field compo-
nents we have
∂Ez
− iβEy = iωµ0 Hx (4.99)
∂y
∂Ez
iβEx − = iωµ0 Hy (4.100)
∂x
∂Ey ∂Ex
− = iωµ0 Hz (4.101)
∂x ∂y
∂Hz
− iβHy = −iωEx (4.102)
∂y
∂Hz
iβHx − = −iωEy (4.103)
∂x
∂Hy ∂Hx
− = −iωEz (4.104)
∂x ∂y
We can use Eqs. 4.99, 4.103 to write Hx in terms of derivatives of Ez and Hz in the
transverse x − y plane. Similarly we can use Eqs. 4.100, 4.102 to also write Hy in terms
of transverse derivatives of Ez , Hz . Proceeding in the same way for Ex and Ey we have
finally four equations,
i ∂Ez ∂Hz
Hx = − ω − β (4.105)
(k 2 − β 2 ) ∂y ∂x
i ∂Ez ∂Hz
Hy = 2 ω +β (4.106)
(k − β 2 ) ∂x ∂y
i ∂Hz ∂Ez
Ex = 2 ωµ 0 + β (4.107)
(k − β 2 ) ∂y ∂x
i ∂Hz ∂Ez
Ey = − 2 ωµ0 −β (4.108)
(k − β 2 ) ∂x ∂y
where k, as always, is related to the frequency ω and velocity of light through the
√ √
medium (v = 1/ µ0 ) by the expression k = ω µ0 . The factor k 2 − β 2 can be
interpreted as the square of a propagation parameter in the x − y plane since β is
always along the z axis. This transverse propagation parameter is called the “cutoff”
parameter kc .
kc2 = k 2 − β 2 (4.109)
i ∂Hz
Hx = ·β (4.110)
kc2 ∂x
i ∂Hz
Hy = ·β (4.111)
kc2 ∂y
i ∂Hz
Ex = 2
· ωµ0 (4.112)
kc ∂y
i ∂Hz
Ey = − 2
· ωµ0 (4.113)
kc ∂x
In order to find the transverse fields we have to use the Helmholtz equation to find
the permitted values of Hz . We write the expression
∂2 ∂2 ∂2
2 2 2
∇ +k Hz = + 2 + 2 + k Hz = 0 (4.114)
∂x2 ∂y ∂z
But since
∂ 2 Hz
= −β 2 Hz (4.115)
∂z 2
the Helmholtz equation reduces to a differential equation for the transverse Hz field
gradients
2
∂2
∂ 2
+ + kc Hz = 0 (4.116)
∂x2 ∂y 2
Allowed solutions will be a function of the transverse boundary conditions.
TM Waves. For TM waves we follow the same procedure except we set Hz = 0 and
write Eqs. 4.105–4.108 as
i ∂Ez
Hx = − · ω (4.117)
kc2 ∂y
i ∂Ez
Hy = · ω (4.118)
kc2 ∂x
i ∂Ez
Ex = ·β (4.119)
kc2 ∂x
i ∂Ez
Ey = 2
·β (4.120)
kc ∂y
∂2 ∂2
2
+ 2 + kc Ez = 0 (4.121)
∂x2 ∂y
and allowed solutions will again be a function of the transverse boundary conditions.
134 Waveguides and Equivalent Circuits
then β becomes pure imaginary. The wave no longer propagates in the waveguide
but decays exponentially to 1/e of its initial amplitude at a characteristic distance of
x = 1/β. The propagating wave becomes an evanescent wave. This threshold behavior
for propagation of mode n within the waveguide is the reason that kc is called the
“cutoff” parameter. Below cutoff, the wave does not propagate. The cutoff parameter
kc is related in the usual way to wavelength and frequency.
2π
kc = and ωc = kc v (4.130)
λc
where v is the propagation velocity in the gap dielectric of the waveguide. If the
dielectric is air or vacuum, then is the velocity of light, c. For a given waveguide
“mode” n the cutoff frequency must be greater than kcn v.
Once we have the solutions for Hz (x, y) we can get the solutions for all the trans-
verse field components from Eqs. 4.110–4.113. Since there is no transverse dependence
in the y direction, Eqs. 4.111, 4.112 show that Hy = Ex = 0 and
β nπ
Hx = −i Bn sin x eiβz (4.131)
kc s
ωµ0 nπ
Ey = i Bn sin x eiβz (4.132)
kc s
Note the difference in sign between Hx and Ey . This sign difference is consistent with
the direction of energy propagation, determined by the Poynting vector, along the
positive z direction, S = E × H. The wave impedance is given by the ratio of the
E-field amplitude to the H-field amplitude,
|Ey | ωµ0 k
ZTE = = = Z0 (4.133)
|Hx | β β
For the parallel plate waveguide the TE modes consist of a “triplet” of components,
Hz , Hx , Ey and the modes are labeled TEn , n = 1, 2, 3 . . .
TM Modes. To find the field components for the TM modes we follow a parallel
procedure to the TE case. The first step is to find the Ez solutions from the reduced
Helmholtz equation, Eq. 4.121, subject to the boundary conditions on the parallel plate
waveguide at x = 0 and x = s. Once again the general solution is
with r
p nπ 2
β = ± k 2 − kc2 = ± k2 − (4.137)
s
From Eqs. 4.117–4.120 we find the transverse field components.
ω nπ
Hy = i An cos x eiβz (4.138)
kc s
β nπ
Ex = i An cos x eiβz (4.139)
kc s
The impedance for the TM waves is
Ex β β
ZTM = = = Z0 (4.140)
Hy ω k
4.6.1 TE Modes
The transverse electric (TE) modes are defined as those waves that propagate in the
guide with E-fields only transverse to the z (propagation) direction. Therefore we seek
Hz solutions to the transverse Helmholtz equation.
2
∂2
∂ 2
+ + kc Hz (x, y) = 0 (4.141)
∂x2 ∂y 2
The solutions will be functions of both coordinates, but we can expect them to be
uncoupled. Therefore we can posit solutions of the form Hz (x, y) = X(x)Y (y). Sub-
stitution into Eq. 4.141 and division by X(x)Y (y) results in
1 ∂2X 1 ∂2Y
+ = −kc2 (4.142)
X ∂x2 Y ∂y 2
Rectangular Waveguides 137
Fig. 4.8 Rectangular waveguide with sides x = 2a and y = 2b. Inside the conducting
boundaries the waveguide is filled with a lossless dielectric characterized by µ0 , .
Since this result must be true for all values of x, y, the terms on the left must themselves
be equal to constants that we write
1 ∂2X 1 ∂2Y
= −kx2 = −ky2 (4.143)
X ∂x2 Y ∂y 2
and
kx2 + ky2 = kc2 (4.144)
The solution for Hz (x, y) is a product of the general solution in x and the general
solution in y.
The transverse E-fields are obtained from the longitudinal H-field Hz by using Eqs. 4.112, 4.113
ωµ0
Ex = i ky (A cos kx x + B sin kx x) · (−C sin ky y + D cos ky y) (4.147)
kc2
ωµ0
Ey = i 2 kx (−A sin kx x + B cos kx x) · (C cos ky y + D sin ky y) (4.148)
kc
4.6.2 TM Modes
The transverse magnetic modes are defined as those modes with H-fields only in the
transverse (x − y) plane. Therefore the field component in the z direction must be an
E-field. The waves propagate along z and Ez is written as the product Ez (x, y)eiβz .
Therefore we seek Ez solutions to the same Helmholtz wave equation as for the TE
modes. 2
∂2 ∂2
∂ 2
+ 2 + 2 + kc Ez = 0 (4.158)
∂x2 ∂y ∂z
Rectangular Waveguides 139
The general solution for Ez has the same form as that for Hz , Eq. 4.145, and subject
to boundary conditions,
Fig. 4.9 Schematic of cylindrical waveguide. The cylinder surface is assumed to be a perfect
conductor. Cylindrical coordinates are r, ϕ, z as shown. The radius of the cylinder is a.
4.7.1 TE Modes
As always, TE modes are defined by the presence of electric fields only in the trans-
verse plane. The field function along the z axis must be an H-field, and we write the
Helmholtz equation
∇ 2 Hz + k 2 Hz = 0 (4.171)
Then we write the ∇2 or Laplacian operator in cylindrical coordinates and factor
Hz (r, ϕ, z) = Hz (r, ϕ)eiβz . We take the expression for the Laplacian operator from
Eq. C.33 of Appendix C.
2
1 ∂2
∂ 1 ∂ 2
+ + + kc Hz (r, ϕ) = 0 (4.172)
∂r2 r ∂r r2 ∂ϕ2
Cylindrical Waveguides 141
r2 d2 R r dR 1 d2 Φ
2
+ + r2 kc2 = − (4.174)
R dr R dr Φ dϕ2
The left side is a function only of r and the right side only of ϕ. In order for the
equation to be valid for the entire range of r, ϕ the two sides must be equal to a
2
constant. We set the separation constant equal to lϕ and write
1 d2 Φ 2
− = lϕ (4.175)
Φ dϕ2
or rearranging
d2 Φ 2
+ lϕ Φ=0 (4.176)
dϕ2
From the left side of Eq. 4.174 we have
d2 R dR
r2 + r2 kc2 − lϕ
2
+r R=0 (4.177)
dr2 dr
Note that lϕ is unitless. From inspection we can write down a solution to Eq. 4.176.
Φ = eilϕ ϕ (4.178)
d2 R dR
r2 + r2 kc2 − n2 R = 0
2
+r (4.181)
dr dr
This expression is one of the forms of Bessel’s differential equation. There exists a
family of functions that are solutions to this equation. In fact there are two independent
families, Bessel functions of the first kind, Jn (x), and of the second kind, Nn (x). The
functions of the second kind, the Neumann functions, tend to −∞ as x → 0 so they
are not physically acceptable solutions. The Bessel functions of the first kind are finite
at the origin and exhibit physically acceptable behavior. The first few Bessel function
142 Waveguides and Equivalent Circuits
Jn (x) are plotted in Fig. 4.10. Putting together the angular and radial solutions to the
reduced Helmholtz equation, we have
The next step is to apply boundary conditions at r = 0 and r = a, where a is the radius
of the hole. But in order to do so we must write the E-field and H-field components
in the r, ϕ directions in terms of the z-axis components. These components are found
from writing down the six Maxwell’s curl equations in cylindrical coordinates and
reducing them to four relations of r, ϕ in terms of z as was done for Eqs. 4.105–4.108.
The result is
1 ∂Ez ωµ0 ∂Hz
Er = i 2 β + (4.183)
kc ∂r r ∂ϕ
1 β ∂Ez ∂Hz
Eϕ = i 2 − ωµ0 (4.184)
kc r ∂ϕ ∂r
1 ω ∂Ez ∂Hz
Hr = −i 2 −β (4.185)
kc r ∂ϕ ∂r
i ∂Ez β ∂Hz
Hϕ = i 2 ω + (4.186)
kc ∂r r ∂ϕ
On the perfectly conducting wall of the cylinder the E-field tangent to the wall must
vanish. Therefore Eϕ (r = a, ϕ) = 0. From Eq. ?? we have
ωµ0
Eϕ (r, ϕ, z) = −i (A sin nϕ + B cos nϕ) Jn0 (kc r)eiβz (4.187)
kc
Cylindrical Waveguides 143
Fig. 4.11 Derivatives of the first three Bessel functions, showing the positions of the first
few zero-crossings.
where Jn0 (kc r) is the radial derivative of the Bessel function. Applying the boundary
condition at r = a
ωµ0
Eϕ (a, ϕ, z) = −i (A sin nϕ + B cos nϕ) Jn0 (kc a)eiβz = 0 (4.188)
kc
But this requirement fixes kc to the zeros of Jn0 (kc a). Each function Jn0 , labeled by
n will have a series of zeros where Jn0 [(kc a)nm ] corresponds to the mth zero of the
nth Bessel function derivative. The arguments of the zeros (or roots) of Jn0 are not
simple multiples of π as in the case of sin and cos functions. They have to be found
numerically and 4.2 provides values for the first few arguments (kc a)nm corresponding
to the zeros of the Jn0 functions. The first three Bessel function derivatives are plotted
in Fig. 4.11. Evidently we obtain kc for the nmth mode from
144 Waveguides and Equivalent Circuits
kc anm
kcnm = (4.190)
a
We obtain the propagation parameter along z for the nmth mode in the usual way
q
βnm = k 2 − kc2nm (4.191)
and with βnm in hand we construct Hz (r, ϕ, z) = Hz (r, ϕ)eiβnm z . Then from Eqs. 4.183–
4.186 we calculate the transverse fields for the nmth mode. The notation nm has been
omitted from kcnm and βnm so as not encumber the expressions more than necessary.
ωµ0 n
Er = i 2 (A cos nϕ − B sin nϕ) Jn (kc r)eiβz (4.192)
kc r
ωµ0
Eϕ = −i (A sin nϕ + B cos nϕ) Jn0 (kc r)eiβz (4.193)
kc
β
Hr = i (A sin nϕ + B cos nϕ) Jn0 (kc r)eiβz (4.194)
kc
βn
Hϕ = i 2 (A cos nϕ − B sin nϕ) Jn (kc r)eiβz (4.195)
kc r
In fact the A and B amplitude factors in the angular part of the transverse field expres-
sions are not independent. We can easily see this from Eq. 4.178. The “normalization
integral” of the angular solution in complex form is
Z 2π
Φ∗ Φ dϕ = 2π = const. (4.196)
0
and writing the normalization of the angular solution in terms of real sin and cos
functions Z 2π
2 1 2
A + B 2 = const.
(A sin nϕ + B cos nϕ) dϕ = (4.197)
0 2
or in other words the only restriction on the amplitudes of the sin and cos terms is that
the sum of the squares of A and B must always be equal to some constant. Therefore
we can set A = 0 and let B 2 = const. Then the transverse fields can be written in
somewhat simpler form
ωµ0 n
Er = −i 2 (B sin nϕ) Jn (kc r)eiβz (4.198)
kc r
ωµ0
Eϕ = −i (B cos nϕ) Jn0 (kc r)eiβz (4.199)
kc
β
Hr = i (B cos nϕ) Jn0 (kc r)eiβz (4.200)
kc
βn
Hϕ = −i 2 (B sin nϕ) Jn (kc r)eiβz (4.201)
kc r
The TE mode impedance is given by
Er k
ZTE = = Z0 (4.202)
Hϕ β
Cylindrical Waveguides 145
4.7.2 TM Modes
In the case of TM modes, there is no component Hz along the axis of symmetry, and
we seek solutions to the Helmholtz equation for Ez (r, ϕ, z) = Ez (r, ϕ)eiβz . As with the
TE modes we have a reduced Helmholtz equation for the transverse coordinates.
2
1 ∂2
∂ 1 ∂ 2
+ + + k c Ez (r, ϕ) = 0 (4.203)
∂r2 r ∂r r2 ∂ϕ2
This expression is the same as Eq. 4.172 so the general solutions are the same.
But in this case the boundary conditions on the E-field can be applied directly the
solutions for the Helmholtz equation.
Ez (r = a, ϕ) = 0 (4.205)
Jn (kc a) = 0 (4.206)
and we get the cutoff parameter for the TMnm mode from the zeros of Jn Bessel
function. Table 4.3 shows some values for the roots of Jn . Once again the cutoff
parameters are obtained from
kc anm
kcnm = (4.207)
a
The propagation parameter along z is
p
βnm = k 2 − kc2 (4.208)
Vn = Vn+ + Vn−
In = In+ − In−
and relating the voltages and currents by the impedance Z for each port of the four-
port network of Fig. 4.12,
V1 Z11 Z12 Z13 Z14 I1
V2 Z21 Z22 Z23 Z24 I2
= · (4.214)
V3 Z31 Z32 Z33 Z34 I3
V4 Z41 Z42 Z43 Z44 I4
The inverse of the impedance is called the admittance Y , and the inverse of the
impedance matrix [Z] is the admittance matrix [Y ].
[Y ] = [Z]−1 (4.216)
Networks of Transmission Lines and Waveguides 147
Fig. 4.12 A schematic waveguide network with two ports S1 , S2 showing inputs and outputs
and two ports S3 , S4 showing only outputs. The outputs at S1 , S2 can be thought of as
reflected waves.
or written out explicitly for the four-port case Eq. 4.216 becomes
I1 Y11 Y12 Y13 Y14 V1
I2 Y21 Y22 Y23 Y24 V2
I3 Y31 Y32 Y33 Y34 · V3
= (4.217)
I4 Y41 Y42 Y43 Y44 V4
The matrix equation 4.215 means that for each element Zij we have
Vi
Zij = (4.218)
Ij
where all Ik with k 6= j have been set to zero. Equation 4.218 means that we can
determine the impedance Zij if we inject current I at port j and measure the resulting
voltage V at port i with all the other ports closed or “open-circuited” in engineering
parlance. For the admittance matrix we have the analogous situation.
Ii
Yij = (4.219)
Vj
148 Waveguides and Equivalent Circuits
which means that in order to determine the admittance matrix element Yij , we apply
a voltage V to port j and measure the current I at port i with all other ports dis-
connected from the network (open-circuited). It is important to remember that the
network can be lossy in which case the matrix elements Zij or Yij are complex. It can
be shown that lossless networks have all Zij and Yij pure imaginary and that recipro-
cal networks are represented by [Z] and [Y ] matrices that are symmetric with respect
to interchange of indices. That is, for example, Zij = Zji . Physically the reciprocal
property means that if a current injected at port m produces a voltage at port n,
then injecting the same current in port n will produce the same voltage at port m.
Networks of linear, passive components such as resistors, capacitors, and inductors are
reciprocal. Networks with active nonlinear components with gain, such as amplifiers,
or not reciprocal.
V1
Z11 = = ZA + ZC (4.220)
I1
because no current runs through ZB . The “transfer impedance” Z12 is obtained by
shutting off port S1 and injecting current into port S2 . No current flows through ZA
so all the injected current I2 flows through ZB and ZC . Then V2 = I2 (ZB + ZC ).
The voltage V1 appearing at the port S1 is just the current I2 flowing through the
“pull-up” resistor ZC . Therefore Z12 is ZC . It is easy to verify that this network is
reciprocal and that therefore Z22 = Z11 and Z12 = Z21 .
The usual rules of matrix multiplication obtain here. Thus, for a given network, the
voltage amplitude of a reflected wave at port i is related to the voltage amplitude of
an incident wave at port j by
Vi− = Sij Vj+ (4.222)
Networks of Transmission Lines and Waveguides 149
Fig. 4.13 (A) Two-port impedance network. (B) Two-port impedance network implementing
a voltage divider. Note that the directions of I1 , I2 point into the ports towards the network.
with the implicit assumption that all other ports are quiescent with voltages at the
port inputs set to zero. The diagonal matrix elements express the ratio of reflected
amplitude to incident amplitude at a give port, assuming all other ports are quiescent
(i.e. matched impedances at all other ports). Thus Sii is a reflection coefficient at
port i. Since the S-matrix is essentially intended for waveguide networks, the ports
are assumed to have some characteristic impedance (for example, Z0 = 50Ω). The Sij
matrix element expresses the transmission coefficient from port j to port i.
150 Waveguides and Equivalent Circuits
Fig. 4.14 Schematic ABCD network. Note that the positive current I2 is pointing away from
the port entrance. This ABCD-matrix convention differs from the S-matrix where all positive
quantities point into the port.
V1 A1 B 1 A2 B 2 V
= · · 3 (4.234)
I1 C1 D1 C2 D2 I3
The ABCD matrices can be used to construct elaborate networks from a few simple
elements. For example, suppose we have have an elementary two-port network con-
sisting of one impedance element as shown in Fig. 4.15 By applying Ohm’s law we
can identify the ABCD matrix elements. By inspection it is easy to determine that
A = 1, B = Z, C = 0, D = 1. Another example is a two-port network with a single
admittance as shown in Fig. 4.16 Again by Ohm’s law and remembering the Y = 1/Z,
a simple inspection of the coupled equations reveals that A = 1, B = 0, C = Y, D = 1.
Figure 4.17 shows an impedance T-network. The ABCD matrix for this network
can be obtained by a more systematic approach. First consider the A element and
Eq. 4.232. We see that if no current flows at the output (open circuit with I2 = 0),
then A = V1 /V2 . But in that case from Ohm’s law V1 = I1 (Z1 + Z3 ) and V2 = I1 Z1 .
Therefore A = (Z1 + Z3 ) /Z1 . To determine the B element we set V2 = 0 (output short
circuit). Then B = V1 /I2 . The result of applying Ohm’s law (or the Kirchhoff voltage
and current rules) is that B = Z1 + Z2 + Z1 Z2 /Z3 . Similarly we consider Eq. 4.233
152 Waveguides and Equivalent Circuits
and set successively I2 , V2 = 0 to find the matrix elements C and D. The result is
C = 1/Z3 and D = 1 + Z2 /Z3 .
Another common network is the admittance network shown in Fig. 4.18. This
arrangement is often called the Pi-network. By methodically “open-circuiting” and
“shorting” the output terminals the elements of the ABCD matrix can be determined.
Two more useful elementary functions useful for constructing more elaborate net-
works are the transmission line segment and the inductive transformer. Figures 4.19
and 4.20 sketch these two circuit functions, and the ABCD matrix elements for all
these elementary circuit functions are entered in Table 4.4.
Fig. 4.19 Two-port ABCD network element: transmission line segment of length z.
is the number of ports in the network. For components with linear, passive response,
however, the matrices are always symmetric so that Xij = Xji . All the network matrix
representations considered here are symmetric. Furthermore it can also be shown that
if the network is lossless, containing only reactive elements that store power but do
not dissipate it, the network elements will be pure imaginary quantities. Coupling
networks that contain capacitance and induction but no significant resistance fall into
this category.
Network A B C D
Single Z 1 Z 0 1
Single Y 1 0 Y 1
Z1 Z1 Z2 1 Z2
T-Impedance 1+ Z3 Z1 + Z2 + Z3 Z3 1+ Z3
Y2 1 Y1 Y 2 Y1
Pi-Admittance 1+ Y3 Y3 Y1 + Y2 + Y3 1+ Y3
V1 Z11 Z12 I1
= · (4.235)
V2 Z21 Z22 −I2
Note that the sign of I2 has been reversed. The reason is that we seek to compare
the ABCD network scheme with the Z scheme. But the impedance network shown in
Fig. 4.13 assumes that positive current I2 points into port 2, whereas in the ABCD
network I2 points outward from the port. Written out in equation form we have
The set of Z-coupled equations 4.236, 4.237 must be compared to the set of ABCD-
coupled equations, 4.232, 4.233. Suppose we take I2 output current to zero (open-circuit
port 2). Then from Eq. 4.232 A = V1 /V2 and from Eqs. 4.236, 4.237 Z11 = V1 /I1 and
Z21 = V2 /I1 . From these relations we find that the A element of the ABCD matrix
is related to the impedance matrix elements by A = Z11 /Z21 . If we short-circuit port
2 (V2 = 0) we find B = Z11 (Z22 /Z21 ) − Z12 . Proceeding similarly for C, D we find
C = 1/Z21 and D = Z22 /Z21 .
Nanostructures and Equivalent Circuits 155
Table 4.5 Equivalence between ABCD and Z-matrix elements in two-port networks
The equivalence between the ABCD matrix elements, the impedance matrix ele-
ments, and the impedance-equivalent circuit for the T-network is summarized in Table
4.5. The T-network, implemented with impedance elements, is shown in Fig. 4.21.
Fig. 4.22 Polarized sphere subject to an external E-field Ez = E0 e−iωt . E-field lines are
sketched in the vicinity of the sphere, and displacement field (D-field) lines are shown within
the sphere and looping outside the sphere in a dipole pattern. Note that the D-field lines
normal to the surface are continuous through the interface.
3out
Ein (r, θ) = E0 êz (4.238)
in + 2out
in − out r03
Eout (r, θ) = E0 êz + E0 (2 cos θ êr + sin θ êθ ) (4.239)
in + 2out r3
Figure 4.22 shows the shape and direction of the applied E-field lines (along the z
direction) in the vicinity of the sphere and the displacement field inside the sphere,
Din = in Ein , and outside the sphere, Dout = out Eout . At the surface of the sphere,
matching conditions impose that the normal component of the displacement field be
continuous across the surface.
in − out
in Ein · n̂ = out E0 êz · n̂ + E0 (2 cos θêr · n̂ + sin θêθ · n̂)
in + 2out
3out in − out
in E0 êz · n̂ = out E0 êz · n̂ + E0 2êz · n̂ (4.240)
in + 2out in + 2out
The net or residual displacement field within the sphere is the difference between the
total displacement field (left side of Eq. 4.240) and the applied displacement field.
out − in
Dnet = in E0 êz (4.242)
in + 2out
Nanostructures and Equivalent Circuits 157
The component normal to the sphere surface of this net displacement field is
out − in
Dnet · n̂ = in E0 êz · n̂ (4.243)
in + 2out
Now we subtract Eq. 4.243 from both sides of Eq. 4.240 and rearrange the terms to
put the applied fields inside and outside the sphere on the left and everything else on
the right.
in − out in − out
(in − out ) E0 êz · n̂ = in E0 êz · n̂ + out E0 2êz · n̂ (4.244)
in + 2out in + 2out
The displacement current is given by the time derivative of the displacement field.
Taking the time derivative of Eq. 4.244 and integrating over the surface of the half-
sphere with positive charge in Fig. 4.22 yields
2 in − out
−iω (in − out ) E0 · πr0 = −iωin E0 · πr02
in + 2out
in − out
− iωout E0 · 2πr02 (4.245)
in + 2out
The integration over the surface is carried out by noting that the êz · n̂ factor in
Eq. 4.244 is cos θ and that therefore the integration over an infinitesimal of half the
sphere surface dS is
Z Z π/2
2 2
r0 êz · n̂ dS = r0 cos θ sin θ dθ dϕ
0
π/2
1
= r02 2π sin2 θ = πr02
2 0
The term on the left side of Eq. 4.245 is a bound current source arising from the
polarization of the sphere due to the external applied electric field. The first term
on the right is the bound current passing through the sphere surface and the second
term is the external “dipole current” looping back to the source (Fig. 4.22). The three
terms of Eq. 4.245 express the Kirchhoff current law (Section 4.2.1), that all currents
entering and leaving a node in a circuit must sum to zero.
Dielectric Sphere. We suppose for the moment that in represents an (almost) lossless
dielectric. In that case the permittivity of the sphere is complex but with a negligible
imaginary term and in > 0 . The circuit of the displacement current can then be
represented as in Fig. 4.23. Now from the electric field at the surface of the sphere we
the potential difference across the sphere. The potential is given by
Z
Vsphere = − Esurf · dl
σ
Z 0
out − in
= −2 E0 cos θ r0 sin θ dθ
−π/2 in + 2out
out − in
= r0 E0 (4.246)
in + 2out
158 Waveguides and Equivalent Circuits
Fig. 4.23 Equivalent circuit for a subwavelength dielectric sphere subject to an applied oscil-
lating electric field. The displacement currents into and out of the round node at the branch
obey Kirchhoff’s current rule. The dielectric sphere is represented by capacitor in parallel
with a very low-loss resistor, and the dipole loop displacement current is also represented by
a capacitor.
Now from Vsphere and Isphere we can identify the characteristic impedance of the sphere
as
Vsphere
Zsphere =
Isphere
i
= (4.247)
ωin πr0
and the impedance of the dipolar field is
Vsphere
Zdipole =
Idipole
i
= (4.248)
ωout 2πr0
From the phasor form of the impedance (Section D.2 in Appendix D) we have
Z = R + i (XL + XC ) (4.249)
XL = −ωL inductive reactance (4.250)
1
XC = capacitive reactance (4.251)
ωC
Comparing 4.251 with 4.247 and 4.248 we see that
Csphere = in πr0 (4.252)
Cdipole = out 2πr0 (4.253)
Nanostructures and Equivalent Circuits 159
Fig. 4.24 Equivalent circuit of a subwavelength metallic sphere subject to an external os-
cillating electric field. The source displacement current drives the sphere and Kirchhoff’s
current rule applies to the branch point node. Current through the sphere exhibits inductive
reactance in parallel with a nonnegligible resistive loss. The dipole branch exhibits capacitive
reactance as in the case of the dielectric sphere.
Metallic Sphere. In the case of a metallic sphere the real part of the permittivity can
be strongly negative and the resistive loss nonnegligible. In that case the equivalent
circuit can be construed as shown in Fig. 4.24. The sphere impedance becomes
metal i
Zsphere = (4.254)
ωRe[in ]πr0
Thus we see that the nanosphere can act as a capacitive element or an inductive
element by choosing the material property: insulating or conducting.
panel the plane wave incident normal to the interface and the right panel the equiv-
alent impedance T-network representing a transmission line. The expressions for the
impedances are (see reference 4 at the end of the chapter),
βl
Zs = Zc tanh (4.256)
2
Zc
Zp = (4.257)
sinh(βl)
where
γc
Zc = (4.258)
iωc
and
γc2 = β 2 − c k02 = −kc2 (4.259)
where β is the complex propagation constant,l the length of the transmission line,
Zc the characteristic impedance, and c the characteristic dielectric constant of the
medium. Using these expressions for the series impedance Zs and parallel impedance
Zp , and applying the rules for finding the ABCD matrix elements of the T-circuit we
developed in Section 4.8.5, we find that for long transmission lines (l >> 1/β).
A = cosh βl (4.260)
B = Z0 sinh βl (4.261)
1
C= sinh βl (4.262)
Z0
D = cosh βl (4.263)
in accordance with the expressions listed in Table 4.4. Now as the length of the trans-
mission line tends to infinity, Z → 0; and in the limit of very long lines we have the
situation shown in Fig. 4.26. In fact the plane wave propagates through the dielectric
and the metal, although the propagation length in the metal penetrates only as far as
the skin depth. Therefore we can characterize the light incident on the interface as two
transmission lines connected at the interface with two characteristic impedances, Z0d
and Z0m for the dielectric and metal respectively. Figure 4.27 shows the correspondence
between plane wave propagation in the dielectric and metal and the equivalent circuit
of the interface. Now the transmission line impedances can be written in a particularly
useful form if we consider Maxwell’s equations in a Laplace-transformed space. The
Laplace transform is an integral operator of the form
Z ∞
L̂ = e−γz dz (4.264)
0
with
Nanostructures and Equivalent Circuits 161
Fig. 4.25 Comparison between plane wave and equivalent impedance t-network. Panel(a)
Plane wave incident on a dielectric-metal interface. (b) Equivalent circuit for a transmission
line representing the plane wave.
γ 2 = −kz2 (4.266)
where γ may be complex to take into account transmission line losses and kz the
propagation parameter in the z direction. Details of this transformation and its con-
sequences are discussed in a recent publication (reference 6 at the end of the chapter).
Here we simply state one of the key results for the Ampère-Maxwell law in Laplace
space for incident waves with TM polarization.
γ H̃y = iωẼx (4.267)
where H̃y and Ẽx are the Laplace-transformed y and x components of the H- and E-
field, respectively, ω and have their usual meanings of frequency and permittivity. We
identify the characteristic impedance of the transmission line with the corresponding
impedance of the Laplace-transformed wave.
Ẽx γ
Z0 = = (4.268)
H̃y iω
162 Waveguides and Equivalent Circuits
Fig. 4.26 The limiting form of the equivalent circuit of a long transmission line.
Any surface waves traveling in the x direction must have the same propagation pa-
rameter β in the dielectric and metal materials. With k0 the wave propagation vector
in free space we have from k-vector addition on each side of the interface
q
γd = β 2 − d k02 = ikzd (4.269)
q
γm = β 2 − m k02 = ikzm (4.270)
The picture we have now is of two transmission lines along z that join at the dielectric-
metal interface. The characteristic impedances for the dielectric and metal transmis-
sion lines are
γd
Z0d = (4.271)
iωd
γm
Z0m = (4.272)
iωm
Transmission Line Transverse Resonance. When a transmission line is populated by
a wave that is just at the cutoff condition, kc , a standing wave is present in the
line, and no net energy propagates along the line. A transmission line in this state
is said to be in “resonance”. The two transmission lines, along z in our problem, are
transverse to the interface x. In order for stable surface waves to propagate along
the interface, they must be subject to the same restriction that no energy propagates
along z. Therefore a necessary condition for stable, propagating surface waves along x
is that our transmission lines along z fulfill the resonance condition. Since no net wave
propagates along z, the net propagation parameter along z must be null. Therefore,
according to Eqs. 4.271, 4.272, a necessary condition for transverse resonance in our
two joined transmission lines is
Fig. 4.27 Correspondence between plane waves propagating in dielectric and metal, panel
(a), and two equivalent circuit transmission lines joined at the dielectric-metal interface.
Therefore
γd γm
= (4.274)
iωd iωm
or
kzd km
= z (4.275)
d m
This “impedance matching” condition at the interface, arising from the transmission
line at resonance, is equivalent to the field matching condition that we found in Section
3.3, Eq. 3.32. Thus we see that a transmission line point of view gives us a supplemen-
tary insight into the physical conditions required for stable surface waves.
Fig. 4.28 Schematic of a subwavelength slit of width w milled into a subwavelength metal
layer. The dielectric in the slit is the same as the dielectric above and below the metal surface.
In the usual case the metal is the same on both sides of the slit so Z1 = Z2 . The equivalent
circuits of the two dielectric-metal interfaces and the T-circuit of the slit itself are shown in
the impedance equivalent circuit.
subwavelength. Figure 4.28 shows a schematic of the physical slit and the correspond-
ing impedance equivalent circuit. The equivalent circuit consists of the T-circuit that
we considered earlier in Section 4.9.2 for the slit and two symmetrical voltage sources
and impedances that represent voltage and current runnning in the metal near the
surfaces. We posit that a plane wave propagating along z is incident on the top sur-
face and sets up a standing wave there. We saw in Section 2.10.6 that the magnetic
field component of the wave is adjacent to the surface and penetrates to the skin depth
within a real metal. This time-harmonically oscillating magnetic field, via Faraday’s
law, induces currents and voltages within the skin depth on the top surface and on
the slit walls. A detailed discussion of the currents circulating in the skin depth near
the slit is given in Reference 7 at the end of the chapter.
The first step in the analysis is to assume that the circuit is symmetrical with the
metal wall impedances Z1 = Z2 = Zm . Then we consider the circuit with only one
source, say the one on the left associated with Z1 in Fig. 4.28 and effectively “short-
circuit” the other source on the right. The resulting circuit consists of a single voltage
source and a series-parallel network of impedances that can be combined into a single
impedance ZT . The expression for ZT is
(Zm + Zsd )Zpd + (Zm + Zsd )(Zm + Zsd + Zpd )
ZT = (4.276)
Zm + Zsd + Zpd
Now we impose the “resonant transmission line” condition to insure that no power is
propagated perpendicular to the slit walls (along the x direction). As we saw in Section
Exercises 165
3.6 power is propagated only along z with exponential fall-off of the fields penetrating
the metal walls of the slit. This condition is that ZT = 0, which results in the following
impedance expression
2 2 2
(Zsd + Zpd ) + (2Zm ) (Zsd + Zpd ) + Zm − Zm + Zpd =0 (4.277)
This is the final expression for the slit transcendental function in terms of the equivalent
circuit quantities. It is consistent with the transcendental equations found for the slit
in Chapter 3 obtained from a wave treatment.
4.10 Exercises
1. Suppose two lossless transmission lines with different characteristic impedances
are coupled. If Z1 = 50 Ω and Z2 = 75 Ω, calculate the fractonal power reflected
and transmitted at the interface between the two lines.
166 Waveguides and Equivalent Circuits
2. For a tranmission line of length λ/4 calculate the impedance of the line if it is (a)
shorted, (b) open-circuited.
3. For a rectangular waveguide calculate the cutoff wavevector kc for TM modes
if the guide is square with side s = 800 nm and if the guide is very thin with
s1 = 800 nm and s2 = 100 nm. Assume the guide is excited by a plane wave
of λ = 500 nm impinging normally on the front fae of the guide. Calculate the
characteristic impedance of this transmission line.
4. Consider the interface between silver and glass. Assume that 1 mW power is cou-
pled to the interface and converted to spp waves. From what you learned in
Chapter 3 calculate the stable surface wave propagation parameter assuming the
exciting field wavelength is 832 nm. Use the following dielectric constants: glass
0 = 2.40, silver = −32.8 + i0.46.
5. Using the same data as in Exercise 4 calculate the impedance Zp of the equivalent
T-circuit and the charactersitic impedance Z0 of the corresponding transmission
line.
6. Calculate the TE and TM modes that can propagate in a planar metal-dielectric-
metal (MDM) waveguide with water between the two metal walls. The distance
between the two metal slabs is 0.25 µm and the free-space wavelength is 1.55 µm.
The dielectric constant of water at λ0 = 1.55µm can be considered real and equal
to 1.77.
7. Calculate (a) the capcitance of a glass nanosphere with radius equal to 50 nm and
(b) the capcitive reactance of the sphere at an excitation wavelength of 832 nm.
The dielctric constant of glass is 2.25.
8. Suppose we have a sphere with the same dimensions as in Exercise 7 but fabricated
from silver metal. Using the dielectric constant data of Exercise 4, calculate (a)
the inductance of the sphere and (b) the inductive reactance.
5.1 Introduction
The hydrogen atom is the simplest atomic structure interacting with light in the optical
regime. We know from the earliest days of atomic spectroscopy that the hydrogen atom
emits and absorbs dipole radiation. But the emitted and radiated light only appears
at certain frequencies; the hydrogen atom does not exhibit a continuous spectrum as
the classical dipole antenna. The reason is that the atom is a quantum object, not a
classical dipole. Nevertheless a “semiclassical” picture of the light-atom interaction is
possible. In this picture the electron, with dynamics defined by the quantal nature of
atomic internal states, still acts as a receiving antenna or a source of classical (usually
dipole) radiation. This hybrid picture of quantal matter and classical electrodynamics
captures a great deal of the atom-optical physics. Nonrelativistic quantum mechanics
and the classical radiation field combine to explain almost all the features of the
hydrogen atom spectrum. We will use these tools to develop an explanation for the
quantization of the energy levels of the atom and the “selection rules” for dipole
transitions between them.
e2 2 a 2
hW i = (5.4)
4π0 3c3
and we see that the average power emitted by the oscillating electron is proportional to
the average square of the electron acceleration. A charged particle must be accelerating
to emit radiation; an electron moving at constant velocity, not subject to external
forces, does not emit.
The energy (cycle-averaged) emitted by the oscillator must come from mechanical
(kinetic + potential) energy, Emech .
1
me ω02 r02
Emech = (5.5)
2
where me is the electron mass. The average power emitted by the oscillator can be
equated to the diminution of mechanical energy with time.
Emech
hW i = − (5.6)
τ
From Eqs. 5.2, 5.5, and 5.6 we see that that
2 −1
e 3me c3
τ= (5.7)
4π0 2ω02
The “lifetime” of the oscillator τ can be written a little more simply by introducing
the classical radius of the electron,
e2 1
rc = ' 2.8 × 10−15 m (5.8)
4π0 me c2
and
3c
τ= (5.9)
2rc ω02
In differential form Eq. 5.6 is written
dEmech
hW i = −
dt
so
dEmech dt
=− (5.10)
Emech τ
and
Emech (t) = Emech (0)e−t/τ (5.11)
We see that the internal energy of the atom, modeled as a classical electron oscillating
around a positive charge center, decreases exponentially with time as the energy is
radiated away.
Radiative Damping and Electron Scattering 169
A specific example provides some feeling for the relevant time and energy scales.
Consider a sodium atom, structured with 10 electrons in a “closed shell” and one
electron outside. This exposed or “active” electron is subject to an effective positive
charge at the nucleus roughly equal to that of of one proton. The potential and electric
field experienced by the active electron is radial and the simple dipole oscillator model
might therefore be appropriate. Emission from the first excited state to the ground
state is concentrated in a narrow band of wavelengths centered at 590 nm, the well-
known “D line” of atomic spectroscopy. Considering this emission as emanating from
a dipole oscillator with resonant frequency ω0 ,
2πc
ω0 = = 3.2 × 1015 s−1
λ
and
3c
τ= ' 16 × 10−9 s
2rc ω02
This result is actually quite close to the measured lifetime of the first excited state
of the sodium atom and lends credence to the harmonic oscillator model. In reality,
however, the atom must be treated quantum mechanically because there is no lower
limit to the classical radiative emission, and the classical electron would fall into the
nucleus as its mechanical energy diminishes to zero. Furthermore, the first excited
state with a quantum mechanically “allowed” transition possesses one quantum of
orbital angular momentum, and this angular momentum does not appear in the simple
oscillator model.
d2 z 1 dz eEz −iωt
+ + ω02 z = − e (5.12)
dt2 τ dt me
where as before ω0 is the characteristic frequency of the oscillator, e, me are the electron
charge and mass, respectively, τ the classical oscillator lifetime, and Ez is the amplitude
of the external driving field oscillating at frequency ω. The solution to this equation
of motion is
−eEz e−iωt
z= (5.13)
me [(ω02 − ω 2 ) − iω/τ ]
From this result we can write down the frequency-dependent dipole moment,
−eEz e−iωt
p = −ez ẑ = ẑ (5.14)
me [(ω02 − ω 2 ) − iω/τ ]
e4 Ez2 ω 4
hW i = h
2
i (5.15)
12π0 m2e (ω02 − ω 2 ) + (ω/τ )2
Using the classical electron radius, Eq. 5.8, we can write this expression for the dissi-
pated power as the product of an energy flux and a scattering cross section,
( " #)
0 Ez2 8πrc2 ω4
hW i = c· 2 2
(5.16)
2 3 (ω02 − ω 2 ) + (ω/τ )
| {z } | {z }
energy flux cross section
where Ψn (r, t) represents the state n of the atom, and Ĥ is the Hamiltonian operator,
Ĥ = i~∂/∂t. According to the postulates of quantum mechanics, a characteristic or
“eigen” state of a system must be represented by a wavefunction that is single-valued,
normalized, and orthogonal to all the other eigenstates of the system. An arbitrary
system state may be expressed as a linear combination of these eigenstates that form
a “complete set” in a linear, orthogonal vector space. If the state is “stationary” or
time-independent, as it is for the isolated atom, we may write the state function as a
product of a spatial term ψ(r) and a time-varying phase, exp(−iωt)
The frequency of the phase is related to the energy En of the stationary state by
the Planck relation, En = ~ω. The spatial, time-independent part can factor and be
separated by substitution of Eq. 5.19 into Eq. 5.18 with the result
Ĥ [ψ(r) − En ] = 0 (5.20)
where ψ(r) is the spatial wave function that is the solution to the time-independent
Schrödinger equation, 5.20. Now the Hamiltonian operator Ĥ is constructed from the
Bohr correspondence principle that permits us to identify certain mathematical op-
erators with their dynamical variable counterparts in the Hamiltonian formulation of
The Schrödinger equation for the hydrogen atom 171
µe4 me e4
En = 2 '− 2 (5.43)
(4π0 ) 2~2 n2 (4π0 ) 2~2 n2
where n can take on the integer values n = l, l + 1, l + 2, . . . and l can be an integer
l = 0, 1, 2, . . .. The radial solutions take the form of three factors: a normalization
factor, a polynomial, and an exponential. In fact Eq. 5.40 can be rearranged into a
form that bears a striking resemblance to the differential equation whose solutions are
the “modified Laguerre functions”. The rearranged form is obtained by multiplying
Eq. 5.40 by −2µ/~2 and transposing two terms.
~2 1 d e2 ~2 l(l + 1)
2 dRnl
− 2
r − Rnl + Rnl = En Rnl (5.44)
2µ r dr dr 4π0 r 2µ r2
The differential equation for the modified Laguerre functions and its solutions, Eqs. E.18, E.19,
are discussed in Appendix E. Now if we make the following substitutions, Eq. 5.44 can
be cast into the form of Eq. E.19.
r
8µE
ρ = αr and α= − 2 (5.45)
~
where it is understood that E < 0 since we only consider the allowed bound-state
energies of the hydrogen atom, and the reference zero of energy is at the ionization
limit r → ∞. The quantity α has units of m−1 so ρ is unitless. We make an additional
substitution to consolidate many constants,
µe2
λ= (5.46)
2π0 α~2
In order for the λ substitution to be consistent with the physically admissible values
of the energy En , λ must be restricted to integers, λ = 1, 2, 3, . . .. and defining χ(ρ) =
R(ρ/α), we can recast Eq. 5.44 as,
1 d 2 dχ(ρ) λ 1 l(l + 1)
ρ + − − χ(ρ) = 0 (5.47)
ρ2 dρ dρ ρ 4 ρ2
Using one last substitution trick,
1 d 2 dχ 1 d2
ρ = (ρχ) (5.48)
ρ2 dρ dρ ρ dρ2
174 Radiation in Classical and Quantal Atoms
we have finally
d2 (ρχ)
1 λ l(l + 1)
+ − + − ρχ = 0 (5.49)
dρ2 4 ρ ρ2
Now we compare this last result to Eq. E.19 rewritten here for convenience,
d2 Φkn (x) 1 2n + k + 1 k 2 − 1
+ − + − Φkn (x) = 0
dx2 4 2x 4x2
k−1
ρ↔x l↔ λ→n+l+1 (5.50)
2
Then the solutions to Eq. 5.49 are
where Lkn (x) are the associated Laguerre polynomials. Thus Eq. 5.51 expresses the
solutions to Eq. 5.49 that in turn is just the the hydrogen radial equation, Eq. 5.40,
in a different guise. Now we recover from the scaling parameters α and ρ terms of
physically meaningful quantities. From the expression for the allowed bound energies
En of the hydrogen atom, Eq. 5.43, we can write α and ρ in terms of n the principal
quantum number.
2 2
α= and ρ= r (5.53)
na0 na0
where a0 = 4π0 ~2 /me e2 , the well-known atomic radius of the Bohr model. Finally
we can write out the solutions to the hydrogen radial equation, Eq. 5.40,
−αr/2
Rnl (r) = (αr)l L2l+1
n−l−1 (αr)e (5.54)
Fig. 5.1 Normalized radial probability density as a function of the radial coordinate (in
units of a0 ) for three radial wave functions.
n l Rnl
3/2
1 0 1
a0 2e−r/a0
3/2 h i
1 r −r/2a0
2 0 2a0 2 − a0 e
3/2 2
3 0 1
3a0 2 1 − 2 a0 + 27 ar0
r 2
e−r/3a0
properties are discussed in Complement F. The solutions to Eq. 5.42, the functions of
the azimuthal angle ϕ, can be obtained by inspection. It is easy to verify that
Φml (ϕ) = eiml ϕ (5.60)
According to the postulates of quantum mechanics, the solution to a differential equa-
tion must be single-valued in the independent variable in order for the function to
suitably represent a physical state. Therefore, as ϕ cycles through integer multiples of
2π, increasing from 0 → 2π, 2π → 4π, . . . the value of the function Φml should remain
single-valued. Thus Φml (ϕ) must have the same value as Φml (ϕ0 ) if ϕ0 = ϕ+n2π where
n = 1, 2, 3, . . .. In order to assure this behavior, the value of ml must be an integer,
ml = ±1, ±2, ±3, . . .. Furthermore, Φml is subject to a normalization requirement
analogous to that of Eq. 5.56.
1 2π ∗
Z
Φml (ϕ)Φml (ϕ) dϕ = 1 (5.61)
N 0
The normalized azimuthal wavefunction is therefore
1
Φml (ϕ) = √ ei(±ml )ϕ (5.62)
2π
Functions of the polar angle θ . The solution to Eq. 5.41 constitutes a family of
functions, Plml (cos θ) called the associated Legendre functions. The properties of the
Legendre functions and the Legendre polynomials, the relevant functions when ml = 0,
are discussed in some detail in Complement F. These functions describe the polar-angle
dependence of the hydrogen atom wave function, ψ(r, θ, ϕ) = Rnl Plml (cos θ)Φml (ϕ).
Just as Rnl and Φml must be normalized, so must the functions Plml (cos θ). The
normalization requirement specifies that
1 π ∗ml
Z
P (cos θ)Plml (cos θ) sin θ dθ = 1 (5.63)
N 0 l
with N expressed by
2 (l + ml )!
N= · (5.64)
2l + 1 (l − ml )!
The family of functions that are the product of Pl (cos θ) and Φ(ϕ) is called the spherical
harmonics, Ylml (θ, ϕ). The normalized spherical harmonics are
The Schrödinger equation for the hydrogen atom 177
ml
Table 5.2 Angular Functions, Spherical Harmonics Yl
n l ml Ylml
q
1 1
1 0 0 2 π
q
2 1 −1 1
2
3
2π sin θe−ϕ
q
1 3
2 1 0 2 2π cos θ
q
2 1 1 − 21 3
2π sin θeϕ
q
3 2 −2 1
4
15
2π sin2 θe−2ϕ
q
3 2 −1 1
2
15
2π sin θ cos θe−ϕ
q
1 5 2
3 2 0 4 π cos θ − 1
q
3 2 1 − 12 15
2π sin θ cos θeiϕ
q
3 2 2 1
4
15
2π sin2 θei2ϕ
s
2l + 1 (l − ml )! ml
Ylml (θ, ϕ) = (−1)ml · P (cos θ)eiml ϕ (5.65)
4π (l + ml )! l
and Z π Z 2π
Yl∗ml (θ, ϕ)Ylml (θ, ϕ) sin θ dθ dϕ = 1 (5.66)
0 0
In order to maintain physically admissible, well-behaved functions there are certain
restrictions on the function labels, n, l, ml . We summarize them here without proof.
n = 1, 2, 3, . . . (5.67)
l = 0, 1, 2, . . . n − 1 (5.68)
ml = −l, −l + 1, −l + 2, . . . 0 . . . , l − 2, l − 1, l (5.69)
ml
Table 5.3 Hydrogen wave functions Rnl Yl , n=1 − 3
n l ml Rnl Ylml
3/2
1 0 0 √1
π
1
a0 e−r/a0 1
3/2 h i
2 0 0 √1
4 2π
1
a0 2− r
a0 e−r/2a0 1
3/2
2 1 0 √1
4 2π
1
a0
r
a0 e−r/2a0 cos θ
3/2
2 1 ±1 8 π
1
√ 1
a0
r
a0 e−r/2a0 sin θe±iϕ
3/2 2
3 0 0 1
√
81 3π
1
a0 27 − 18 ar0 + 2 ar0 e−r/3a0 1
q 3/2
1 2 1 −r/3a0
3 1 0 81 π a0 6 − ar0 r
a0 e cos θ
3/2
−r/3a0
3 1 ±1 1
√
81 π a0
1
6 − r
a0
r
a0 e sin θe±iϕ
3/2 2 2
3 2 0 √1
81 6π a0
1 r
a0 e−r/3a0 3 cos2 θ − 1
3/2 2 2
3 2 ±1 1
√
81 π a0
1 r
a0 e−r/3a0 sin θ cos θe±iϕ
3/2 2 2
3 2 ±2 1√
162 π a0
1 r
a0 e−r/3a0 sin2 θe±i2ϕ
Table 5.3 lists the normalized ψnlml solutions to the hydrogen atom Schrödinger equa-
tion for n = 1 − 3.
5.4.3 Orthogonality
If ψn,l,ml represents an eigenstate of the hydrogen atom with an eigen energy and
angular momentum, then according to the postulates of quantum mechanics it must
be not only single-valued and normalized, but also orthogonal to all other states with
different energies or angular momenta. More succinctly,
Z
∗
ψn,l,ml
ψn0 ,l0 ,m0l dτ = δnn0 δll0 δml m0l (5.76)
all space
State Energy and Angular Momentum 179
where the δ function takes the value of unity when n, l, ml are equal to their primed
counterparts but is null otherwise. Wavefunctions that exhibit this property are said
to be “orthonormal” because they are orthogonal and normalized. In polar coordinates
the wavefunctions of the hydrogen atom are products of radial and angular functions
ψn,l,ml (r, θ, ϕ) = Rnl (r)Ylml (θ, ϕ) = Rnl (r)Plml (cos θ)Φml (ϕ) (5.77)
and therefore these radial and angular functions must be orthonormal as well.
The eigen energies En are labeled by the principal quantum number n and are charac-
terized by the analytical expression given in Eq. 5.43. The constants in this expression
can be collected together to facilitate numerical calculation in convenient units.
The allowed values of n are n = 1, 2, 3, . . .. The lowest lying (ground) state of the
atom corresponds to n = 1, and E1 = −13.6 eV. The first excited state corresponds to
n = 2, and E2 = −3.40 eV, and so forth. As n → ∞, En → 0, and the zero-reference
energy corresponds to the ionization limit where the electron is no longer bound by
the Coulomb potential. Figure 5.2 shows a diagram of the hydrogen atom energy levels
bound within the Coulomb potential.
Fig. 5.2 Coulomb potential to which the bound electron in the hydrogen atom is subject.
The lowest four bound states (n = 1 − 4) are shown.
in Section 5.4. From classical mechanics we have for the angular momentum of the
electron L from the position r and the linear momentum p
L=r×p (5.80)
r → r̂ (5.84)
p → −i~∇ (5.85)
∂ ∂
L̂x = −i~ y −z (5.86)
∂z ∂y
∂ ∂
L̂y = −i~ z −x (5.87)
∂x ∂z
∂ ∂
L̂z = −i~ x −y (5.88)
∂y ∂x
and the operator corresponding to the square of the total orbital angular momentum
is
Lˆ2 = Lˆ2x + Lˆ2y + Lˆ2z (5.89)
1 ∂2
1 ∂ ∂
Lˆ2 = −~2 sin θ + (5.93)
sin θ ∂θ ∂θ sin2 θ ∂ϕ2
From Eqs. 5.41, 5.42 and the definition of the spherical harmonic functions, Eq. 5.65
we see that
The value of the orbital angular momentum for a given eigen state of the hydrogen
atom is therefore p
L = ~ l(l + 1) (5.96)
and the Lz component is given by
Lz = ~ml (5.97)
Thus we see that the ground state exhibits no orbital angular momentum, with n = 1,
l = 0 and L, ml = 0. The first excited state, n = 2, can have values of l = 0, 1. Therefore
√
the n = 2 level exhibits two states of orbital angular momentum, L = 0 and L = ~ 2.
The level with l = 1 possesses three possible values of Lz : ~, 0, −~. It is already clear
from these results that Schrödinger equation yields a physical interpretation of the
internal atomic structure quite different from that of the Bohr orbits. First, since the
ground state has no angular momentum, we cannot imagine the electron as orbiting
the nucleus. The electron is restricted to radial motion. Second, even when the orbital
182 Radiation in Classical and Quantal Atoms
state does have nonzero angular momentum, the projection L on the z axis can only
take on discrete values. In the case of l = 1 only three Lz values are allowed. In a
classical orbit, there would be no restriction on Lz from −L to L. A third nonclassical
feature of the angular momentum is the peculiar status of the two other components,
Lx , Ly . A consequence of the uncertainty principle is that the position and momentum
of a particle cannot be known (measured) simultaneously. If all three components of
the angular momentum were measurable and the magnitude of L was known as well,
then the position and momentum of the electron in the atom would be determined
simultaneously in violation of this fundamental postulate of the quantum theory.
lar structures can be explained on the basis of the linear combination of orbitals. For
example the methane CH4 exhibits a spatial structure with the carbon atom at the
center and the four hydrogen atom at the vertices of an equilateral tetrahedron. This
structure is not explained by the p-orbitals of the central carbon atom alone but is
due to a further orbital mixing. The l = 0 s-orbital combines with the three p-orbitals
to produce the tetrahedral structure. This further mixing of s- and p-orbitals is called
hybridization because it involves linear combinations of solutions to the Schrödinger
equation with different angular momenta. As such these “hybridized” orbitals are not
pure quantum angular momentum states, but they nevertheless appear in nature be-
cause the hybridization lowers the energy of the molecule. Highly symmetric molecules
like methane CH4 or sulfur hexafluoride SF6 do not exhibit dipole absorption or emis-
sion between rotational or vibrational states because they have no permanent dipole
moment. However, a simple substitution of a chlorine atom for one of the hydrogen
atoms in methane, for example, yields chloromathane CH3 Cl which does have a per-
manent dipole moment. Rotation and vibration of the molecular structure results in
oscillation of the dipole which in turn permits rotation-vibration transitions through
a dipolar interaction with incident light. We will examine in more detail this type
of molecular spectra for simple diatomic molecules when we discuss the rigid-rotor,
harmonic oscillator model of molecular spectra in Section 5.9.
the Planck relation between quantized energy and frequency (see Complement D at the
end of this chapter). For the case of the hydrogen atom, the label n can be considered
184 Radiation in Classical and Quantal Atoms
Fig. 5.3 Plots of the p-orbitals and three of the d-oribitals showing their spatial directions
and orientations.
with
~ω0 = ~ (ω2 − ω1 )) = E2 − E1
We now add the V̂ (t) term, the interaction energy of the atom with the driving field
with frequency close to ω0 .
Interaction of Light with the Hydrogen Atom 185
Now, if we substitute the time-dependent solution, Eq. 5.103, back into the time-
dependent Schrödinger equation, Eq. 5.98, multiply from the left with the complex
conjugate, ψ1∗ eiωt , and integrate over all space, we obtain
Z Z
dC1
C1 ψ1∗ V̂ ψ1 dr + C2 e−iω0 t ψ1∗ V̂ ψ2 dr = i~ (5.104)
dt
To simplify notation we write the integral “matrix elements” ψ1∗ V̂ ψ1 dr and ψ1∗ V̂ ψ2 dr
R R
Atom-field coupling operator. We now turn our attention to the details of V̂ (t). This
term is an interaction energy operator, and from the classical theory of electrostatics
we know that the leading interaction V between an electric field and neutral matter
is the dipole term.
V = −p · E (5.107)
where p is the dipole moment and E is the electric field of, say, a propagating elec-
tromagnetic wave through a polarizable material. The classical dipole itself is defined
as X X
p= pi = qi ri (5.108)
i i
where qi is the charge at position ri with respect to the origin of the coordinate sysem
(see Section ??). In the case of a classical atom with one electron orbiting a massive
proton, the instantaneous classical dipole can be taken as
186 Radiation in Classical and Quantal Atoms
p = −er (5.109)
where e is the charge on the electron and r is the radial position of the electron with
respect to the origin, centered at the proton. If an external E-field is now applied to
the classical atom, the interaction energy is given by
V = −p · E = e r · E (5.110)
Passing to the quantal atom, we use the correspondence principle to write the r and
p operators.
r → r̂ (5.111)
p → p̂ (5.112)
but we do not quantize the E-field. The rate of atomic absorption and emission we will
obtain is based on this semiclassical approximation in which the atom is quantized
but the field remains classical. We write the dipole interaction operator as
V̂ = −p̂(r) · E (5.113)
= e r̂ · E (5.114)
where r̂ = r is the electron radial coordinate centered at the proton. The operator V̂
exhibits odd parity with respect to this electron coordinate so that the matrix elements
V11 and V22 in Eqs. 5.105, 5.106 will vanish. Only atomic states of opposite parity can
be coupled by the dipole interaction. For the E-field we write the real form
E = êE0 cos(ωt − kz)
were ê is the polarization unit vector, E0 the field amplitude, ω the frquency, and k the
propagation parameter. We have the chosen the field to be propagating along z. The
interaction takes place over the spatial extent of the atom, ∼ a0 , which is only about
0.5 nm. For electromagnetic fields ranging in wavelength from the near ultraviolet to
the naear infrared, the wavelength is hundreds of nanometers. Therefore, the kz term in
the field argument is negligible and the E-field can be considered effectively constant
over atomic dimensions. The absolute interaction matrix element can therefore be
written
|V12 (t)| = |V12 | cos ωt
Z
∗
= ψ1 (r)erψ2 (r) dV E0 cos ωt
where the primes on the quantum number labels indicate a final state different from
the intial state. This dipole matrix element can be simplified by factoring the wave-
functions into their r, θ, ϕ constitutents and resolving the vector dipole moment into
its x, y, z components.
Z
∗
p12 = e rRnl (r)Rn0 l0 r2 drPl∗ (cos θ)Pl0 (cos θ)e−iml ϕ eiml0 ϕ sin θ dθ dϕ (5.118)
pz = r cos θ (5.119)
px = r sin θ cos ϕ (5.120)
py = r sin θ sin ϕ (5.121)
Since the integral over the period of any oscillatory exponential function with imagi-
nary argument is zero, the only way this factor can be nonzero is if the argument itself
is zero. Therefore we have the first seclection rule
Next, still considering pz , we see that that we will have another factor in transition
moment integral
Z π
pz ∼ Pl∗ (cos θ)Pl0 (cos θ) cos θ sin θ dθ (5.124)
0
Now, as we show in Apppendix F, the Legendre functions Pl (cos θ) have definite parity.
Functions labeled with even l are unchanged (even) with respect to the parity operation
and functions labeled with odd l exhibit odd parity (change sign). The parity operation
188 Radiation in Classical and Quantal Atoms
is sign reversal of the function variable. In the case of the Legendre functions the
variable is cos θ. The parity operation is
The integrand of the integral in Eq. 5.124 must be even with respect to θ over the
limits of integration. Since the factor cos θ sin θ exhibits odd parity, the product Pl Pl0
must also be odd so as to render the overall integrand even. Therefore we see that
∆l = 0 is a “forbidden” transition and ∆l = ±1 is allowed, or at least not forbidden,
by the integrand symmetry. Turning now to px , py , we will have factors
Z 2π
px ∼ e−iml ϕ cos ϕeiml0 ϕ dϕ (5.126)
0
Z 2π
py ∼ e−iml ϕ sin ϕeiml0 ϕ dϕ (5.127)
0
and since cos ϕ and sin ϕ are linear combinations of eiϕ and e−iϕ , it is clear that in
order to avoid a null integral another selection rule for these two components of the
transition moment is
∆ml = ±1 (5.128)
More generally we have to consider the overall parity operation that transforms the
dipole vector er to −er. In polar coordinates this parity inversion is accomplished by
r → −r (5.129)
θ →π−θ (5.130)
ϕ→ϕ±π (5.131)
We see from Eq. 5.118 that the radial part of the integrand has odd parity and therefore
the product of the Legendre functions must show odd parity as well. Therefore l must
differ from l0 by ±1 or we have the selection rule
∆l = ±1 (5.132)
Summarizing, the selection rules for allowed dipole transitions in the hydrogen atom
are
∆l = ±1 (5.133)
∆ml = ±1, 0 (5.134)
The various allowed atomic transitions are often summarized in a schematic chart
called a Grotrian diagram. Figure 5.4 shows a Grotrian diagram for some of the
Interaction of Light with the Hydrogen Atom 189
Fig. 5.4 Grotrian diagram for nl energy levels of the hydrogen atom. The arrows indicate
some of the allowed transitions. Even with the restrictive selection rule ∆l = ±1, the com-
plicated cascading transition from high-lying energy states to lower-lying states is evident.
transtions in the hydrogen atom. The notation for the levels derives from atomic
spectroscopy. The prinicipal quantum number n is indicated as n = 1, 2, 3, 4, . . . but
the l quantum number is assigned a letter s, p, d, f, . . .. The letters indicate progres-
sions of spectroscopic lines that have some common property. The quantum number
l = 1 is labeled s for “sharp” lines. The l = 2 levels are labeled p for “principal” lines,
l = 2 → d for “diffuse” lines.
Calculation of atomic absorption and emission rates. Now let us go back to the
coupled equations 5.105, 5.106 and rewrite them in terms of the Rabi frequency and
ω0 , the frequency difference between the ground and excited states.
dC1
Ω0 cos ωt e−iω0 t C2 = i (5.135)
dt
dC2
Ω∗0 cos ωt eiω0 t C1 = i (5.136)
dt
We take the initial conditions (at t = 0) to be unit probability of finding the two-state
system in state 1 and null probability for finding it in the excited state 2. At later
2
times the probability for finding the system in state 2 is P = |C2 (t)| . At early times
the rate of increase of this probability is
190 Radiation in Classical and Quantal Atoms
2
dP |C2 (t)|
= (5.137)
dt t
so our task is to calculate C2 (t) at times just later than t = 0. Applying the inital
conditions, it is not difficult to show that
Ω∗0 1 − ei(ω0 +ω)t 1 − ei(ω0 −ω)t
C2 (t) = + (5.138)
2 ω0 + ω ω0 − ω
If the frequency ω of the driving wave approaches the two-state difference frequency
ω0 it is quite clear that the first term in Eq. 5.138 will be negligible compared to the
second. Near resonance we can therefore write to good approxiamtion
Ω∗0 1 − ei(ω0 −ω)t
C2 (t) ' (5.139)
2 ω0 − ω
We have therefore
2 2 sin2 [(ω0 − ω)(t/2)]
|C2 (t)| = |Ω0 | 2 (5.140)
(ω0 − ω)
and when ω → ω0 , around resonance, we obtain a limiting expression
2 1 2
|C2 (t)| = |Ω0 | t2 (5.141)
4
Thus with ω near ω0 and at early times under weak-field conditions, the probability
rate for finding the system in the excited states increases linearly with time.
dP 1 2
= |Ω0 | t (5.142)
dt 4
5.7.3 Finite spectral width of absorption
The incident driving field always has some spectral width. The source might be, for
example, a broad band arc lamp or the output from a monochromator coupled to
a synchrotron. Therefore we need to write the field eneergy as an integral over the
spectral energ density of the excitation source in the neighborhood of the transition
frequency.
Z ω0 + 21 ∆ω
1 2
0 E0 = ρω dω (5.143)
2 ω0 − 12 ∆ω
where ρω is the spectral energy density of the excitation source with a line width
of ∆ω. Note that the MKS units of spectral energy density are Jm−3 s. Now from
Eq. 5.140 we can write
2
sin2 [(ω0 − ω)t/2]
2 p12 E0
|C2 (t)| = 2 (5.144)
~ (ω0 − ω)
and substituting from Eq. 5.143 we have
ω0 + 12 ∆ω
2p212
Z
2
|C2 (t)| = ρω dω (5.145)
0 ~2 ω0 − 21 ∆ω
For conventional broad-band excitation sources we can safely assume that the spectral
density is constant over the line width of the atomic transition and replace ρ(ω) in the
Interaction of Light with the Hydrogen Atom 191
integrand with ρ(ω0 ) outside it. Assuming a continuous wave (cw) excitation source,
the integral can be evaluated
ω0 + 12 ∆ω
sin2 [(ω0 − ω)t/2]
Z
πt
2 dω = (5.146)
ω0 − 21 ∆ω (ω0 − ω) 2
So finally the expression for the time rate of increase of the probability of finding
the system in state 2 (i.e. the radiation absorption rate), integrated over the spectral
width of the source, is
2
dP |C2 (t)| πp212
= = ρ(ω0 ) (5.147)
dt t 0 ~2
5.7.4 Comparison to Einstein B coefficient
Now from Complement D we know that the Einstein A and B coefficients represent
spontaneous emission and stimulated absorption under conditions of equilibrium be-
tween a radiation field and a collection of material absorber-emitters. These coefficients
were introduced into phenomenological rate equations and originally had no underlying
interpretation of just how the light was absorbed or emitted. However, by recoginizing
that the Einstein absorption rate, B12ρω , is equivalent to the semiclassical dP/dt we
can write the B coefficient in terms of the quantum mechanical dipole matrix element
p12 . More succinctly
dP πp212
= ρ(ω0 ) = B12 ρ(ω0 ) (5.148)
dt 0 ~2
or
πp212
B12 = (5.149)
0 ~2
The p12 that appears in Eq. 5.148 has a specific orientation with respect to the po-
larization of the exciting field. Equation 5.115 shows the angle θ of p12 with respect
to the E-field of the exciting light is zero. However the Einstein B coefficient assumes
isotropic radiation, so in order to really make the comparison of B and p12 meaningful
we have to average the right side of Eq. 5.149 over all angles. The average value of the
square of the dipole moment matrix element is given by
1 2
p212 = p212 cos2 θ = p (5.150)
3 12
So, finally we have
πp212
B12 = (5.151)
30 ~2
We have restricted this development to the consideration of an artificial atom, the
two-level system without any substructure or energy-level degeneracy. No real atom
corresponds to this simple scheme. Even the simplest atom, hydrogen, with one elec-
tron and one proton, exhibits orbital angular momentum degeneracy in the excited
p level. However it is a simple matter to take degeneracy into account in writing the
expressions for B12 , B21 and A21 , the stimulated absorption, stimulated emission, and
192 Radiation in Classical and Quantal Atoms
The rationale for this expression comes from the interpretation of the product of a
wavefunctions and its complex conjugate as a probability density P.
P = Ψ∗ (r, t)Ψ(r, t)
Assuming the wavefunction is normalized,
Z Z
P dV = Ψ∗ (r, t)Ψ(r, t) dV = 1
and the average value of the dipole moment is the integral of this differential probability
over all space.
Z
∗ ∗
C1 ψ1 (r)eiω1 t + C2∗ ψ2 eiω2 t (−er)·
hp(t)i = −e hr(t)i =
C1 ψ1 (r)e−iω1 t + C2 ψ2 (r)e−iω2 t dV
Z
= 2 cos(ω0 t) C1 C2∗ ψ1∗ (r)(−er)ψ2 (r) dV
Fig. 5.5 Rigid rotor model of a heteronuclear diatomic molecule. Internuclear separation
r is considered constant. Mass m2 > m1 . Permanent electric dipole is aligned along r and
rotates with the molecule.
the plane of rotation reveals an effective dipole oscillation and therefore the possibility
of absorption and emission of diople radiation. Note that this dipolar oscillation is
due to nuclear motion; not the electron orbital motion. The nuclei of molecules move
much more slowly than the electrons in the charge density binding the nuclei togther.
Because of this great difference between effective nuclear and elecron velocities, their
motions are, in many cases, effectively decoupled. The energy of the molecule can then
be calculated as the sum of the rotational, vibrational, and electronic energy; and we
can write down a Schrödinger equation for each type of motion, solve them separately
and write the total wave function as a product of three. Thus
where
m1 m2
r1 = r and r2 = r (5.167)
m1 + m2 m1 + m2
196 Radiation in Classical and Quantal Atoms
Fig. 5.6 Energy level diagram of the rigid rotor eigen states.
In the realm of molecular spectroscopy the energy levels are often classified ac-
cording to terms, by which is meant that the transition energy is expressed in units of
reciprocal length. Thus Eq. 5.179 is expressed as
Erot (J) h
F (J) = = J(J + 1) = BJ(J + 1) (5.181)
hc 8π 2 cI
where h, c have their usual meanings and B is called the rotational constant. It charac-
terizes the rotational energy in units of reciprocal length. Traditionally in spectroscopy
the conventional unit is cm−1 , called a “wavenumber”. Pure rotational transitions
typically have rotational constants of some tens of cm−1 . For example the rotational
constant of hydrogne iodide, HI, is about 6.5 cm−1 and that of H2 is about 61 cm−1 .
Fig. 5.7 Harmonic oscillator model of molecular vibration along the internuclear axis r. If
m1 6= m2 then a charge dipole exists along the axis, the amplitude of which will vary with
the oscillation of r.
1 2
V (r) = kr (5.182)
2
where k is the force constant given by
k = µω 2 (5.183)
with ω the angular frequency of oscillation. Once again we use the correspondence
principle to form the Hamiltonian operator.
∂
p → p̂ = −i~ (5.184)
∂r
r → r̂ = r (5.185)
2
p̂ 1
Evib → Ĥvib = + kr̂2 (5.186)
2µ 2
The harmonic oscillator Schrödinger equation then becomes
h i
Ĥvib − Evib ψvib = 0 (5.187)
Fig. 5.8 Energy level diagram showing allowed energies of the quantum harmonic oscillator.
Note that the ground state energy E0 = 1/2~ω.
√ 2
ψvib = Nnorm Hv ( αr)e−µω/~x (5.189)
√
where Nnorm = 1/ 2v π 1/2 v! is a nomalization factor and α = µω/~ is a scaling factor
for the coordinate r. Figure 5.8 shows the equally spaced ladder of energies Evib = ~ω
starting from the ground state with zero-point energy 1/2~ω.
In spectroscopy the energy of vibrational transitions are expressed as vibrational
terms analogous to the spectroscopy of pure rotations. The vibrational terms are
expressed as
Evib 1
G(v) = =ω v+ ; v = 0, 1, 2, . . . (5.190)
hc 2
where ω means the vibrational energy, usually reported in units of cm−1 . This vibra-
tional ω should not be confused with the angular frequency of some wave or transition.
Unfortunately spectroscopic notation became entrenched long before the modern as-
sociation of ω with a unit of angular frequency. The vibrational transitions of diatomic
molecules exhibit much greater energy than rotational transitions. For example the vi-
brational termG(v) in HI is 2309 cm−1 , factor of about 350 greater than the rotational
term.
5.9.3 Electronic states of molecules
In addition the rotational and vibrational transitions, arising from the motion of per-
manent dipoles, molecules can undergo electronic transitions, somewhat analogous to
200 Radiation in Classical and Quantal Atoms
the transitions we have studied in the hydrogen atom. Diatomic molecules have lower
symmetry (cylindrical) than atoms (spherical), and therfore the great simplification of
variable separation is not so easy to carry out. Nevertheless one can characterize molec-
ular orbitals and assign symmetry quantum number to electronic states. The proper
study of molecular structure and spectroscopy is outside the scope of this book, but
suffice it to state that molecular electronic transitions do occur and their energies are
again much greater than pure vibrational or rotational transitions. Continuing with
the example of HI, the first electronic transition between molecular bound states is the
X → B transition. Here X indicates a molecular ground electronic state and B labels
a bound excited electronic state. The electronic spectroscopic term for this transition
is about 5.6 × 103 cm−1 , about 25 times greater than the vibrational term. Hierarchy
of Transitions Rotation, vibration, and electronic transitions follow an energy hierar-
chy from relatively low to high. Table 5.5 summarizes the typical energies for these
dipole transitions. Rotational spectroscopy can be studied with microwave technology,
but pure vibrational transitions (at least for “typical” diatomic molecules) fall in an
awkward part of the electromagnetic spectrum. Radiaton sources and detectors in the
THz (1012 Hz and far infrared (IR) regimes are not, at present, a well-developed and
readily available technology. Electronic spectroscopy, in contrast, can be studied with
a vast array of sources, optical dispersion instrumentation, and detectors. Of course
when a molecule absorbs or emits light in the visible-ultraviolet range it undergoes a
“rovibronic” transition in which all three types of internal molecular motion change.
Figure 5.9 shows schematically how these rovibronic transitions couple lower states
to higher states. The rotation-vibration motion gives rise to underlying manifolds of
transition lines, riding on top of the electronic transition. The amplitude and spacing
of the transition lines within the manifold provides information on vibrational and
rotational motion.
5.10 Exercises
1. Calculate the average value of the electron radius for the hydrogen atom in its
ground state, R10 . Use the expression from Table 5.1.
2. For the same state, calculate the maximum probability for finding the electron at
some distance from the proton.
3. Explain why the average value for the electron radius and the most probable value
for the electron radius are not equal.
4. Given the ground state and first excited states of the hydrogen as
1 1
ψ100 = √ 3/2 e−r/a0
πa
0
Further Reading 201
Fig. 5.9 Schematic of rovibronic transitions where radiation connects a ground-state mani-
fold of vibration-rotation states to an excited-state manifold. Separations between states are
not to scale.
and
1 1
ψ210 = √ re−r/2a0 cos θ
π (2a0 )5/2
show that
215/2 ea0
p12 =
35
where e is the charge on the electron and a0 is the Bohr radius.
5. The energy level spacing for this first transition in hydrogen is called the Ly-
man α transition. The wavelength at line center is λ = 121.7 nm. Calculate the
spontaneous emission rate A21 (s−1 ) for this transition.
202
D
Classical Blackbody Radiation
∇·E=0
Harmonic waves standing-wave solutions factor into oscillatory temporal and spatial
terms. Now, respecting the boundary conditions for a three-dimensional box with sides
of length L, we have for the components of E
Fig. D.1 Mode points in k space. Left panel shows one-half the volume surrounding each
point. Right panel shows one-eighth the volume of spherical shell in this k space.
πn
kx = n = 0, 1, 2, ...
L
and similarly for ky , kz . Notice that the cosine and sine factors for the Ex field
component show that the transverse field amplitudes Ey , Ez have nodes at 0 and L as
they should and similarly for Ey and Ez . In order to calculate the mode density, we
begin by constructing a three-dimensional orthogonal lattice of points in k space as
π
shown in Fig. D.1. The separation between points along the kx , ky , kz axes is L , and
the volume associated with each point is therefore
π 3
V =
L
Now the volume of a spherical shell of radius |k| and thickness dk in this space is
4πk 2 dk. However, the periodic boundary conditions on k restrict kx , ky , kz to positive
values, so the effective shell volume lies only in the positive octant of the sphere. The
number of points is therefore just this volume divided by the volume per point:
1 2
8 4πk dk 1 3 k 2 dk
Number of k points in spherical shell = = L
π 3 2 π2
L
Remembering that there are two independent polarization directions per k point, we
find that the number of radiation modes between k and dk is,
k 2 dk
Number of modes in spherical shell = L3 (D.3)
π2
and the spatial density of modes in the spherical shell is
Planck mode distribution 205
dρ(k) k2
= ρk = 2
dk π
and therefore the mode number as
k2
ρk dk = dk
π2
with ρk as the mode density in k - space. The expression for the mode density can be
converted to frequency space, using the relations
2π 2πν ω
k= = =
λ c c
and
dν c
=
dk 2π
Clearly
ρν dν = ρk dk
and therefore
8πν 2 dν
ρν dν =
c3
The density of oscillator modes in the cavity increases as the square of the frequency.
Now the average energy per mode of a collection of oscillators in thermal equilibrium,
according to the principal of equipartition of energy, is equal to kB T , where kB is the
Boltzmann constant. We conclude therefore that the energy density in the cavity is
8πν 2 kB T dν
ρRJ
E (ν)dν = (D.4)
c3
which is known as the Rayleigh–Jeans law of blackbody radiation; and, as Fig. D.2
shows, leads to the unphysical conclusion that energy storage in the cavity increases
as the square of the frequency without limit. This result is sometimes called the “ul-
traviolet catastrophe” since the energy density increases without limit as oscillator
frequency increases toward the ultraviolet region of the spectrum. We achieved this
result by multiplying the number of modes in the cavity by the average energy per
mode. Since there is nothing wrong with our mode counting, the problem must be in
the use of the equipartition principle to assign energy to the oscillators.
Fig. D.2 Left panel: Rayleigh–Jeans blackbody energy density distribution as a function
of frequency, showing the rapid divergence as frequencies tend toward the ultraviolet (the
ultraviolet catastrophe). Right panel: Planck blackbody energy density distribution showing
correct high-frequency behavior.
This probability distribution Pi comes from statistical mechanics and can be written
∞
in terms of the Boltzmann factor e−i /kB T and the partition function q = e−i /kB T :
P
i=0
e−i /kB T
Pi =
q
Now, Planck suggested that instead of assigning the average energy kB T to every oscil-
lator, this energy could be assigned in discrete amounts, proportional to the frequency,
such that
1
i = n i + hν
2
where ni = 0, 1, 2, 3 . . . and the constant of proportionality h = 6.626 × 10−34 J·s. We
then have
ni
e−hν/2kB T e−ni hν/kB T e−hν/kB T
Pi = ∞ = P
∞ ni (D.5)
e−hν/2kB T e−ni hν/kB T e−hν/kB T
P
ni =0 ni =0
ni
= e−hν/kB T 1 − e−hν/kB T (D.6)
∞ ni
e−hν/kB T = 1/ 1 − e−hν/kB T . The average
P
where we have recognized that
ni =0
energy per mode then becomes
∞
X
ε̄ = Pi i =
i=0
∞ ni
X hν
e−hν/kB T 1 − e−hν/kB T (ni )hν = (D.7)
ni =0
ehν/kB T −1
The Einstein A and B coefficients 207
and we obtain the Planck energy density in the cavity by substituting ε̄ from Eq. D.7
for kB T in Eq. D.4
8πh 3 1
ρP l
E (ν)dν = 3
ν hν/k T dν (D.8)
c e B −1
This result, plotted in Fig. D.2, is much more satisfactory than the Rayleigh–Jeans
result since the energy density has a bounded upper limit and the distribution agrees
with experiment.
dN1 dN2
=− = −N1 B12 ρω + N2 B21 ρω + N2 A21 (D.9)
dt dt
dN1
At thermal equilibrium we have a steady-state condition dt = − dN
dt = 0 with
2
ρω = ρth
ω so that
A
ρth
ω = 21
N1
N2 B12 − B21
and the Boltzmann distribution controlling the distribution of the number of atoms
in the lower and upper levels,
N1 g1
= e−(E1 −E2 )/kT
N2 g2
where g1 , g2 are the degeneracies of the lower and upper states, respectively. So
A
21
A21 B21
ρth
ω = = (D.10)
g1 ~ω0 /kT g1 ~ω0 /kT B12
g2 e B12 − B21 g2 e B21 − 1
208 Blackbody Radiation
But this result has to be consistent with the Planck distribution, Eq. D.8:
8πh 3 1
ρP l
E (ν) dν = ν dν (D.11)
c3 0 ehν0 /kB T − 1
~ 1
ρP l 3
E (ω) dω = 2 3 ω0 ~ω0 /kB T dω (D.12)
π c e −1
Therefore, comparing these last two expressions with Eq. D.10, we must have
g1 B12
=1 (D.13)
g2 B21
and
A21 8πh
= 3 ν03
B21 c
or
A21 ~ω 3
= 2 03
B21 π c
These last two equations show that if we know one of the three rate coefficients, we
can always determine the other two.
It is worthwhile to compare the spontaneous emission rate A21 to the stimulated
emission rate B21
A21
= e~ω0 /kT − 1
B21 ρth
ω
which shows that for ~ω0 much greater than kT (visible, UV, X - ray), the spontaneous
emission rate dominates; but for regions of the spectrum much less than kT (far IR,
microwaves, radio waves), the stimulated emission process is much more important. It
is also worth mentioning that even when stimulated emission dominates, spontaneous
emission is always present.
Appendices
209
A
Systems of Units in
Electromagnetism
and would have units of l2 /qt2 , where q is some unit of electrical charge. The number of
basic units is now four: mass, length, time, and electric charge. We could also decrease
the number of units by setting some constants equal to unity and without dimensions.
For example setting me = e = ~ = 1 is such a choice, sometimes called atomic units,
and convenient in atomic and molecular quantum mechanical expressions.
Here is another example: The volume can be defined as function of the length unit,
V = kl3 (A.4)
where k is an arbitrary constant, usually set equal to unity. But suppose we had some
problem dealing only with spherical volumes,
4
V = k πr3 (A.5)
3
3
We could define k to be 4π which would make the volume read
Coulomb’s Law 211
V = r3 (A.6)
in some system of ’rationalized’ units.
Now electromagnetic (E-M) theory differs from classical mechanics in that a fun-
damental constant, ”the speed of light”, c appears explicitly. This constant has di-
mensions, l/t, and if these units are changed, the appearance of E-M expression will
change as well. We will see how the speed of light makes it’s appearance by comparing
electrostatic and magnetostatic forces.
We can use Coulomb’s law to write the electric field as ’force per unit charge’,
F q2
=E= 2 (A.8)
q1 r
Or more properly in vector form
Z
r̂ r̂
E=q → ρ(x)d3 x (A.9)
r3 V r3
where r̂ is the unit vector pointing in the E direction, and ρ(x) is the charge density.
Now from Gauss’s law we know
I Z
E · n da = 4π ρ(x)d3 x (A.10)
S V
Up to now we have only considered E-M fields in a vacuum, but when we generalize
to dielectric media the electric vector field E, becomes the displacement vector D and
is expressed as
D = 0 E + κP (A.13)
The vector field P is called the ’polarization’ and represents the density of dipole
moments in the material. The constants 0 and κ are proportionality constants. These
constants are not the same in esu and MKS units. In the esu system the displacement
vector field is
D = E + 4πP (A.14)
In this system 0 is chosen to be a pure number with unit value. This choice implies
that D, E, and P all have the same dimensions.
Eqs. A.7 and A.19 we can do a dimensional analysis of the ratio, K1 /K2 . The result
is that the ratio of the two constants has units of velocity squared (l/t)2 It turns
out from measurement that for equal electrostatic and magnetostatic forces the value
of this ratio is equivalent to the speed of light in vacuum squared, c2 . So we have
K1 /K2 = c2 .
The magnetic induction field B is defined as the ’force per unit current’ analogous
to the ’force per unit charge’ that defines the electric field E.
I
B = 2K2 (A.20)
d
However, Eq. A.20 is not quite the whole story. A defning relation between E and B
would imply that B is just proportional to the ’force per unit current’ so that strictly
speaking we should write
I
B = 2K2 K3 (A.21)
d
and then make some convenience argument about the choice of K3 . In order to really
set K3 correctly we need the equation that relates E to B. Such an equation exists. It
is Faraday’s law of induction.
∂B
∇ · E = −K3 (A.22)
∂t
where the ’permeability of free space’ µ0 has been introduced. The permeability serves
the same function for magnetic fields that permittivity serves for electric fields. The
difference between the Electrostatic system and the Gaussian system is in the the
choice of K3 . In the Electrostatic system of units K3 = 1 while in the Gaussian
system K3 = 1/c. The Heaviside-Lorentz system is similar to the Gaussian except
it is “rationalized” thereby eliminating factors of 4π that often appear in the field
equations of the Gaussian, Electrostatic, and Electromagnetic systems.
The magnetic fields in matter, analogous to Eqs. A.14 and A.15, are
1
H = c2 B − 4πM (esu) and H= B − M (MKS) (A.25)
µ0
The vector field H is, strictly speaking, the magnetic field while B should always
referred to as the magnetic induction field. Obviously the magnetic field H plays an
analogous role to the displacement field D in the macroscopic equations of electrostat-
ics. The vector field M is called the ’magnetization’ and plays a role analogous to P,
the polarization. Note that neither in the esu system nor in the MKS system does B
have the same units as H.
Note finally that the choice of K2 = µ0 /4π in the MKS system implies that
1
= c2 (A.26)
0 µ0
Many of the quantities, symbols, and units (in the rationalized MKS system) are
compiled in TableA.2.
B
Review of Vector Calculus
B.1 Vectors
A spatial vector is essentially a geometric entity comprising a magnitude (or length)
and a direction. A scalar is some entity characterized only by magnitude. Many physi-
cal quantities can be represented by scalars and vectors. As an example Fig. B.1 shows
an object of mass m moving in a curved trajectory with velocity v and subject to a
force F. The acceleration ac points toward the origin of the central force. The distance
from the object to the the force center is R. In this example m and R are scalars with
only one number (amplitude) associated with these physical quantities. The vector
quantities, v, F, ac have two associated properties: the amplitude and the direction.
The geometric entity of the vector can be represented by the coordinates of the two
end points. If the coordinate system is transformed to some other, the vector remains
unchanged but its representation changes with the coordinate values of the endpoints.
For example, the direction can be represented by two numbers along a line in one-
dimensional (1-D) space, two pairs of numbers in two-dimensional (2-D) space, and a
pair of three numbers in a three-dimensional (3-D) space. Very often the “tail” of the
vector is referred to the origin of the coordinate system. The 3-D velocity vector, for
example, in Cartesian coordinates takes on the form
v = vx x̂ + vy ŷ + vz ẑ (B.1)
Fig. B.1 Examples of vector quantities and a scalar quantities. Velocity v, force F, and
acceleration ac are vectors. The distance to the origin R and the mass m are scalers.
216 Vector Calculus
where the unit vectors in the x, y, z directions are x̂, ŷ, ẑ and the magnitude of the vec-
tor components in theq x, y, z directions are vx , vy , vz . The overall length or amplitude
of the vector is |v| = vx2 + vy2 + vz2 . A vector can also be expressed by a column of
numbers, each element corresponding to the component along a Cartesian coordinate
axis.
vx
v = vy (B.2)
vz
A vector can be rotated within a coordinate space or alternatively the coordinate axes
themselves can be rotated (transformed so that the vector is represented in a new
coordinate space. The rotation operation can be represented by matrix. For a 2-D
counterclockwise rotation we have, for example,
v0
cos θ − sin θ v
0
v = x0 = · x (B.3)
vy sin θ cos θ vy
and Fig. B.2 shows the coordinate rotation. This type of coordinate transformation of a
vector is called a unitary transformation because the length of the vector is preserved.
Axioms of vector addition and scaler multiplication 217
This property is easy to see by taking the absolute value of the original vector and the
transformed vector. The length of the original vector is give by
1/2
|v| = vx2 + vy2 (B.6)
1/2
= vx2 + vy2 = |v| (B.7)
In 3-D we can represent the rotation matrix about the three Cartesian coordinate axes
as
1 0 0
Rx (θ) = 0 cos θ − sin θ (B.8)
0 sin θ cos θ
cos θ 0 sin θ
Ry (θ) = 0 1 0 (B.9)
− sin θ 0 cos θ
cos θ − sin θ 0
Rz (θ) = sin θ cos θ 0 (B.10)
0 0 1
These matrices rotate the vectors counterclockwise about each axis, assuming that the
rotation axis considered is pointing toward the reader.
Ax = A cos α ≡ A · x̂
Ay = A cos β ≡ A · ŷ
Az = A cos γ ≡ A · ẑ
We use the associative property to get a generalization of the dot product between
two vectors A and B. We can write B in terms of its components and the unit vectors
along the Cartesian axes.
B = Bx x̂ + By ŷ + Bz ẑ
Then
A · B = A · (Bx x̂ + By ŷ + Bz ẑ) (B.13)
But then invoking the associative property
Vector Multiplication 219
A · B = Bx A · x̂ + By A · ŷ + Bz A · ẑ
= B x Ax + B y Ay + B z Az
Therefore X X
A·B= B i Ai = Ai B i = B · A (B.14)
i i
and we see that the scaler product between two vectors is commutative as well as
associative. In the same way we defined Ax = A · x̂ = A cos α we could define the
projection of A onto the B direction as
AB = A cos θ ≡ A · B̂ = A · B/B
BA cos θ = A · B
B · A + C · A = (B + C) · A
A × B = −B × A (B.17)
The cross product is anticommutative. From the definition we can directly obtain basic
relations among the Cartesian unit vectors.
x̂ × ŷ = ẑ ŷ × ẑ = x̂ ẑ × x̂ = ŷ (B.18)
and
ŷ × x̂ = −ẑ ẑ × ŷ = −x̂ x̂ × ẑ = −ŷ (B.19)
It is also clear from the definition, Eq. B.16 that
x̂ × x̂ = ŷ × ŷ = ẑ × ẑ = 0 (B.20)
Notice that these relations can be generated by a cyclic commutation of x̂, ŷ, ẑ starting
from any one of them. In analogy to the scaler product we posit that the cross product
operation is distributive and associative. Thus
A × (B + C) = A × B + A × C (B.21)
(A + B) × C = A × C + B × C (B.22)
A × (sB) = sA × B = (sA) × B (B.23)
Now let us take A × B and decompose this expression into its Cartesian components.
We see that the Cartesian components of C are constructed from combining products
of the Cartesian components of A and B. Note also that these combinations commute
cyclically. The easiest way to generate the components of the vector product is by
writing a determinant and expanding it. The first row of the determinant consists
of the Cartesian unit vectors, x̂, ŷ, ẑ. The Cartesian components of A composes the
second row and the Cartesian components of B, the third.
x̂ ŷ ẑ
C = Ax Ay Az (B.26)
Bx By Bz
A · (B × C) = B · (C × A) = C · (A × B) (B.27)
A · (B × C) = −A · (C × B)
Associated with vector and scaler fields are three differential operations impor-
tant in physics and engineering: the gradient, the divergence, and (in 3-D) the curl
operations.
B.4.1 Gradient
If ϕ(x, y, z) is a scaler field then the gradient of ϕ is defined as
∂ϕ ∂ϕ ∂ϕ
∇ϕ(x, y, z) = x̂ + ŷ + ẑ (B.29)
∂x ∂y ∂z
Notice that the gradient of scaler field is a vector field consisting of the partial deriva-
tives of the scaler field pointing in the x, y, z directions. If we just considered the total
differential of ϕ in terms of its partial derivatives we have
∂ϕ ∂ϕ ∂ϕ
dϕ = dx + dy + dz (B.30)
∂x ∂y ∂z
In fact we can write this scaler differential as the scaler product of ∇ϕ and the vector
coordinate differential dr = dx x̂ + dy ŷ + dz ẑ.
dϕ = ∇ϕ · dr = |∇ϕ||dr| cos θ (B.31)
where the angle θ is the angle between the gradient and the coordinate vector. Clearly
in order for Eq. B.31 to be in accord with B.30, the angle θ = 0. If θ were any other
value, then the dot product in B.31 would be less than dϕ. So ∇ϕ as defined in Eq. B.29
points in the direction of maximum change in ϕ and its magnitude |∇ϕ| yields the
greatest rate of change (slope) along this direction.
The ∇ = ∂/∂x x̂ + ∂/∂y ŷ + ∂/∂z ẑ is a vector operator which acts on a scaler
field to produce a vector field. The most familiar example of the use of the gradient
in electromagnetism is the relation between an electrostatic field E and an electrical
potential V (x, y, z).
E = −∇V
The gradient operator is sometimes called “grad” or “del” and the gradient operation
on a scaler field ϕ is termed “grad ϕ” or “del ϕ”. The ∇ symbol itself is called the
“nabla”.
B.4.2 Divergence
The divergence operation is the dot product of the vector gradient operator with a
vector field. Suppose we have a vector field ϕ. The result of the divergence operation
is
∂ ∂ ∂
∇·ϕ= x̂ + ŷ + ẑ · [ϕx x̂ + ϕy ŷ + ϕz ẑ]
∂x ∂y ∂z
∂ϕx ∂ϕy ∂ϕz
= + + (B.32)
∂x ∂y ∂z
Notice that the divergence operation results in a scaler field because it is a “dot prod-
uct” operation between the vector operator ∇ and the vector field ϕ. The divergence
Vector Fields 223
Fig. B.6 Panel (A) shows a vector field with very high negative divergence. Vector ampli-
tudes decrease with distance from origin. Panel (B) shows a constant vector field along x
which exhibits null divergence.
of a vector field is a measure of how much the field increases or diverges along the field
coordinates. A vector field radiating from a point in all directions has a very high di-
vergence. A vector field whose magnitude is invariant along some coordinate has a null
divergence. A positive (negative) divergence denotes an increasing (decreasing) field.
This geometrical representation of divergence is shown in Fig. B.6. The divergence of
vector field is sometimes written as div ϕ. The most familiar example of the use of the
divergence in electromagnetism is the differential form of Gauss’s law,
ρ
∇·E= (B.33)
0
where E is the electric field and ρ the electric charge density.
B.4.3 Curl
The vector product of ∇ and a vector field ϕ is another vector field χ, called the curl
of ϕ.
χ=∇×ϕ (B.34)
From the determinant rule for cross product of two vectors we can write
x̂ ŷ ẑ
∂ ∂ ∂
χ= ∂x ∂y ∂z
(B.35)
ϕx ϕy ϕz
Expanding the determinant we have explicitly
∂ϕz ∂ϕz ∂ϕz ∂ϕx ∂ϕy ∂ϕx
χ= − x̂ − − ŷ + − ẑ (B.36)
∂y ∂y ∂x ∂z ∂x ∂y
224 Vector Calculus
Fig. B.7 A vector field circulating anticlockwise in the x − y plane producing a curl field
along z but no divergence.
The curl of a vector field is measure of its “rotation” around an axis perpendicular
to the plane of rotation. A vortex field has a very large curl while a field emanating
radially from a point in all direction has no curl. Figure B.7 shows a field of all curl
and null divergence. The explicit expression for this vector field is
y x
V(x, y) = − p î + p ĵ (B.37)
2
x +y 2 x + y2
2
Figure B.8 shows a vector field with finite divergence but null curl along the x = y
diagonals. The expression for this vector field is
y x
V(x, y) = p î + p ĵ (B.38)
x2 + y2 x2 + y2
One can show that the divergence of the curl of vector field is null
∇ · [∇ × (∇ϕ)] = 0 (B.39)
∇ × (∇ϕ) = 0 (B.40)
Fig. B.8 A vector field with a null curl at the origin and along the x = y diagonals but with
finite divergence for all x, y except at the point x = 0, y = 0 where the divergence is null.
convenient choice, depending on the symmetry of the problem. Two commonly encoun-
tered curvilinear coordinate systems in physics are cylindrical and polar coordinates.
The expressions for grad, div, and curl in these coordinates is discussed in Appendix
C.
Equation B.41 says that the line integral of the vector field ∇ϕ using the path dr is
equal to the difference between the scaler field values at the two end points. Since this
difference does not depend on the path, the result of the line integral is independent
of the the path taken; and in particular
I
∇ϕ · dr = 0 (B.42)
for any closed loop since for these the ending point is identical to the starting point.
The integral over a path of the gradient of a scaler field is commonly encountered in
226 Vector Calculus
electrostatic problems where the task is to calculate the potential difference between,
say, two conductors that form an electric field between them. The potential difference
is given by Z 2 Z 2
V21 = V2 − V1 = − E · dr = − ∇V · dr (B.43)
1 1
Where D is the displacement vector field, E is the electric field emanating from the
charge density ρ enclosed by the surface boundary S of volume V . Note that the surface
differential element is a vector differential with direction pointing outward from the
boundary and perpendicular to it.
The line integral on the right is taken around the boundary of the surface integral on
the left. A convenient way to remember and visualize the geometric interpretation of
the curl integral is to think of Ampère’s law. In differential form it is
∇ × H = Jfree
where H is the magnetic field and Jfree is a free charge current density. The integral
form of this equation is, taking into account the curl theorem,
Z I Z
(∇ × H) · dS = H · dr = Jfree · dS = I (B.46)
surf surf
where I is the charge current running in a wire of cross section S. This relation can be
pictured as a loop of H-field surrounding a straight wire and generated by a current I
running through it.
C
Gradient, divergence and curl in
cylindrical and polar coordinates
Although Cartesian coordinates are the most familiar and serve many purposes, they
are not the only orthogonal coordinate system that can be used to define a set of basis
vectors. Often we can take advantage of the natural symmetry of a problem to express
vectors in some other orthogonal curvilinear coordinate system. The most common of
these are the cylindrical and polar coordinates because they are appropriate for many
practical problems.
In general we can expand a vector V in basis vectors of the Cartesian system or
some other system with basis vectors {q},
However, we cannot treat the position vector with this general rule. In this special
case
(C.2)
r = xx̂ + y ŷ + z ẑ cartesian coordinates (C.3)
r = ρρ̂ + z ẑ cylindrical coordinates (C.4)
r = rr̂ spherical coordinates (C.5)
In the general case we convert from the Cartesian system to another through the
relations
x = x(q1 , q2 , q3 )
y = y(q1 , q2 , q3 )
z = z(q1 , q2 , q3 )
∂x ∂y ∂y
dx = dq1 + dq2 + dq3
∂q1 ∂q2 ∂q3
and similarly for y and z. The differential of the position vector r in the Cartesian
basis is
dr = dx̂ + dŷ + dẑ
228 Curvilinear coordinates
and the square of the distance between two neighboring points can then be written
ds2 = dr · dr = dx2 + dy 2 + dz 2
But the differential of dr can also be expanded in the curvilinear coordinate system
∂r ∂r ∂r
dr = dq1 + dq2 + dq3 (C.6)
∂q1 ∂q2 ∂q3
and
X ∂r ∂r
ds2 = dr · dr = · dqi dqj (C.7)
ij
∂qi ∂qj
where we have identified gii = h2i . We see from Eq C.7 that hi is a scale factor such
that the differential length segment dsi can be written
∂r
dsi = hi dqi and = hi q̂i
∂qi
and from Eq. C.6 the position vector differential dr can be expanded in terms of these
scale factors in the curvilinear basis as
From the line segment dsi we can easily write the surface and volume elements σ, τ
in the curvilinear system.
and
dτ = ds1 ds2 ds3 = h1 h2 h3 dq1 dq2 d3
We can expand an area vector σ in the curvilinear basis set analogous to the position
vector r in Eq. C.8
For example we can take an ordinary vector quantity F and expand it in Cartesian
coordinates or in spherical coordinates.
The gradient in curvilinear coordinates 229
F = Fx x̂ + Fy ŷ + Fz ẑ (C.10)
F = Fr r̂ + Fθ θ̂ + Fϕ ϕ̂ (C.11)
The unit vectors r̂, θ̂, ϕ̂ in terms of the Cartesian unit vectors are
The differential length vector along the surface in spherical coordinates is then
dl = dr r̂ + r dθ θ̂ + r sin θ dϕ ϕ̂ (C.15)
and the differential volume element in spherical coordinates is obtained from the prod-
uct of ds1 ds2 ds3 with
The expressions for dr and dσ in Eqs. C.8, C.9 give us the tools to write vector line
and surface integrals in the curvilinear coordinates
Z XZ
V · dr = Vi hi dqi (C.20)
i
Z XZ
V · dσ = Vi hj hk dqj dqk j 6= k 6= i (C.21)
i
x̂ ŷ ẑ
∂ ∂ ∂
∇×V = ∂x ∂y ∂z
Vx Vy Vz
As with the gradient and divergence operations, the curl can also be written as the
limiting integral operation
R
dσ × V
∇ × V = R lim R
dτ →0 dτ
It can be shown that by using this integral limiting form and applying Stokes’ theorem
the curl operation can be written in curvilinear coordinates as
Figure C.1 shows the relation between the Cartesian coordinates x, y, z and the cylin-
drical coordinates, ρ, ϕ, z. The z coordinate is common to both systems. From Fig. C.1
it is evident that
p
ρ = (x2 + y 2 ) x = ρ cos ϕ y = ρ sin ϕ z=z
h1 = hρ = 1 h2 = hϕ = ρ h3 = hz = 1
232 Curvilinear coordinates
Fig. C.1 Cylindrical coordinates ρ, ϕ, z and unit vectors ρ̂, ϕ̂, ẑ and their relations to Carte-
sian coordinates and unit vectors, x, y, z, x̂, ŷ, ẑ.
Then from the general expressions for grad, div, curl, Eqs. C.25, C.27, C.29 we write
these operators in cylindrical coordinates as
∂ψ 1 ∂ψ ∂ψ
grad ∇ψ(ρ, ϕ, z) = ρ̂ + ϕ̂ + ẑ (C.30)
∂ρ ρ ∂ϕ ∂z
1 ∂(ρVρ ) 1 ∂Vϕ ∂Vz
div ∇·V = + + (C.31)
ρ ∂ρ ρ ∂ϕ ∂z
ρ̂ ρϕ̂ ẑ
1 ∂ ∂ ∂
curl ∇×V = ∂ρ ∂ϕ ∂z
(C.32)
ρ
Vρ ρVϕ Vz
We can also find the scalar Laplacian in cylindrical coordinates by applying the general
expression Eq. C.28
1 ∂2ψ ∂2ψ
2 1 ∂ ∂ψ
Laplacian ∇ ψ= ρ + 2 + (C.33)
ρ ∂ρ ∂ρ ρ ∂ϕ2 ∂z 2
Polar coordinates. Polar coordinates consist of three independent variables, r, θ ϕ
and their associated unit vectors, r̂, θ̂, ϕ̂. The limits on these variables are,
0≤r≤∞ 0≤θ≤π 0 ≤ ϕ ≤ 2π
Figure C.2 shows the relation between the Cartesian coordinates x, y, z and the polar
coordinates, r, θ, ϕ. From Fig. C.2 we see that
Expressions for grad, div, curl in cylindrical and polar coordinates 233
Fig. C.2 Polar coordinates r, θ, ϕ and unit vectors r̂, θ̂, ϕ̂ and their relations to Cartesian
coordinates and unit vectors, x, y, z, x̂, ŷ, ẑ.
p
r= x2 + y 2 + z 2 x = r sin θ cos ϕ y = r sin θ cos ϕ z = r cos θ
From the differential volume dV and inspection of Fig. C.2 we can identify the scale
factors h1 , h2 , h3 .
dV = dsr dsθ dsϕ = dr rdθ r sin θdϕ
hr = 1 hθ = r hϕ = r sin θ
Then from Eqs. C.25, C.27, C.29 we write expression for the grad, div and curl opera-
tors in polar coordinates
∂ψ 1 ∂ψ 1 ∂ψ
grad ∇ψ = r̂ + θ̂ + ϕ̂ (C.34)
∂r r ∂θ r sin θ ∂ϕ
∂r2 Vr
1 ∂ sin θVθ ∂Vϕ
div ∇·V = sin θ + r + r (C.35)
r2 sin θ ∂r ∂θ ∂ϕ
1 ∂ 2 ∂ψ ∂ ∂ψ
Laplacian ∇2 ψ = sin θ r + sin θ +
r2 sin θ ∂r ∂r ∂θ ∂θ
1 ∂2ψ
(C.37)
sin θ ∂ϕ2
∂ 2 ψ 2 ∂ψ
1 ∂ ∂ψ
Laplacian ∇2 ψ = + + sin θ +
∂r2 r ∂r r2 sin θ ∂θ ∂θ
1 ∂2ψ
(C.38)
sin θ ∂ϕ2
D
Properties of Phasors
D.1 Introduction
In Chapter 2, Section 2.10.1 we discussed the phasor form for electromagnetic prop-
agating fields consisting of a single-frequency harmonic time dependence factor e−iωt
and the spatially dependent amplitude modulation factor ei(k·r+ϕ) . The general form
of the field is
A(r, t) = A0 ei(k·r+ϕ) e−iωt (D.1)
and we separated the field into a phasor factor, involving only the spatial dependence,
and a harmonic time factor that was usually invariant.
In general a “phasor” is any function with a sinusoidal modulation, and in this
Appendix we examine some of the useful properties of phasors and their application
to circuit theory.
The product of two phasors is another phasor with a phase angle equal to the algebraic
sum of the two individual arguments of the exponential factors.
The addition of two phasors is less obvious. In many cases we are interested in
real sin and cos functions so let us examine the following sum of two real phasors,
V1 (t) = A sin ωt and V2 (t) = B sin(ωt + ϕ).
On intuition we might suppose that the sum of two sin functions differing only in
amplitude and relative phase would be another sin function. We posit therefore that
the form of the sum is
V3 (t) = C sin(ωt + δ) (D.4)
where C and δ are to be determined. Expanding V3 (t) in Eqs. D.3, D.4 and setting
coefficients of sin ωt and cos ωt equal, we find
236 Phasors
Fig. D.1 Relations among two phasors A and B which differ in amplitude and phase. Phase
of A leads phase of B by ϕ. The interior angle γ is related to ϕ by γ = π − ϕ.
−1 B sin ϕ
δ = tan (D.5)
A + B cos ϕ
1/2
C = A2 + B 2 + 2AB cos ϕ
(D.6)
The expression for C, the amplitude of the phasor sum, is highly reminiscent of the
result we would expect for the length of a vector C that is the vector sum of A and
B. In this case the length of the vector C would be determined by the law of cosines
as indicated in Fig. D.1. The interior angle γ between the two component vectors is
closely related to ϕ, the relative angle between the two phasors. In fact it is easy to
show that
γ =π−ϕ (D.7)
and substitution in Eq. D.6 results in
1/2
C = A2 + B 2 − 2AB cos γ
(D.8)
confirming that two phasors add with amplitude equivalent to the resultant “length”
from the the sum of two vectors.
where V, R, L, C have their usual meanings of voltage, resistance, inductance and ca-
pacitance and i(t) is the time-dependent current running through all the lumped circuit
Application of Phasors to Circuit Analysis 237
We see that the voltage drop across the resistor is in phase with the source, while the
inductive reactance, ωL shows a phase advance of π/2 and the capacitive reactance,
1/ωC lags in phase by π/2. The resistive voltage drop is purely dissipative while the
two reactances store energy originating at the source and return it to the circuit at
different points along the harmonic cycle. Equation D.9 can be rewritten in terms of
the impedance, Z(ω),
where XL and XC are the inductive and capacitive reactances in phasor form. We
can take this analysis further by considering the RLC circuit in series and in parallel.
Figure D.2 shows circuit and phasor diagrams for the two cases. In the series circuit
the property common to all lumped elements is the current i(t) so the relevant phasor
quantities are the voltages across the resistive, inductive and capacitive elements. For
the parallel circuit the common property is the voltage V (t) and the relevant phasor
diagram is expressed in terms of the individual currents passing through each circuit
element.
238 Phasors
Fig. D.2 (a) Series RLC circuit with harmonic voltage source and a common current i(t)
running through each circuit element. (b) Voltage phasor diagram showing the phase “lead”
for the capacitive voltage term, the phase “lag” for the inductive voltage term and the net
phase difference between the driving voltage and the resistive voltage drop. Panels (c) and
(d) show similar diagrams for current phasors in RLC parallel circuit.
E
Properties of the Laguerre Functions
The Laguerre polynomials and the “associated” Laguerre polynomials are solution to
Laguerre’s differential equation and are closely related to the radial solutions of the
Schrödinger equation for the hydrogen atom.
As we did with Legendre and Hermite polynomials, we find recurrence relations be-
tween adjacent members of the Laguerre polynomials by differentiating the generating
function with respect to z and x.
∂g(x, z)
(1 − z 2 ) = (1 − x − z)g(x, z) (E.2)
∂z
∂g(x, z)
(z − 1) = zg(x, z) (E.3)
∂x
Substituting Eq. E.1 into Eq. E.2, adjusting the summation indices, and equating terms
with equal powers of z n , results in the first recurrence relation
2n + 1 − x
Ln+1 (x) = Ln − nLn−1 (E.4)
n+1
It is often useful to have a recurrence relation involving the derivatives of the polyno-
mials, and we can obtain one by using Eqs. E.2 and E.3 to derive a third relation,
where the polynomials defined by the generating function, Eq. E.1 are already normal-
ized. As in the case of Hermite polynomials we can identify a closely related Laguerre
function ϕn (x) that has the orthonormal properties of a complete set of functions
involving the Laguerre polynomials,
The ordinary differential equation that ϕn (x) satisfies, however, is not the physically
relevant one with which we associate the Schrödinger radial equation for the hydrogen
atom. To find the solutions for that equation we need to examine the “associated”
Laguerre polynomials.
We can obtain a similar series representation for the associated Laguerre polynomials
by substituting Eq. E.11 into the definition, Eq. E.9
n
X (n + k)!
Lkn (x) = (−1)m xm k > −1 (E.12)
m=0
(n − m)!(k + m)!m!
Lk0 (x) = 1
Lk1 (x) = −x + k + 1
x2 (k + 2)(k + 1)
Lk2 (x) = − (k + 2)x +
2 2
Associated Laguerre Polynomials 241
from which recurrence relations can be derived by differentiation and equating of terms
with like powers in z in the usual way. The resulting recurrence relation is
(n + 1)Lkn+1 (x) = (2n + k + 1 − x)Lkn (x) − (n + k)Lkn−1 (x) (E.14)
The associated Laguerre polynomials are solutions to the associated Laguerre ordinary
differential equation, but this equation is not “self-adjoint” or hermitian; and therefore
cannot function as a legitimate quantum mechanical operator. However it can be cast
into a form suitable for quantum mechanics, by multiplying the associated Laguerre
polynomials by a factor, e−x/2 xk/2 Lkn (x). This factor transforms the polynomials into
the associated Laguerre functions.
ϕkn (x) = e−x/2 xk/2 Lkn (x) (E.15)
These associated Laguerre functions have the desired orthonormality
Z ∞
(n + k)!
e−x xk Lkn (x)Lkm (x) dx = δmn (E.16)
0 n!
and they satisfy the “quantum mechanically correct” ordinary differential equation
d2 ϕkn (x) dϕkn (x) x 2n + k + 1 k 2
x + + − + − ϕkn (x) = 0 (E.17)
dx2 dx 4 2 4x
This equation is “correct” in the sense that the operator integrals corresponding to
average values of observable variables and the eigenvalues themselves are guaranteed to
be real. But in fact Eq E.17 is still not the ordinary differential equations corresponding
to the form of the radial Schrödinger equation for the hydrogen atom. The problem is
that the Schrödinger equation does not exhibit a term with a first-derivative operator.
However, this term can be eliminated by multiplying the Laguerre function by a factor
x1/2
Φkn (x) = e−x/2 x(k+1)/2 Lkn (x) (E.18)
This family of modified Laguerre functions is a solution to the ordinary differential
equation
d2 Φkn (x) 1 2n + k + 1 k 2 − 1
+ − + − Φkn (x) = 0 (E.19)
dx2 4 2x 4x2
and has a normalization
Z ∞
2 (n + k)!
e−x xk+1 Lkn (x) dx =
(2n + k + 1) (E.20)
0 n!
Notice that in Eq. E.19 there is no term in the first derivative of x, and therefore this
second-order differential equation and its family of modified Laguerre functions can
serve as a prototype for the radial Schrödinger equation of the one-electron atom.
F
Properties of the Legendre functions
The Legendre functions are a family of solutions to the Legendre differential equation.
This equation determines the angular behavior of many physical problems including
the scalar Helmholtz wave equation in optics, Maxwell’s equations in classical electro-
dynamics, and the Schrödinger wave equation in quantum mechanics. We discuss here
the properties of these functions common to all these physical situations.
The Legendre equation itself is
The left side of this equation can be cast into a power series in tn such that left side
and right side, also a power series in tn can be equated term by term.
∞ [n/2] ∞
X X (2n − 2k)! X
(1 − 2xt + t2 )−1/2 = (−1)k (2x)n−2k) n
t = Pn (x)tn
n=0 k=0
22n−2k n!k!(n − k)! n=0
(F.5)
Recurrence Relations 243
where the upper limit on the sum, [n/2] means n/2 when n is even and (n − 1)/2 when
n is odd. Therefore we can obtain an expression for each Pn (x) in terms of a sum over
the index k.
[n/2]
X (2n − 2k)!
Pn (x) = (−1)k n xn−2k (F.6)
2 k!(n − k)!(n − 2k)!
k=0
Using Eq. F.6 function we find, for example,
1 5 3
3x2 − 1
P0 (x) = 1 P1 (x) = x P2 (x) = P3 (x) = x − 3x
2 2
Using Eq. F.4 we can write this last expression as sum of four terms, each of which is
a power series in t.
∞ ∞
X X
1 − 2xt + t2 nPn (x)tn−1 + (t − x) Pn (x)tn = 0 (F.8)
n=0 n=0
or
∞
X ∞
X ∞
X
nPn (x)tn−1 − x(2n + 1)Pn (x)tn + (n + 1)Pn (x)tn+1 = 0 (F.9)
n=0 n=0 n=0
Now we seek to rewrite the various terms so that they are all sums of a power series
in tn . We can do this by adjusting the indices n. Thus,
∞
X ∞
X
n−1
nPn (x)t is equivalent to (n + 1)Pn+1 tn (F.10)
n=0 n=0
and
∞
X ∞
X
(n + 1)Pn (x)tn+1 is equivalent to nPn−1 (x)tn (F.11)
n=0 n=0
and therefore we can write
(2n + 1)xPn (x) = (n + 1)Pn+1 (x) + nPn−1 (x) (F.12)
Equation F.12 is the recurrence relation for Legendre polynomials. It is a prescription
for expressing a given Pn (x) in terms of the adjacent polynomials in the series, Pn−1 (x)
244 Legendre Functions
and Pn+1 (x). For example, it is easy to remember the first two Legendre polynomials,
P0 (x) = 1, P1 = x. Using Eq. F.12 we find immediately P2 = 1/2(3x2 − 1). Then,
with P1 (x) and P2 (x) in hand we find P3 (x) = 1/2(5x3 − 3x). Thus,in principle any
member of the series may be found from the recurrence relation.
F.3 Parity
It appears by inspection that even-index Legendre polynomials are even and odd-index
polynomials are odd. In general the parity of the polynomials can be demonstrated by
recourse to the generating function, Eq. F.4.
which shows that in general even-index Legendre polynomials are even and odd-index
polynomials are odd in the x → −x parity operation.
and in fact they form a complete set, spanning the space of x from -1 to +1. The
normalization factor comes from the determination of the integral when m = n. We
start with the generating function, square it and write
" ∞
#2
X
(1 − 2tx + t2 )−1 = Pn (x)tn (F.16)
n=0
Then we integrate both sides and note that all the Pn Pm cross terms will vanish by
virtue of Eq. F.15.
Orthogonality and Normalization 245
Z 1 ∞ Z 1
dx X 2
= [Pn (x)] dx t2n (F.17)
−1 1 − 2tx + t2 n=0 −1
The integral on the left can be readily evaluated by making a change of variable,
y = 1 + 2t + t2 and dy = −2tdx. Making the substitution in Eq. F.17, together with
the change in limits x = −1 → y = (1 + t)2 and x = 1 → y = (1 − t)2 we have
Z 1 ∞ Z 1
dx 1 1+t X 2
2
= ln = [Pn (x)] dx t2n (F.18)
−1 1 − 2tx + t t 1−t n=0 −1
and therefore
∞ ∞ Z 1
X t2n X 2
2 = [Pn (x)] dx t2n (F.20)
n=0
2n + 1 n=0 −1
Finally, equating each term in the power series t2n on each side of Eq. F.20, we have
Z 1
2 2
[Pn (x)] dx = (F.21)
−1 2n + 1
The Hermite polynomials constitute the essential part of the solutions to the Schrödinger
equation for the one-dimensional harmonic oscillator. Their properties are the start-
ing point for interpreting vibrational motion and dipole transition selection rules in
diatomic molecules. The Hermite polynomials can be defined from the “Rodrigues
representation”
dn
Hn (x) = (−1)n exp x2 exp −x2
(G.1)
dxn
or by use of the generating function.
from which we can obtain the recurrence relations similarly to the procedure followed
for the Legendre polynomials in Complement F. First we take the partial derivative of
g(x, t) with respect to t,
∞ ∞
∂g(x, t) 2 X ntn X tn−1
= 2(x − t)e−t +2tx = Hn (x) = Hn (x) (G.3)
∂t n=0
n! n=0
(n − 1)!
Terms of equal powers of tn must be equal and the recursion relation is therefore
We can also get a recursion relation involving the derivative of the Hermite polynomials
by taking the partial derivative of the generating function with respect to x.
Orthogonality and Normalization 247
∞ ∞
∂g(x, t) 2 X tn X tn
= 2te−t +2tx = 2t Hn (x) = Hn0 (x) (G.6)
∂x n=0
n! n=0 n!
Then
∞ ∞ ∞
X tn+1 X tn X tn
2 Hn (x) = 2n Hn−1 (x) = Hn0 (x) (G.7)
n=0
n! n=0
n! n=0 n!
and
2nHn−1 (x) = Hn0 (x) (G.8)
We can get the first two members of the Hermite polynomials by expanding the gen-
erating function exponential in a Taylor series,
∞ n
2 X 2tx − t2
e−t +2tx = = 1 + (2tx − t2 ) + . . . (G.9)
n=0
n!
that results in H0 (x) = 1 and H1 (x) = 2x. Then from these and Eq. G.5, all the
polynomials up to arbitrary n can be calculated. The first few Hermite polynomials
are shown in Table G.1
G.2.2 Normalization
We find the normalization of the Hermite functions, Eq. G.12 by multiplying the gen-
erating function for Hm (x) with that for Hn (x), similar to the approach we take with
the squared generating function of the Legendre polynomials, Eq. F.16.
∞
X sm tn
exp(−s2 + 2sx) exp(−t2 + 2tx) = Hm (x)Hn (x) (G.18)
m,n=0
m!n!
Then multiply both sides by exp(−x2 ) and integrate over the interval from −∞ to
∞. Cross terms in the exp(−x2 )Hm (x)Hn (x) products drop out of the integral due to
orthogonality.
∞
(st)n ∞
X Z Z ∞
2 2
exp(−x ) [Hn (x)] dx = exp −(x − s − t)2 exp(2st) dx (G.19)
n=0
n!n! −∞ −∞
Now in the usual way we equate equal powers of (st)n that results in
Z ∞
2 √
exp(−x2 ) [Hn (x)] dx = 2n πn! (G.21)
−∞