Foundationsofphysics PDF
Foundationsofphysics PDF
OF
PHYSICS
LICENSE, DISCLAIMER OF LIABILITY, AND LIMITED WARRANTY
By purchasing or using this book (the “Work”), you agree that this license grants
permission to use the contents contained herein, but does not give you the
right of ownership to any of the textual content in the book or ownership to
any of the information or products contained in it. This license does not permit
uploading of the Work onto the Internet or on a network (of any kind) without
the written consent of the Publisher. Duplication or dissemination of any text,
code, simulations, images, etc. contained herein is limited to and subject to
licensing terms for the respective products, and permission must be obtained
from the Publisher or the owner of the content, etc., in order to reproduce or
network any portion of the textual material (in any media) that is contained in
the Work.
MERCURY LEARNING AND INFORM ATION (“MLI” or “the Publisher”) and anyone
involved in the creation, writing, or production of the companion disc,
accompanying algorithms, code, or computer programs (“the software”), and
any accompanying Web site or software of the Work, cannot and do not warrant
the performance or results that might be obtained by using the contents of the
Work. The author, developers, and the Publisher have used their best efforts
to insure the accuracy and functionality of the textual material and/or programs
contained in this package; we, however, make no warranty of any kind, express
or implied, regarding the performance of these contents or programs. The
Work is sold “as is” without warranty (except for defective materials used in
manufacturing the book or due to faulty workmanship).
The author, developers, and the publisher of any accompanying content, and
anyone involved in the composition, production, and manufacturing of this
work will not be liable for damages of any kind arising out of the use of (or the
inability to use) the algorithms, source code, computer programs, or textual
material contained in this publication. This includes, but is not limited to, loss
of revenue or profit, or other incidental, physical, or consequential damages
arising out of the use of this Work.
The sole remedy in the event of a claim of any kind is expressly limited to
replacement of the book, and only at the discretion of the Publisher. The use of
“implied warranty” and certain “exclusions” vary from state to state, and might
not apply to the purchaser of this product.
FOUNDATIONS
OF
PHYSICS
Steve Adams
Original title and copyright: Principles of Physics. Copyright ©2017 by The Pantaneto Press. All rights
reserved. Published by The Pantaneto Press.
This publication, portions of it, or any accompanying software may not be reproduced in any
way, stored in a retrieval system of any type, or transmitted by any means, media, electronic display
or mechanical display, including, but not limited to, photocopy, recording, Internet postings, or
scanning, without prior permission in writing from the publisher.
The publisher recognizes and respects all marks used by companies, manufacturers, and developers as
a means to distinguish their products. All brand names and product names mentioned in this book are
trademarks or service marks of their respective companies. Any omission or misuse (of any kind) of service
marks or trademarks, etc. is not an attempt to infringe on the property of others.
Library of Congress Control Number: 2019937248
192021321 This book is printed on acid-free paper in the United States of America.
Our titles are available for adoption, license, or bulk purchase by institutions, corporations, etc.
For additional information, please contact the Customer Service Dept. at 800-232-0223(toll free).
All of our titles are available in digital format at authorcloudware.com and other digital vendors.
The sole obligation of MERCURY LEARNING AND INFORMATION to the purchaser is to replace the book,
based on defective materials or faulty workmanship, but not based on the operation or functionality
of the product.
For Alison
CONTENTS
Preface xxv
4.4.3 Couples 60
4.4.4 Principle of Moments 61
4.5 Stability 63
4.5.1 Types of Mechanical Equilibrium 63
4.5.2 Degrees of Stability 65
4.6 Frictional Forces 66
4.6.1 Origin of Frictional ForcesBetween Surfaces in Contact 66
4.6.2 Static and Dynamic (Kinetic) Friction 67
4.6.3 Coefficients of Friction 67
4.6.4 Measuring the Coefficient of Static Friction 68
4.6.5 Measuring the Coefficient of Dynamic (Kinetic) Friction 69
4.7 Exercises 69
Glossary 683
Index 709
PREFACE
The aim of this book is to draw on the essential physical principles that typical
physics courses use to provide a strong conceptual base for the further study of
more advanced topics. As such this book provides support for both introductory
courses (calculus-based) and for readers interested in a basic review of key topics
in physics. It will also be a useful reference work for instructors.
The focus is on physical principles. Applications are used to exemplify the
physics but do not divert attention from the underlying concepts. Mathematics
is the language of physics and a mathematical approach is taken throughout,
drawing mathematical techniques including basic calculus. The approach here
acknowledges this and helps to secure a foundation of relevant mathematical skills
in the context of real physical problems.
Practical techniques, including the collection, presentation, analysis, and
evaluation of data, are discussed in the context of key experiments linked to the
theoretical spine of the work. There are also sections on testing mathematical
relationships, the analysis of uncertainties, and how to approach, carry out, and
write-up experimental investigations.
Every chapter concludes with a set of exercises and an appendix on Fermi
problems provides an open-ended challenge that allows the reader to practice
their skills in unfamiliar contexts.
1
THE LANGUAGE OF PHYSICS
1.0 Introduction
NASA’s Mars climate orbiter was launched in 1998 and should have gone into
orbit around Mars 286 days later. Instead it fell too close to the planet and
broke up in the atmosphere. The mission had cost upwards of $100,000,000.
Why did this happen? Because Lockheed Martin, who was calculating the
thrust to maneuver the spacecraft, used English units (pound-seconds),
while NASA, which controlled the thrusters, was expecting metric units
(newton-seconds). Units matter!
You will notice that the meter is actually defined in terms of the speed of
light. This has been the case since 1983 when the speed of light, which
had been measured with ever-increasing precision, was defined to have the
value 299,792,458 ms1.
Derived units
Many physical quantities are measured in derived units. These are
combinations of base units such as m3 (for volume) or kgm3 (for density).
Some common derived units that are given their own names are shown in
the table.
Derived quantity Name Symbol
S.I.-derived units S.I. base units
Force Newton N kgms2
Pressure Pascal Pa kgm1s2
Energy Joule J kgm2s2
Power Watt W kgm2s3
When you come across a new physical quantity that is unfamiliar, you can
work out its S.I. units by making sure that an equation containing the new
quantity is balanced. All equations in physics must balance in terms of units.
Here are two examples.
Energy
Any equation involving energy will suffice, e.g., E ½ mv2. The units of
the right-hand side are kgm2s2, so these must be the units of energy. The
name “joule” is a convenient alternative for such a common physical unit, so
1 J 1 kgm2s2.
THE LANGUAGE OF PHYSICS • 3
Viscosity
There is an equation for viscous drag on a sphere moving through a fluid
called “Stoke’s law.” This has the form F 6rv where r and v are the
radius and velocity of the sphere, respectively. is the viscosity of the fluid.
What are the S.I. units of viscosity? First of all, rearrange the equation to
give F/6rv and then balance the units (6 has no units; it is simply a
number). The right-hand side has units of Nm2s or Pas (this is because
1 Nm2 1 Pa). This can be reduced to base units by substituting for N
( kgms2). The base units for viscosity are, therefore, kgm1s1.
1.2 Dimensions
All mechanical quantities depend on mass (M), length (L), and time (T).
These are the fundamental dimensions of mechanics and are measured in
terms of the base units of kg, m, and s. All equations in physics must balance
in terms of numerical value, units, and dimensions. When they do, they
are said to be homogeneous. This can be very useful for checking your
working during a derivation (if you made a mistake in the algebra, then the
dimensions might not balance) and for testing proposed equations to see
if they are viable. It can also be used to construct possible equations if you
have an idea of the relevant parameters.
To indicate that we are dealing with dimensions, we use square brackets
so that [E] means “the dimensions of” energy and [v] means “the dimensions
of” velocity. The fundamental dimensions are related in a similar way to
base units, so it is possible to work them out by balancing simple equations.
For example, if we want to find the dimensions for energy, we can use the
equation E mgh. The dimensions of E will be the same as the dimensions
of mgh:
[E] [m][g][h] MLT 2 L ML2T 2
If the dimensions are now replaced by base units, we see that the S.I. unit
of energy is kgm2s2 as before.
Five basic dimensions can be used to express most quantities in physics.
Dimension Mass Length Time Current Temperature
Symbol M L T I
Dimensionless numbers play an important role in many areas of physics.
These are combinations of physical quantities where the dimensions cancel
so that the result is a pure number. These often have (or seem to have) great
4 • FOUNDATIONS OF PHYSICS
These go up (or down) in multiples of 103, but there are some prefixes
that are in common use that do not fit this pattern: “centi-” multiples by
102 and “deci-” multiplies by 101. In chemistry it is quite common to state
volumes in decimeters-cubed. 1 decimeter is 10 cm or 0.10 m, so this is a
volume of 1000 cm3 or 1 liter. In S.I. base units, it is 0.001 m3. Beware!
When presenting data it is important to use an appropriate number
of significant figures, even if some of these are zeroes. The number of
significant figures used represents the precision of the data, so a length
of 1.20 m is more precise than 1.2 m. In principle, 1.2 m could have been
rounded from anything between 1.150 m and 1.249 m, with an uncertainty
of 0.05, whereas a length of 1.20 m must really lie between 1.195 m and
1.205 m, with an uncertainty of 0.005. Quoting the third significant figure
has increased the precision by a factor of 10.
However, it is also important not to quote data to too many significant
figures, if this is not justified by the measurements that were used to obtain
the data. For example, if you were trying to calculate the density of a block
of wood and had measured a mass of 40.5 g and a volume of 24.2 cm3,
the value of density is 40.5/24.2 1.673553719 gcm3. The result must be
rounded off so that it is consistent with the data used to calculate it. In this
case, data was given to three significant figures, so an appropriate result is
1.67 gcm3.
Q A useful rule of thumb: Quote calculated values to the same number
of significant figures as the least precise piece of data used in the
calculation.
1.4 Uncertainties
In 2012, the discovery of the Higgs boson was announced at the Large
Hadron Collider at CERN. Physicists had tuned the collider to the energy
range in which the particle was expected to be found and sure enough
it turned up! This allowed physicists to measure the mass of the Higgs:
125.090.21 GeV/c2 (the GeV/c2 is a convenient mass unit used in particle
physics). The quoted uncertainty is < 0.2% of the mass. This pins the mass of
the Higgs into a small enough range so that its properties can be compared
with theoretical predictions.
If the uncertainty in a measured value is too large, it is not very
useful. For example, if you were asked to measure the acceleration due to
gravity in a laboratory and you got a value of 9.8 ms2, you might be quite
THE LANGUAGE OF PHYSICS • 7
happy and feel that you had done a good job. However, if the uncertainty
associated with that value was 1.0 ms2 then the acceleration due to gravity
as measured in your experiment could lie anywhere between 8.8 and
10.8 ms2, so the fact that the actual value turns out to be close to the true
value was probably just by chance. If you repeated the experiment in the
same way, you would probably get a very different value.
Q Whenever you calculate a value from experimental data, you should
include the estimated uncertainty. This is a measure of the reliability of
the measured value.
While a detailed analysis of uncertainties and their effect on calculated
values involves a lot of statistics, there are some simple methods that will
give a reasonable estimate and that can be used quite easily.
While this method can be used, there are also some simple ways in
which uncertainties can be combined mathematically. Here are some
simple rules.
Combination Rule
Uncertainty in a sum: y a b Add absolute uncertainties y a b
Uncertainty in a product: y ab Add fractional uncertainties y a b
y a b
Uncertainty in a quotient: y a/b Add fractional uncertainties y a b
y a b
Uncertainty in a power: y an Multiply fractional uncertainty y a
by power n
y a
Using the data from the example above (acceleration of free fall) and
applying the rules from the table, we have:
g 2s/t2 (2 × 2.500)/(0.710)2 9.92 ms2
g s t 0.005 0.020
2 2 0.058
g s t 2.500 0.710
g 0.58 ms2
This gives a result of g 9.92 0.58 ms−2 (almost identical to the first
method).
Random errors
If a quantity is measured repeatedly, the results are likely to vary as a result
of measurement errors. If these errors are random, the results will be
scattered above and below the actual value of the quantity being measured.
Q The best way to reduce the impact of random errors is to use a large
number of repeats and to take an average.
Repeated measurements are also helpful for estimating experimental
uncertainties. A useful estimate is half the range of the data (but it is
important to eliminate any anomalous results before calculating the range).
Systematic errors
A systematic error is one that introduces a consistent bias to all of the
measured data, usually making it all too big or too small. Taking an average
will not help to reduce the impact of a systematic error. However, if a
systematic error is known, then it can be corrected for by adjusting the
measurements. For example, if there is a zero error, this must be subtracted
from all subsequent measurements.
If a systematic error is present but unidentified, the final result might
seem very precise and yet disagree with an accepted value. It is always
important to compare your results with accepted values (where these exist).
If you have carried out a good experiment, your value with its uncertainties
should overlap the accepted value for the quantity measured.
Charge
Gradient represents
current at this instant:
I = dQ/dt =gradient
Time
Some common derivatives (rates of change) that often occur in physics are
listed in the table below.
THE LANGUAGE OF PHYSICS • 11
Function (y)
dy
Derivative (rate of change) dx
Second derivatives
A derivative is a rate of change, so the derivative of displacement with
respect to time is velocity and the derivative of velocity with respect to time
is acceleration. This means that acceleration is the rate of change of the rate
of change of displacement.
dv d ds
a
dt dt dt
This is called the second derivative of displacement and is written as:
d2 s
a
dt 2
Whereas first derivatives are equal to gradients, second derivatives are
related to the sharpness of curvature of a graph (the rate of change of the
gradient). Acceleration is related to the sharpness of curvature of a graph of
displacement against time.
The maximum power Pmax occurs at some resistance Rpeak. This is where
the curve has a gradient of zero. However, gradients are equal to derivatives,
so we can find the maximum value if we can find where the gradient is zero.
To do this we need an equation for power in terms of the resistance, P(R).
It turns out that this is easy to find (see Section 18.5.1). Once we have the
equation, we differentiate it, set it equal to zero, and solve the resulting
equation to find the value of R.
Power
Pmax
Resistance
Rpeak
dP(R)
0
dR
To find a maximum or minimum of some function y(x):
Q Differentiate to find the derivative dy
dx
dy
Q Set the derivative equal to zero, 0
dx
Q Solve for x
Calculus can be used to show the nature of the stationary point:
2
Q Find d y
dx2
2
Q If d y is positive, the gradient is increasing: minimum position
dx2
2
Q If d y is negative, the gradient is decreasing: maximum position
dx2
2
Q If d y is zero, the position is a saddle point, neither a maximum nor a
dx2
minimum
THE LANGUAGE OF PHYSICS • 13
y Maximum: gradient
decreasing through zero
d2y
negative
dx2
Minimum: gradient
increasing through zero
d2y
positive
dx2
Saddlepoint
gradient stationary
d2y
zero
dx2
x
s v(t)t
t1
THE LANGUAGE OF PHYSICS • 15
Velocity
δt
v(t)
Time
t 1
t t
2
s lim
t0
v(t)t vt dt
t1 t1
The table below gives some derivatives and related integrals that are used
in this book.
Context Differential form Integral form
Dynamics
v
ds
dt
s vdt
Dynamics dv
a
dt
v adt
Newton’s laws
F
d(mv)
dt
mv mu Fdt
Electric circuits
I
dQ
Q Idt
dt
Radioactivity dN
dt
dN
N
dt
Capacitors dQ Q dQ dt
dt
RC Q RC
resulting body has a mass of 10 kg. Scalars do not have a direction but can
have a sign, and this must be taken into account when they are added.
Vectors are physical quantities that have magnitude and direction, e.g.,
displacement, velocity, force, momentum. Vectors can be represented by
arrows; the length of the arrow represents magnitude, and the direction of
the arrow is the direction of the vector. They are often distinguished from
scalar quantities by underlining them, e.g., v is a velocity vector and v is its
magnitude.
10 m 10 m 10 m
14 m + =
14 m
A sin θ A
θ
x
A cos θ
Scalar product
The work done by a force is the product of the force and the displacement
of the point of action of the force parallel to the force. Since both force and
displacement are vectors but work is a scalar, this is an example where the
product of two vectors results in a scalar. For this reason it is called a scalar
product (sometimes referred to as a “dot product”).
The magnitude of a scalar product is equal to the product of the
magnitudes of the two vectors multiplied by the cosine of the angle between
them.
18 • FOUNDATIONS OF PHYSICS
F
θ Scalar product = F . s = Fs cos θ
s
Vector product
The force on a moving charged particle in a magnetic field has a magnitude
given by F qvB sin where B is the magnetic field strength, v is the velocity
of the particle, and is the angle between v and B. Since force, magnetic
field strength, and velocity are all vectors, this is an example where the
product of two vectors results in another vector. For this reason it is called
the vector product (sometimes referred to as the “cross-product”). The
direction of the resulting force is found by using a right-hand rule and is
perpendicular to both of the vectors v and B.
A vector product is written as: vector product of v and B v^B
The force on a moving charge in a magnetic field is therefore
F q (v^B)
The magnitude of the vector product is equal to the product of the
magnitudes of the two vectors multiplied by the sine of the angle between
them.
The direction of the vector product is perpendicular to the plane defined
by the two vectors being multiplied together. To work out the direction of
the product vector, imagine rotating from the first vector to the second
vector and then the resultant is in the direction of movement of a right-
handed screw! The vector product is zero if the vectors are parallel ( 0).
v^B
Direction of resultant
B B
v v
THE LANGUAGE OF PHYSICS • 19
The shape of the triangle is invariant under these rotations and reflections.
In Newtonian mechanics the laws of physics are the same in all uniformly
moving (inertial) reference frames, so Newton’s laws are invariant under a
change of velocity. Einstein’s theory of special relativity goes further and
includes all of the laws of physics (see Chapter 24). Hermann Minkowski
realized that Einstein’s equations for relativity were similar to those for a
geometrical rotation and identified physical quantities that are the same
for all inertial observers—these are four-dimensional quantities called
invariants and are constructed from space and time components.
Emmy Noether showed that symmetry principles are linked to
conservation laws. This is not really surprising because a conservation law
identifies some quantity that stays the same (is invariant) when other things
change. For example, the total linear momentum of a collection of colliding
bodies is the same before and after the collisions, and the total energy of
20 • FOUNDATIONS OF PHYSICS
the universe is the same before and after an explosion. Noether showed
that conservation of momentum is linked to the laws of physics remaining
the same under translation; conservation of angular momentum is linked
to the laws staying the same under rotation; and the law of conservation of
energy is linked to the laws staying the same at all times. This link between
mathematical symmetries and conservation laws is a powerful idea in
theoretical physics.
In particle physics, the search for symmetry in mathematical equations
can lead to the prediction and discovery of new types of particle. Paul Dirac
realized that the equation for the rest energy of an electron is a quadratic
equation, one that has both positive and negative solutions:
E2 p2 c2 m02 c 4
When p 0 we can write
E2 m02 c 4
which has the solutions
E m0 c2
The positive solutions correspond to normal electrons. By taking the negative
energy solutions seriously, he predicted the existence of anti-electrons or
positrons. The symmetry between matter and antimatter is now built into
the Standard model and is linked to other fundamental symmetries such as
time reversal. There is even a sense in which anti-matter can be regarded
as ordinary matter traveling backward in time!
The time reversal symmetry in particle physics is one of three symmetries
that are fundamentally linked:
T: Time reversal symmetry—changing the sign of the time coordinate
P: Parity symmetry—changing the sign of all three spatial coordinates
C: Charge (charge conjugation) symmetry—changing the signs of all of
the charges
It turns out that none of these symmetries are observed in all interactions,
but if C, P, and T are all applied, the symmetry is observed. No violation of
CPT symmetry has ever been seen.
The Standard model also contains some apparent symmetries that have
not been explained. There are three pairs of leptons and three pairs of
quarks (see Section 26.4.5) but no known physical process that links them
THE LANGUAGE OF PHYSICS • 21
or that will allow the transformation of quarks into leptons or vice versa.
Attempts to construct a theory that would allow this has led physicists to
predict the existence of additional “super-symmetric” particles for all the
existing particles. So far none have been discovered.
Symmetry principles have also been used to investigate alternative
approaches to physics. Richard Feynman and John Wheeler, for example,
realized that Maxwell’s equations for electromagnetism could be solved both
forwards and backwards in time. This sounds odd, but by including both
solutions they were able to construct a version of the theory that worked
just as well as the original one and did not neglect half of the solutions.
However, the interpretation of the so-called “absorber theory” requires us
to take the idea of future events influencing the present seriously. Perhaps
we should.
1.11 Exercises
1. Express:
(a) 267 g in kg, (b) 25 km in mm, (c) 5.0 m3 in cm3,
(d) 80 km/h in m/s, (e) 45 cm2 in m2.
2. Light travels at 3.0 × 108 m/s and it takes about 8 min for light to travel
from the Sun to the Earth.
(a) How far away is the Sun in km?
(b) How far is a light year? Give your answer in meters.
(c) The distance from the Earth to the Moon is 380,000 km. How far is
this in light-seconds?
2
REPRESENTING AND
ANALYZING DATA
2.0 Introduction
Experiments generate data and it is important to know how to select,
record, and process this data in order to find out what the experiment
has revealed. This is a huge problem for large experiments. According to
the CERN website, the four experimental stations on the Large Hadron
Collider (LHC) each generate between 750 Mbs1 and 4 Gbs1 of data when
the accelerator is operating! The task of selecting and analyzing relevant
data is carried out by the Worldwide LHC Computing Grid, which might
reject the majority of results in order to focus on those that could reveal
interesting new physics. Writing the algorithms for this system is extremely
challenging.
When you carry out experiments in a laboratory, you will not face such a
daunting task. However, presenting your data clearly and to an appropriate
precision—rejecting anomalous results, deciding how to process it, and
then extracting information from the processed data—is an essential part
of physics.
experiment to be a fair test, you must keep all other parameters that might
affect the dependent variable constant; these are called control variables.
For example, imagine you are investigating how the acceleration of a
trolley along a runway depends on the force applied to it. The dependent
variable is the acceleration, which might be measured using light gates.
The independent variable is the applied force, which might be varied
using falling masses attached to the trolley via a pulley. There are several
control variables, all of which must be kept constant. The obvious one is the
mass of the trolley. If this was not constant, then changes in acceleration
might be because of changes in mass rather than force, so the results of
the experiment would not be clear. Another control variable is the angle of
the runway; it should be horizontal in each trial. It would also be important
to control frictional forces (e.g., by using the same trolley and runway and
measuring the acceleration over the same distance on the same part of the
runway).
Q Independent variable: the one you deliberately vary
Q Dependent variable: the one that changes in response to the change in
the independent variable
Q Control variables: the ones that you have to keep constant so they do not
also affect the dependent variable
When you display results on a graph it is conventional for the dependent
variable to go on the y-axis and the independent variable to go on the x-axis.
Dependent
variable
Indendent
variable
Much of the analysis described above is done “behind the scenes,” but
when you need to present your data in a report, it is important to make
sure that you include the correct table headings (and units) and that you
round data to an appropriate number of significant figures. You should be
in charge of the format; don’t let the spreadsheet present its default style!
x x x
Straight line through Straight line with Straight line with
the origin: y is directly positive intercept y negative intercept y
proportional to x increases linearly decreases linearly
with x with x
It is also important to realize that a linear graph with negative gradient does
not imply inverse proportion.
Intercept Gradient = m
at y = c
0 x
0
If there is a linear relationship between two physical variables x and y, then
a graph of y against x can be used to find m and c and hence determine an
equation for this relationship. If c 0 then y is directly proportional to x.
100 100
0 90
0 0
28 • FOUNDATIONS OF PHYSICS
Q If the origin (0, 0) is included on the graph, then mark it by a zero on the
scale on each axis.
Q Mark values onto each axis in equal simple intervals, e.g., 0.0, 10.0, 20.0,
30.0 … or 0.02, 0.04, 0.06… Avoid unusual or awkward intervals such
as 3, 5, 7….
Q Label both axes with the physical quantity and unit in the same way as if
you were putting in a table heading, e.g., velocity/ms1.
Q Mark each data point carefully with a cross or a point surrounded by a
circle using a sharp pencil. If you are plotting on millimeter graph paper,
the points need to be within half a millimeter of the correct value.
Q Look carefully at the pattern of the data and decide whether any of
the points are anomalies. This means that they seem “out of place”
compared to the others. If possible these data points should be checked
by repeating the experiment. If not they should be labeled and ignored
when drawing a best-fit line.
Q Look at the pattern of data and decide whether it is best represented
by a straight line or a smooth curve. Remember, this is experimental
data, so each point has a degree of uncertainty—this means that the
line does not have to pass through the points; it is there to represent the
relationship revealed by the data.
Q If the data is best represented by a straight line, then use a ruler to draw
it, trying to balance approximately equal numbers of points on either
side of the line. It is also possible to find the gradient and y-intercept of
a linear graph directly from the data by using an algebraic method called
linear regression (omitting data from anomalous points).
Q If the data is best represented by a curve, then draw a smooth curve,
again trying to balance the distribution of points on either side of the
curve.
25
(6.0, 20.2)
20
Extension /cm
15
10
(1.6, 5.0)
5
0
0 1 2 3 4 5 6 7
Load / N
(20.2 5.0)
gradient of line 3.5 cmN1
(6.0 1.6)
2.4.3 Using a Spreadsheet Program
Whether you are plotting a graph by hand or using computer software, your
aims are the same—to represent the relationship between two variables
accurately and clearly and to extract any useful physical information or
values from the graph as accurately and precisely as possible. The great
advantage of a spreadsheet program is that once data has been stored it
can be used to generate a graph at the click of a button. However, this can
30 • FOUNDATIONS OF PHYSICS
also lead to problems. The default settings of the program will determine
how the graph appears, and this may not be the best way to display this
particular dataset. The ease and rapidity of creating a graph can also mean
you forget to do basic things such as ensuring the axes are labeled (or labeled
correctly), having sensible scales with appropriate numbers of significant
figures, etc.
Here are some questions to ask yourself when using a spreadsheet to
produce a graph and analyze your data:
Q Could I do a better job by plotting the graph by hand?
Q Do I want the program to fit a line or curve to this data, or shall I print
out the graph once the points have been plotted and then draw the line
by hand?
Q Does the graph have an appropriate heading?
Q Do I want to include gridlines and if so with what divisions?
Q Are the axes correctly labeled with quantity and unit?
Q Are the scales marked correctly and are they easy to read?
Q Are values shown with the correct number of significant figures?
Q Do I need to use a false origin?
Q Should the line be extrapolated back or forward?
Q Should the line go through the origin?
Q What kind of fit do I want to use (e.g., linear, exponential, polynomial,
etc.)?
Q Do I want to include error bars?
Q Do I want the program to display a formula and if so how should it be
formatted?
Q How large should the graph appear in my report (avoid tiny graphs!).
Remember, you—not the computer—should be in control!
(7.0, 24.5)
25
20
AL
W
15
Extension /cm
10
5
(0.6, 1.2)
0
-1 0 1 2 3 4 5 6 7 8
-5
Load / N
(24.5 1.2)
gradient of worst acceptable line = 3.6 cmN1
(7.0 0.6)
Using this value and the gradient of the best fit line gives
gradient 3.5 0.1 cmN1
32 • FOUNDATIONS OF PHYSICS
2.5 Logarithms
2.5.1 Logarithmic Scales and Logarithms
Many quantities in physics have values that spread over an extremely wide
range, so it is often convenient to represent them using a scale that increases
in multiples rather than equal amounts. Such a scale is called logarithmic,
e.g., 1, 10, 100, 1000…; 1, 2, 4, 8, 16…
Each step raises the power of some base quantity by 1:100, 101, 102,
10 …; 20, 21, 23, 24….
3
The logarithm of any number to a particular base is the power that the
base must be raised to in order to get the number. For example,
Base 10: logarithm to base 10 of 1000 3 or log10 (1000) 3
Base 2: logarithm to base 2 of 8 3 or log2 (8) 3
Another common base is the number e. Logarithms to base e are called
“natural logarithms” and are written using the prefix “ln.”
Base e: logarithm to base e of 10 2.3026 or ln (10) 2.3026
The values of logarithms to base 10 and of natural logarithms can be
found directly with your calculator.
If you are working with logarithms you will also need to be able find
anti-logarithms or inverse-logarithms. For example, if you know that the
logarithm to base 10 of some physical quantity is 5, you can find the value
of the physical quantity by raising the base to the power 5. For example,
if log10 (x) 5 then x 10 log10 ( x) 10 5 100 000
if log10 (x) 2.3 then x 10 log10 ( x) 102.3 199.5
Antilogarithms (or inverse-logarithms) can be found directly with your
calculator.
This relationships is useful when analyzing data for variables that are
linked by a power law (if the base is omitted, it is assumed that everything
is to base 10):
If y Axn then log(y) log (A) n log (x)
A graph of log (y) against log (x) (called a “log-log graph”)will be linear with
gradient n and intercept log (A).
Exponential relationships can also be analyzed using natural logarithms:
If y Aebx then ln (y) ln (A) bx
A graph of ln (y) against x (called a “log-lin graph”) will be linear with
gradient b and intercept ln (A).
2.7 Exercises
(b) Find an equation to relate pressure to volume and calculate the value
of any constants in this equation (stating their units and the associated
uncertainty).
5. The table below gives some data for the planets in the solar system.
Planet Av. distance from the Sun Orbit time, days (d)/ years (y)
(million km)
Mercury 58 88 d
Venus 108 ?
Earth 150 365 d
Mars 228 687 d
Jupiter 778 11.9 y
Saturn 1430 29.5 y
Uranus 2870 84 y
Neptune 4500 165 y
Kepler proposed that the orbital period of the planets T is linked to their
mean radius of orbit r by an equation of the form: T rn
(a) Use a graphical method to verify that such a power law is valid and
determine the value of n.
(b) Use your graph (or the equation) to predict the orbital period of
Venus (check online to see if your value is acceptable).
6. When a capacitor discharges through a resistor, the discharge current I
falls exponentially according to an equation of the form:
t
I I0 e
where I0 is the initial value of the current and is the “time constant” for
the decay (in seconds).
The table below shows how current falls with time when a particular
capacitor is discharged through a resistor.
Current/mA 48 38 30 24 19 15 12 9 8 6 5
Time/s 0 10 20 30 40 50 60 70 80 90 100
3
CAPTURING, DISPLAYING, AND
ANALYZING MOTION
3.0 Introduction
Kinematics is the study of motion. Dynamics is the study of how forces
affect motion. In this chapter, we focus on how motion is described in space
and time, and how we can capture data from moving objects in order to
display their motion graphically. In Chapter 5, we will see how Newton’s
laws describe how motion is affected by unbalanced or resultant forces.
s
average velocity =
t
Where s is the change in displacement during a time interval t. If the
time interval is small (but finite), this might be written as:
s
average velocity=
t
To obtain the instantaneous velocity we would need to take the ratio of
s to t in the limit that t0. This is the same as taking the derivative of
displacement with respect to time:
s ds
instantaneous velocity lim
t0 t dt
ds
or v
dt
The table below lists symbols used to describe changes and rates of
change.
x A change in x
x A small change in x
d(x)/dt The rate of change of x
Average and instantaneous acceleration are defined in a similar way:
v
average acceleration =
t
dv
instantaneous acceleration =
dt
The relationships between the three graphs are summarized in the diagram
below, which shows graphs of motion for an object that starts from rest and
accelerates at a constant rate.
CAPTURING, DISPLAYING, AND ANALYZING MOTION • 39
Displacement
Differentiate:
v = ds/dt
Velocity is gradient
of displacement
time graph
Time
Velocity
Differentiate: Integrate:
a = dv/dt s = ∫ vdt
Acceleration is Displacement is area
gradient of velocity under a velocity time
time graph graph
Acceleration Time
Integrate:
v = ∫ adt
velocity is area under
an acceleration
graph
Time
B (v − u)
which is equivalent to the integral: ds (u at)dt (a is constant)
0 0
1
which gives s ut at 2 (suvat equation 3)
2
The other equations can be derived from these by a series of substitutions.
These equations are valid for motion at constant acceleration.
25
Vertical position/m 20
15
10
0
0 5 10 15 20
Horizontal position/m
Each data point represents the position of the projectile at 0.10 s intervals.
Notice that the horizontal displacement in each 0.10 s interval is always the
same (0.80 m).
u
1
0.8
0.6
0.4
0.2
θ
0
0 1 2 3 4 5 6
Horizontal displacement
Initial vertical component of velocity u sin
Final vertical component of velocity u sin (the motion is symmetric up
and down)
Vertical acceleration g
The displacement when the projectile returns to the surface is s 0
Using suvat equation 1 from Section 3.2:
u sin u sin gt
2 u sin
So the time of flight is: t
g
2 u2 sin cos u2 sin 2
and the range is: R
g g
This has a maximum value when sin 2 1 suggesting that maximum range
will be achieved when the projectile is launched at an angle of 45° to the
horizontal (in the absence of friction).
To computer
Reflector
Ultrasound
pulses Moving trolley
Datalogger
Ultrasound
transducer
CAPTURING, DISPLAYING, AND ANALYZING MOTION • 45
To datalogger
Light gate
Light gate Light gate A Light gate B
Card
Light beam
46 • FOUNDATIONS OF PHYSICS
If two light gates (A and B) are used, the following measurements can be
made and recorded:
Q Time at A and time at B (from starting the experiment)
Q Time to move between A and B
Q Velocity at A and velocity at B
Q Acceleration from A to B
Accelerometer sensor
This detects the real-time acceleration of the mobile phone along three
perpendicular axes. The accelerometer samples the acceleration (a sampling
interval of 20 ms is typical), and the captured data can be exported to a
spreadsheet program such as Excel. When used with a suitable app, tables
of data and graphs for acceleration, velocity, and displacement against time
can be displayed.
The acceleration due to gravity can be measured by dropping the phone
(but make sure it has a soft landing!), and the acceleration of a dynamic
trolley can be measured by simply attaching the phone to the trolley. The
portability of the phone makes it ideal for investigating acceleration in real-
world situations such as cars or trains.
Video capture
The high frame rates available on some cameras are ideal for investigating
short-lived, transient phenomena such as the impact of a water droplet
with the surface of water or the formation of a crater when a ball bearing
falls into sand. The cameras in mobile phones typically offer 240 frames
per second for slow-motion playback, but more sophisticated cameras can
record video at several thousand frames per second.
CAPTURING, DISPLAYING, AND ANALYZING MOTION • 47
3.7 Exercises
1. A ball is thrown vertically upwards at 4.0 ms1 from an initial position 1.2 m
above the ground on Earth and is allowed to fall down to the ground.
Sketch graphs to show:
(a) displacement versus time
(b) velocity versus time
(c) acceleration versus time.
Take upwards as positive and label your axes with appropriate values.
2. An experiment performed on the Moon finds that a feather falls 20.75 m
from rest in 5 s.
(a) Calculate its speed as it hits the Moon’s surface.
(b) Calculate the acceleration of free fall at the Moon’s surface.
3. A sprinter reaches his maximum velocity of 12 ms1 after running
30 m from rest. How long does this take him and what is his average
acceleration (assume acceleration is constant).
4. A car is moving at 50 kmh1, what is this speed in ms1?
(a) When the brakes are applied the car can stop in 4.0 s;what is its
braking distance?
(b) A cat runs out in front of your car when you are traveling at 50 kmh1
and your reaction time is 0.6 s. What is the total distance the car will
travel before stopping (i.e., its “stopping distance”)?
5. A stone is dropped into a deep well. It strikes the surface of the water at
the bottom of the well 2.2 s after its release. How deep is the well?
6. A boat slows down from 6.0 to 4.0 ms1. During the deceleration it
travels 150 m. Calculate the average deceleration.
7. Two dragsters line up for a 500 m race. Car A accelerates at 4.0 ms2 and
then maintains a constant maximum speed of 50 ms1. Car B accelerates
at 5.0 ms2 and then maintains its maximum speed of 45 ms1. Which car
wins the race and by how much?
8. A cricket ball is bowled horizontally at a speed of 20 ms1 from a height
of 2.5 m above the ground.
(a) How far from the bowler does it first hit the ground?
(b) What is the maximum distance from the cricketer that the ball hits
the ground if he throws the ball upwards at the same initial speed at
an angle of 45° to the horizontal? Ignore air resistance.
CAPTURING, DISPLAYING, AND ANALYZING MOTION • 49
4
FORCES AND EQUILIBRIUM
Slope with
rough surface
θ
W: weight of block
52 • FOUNDATIONS OF PHYSICS
θ x
W
Here is an example. Three forces act in the same plane on a single object.
We need to find the resultant force.
6.0 N
20°
9.0 N
120°
8.0 N
FORCES AND EQUILIBRIUM • 53
Resu
ltant
force
8.0 N
6.0 N
9.0 N
8.0 N
20° 9.0 N x
120°
8.0 N
Resolving parallel to the x-axis: Fx 9.0 6.0 cos (20°) 8.0 sin (30°)
10.64 N
Resolving parallel to the y-axis: Fy 6.0 sin (20°) 8.0 cos (30°) 4.88 N
54 • FOUNDATIONS OF PHYSICS
10.64 N x
φ
4.88 N Resul
tant for
ce
4.2.2 Weight
Weight is the gravitational force acting on a body and is a vector quantity.
The weight of a body depends on the strength of the gravitational field in
which it is placed, given by the formula:
W mg
where m is the mass in kilograms and g is the gravitational field strength
measured in Nkg1.
The gravitational field strength near the surface of the Earth is on
average about 9.81 Nkg1 (standard gravity is 9.80665 Nkg1) but varies by
about 0.7% at different locations (from about 9.76 Nkg1 on a mountain in
Peru to about 9.83 Nkg1 in Oslo).
Here are some values for gravitational field strength elsewhere in the
solar system.
Moon Sun Venus Mars Jupiter Saturn
1
Surface gravity/Nkg 1.62 275 8.87 3.69 24.8 10.5
Surface gravity/gEarth 0.165 28.0 0.904 0.376 2.53 1.07
CG CG CG
If an object is suspended from a point it will align itself with the center
of gravity vertically below the point of suspension. This means that the
location of the center of gravity can be found by suspending the object
separately from two different points (A and B) and noting where the two
lines of suspension intersect inside the object. This is shown below for a
two-dimensional object.
56 • FOUNDATIONS OF PHYSICS
The reason that the center of gravity lies beneath the point of suspension
is that its line of action then passes through the point of suspension and so
has no moment or turning effect about that point. If the object is rotated
slightly so that the center of gravity lies on either side of this line, then there
would be a resultant moment causing the body to rotate back toward the
equilibrium position. Equilibrium of moments (see Section 4.4.4) can be
used to calculate the position of a center of gravity for an extended body.
The idea is simple.
Center of
A gravity where
B
both lines cross
A
B
The resultant moment of the body about any point must be equal to the
moment produced by its weight acting through the center of gravity. From
this equality we can find the distance of the center of gravity from the point
about which we are taking moments. Here is an example for a uniform rod
of length l and mass m. We take moments about one end of the rod.
l
xCG
A
x δx
mg
This must equal the moment of the weight, mg, acting through the center
of gravity, about A.
mgl
mgxCG
2
so xCG l /2; the center of gravity is at the midpoint of the rod, as expected.
The terms “center of gravity” and “center of mass” are often used
interchangeably, but they are only in the same position when the object
concerned is placed in a uniform gravitational field. If the field varies
significantly across the object, then they will be in different positions. For
example, for the rod above, if the value of g increases from left to right
along the bar then the contributions to the resultant moment would be
greater from strips near the right-hand end. This would move the position
of the center of gravity to the right of the center of the bar, i.e., xCG > l/2.
The center of mass on the other hand would not be affected and would
remain at the center of the uniform bar. In practice the difference between
the two positions is rarely significant.
T1 T2
T1
T2
W
58 • FOUNDATIONS OF PHYSICS
If there are more than three forces in equilibrium then they will form a
closed quadrilateral of four or more sides.
Here is an example where the triangle of forces is used to calculate an
unknown force keeping the system in equilibrium. The diagram shows a
pendulum held in equilibrium by a horizontal force F. The problem is to
find the magnitude of F.
Triangle of forces
25°
25°
8.0 N
F F
8.0 N
From the triangle of forces, tan (25°) F/8.0, so F 8.0 tan (25°) 3.7 N
Using a triangle of forces is sometimes a convenient method to solve a
simple equilibrium problem, especially if two of the forces are perpendicular
to one another. However, for more complex problems we need a more
general method; this involves resolving forces along perpendicular axes.
Dividing (1) by (2), tan (25°) F/8.0, soF 8.0 tan (25°) 3.7 N as before.
In this particular case the method of resolving is a little more involved
than using the triangle of forces, but this method is much simpler if there
are a large number of forces and if none of them are perpendicular.
ion d
o f act
Line
P
Moment about P Fd
The S.I. unit of moment is Nm and the direction of the moment is usually
described as clockwise or counterclockwise about the pivot. In this example
it is clockwise.
You might be tempted to think that this unit is equivalent to the joule,
since 1 J 1 Nm. However, when calculating a moment, the force and
displacement are perpendicular, whereas for work done, they must be
parallel.
F
Moment about P = Fl sin θ
l θ
P
d
60 • FOUNDATIONS OF PHYSICS
Notice that d is the perpendicular distance from the pivot, not the
distance between the point where the force acts on the body and the pivot.
This means that we often have to resolve the force to calculate the moment.
The example below shows how to calculate the moment of an inclined force
acting on a uniform rod at a distance l from the pivot P.
This can be regarded as the magnitude of the force multiplied by the
component of l perpendicular to the line of action of the force (F l sin ),
OR the component of force perpendicular to l multiplied by l (F sin l).
The term torque is often used in engineering or when studying rotational
dynamics. This is simply another name for a moment and is calculated in
the same way and measured with the same units.
If the line of action of a force acts through P its moment about P is zero.
0.7 m
0.3 m 0.5 m
8.0 N 5.0 N
3.0 N 12.0 N
. . F
. .
F
CM CM CM CM
y
d
x CM
F
The perpendicular distance of each line of action from the center of mass
is x and y respectively.
Moment of couple Fx Fy F (x y) Fd
Notice that the moment of the couple is independent of the individual
values of x and y and depends only on their sum—in other words, on the
separation of the two lines of action. This means that the moment of a
couple has the same value about any point in the body (or outside it).
Moment of a couple magnitude of one force perpendicular
separation of lines of action
Moment of a couple Fd
5m
0.3
5m
0.4
300 N
28°
Hinge Strut
471 N
θ
300 N
28°
Resolving horizotally,
H sin 471 sin (28°) or H sin 221 (1)
Resolving vertically, H cos 471 cos (28°) 300 or cos 116 (2)
Dividing (1) by (2) givs tan 221/116 1.91 and = 62.3°
Substituting back into (1) gives H 250 N
When coplanar forces act on an extended structure (as in the example
above), the principle of moments gives one equation and the equilibrium of
forces gives two more, so with three independent simultaneous equations,
it is possible to solve for up to three unknowns.
4.5 Stability
4.5.1 Types of Mechanical Equilibrium
When a system is in equilibrium the forces and moments acting on it
are balanced. However, if the system is disturbed, it might return to
equilibrium or depart from it. The behavior of the system when disturbed
is determined by its stability. Consider the three objects below, all of which
are in equilibrium and all of which are given a small clockwise displacement
from equilibrium.
Another way to look at this is in terms of potential energy. In A any
disturbance tends to raise the center of mass and increase GPE, so forces
act toward the equilibrium position because this is the local minimum of
potential energy. In B the disturbance lowers the center of mass and the
GPE, so forces continue act in the direction of the disturbance to lower the
potential energy of the system. In C the disturbance has no effect on the
64 • FOUNDATIONS OF PHYSICS
A B C
A B C
Moment returns Moment takes No resultant
object to object further moment:
equilibrium: from equilibrium: NEUTRAL
STABLE UNSTABLE EQUILIBRIUM
EQUILIBRIUM EQUILIBRIUM
A B C
Small Larger
displacement displacement
W W W
The limit of stability is reached when the line of action of the weight passes
beyond the corner of the base, which acts as a pivot. The resultant moment
then changes from counterclockwise to clockwise, and when the object is
released, it continues to rotate and falls over. The potential energy curve
looks something like this:
Potential energy
Angular
displacement
The dotted lines represent the limits of stability. These will move farther
apart if the object has a wide base and low center of gray.
66 • FOUNDATIONS OF PHYSICS
Applied
force
Frictional
force
Temporary bonds
The work done by the frictional force is W Fs, where s is the displacement
of one surface relative to the other.
In most cases, when two surfaces are in contact the frictional force
depends on:
Q The nature of the surfaces
Q The normal reaction force between the surfaces; the frictional force F
between two surfaces is directly proportional to the normal reaction
force, N: F N
Q But is independent of:
Q The area of contact
FORCES AND EQUILIBRIUM • 67
These simple rules work very well in many cases, but there are exceptions.
For example, a pointed object might dig into a surface. Another common
exception is for car tires in snow. Having wider tires does not affect the
normal force, but it does increase the friction because the snow does
not pack so much. Another way of thinking about this is to say that the
coefficient of friction is dependent on the normal force.
Frictional force
Gradually increasing Limit of static
applied force friction
Fractional Dynamic
force friction
Rough surface
Applied force
The limit of static friction is the maximum frictional force acting between
the two surfaces when they are at rest. To move the block, an applied force
greater than this limit must be used. Once the block moves and the surfaces
are sliding over one another, the friction drops to a lower dynamic value.
M Pulley
Frictional
force F
Mg
Masses are added to the hanger until the block just begins to slip. The limit
of static friction is then mg.
The normal reaction is N Mg.
Therefore mg SMg, so S m/M
M
Gradually lifted
until block
starts to slip
Angle at which block
just begins to slip θ
FORCES AND EQUILIBRIUM • 69
Flimit = µsN
W = Mg
4.7 Exercises
1. In which of the following situations (if any) are the forces acting on a man
in equilibrium?
Q Lying still in his bed
Q Sitting in a car seat when the car is traveling at constant velocity along
a motorway
Q Standing in a lift that is moving upwards at constant velocity
Q Floating apparently weightless inside an orbiting spacecraft
Q Floating in a swimming pool
70 • FOUNDATIONS OF PHYSICS
60°
30°
6.0 N 4.0 N
1.0 N
3.0 N
(a) (b)
3. A frame of mass 1.2 kg is suspended from a rigid support by two wires as
shown below.
Calculate the tension in each wire.
40 cm
15 cm
30 cm
0.15 cm
1.0 N
FORCES AND EQUILIBRIUM • 71
5. The diagram below shows a paving slab of mass 300 kg resting on two
wooden supports, A and B. Assume that the center of gravity of the slab
is at its geometric center and that the forces between the supports act at
the centers of their areas of contact with the slab.
2.00 cm
0.30 cm 0.50 cm
A B
5
NEWTONIAN MECHANICS
5.0 Introduction
Newton’s masterwork, Philosophiæ Naturalis Principia Mathematica
(“Mathematical Principles of Natural Philosophy”) is probably the most
famous and important book (actually three books) in the history of physics. It
sets out the laws of motion and the law of gravitation and then applies these
laws to the motion of planets in the solar system. It also marks the beginning
of mathematical physics. This is not because mathematical arguments had
not been used in physics before Newton but because Newton provided a
mathematical framework in which to tackle an enormous range of physical
problems. Newton’s work had, and still has, a remarkably wide impact on
science and philosophy and is even used as a model in other apparently
unrelated disciplines (e.g., economics!). If you want to be good at physics,
you need to be good at mechanics.
Imagine releasing a small ball from the top of a U-shaped ramp. In the
absence of friction we would expect it to rise up to the same height on the
far side of the U (A below). Now let the ramp on the far side slope up more
gradually. We would expect the ball to travel further along the ramp until,
once again, it reached its starting height (B below). Now let the ramp on the
far side continue on horizontally; what will happen to the ball? Logically it
seems it must continue to move at a constant velocity until such time as the
ramp rises back up again (C below). Galileo argued that there is no need for
an unbalanced force to keep things moving at constant velocity.
A B C
Comments
Q Most moving objects are acted upon by many forces (e.g., thrust,
gravity, contact forces, drag). If they are moving at constant velocity, the
resultant of all these forces must be zero.
Q If we know that the forces acting on a particular object are in
equilibrium, we cannot assume it is at rest; it might be moving at
constant velocity.
Q If an object is accelerating, decelerating, or changing direction, the
forces acting on it must be unbalanced; there is a resultant force. An
example is an object moving at constant speed along a curved path: since
the speed is unchanging, there is no force component parallel to the
motion, but there must be a component of force perpendicular to it in
order to cause a change of direction.
Shut yourself up with some friend in the main cabin below decks on
some large ship, and have with you there some flies, butterflies, and
other small flying animals. Have a large bowl of water with some fish
in it; hang up a bottle that empties drop by drop into a wide vessel
beneath it. With the ship standing still, observe carefully how the
little animals fly with equal speed to all sides of the cabin . . . When
you have observed all these things carefully (though doubtless when
the ship is standing still everything must happen in this way), have
the ship proceed with any speed you like, so long as the motion is
uniform and not fluctuating this way and that. You will discover not
the least change in all the effects named, nor could you tell from any
of them whether the ship was moving or standing still.1
This was a profound observation. Einstein realized that if the laws of
physics are the same in stationary and uniformly moving reference frames,
then there is no fundamental difference between rest and motion; it just
depends on what you choose as your reference frame. It led Einstein to
the special theory of relativity (see Chapter 24), but in Galileo’s time, it
served a different purpose. Galileo was convinced that the Earth orbited
the Sun rather than the other way around as was believed by most people
at the time. They thought that it was obvious that the Earth was not moving
because we cannot feel the motion. Galileo’s thought experiment showed
that you would not expect to feel the motion—everything would happen on
Earth as if it was at rest; so the argument against the Earth’s motion was
flawed.
Galilean relativity is the idea that the laws of mechanics are the same in
all uniformly moving reference frames.
1.
Dialogue Concerning the Two Chief World Systems, translated by Stillman Drake, University
of California Press, 1953, pp. 186–187 (second day).
76 • FOUNDATIONS OF PHYSICS
Definition of newton
A resultant force of 1 N accelerates a mass of 1 kg at 1 ms−2.
This makes the constant of proportionality 1, and the equation becomes:
F
a
m
Comments
Q This is a vector equation: the resultant force and acceleration vectors are
in the same direction.
Q The term inertia is given to the property of mass that resists
acceleration. The m in the equation above is sometimes called “inertial
mass.” The greater the mass of an object, the greater its inertia.
Q Newton argued that because objects in free fall accelerate toward the
Earth, they must be acted upon by a resultant force directed toward the
Earth—a gravitational force.
Q Newton realized that because the planets follow curved paths, they
must also be acted upon by a resultant force. This is a gravitational force
toward the Sun.
Q Newton thought that the orbital motion of the Moon was caused by
a gravitational force toward the Earth. This force has the same origin
as the force that makes objects close to the surface fall downwards.
By comparing the acceleration of an object near the surface with that
of the Moon, he was able to derive the famous inverse-square law of
gravitation (see Section 23.1.1).
experiment that showed that both should fall at the same rate. It is simple
but compelling, and it goes something like this.
According to Aristotle the combined object should fall faster than either
of the original objects because it is more massive. However, shouldn’t
attaching the smaller mass to the faster mass slow the larger mass down and
shouldn’t attaching the larger mass to the smaller mass speed the smaller
mass up? This argument suggests that the composite body should have a
speed intermediate between the speeds of the small mass and large mass
alone. Aristotle’s idea leads to a contradiction. On the other hand, if all
objects fall at the same rate then there is no problem.
Galileo is said to have tested this idea by dropping two cannon balls
of different sizes from the top of the Tower of Pisa and showing that they
landed at the same time. Most historians doubt whether he actually did this,
but in 1971, during the Apollo 15 mission to the Moon, a hammer and a
feather were dropped to the Moon’s surface; both landed at the same time.
Newton’s second law can explain this. The resultant force on an object
of mass m in a gravitational field of strength g is its weight, mg. Using the
equation above,
F mg
a g
m m
gravitational field strength is equal to the acceleration of free fall in that
field.
Comment
Q It seems obvious that we can cancel the two m’s in the equation above.
However, if we think about this a little more deeply, it is not so obvious
that they are the same thing. The m on the bottom of the equation is
the inertial mass; the m on the top is related to how strongly the mass
responds to a gravitational field, sometimes called the gravitational
mass. They are two different properties of mass, so the fact that all
objects fall with the same acceleration in the same gravitational field
shows that they are at least proportional to one another and possibly
78 • FOUNDATIONS OF PHYSICS
identical. This subtle point was one of the clues that helped Einstein
toward his general theory of relativity, a new theory of gravity.
EXAMPLES
2. When a ball rests on the ground, there are two “action–reaction” pairs.
Contact force from ground
acting on the ball
Gravitational force
from Earth on ball
Gravitational force
from ball on Earth
Notice that while it is true that the weight of the ball and the upward
contact force from the ground have equal magnitudes and act in opposite
directions, they do not form an action–reaction pair. They fail in three
respects: they are not the same type of force; they do not act on different
bodies; and they are not part of the same interaction.
3. When a ball rests on an accelerating surface, the contact force and the
weight are not equal. While each action–reaction pair remains balanced,
there is now a resultant upward force on the ball, as shown by the free
body diagram on the right in the figure below.
Ground Ball
accelerating accelerating
upwards upwards
Gravitational
force from
Earth on ball Gravitational
Gravitational force from
force from Earth on ball
Contact force from ball
ball on Earth
acting on the ground
80 • FOUNDATIONS OF PHYSICS
so Ft (mv – mu)
Or impulse change of momentum
The S.I. unit for impulse is newton-second (Ns), which is equivalent to
the unit for momentum, kgms−1.
We can derive a more general equation for impulse by integrating
dp
Newton’s second law F dt :
Fdt dp p
The term on the left-hand side is equal to the area under the graph of force
against time, and this is equal to the change of momentum. For example,
the force exerted on a football when it is kicked might vary as shown in the
graph below. The area under the graph will be equal to the momentum
transferred to the ball as it is kicked.
Force (N)
= Impulse (Ns)
Time (s)
FB on A FA on B
A B x
Taking positive values in the x-direction, the impulse on each body during
a short time t is given by:
Impulse on A FB on A t
Impulse on B FA on B t − FB on At
During the interaction, the impulse given to each body, calculated from the
integral Fdt, is also of equal magnitude but opposite in direction. Since
82 • FOUNDATIONS OF PHYSICS
change of momentum of system FA on Bdt FB on A dt 0
0 0
Linear momentum is conserved in an interaction between two bodies:
Newton’s third law tells us that all forces arise as a result of interactions,
so the above argument will apply many times over for a complex system of
interacting bodies, as long as we include all the pairs of forces. This can be
stated as the law of conservation of linear momentum.
The linear momentum of a closed system is constant.
Comments
Q A “closed system” means that we include all pairs of forces within the
system. Another way of saying this is that the linear momentum of a
system is constant if no external resultant force acts on the system.
Q Linear momentum is a vector quantity, so the conservation of linear
momentum implies a separate conservation of each component of linear
momentum.
Q An object moving along a curved path has a continuously changing
momentum (it is changing direction even if its magnitude is constant).
To solve problems involving conservation of momentum, it is useful to
consider the total momentum of the objects before and after they interact
and then to equate them. Here is a simple one-dimensional example from
nuclear physics.
Alpha decay: An unstable nucleus of mass M is initially at rest and decays
by emitting an alpha particle (this consists of 2 neutrons and 2 protons) of
mass m. The new nucleus recoils with velocity u. What is the velocity, v, of
the emitted alpha particle?
Unstable nucleus
New nucleus Alpha particle
M M m m
u v
Before After
NEWTONIAN MECHANICS • 83
Before After
m1v1 m2v2
m1u
84 • FOUNDATIONS OF PHYSICS
The sum of the vector momenta after the collision must equal the total vector
momentum before the collision. Solving the triangle using trigonometry is
equivalent to taking components and deriving the equations (1) and (2)
above.
5.3.1 Work
Work is the transfer of energy when the point of application of a force
moves in the direction of the force:
Work done (J) force applied (N) × displacement parallel to force (m)
The S.I. unit of energy is joule (J).
1 joule of energy is transferred when a force of 1 N moves through 1 m
(1 J 1 Nm).
If the force is at an angle to the displacement, then the component of
force parallel to the displacement must be used.
NEWTONIAN MECHANICS • 85
Displacement: s
Displacement (m)
h F
The diagram above shows a mass m being lifted through a height h by two
different routes: vertically to A and along an inclined path to B. The vertical
force needed to lift the mass is always equal to mg since the applied force
must balance the weight of the mass. In the absence of frictional forces,
there is no horizontal component of force to consider.
Work done lifting mass vertically: W mgh
Work done along the inclined path: W mg (h/cos ) cos mgh
The work done is independent of the path taken and depends only on the
height through which the mass is lifted. If the mass is allowed to fall back
to the ground, it will gain kinetic energy, which could be used to do some
useful work (e.g., generating electricity). Lifting it in a gravitational field
has given it the potential to do work.
Q Gravitational potential energy (GPE) is the potential energy an object
has due to its position in a gravitational field.
In a uniform gravitational field, change in GPE is given by:
GPE mgh
If a mass is moved around a closed loop inside a gravitational field (any
gravitational field, not just a uniform field), the work done by an external
agent in lifting it is equal to the work done by the gravitational field as it
NEWTONIAN MECHANICS • 87
comes back down. If it returns to its original position, the work done in
the loop is zero. This is an example of what is called a “conservative field.”
Gravitational fields are conservative fields.
In a short time t, the work done on the mass is given by: W Fs Fvt
v v
Using F ma m , W m vt mvv
t t
Taking the limit of t0 and integrating between initial and final
velocities, we get:
v
1 1
W mvdv mv2 mu2
2 2
u
The work done is equal to the change in kinetic energy, so KE ½ mv2
It is also possible to derive this relationship by considering a constant
resultant force acting on a constant mass to produce a constant acceleration.
The displacement during acceleration is given by:
(v2 u2 )
s
2a
Multiplying throughout by mass m and acceleration a gives:
m(v2 u2 )
mas
2
Using F ma, we obtain:
W Fs ½ mv2 − ½ mu2 as before
5.3.4 The Law of Conservation of Energy
The law of conservation of energy can be stated in several different ways.
Here are two of them:
Q Energy is never created or destroyed, but one form can be transferred
to another.
Q The total energy of a closed system is constant.
88 • FOUNDATIONS OF PHYSICS
Unstable nucleus
New nucleus Alpha particle
M M m m
u v
Before After
Total momentum mv − (M − m) u 0
Total energy ½ mu2 ½ mv2 0 (direction does not the affect sign
of KE)
Notice that, in this example, kinetic energy is not conserved. It is
important to realize that the law of conservation of energy is about total
energy and not any one kind of energy. In some interactions and collisions
(usually on the atomic scale), kinetic energy is conserved. When this is
the case, the interaction is described as an elastic interaction. In avast
majority of interactions, some of the initial kinetic energy is transferred to
other forms (such as thermal energy), and these are described as inelastic
interactions.
Type of interaction Momentum Kinetic energy
Elastic conserved conserved
Inelastic conserved not conserved
v2
y
u1
v1
100 J 60 J used
Electrical Radiant
energy energy
40 J wasted
Thermal energy
The efficiency of a transducer is the ratio of useful output energy to total
input energy, usually expressed as a percentage. In the case of our compact
fluorescent light, this would be 0.40 or 40%.
Useful output energy
Efficiency= 100%
Total input energy
5.3.7 Power
Power is the rate of transfer of energy.
Energy transferred
Power=
Time
dE
For continuous energy transfers, P
dt
NEWTONIAN MECHANICS • 91
Exhaust Intake
The force exerted on the jet engine will be equal in magnitude to the rate
of change of momentum of the gas passing through the engine. While it
is true that the combustion of fuel inside the engine adds mass to the air
passing through it, this is in practice a very small contribution; so we can
simplify the analysis by assuming that the mass flow in and out of the system
is solely due to the air. This means that the mass flow rate in and out of the
jet is the same.
NEWTONIAN MECHANICS • 93
If u is the air speed of the aircraft and v is the speed of exhaust gases, then
the thrust is given by:
dm
F (v u)
dt
Clearly v must be greater than u for the jet to produce a forward thrust.
There are two ways to increase the thrust:
Q Increase the mass flow rate (dm/dt)
Q Increase the exit velocity of gases (v − u)
Different types of jets use different methods.
5.5.2 Rockets
A jet cannot work in a vacuum, because there would be no external material
against which to push. Rockets get around this problem by ejecting a large
amount of matter (in the form of burnt fuel) at a very high velocity. The
change in momentum of the ejected matter is equal and opposite to the
change in momentum of the rocket. Once again, we can derive an equation
for the thrust of a rocket using Newton’s second law, but this time we are
dealing with an object of changing mass.
d(mv)
F
dt
If the rocket is in space with no external forces acting upon it, then we
use the conservation of momentum to show that the change in velocity of
the rocket depends on the proportion of its mass expelled as burnt fuel.
Consider a short time t during which the rocket expels a mass m at a
velocity u relative to the rocket. At this time, the rocket has a forward
velocity v and a mass m.
Mass m Mass m
Exhaust velocity
relative to rocket = u Rocket velocity = v
leading to:
m
v f v0 u ln 0
mf
The final velocity of the rocket can be increased by:
Q Increasing the velocity of exhaust gases
Q Increasing the ratio of m0/mf.
The second condition requires that the final mass of the rocket be small
compared to its initial mass, so the majority of the rocket’s mass at launch
is in its fuel.
photon (from a positive value to an equal negative value). While the radiation
pressure from ordinary light sources on human-sized mirrors might be tiny
(< 10−6 Pa for 100 W of radiation falling onto a mirror of area 1 m2), it
has been suggested that high-intensity laser beams directed from the Earth
could accelerate a reflective micro-spacecraft up to very high speeds so that
they can make trips to the nearest stars within human lifetimes.
the air the ship runs many spans. The fish in their water will swim
toward the front of their bowl with no more effort than toward the
back, and will go with equal ease to bait placed anywhere around the
edges of the bowl. Finally the butterflies and flies will continue their
flights indifferently toward every side, nor will it ever happen that
they are concentrated toward the stern, as if tired out from keeping up
with the course of the ship, from which they will have been separated
during long intervals by keeping themselves in the air. And if smoke
is made by burning some incense, it will be seen going up in the form
of a little cloud, remaining still and moving no more toward one side
than the other. The cause of all these correspondences of effects is the
fact that the ship’s motion is common to all the things contained in it,
and to the air also. That is why I said you should be below decks; for
if this took place above in the open air, which would not follow the
course of the ship, more or less noticeable differences would be seen
in some of the effects noted.2
This is an important observation. This means that there is no way to
distinguish rest from uniform motion, so there is no way to be sure that
any particular reference frame will beat rest in space. All uniformly moving
reference frames are called inertial reference frames. It is often helpful
when selecting a particular inertial reference frame to simplify a problem,
and often the best one to choose is the center of mass frame.
2. Dialogue Concerning the Two Chief World Systems, translated by Still man Drake, University
of California Press, 1953, pp. 186–187 (second day).
NEWTONIAN MECHANICS • 97
c
u
Before After
If this could occur, the photon would take away the energy and momentum
of the electron–positron pair. However, this is in fact impossible, as can be
seen if we transform to the center of mass frame by subtracting u/2 from
the electron and photon.
u/2 u/2
?
Before After
The momentum before annihilation may be zero since the electron and
positron have equal mass but opposite velocities. This means that a single
photon would have to carry away energy but have no momentum. We have
seen earlier (Section 5.5.3) that photons have momentum p E/c, so this
is impossible. In reality, such an annihilation results in a pair of photons
emitted in opposite directions in the center of mass frame.
Before
After
This allows energy to be carried away and the total momentum to remain
at zero. The creation of a pair of identical photons when an electron and a
positron annihilate is utilized in PET scanners (see Section 29.5).
vt x′
x P
x x′
The equations of this transformation are:
x x − vt
y y
z z
t t
The laws of Newtonian mechanics are invariant (do not change) under
Galilean transformations. These transformations assume that physics takes
place against a background of absolute space and absolute time, which are
the same for all observers. However, early in the 20th century, Einstein
realized that the laws of electromagnetism are not invariant under Galilean
transformations. This realization ultimately led him to the special theory
of relativity, which postulates that the laws of physics should be the same
in all inertial reference frames. The only way that this could be true was if
measurements in space and time were all relative and not absolute. This is
explored further in Chapter 24.
of complex problems. They also provided new ways to think about physical
processes. One of the most significant approaches was developed by the
Italian-French mathematician Joseph-Louis Lagrange in 1788. A great
advantage of the Lagrangian method is that it helps to show the links
between Newtonian mechanics and quantum mechanics.
position (x, y, z). One of the beauties of the Lagrangian method is that it can
work equally well with polar or spherical coordinates (this is often difficult
when we start from F ma), so we can set up the theory with an arbitrary
set of “generalized” coordinates:
Positions: q1, q2, q3 (x, y, z) in Cartesian coordinates
Velocities: q 1 , q 2 , q 3 (vx, vy, vz) in Cartesian coordinates
The dot above the symbol represents differentiation with respect to
time (e.g., q dq/dt ).
Once the Lagrangian for the system is known, we can derive the
equation of motion using the Euler-Lagrange equation:
L L
t
q q
The method itself is quite simple:
1. Write down the Lagrangian for the system under consideration.
2. Use the Euler-Lagrange equation to derive the equation of motion.
Consider a simple example where a particle of mass m is released from rest
in a uniform gravitational field of strength g. We set x 0 at the surface and
consider only the x-direction (i.e., solve the problem in one dimension).
x
1 2 1
1. T mx and V mgx; so L mx 2 mgx
2 2
L
2. L.H.S. of Euler-Lagrange equation: mx
t x t
L
R.H.S. of Euler-Lagrange equation: mg
x
These are equal: mx mg
t
leading to: x g
NEWTONIAN MECHANICS • 101
5.8 Exercises
1200 N 2000 N 20
20
1500 N
Before After
6
FLUIDS
6.0 Introduction
The particles inside a solid vibrate about fixed positions unless the material
is placed under extreme stress. Particles inside a fluid, however, can move
past each other and change position; this allows them to flow when stresses
are applied to the fluid. Liquids and gases are fluids and their behavior can
be modeled using Newton’s laws.
Here are some key ideas to describe the behavior of fluids:
Density (kgm−3): m/V where m is the mass of the fluid (kg) and V
is the volume occupied by the fluid (m3).
Pressure (Pa): p F/A where F is the force (N) exerted by the fluid
perpendicular to an area A (m2). 1 Pa 1 Nm−2.
Pressure in a fluid acts in all directions, and the
pressure at the same level in a static fluid is constant.
A
F
F
Shear stress (Pa): When two parallel layers of area A are pulled in
opposite directions by a force F, the shear stress
acting on the layers is F/A.
Incompressible fluids: Liquids such as water do not compress easily, so
a useful model assumes that they have constant
volume and so their density is constant.
104 • FOUNDATIONS OF PHYSICS
W
h
FLUIDS • 105
This can also be derived by simply calculating the total weight of the
column and dividing by the area of the base: p0 Agh/A gh. However,
the approach above can be used when the density of the fluid changes with
depth, for example, to derive an expression of the atmospheric pressure at
altitude h.
We can now substitute this expression for density into the equation of
hydrostatic pressure gradient in a fluid:
dp mgp
g
dz kT
The expression on the R.H.S. depends on p, so we have to separate variables
before we can integrate between the ground (z 0) and some height (z h):
px
dp mg x
p
dz
kT
p0 0
p mg
ln x x
p0 kT
mg
x
kT
px p0 e
A B
FLUIDS • 107
In the manometer shown on the left, both tubes are open to the
atmosphere at pressure p0, so the levels are equal. In the manometer shown
on the right, an excess pressure p is connected to one tube. The pressures
at A and B inside this manometer must be equal because these points are
at the same level in the same fluid. However, the pressure at B is also equal
to the pressure p0 plus the pressure caused by the column of fluid BC of
height h.
pA p0 p pB p0 gh
p gh
The lower the density of the liquid used, the greater the sensitivity of
the manometer (i.e., the greater the change in height per unit change in
pressure).
6.1.4 Barometers
Imagine a manometer with one end open to the atmosphere and the other
end attached to an effective vacuum pump. The pressure difference is
equal to atmospheric pressure pAt, so the height of the column can be used
to measure atmospheric pressure. This is the principle of the barometer.
However, instead of connecting one end of a manometer to a pump, one
end is sealed and the fluid is allowed to fall away from the sealed end so that
a vacuum forms above it.
The pressure at A is equal to the pressure at B because both points
are at the same level in the same liquid. Since A is on the surface of the
liquid exposed to the atmosphere, the pressure at both points must be
atmospheric pressure.
Connected to Vacuum
PAt. vacuum pump:
p=0
PAt.
h
A B Mercury
reservoir
108 • FOUNDATIONS OF PHYSICS
6.1.5 Dams
The design of a dam must take into account all the forces that act on the
structure. The most important of these is hydrostatic pressure from the
trapped water. In a simple case, this could be the only force acting on the
dam assuming that the containing wall of the dam is vertical.
Water
h
F
Horizontal hydrostatic
force from water
Vertical downward
hydrostatic force from
water above the sloping
part of dam wall Weight of dam
Narrow column
of base area A
h
This is the weight of fluid displaced by the total volume of the block as
stated in Archimedes’ principle.
6.2.3 Flotation
An object will float if the buoyancy force can support its weight. The
maximum buoyancy force is when the object is completely submerged; so
for an object of volume V and average density obj to float in a fluid of
density fluid,
weight of object < buoyancy force
object Vg < fluid Vg
object < fluid
An object will float if its density is less than the density of the fluid in which
it is placed.
If an object’s density is equal to the density of the fluid in which it is
submerged, it is said to have “neutral buoyancy.” Divers and submarines
use neutral buoyancy to remain at the same depth under water.
6.3 Viscosity
Moving layers
Stationary layer
The greater the viscosity, the smaller the velocity gradient dv/dz for the
same shear stress.
112 • FOUNDATIONS OF PHYSICS
Two regimes exist because there are two competing effects in the moving
fluid: inertial forces related to the density and speed of motion of the fluid,
and viscous forces related to the viscosity of the fluid and inversely to the
physical size of the channel in which the fluid is flowing (e.g., the diameter
of the pipe). If viscous forces dominate, eddies cannot form and the flow is
laminar. If inertial forces dominate, the flow will be turbulent.
The Reynolds number Re is a dimensionless constant that represents
the ratio of inertial forces (v) to viscous forces (/L) in a particular flow
situation. It is defined as:
vL
Re
where v is the flow velocity, is the fluid density, is the coefficient of
viscosity, and L is the characteristic length. For flow in a pipe, L would be
the diameter of the pipe; for flow between two parallel plates, it would be
the separation of the plates.
FLUIDS • 113
A1 v1 A2 v2
The flow velocity in the wider part of the pipe is v1 and in the narrower part
is v2. In a short time t, the mass flow through area A1 is 1A1v1t and that
through area A2 is 2A2v2t. These must be equal, so in general:
1 A1v1 2 A2v2
This is called the equation of continuity.
For an incompressible fluid, the density is constant, so:
A1v1 A2v2
In this case, the flow velocity is inversely proportional to the area of the
pipe:
v1 A2
v2 A1
6.4.3 Drag Forces in a Fluid
When an object moves through a fluid, it exerts a force on the fluid to make
it flow around the moving object. By Newton’s third law, the fluid exerts
an equal but opposite force on the moving object; this is the origin of the
drag force.
The drag force depends on the nature of the flow around the object
(e.g., laminar or turbulent) and is determined by the properties of the fluid,
such as density and viscosity, and on the velocity, size, and shape of the
moving object. It may also be affected by nearby boundaries (e.g., if the
fluid is inside a container).
There are two particular situations that give simple expressions for the
drag force.
Q Viscous forces dominate (Re<< 1000): drag force is directly proportional
to velocity.
114 • FOUNDATIONS OF PHYSICS
The drag force arises as a reaction to shearing the layers of the fluid
as they flow around the object. These forces depend on the velocity
gradient and therefore on the velocity of the moving object.
Q Inertial forces dominate (Re >> 1000): drag force is directly proportional
to velocity-squared.
The inertial force arises as a reaction to the force needed to accelerate
the fluid in front of the moving object up to the velocity of the object.
This is directly proportional to the rate of change of momentum of the
fluid in front of the object, which is proportional to the mass of fluid
encountered per second multiplied by the velocity of the object. Since
the mass encountered per second is also proportional to the velocity, the
drag force will be proportional to the velocity-squared.
Motion Drag
y
MLT2 (ML1T1 ) x LT1 L
z
F ( ) x (v)y ( A)z
y z
MLT2 (ML3 ) x LT1 L2
For the dimension of mass: 1x
For the dimension of time: −2 − y, so y 2
For the dimension of length: 1 −3x y 2z, so z 1
116 • FOUNDATIONS OF PHYSICS
1 1
P1 v12 gh1 P2 v22 gh2
2 2
where h1 and h2 are heights above some reference level.
The Bernoulli equation only applies when several conditions are met:
Q The flow is steady.
Q The flow is laminar and not turbulent.
Q The fluid is inviscid.
1 2 1 2
Using the Bernoulli equation, P1 2 v1 P2 2 v2
The equation of continuity shows that v2 > v1, so P2 < P1, and the static
pressure falls as the flow velocity increases. The kinetic energy of the fluid
has increased, so its potential energy has decreased.
Another way to think about this is by considering Newton’s second law.
The fluid must accelerate as it enters the constriction, so there must be a
resultant force from the wider part of the pipe. This comes from the greater
static pressure.
The derivation has two parts: (i) we use the equation for viscosity to work
out an expression for the velocity of flow at radius r from the center. Then
118 • FOUNDATIONS OF PHYSICS
(ii) we use this to work out the total rate of volume flow inside the pipe by
integrating over cylindrical shells.
(i) The applied force created by the pressure difference must balance the
viscous forces along its surface:
F r 2 p
F r 2 p rp
shear stress along surface =
(area of surface) 2 rl 2 l
rp dv
This must balance viscous forces, so
2l dr
The negative sign is because the velocity decreases from r 0 to r a.
0 a
pr
dv dr
2l
v 0
p 2
which gives v 4 l a r
2
Fluid Velocity
flow profile
(ii) Since different layers flow at different velocities, the total flow can be
found by integrating the volume flow for all thin cylindrical shells inside
the pipe.
Thin shell of radius
r and thickness r
flowing with velocity v(r)
r
r+
r
This is Poiseuille equation. The volume flow rate depends on the fourth
power of the pipe radius a and is directly proportional to the pressure
gradient p/l.
Narrow tube
Fill Drain of length l Collection
beaker
Area A Area a
2. Lower
static pressure
1. Higher static
pressure
Fluid
flow
Dynamic Static
pressure pressure
Total Static
pressure pressure
Pressure transducer
The tube points into the direction of air flow, and pressure sensors are
used to measure the difference between the total and static pressure. Since
total pressure is equal to the sum of the static and dynamic pressures, the
difference between these is just the dynamic pressure ½v2. This value can
then be used to calculate the speed of the fluid relative to the Pitot tube.
Pitot tubes are used to measure the speeds of aircrafts and boats.
2 ptot pstat
v
122 • FOUNDATIONS OF PHYSICS
6.6 Exercises
1. Scuba divers estimate that the excess pressure they experience when
they dive increases by an amount equal to the atmospheric pressure for
every additional 10 m of depth.
(a) Show that this is approximately correct.
Density of seawater 1030 kgm−3. Atmospheric pressure at
sea level 101 kPa.
(b) Estimate the total pressure at the bottom of the Mariana Trench in
the Pacific Ocean (about 11 km below sea level).
(c) Seawater is slightly compressible—how will this affect your answer
to (b)?
2. A vertical dam wall contains water of density to a depth h. The
horizontal length of the dam wall is l.
(a) Write down an expression for the excess pressure at depth x below
the surface of the water.
(b) Show that the total horizontal force acting on the wall of the dam
from the water is given by the expression F ½ glh2.
(c) By considering equilibrium of moments show that the line of action
of this force is 1/3h above the base of the dam.
3. The laminar flow of a viscous fluid through a narrow capillary tube
depends only on the radius a, the pressure gradient along the tube (p/l),
and the viscosity of the fluid, .
(a) Use the method of dimensions to show that the volume flow rate Q
is given by an expression of the form:
a4 p
Qconstant
l
(b) Explain why the method of dimensions cannot be used to determine
the value of the constant in the expression above.
(c) Explain why this formula is likely to break down for high flow rates
or larger diameter pipes.
4. (a) Show that the flow of air around a car is likely to be turbulent. You
will need to estimate the relevant quantities.
(b) A formula used for turbulent drag forces is:
F ½ CD Av2
where CD is the “drag coefficient.”
FLUIDS • 123
(c) Use your answer to (b) to calculate the Reynold’s number for the
falling droplet. Comment on your answer.
8. The diagram below shows two pipes, each of length l, but with
diameters d and d/2 respectively. The volume flow rate into X is Q and
the flow is laminar. The pressure at X is pX and the pressure at Z is pz.
The viscosity of the fluid is . Find an expression for the pressure at Y
assuming the fluid is incompressible.
d X Y Z
l l
9. (a) Explain how it is possible to drink water from a glass through a straw.
(b) Discuss whether there is a limit to the length of straw that can be
used to drink water. The density of water is 1000 kgm−3.
10. Show that the S.I. base units for viscosity are kgms−1.
11. The pressure difference measured by a Pitot tube on an aircraft’s wing is
20 kPa. What is the aircraft’s air speed?
12. When an inflated balloon is connected to one side of a water manom-
eter, the height difference between the manometer arms is 14.0 cm.
The atmospheric pressure is 102 kPa.
(a) What is the excess pressure inside the balloon?
(b) What is the total pressure inside the balloon?
(c) Suggest one advantage and one disadvantage of using mercury
instead of water in a manometer.
13. In a famous experiment, the French physicist Pascal placed one mercu-
ry barometer at the base of a mountain and carried a second one to the
top of the mountain. Both barometers had mercury columns of equal
height when they were together at the base of the mountain. However,
Pascal noticed that the mercury column on the barometer he carried
with him fell gradually as he climbed the mountain. Explain this effect
as carefully as you can.
14. The circulation of blood in the body can be considered as a continuous
circuit. Blood is pumped from the heart into the aorta, splits into the
arteries, splits again into the capillaries, and then returns to the heart
via the veins and finally the vena cava. This can be represented as an
electric circuit consisting of series and parallel resistors:
FLUIDS • 125
The table below gives the total area of each type of blood vessel along
with the average flow speed and volume flow rate.
(a) Complete the table.
Area Speed Volume flow
(cm2) (cms−1) rate (cm3s−1)
Aorta 3.0 30 90
Arteries 100
Capillaries 900 0.10
Veins 0.45
Vena cava 18
(b) Blood has a viscosity between 0.003 and 0.004 Pas. Discuss whether
blood flow in the human circulation is likely to be laminar or
turbulent.
CHAPTER
7
MECHANICAL PROPERTIES
7.1 Density
Density is a property of each material and is independent of the amount of
that material. This is in contrast to mass, which depends on the amount of
material present. Density is defined as:
Mass
Density
Volume
The S.I. unit of density is kgm3, but gcm3 is also in common use. The
relation between these is:
1000 kgm3 1 gcm3
1 gcm3 0.001 kgm3
The densities of some common materials are listed below.
Density/kgm3 Density/gcm3
Air (sea level, 15°C) 1.225 0.001225
Water 1000 1.000
Wood 160 (balsa)–1300 (ebony) 0.16–1.3
Concrete ~2400 ~ 2.4
Aluminum 2700 2.7
Steel 7750–8050 7.75–8.05
Copper 8960 8.96
Lead 11340 11.34
Mercury 13560 13.56
Uranium 19100 19.10
128 • FOUNDATIONS OF PHYSICS
Density/kgm3 Density/gcm3
Gold 19320 19.32
Osmium (densest naturally 22590 22.59
occurring element)
r0 Separation
Attraction
MECHANICAL PROPERTIES • 129
origin and vary with distance as shown below (a positive force represents
repulsion and a negative force represents attraction).
The distance r0 is equal to the equilibrium separation of the particles. For
small displacements either side of this position, the graph is approximately
linear in many materials, particularly metals. This has two consequences:
Q When a force is applied to the material its extension (or compression)
will be directly proportional to the force. In other words it will obey
Hooke’s law in this linear region.
Q When a particle is displaced from its equilibrium position it will
experience a restoring force directly proportional to its displacement, so
that oscillations about the equilibrium will be simple harmonic
(see Chapter 11).
The work that must be done to separate two particles from their equilibrium
separation is equal to the area between the negative part of the graph and
infinity. This is also equal to the energy released when the bond between
the particles is formed. Since work must be done to push the particles closer
together or to separate them, the potential energy is a minimum when their
separation is r0. They are in a bound state and each particle is in a potential
well.
The graph below shows how the potential energy varies with particle
separation.
Potential energy
r0 Separation
130 • FOUNDATIONS OF PHYSICS
Extension/m
O
Permanent
plastic deformation
Section OP is (almost) a straight line through the origin, so in this region the
extension (x) is directly proportional to the force applied to the spring (or
the tension in the spring) (F).
F x or F kx
k is the “spring constant” with S.I. unit Nm1.
MECHANICAL PROPERTIES • 131
Series combinations
When n springs are connected in series and the system is stretched, the
tension in each spring must be the same and equal to F. This means the
extension of each spring will be the same as the extension of an individual
spring under the same load. The total extension of the system of n springs
is therefore ne.
Using Hooke’s law for the system of n springs in series gives:
kSERIES F/ne k/n
Using similar reasoning, a system of different springs with spring constants
k1, k2, k3, etc., connected in series has an overall spring constant given by:
1 1 1 1
Kseries k1 k2 k3
Connecting springs in series reduces the stiffness of the system.
Parallel combinations
When n springs are connected in parallel and the system is stretched by a
force F, the tension in each spring must be the same and equal to F/n. The
total extension of each spring (and the system) will be e/n.
132 • FOUNDATIONS OF PHYSICS
Work done
x Extension
If the spring obeys Hooke’s law then the work done to stretch it to an
extension x is equal to the shaded area in the graph above:
Area ½ Fx
or, using Hooke’s law,
Work done ½ kx2
If we assume that no energy is lost as the spring is stretched, the elastic
potential energy in a stretched spring is given by:
EPE ½ kx2
In general, the work done to stretch something is calculated from the
integral:
W F( x)dx
1
If the spring obeys Hooke’s law this is simply W kxdx kx2 . giving
2
the same result as before.
MECHANICAL PROPERTIES • 133
Some definitions
Tensile stress: is the force per unit area when the force is applied along
the axis of the sample as shown below.
e
A
F F
Force (N)
Stress(NM2 )
Cross-sectional (m2 )
F
A
2
The S.I. unit of stress is Nm or Pa.
Tensile strain: is the extension per unit length under a tensile stress.
Extension
Strain
Original length
e
l0
This is a ratio of two lengths, so it is dimensionless.
Stress (Pa)
ε Strain
d 2 E
m e
4 gl0
2
A graph of m on the y-axis against e on the x-axis has a gradient d E .
4 gl0
The Young’s modulus is given by:
4 gl
E gradient × 20
d
MECHANICAL PROPERTIES • 135
Clamp l
Test wire
Bench
Wire diameter is
measured by placing it
into this gap Collar: mm/100 scale (0.35 mm)
The reading is taken from the point where the lines on the shaft and collar
are aligned. The last reading in millimeters from the shaft is added to the
reading from the collar. In the example below the gap is 3.00 0.35 mm
3.35 mm. Any zero error for the micrometer must be subtracted from this.
(You can check for a zero error by closing the micrometer gap completely
and checking the reading; it should be 0.00 mm.)
The wire must then be clamped securely at one end.
Two light markers are attached to the wire a distance l0 apart. This
distance should be measured with an unkinked, unloaded but taut wire.
Using a larger value of l0 will increase the extensions and give better results.
136 • FOUNDATIONS OF PHYSICS
The wire is then loaded, adding one mass at a time and recording the
mass added and the length of the wire in a suitable results table. This is
repeated for at least seven different values of mass, and preferably many
more. However, care must be taken not to exceed the elastic limit for the
wire. If this does occur the graph will begin to curve, and the Young’s
modulus must only be calculated using the gradient of the straight part of
the graph.
Extensions should be calculated for each measured length and added
to the results table:
extension, e l l0
Care must be taken throughout the experiment because there is always a
danger that the wire could snap. Safety glasses should be worn!
Finally a graph of mass against extension is plotted and the Young’s
modulus can be determined from the gradient of this graph as shown above
(Section 7.4.1).
Strain
O
Q Region OP: The material obeys Hooke’s law in the initial linear region of
the graph where stress is directly proportional to strain.
Q The Young’s modulus of the material is equal to the gradient of line OP.
Q Up to Y the material behaves elastically. If the stress is removed it will
return to its original dimensions.
MECHANICAL PROPERTIES • 137
Q Y is the “yield point.” Beyond this level of stress the material will deform
plastically, and when the stress is removed there will be a residual
deformation.
Q The ultimate tensile strength(UTS) is at a stress uts.
Some typical values of ultimate tensile strengths are:
Material UTS/MPa
Carbon fiber 5650
Spider silk 1400
Stainless steel 860
Copper 220
Nylon 75
The dip in the curve is a result of the change in diameter of the sample. As
it begins to yield, its diameter becomes smaller, but in this graph the stress
has been calculated using the original diameter of the sample, so while the
calculated stress decreases the actual stress does not.
The area under a stress–strain graph represents energy per unit volume.
area under the curve d FA dxl Al1 Fdx
(Here we have assumed that the cross-sectional area of the sample has
remained constant during strain. This is approximately true for small
strains.)
The larger the area up to fracture, the more energy the material absorbs
before fracture and the tougher the material is said to be.
Stress
g
Loadin
g
adin
Unlo
Strain
O
Long chain Most molecules
molecules randomly aligned—stress
arranged, stress Molecules aligning but acts along
begins to straighten connected by cross-links chains and is
them by causing resisted by stiff
bond rotation covalent bonds
Ceramics are solid non-metallic materials such as brick, tile, pottery, and
china that are formed by high-temperature firing. They usually consist of
tiny ionic crystals bound together by amorphous glassy regions that formed
during firing. Ceramics are usually stiff, hard, and strong and have high
melting points and good chemical resistance, but they can be brittle.
Glasses are closely related to ceramics but are characterized by being
completely amorphous and result from a rapidly cooled melt. Glasses are
distinguished from ceramics by their microscopic structure. For example,
silica glass and quartz have identical composition consisting of SiO4 units,
but in the glass they are arranged randomly, while in the ceramic they are
arranged regularly.
Composite materials are combinations of two or more different
materials designed to take advantage of the desirable properties of each
individual component. Examples of composites are fiber glass, concrete,
steel, reinforced concrete, etc. Many composites consist of a matrix and a
reinforcement. Concrete is strong in compression, but weak in tension and
brittle, whereas steel is strong in tension and can prevent crack formation in
the concrete, providing a versatile and economical building material.
7.7 Exercises
1. A rectangular wooden block has sides of length 5.0 cm, 8.0 cm, and
12 cm. Its mass is 960 g.
(a) Calculate its density in kgm3.
(b) The absolute uncertainty in each measurement above (i.e., of side
lengths and mass) is ±4%. Calculate the absolute and fractional
uncertainty in the density.
2. A spring is extended 0.18 m by a force of 30 N.
(a) Calculate the spring constant of the spring.
(b) Calculate the work done to stretch the spring.
(c) A load of 20 kg is supported by 10 of these springs connected in
parallel with one another. Calculate the extension of the system
under this load.
3. The data below was collected from an experiment to stretch a steel
spring. The unstretched length of the spring was 10.0 cm.
Extension/cm Force/N
0 0
2.8 1.0
MECHANICAL PROPERTIES • 141
Extension/cm Force/N
6.2 2.0
9.6 3.0
13.2 4.0
16.5 5.0
20.1 6.0
23.2 7.0
26.6 8.0
30.2 9.0
33.0 10
36.3 11
40.2 12
44.4 13
49.4 14
57.1 15
68.5 16
96.5 17
113.2 18
(a) Plot a graph of force (y-axis) against extension (x-axis).
(b) State the limit of proportionality for this spring.
(c) Calculate the spring constant of the spring.
(d) What was the total length of the spring when it was pulled by a force
of 7.0 N?
(e) Calculate the extension of the spring if it supported a load of 850 g.
(f) For forces up to about 10 N the graph is a straight line through the
origin. Describe the relationship between force and extension over
this range. What law is the spring obeying?
(g) If two springs like the one above were connected in series, how much
would they extend when stretched by a force of 14 N?
(h) When the force of 18 N was removed, the spring did not return to its
original length. Explain this.
(i) Does the spring become stiffer or less stiff beyond the limit of
proportionality? Explain your answer.
4. The steel cable used to moor a ship has a length of 12.0 m and a cross-
sectional area of 8.2´104 m2. The force in the cable is 15,000 N.
(a) Calculate the extension of the cable.
(b) Calculate the strain energy (elastic P.E.) stored in the cable.
Density of steel 8050 kgm3, Young’s modulus of steel 200 GPa
142 • FOUNDATIONS OF PHYSICS
Strain
8
THERMAL PHYSICS
8.0 Introduction
Thermal energy (often simply referred to as heat energy or heat) is the
energy an object has as a result of the random thermal motions of its
particles. This is not the same thing as temperature. For example, the
Atlantic Ocean has a lower temperature (perhaps 20°C) than a freshly made
mug of coffee (perhaps 75°C) but has much more thermal energy because
it contains many more particles. While the formal definition of temperature
is quite complex and involves an understanding of the concept of entropy,
a simple way to think about temperature is to relate it to the mean energy
per particle. The mean energy of the water particles in a mug of hot coffee
is greater than the mean energy of water molecules in the Atlantic Ocean.
8.4.1 Conduction
Conduction involves the transfer of energy between particles as a result of
collisions. It is the most important process for heat transfer inside a solid
because particles in a solid remain in fixed positions and so cannot form
convection currents. Conduction also occurs in liquids and gases, but here
it is often less important than convection since particles in liquids and gases
are able to move in convection currents transferring heat energy as they do
so. The process of conduction is a dynamic one, but for macroscopic objects
the transfer of energy from more energetic particles to less energetic
particles has a higher probability than transfer in the opposite direction, so
heat flows from higher to lower temperatures. Conduction cannot occur in
a vacuum because there are no particles to conduct the heat.
dQ
The rate of heat flow (in watts) through an insulated block of
dt
material depends on the type of material, its cross-sectional area A, and the
temperature gradient between opposite sides of the block.
The temperature gradient in the block is:
l
Direction of heat flow A
Temperature θ1
d 2 1
dx l
THERMAL PHYSICS • 147
The high values for metals is because they contain large numbers of
free electrons that rapidly transfer heat. Diamond has an especially high
coefficient of thermal conductivity because its atoms are bonded very
tightly together and this couples their motions, so that when one atom is
disturbed it has an almost immediate effect on its neighbors and so transfer
energy quickly.
148 • FOUNDATIONS OF PHYSICS
8.4.2 Convection
Convection can only occur in fluids, i.e., liquids or gases. This is because
convection involves bulk movement of hot matter from one part of
the medium to another. Natural convection currents arise as a result of
changes of density inside the fluid. In most fluids the material expands as its
temperature increases. This increases its volume and decreases its density
so that it is then displaced by cooler, denser material from above it. The
overall effect is to set up a convection current with warmer material rising
and cooler material falling. This is often the dominant mechanism for heat
transfer within a fluid.
Heat source
8.4.3 Radiation
All bodies emit and absorb a spectrum of electromagnetic radiation that
depends on the nature of their surface and their temperature. For many of
the hot objects we encounter in everyday life, the peak of this spectrum is
in the infrared region, so thermal radiation is often referred to as infrared
radiation although the actual spectrum of thermal radiation is continuous.
Thermal radiation is part of the electromagnetic spectrum, so it can be
transferred in a vacuum. While the absorption and emission of radiation
is a complex subject, it is usually true that matt black surfaces are good
absorbers and emitters, whereas light shiny surfaces are poor absorbers
and emitters. Thermal radiation, like light, can be reflected from a silvered
surface, e.g., on the inside of a thermos flask.
THERMAL PHYSICS • 149
Wavelength
Black body radiation
spectrum
150 • FOUNDATIONS OF PHYSICS
Some terminology
Q Heat capacity is the energy needed to increase the temperature of a
particular sample of a substance by 1°C (or 1K).
Q Specific heat capacity is the energy needed to increase the temperature
of 1 kg of a substance by 1°C (or 1K), i.e., this is the heat capacity per
unit mass.
Q Molar heat capacity is the energy needed to increase the temperature
of 1 mole of a substance by 1°C (or 1K), i.e. the heat capacity per unit
amount (the S.I. unit of amount is the mole).
V
Thermometer
A
Insulation
Electrical
heater
Substance
under test
Temperature/°C
At higher temperatures
heat loss through the
insulation increases, so the
Initially some of the graph curves downwards
energy supplied heats
the heater, so the graph
curves upwards In the linear
section almost all
of the heat
supplied is heating
the substance
under test
Time/s
E
P mc
t t
Rearranging this equation gives:
P
c
m
t
is equal to the gradient of the linear part of the graph above.
t
m can be measured using a top pan balance.
P is calculated from the ammeter and voltmeter readings (P IV).
8.8 Exercises
1. A brick hut with a flat wooden roof is heated so that the inside temperature
is 22°C when the outside temperature is 12°C. The dimensions of the
hut’s base are 4.5 × 3.5 m and the walls are 2.6 m high and 10 cm thick.
There is a glass window of area 1.5 m2 in one of the walls. The thickness
of the glass is 6.0 mm and the thickness of the wooden roof is 3.0 cm.
(a) Calculate the minimum power of the heater needed to maintain the
temperature at 22°C when the outside temperature is 12°C.
(b) Explain why, in practice, more powerful heating will be needed.
Thermal conductivities: brick, 0.80 Wm1K1;wood,0.16 Wm1K1; glass,
0.96 Wm1K1
2. An 800 g block of copper at 76.0°C is immersed in 2000 cm3 of water,
which is initially at 18.0°C. The water is in a thermally insulated
container.
Calculate the final equilibrium temperature of water and copper
assuming there is no heat loss to the surroundings.
Specific heat capacities: copper, 385 Jkg1K1;water, 4180 Jkg1K1
3. The table below gives the specific heat capacities and molar masses of
four metals:
Metal Specific heat capacity/ Molar mass/g
Jkg1K1
Copper 385 63.5
Aluminum 897 27.0
Iron 449 55.8
Mercury 140 200.6
Calculate the molar heat capacity of each metal. Comment on your
results.
4. Calculate the energy needed to change 2.0 liters of water at 20°C into
steam at 120°C.
THERMAL PHYSICS • 155
Ambient
temperature
Time
5. When molten metal is poured into a cast it cools down and solidifies.
The graph below shows a typical cooling curve for the metal. Explain the
shape of the graph and the significance of the three regions.
CHAPTER
9
GASES
9.1.0 Introduction
The state of a fixed amount of gas is defined by three parameters: its
volume, pressure, and temperature. These are all dependent on one
another so that when a gas is compressed its volume and temperature might
change. The gas laws are a set of macroscopic empirical laws that describe
these relationships and which will be explained later when we consider the
microscopic kinetic theory model of a gas (see Section 9.3.2). Given that
there are three parameters, there are also three gas laws, each of which
holds when one of the parameters is constant.
Boyle’s law: dependence of pressure on volume (constant temperature)
Charles’s law: dependence of volume on temperature (constant pressure)
Gay-Lussac’s law: dependence of pressure on temperature (constant
volume)
While many gases under a wide range of physical conditions will
approximately obey these laws, an ideal gas is a theoretical gas that obeys
them perfectly.
The three gas laws are expressed most clearly when temperature is
measured using the thermodynamic or kelvin scale and can be combined
into a single equation—the ideal gas equation.
158 • FOUNDATIONS OF PHYSICS
Hand pump to
Length of air increase pressure
column: directly
proportional to
the volume of
the gas
Bourdon
gauge to
measure gas
pressure
Colored water
acts as a piston to
compress the gas Fluid
reservoir
The length of the air column is measured for a range of different pressures.
Since the tube containing the air has a constant cross-sectional area, the
volume of air is directly proportional to the length of the column. The
results of such an experiment show that the pressure p and volume V are
inversely proportional to one another. This can be expressed algebraically
by p 1 / V or pV constant. This is called Boyle’s law:
Q Boyle’s law: pV constant for a fixed amount of an ideal gas at constant
temperature
Pressure Pressure
Volume 1/Volume
GASES • 159
Temperature/°C
More mass
Volume
Less mass
Temperature/°C
−273.15 0
More mass
Volume
Less mass
Temperature/K
0 + 273.15
Thermometer To pressure
sensor
Water bath
Air
Heat
Less mass
Temperature/K
0 + 273.15
This is called the ideal gas equation or equation of state for an ideal gas.
Holding any one of the three parameters constant reduces it to one of the
three gas laws, but it also applies if all three parameters change together.
The constant is directly proportional to the amount of gas. The value
of the constant for one mole of an ideal gas is R 8.314 JK1. The ideal gas
equation is often written in the form:
pV nRT
where n is the number of moles.
m v
x-direction
molecule will have velocity components, vx, vy, and vz, and the magnitude of
velocity will be given by:
v2 v2x v2y v2z
The molecules are moving randomly, so the mean-squared values of vx, vy,
and vz will all be equal:
v2x vy2 vz2
Therefore, v2 v2x vy2 vz2 3 v2x (2)
Using equations (1) and (2) above we can find an expression for the
average force on the wall of the container from a gas consisting of N particles
in rapid random motion:
Nm 2 Nm 2
F v v
a x 3a
where v2 is the mean-squared speed of the molecules.
We can now derive an expression for the pressure:
F Nm 2 Nmv2
p v
A 3 a3 3V
This is the kinetic theory equation for gas pressure, usually written as:
1
pV Nmv2
3
Since density nm/V we can also write this equation in the form:
1
p (v)2
3
Mean-squared and root mean-squared (rms) speeds
v12 v22 v32 vN2
Q Mean-squared speed, v2
N
2
Q rms speed, vrms = v
Note that the rms speed is not equal to the mean speed and that the mean-
squared speed is not the same as the mean speed-squared!
z
168 • FOUNDATIONS OF PHYSICS
Molecular speed
Q vp is the most probable speed. This means that there are more molecules
with speeds between vp and vp v than in any other similar-sized
range.
Q vm is the mean speed (vm 1.13 vp).
Q vrms is the rms speed (vrms v2 1.23 vp).
As the temperature of the gas increases the peak moves to the right and
falls. The number of molecules with higher speeds increases.
The area under each graph is the same because the total number of
molecules has not changed.
170 • FOUNDATIONS OF PHYSICS
Fraction of
molecules
between v
and v + ∆v
Higher temperature
Molecular speed
Fraction of
molecules
between v
and v + ∆v
Molecules in the
shaded region can
escape from the
liquid surface
Molecular speed
Minimum speed
needed to escape
If this is the case then a graph of ln (R) against 1/T should be a straight line
with a negative gradient and positive y-axis intercept:
ln(R)
1 /T
energy of the gas is constant for all isothermal changes (changes at constant
temperature).
δx
Work done by gas as piston moves a distance x:
W Fx pAx pV
where V is the small increase in volume when the piston moves a distance
x.
GASES • 173
W pdV
x1
This is also equal to the area under a graph of pressure against volume.
If the pressure is constant, the work done is simply:
W pV
GAS
Isochoric changes
An isochoric change is one that takes place with no change in the volume
of the system. If the volume is constant, there is no work done on or by the
system, so W 0 and U Q. The change in internal energy is equal to the
heat supplied to the system. An example of an (approximately) isochoric
change is the explosive combustion of a petrol and air mixture inside the
cylinder of a petrol engine.
Isobaric changes
An isobaric change is one that takes place at constant pressure. For a fixed
amount of an ideal gas, Charles’s law is obeyed: V/T constant. An example
of an (approximately) isobaric change is the expansion of a diesel–air
mixture as fuel is injected and ignited inside the cylinder of a diesel engine.
Another example is the change of state of a liquid to a vapor at constant
temperature and pressure. The vapor occupies a much larger volume than
the liquid, and as it expands against external pressure, it does work (e.g.,
pushing back the atmosphere).
When an ideal gas expands isobarically, work done is given by:
W pV
Work done
by heat
engine: W
Heat
dumped
into sink:
Q2
Sink at lower
temperature T2
GASES • 175
P2
C D
P1
B A
Volume
V2 V1
176 • FOUNDATIONS OF PHYSICS
Inlet valve
Spark plug Exhaust valve
Cylinder Working
head fluid
Piston
Cylinder
block
Connecting
rod
Crankshaft
The diagram shows an idealized indicator diagram for the Otto cycle.
Pressure
pC C
pD
D
pB
B
pA
A
O
Volume
V2 V1
Q CD: adiabatic expansion of gas (work is done by the gas but no heat is
transferred)—power stroke
Q DA: isochoric cooling of gas (heat is lost from the gas but no work is
done by it)
AB corresponds to the compression stroke and CD to the power stroke.
The closed loop OAO represents the intake and exhaust strokes. The area
contained by the loop ABCD represents the net work done in one cycle.
Analysis of the Otto cycle
For an heat engine,
T2
efficiency 1
T1
where T2 is the sink temperature and T1 is the source temperature. In the
idealized diagram above, T1 TC and T2 TD, so the theoretical efficiency
of the Otto cycle is:
T
efficiency 1 D
TC
This can be linked to the compression ratio,V1/V2, by using the equation of
state for an ideal gas and the relation pV constant for adiabatic changes.
We need to find an expression for TD/TC. Consider the power stroke
CD. This is an adiabatic change, so:
pC V2 pD V1 (1)
The ideal gas equation also applies:
pC V2 pDV1 (2)
TC TD
Combining these equations,
1
TD pD V1 V2 V1 V2
TC pC V2 V1 V2 V1
The theoretical efficiency of the Otto cycle is, therefore,
V2 1
efficiency 1
V1
Using a typical compression ratio V1/V2 10 and taking 1.4 gives a
maximum theoretical efficiency of (1 0.100.4) 0.60 (or 60%).
GASES • 179
pD D
pA A
O
V2 V2 V2 Volume
180 • FOUNDATIONS OF PHYSICS
9.8 Exercises
Gas
(a) Sketch a graph to show how the pressure inside the syringe varies
with volume as the gas is slowly compressed (e.g., as if someone put
their finger over the end of the syringe and slowly pushed the plunger
in).
Assume this is slow enough for the temperature to remain constant.
(b) On the same graph draw a second line to indicate how you would
expect pressure to change with volume if the plunger is pushed in
quickly.
(c) Explain the difference between the lines in (a) and (b) by referring to
work and internal energy.
4. Air at atmospheric pressure and temperature has a pressure of about 105
Pa and a density of about 1.2 kgm−3.
(a) Calculate the rms speed of an air molecule.
(b) Air consists mainly of nitrogen and oxygen and the rms speeds of
oxygen and nitrogen molecules in the atmosphere are different.
Explain why.
5. Suggest why the ideal gas equation will cease to apply:
(a) at very high pressures
(b) at very high temperatures
(c) at very low temperatures
6. The speed of sound in air is given by:
p
v
where is a dimensionless constant, p is the air pressure, and is the
air density. By treating air as an ideal gas, show that the speed of sound
in air is independent of pressure but directly proportional to the square
root of absolute temperature. (Hint: consider one mole of air and use
the ideal gas equation to relate density to pressure and temperature by
introducing the molar mass M).
7. Eight molecules in a gas are moving parallel to the x-axis and have
x-components of velocity equal to 510.0 m/s, 550.0 m/s, 495.0 m/s,
548.0 m/s, 498.0 m/s, 502.0 m/s, 518.0 m/s, 535.0 m/s
182 • FOUNDATIONS OF PHYSICS
14. Here is the Otto cycle for an ideal four-stroke petrol engine:
Pressure
pC C
pD
D
pB
B
pA
A
O
Volume
V2 V1
10
STATISTICAL THERMODYNAMICS
AND THE SECOND LAW
10.0 Introduction
The gas laws and the ideal gas equation are all derived empirically, that
is, from experimental data. They give a good mathematical description of
the behavior of an ideal gas, but they offer no explanation of why the gas
behaves in this way. The reason for this is that they deal with macroscopic
properties such as volume, temperature, and pressure and do not engage
with the microscopic behavior of the molecules that constitute the gas.
Kinetic theory (see Section 9.3) provides a more fundamental explanation
of the behavior of an ideal gas, despite being based on the random motion
of particles. Statistical thermodynamics takes this further and provides one
of the most important and enigmatic of all physical laws—the second law
of thermodynamics.
happens. In one sense this is true. Macroscopically the ink droplet has
spread uniformly throughout the water and that is the end of it. However,
on the microscopic scale, particles of water and ink are continually moving,
colliding, and changing positions. These microscopic collisions are all
governed by Newton’s laws and are completely reversible. This means that
if you were to analyze a particular collision and then run it backwards, the
reversed collision would be a perfectly acceptable physical process. In fact,
if you were presented with videos of a single collision running forwards and
backwards, you would be unable to tell which one goes forward in time.
Both would obey Newton’s laws and conserve energy, momentum, etc.
Direction of time
Direction of time
Newton’s laws are reversible. So are the laws of electromagnetism and
gravity. They do not distinguish between the past and the future. However,
macroscopic processes are almost always irreversible. The pale blue water
does not resolve itself back into separate clear water and a dark drop, a
broken glass remains broken, a scrambled egg remains scrambled and, more
personally, we remember the past but not the future and we all eventually
grow old and die. This raises two very important questions.
Water returned
at 16 °C (289 K)
δQ
T1 T2
STATISTICAL THERMODYNAMICS AND THE SECOND LAW • 189
Q
SA
T1
Q
SB
T2
Q Q
Ssystem 0 (second law of thermodynamics)
T1 T2
Q Q
Therefore, , so T1 T2
T2 T1
If entropy cannot decrease then either T1 > T2 and heat flows from hot
to cold or T1 T2 and the system is in thermal equilibrium with maximum
entropy. This shows that the second law expressed in terms of entropy is in
agreement with the statement about direction of heat transfer. Heat cannot,
in isolation, flow spontaneously from a colder to a hotter body because this
would decrease the entropy of the system.
Q2 Work done: W
Entropy of sink
Sink at T2 increases as heat
is dumped
Work done has no effect on the entropy of the system, so the entropy
changes of the system are:
Q
Ssource 1
T1
Q2
Ssink
T2
Q Q
Ssystem 1 2 0 (second law of thermodynamics)
T1 T2
190 • FOUNDATIONS OF PHYSICS
states. The graph below shows how the “number of ways,” W, changes with
the distribution of particles:
Number of
micro-states W
Macro-state
N:0 N/2 : N/2 0 : N distribution L:R
As N increases, the peak becomes incredibly sharp (tall and narrow) so that
virtually all of the micro-states cluster close to the N/2:N/2 equilibrium
distribution. Most of the micro-states correspond to macro-states that are
indistinguishable from this 50:50 distribution. Since the particles move
randomly we can assume that, left alone, the system will explore all of the
microstates available to it. The system we considered started off in a macro-
state which corresponds to only 1 micro-state. As time goes on it explores
other adjacent micro-states, most of which lie closer to the peak because
the number of micro-states increases that way. This means the system tends
to evolve toward the peak. Given the fact that the number of micro-states
close to the peak vastly outweighs all of those even a short distance away
from it then, if we leave the system for any length of time, the overwhelming
probability will be that we will find it in a macro-state close to the peak of
the distribution—i.e., close to equilibrium. And once in equilibrium it is
highly unlikely to fluctuate far from it because the number of micro-states
drops rapidly away from equilibrium. For the systems we usually interact
with, N is enormous (e.g.,>1020), so the probability of a large fluctuation
from equilibrium is so small that it can be assumed to be zero.
We can now give a more fundamental, microscopic, explanation of the
irreversible evolution toward equilibrium. It is based on three assumptions.
Q Macroscopic systems can be composed of a number (usually a very large
number) W of microscopic configurations or micro-states (sometimes
called “number of ways”).
STATISTICAL THERMODYNAMICS AND THE SECOND LAW • 193
Number of
micro-states W
Macro-state
N:0 N/2 : N/2 0 : N distribution L:R
When two systems at temperatures Thot and Tcold are placed in thermal
contact heat flows from the system at higher temperature to the system
at lower temperature. We showed (see Section 10.2.2) that this is a
consequence of increasing entropy. Removing heat Q from the hotter
system reduces its entropy by an amount Q/Thot which is smaller than the
increase in entropy of the cold system, Q/Tcold resulting in a net increase
in entropy of the combined system and therefore satisfying the second law.
What this must mean is that transfer of heat Q to or from a hotter system
has a smaller effect on its entropy than transfer of the same amount of heat
to or from a colder system. When two systems are in thermal equilibrium
the entropy change when heat Q is transferred between them is zero.
Richard Feynman likened it to drying yourself with a small towel. At first
you are wetter than the towel and water is transferred from you to the
towel. However, once you have used the towel for a while it becomes so
wet that it no longer dries you—as you rub yourself with the towel as much
water transfers from you to the towel as transfers from the towel to you—
both the towel and your skin have reached the same degree of “wetness.”
Continuing to use the towel results in no further change in wetness. This is
analogous to the approach to thermal equilibrium.
Temperature is therefore related to the rate at which entropy changes
when heat is transferred to or from a system, and we can define the
thermodynamic temperature using this equation:
1 dS
T dQ
How does this relate to the statistical description of entropy in terms of
microscopic configurations? This can be understood by considering the
effect of adding 1 quantum of energy to a system that already contains N
quanta. Whilst this will always increase the number of configurations of the
system it has a greater effect on the entropy of a system with an initially
smaller number of configurations than on one with an initially larger
number. Mathematically this is a consequence of the natural logarithm in
Boltzmann’s formula.
T2 V2
3RdT T2 V2
S 2T
RVV 23 R ln T R ln V
1 1
T1 V1
Qhot Qhot
Work done by Work done on
the heat engine: W the refrigerator: W
Qcold Qcold
For a heat engine, the heat reservoirs are burnt fuel (hot) and the
environment (cold). For a refrigerator the heat reservoirs are the foodstuff
198 • FOUNDATIONS OF PHYSICS
that must be cooled down (cold) and the environment (hot). Typically, a
domestic refrigerator removes heat from food at about 4°C and dumps heat
into a room at about 20 °C although different types of refrigerator can work
in different temperature regimes. The reason you need to plug an electric
refrigerator into a mains supply is so that electricity can supply the work
that will ultimately be dumped as additional heat in the environment.
For a refrigerator to operate it must obey the second law of
thermodynamics so the entropy increase caused by the heat dumped in the
environment must be equal to or greater than the entropy decrease caused
by extracting heat from the stuff inside the refrigerator:
Qhot
Senvironment
Thot
Q
Sfood cold
Tcold
Qhot Qcold
Suniverse 0
Thot Tcold
Qhot Qcold
Thot Tcold
Qhot T
which implies that hot , so Qhot (the heat dumped in the
Qcold Tcold
environment) must be greater than Qcold (the heat extracted from the food).
The refrigerator can only operate if we supply additional energy. This is the
work W Qhot Qcold. As far as the universe is concerned a refrigerator is
a heater!
The coefficient of performance (CoP) is a measure of how effective the
refrigerator is and is the ratio Qcold/W. The higher this is, the more joules of
heat are removed from objects inside the refrigerator per joule of electrical
work done, so it is like the “efficiency” of the refrigerator. The second law
sets a limit to the theoretical thermodynamic efficiency of a heat engine
and it also sets a limit to the theoretical coefficient of performance for a
refrigerator.
Q Qcold 1 1
CoPrefrigerator cold
W Qhot Qcold Qhot Thot
1 1
Qcold Tcold
10.6.2 Heat Pumps
Heat pumps are used to extract heat from a cooler environment and dump
it into a warmer environment and so, like a refrigerator, they are really
STATISTICAL THERMODYNAMICS AND THE SECOND LAW • 199
a reversed heat pump. However, the point of the heat pump is not the
amount of heat extracted from the cold reservoir but the amount of heat
delivered to the hot reservoir. The coefficient of performance for a heat
pump is therefore defined as the ratio Qhot/W and the second law once
again sets a limit on this:
Q Qhot 1 1
CoPheat pump hot
W Qhot Qcold Q T
1 cold 1 cold
Qhot Thot
10.7 Implications of the Second Law
10.7.1 The Second Law, the Arrow of Time, and the Universe
When the second law of thermodynamics is applied to the universe it states:
Q The entropy of the universe tends to a maximum value.
This implies that the arrow of time points in the direction of increasing
entropy. The past is a low entropy state and the future is a high entropy
state.
Low High
entropy Arrow of Time entropy
past future
the arrow of time ceases to exist once the system has reached equilibrium,
so one large question remains: why did the universe start in a low entropy
state? According to our analysis the initial configuration of the universe
must have been one that can be realized in only a relatively small number
of ways (low W). This was a state of low probability in the sense that if
we considered the totality of macro-states the universe can have then the
actual state in which it began belongs to a tiny subset of these. This is rather
like the first example we considered, the ink droplet in water—the system
starts in a very special low probability, low entropy state and evolves toward
a higher probability, high entropy equilibrium. Our universe is also evolving
toward a high entropy equilibrium state. The assumption that the universe
began in a low entropy state is sometimes called the “past hypothesis.”
10.8 Exercises
Q Number of ways
11
OSCILLATIONS
11.0 Introduction
An oscillator undergoes regular periodic motion about a fixed equilibrium
position (or value). The simple model of a mass oscillating on a spring
can be adapted to explain a wide variety of physical phenomena, from
lattice vibrations in a crystalline solid to the effects of seismic waves. The
mathematical analysis developed to describe mechanical oscillators can also
be used for electrical and electromagnetic oscillations and is the starting
point for understanding all kinds of wave motion.
Support
Spring
Amplitude of
oscillation: A
Mass Equilibrium position
Connection to
oscilloscope
Ultrasonic
position Data logger
sensor
-2
-3
-4
-5
OSCILLATIONS • 205
Displacement/cm
5
4
3
2
1
0 Time/s
-1 0 1 2 3 4 5 6
-2
-3
-4
-5
v = dv/dt
15
Velocity/cms–1 Max. velocity Vmax =
ωA = 2πfA occurs at
10
zero displacement
0
0 1 2 3 4 5 6 Time/s
-5
-10
-15
50
a = dv/dtAcceleration/cms–2 Max. acceleration
40 amax = ω2A = 4π2fA
30 occurs at max.
displacement
20
10
0
-10 0 1 2 3 4 5 6 Time/s
-20
-30
-40
-50
OSCILLATIONS • 207
-2
-3
-4
–5
k 1 k
So and f
m 2 m
11.3 Mass-Spring Oscillator
An ideal mass-spring oscillator has a mass m attached to a massless spring
with spring constant k. In equilibrium the spring has an extension x0. At this
point the weight of the mass is supported by the tension in the spring.
kx0 mg
When the spring is displaced from equilibrium by an additional amount x,
the magnitude of the resultant force on the mass becomes:
k(x x0) mg kx
kx0 k(x0 + x)
Equilibrium
x
mg
mg
OSCILLATIONS • 209
l Direction of
resultant
force
θ
mg
A B
x
The free body diagram on the right shows that the resultant of the two
forces (tension and weight) acting on the bob will be mg sin along the line
shown. If we consider the horizontal motion (along AB), then the resultant
force acting horizontally is:
F mg sin cos
210 • FOUNDATIONS OF PHYSICS
1 g l
which gives f and T 2
2 l g
Notice that this is independent of the mass of the bob. This is because the
resultant force is directly proportional to the mass, so mass will cancel when
calculating accelerations:
a F/m (constant m)/m.
This is for the same reason that objects of different mass fall with the same
acceleration in the same gravitational field.
2
TE
1.5 PE
KE
1
0.5
Time/s
0
0 0.1 0.2 0.3 0.4 0.5 0.6
Notice that both KE and PE reach their maximum values twice per
oscillation; this is because they are related to the square of the sine and
cosine.
x0 x0
212 • FOUNDATIONS OF PHYSICS
2
TE
1.5
PE
1 KE
0.5
0 Position/cm
-6 -4 -2 0 2 4 6
11.5.3 Damping
Real oscillators are subject to frictional forces that oppose their motion.
The oscillator must do mechanical work against these forces, so (unless
it is driven by an external energy source) its total energy decreases with
time and so does its amplitude. The oscillator is damped. The heavier the
damping, the greater the rate at which the oscillator loses energy and the
greater the rate of decay of its amplitude. In many cases the amplitude
decays approximately exponentially. This occurs when the oscillator loses
the same fraction of its total energy on each oscillation.
OSCILLATIONS • 213
Displacement/cm
5
4
3
2
1
Time/s
0
-1 0 0.1 0.2 0.3 0.4 0.5 0.6
-2
-3
-4
response gets smaller; and when you shake the string at a high frequency,
the bob stays almost still at the center of its motion. Closer inspection shows
that it does oscillate with the driver frequency but with a tiny amplitude
and a phase lag of almost The strong response occurs when the driving
frequency is equal to the natural frequency of the oscillator and is called
resonance. This pattern of response is shown in the graph below.
Amplitude Strong
of oscillator resonance
peak
Effect of
increasing
damping
Driver
amplitude Driver
frequency, fd
fd = f 0
Phase difference
between driver and
driven
π
π/2
0 Driver frequency,
fd
11.7 Exercises
(a) it is stationary?
(b) it has maximum velocity to the right?
(c) it has zero acceleration?
(d) it has maximum acceleration to the left?
(e) it has maximum kinetic energy?
(f) it has maximum potential energy?
2. The diagram below shows a vehicle tethered between two similar
springs. When it is displaced from equilibrium and released, it
undergoes periodic oscillations about an equilibrium position. Neither
spring goes slack during these oscillations and both springs obey
Hooke’s law in compression and extension. The vehicle has a mass of
0.80 kg.
x
M
A B
Equilibrium position
216 • FOUNDATIONS OF PHYSICS
(iii) Add a second line to your graph to show the variation of potential
energy with time.
4. During an earthquake the floor of a building oscillates vertically with
an amplitude A and frequency f. Derive a formula for the frequency at
which objects in contact with the floor will just lose contact during the
earthquake.
5. An atom of mass 5 1026 kg is in a cubic lattice with all bonds between
adjacent pairs of atoms having a spring constant about 100 Nm1.
(a) Estimate the natural frequency of oscillation of the atom.
(b) If it is able to absorb radiation at this frequency, what part of the
electromagnetic spectrum does it absorb?
12
ROTATIONAL DYNAMICS
12.0 Introduction
The circle is a simple geometric shape that is used throughout science to
model cyclic processes and is often the starting point for more complex
theories (e.g., of planetary motion or electron orbits). Simple harmonic
motion can be regarded as a projection of circular motion onto a diameter,
making the concept of phase very clear. Rotating vectors, or phasors, are
powerful ways to model oscillations and waves.
12.1 Angles
12.1.1 Measuring Angles in Radians
The degree is an arbitrary division of the circle into 360 equal parts.
This is a useful measure of angle, but in physics it is often simpler to work in
a different unit, the radian. The reason for this is that the radian is defined
directly from the geometry of the circle.
Q Definition: The angle at the center of a circle subtended by an arc of
length l is equal to the ratio l/r where r is the radius of the circle.
220 • FOUNDATIONS OF PHYSICS
r l
θ = l/r
For a complete circle, l = 2πr
θ
Therefore, θcircle = 2π radians
To convert between degrees and radians,
2π radians = 360°; π radians = 180°;
π/2 radians = 90°
l
θ= (h= radius of
h
circle)
h o
sin θ =
h
o
a
cos θ =
θ h
As θ → 0 o → l and
a a → h (radius)
therefore:
sin θ → θ (in radians)
cos θ → 1
tan = sin θ → θ
cos θ
The significance of this is that, for small angles, we can replace the sine
or tangent of the angle with the angle itself (in radians). But how small
is small? Like all approximations this depends on how precise a value is
needed. The table below shows that the approximations work to better than
1% when the angle is 0.1 radian and better than 2% for 0.2 radian.
ROTATIONAL DYNAMICS • 221
v v
v
v
same constant velocity
In the top case we are describing the physics from an external non-
accelerating (inertial) frame of reference. Newton’s laws of motion apply, so
the object continues to move in a straight line in the absence of a resultant
force. There is no acceleration or resultant force until the side of the vehicle
has moved inwards and collided with the sliding object.
In the bottom case the apparent acceleration is because the observer is
inside the vehicle and does not take into account his own acceleration. In
order to explain the apparent acceleration of the object, he introduces an
imaginary outward force, the centrifugal force.
Centrifugal forces are examples of inertial forces, introduced in
order to explain observed physics from a non-inertial (accelerating) frame
of reference. Unlike the centripetal forces in an inertial reference frame,
inertial forces have no physical origin and that is why they are referred to as
imaginary. They are helpful if we need to solve physical problems inside a
rotating reference frame, but we must always bear in mind that they are an
artifact of our reference frame and do not arise from physical causes.
ROTATIONAL DYNAMICS • 225
a
v a a
a v
a
a
v v
When you swing a stone in a circle on the end of a string, the forces acting on
the stone are directed toward the center of the circle. If the string suddenly
breaks, the stone has no resultant force acting on it and flies off along a
tangent; it does not accelerate outwards because there is no centrifugal
force. It moves in a straight line at constant velocity. (Here we ignore the
effects of other external forces such as gravity.)
A v
Initial path of stone if string
breaks when stone is at point A
Note that centripetal force is not a new kind of force. It is the magnitude of
the resultant force on a body moving with constant uniform circular motion.
It arises as a result of the real physical forces that act on it. In some cases this
is simple. For example, the Moon’s orbit around the Earth is approximately
circular, and the centripetal force is provided by the gravitational attraction
toward the Earth.
4 2 mr
Fgrav mr 2
T2
226 • FOUNDATIONS OF PHYSICS
Fgrav
Moon
Earth
Fgr
av Moon
later
T is the Moon’s orbital period, r is its orbital radius, and m is its mass.
A more complex situation involves an object moving in a vertical circle
in a uniform vertical gravitational field. Examples might be a person on a fun
fair ride, a plane looping the loop, or just a stone on a string. If the motion
is uniform (constant angular velocity) the resultant force stays constant in
magnitude, but the forces that contribute to it change. The example below
shows a person standing in a capsule on a fairground ride and indicates the
forces in four different positions. The capsule is rotating at constant angular
velocity.
Only two forces act on the man. His weight mg is the same in all
positions, but the contact force R from the floor of the capsule changes
with position. The resultant force at all points is the vector sum of weight
and reaction force and must be F mr2 toward the center of the circle.
A
R ω R
D θ θ B
mg mg
C
mg
ROTATIONAL DYNAMICS • 227
Position A
Weight and contact force act in the same direction, so mr2 mg RA
R A mr2 mg
For one particular angular velocity 0, the contact force falls to zero,
g
RA 0. This is when mr 20 mg, so 0 .
r
Under these conditions the contact force between the man and the floor
is zero and he feels “weightless.” This is only an apparent weightlessness
because he is in fact free-falling at this moment and the capsule is
accelerating downwards at g. For higher values of angular velocity, there
must be a contact force from the floor. For lower values of angular velocity
he will lose contact with the floor and begin to fall downwards toward the
roof of the capsule. In practice this sets a minimum value for the practical
angular velocity in such a fairground ride.
Position C
Weight and contact force act in opposite directions, so mr2 RA mg
R A mr2 mg
The contact force is greater than his weight. He experiences this through
the reaction force from the floor pushing up on his feet. This makes him
feel heavy at the bottom, as if his weight has increased. Once again this is
only an apparent increase in weight since the gravitational forces have not
changed.
RD RB
mg mg
θ θ
Resultant = mrω2 Resultant = mrω2
RD sin θ = mg RD sin θ = mg
RD cos θ = mrω2 RD cos θ = mrω2
tan θ = g/rω2 tan θ = g/rω2
Positions B and D
In both positions the contact force has a vertical component that balances
the weight and a horizontal component that provides centripetal force.
228 • FOUNDATIONS OF PHYSICS
t = 3T/4 Time
t = T/4
t = T/2 −A
t=0 t = T/4 t = T/2 t = 3T/4 t=T
These are used in exactly the same way as the original suvat equations.
rotating about its center of mass). The total kinetic energy is equal to the
sum of translational and rotational kinetic energies.
The diagram below shows a rigid body rotating about its center of mass
(CM). It consists of N particles and the ith particle has mass mi and is at a
distance ri from the axis.
mi
ri vi
CM
RKE ke 12 m r
i i i
2
2
i1 i1
The angular velocity is the same for all particles, so:
1
iN
RKE
2
m 2 2
i i
r
i1
The term in parentheses is called the moment of inertia I of the body:
iN
I m r 2
i i
i1
2
The S.I. unit of moment of inertia is kgm .
Moment of inertia in rotational dynamics plays an analogous role to
mass (inertia) in linear mechanics. However, unlike mass, the moment of
inertia is not a fixed quantity for a body because it depends on both the
mass and its distribution about the axis of rotation.
Using moment of inertia we can extend the analogy between linear and
rotational motion:
Linear kinetic energy ½ mv2 Rotational kinetic energy ½ I2
ROTATIONAL DYNAMICS • 231
h
ω
Neglecting energy losses due to friction, the gain in total kinetic energy
must equal the loss of gravitational potential energy. If the cylinder starts
from rest, then its final kinetic energy will be (using v/r):
mgh translational KE rotational KE
½ mv2 ½ I2 ½ mv2 ½ Iv2/r2
This can be rearranged to give the linear velocity at the bottom:
2mgh
v
I
m 2
r
The larger the moment of inertia, the lower the final linear velocity. This is
because a larger fraction of the energy goes into rotation. A cylinder has a
larger moment of inertia than a sphere because more of the mass is farther
from the axis. If a cylinder and ball are rolled down the same slope side by
side, then the ball will reach the bottom first.
If the surface is completely smooth and the object slides without rolling,
then there is no rotational kinetic energy and the expression above reduces
to:
v 2 gh
This shows that a block sliding down a smooth slope will always get to the
bottom faster than any round rolling object on the same slope.
ri vi
CM
L m rli i i
2
i1 i1
The angular velocity is the same for all particles, so:
iN
L
m i ri
2
I
i1
iN
i Torque, Γ = force ×
F
perpendicular distance
m from pivot P = Fr
S.I. unit of torque is Nm
P
ri fi
CM
= f i ri m i ai ri m r i i
2
I
i1 i1 i1
This has the same form as Newton’s second law in linear mechanics:
Linear case, F ma Rotational case, I
234 • FOUNDATIONS OF PHYSICS
d
We can use to relate this to angular momentum.
dt
d(mv) d(I)
Linear case, F →
dt dt
We can now state Newton’s laws of motion in a form that applies to
rotational motion.
Q Newton’s first law for rotational motion
An object continues to rotate with constant angular momentum until acted
upon by a resultant external torque.
Q Newton’s second law for rotational motion
The resultant external torque on a system is equal to the rate of change of
angular momentum of that system.
Q Newton’s third law for rotation
When object A exerts a torque on object B, B exerts an equal but opposite
torque on A.
i1
m i ri2 , and we can
use this to derive useful expressions for the moments of inertia of a range of
standard mass distributions.
iN
I
m 2
i i mr
r 2
i1
r1 m1
r2
r3 m2
m3
For several point masses at various distances from the center of rotation,
iN
I
m i ri 2 m1 r12 m2 r2 2 m3 r32
i1
12.6.2 Moment of Inertia of a Rod
For a continuous mass distribution we need to convert the sum into an
integral. For a thin rod lying along the x-axis this can be done by considering
a short section of length x.
l
P
x
δx
236 • FOUNDATIONS OF PHYSICS
Let the total mass of the rod be m. The mass of the short section of length
m x
x at distance x from the center of rotation is m , and the moment
l
mx2 x
of inertia of this section (treated as a point mass) is I .
l
The moment of inertia of the rod about one end is, therefore,
iN xl xl
mx2 dx 1 2
2
Irod end m i ri äI
l
ml
3
i1 x0 x0
A similar approach can be used to determine the moment of inertia about
any point in the rod. For example, the moment of inertia about the center
of the rod would be found by taking x 0 at the center and then integrating
from l/2 to l/2:
iN xl xl /2
mx2 dx 1
Irod CM
m i ri 2 äI l
ml 2
12
i1 x0 xl /2
This moment of inertia is smaller than the moment of inertia about one end
because more of the mass is now closer to the axis of rotation.
You can get a qualitative “feel” for moment of inertia by taking a rod
(e.g., ruler) holding it in the center and trying to rotate it back and forth.
You will feel a resistance to rotation. Now do the same thing while holding
it near one end. The resistance to rotation has increased significantly—the
moment of inertia is larger.
Solid cylinder is
made up of an
r infinite number
of infinitely thin
shells
ROTATIONAL DYNAMICS • 237
The diagram that follows shows an end view of a solid cylinder of total mass
m and radius r. A thin shell at radius x is shown.
Thin cylindrical shell
of thickness δx and
mass δm
The moment of inertia of a cylindrical shell is I mx2, and the mass of the
2 xmx 2 xmx
shell is a fraction of the mass of the cylinder given by m .
r 2 r2
This is because the area of the shaded strip above is effectively 2xx if
treated like a long thin rectangle.
The moment of inertia of the entire cylinder is then:
xr xr
2 mx3 dx 1 2
Icylinder I r2
mr
2
x0 x0
12.6.4 Moment of Inertia of a Uniform Sphere
A uniform sphere of mass m and radius r can be considered to be made up
of an infinite number of thin discs of varying radius. The total moment of
inertia is the sum of moments of inertia of all such discs.
The moment of inertia of the sphere is:
Isphere 12 x dm
2
0
In order to integrate this we must express m in terms of d. It simplifies
things if we introduce the density of the material of the sphere at this
stage.
m r 2sin 2 x
238 • FOUNDATIONS OF PHYSICS
h θ θ h
CM CM
mg
Equilibrium
1. (a) Use Newton’s laws of motion and a suitable diagram to show that
for an object to move in uniform circular motion there must be a
resultant force acting toward the center of the circle.
(b) Explain why a centripetal force cannot do work on an object moving
in circular motion.
2. A small ball of mass m is released from a height h on a track that leads to
a looping section as shown below.
h
r
B
A
(a) Derive an expression in terms of r for the minimum height h from
which the ball can be released if it is to complete the loop without
losing contact with the track. Ignore the rotational motion of the ball
and assume friction is negligible.
(b) Draw free body diagrams to show the forces acting on the ball at
points A, B, and C.
(c) Discuss whether the resultant force on the ball is toward the center
of the circle at all points as it completes the loop.
(d) Derive an expression for the velocity of the ball at point C.
ROTATIONAL DYNAMICS • 241
D B
400 m
(a) Estimate the moment of inertia of the space station about its
rotation axis by treating the pods as point masses concentrated at
242 • FOUNDATIONS OF PHYSICS
13
WAVES
13.0 Introduction
There are many different types of waves, but the underlying physics
is common to all of them, whether they are mechanical vibrations in a
medium, like sound, or vibrations of an electromagnetic field, like light.
They all transfer energy, reflect, refract, diffract, and interfere. The wave
model is one of the most important models in physics, and in the next four
chapters, we will investigate waves in considerable detail.
The time period T for the formation of one wave is equal to the time period
of the source oscillations, so all particles in the wave oscillate with the same
frequency f as the source. During one oscillation of the source the wave
moves forward a distance equal to its wavelength, so the phase velocity v of
the wave is given by:
v f
T
We can now define some terms:
Q Traveling or progressive wave
A wave where all the particles oscillate with the same (or a decaying)
amplitude but with a progressive phase delay in direct proportion to their
distance from the source.
Q Wavelength
Shortest distance between two particles oscillating in phase in the wave.
Q Time period T
Time for one complete wave to leave the source or time for a particle to
complete one cycle of oscillation.
Q Frequency f
1
Number of waves leaving the source in 1 second, f .
T
WAVES • 245
Q Phase velocity v
Velocity at which a point of constant phase (e.g., a wave crest) travels away
from the source, v f
Q Wavefront
Line of constant phase in the wave pattern; perpendicular to rays.
Q Ray
Arrow in the direction of energy transfer; perpendicular to wave fronts.
Transverse waves
Vibration directions are perpendicular to the direction of energy transfer.
Phase velocity v
A
Direction of
oscillations
Direction of
energy transfer
−A
As the wave moves to the right, the disturbance at each position varies
vertically with the same amplitude as the source (assuming no energy
dissipation). All electromagnetic waves are transverse, as are seismic
S-waves (secondary or shear waves).
Compression Compression Compression
Rarefaction Rarefaction
Oscillation
direction
Phase
Wavelength λ velocity v
246 • FOUNDATIONS OF PHYSICS
Longitudinal waves
Vibration directions are parallel to the direction of energy transfer. This
results in regions of compression (shown as dark areas below) and rarefaction
(low density, shown as lighter areas below) in the medium through which
the wave passes.
As the wave moves to the right, the disturbance at each position also
varies horizontally, with the same amplitude as the source (assuming no
energy dissipation). Sound and ultrasound are longitudinal waves, so are
seismic P-waves (primary or pressure waves).
Some waves are a combination of longitudinal and transverse waves.
This results in particles undergoing elliptical motions as the wave passes.
Surface water waves are like this.
Diection of motion
of water wave
Longitudinal Transverse
component component
Wave disturbance
(e.g., displacement)
versus position at x
one time
–A
λ
WAVES • 247
Wave disturbance
(e.g., displacement) t
versus time at one
position
–A T
These graphs are similar to each other but represent different things. The
repeat distance in space is the wavelength, and the repeat distance in time
is the time period of the wave.
If the wave is moving in the negative x-direction, the sign in the equation
changes:
y A cos(t kx)
Derivation of phase velocity
The phase velocity is the velocity at which a point of constant phase, e.g., a
wave crest, moves along the axis. The phase of the wave is given by:
(t kx)
d dx
k kv 0 (for position of constant phase)
dt dt
2 f
v f
k 2
v f
13.1.5 Amplitude and Intensity
The intensity of a wave is the power delivered per unit area (Wm1). This
will be proportional to the energy of each oscillator in the wave. For a
harmonic wave the disturbance at each point is simple harmonic, so the
total energy of each oscillator is given by an expressions of the form:
1
E m 2 A 2
2
It follows that the intensity of a harmonic wave is directly proportional to
the square of its amplitude. This is a general and very useful relation:
I A2
Doubling amplitude increases intensity by a factor of four, etc.
13.2 Reflection
When a wave strikes a boundary it can be wholly or partially reflected. The
law of reflection states that:
Q The incident and reflected rays make equal angles to the normal to the
surface at the point of incidence, and both rays and the normal lie in the
same plane.
WAVES • 249
Normal
i r
If two plane reflectors are placed at 90° to each other, then a ray striking
either one of them will return parallel to its original path. This is used in
car reflectors and was used by the Apollo astronauts who left an array of
corner reflectors on the surface so that lasers sent from Earth would reflect
back and allow the distance between the Earth and Moon to be measured
precisely.
i1 = r1; i2 = r2 = 90 – i1
Original incident ray has turned
i1 through a total angle of: (180 – (i1 + r1)) +
r1 (180 – (i2 + r2)) = 360 – 2i1 – 2i2 = 180
i.e., it travels back parallel to the
i2 r initial ray
2
13.3 Refraction
13.3.1 Refraction at a Boundary Between Two Different Media
Wave velocity depends on the medium through which the wave is traveling.
For example, light slows down when it travels from air into glass, and surface
water waves slow down when they travel from deeper to shallower water. If
the wave strikes the boundary between two different media at an angle to
the normal, then the wave direction changes. This is called refraction.
250 • FOUNDATIONS OF PHYSICS
Medium 1 Medium 2:
λ1
λ1
λ2 λ2
Medium 2 Medium 2
Sheet of A3 paper
Ray
fixed to a drawing
box
board
θ1
θ2
Rectangular
glass block
The position of the edge of the block and the normal to this line are marked
onto the white paper, and the ray box is used to direct a single fine ray at the
WAVES • 251
point where these two lines intersect. The ray is traced using optical pins
(shown by X on the diagram) pushed into the board. The block can then be
removed and the path of the ray outside and inside the block can be drawn
using a pencil and ruler. The incident angle 1 and the refracted angle 2 are
then measured using a protractor. If this is done for incident angles in the
range 0–90° then a graph of sin 1 against sin 2 is a straight line.
sin θ1
1
sin θ2
This shows that:
sin 1 sin (incident angle)
constant
sin 2 sin(refracted angle)
This is called Snell’s Law, and the constant is called the relative refractive
index 1n2 for a ray passing from medium 1 into medium 2.
θ1
θ2
Snell’s law:
sin 1
n
sin 2 1 2
If medium 1 is the vacuum, then this constant is the absolute refractive
index for medium 2.
In practice, since the speed of light in air is almost the same as the
speed of light in a vacuum, the absolute refractive index is usually used
when medium 1 is air.
252 • FOUNDATIONS OF PHYSICS
B
θ1
Medium 1 θ1
Medium 2 A θ2 C
D
θ2
AB and DC are adjacent wave fronts so they are separated by one wavelength:
v1 2
BC 1
f u2
v
AD 2 2
f
AC is the common hypotenuse to the two right-angled triangles containing
angles 1 and 2, so we can write expressions for the sines of each angle:
v1
BC f
sin 1
AC AC
v2
AD f
sin 2
AC AC
Dividing the first equation by the second gives us Snell’s law in terms of the
wave velocities in each medium:
sin 1 v1
1n2
sin 2 v2
The index of refraction is the ratio of wave speeds in the two media on
either side of the boundary. It is obvious from this equation that if the ray
WAVES • 253
direction is reversed, it will retrace its path. In other words, the refractive
index when going from medium 2 to medium 1 is the reciprocal of the
refractive index when going from medium 1 into medium 2:
v2 1
2 n1
v1 1 n2
It is also clear that with a greater ratio of velocities, more refraction will
occur and that rays of light will bend toward the normal as they enter a
medium with a lower speed of light and away from the normal when they
enter a medium with a higher speed of light.
α
Air
P Glass
to ay β
or em
ig er
in g
al es
in pa
cid ra β
en lle
tr l
ay
TIR
θ1 < c θ1 = c θ1 > c
The condition for the critical angle is that the refracted angle must be equal
to 90°.
n1 sin 1 n2 sin 2
n1 sin c n2 sin 90 n2
n2
sin c
n1
WAVES • 255
Each time the ray reaches the boundary, its incident angle is greater than the
critical angle, so it is repeatedly totally internally reflected. Light has a very
high frequency, so it can be modulated to carry a great deal of information.
There are two main types of optical fiber: mono-mode fibers, which
effectively only allow a single path for the light (by having an extremely
narrow core of about 10 mm), and multi-mode fibers, which are much
thicker and allow multiple light paths. The disadvantage of multi-mode
fibers is that, over a long distance, different parts of the signal have traveled
significantly different distances and developed time delays. If these become
comparable to the time between ones and zeroes in the digital signal, then
the information is lost. The longer the fiber, the lower the maximum data
transfer rate, so they tend to be used over shorter distances, e.g., within a
single building. The advantage of multi-mode fibers is that they are cheaper
and can carry light of multiple wavelengths, so they are often used with
LEDs rather than lasers. Mono-mode fibers use laser sources working at a
single wavelength and can transmit high data rates over great distances (up
to thousands of kilometers).
13.3.6 Dispersion
The amount of refraction at a boundary depends on the refractive index
at that boundary. However, absolute refractive indices depend on the
wavelength of light (because the speed of light in a medium depends on
256 • FOUNDATIONS OF PHYSICS
wavelength). This means that if a polychromatic ray (i.e., one with a range
of wavelengths present, such as white light) refracts at a boundary, different
wavelengths will refract different amounts. This is called dispersion and is
familiar from the way a triangular glass prism can disperse white light into
a spectrum of colors.
For glass, the (higher frequency)shorter wavelength (blue/violet) end
of the spectrum travels more slowly than the (lower frequency)longer
wavelength end (red/orange) of the spectrum, and so has a higher refractive
index. For a certain type of crown glass, nred 1.509,while nviolet 1.521, a
small difference but one that is clearly demonstrable.
light
White Red lig
ht
Viol
et li
ght
13.4 Polarization
13.4.1 What is Polarization?
Longitudinal waves vibrate parallel to the direction in which they transfer
energy, so there is a unique vibration direction. Transverse waves, however,
vibrate at 90° to the direction in which they transfer energy, so they can
vibrate in any direction perpendicular to the direction in which the wave
is traveling. If a transverse wave is confined to oscillate only in one plane,
it is said to be plane-polarized. Transverse waves can be polarized, but
longitudinal waves cannot.
Possible
vibration Direction of energy transfer
directions
WAVES • 257
For example, if the only vibration direction is vertical, then the wave is
vertically plane-polarized.
Vertical
vibrations
Vertical
vibrations Direction of
energy transfer
In the diagram below, we are looking in the direction of wave travel with
the wave moving away from us. The diagram shows a polarizing filter that
will transmit vertically plane polarized light. The light incident on the filter
is plane-polarized at an angle to the vertical.
Transmitted
A cos θ
θ A
Absorbed
A sin θ
258 • FOUNDATIONS OF PHYSICS
0.6
Mean value:
0.4
0.2 Angle of
polarizing filter
0
0 30 60 90 120 150 180 210 240 270 300 330 360
Direction
of wave
Sample rotates
Light source: polarization
unpolarized direction by θ Filter must be
rotated by θ to
find maximum
Unpolarized Polarized
θB θB θB
θ2
Partially
polarized
We can derive an expression for the Brewster angle B by using Snell’s law
and the law of reflection. Let the refractive index of the first and second
media be n1 and n2, respectively.
n1 sin B n2 sin 2
But 2 180 (90 B) 90 B
n1 sin B n2 sin (90 B) n2 cos B
n2
tan B
n1
This is Brewster’s law.
Photographers use polarizing filters to enhance contrast, e.g., to darken
the sky compared to the clouds, or to reduce glare from reflective surfaces
(e.g., water or windows). Polarizing sunglasses are also used to reduce
reflected glare by absorbing one component of polarization.
13.5 Exercises
(c) Her friend is also treading water but is 15 m further out to sea. What
is the phase difference between the oscillations of the two swimmers?
Assume that the waves are traveling directly toward the shore.
3. The diagram below shows a pin placed symmetrically between two
plane mirrors that are perpendicular to each other.
Pin
An observer looking into the mirrors can see three images of the pin.
Draw a careful ray diagram to locate the positions of these images.
4. Here are some absolute refractive indices for different media.
Vacuum n 1 (exactly)
Air n 1.000293 (at s.t.p.) .... usually taken to be 1.00
Diamond n 2.42
Glass n 1.50
Water n 1.33
Speed of light in a vacuum c 3.0 108 ms1
The table below refers to a ray of light traveling from medium 1 to
medium 2 as shown in the diagram.
θ1
θ2
Medium 1 Medium 2
(a) Complete the table using values from the list above:
Medium 1 Medium 2 v1 v2 1 2
Glass 3.0 10 ms
8 1
40°
Glass Air 40°
Water Diamond 20°
2.0 108 ms1 55° 70°
262 • FOUNDATIONS OF PHYSICS
(b) Explain what is meant by total internal reflection and state the
conditions under which it occurs.
(c) Calculate the critical angle for an interface between:
(i) water and air
(ii) glass and air
(iii) diamond and air
(d) Suggest why a real diamond sparkles more than a fake glass “diamond.”
5. The refractive index n of glass can be determined by measuring the
minimum angle of deviation D when light passes through a triangular
prism of apex angle A:
A
D
6. Two polarizing filters are placed at 90° to each other. A third polarizing
filter is placed between them and slowly rotated. Unpolarized light is
directed into the system of three filters.
(a) Explain why the intensity of light passing through the system of
three filters reaches a maximum when the third filter is at 45° to the
direction of the first polarizing filter.
(b) Calculate the fraction of the incident intensity that passes through
the system when the middle polarizing filter is at 30° to the first
polarizing filter.
CHAPTER
14
LIGHT
14.1.2 Electromagnetism
In the 19th century, Michael Faraday and James Clerk Maxwell tried to
make sense of all the different electromagnetic phenomena that had been
discovered up that time. This included:
Q how charges exert forces on one another (see Section 17.3.1)
Q how electric currents create magnetic fields (see Section 20.3.1)
Q how magnetic fields exert forces on moving charges and electric
currents (see Section 20.2.2)
264 • FOUNDATIONS OF PHYSICS
Q how changing magnetic fields can induce voltages (see Section 21.1.1)
In order to do this Faraday introduced the idea of an electromagnetic
field. Electromagnetic fields are created by electric charges and exert
forces on electric charges. For example, a positive charge at one point in
space creates an electric field through all of space, and another charge some
distance from the first experiences a force from the field. The beauty of
this model is that the field itself has properties, and charges do not interact
by “instantaneous action-at-a-distance” but by interacting with the local
electromagnetic field.
Maxwell discovered a set of four equations—the Maxwell equations
that describe how fields are created by, and interact with, charges and how
the field in one place affects the field nearby. The equations showed that
if the field is disturbed in one place, e.g., because a charge is vibrating,
that disturbance spreads outwards from the source at a constant speed.
The speed could be calculated from the equations and is equal to the
measured speed of light. This suggested very strongly that light must be an
electromagnetic disturbance. It also showed that visible light is just part of
a much wider spectrum of electromagnetic waves, all of which travel at the
same constant speed in a vacuum.
Micro
visible
Gamma X-rayss UV IR
wave Radio waves
rays
10–1 10–14 10–1 10–1 10–8 10–6 10–4 10–2 100 102 104 106 108
Wavelength/m
The first terrestrial measurement of the speed of light was made by Armand
Fizeau in 1849. He employed an ingenious method using a rapidly spinning
toothed wheel to chop light into short pulses. The pulses then hit a distant
mirror and reflected back to the wheel. As the speed of the wheel increased,
the returning light hit the next tooth and was blocked. This meant that the
time taken for the light to travel to the mirror and back was equal to the
time taken for the wheel to rotate by the angle between a tooth and the next
gap. He measured the rate of rotation at which this occurred and used this
to find the time of flight t of the light pulse. The speed of light was then
calculated from the distance to the mirror, d, and back divided by the time
of flight:
c 2d/t
This method was modified and improved by Jean Leon Foucault and then by
Albert Michelson. They replaced the toothed wheel with a rotating mirror,
but the principle of the method was similar. Foucault even measured the
speed of light in water by placing a tube of water in the light path between
LIGHT • 267
To oscilloscope to
record emitted pulse
Mirror
LED base producing short
pulses at high frequency
Detector
d
To oscilloscope to
record detected pulse
the rotating mirror and the fixed distant mirror. His results showed that
light travels more slowly in water than in air, a result that reinforced the
wave model of light because it was consistent with the wave explanation
of refraction at an air–water boundary (the particle model predicted that
the particles of light would speed up as they entered the denser medium).
Michelson spent most of his life-refining methods for measuring the speed
of light. He also carried out the famous Michelson-Morley experiment,
which seemed to show that the speed of light is independent of the motion
of the observer, an effect that was eventually explained by Einstein’s special
theory of relativity (see Chapter 24).
What makes it difficult to measure the speed of light in a laboratory
is the fact that it is so fast. This means that we have to measure extremely
short time intervals. Nowadays high-speed electronic devices can do this
and the measurement can be carried out successfully over a distance of
just a few meters. The principle is very similar to that used by Fresnel,
Foucault, and Michelson, wherein short pulses of light are generated by an
LED laser and a fast oscilloscope is used to detect the emission of the pulse
and the detection of its reflection from a mirror placed a few meters away.
t t
One slight complication with this method is that the time it takes for the
electronic processing of the signals is comparable to the time of flight of the
light pulses, so the time t1 measured on the oscilloscope must be corrected
for this. One way to do this is to place the emitter and detector so that they
are facing each other and the light path is virtually zero. There will still be
two separate peaks on the screen, and the time delay t2 is now entirely due
to the equipment. The actual time of flight of the light pulse is therefore
(t1 t2) and the speed of light is 2d/(t1 t2). Typical oscilloscope traces are
shown in the diagram.
n r r
LIGHT • 269
F
Principal axis
1
P=
f
1
P
f
Concave, or diverging, lenses are shaped in such a way that light traveling
parallel to their principal axis diverges from a virtual focal point before the
lens.
F
Principal axis
In a real lens, light will refract as it enters the lens and refract again as it
leaves on the opposite side. A thin lens is one where both refractions can
be assumed to take place at the same point, on a line perpendicular to
the principal axis and passing through the optical center of the lens. Ray
diagrams involving thin lenses are often drawn with the lens itself shown as
a single vertical line passing through the optical center and a small symbol
showing the type of lens.
Thin lens
Thick
Thin lens
In ray optics the direction of the ray is reversible, so a ray passing through
the focal point before it reaches the lens will then travel parallel to the
principal axis on the far side of the lens.
Convex lens—predictable ray 2: A ray passing through the optical center
of the lens does not change direction.
Thin lens
Thin lens
The direction of the ray is reversible, so a ray heading toward the focal point
on the far side of the lens will then travel parallel to the principal axis after
it passes through the lens.
272 • FOUNDATIONS OF PHYSICS
Thin lens
14.2.3 Images
An image is a point-by-point representation of an object. A real image is
formed when the rays creating the image pass through the image points, for
example, the image formed on the retina of the eye, or the image projected
onto a cinema screen. A virtual image is formed when the rays creating
the image diverge from points on the image but do not pass through those
points, for example, the image in a plane mirror, or the image of the bottom
of a swimming pool seen through the surface. Since the rays responsible for
the image do not pass through it, a virtual image cannot be formed onto
a screen.
The size and position of an image can be determined by drawing two
predictable rays from a single point on an object and tracing their paths.
The place where they intersect is the position where the image point is
formed. Objects are often shown as vertical arrows and the two rays to be
traced are from the tip of the arrow.
14.2.4 Image Formation with a Convex Lens
The distance from the object to the optical center of the lens is called the
object distance, u. The distance from the optical center of the lens to the
image is called the image distance, v.
u v
ho
F Image
Object F
hi
Thin lens
LIGHT • 273
F
Object F
To image
Thin lens at infinity
Image u< f
F Object
Diverging
Thin lens rays
F
Object Image
Thin lens
The image is erect (same way up as the object), diminished, and virtual.
F F
Thin lens
LIGHT • 275
For an extended object at great distance, e.g., the Sun, there will be an
angle between the rays from a point on one side of the Sun and the rays
from a point on the opposite side of the Sun. They will be focused in the
focal plane above and below the focal point of the lens, and an image of the
Sun will be formed between them.
Rays from a Focal plane
single point
on top of Sun
Image
of Sun
Rays from a
single point
on bottom of Sun Thin lens
This creates an image of the Sun that can be projected onto a screen—e.g.,
to look for sunspots.
The power of a lens depends on its shape and on the material from
which it is made. The higher the refractive index, the more the light is
refracted and the shorter the focal length of the lens. However, the refractive
index depends on wavelength, so objects that have a range of colors are not
focused sharply at any one point. This is called chromatic aberration.
Achromatic lenses are composite lenses that use a range of different lenses
to compensate for chromatic aberration.
14.2.7 Lens Equation
The equation relating u, v, and f is called the lens equation. It can be derived
from the ray diagram for a convex lens, but, with a suitable sign convention,
it also applies to concave lenses. Look at the two shaded triangles in the ray
diagram below.
u v
ho
F Image
Object F
hi
Thin lens
276 • FOUNDATIONS OF PHYSICS
u f v
ho
F Image
Object F
hi
Thin lens
Therefore,
hi (v f ) v
ho f u
which can be rearranged to give the lens equation:
1 1 1
f u v
This can be used for convex and concave lenses using the REAL IS
POSITIVE sign convention:
Q f is positive for a convex lens (real focus) and negative for a concave lens
(virtual focus).
Q u is positive for a real object and negative for a virtual object.
Q v is positive for a real image and negative for a virtual image.
u v
Object Image
Normal
Normal
Concave mirrors are used to focus microwaves and radio waves so that a
detector placed at or near the focal point receives a strong signal (e.g., for
a radio telescope or a satellite TV dish). The way to think about curved
mirrors is to realize that the ray diagrams for a concave mirror are like those
for a convex lens, but with a reflection, and the ray diagrams for a convex
mirror are like those for a concave lens with a reflection.
F F
If a convex mirror is actually part of a sphere, rays nearer the edge of the
mirror focus to a different position than rays near its center. This results in
an unfocused image and is an effect called spherical aberration. To avoid
this the correct shape for a concave mirror is a paraboloid.
the water. The reason for this is that when light refracts at the water surface
it creates a virtual image of the floor of the pool that is closer to the surface.
If you look directly down into the pool, the ratio of the real depth to the
apparent depth is equal to the refractive index of water.
Air x
Water
β hreal
happ
α
Image point
β
Object point
sin
nwater
sin
From the diagram,
x x
sin and sin
happt hreal
Therefore,
hreal
nwater
happt
14.3 Optical Instruments
The behavior of an optical instrument (e.g., its magnifying power and the
nature and position of the image it produces) can be determined by ray
tracing.
parallel rays. The telescope is used to increase the visual angle, that is, the
angle subtended by the image at the eye. It does this using two convex
lenses. The first, the objective lens, is a low-power convex lens that creates a
real, inverted image of the distant object on the objective focal plane inside
the barrel of the telescope. The second, the eyepiece lens, is a higher-
power lens used as a magnifying glass to magnify the intermediate image.
In normal adjustment, the telescope accepts parallel rays and parallel rays
leave the eyepiece; therefore, the distance from the first image to the
eyepiece lens must equal the eyepiece focal length. This makes the total
distance between the two lenses equal to the sum of focal lengths of the
lenses.
Rays from
point at top of
distant object f0 fe
ay
ction r
stru
Con
α F β
Intermediate
image
In the ray diagram above, the rays from the top of the object enter the
objective at an angle to the principal axis. Rays from the bottom of the
object are assumed to enter along the principal axis but are not shown in
the diagram. The angle is therefore equal to the angle subtended by the
object at the objective, and is the angle subtended by the image at the eye.
Q is the angle subtended by the distant object at the objective lens.
Q is the visual angle—the angle subtended at the eye by the rays forming
image.
The angular magnification M of the telescope is the ratio /.
Using similar triangles and alternate angles, it is easy to see that
tan fo
.
tan fe
However, the angles involved are small, so we can use the approximation
tan (in radians) to obtain an expression for angular magnification:
f
M o
fe
280 • FOUNDATIONS OF PHYSICS
The image is, as can be seen in the diagram, inverted. For astronomical
work, this is not an issue. However, a terrestrial refracting telescope has
a third convex lens in between the intermediate image and the eyepiece,
and this lens simply inverts the intermediate image so that the final image
is erect.
Secondary mirror
Eyepiece: convex
lens of focal length
LIGHT • 281
Radio telescopes are also reflecting telescopes, using a large concave dish to
reflect radio waves to a detector. Radio waves have much longer wavelengths
than light, so radio telescopes must have much larger diameters than optical
telescopes (the resolution of a telescope depends on its diameter—see
Section 15.3.4). The world’s largest radio telescope is the Tianyan (Heavenly
Eye) telescope in Guizhou Province, China. Its objective reflector has a
diameter of 500 m and is built into a natural depression in the landscape.
The signal from a radio telescope is detected by placing a receiver close to
the focal point of the objective mirror.
Eye
Inverted
magnified
virtual image
Frequency heard by
stationary observer
Siren
Frequency of siren
approaching
at rest
f0
Siren receding
Time
Doppler shifts
v
Q Source and observer approaching with relative velocity v: O O0
c
v
Q Source and observer receding with relative velocity v: O O0
c
LIGHT • 283
all red-shifted. The implication is that distant galaxies are all moving away
from us and from each other. In addition to this, Hubble showed that the
red-shift is directly proportional to the distance of the galaxy, so that very
distant galaxies are moving away from us more rapidly than closer galaxies.
Since redshift is directly proportional to recession velocity, this discovery
showed that recession velocity v is directly proportional to distance d. This
is called Hubble’s law:
v H0 d
where H0 is the Hubble constant.
This discovery led to the idea of the expanding universe (see Section
28.3). This was eventually explained by Einstein’s general theory of relativity,
which reinterpreted the galactic redshifts. According to Einstein’s model,
the reason for the red shifts is not that galaxies are flying apart in pre-existing
space but that space itself is expanding so that the separation between the
galaxies increases. Either way we can use the redshift to calculate the speed
at which any particular galaxy is receding from us.
In 1964, the American radio astronomers Arno Penzias and Robert
Wilson discovered that the universe is bathed in low-intensity microwaves
with an almost perfect black body radiation spectrum. These are believed
to be an “echo” of the Big Bang. According to the Big Bang theory the early
universe was filled with high-energy short-wavelength gamma radiation,
but in the past 13.7 billion years of expansion, this has been hugely red-
shifted and is now present as the cosmic microwave background radiation.
14.5 Exercises
1. The distance to the Moon has been measured by reflecting light from
arrays of mirrors left on the Moon’s surface by Apollo astronauts. The
mirrors are arranged as corner reflectors that reflect incoming beams
back on themselves. However, the spreading of the beam means that
only about 1 in 1017 photons leaving Earth are detected in the reflection!
The uncertainty in the measurements of distance are of the order of
1 cm.
The speed of light is 299,792,458 ms1
The distance to the Moon is (on average) 385,000.6 km.
(a) Calculate the time taken for light to make the return trip to the moon.
(b) How precisely must this time be measured in order to achieve a
distance accuracy of ±1 cm?
LIGHT • 285
15
SUPERPOSITION EFFECTS
Screen or
detection plane
Maximum
Minimum
Source 1 ∗
SUPERPOSITION Maximum
Source 2 ∗ Minimum
Maximum
In order for stable clear interference effects to be created, the two sources
must be coherent.
Connection
Signal S to oscilloscope
generator
S
B
SUPERPOSITION EFFECTS • 289
In the experiment above, the microphone is moved along the line AB and it
detects a sequence of maxima and minima. When the microphone is in any
position, the contributions from each speaker can be displayed separately
by switching each one off in turn. The resultant disturbance depends on
the phase difference between the waves as they reach the microphone. This
depends on the path difference x S2M S1M. A path difference of one
whole wavelength corresponds to a phase difference of 2 radians, so the
relationship between path difference and phase difference is:
2 x
For a maximum, the waves must reach the microphone in phase; this occurs
when the path difference is a whole number of wavelengths. For a minimum,
they must arrive in antiphase (an odd multiple of phase difference); this
happens when the path difference is an odd number of half wavelengths.
There will be a maximum at the center of the pattern because S2M
S1M here, so the path difference and phase difference are both zero.
Minima will occur when x /2, 3/2, 5/2,etc., and these minima will
be positioned symmetrically about the central maximum. In between the
minima there will be further maxima when x , 2, 3,etc. The intensity
between maxima will vary continuously from maximum to zero and then
back to a maximum. The path difference, phase difference, and effects at
maxima and minima are tabulated below.
Path difference Phase Resultant
difference/rad intensity
0 0 Central maximum
½ Minimum
2 Maximum
3
/2 3 Minimum
2 4 Maximum
5
/2 5 Minimum
LASER SUPERPOSITION
Single
slit
Interference
pattern
Double
slit Screen
The diagram above has exaggerated the angle of spread from each slit.
In practice the angles are small, and this allows us to use the small-angle
approximation when deriving the relationship between the wavelength of
the light and the structure of the pattern. It is also important to realize
that the slit separation is very small compared to the distance between the
double slits and the screen.
Here is an image of an actual double slit pattern formed using a green
laser pen.
15.1.3 Using the Double Slit Experiment to Find the Wavelength of Light
Young’s double slit experiment was used to measure the wavelength of
visible light. We can derive an equation that relates the wavelength of light
to three parameters: the separation of the double slits s, the distance from
the double slits to the screen d, and the separation of adjacent maxima in
the interference pattern y (fringe separation).
P
First max.
y
∗
S1
s O θ max.
∗
S2 T
d
Screen or
detection
plane
P is the position of the first maximum above the central maximum. A line
drawn from O, half-way between the two slits, to P makes an angle with
the central axis. If a line is now drawn from S1 perpendicular to OP then
distances S1T and TP are equal. The distance S2T is therefore equal to the
path difference, and this must be equal to one wavelength.
y ST
sin 2
d s s
Rearranging,
sy
d
The wavelength of light can be determined by measuring the slit separation,
the fringe separation, and the distance to the screen.
where is the phase difference between the two oscillations. This will
depend on the path difference x between waves from the two sources.
This can be simplified using a well-known trigonometric identity:
R T R T
sin R sin T 2 sin cos
2 2
to give:
Y 2 A sin t cos
2 2
This is most easily interpreted by grouping it differently so that there is a
phase-dependent amplitude multiplied by a sinusoidal oscillation:
Y 2 A cos sin t
2 2
The term in brackets represents the amplitude. When /2 0, 2, 4,
etc., the cosine term will be 1, representing a maximum in the interference
pattern and the amplitude will be 2A. When /2 0, , 3,etc., it will be
0,representing a minimum.
The graphs below show the resultant oscillations when two waves, each
of amplitude 2 units, combine with various phase differences. The two
broken lines represent oscillations from waves 1 and 2,while the solid line
is the sum of these.
6
4
2 Phase difference = 0
3
2
Phase difference = π
1
Lines for waves 1 and 2
0
Y
are in antiphase.
–1
Resultant amplitude = 0
–2
–3
Time
SUPERPOSITION EFFECTS • 293
2 2√2
2 π/2 π/4
Phasors are particularly helpful for visualizing how two or more waves will
superpose.
Second-order maximum
First-order maximum
Central (zeroth-
LASER
order) maximum
First-order maximum
Second-order maximum
Diffraction
maxima (orders
of diffraction)
Diffraction
Screen grating
Laser pen
consider parallel rays traveling at an angle to the normal to the grating and
superposing at a point P on the screen. The first diagram shows the large-
scale situation and the second diagram is a highly magnified picture of some
of the rays leaving slits in the grating.
P
O θ
Laser
Central (zeroth-
L order) maximum
Diffraction grating with
N lines per meter and
slit separation d
Often the grating is described by the number of lines per meter(or per
millimeter) N, rather than the slit separation d. These are related by d 1/N.
The path difference between the ray from S1 and the ray from S2 is
S2Q d sin .
Rays superpose at
point P on screen
S1
d
θ
Q
S2
S3
296 • FOUNDATIONS OF PHYSICS
It is clear from the diagram that the path difference between the ray
from S1 and the ray from S3 is just double this, and that the path difference
to a slit m times further away is m times greater. It follows that if the rays
leaving adjacent slits are in phase, then ALL rays across the entire grating
will be in phase at that angle and an intense maximum will be created.
These maxima are called orders of diffraction.
For rays from adjacent slits to be in phase, the path difference between
them must be an integer number of wavelengths. The condition for a
maximum is, therefore,
n
sin or n d sin
d
where n is the order of diffraction (equal to the number of wavelengths
path difference between rays from adjacent slits).
This is the diffraction grating equation. If we know d and can measure
, we can calculate .
Intensity of maxima
Intensity I is proportional to amplitude-squared (I µ A2), so in the double
slit experiment, the maxima will have double amplitude and four times the
intensity of the light from a single slit. When we use a diffraction grating,
N slits contribute to each maximum, so the amplitude is N times greater
and the intensity is N2 times greater. This makes the maxima much more
intense than in the double slit experiment.
Sharpness of maxima
In the double slit experiment the intensity varies gradually between one
maximum and the next minimum so that the maxima are not sharp. This is
because the path difference between adjacent slits varies slowly as we move
away from a maximum position. This is also true for adjacent slits in the
diffraction grating, but the resultant effect on the screen is the sum of waves
from slits all the way across the grating, and even a small displacement from
the maximum position will result in a large change in path difference from a
distant slit. The result is that the sum of rays across the grating falls to zero
very rapidly as we move away from each maximum. This is best illustrated
using phasors. The diagrams below compare the effect of moving the same
short distance from a maximum for both the double slit and for a grating
with just 8 slits (in practice gratings have many more than this).
As the angle moves away from a diffraction grating maximum, the
phasors from slits across the width of the grating curl up more and more so
SUPERPOSITION EFFECTS • 297
that the maximum itself is sharp and has a series of closely packed secondary
maxima on either side of it. The more slits, the sharper the maximum. This
increases the precision with which wavelengths can be measured.
2 phasors have a
+ = small phase
difference so
resultant is large
Double slit
+ + + + + + + =
Grating
Number of orders
The maximum possible path difference between rays leaving adjacent slits
occurs when the rays leave parallel to the grating surface ( 90°) and is
then equal to the slit separation d. The maximum value of n is therefore
d/. However, n can only be an integer, so the maximum number of orders
must be the largest integer less than this; e.g., if d/ 5.7, then 5 orders of
diffraction could be formed either side of the central maximum. In practice
these might not all be visible. The intensity of light diffracted at large angles
might be too low for them to be seen, or they might correspond to missing
orders where there is a minimum of the single slit diffraction pattern (see
Section 15.3.1).
15.2.2 Spectroscopy
Spectroscopy is the analysis of electromagnetic radiation to identify
the wavelengths present in a source. This is one of the most important
298 • FOUNDATIONS OF PHYSICS
Continuous
spectrum
Line emission
spectrum
Line absorption
spectrum
When atoms or molecules are excited to higher energy states, the electrons
can then return to lower energy states by making quantum jumps and
emitting photons. The wavelength of the emitted photon is related to the
energy change by the equation hc/E (see Section 27.2.1), so the greater
the energy jump, the shorter the wavelength. For isolated atoms the energy
levels are very distinct, so there is a discrete set of emitted wavelengths
forming a line emission spectrum. The spectrum of each atom is unique,
so an analysis of its spectrum can be used to identify the types of atom
present in the source (e.g., a distant star). Atoms in molecules interact,
producing a more complex spectrum that includes bands corresponding
to small allowed ranges of energy. The atoms in solid materials are
packed closely together, and electrons occupy wide energy bands so the
characteristic emission from a hot solid is a continuous spectrum. When
electromagnetic radiation is absorbed by atoms or molecules in lower
energy states, photons whose energies correspond to allow quantum jumps
can be absorbed and electrons are excited to higher energy states. This
removes certain wavelengths from the radiation so that the transmitted
radiation will contain dark lines or bands called an absorption spectrum.
The wavelengths missing in the absorption spectrum are the same ones
that would be present in the emission spectrum from the same element or
compound.
SUPERPOSITION EFFECTS • 299
θ1 Central
Source∗ θ1 maximum
Diffraction
grating with slit First
separation d maximum
The angle 1 can be found by measuring x and L with a ruler and then using
tan 1 x/L.
In practice it is best to use an average value for 1 by measuring the first
order on both sides of the central maximum. This helps to reduce errors
due to alignment. The wavelength is then calculated from:
d sin 1
If several orders are visible, each one can be used and wavelength can be
calculated from:
d sin n
n
Another approach is to plot sin n against n. The gradient of this graph is
/d.
15.2.3 Spectrometers
For more precise spectroscopy a specialized instrument called a spectrometer
must be used. A traditional spectrometer consists of a collimating tube,
which ensures that rays from the source arrive along a normal to the grating,
and a telescope that is used to detect the orders of diffraction. These are all
mounted on a rotating base so that the angle between the normal and each
order can be measured from a Vernier scale engraved on the base.
300 • FOUNDATIONS OF PHYSICS
Diffraction grating
Collimator
Source θ
∗
The eyepiece of the telescope has a built-in cross-hair so that the position
of the image can be found precisely. The observer sees a fine vertical line
in each wavelength at each order of diffraction. For greatest precision, the
collimator slit must be adjusted so that it is very narrow.
Setting up a spectrometer requires great care to ensure that the rays
leaving the collimator are parallel and the diffraction grating is in the plane
perpendicular to these rays.
Digital spectrometers
Digital spectrometers are used to display spectra and to measure
wavelengths but are rarely able to provide the level of precision achievable
with a traditional setup. Most simple digital spectrometers use an optical
fiber to direct light into the device where it then falls onto a reflection
grating. This consists of a large number of parallel reflecting lines (like
on the surface of a compact disc) and produces orders of diffraction by
reflection. The light is detected by a CCD detector and the position on the
detector corresponds to wavelength, so this can then be recorded digitally
and used to generate a graph of intensity against wavelength.
The intensity of the diffracted waves varies with angle and usually has
maxima and minima.
Broad central
Laser
maximum
Narrower
Single slit of width w secondary
maxima
Position on screen
302 • FOUNDATIONS OF PHYSICS
The pattern for a thin wire of diameter d is identical to that for a slit of
width w when w d.
A
D
w B
E
Rays from all points across the slit reach the same point on the screen and
add to zero. Consider rays from A and B. If these two rays are out of
phase, the pair will add to zero. Now consider the two rays a small distance
further down the slit from A and a small distance further down the slit from
B. These will have the same small phase difference from A and from B but
will also be out of phase with each other and so will also add to zero. In
SUPERPOSITION EFFECTS • 303
fact, continuing this argument, all the rays will cancel in pairs across the slit
and the resultant at the screen will be zero. The condition when this first
occurs is when the ray at B is out of phase with the ray at A and the path
difference BE is /2. Consider triangle BCE when this occurs:
B
E
BE/BC = sin θ
The condition for the nth minimum is when rays can cancel in pairs n times
across the width of the slit. The path difference between the rays at each
edge of the slit must therefore be n so that:
n
sin n
w
This has a similar form to the equation for the maxima of a diffraction
grating—don’t forget that this gives the positions of minima!
If the pattern subtends a small angle at the slit, then the minima are
equally spaced and the central maximum, which goes from the minima
on either side, is exactly double the width of the secondary maxima. This
contrasts with the double slit pattern where the maxima have equal widths.
Narrower
secondary
maxima
Broad central
Laser maximum
Narrower
Circular hole of
secondary
diameter D
maxima
Diffraction patterns from slits, wires, holes, and circular objects can be used
to measure the size of the diffracting object. There is an inverse relationship
between the size of the object and the size of the pattern, so the smaller the
object, the larger the angle at which the first minimum occurs. In practice
the wavelength of the source needs to be known and the experiment consists
of measuring the angle at which the first minimum occurs.
Object 1
∗
θ
∗
Object 2 Objective
diameter D
Rayleigh criterion:
1.22
not resolved
D
1.22
limit of resolution
D
1.22
resolved
D
In practice diffraction is only one factor that limits resolution, so these
diffraction limits are rarely achieved. Other factors are aberration,
atmospheric effects, sensitivity of the sensors, etc.
It is interesting to compare the limit of resolution of the human eye
with that of a large optical and radio telescope. We will take 500 nm as
a representative visible wavelength and 20 cm as a representative radio
wavelength.
Human eye: pupil diameter 5.0 mm, wavelength 500 nm, limit of resolution
1.2 104 rad
Large optical telescope: objective diameter 10 m, wavelength 500 nm,
limit of resolution 6.1 108 rad
Large radio telescope: objective diameter 500 m, wavelength 20 cm,
limit of resolution 4.9 104 rad
The resolving power of the large optical telescope is, as expected, much
greater than that of the unaided human eye, but it might be surprising to
realize that the world’s largest radio telescope “sees” the universe in less
detail than our eyes, albeit seeing different features.
Signal Vibrator
generator
The ends of the string are effectively fixed. These are the boundary
conditions, so there must be nodes at each end. Standing waves can
then be formed at frequencies that “fit” the length of the string—in other
words, patterns that have nodes at the ends. The lowest frequency, longest
wavelength wave that sets up a standing wave is called the fundamental.
The sequence of different standing waves are harmonics. Since the
speed v of transverse waves on a string is determined only by the tension
and mass per unit length of the string, it is the same for all harmonics, and
the frequencies form a simple sequence. The diagrams that follow show the
first few harmonics for the string above.
It is clear that the harmonic frequencies are all integer multiples of the
fundamental frequency f1. The frequency of the nth harmonic will be nf1.
A similar series is obtained if the boundary conditions have an antinode at
both ends (e.g., standing sound waves in a tube open at both ends).
If the boundary conditions are different at each end, so that there is a
node at one end and an antinode at the other, then the sequence is f1, 3f1,
5f1, etc., all the odd multiples of the fundamental frequency (e.g., standing
waves in a tube open at one end and closed at the other).
SUPERPOSITION EFFECTS • 307
Antinode
Node Node
L = λ1/2
Fundamental (first harmonic): f1 = v/λ1 = v/2L
Antinode Antinode
15.5 Exercises
A 20 cm
D
26 cm
4.0 m
Speakers A and B are 26 cm apart along a line parallel to CD. They both
emit sound waves of a single frequency and the same amplitude a, and
they are in phase with one another. D is equidistant from A and B. When
a microphone is placed at D it records a sound of maximum intensity.
When it is moved along the line DC it gradually fades to become a
minimum at C; immediately beyond C, the intensity increases.
(a) State the phase difference between the waves arriving at D.
(b) State the phase difference between the waves arriving at C.
(c) State the path difference (in wavelengths) between waves reaching C
from sources A and B.
(d) State the amplitude and intensity of the resultant sound at D (assume
I k2 where k is a constant).
(e) State the amplitude and intensity of the resultant sound at C (assume
I k2 where k is a constant).
(f) What would happen to the amplitude and intensity of the sound at D
if one of the speakers was switched off?
(g) Use the dimensions on the figure to work out the wavelength and
frequency of the sound. The speed of sound is 340 ms1.
(h)How would CD change if the wavelength of the sound was doubled?
(i) How would the distance CD change if the separation of the speakers
was doubled?
(j) Explain why the intensity of sound increases beyond D.
310 • FOUNDATIONS OF PHYSICS
Q how you will calculate the wavelength from the data you collect
Q how you will maximize accuracy and precision
T
10. The speed of transverse waves on a string is given by v , where T
is the tension in the string (N) and is the mass per unit length of the
string (kgm1).
A steel wire of diameter 0.12 mm is held at a tension of 26 N between
two fixed points 0.75 m apart. The density of steel is 7800 kgm−3.
(a) Calculate the mass per unit length of the steel string.
(b) Calculate the speed of transverse waves in the steel string.
(c) What is the frequency of the fundamental?
(d) List the frequencies of the first three harmonics.
(e) The temperature of the wire increases. State and explain how this will
affect the frequencies of sound emitted by the vibrating wire.
CHAPTER
16
SOUND
Oscillation
direction
Wavelength λ
The sound wave itself can be described either by the varying particle
displacements at each point or by the varying pressure at each point.
The speed of sound in an ideal gas is given by:
RT
v
M
where M is the molar mass of the gas and is the adiabatic gas constant.
The speed of sound in a solid material is approximately given by:
E
v
where E is the Young modulus and is the density of the solid.
The speed of sound in a liquid is given by a similar equation, but the
Young modulus is replaced by the bulk modulus of the liquid, a quantity
that measures the compressibility of the liquid.
Typical values for the speed of sound are:
Air, at atmospheric pressure and 20°C: v 343 ms1 (this falls to about
300 ms1 at the altitude of a commercial jet)
Water: v 1482 ms1
Steel: v 6000 ms1 (varies with type of steel)
Vertical cylinder
L
containing air
Second harmonic:
L + e = 3λ/4
Third harmonic:
L + e = 5λ/4
The particles move toward the displacement node and increase the
pressure and then move away from the displacement node and decrease
the pressure. The variation of pressure has a large amplitude here, so it is a
pressure antinode.
Method 1
Mic. Mic.
Sig
1
gen.
To To
oscilloscope oscilloscope
can also be measured from the oscilloscope (using f 1/T), and the speed of
sound is calculated from v f.
Method 2
Standing wave pattern
Sig A N A N A
gen.
To
oscilloscope
Solid reflector
16.5 Ultrasound
Ultrasound has frequencies >20 kHz, i.e., above the highest sound frequency
audible to humans. Ultrasound scanning is used for medical imaging (e.g.,
in prenatal scans). Ultrasound pulses from a transmitter on the surface of
the patient’s skin partially reflect at each boundary inside the patient. The
times of the returning pulses can be used to determine the depth of the
boundary and to map out structure. This is done automatically so that the
ultrasound scanner connects to a computer that displays a digital image of
the organ or foetus being examined.
The fraction of the incident ultrasound intensity that is reflected is
determined by the nature of the tissues on either side of the boundary. The
key parameters are the density and speed of sound in the medium These
are combined in the acoustic impedance Z of the medium:
Z (speed of sound) × (density)
The S.I. unit of the acoustic impedance is kgm2s1
SOUND • 319
The greater the difference in acoustic impedances between the two media,
the greater the ratio. Here are some values of acoustic impedance:
Medium Speed of sound/ms1 Density/kgm3 Z/kgm2s1
Air 340 1.2 410
Water 1480 1000 1.5 106
Muscle 1590 1070 1.7 106
Blood 1550 1060 1.6 106
Bone 4000 1500 6.0 106
The low value of Z for air compared to body tissues means that if there is
an air gap between the ultrasound transmitter and the patient, then most
of the ultrasound reflects from the surface and does not enter the body. To
solve this problem, a gel is spread on the skin under the transmitter. The gel
has an acoustic impedance similar to that of water or body tissues, so most
of the ultrasound is transmitted rather than reflected.
Excess pressure
Time
The graph above shows the result of adding three sinusoidal sound waves
of the same amplitude but with frequencies f, 2f, and 4 f. This can also be
represented by the equation:
y A sin t A sin 2 t A sin 4 t
16.7 Exercises
Signal
generator
Vertical cylinder
40 cm
containing air
h Water
supply
17
ELECTRIC CHARGE AND
ELECTRIC FIELDS
17.2 Electrostatics
Electrostatics deals with situations where objects become charged, remain
charged, or lose charge. Most of these situations can be explained in terms
of a transfer of electrons. The reason for this is because the electrons are
on the outside of the atom, so when atoms interact (e.g., if two materials
are rubbed together), the electrons can move from one place to another.
Protons are locked inside the atomic nucleus, so proton transfer does not
occur. When a neutral object loses electrons, it becomes positively charged;
and when it gains electrons, it becomes negatively charged.
An object is said to be “earthed,” when it is connected to the earth
by a good conductor (e.g., an electrical wire). From the point of view of
electrostatics, the Earth itself is a huge conducting sphere that can gain or
lose any number of electrons while remaining neutral. Another
way of looking at this is to say that the earth is always at zero
potential so that any object connected to it is also at zero potential.
The symbol to show an earth connection is shown on the right.
Surface negative
charge on rod
Polythene rod
Polythene rod
Electron transfer + +
+ + +
+ ++
+
Surfacee
Cloth rubbed back and forth positive
along surface of rod
a positive surface charge. This is because atoms on the surface of the rod
are no longer electrically neutral; some of them have fewer electrons than
protons and so have a net positive charge. However, this positive charge
has, once again, been brought about by the movement of the negatively
charged electrons.
Insulator
Steel rod
Gold leaf
Earthed box
The cap rod and leaf are all metallic conductors, but they are isolated from
the earth by an insulator. If a positively or negatively charged rod is held
close to the cap but not touching it, the leaf will rise. This is because the
charge on the rod exerts electrostatic forces on the free electrons making
them move. The rod and leaf gain the same charge and repel one another.
326 • FOUNDATIONS OF PHYSICS
The leaf is very thin, so it responds to this by rising. In the diagrams below,
only the cap, leaf, and rod are shown.
++++++
− − − − − − Cap gains electrons
and becomes negative
Electrons attracted
toward positive rod
+
+
+
+++
Rod and leaf lose
electrons and
become positive
−−−−−−
+ + + + + + Cap loses electrons
and becomes positive
Electrons repelled
from negative rod
The process by which the cap gains a charge opposite to the charge on the
rod is called electrostatic induction. In the examples above, there is no net
charge on the cap leaf and rod; the electrons have just been redistributed.
However, it is possible to use induction to give the electroscope a net positive
or negative charge—this is called charging by induction and is explained
in the sequence of diagrams below (resulting in a net positive charge).
−−−−−− −−−−−− Electrons
++++++ ++++++ + + +
Electrons
Electrons
+ +
+ +
1. Hold a negatively charged rod close the cap. The leaf rises.
2. Keeping the negatively charged rod in place; momentarily earth the cap
and then disconnect the earth. Electrons flow to earth and the leaf falls.
3. Remove the rod. The electroscope has lost electrons, so the cap, rod,
and leaf have a net positive charge. When the rod is removed, electrons
spread out and there is a positive charge on the cap, leaf, and rod. The
leaf rises.
ELECTRIC CHARGE AND ELECTRIC FIELDS • 327
Charge is measured in
nano-coulombs
θ θ
T T
F Q Q F
r
mg mg
T cos mg
T sin F
So F mg tan
Q can be found by discharging the sphere to a coulomb meter.
If Q is kept constant (charging from the same high voltage) and the mass
of the spheres is varied, then F and r will change, so the inverse-square law
can be tested.
Q can be varied by charging the spheres from a different voltage, but
this will also change r unless a compensating change in mass is made.
F
E
q
The S.I. unit of electric field strength is NC1, which is equivalent to Vm1.
The electric field is a vector field: the electric field points in the direction
of the force on a positive charge.
This can be represented by drawing field lines. The direction of the
field lines is the direction of the electric field, and the separation of the field
lines represents the strength of the field. The two diagrams below show
the shape of the electric field close to a point positive and a point negative
charge.
+ −
EB EC
C EA
A +
B +
The diagram below shows the electric field close to a dipole—two charges
of equal magnitude separated by a small distance. Many molecules have
electric dipoles.
− +
This also makes sense from a geometrical point of view. Imagine the
field lines from a point positive charge spreading uniformly out in all
directions in 3D space. They would spread over an area 4r2 at distance
r from the charge, so the density of field lines would fall off as an inverse-
square law.
This formula can be used to find the resultant electric field from a
number or distribution of charges.
The diagram below shows an electric dipole consisting of two charges,
Q and Q, separated by a distance 2a. Consider the electric field at a point
Pat distance x from the center of a dipole along its axis:
x
+Q –Q
E–
E+
2a P
Q Q axQ
EP 2 2 2
4 0 x a 4 0 x a 0 x2 a2
The negative sign indicates a resultant field to the left.
Here is another example at another point in the dipole field:
E+
P
E–
x
θ
+Q –Q
2a
Sometimes the electric fields of two or more charged particles sum to zero
at a point. This is called a neutral point. For example, half way between
two charges of the same magnitude and sign:
Q Q
EP 0
4 0 a2 4 0 a2
ELECTRIC CHARGE AND ELECTRIC FIELDS • 333
Neutral point
a a
+ +
P
Spherical surface
of area A = 4πr2
The electric flux E is the product of the normal component of the electric
field E and the area A of the surface, then:
E EA E × 4r2
When this is rearranged and compared with the expression for the field
strength of a point charge, it is clear that the flux through the surface is
directly related to the charge within that surface.
Q
E E2
4 r 4 0 r 2
Q
so that: E
0
This is an example of a general result known as Gauss’s theorem. This
states that:
Q The total flux through any closed surface is equal to the total charge
contained within that surface divided by the permittivity of free space
(or of the medium if the charge is not in a vacuum).
This is a powerful theorem that can be used to understand how the electric
field behaves in a range of important situations. Gauss’s theorem can be
stated more precisely using an integral:
334 • FOUNDATIONS OF PHYSICS
iN
Qi
E E.dS i1
0
surface
Where dS is an infinitesimal element of the surface area at some point and
E.dS is the product of the perpendicular component of E at that point and
the area dS. The integral sums these contributions to find the total flux (in
the previous example, the field strength was constant and perpendicular to
iN
the surface, so we simply multiplied the values together). i1
Qi is the
sum of charges contained inside the closed surface. The next section shows
how Gauss’s theorem can be used to derive some important results.
+ + + + + + + + + + + + + + + +
Conductor
Q The electric field through the lower surface must be zero because it is
inside the conductor.
Q The electric field through the sides of the cylinder must also be zero
because there can be no component of electric field in the surface of the
conductor.
Q All of the electric field must pass through the upper surface.
Hollow
conducting Gaussian
box surface
The electric field strength is zero inside the conductor, so the flux through
the Gaussian surface is also zero. Using Gauss’s theorem the total charge
contained within the Gaussian surface and therefore inside the hollow box
must also be 0. This remains the case even if other charged objects are
brought close to the box or if electromagnetic waves are incident on the
box. The field strength inside the box remains 0.
This is an example of a Faraday cage—a conducting box used to
shield its contents from external electric fields and electromagnetic waves.
Metal cars and aircraft act as Faraday cages protecting their occupants
from lightning strikes. A room with conducting walls, floor, and ceiling can
also be used to provide security against unwanted communications—e.g.,
mobile phone signals (which cannot enter or leave the room). The effect
can be demonstrated by placing a mobile phone inside a metal cookie tin.
When the tin is closed it is impossible to ring the mobile phone.
B
The electric field exerts + The electric field
exerts a force
a force downwards on
downwards on B and B
A but A moves upwards
moves downwards so
so work is done on A by
work is done by the
an external agent and
electric field and the
the electric potential + electric potential
energy increases A energy decreases
V dV Edx
VA xA
Three immediate consequences of these equations are:
Q There is no change in potential when a charge is moved in a direction
perpendicular to the electric field lines. In this direction E 0,
so dV/dx 0 too and the potential is constant.
Q The potential changes more slowly with distance when the field is weak
(dV/dx is lower because E is lower).
Q The direction of the electric field is toward lower potential (this is the
significance of the minus sign).
Equipotential surfaces can be drawn perpendicular to the field lines.
The diagram below shows the equipotential surfaces (spherical surfaces)
surrounding a point charge. The steps in potential between adjacent
equipotentials is constant, so their separation increases farther from the
338 • FOUNDATIONS OF PHYSICS
central charge; this is because the field is weaker further out so the potential
is changing more slowly.
Equipotential
surface
Sccelerating
voltage V
Vacuum
Electron beam
Heater supply Electrons
(low voltage)
V Edx
xA
can be used to find the absolute potential at point B if we know the potential
at point A. In order to find the potential at any point in the universe, we
must define the position where potential is zero. This is an arbitrary decision
because forces and fields only depend on differences in potential and not
on its absolute value. However, it makes sense to choose the position for the
zero of potential in such a way that it is easy to use in calculations. The zero
of potential in the electric field is taken to be at infinity. In other words, if
all charges were separated so that they were at infinite distance from one
another, then the total electrical potential energy would be zero.
Here are two qualitative examples of how this work:
Q Consider two point positive charges a distance r from each other. These
will repel each other, and if no other forces act on them, they will move
toward infinity. While they are moving, electrical forces are doing work
on them, transferring electrical potential energy to kinetic energy, so
ELECTRIC CHARGE AND ELECTRIC FIELDS • 341
The electric field strength at a point distance r from a point positive charge
Q is:
Q
E
4 0 r 2
So the potential difference between P (at distance r from Q) and infinity is:
x x
Qdx
V V VP Edx 4 x
0
2
xr xr
x
Q Q
4 0 r xr 4 0 r
Q Q
VP V
4 0 r 4 0 r
The potential varies as 1/r.
342 • FOUNDATIONS OF PHYSICS
1. (a) A small electric cell can supply a continuous current of 2.0 A for
2 hours. How much charge passes through the cell in this time?
(b) A large capacitor (component for storing charge) is charged using a
steady current of 4.0 mA. It takes 0.75 s to reach full charge. How
much charge does it store?
(c) An electrostatic generator stores 8.0 C on a large metal dome. This
charge leaks away through the air in 40 s. What is the average electric
current during this time?
2. Write an instruction sheet explaining how to charge an electroscope
positively using just a polythene rod and a cloth.
3. The diagram below is a model of an electric dipole.
y
D −2.0 µC +2.0 µC B
4.0 µm
E
ELECTRIC CHARGE AND ELECTRIC FIELDS • 343
The dipole is located on the x-axis with its center at the origin.
(a) Calculate the electric field strength (and direction) and the electric
potential at points A to E:
A (0, 0)
B (4.0 m, 0)
C (0, 4.0 m)
D (4.0 m, 0)
E (0, 4.0 m)
4. A hydrogen atom consists of a proton and an electron orbiting at a
distance of 0.053 nm.
(a) Calculate the force between the electron and the proton.
(b) Assuming this force provides the centripetal force for circular motion
of the electron, work out the electron’s orbital speed and kinetic
energy.
(c) Calculate the electrostatic potential energy of the electron.
(d) Calculate the total energy of the electron and explain the significance
of its sign.
(e) State the ionization energy of the hydrogen atom.
5. (a) Sketch the field lines and equipotentials round an isolated charged
conducting sphere of radius 2.0 cm at a potential of 5000 V (include
equipotentials at 1000, 2000, 3000, and 4000 V).
(b) Calculate the electric field strength and potential at distances of 4.0
and 11 cm from the center of the positively charged sphere.
6. Copy the diagram below and sketch the electric field lines and
equipotentials between the charged sphere and the earthed conducting
plane shown below.
(c) that the flux of electric field entering a volume of empty space is
equal to the flux of electric field leaving that volume of space
8. (a) Prove that the uniform electric field between two parallel conducting
plates separated by distance d and connected to a potential
difference V is given by E V/d.
(b) Sketch the electric field lines and equipotentials between two parallel
metal plates 2.0 cm apart with a potential difference of 5000 V
(include equipotentials at 1000, 2000, 3000, and 4000 V).
(c) Calculate the electric field strength between the plates.
(d) Calculate the force on an alpha particle (charge 2e) half way between
the plates.
(e) How does this force vary if the alpha particle moves close to the
positive or negative plate (ignore induction effects)?
(f) An air molecule between the plates loses two electrons and becomes
ionized. What is the ratio of the acceleration of the electron to the
acceleration of this positive ion in the electric field? Assume that the
mass of the ion is 60,000 times greater than that of the electron.
9. An electron in a vacuum tube is accelerated horizontally in an electron
gun through a potential difference of 1500 V. It then enters a region of
uniform vertical electric field of strength 2.0 104 Vm1 that extends
5.0 cm horizontally. It is deflected by the field and emerges into a field-
free region.
(a) Calculate the velocity of the electron as it enters the region of vertical
electric field.
(b) Describe the shape of the electron’s path in the field. Assume the
direction of the field is vertically upwards.
(c) Calculate the time spent by the electron in the vertical field.
(d) Calculate the vertical component of velocity gained by the electron in
moving through the vertical field.
(e) Calculate the angular deflection of the beam.
(f) Calculate the work done by the vertical electric field on the electron.
CHAPTER
18
D.C. ELECTRIC CIRCUITS
+ + +
+
+ + + +
346 • FOUNDATIONS OF PHYSICS
Positive and negative charge carriers (e.g., when current flows through
an electrolyte): The positive charges move in the direction of conventional
current and the negative charges move in the opposite direction.
Conventional current I
− + −
−
+ + − +
In all the examples above the velocities of the charge carriers represent
their average drift velocities; the particles will also have random thermal
motion, which might involve speeds that are several orders of magnitude
greater than the drift velocities.
+ v + v + v
+ v P
+ v + v + v
vδt
There are n charge carriers per unit volume in the conductor, and each
charge carrier has a charge q.
D.C. ELECTRIC CIRCUITS • 347
Consider the charge leaving the right-hand end of the wire at P during
a short time t.
All charge carriers within a distance vt will leave in this time. The
volume within this distance of point P is Avt, so the number of charge
carriers is N nAvt and the total charge Q passing P in time t is:
Q qnAvt
so the current is:
Q
I nAvq
t
Very often the charge carriers have a charge equal to the electronic charge,
e; so this equation becomes:
I nAve
Typical values of the charge carrier density n for two different metals
and two types of semiconductors are shown below. The values are for a
temperature of 300 K.
Copper, 8.5 × 1028 m3
Silver, 1.1 × 1028 m3
Silicon, 1.5 × 1013 m3
Germanium, 2.4 × 1016 m3
Note that the carrier density in the pure semiconductors is very much
smaller than for the metals, so the drift velocity in a semiconductor carrying
the same current as a metal of the same dimensions will be much greater
than in the metal. The carrier density is highly temperature-dependent in
semiconductors and increases rapidly with temperature. Pure, or intrinsic,
semiconductors contain both negative charge carriers (electrons) and
positive charge carriers (holes) in equal numbers, but doped, or extrinsic,
semiconductors have small quantities of other elements added to increase
their carrier density of a factor of 106 or more.
I1 A A I1 I1 = I2 + I3 + I4
I1
If there are no junctions in the circuit, then the current is the same
everywhere—i.e., at all points around a series circuit.
The current that flows in each branch of a parallel circuit is inversely
proportional to the resistance of the branch.
An ideal voltmeter has infinite resistance so that it does not draw any
current from the circuit. In practice a real voltmeter will have a large but
not infinite resistance, and this can affect readings if there are also very
large resistors in the circuit.
The longer line represents the positive end of the cell or battery. The emf
of a battery consisting of cells in series is equal to the sum of the emfs of the
individual cells. An ideal cell provides a constant emf and has no internal
resistance. A real cell does have an internal resistance, so must be treated
as an ideal cell of emf E in series with a resistor of resistance r and is usually
represented as:
E
r
emfs p.d.s
closed closed
loop loop
If the potential differences are all across resistive components, this can be
written as:
iN
emfs I R i i
closed loop i1
where there are N resistors in the loop and the ith resistor has a resistance
Ri and current Ii.
The example below shows how Kirchhoff’s second law can be applied
to two different loops in an electric circuit:
Loop
V2
V1
E1 V V3
Loop
E2
E3 V4
D.C. ELECTRIC CIRCUITS • 351
18.3 Resistance
Resistance R is defined as the ratio of potential difference V across a
component to current I through the component.
V
R
I
The S.I. unit of resistance is , and 1 1 VA1. Large resistances are
measured in k and M.
1 k 1000 103
1 M 1000 000 106
resistance =
voltmeter reading
divided by ammeter
A
352 • FOUNDATIONS OF PHYSICS
Resistance
ranges Rotating switch
to select range
D.C. D.C. voltage
current ranges ranges: this
meter is set
A.C. A.C. voltage
current ranges ranges
10A
terminal: VΩ terminal:
the second the second
lead must be lead must be
connected connected
to this to this terminal
terminal COM terminal: “common” for the meter
for the meter connection for all uses of the to work as a
to work as an meter; one lead must be voltmeter or
ammeter connected to this terminal ohm-meter
D.C. ELECTRIC CIRCUITS • 353
P
IP
VP
RP =
IP
Potential
0 difference/V
0 VP
R
A
To reverse the current through the component, the battery or the component
itself can be turned around. This allows both positive and negative values
for I and V. The current–voltage characteristics for three components are
shown below.
354 • FOUNDATIONS OF PHYSICS
Voltage/V
Current/A
Voltage/V
The fact that the graph is not a straight line through the origin shows that
this is a non-ohmic conductor.
The ratio of V to I is increasing, so the resistance has increased as the
current has increased.
The reason this happens is because the charge carriers passing through
the metal transfer energy to the metal ions, making them vibrate more
rapidly, and this in turn increases the scattering of charge carriers. More
work has to be done to maintain the current (re-accelerate the charge
carriers after scattering), so a higher voltage is needed for the same current.
This increases the ratio V/I and increases the resistance.
Components whose resistance changes with temperature are called
thermistors. The resistance of a metal increases with temperature, so
it is a positive temperature coefficient (PTC) semiconductor. Platinum
is commonly used as a PTC thermistor and can be used as a resistance
thermometer. Semiconductors have a resistance that falls as temperature
increases, so they are used as negative temperature coefficient (NTC)
thermistors. The circuit symbol for a thermistor is:
Q A semiconductor diode
The circuit symbol of a semiconductor diode is:
This is a conductor that conducts with very low resistance in one direction
(forward bias) and acts as an insulator (infinite resistance) in the other (up
to its breakdown voltage, at which point the diode resistance suddenly
drops and it is likely to be destroyed by the current surge).
A small voltage is required in the forward direction before the diode
begins to conduct. This “switch-on” voltage depends on the material
from which the diode is made. For silicon diodes it is about 0.6 V; and for
germanium diodes it is about 0.2 V. Once the forward voltage exceeds this
value, the diode begins to conduct with a very low resistance, and care must
be taken not to allow the current to grow so large that it melts the diode
(e.g., by having a fixed resistor in series with it).
Diodes are important components in rectifier circuits, used to convert
alternating current to direct current.
Current/A
Forward bias
Breakdown + −
voltage
Voltage/V
+ − Switch-on
voltage
VS
I I
RS
V
I I
RP
358 • FOUNDATIONS OF PHYSICS
18.3.4 Resistivity
The resistance of a component depends not only on the type of material from
which it is made but also on the dimensions of the component. Resistivity
is a property of the material alone and does not depend on its dimensions.
For example, the resistances of copper wires of different lengths and cross-
sectional areas differ, but the resistivity of the copper from which they are
made is the same (at the same temperature).
For a cylindrical wire of length l, cross-sectional area A, and resistance
R, the resistivity is given by:
RA
l
The S.I. unit of resistivity is m.
D.C. ELECTRIC CIRCUITS • 359
Typical resistivities (at 20°C) for different conductors are shown below:
Metal Resistivity
Silver 1.6 × 108 m
Copper 1.7 × 108 m
Aluminum 2.8 × 108 m
Tungsten 5.8 × 108 m
Platinum 1.1 × 107 m
Constantan 4.9 × 107 m
Steel 7.2 × 107 m (varies depending on type of steel)
Nichrome 1.3 × 106 m
Resistance wire
l Sliding contact
E
r
Representation
of a real cell
When the cell is connected to an external circuit, current flows through it,
and there is a voltage drop across the internal resistance. This results in the
potential difference at the cell terminals, the terminal p.d. V, falling below
the emf of the cell.
E lr
Current I to
external circuit
Terminal p.d. V
The potential difference across the internal resistance is equal to the work
that must be done per coulomb of charge to move charge carriers through
the cell. The terminal p.d. is therefore equal to the difference between the
cell emf and these “lost volts”:
V HIr
As the current drawn from the supply increases, the terminal p.d. falls.
The internal resistance also limits the maximum current that can
be drawn from the cell. This occurs when the cell is short-circuited by
connecting its terminals together with a conductor of negligible resistance.
The only resistance in the circuit is the internal resistance, so the short-
circuit (maximum) current from the cell is:
H
ISC
r
The internal resistance of a typical 1.5 V alkaline AA cell is about
0.15 , so the maximum current that could be drawn from it is 1.5/0.15
10 A. However, shorting the cell would drain it very quickly, so the working
current for a device operated by AA cells must be much less than this. If
a cell or battery is required to provide a very large current, it must have a
very low internal resistance. Car batteries have an emf of 12 V but need to
provide over 100 A when the ignition is switched on and the battery turns
the starter motor. Typical car batteries have an internal resistance < 0.01 .
362 • FOUNDATIONS OF PHYSICS
As the value of the load resistor is changed, the current I in the circuit and
the terminal p.d. V change too. I and V are related by the equation:
V HIr
comparing this to: y mx c
A graph of V against I is a straight line with a negative gradient and a
positive intercept:
Terminal p.d. / V
Gradient = − r
0 Current/A
0
Q intercept on y-axis emf
Q gradient r
D.C. ELECTRIC CIRCUITS • 363
The terminal p.d. is equal to the cell emf when no current is drawn, so
a simple way to measure the emf is to connect a voltmeter with very high
resistance across the cell terminals. The open-circuit terminal voltage is
equal to the emf of the cell.
H
I
R r
H2R
P I2R
R r 2
It is clear that P is 0 when R 0.
P also becomes asymptotic to 0 as R since the denominator then
dominates the expression.
The expression is positive for all values of R, so there must be a maximum
value for P at some point. This occurs when R r, i.e., the load resistor has
a resistance equal to the internal resistance.
(The fact that maximum power transfer occurs when R r can be
demonstrated using calculus. You need to find the condition for dP/dR
0; this must be a stationary value of the function and, in this case, it is a
maximum.)
364 • FOUNDATIONS OF PHYSICS
Power transfer P
Pmax
Load resistance R
0
0 R=r
R2
Vin
R1 Vout
R2 R2
Vin Vin
R1
R2 R3
i1 i2
i3 R4
E2
h+ h+ e−
e−
h+ e−
Current I h+ Current I
e−
h+ h+
e− e−
h +
e−
e −
h+
h+
Intrinsic semiconductor
the greater the amplitude of ionic vibrations. This increases the electron
scattering and the resistivity of the metal. The rate of increase is roughly
linear, so the variation can be modeled by the equation:
0 (1 TT0 )
resistivity at temperature T
0 resistivity at reference temperature T0
Values for the temperature coefficient of resistance for different metals
based on T0 293 K are shown below:
Metal Temperature coefficient of resistance
Aluminum 4.3 × 103 K1
Copper 4.0 × 103 K1
Silver 4.0 × 103 K1
Tungsten 4.5 × 103 K1
18.7 Exercises
200 Ω 200 Ω
200 Ω
A
V
D.C. ELECTRIC CIRCUITS • 371
(b) In fact the internal resistance is 10 . How does this affect your
answers to part (a)?
8. You are provided with three identical resistors each of resistance R.
Calculate and list the values of all the resistances you can make using
one or more of these resistors stating which arrangement corresponds to
which value.
9. A resistance network is constructed using 12 identical 1 resistors, with
each resistor placed so that it forms one edge of a cube. Calculate the
resistance between two opposite corners of the cubic array.
10. The circuit below is used to detect changes in light intensity. The resis-
tance of the LDR is 50 k in the dark and 100 in bright light.
1.0 kΩ
6V
RLDR Vout
(a) Calculate the output voltage in the dark and in bright light.
(b) How does the value of the top-fixed resistor affect the range of the
output voltages?
11. Use Kirchhoff’s laws to determine the readings on the three ammeters
and the voltmeter in the circuit below. Assume that the cell’s internal
resistance can be neglected.
6.0 V
A1
50 Ω
30 Ω 10 Ω
A3
A2
3.0 V
CHAPTER
19
CAPACITANCE
A A
S R C
Connection
to
oscilloscope
Time
19.1.2 Capacitance
The charge stored on a capacitor is directly proportional to the potential
difference between the two conductors. This can be verified using the
arrangement shown below:
CAPACITANCE • 375
Flying lead
Coulomb
meter
Potential difference/V
Charge
R
Q stored/C
The two expressions for E are equivalent since C Q/V. There is also a
third form of this expression in terms of just V and C (found by using Q
CV to eliminate Q):
E ½ CV 2
Here are the three equations for energy stored on a capacitor:
Q2 CV 2 QV
E
2C 2 2
CAPACITANCE • 377
R C
+Q −Q
Charge Q arrives
Charge Q leaves
negative at plate
positive plate
+Q −Q
The electric field strength is related to the charge density by the equation
(see Section 17.4.4):
Q
E
0 A0
Where is the charge density on one plate.
The electric field strength is also equal to the (negative) potential
gradient:
V
E ()
d
Equating the two expressions for E (ignoring signs):
Q V
A0 d
The capacitance of a parallel plate air capacitor is, therefore,
Q A
C 0
V d
And if the air gap is replaced by a dielectric material of relative dielectric
constant r, the capacitance is given by:
A
C 0 r
d
To create large capacitance, we need a high dielectric constant, large area,
and small separation.
CAPACITANCE • 379
Q(t)
ln RC
Q
t
Q t Q0 e RC
The graph below shows how the charge, current, or p.d. across the
capacitor changes as it discharges:
Percentage of Q0, I0 or V0
100
90
80
70
60
50
40
30
20
10
0
t = RC t = 2RC Time/s
90
80
70
60
Increasing time
50 constant (RC)
40
30
20
10
0 Time/s
I R C
+Q −Q
VR VC
We can use Kirchhoff’s second law to set up a differential equation for the
charge on the capacitor:
VS VR VC
Q
VS IR
C
382 • FOUNDATIONS OF PHYSICS
dQ Q
VS R
dt C
dQ 1
(CVS Q)
dt RC
dQ 1
(Q Q)
dt RC F
Where QF CVS is the final charge when the voltage across the capacitor is
equal to the supply voltage.
This is a first-order linear differential equation whose solution is:
Q QF 1 e
t
RC
The difference between Q and QF decays exponentially toward zero.
The voltage across the capacitor is directly proportional to the charge
stored (V Q/C), so the voltage rises in a similar way to the charge:
V VS 1 e
t
RC
Current is the rate of change of charge, so the charging current decays
exponentially from an initial value I0 VSR
t
I I0 e RC
80
60
40
RC = 2.0 s
20
0 Time/s
0 2 4 6 8 10 12
+Q −Q +Q −Q +Q −Q
V1 V2 V3
The total charge on the set of series capacitors is therefore Q, and the
voltage across the set is equal to the sum of voltages across the individual
capacitors.
V V1 V2 V3
Q Q Q Q
Cseries C1 C2 C3
1 1 1 1
Cseries C1 C2 C3
For n capacitors in series,
in
1
Cseries
C1 i
i1
This has the same form as the equation for resistors in parallel, but here it
applies when the capacitors are in series.
19.4.2 Capacitors in Parallel
When capacitors are connected in parallel, they all have the same voltage
across them, and the total charge stored is the sum of the charges stored on
each capacitor.
V
C1
C2 Q Q1 Q2 Q3
Cpara V C1 V C2 V C3 V
C3
Cpara C1 C2 C3
384 • FOUNDATIONS OF PHYSICS
This has the same form as the equation for resistors in series, but here it
applies when the capacitors are in parallel.
19.6 Exercises
A1 A2
R C
S
V2 V3
S1
C1 = 50 µF
S2
C2 = 100 µF
Initially both switches are open and both capacitors are discharged.
S1 is now closed and C1 charges.
(a) Calculate the charge and energy stored on C1.
S1 is now opened and then S2 is closed connecting the two
capacitors together.
(b) Calculate the charge on each capacitor once they reach equilibrium.
386 • FOUNDATIONS OF PHYSICS
(c) Calculate the total energy stored on both capacitors and compare
this to the value you obtained in (a). Account for the difference.
Now consider a different scenario. Both capacitors are discharged
and then:
S1 is now closed and C1 charges. S1 is kept closed and S2 is also
closed connecting the two capacitors together while still connected
to the power supply.
(d) Calculate the charge on each capacitor once they reach equilibrium.
(e) Calculate the total energy stored on both capacitors and compare
this to the value you obtained in (a). Account for the difference.
5. You are provided with three identical capacitors each of capacitance C.
Calculate and list the values of all the capacitances you can make using
one or more of these capacitors. State which arrangement corresponds
to which value.
6. The diagram below shows a parallel plate air capacitor of area A and
plate separation d connected to a power supply and a resistor.
The plates are pulled apart by an external agent and their separation is
doubled. Describe and explain what happens:
Q to the charge stored on the capacitor
Q to current flow in the circuit
Q to the energy stored on the capacitor
Q to the voltage across the capacitor
7. A 470 F capacitor is charged to 10 V and then discharged through
a 2200 resistor. Calculate the charge and voltage remaining on the
capacitor after 1.5 s.
CHAPTER
20
MAGNETIC FIELDS
Michael Faraday explained these effects by saying that a bar magnet has
two poles, north and south, and that these create a magnetic field in the
space surrounding the magnet. The lines of magnetic field begin on north
388 • FOUNDATIONS OF PHYSICS
poles and end on south poles; they cannot start or end in space (although
they can form closed loops). The direction of the field lines is the direction
of force that would be exerted on a free north pole (even though free north
poles do not exist in nature). This is all very similar to the description of the
electric field and the link between electric field lines and electric charges.
N S
F
N
F
S
The Earth has its own magnetic field. The north pole of a compass points
toward the Earth’s North Pole. This implies that the geographic North Pole
of the Earth is actually a south magnetic pole. Sometimes the poles on
permanent magnets are called “north-seeking” or “south-seeking” poles, so
that the north-seeking pole of a compass always points toward magnetic
north.
MAGNETIC FIELDS • 389
From a distance, the magnetic field in space created by the Earth has
a similar shape to that of a large bar magnet. However, the center of the
Earth is very hot, and this high temperature would destroy any permanent
magnetism (it is above the Curie point for iron and nickel). The Earth’s
field is generated by electric currents.
Nickel: 637 K
Neodymium magnets: ~590–640 K
Circuit to
control current
A
Current-carrying wire of length l
l
N X S
l
Top pan Constant uniform
balance magnetic field directed
out of page
In the diagram above, the X on the end of the wire represents a current into
the page.
A permanent magnet with flat pole pieces is placed on a top pan
balance. The magnet creates a uniform horizontal magnetic field (directed
from N to S). A separate circuit is used to control the current through a wire
that passes horizontally between the poles perpendicular to the magnetic
field lines. As the current is increased from zero, the reading on the top pan
balance decreases, showing that the magnet is being pulled up and the wire
(by Newton’s third law) is being pulled down. The magnitude of the force
increases as the current is increased. Experiments like this can show that
the magnetic force F
Q is directly proportional to current I
Thumb:
“thrust”
or force
Second finger:
current
Conventional
Conventional q v q v q q v current I
current I v q
q v q v v
l Cross-sectional
area A
Magnetic field of strength B
directed into the page
MAGNETIC FIELDS • 393
Magnetic field of
f q strength B directed
into page
v
q
force act along the same line and can be made to oppose one another. By
adjusting the values of the fields, the two forces can be made to cancel out
so that the beam is undeflected.
For the beam to be undeflected,
fmagnetic felectric
Bqv Eq
E
v
B
Uniform electric field of strength E
fmagnetic
q q q
Magnetic field of
strength B directed
into page
felectric
For any ratio of electric field strength to magnetic field strength, there
is just one velocity that will be undeflected. This arrangement is called a
velocity selector. If a stream of charged particles with a range of incident
velocities enters the region of crossed fields, then only those satisfying the
equation above go straight through.
Mass spectrometer
A velocity selector is used to send ion beams with a particular velocity into
a mass spectrometer. The beams are then deflected in a constant magnetic
field so that they move in a semicircular path. The radius of curvature of
this path can be used to determine the masses of the ions. It is also possible
to measure the amount of each type of ion that is present in the beam, so
mass spectrometers are ideal for analyzing ionic ratios in samples.
396 • FOUNDATIONS OF PHYSICS
Region of uniform
magnetic field B2
E
B1
Source of ions
with a range of
velocities
B2
Current
element I
δl
I x
P δB
The direction of B is found using the right-hand rule and using the
continuation of the current element as the current direction. The magnitude
of B is given by:
I sin
B 0 2 l
4 x
where 0 is the permeability of free space, a constant that determines the
ability of a vacuum to support a magnetic field.
x
δB
0 I sin / 2
B l
4 r 2
l2 r
0 I sin / 2
B
4 r 2
l
l0
398 • FOUNDATIONS OF PHYSICS
All the terms in the curly brackets are constant, so this is a very simple
integral resulting in:
I
B 0
2r
If the coil has N turns the magnetic fields add together:
NI
B 0
2r
An expression for the field strength at other points along the axis of
the narrow coil can also be determined using the Biot-Savart law. The
integration is a little more complicated, but the result is:
0 Ir 2
B 3/2
2 z2 r 2
where z is the distance along the axis from the center of the coil. When
z 0 the expression reduces to the previous equation (as it should!).
r x
θ δl
l I
The resultant magnetic field forms concentric rings around the line of the
electric current.
I sin
B 0 2 l
4 x
The contributions from current elements to the left and right of P are the
same, so the resultant magnetic field strength at P is:
l
0 I sin
B 2 4 x2
dl
l0
MAGNETIC FIELDS • 399
The three variables , x, and l are all related, so in order to carry out the
integration we need to express two of them in terms of just one of them and
the constant distance r.
r
x
sin
r
l
tan
dl r
so 2
d sin
The integral now becomes:
0
0 I sin d
B 2 r
2
0 I
B
2 r
20.3.4 Magnetic Field Along the Axis of a Solenoid
A solenoid is a long coil, so the magnetic field along its axis can be found
by integrating the contributions from narrow coils along the length of the
solenoid. The result is that the field strength at the center of a long solenoid
is:
NI
B 0
l
where N is the number of turns, I is the current in the solenoid, and l is the
length of the solenoid.
The field strength drops to half of this value at each end and is small or
negligible outside the solenoid (except near the ends).
I I
0 NI 0 NI 0 NI
Bend Bcenter Bend
2l l 2l
The external field pattern is similar to that of a dipole bar magnet, one end
of the solenoid acts like a north pole and the other acts like a south pole.
400 • FOUNDATIONS OF PHYSICS
δl
I
X
r
MAGNETIC FIELDS • 401
B.dl 2rB 0 I 0 I
closed enclosed
loop by loop
0 I
B
2 r
B C
South
pole
I I
b
North
pole
A D
a
Sides AB and CD are perpendicular to the magnetic field, so magnetic
forces act on these wires. On AB the force is into the page and on CD it is
out of the page.
The magnitude of each force is F BIb. These two forces create a
turning effect about the central vertical axis (dotted line). The resultant
couple or torque is:
a
2 BIb BIab BIA
2
402 • FOUNDATIONS OF PHYSICS
where A ab is the area of the coil. If the coil has N turns, the torque is:
NBIA
20.4.2 A Simple D.C. Electric Motor
To make a working motor using a coil in a uniform magnetic field, the
turning effect on the coil must always be in the same direction. For this
to be the case, the direction of current in the coil must be reversed every
half rotation. The reason for this is clear from the diagrams below, which
show the end view of a motor coil as it rotates in the field. The end of the
coil is shown in three different positions with side AB on the left and CD
on the right. For the coil to continue to rotate in the same direction when
AB moves past the vertical dotted line, the current direction must reverse.
The thick black arrows represent the magnetic forces on the wires as the
coil turns. Current
reverses
X CD
AB
South pole
North pole
AB X CD
X
AB CD
The device used to reverse the current is called a split-ring commutator. For
a simple motor with a single coil, the commutator consists of a conducting
cylinder split in half so that the halves are separated by an insulator. The
commutator is attached to the axis of the motor and rotates with it. Current
enters and leaves the coil via brushes that make a sliding contact with the
surface of the commutator.
MAGNETIC FIELDS • 403
Brushe
Conductors
Coil rotates with
commutator
Rotation axis
20.5 Exercises
1. The Earth’s magnetic field is very similar to the field of a dipole bar
magnet. However, geophysicists are sure that the field is not caused by a
permanent magnet inside the Earth. Explain why not.
2. Two long straight parallel wires separated by 0.045 m each carry a
current of 2.0 A in the same direction.
(a) Draw a diagram showing the magnetic field around one of the wires
interacts with the other wire, and use this diagram to explain why
the wires exert a force on one another. State the direction of this
force.
(b) Calculate the magnitude of the force per unit length on each wire.
(c) Explain why coils of wire carrying a very large current must be able
to withstand large stresses.
3. A mass spectrometer is used to measure the masses and abundancies
of different isotopes. It does so by accelerating ions of each isotope to
the same speed and then deflecting them into a semicircular path in a
strong uniform magnetic field.
(a) Explain how a velocity selector, consisting of perpendicular electric
and magnetic fields, can be used to select ions of the same speed
from a group containing a wide range of different speeds.
(b) Ions of mass m1 and m2 and equal charge q enter the same uniform
magnetic field of strength B at the same velocity v at right angles
to the field lines and both are detected after moving through a
complete semicircle in the field. Derive an equation for their
separation.
404 • FOUNDATIONS OF PHYSICS
(a) Use a diagram to explain why there is a resultant moment on the coil
and state the direction of this moment.
(b) Describe qualitatively how the moment on the coil changes as
varies from 0° to 90°.
The coil has 80 turns and an area of 0.012 m2. It carries a constant
current of 0.65 A.
(c) Calculate the moment on the coil when 0°, 30°, 45°, 60°, and
90°.
(d) Explain how the coil would move if it was released from a horizontal
position and allowed to move freely about a central axis directed
into the page.
(e) Explain what has to be done to the current in the coil if it is to
operate as a D.C. motor.
CHAPTER
21
ELECTROMAGNETIC INDUCTION
Generator
Mechanical Electrical
energy energy
Galvanometer deflects
N S when wire moves, and
G
reads zero when wire is
stationary in or out of field
Bar magnet S
moved along
dotted line
N
Coil G
Soft iron
core
Soft iron
core
A.C.
To oscilloscope
input 2
To oscilloscope
input 1
Note that the peaks of the induced emf occur at times when the rate
of change (gradient) of the magnetic field through the secondary coil is
greatest and that the induced emf is zero at times when the rate of change
of the magnetic field through the secondary coil is zero (gradient is zero).
Magnetic flux
If a constant uniform magnetic field of strength B passes normally through
a surface of area A, the flux is defined as the product BA.
Magnetic field of strength B
Area A perpendicular
to magnetic field
Φ = BA
δA
The total flux is found by integrating these contributions across the surface:
Bcos dA
surface
where B might vary from point to point.
The S.I. unit of magnetic flux is Tm2, which is called Wb.
1 Wb 1 Tm2 or 1 T 1 Wbm2
Writing the tesla in this way emphasizes the fact that it is the magnetic flux
density.
Region of
magnetic field of
strength B into
G v v d
Conducting
vδ
The conductor runs along two parallel conducting rails connected back to
a stationary galvanometer. As the conductor moves, it cuts through lines of
magnetic field. In time t it cuts all the magnetic field lines in the darker
shaded area, and the flux through the circuit increases by an amount equal
to the flux through that additional area.
Flux cut in time t, Bdvt
Rate of flux-cutting:
d
Bvd
dt
Using Faraday’s law, the induced emf in the moving conductor is:
H = ()Bvd
412 • FOUNDATIONS OF PHYSICS
Lenz’s law
The negative sign in the equation of Faraday’s law can be interpreted in the
following way:
Q The direction of the induced emf is such as to oppose the change that
caused it.
This sounds a little obscure, but becomes clear when we apply it to the
example above. In order to induce an emf, we needed to push the wire
to the right. The existence of an emf in the circuit causes a current to
flow. However, when a current flows in a conductor lying perpendicular
to the magnetic field, there is a motor effect force on the conductor that
is perpendicular to both the current and the field. In this case the force
must lie either in the direction of v or in the opposite direction. If it was in
the direction of v then once we had started the wire moving, the induced
emf would create a current that experienced a force in the direction of
motion and the wire would accelerate with no need for us to do any further
work on it. Energy would be generated from nothing! This violates the law
of conservation of energy, so it cannot occur. The force on the induced
current must oppose the force moving the wire so that we do work to move
it and “pay” for the electrical energy it generates. Lenz’s law ensures that
energy is conserved when an emf is induced. It is the reason we need to
burn fuel to turn the generators in a power station.
δt
ELECTROMAGNETIC INDUCTION • 413
B = B0 cos ωt
21.3 Inductance
When there is current in a coil, the coil becomes an electromagnet and
creates a magnetic field. If the magnetic field changes, there is changing
flux linkage inside the coil, so an induced emf is created that opposes the
external supply that is changing the current. This is often called a “back
emf.” This opposition to changing current is called inductance. Coils act as
inductors in electric circuits.
414 • FOUNDATIONS OF PHYSICS
21.3.1 Self-Inductance
The tendency for a coil to oppose changes of current is called inductance.
The greater the inductance, the stronger the opposition to changing
currents. Inductance is defined by:
dI
H L
dt
H
L
dI
dt
Inductance is equal to the back emf per unit rate of change of current.
The S.I. unit of inductance is VsA1 or the henry (H). 1H 1 VsA1.
An expression for the self-inductance of a long solenoid can be
determined by comparing the definition of inductance with Faraday’s law:
dI dN
H L
dt dt
0 N 2 IA
NBA
l
So
2
dI 0 N A dI
L
dt l dt
0 N 2 A
L
l
The symbol of an inductor is:
VS Current
VS /R
VR VL Time
0
0
VS VR VL
dI
VS IR L
dt
This is a first-order differential equation that can be solved to give an
expression forI.
R
t
I I0 1 e L
This has a similar form to the equation for the charging of a capacitor, and
(R/L) is the effective time constant.
If S is open the current falls to zero in a very short time so that dI/dt
is very large. This results in a large back emf that can cause sparks across
the switch terminals. Interrupting the current in a coil is a way to generate
spikes of high voltage—e.g., for a car’s spark plugs.
E Pdt ILdI 12 LI 2
t0 I0
416 • FOUNDATIONS OF PHYSICS
21.4 Transformers
Electromagnetic induction is the key principle on which the generation
and distribution of electrical energy depends. Thermal power stations use
fossil or nuclear fuels to transfer chemical or nuclear energy to thermal
energy and then use this to power generators that output A.C. electricity.
Transformers step up the voltage so that electricity can be transmitted with
ELECTROMAGNETIC INDUCTION • 417
low losses over large distances. More transformers are used to step down
the voltage for consumers.
21.4.1 An Ideal Transformer
A transformer is a device that uses mutual induction to step up or step down
an A.C. voltage.
Soft iron laminated core
I
I
Output
A.C. supply V1 R
voltage
Flux in core
Induced emf in
secondary
Time
418 • FOUNDATIONS OF PHYSICS
The peaks of emf correspond to times when the flux in the core has its
greatest rate of change.
For an ideal transformer, all of the flux created by the primary passes
through the secondary. If there are equal numbers of turns on both
primary and secondary, the back emf in both coils will be equal and will
equal the supply voltage. However, the voltage on the secondary is also
directly proportional to the number of turns on the secondary (V2 µ N2), so
changing the turns ratio (N2/N1) must change the voltage ratio (V2/V1) in a
similar way. This leads to the transformer equation:
V2 N2
V1 N1
The voltage ratio is equal to the turns ratio.
Q If N2 > N1 then V2 > V1 and it is a step-up transformer.
Q If N2 < N1 then V2 < V1 and it is a step-down transformer.
While a transformer can increase voltage, it cannot create energy; the power
supplied to the transformer must be equal to or greater than the power it
delivers to a load. An ideal transformer has 100% efficiency, so:
Pin Pout
I1 V1 I2 V2
I 2 V1 N 1
I1 V2 N 2
The current ratio is the inverse of the voltage and turns ratios.
Resistance R of V3 to consumer
V1 from generator
transmission lines
I1 I2
V1 V2 V3
Step-up Step-down
transformer transformer
ω
South
North
Flux linkage:
NBA sin
The coil is rotating with constant angular velocity :
t
dN
Induced emf, H () ()NBA cos t
dt
ELECTROMAGNETIC INDUCTION • 421
Induced
voltage
ωNBA
Time
T = 2π/ω
The negative sign in the equation has been ignored in the graph. The output
is an A.C. voltage with peak value E0 NBA.
S
Oscillations
N
Switch
422 • FOUNDATIONS OF PHYSICS
Copper
pipe FEM
Neodymium
v
magnet
mg N
As magnet falls, its flux cuts the conductor surrounding it. This induces
current loops above and below the falling magnet. These create magnetic
fields that oppose the magnet’s motion.
ELECTROMAGNETIC INDUCTION • 423
conductor inducing emfs. The induced emfs create current loops and the
magnetic fields of these loops oppose the motion of the magnet. There is a
magnetic drag force that slows the magnet down. This can be a surprisingly
large effect if the metal sheet is titled at a large angle.
Another intriguing demonstration involves dropping a cylindrical
neodymium magnet down a copper water pipe with an internal diameter
slightly greater than the diameter of the magnet. The magnet falls at a slow
terminal velocity. Electromagnetic damping forces balance its weight.
Motion of magnet
F
The moving magnetic field induces emfs in the metal that create current
loops. By Lenz’s law these create magnetic fields that oppose the relative
motion, resulting in the forces on the magnet and conductor as shown
above. The effect is that the conductor tends to follow the moving magnetic
field.
In an induction motor, coils are used to create a rotating magnetic field.
A non-ferromagnetic rotor is placed in the rotating field and it too rotates.
This has the big advantage that there are no brushes. The idea was first
suggested by Nikola Tesla, and induction motors are now widely used from
DVD players to electric cars.
A rotating magnetic field can be created by superposing two alternating
magnetic fields at right angles to one another and giving them a phase
difference of /2. If we represent each magnetic field by a phasor, then the
resultant of the two phasors represents the rotating field.
424 • FOUNDATIONS OF PHYSICS
B
By
θ
Bx
Bx B0 cos t
By B0 sin t
Rotor
Vy = V0 sin ωt
Vx = V0 cos ωt
ELECTROMAGNETIC INDUCTION • 425
21.8 Exercises
Bar magnet S
moved along
dotted line
N
Coil G
While the magnet is moving toward the coil, the galvanometer deflects
to the right.
(a) Explain why this occurs.
(b) State and explain, using Faraday’s law, what happens when:
(i) the magnet is moved toward the coil at a higher speed
(ii) the magnet is moved away from the coil
(iii) the magnet is stationary inside the coil
(c) The experiment is repeated but with the galvanometer removed
so that the ends of the coil are not connected to anything. Discuss
whether the motion of the magnet near the coil has any effect now.
2. The Earth’s field has a strength of about 50 mT. A square coil of side
20 cm having 200 turns is placed so that its plane is perpendicular to the
Earth’s field. The coil has a total resistance of 50 ohms.
(a) Calculate the flux through the coil.
(b) Calculate the flux linkage through the coil.
(c) The coil is quickly turned (about an axis in the plane of the coil
perpendicular to the field) through 90° so that its plane is now
parallel to the Earth’s field. It takes 0.50 s to complete the rotation.
Explain why there is a current in the coil as it rotates.
(d) Calculate the average current as the coil is rotated.
(e) Calculate the charge that has moved around the coil during the
process.
426 • FOUNDATIONS OF PHYSICS
(f) How (if at all) would your answers to parts (d) and (e) be affected if
the coil was turned through 90° in 0.10 s instead of 0.50 s? Explain.
3. When a strong magnet is dropped through a copper tube, it falls at a
constant velocity. Use your knowledge of electromagnetism to explain
this as fully as you can. Use diagrams if this helps.
4. An aircraft is flying through due north at 200 ms1 in a place where the
vertical component of the Earth’s magnetic field is 40 mT. Its wingspan
is 50 m and its wings are made of a conducting material.
(a) Explain why there is a potential difference between its wing tips.
(b) Calculate the potential difference between its wing tips.
(c) Discuss whether the voltage generated could be used to power an
electrical device using wires connected to each wing tip.
5. (a) Draw a labeled diagram of a transformer that could step down a
main supply of 240 V 50 Hz A.C. to 20 V 50 Hz A.C. The primary coil
has 1200 turns.
(b) Explain how the transformer works.
(c) Explain the function of the core and explain why soft iron is a
suitable material for it.
(d) Explain how laminating the core reduces energy loss from the
transformer.
(e) Transformers are very efficient, but there are still losses. State three
ways in which energy can be dissipated by a transformer.
(f) A 20 V, 40 W lamp is connected to the secondary of the transformer.
Calculate the current drawn from the primary (assume that the
efficiency is 100% and neglect inductive effects).
(g) Sketch a graph to show how the voltage in the secondary coil is
related to the flux in the core. Explain how the graph illustrates both
Faraday’s and Lenz’s laws.
6. A coil with inductance and resistance is connected via a switch to a
power supply of 1.0 V. The graph below shows how the current in the
coil increases with time from the moment the switch is closed.
ELECTROMAGNETIC INDUCTION • 427
2.5
1.5
Current/A
0.5
0
0 0.05 0.1 0.15 0.2 0.25
Time/s
(a) Use the graph to calculate the inductance and resistance of the coil.
(b) Calculate the energy stored when there is a current of 2.0 A in the
coil.
(c) When the switch is opened, a spark is observed to jump across its
contacts. Explain why this occurs.
7. The diagram below shows an end view of a simple generator consisting
of a coil rotating at a constant rate in a uniform magnetic field. The A.C.
output is connected via slip rings to a purely resistive load. South
North
428 • FOUNDATIONS OF PHYSICS
22
ALTERNATING CURRENT
0 Time
0
430 • FOUNDATIONS OF PHYSICS
½ I0V0
Time
0
0
The power is always positive and peaks twice as frequently as the current
or voltage. The peak power is I0V0, and the average value of the power is
½ I0V0.
This can be shown using the trigonometric relation:
1 1
sin 2 A cos2 A
2 2
2 1 1
sin t cos2 t
2 2
The average value of a cosine term over a whole number of cycles is zero,so
the average of a sine-squared term is ½.
The average A.C. power is, therefore,
1
PAC I0 V0
2
The rms values of current and voltage (for sinusoidal variation) are:
I
Irms 0
2
V0
Vrms
2
So the average A.C. power is:
PAC Irms Vrms
By using the rms values instead of peak values, we can express the formula
for average A.C. power in the same way as we express the formula for D.C.
power. This means that an A.C. supply voltage of 240 V rms would light
a lamp to the same brightness as a D.C. supply of constant value 240 V.
ALTERNATING CURRENT • 431
In this sense the rms value is the “D.C. equivalent” value. However, an
A.C. voltage of 240 V rms actually peaks at 339 V. We can show where the
factor of 1/ 2 comes from:
V V0 sin t
V 2 V02 sin 2 t
1
V 2 V02 sin 2 t V02
2
V0
Vrms V 2 V02 sin 2 t
2
A.C. meters are usually calibrated to give rms values.
Current in
I0 capacitor
Voltage
across
V0
capacitor
0 Time
0
The ratio of V/I is of little use because it varies continuously (being zero at
some points and infinite at others). However, the ratio of the peak values
of voltage and current is constant;this is the reactance C(“kie-cee”) of the
capacitor.
V 1 1
C 0
I0 C 2 fC
The S.I. unit of reactance is , but reactance is not the same as resistance
because the peak values occur at different times.
Reactance can be used to find the peak or rms current in a capacitor if
we know the peak or rms A.C. voltage across it. However, the interesting
thing about reactance is its frequency dependence.
1
C
f
At low frequencies the reactance is high, becoming infinite at f 0 (i.e.,
for D.C.). This makes sense because the capacitor is actually a break in the
circuit so no D.C. can pass through it. At high frequencies the reactance
ALTERNATING CURRENT • 433
I0
Current in
inductor
V0
Time
Voltage
across
inductor
434 • FOUNDATIONS OF PHYSICS
L R
VL = IXL
VR = IR
I
VC = IXC
The supply voltage is equal to the sum of the phasors across the three
components.
22.3.2 Impedance
The impedance Z of a load that contains resistive, capacitive, and inductive
components is defined as the ratio of the peak voltage across the load to the
peak current in the load:
V
Z 0
I0
and is measured in .
For a series circuit V0 is found by adding the voltage phasors for the
components, and except in special cases, the voltage and current in the
circuit are not in phase, so the peak voltage and peak current occur at
different times. The general expression for the impedance of a series A.C.
circuit containing all three types of components is derived below:
VL = IXL
VC = IXC
VR = IR
VS
φ
I
A.C.
R C
VR VC
ALTERNATING CURRENT • 437
VR = I R
I
φ
VC = IXC VS
VS VR2 VC2 I R 2 C2
VS 1 2
Z R 2 XC2 R 2
I 2 fC
C 1
tan
R 2 fRC
22.4.2 RL Series Circuit
The behavior of an RL series circuit is also straightforward. As frequency
is increased, the reactance of the inductor increases, so VL, the circuit
impedance, and the phase difference between the supply voltage and the
current also increase, and the amplitude of the A.C. falls toward zero. The
maximum current is approached at low frequencies. In D.C., limit Imax
VS/R and the supply voltage and current are in phase.
VS
A.C.
R L
VR VL
VL = IXL VS
φ VR = IR
I
VS VR2 VL2 I R 2 X L2
VS 2
Z R 2 X L2 R 2 2 fL
I
L
tan C
R 2 fR
438 • FOUNDATIONS OF PHYSICS
A.C.
C
R
VR VC VL
VL = IXL
VS
φ
I
VR = I R
VC = IXC
1
2 f0 L
2 f0C
1 1
f0
2ð LC
At this frequency the current in the circuit has its maximum value:
VS
Imax
R
The circuit acts as a purely resistive load, and the current and supply voltage
are in phase.
The graph below indicates how the resistance, impedance, and circuit
current vary with frequency for an RCL series circuit.
Resistance
Impedance Impedance
Current
Resistance
0
0 f = f0 Frequency
For f < f0 the reactance of the capacitor is greater than that of the inductor,
and the circuit is said to be capacitive. For f > f0 the reactance of the inductor
is greater than that of the capacitor and the circuit is said to be inductive.
VS
A.C.
IR
IC
IL
IC = VS/XC
IR = VS/R
VS
IL = VS/XL
C L
When the capacitor is connected to the inductor, the voltages across them
must be equal, so that:
Q dI
L
C dt
Since I dQ/dt we can form a second-order differential equation for Q and t:
Q d 2Q
L 2
C dt
d 2Q 1
Q
dt 2 LC
This equation has exactly the same form as the equation of motion for
simple harmonic motion, so the solutions will have the same form too:
Q Q0 sin t
where
1
LC
The charge on the capacitor oscillates with a frequency,
1 1
f
2 LC
which is the same as the resonant frequency for an RCL circuit.
The voltage across the capacitor oscillates at the same frequency:
Q0
VC sin t
C
and the current is
dQ
I Q0 cos t
dt
442 • FOUNDATIONS OF PHYSICS
Analogies like this occur all over physics. They help us understand new
phenomena in terms of ones we are already familiar with, but they are also
useful in their own right. For example, this correspondence between the
mechanical and the electrical means that we can model mechanical systems
using electrical circuits—e.g., to test new designs for structures.
22.6 Exercises
The A.C. supply is adjusted until the lamp lights equally brightly from
both supplies.
(a) Calculate the peak value of the rms voltage.
(b) State the rms value of the A.C. voltage.
(c) Discuss whether the lifetime of the bulb (a filament lamp) will
depend on the type of supply used to light it.
3. Copy and complete the table below by adding resistance or reactance
values at each frequency:
Component/frequency 0 Hz (D.C.) 100 Hz 1000 Hz 10,000 Hz
100 resistor
100 F capacitor
100 H inductor
4. The circuit below contains a resistor and an inductor.
16 V rms
A.C.
22 Ω 6.5 mH
IR IL
A.C.
10 Ω 22 µF 50 µH
VR VC VL
(a) Calculate the impedance of the circuit at 0 Hz, 1.0 kHz, 5.0 Hz,
10 kHz, and 100 kHz.
(b) Explain what is meant by resonance and calculate the resonant
frequency for this circuit.
(c) Sketch a graph to show how the rms current varies with frequency
from 0 to 100,000 Hz.
(d) Sketch a graph to show how the phase difference between the
supply voltage and the current varies as the frequency changes from
0 to 100,000 Hz.
(e) Describe the energy transfers that take place in the circuit at
resonance.
CHAPTER
23
GRAVITATIONAL FIELD
Gm1 m2
F
r2
The minus sign indicates attraction.
G is the universal constant of gravitation, G 6.674 1011 Nm2kg2.
Newton was also able to show that the force of attraction between
spheres of uniform density is the same as the attraction between two point
masses placed at their centers. This means that we can treat object like
446 • FOUNDATIONS OF PHYSICS
planets and stars as point masses when considering orbital motion. It is also
important to note that, by Newton’s third law, the forces on each mass have
the same magnitude, even if the masses are different. For example, the
weight of an apple in the Earth’s gravitational field is the same as the weight
of the Earth in the apple’s gravitational field. It is also the case that the
gravitational force exerted on the Earth by the Moon is equal in magnitude
to the gravitational force exerted on the Moon by the Earth.
The resultant gravitational force on a body affected by the gravitational
fields of several other objects (e.g., the Earth affected by the Sun, Moon,
and other planets) is the vector sum of the gravitational forces from each of
the other objects.
M
GRAVITATIONAL FIELD • 447
r1
Uniform spherical shell of
mass M
r2
Inner Gaussian surface encloses
no mass, so by symmetry:
4 r12 g 0
g0
The negative sign again arises because of the attractive nature of the
gravitational field; the flux entering through a closed surface is equal to
4G times the total mass enclosed by the surface. In the same way as in
electrostatics, this form of Gauss’s law is particularly useful for situations
with spherical symmetry. For example, we can use it to show that the
gravitational field strength inside a hollow uniform spherical shell is zero
everywhere.
The field outside the surface is that of a point mass ME located at the
center of the Earth:
GM
g(r RE ) 2 E
r
We can determine an expression for the field strength inside the earth
by recalling that the field strength inside a hollow sphere is zero. This
implies that the field strength at distance r < RE from the Earth’s center is
that of the mass inside radius r.
g
r = RE
r
0
GME
gsurface = −
RE2
(559 km) the gravitational field strength is about 8.2 Nkg1, i.e., over 80%
of its surface value. The apparent weightlessness of astronauts inside an
orbiting spacecraft is not due to g being zero—it isn’t. They are in free fall
with the same acceleration as their spacecraft, so they do not experience a
reaction force to their own weight. The very fact that they are orbiting is
because of their weight!
23.2 Gravitational Potential Energy and Gravitational Potential
Gravitational potential energy is the energy a body has because of its position
in the gravitational field. When it is moved from one position to another,
energy is either transferred to or from gravitational potential energy. For
example, lifting a case from the floor and placing it on a table requires an
external agent to apply an upward force and to move this force; so work is
done on the case and its gravitational potential energy increases.
If an apple falls from a tree to the ground, a gravitational force (its
weight) acts on the apple and does work on it, so its gravitational potential
energy decreases.
23.2.1 Change in Gravitational Potential Energy
Consider a particle of mass m moving from point A to B in a gravitational
field as shown below.
A mg
m
θ
δx
The work done by the gravitational field to move it a short distance x along
the line AB is:
W mg cos x
450 • FOUNDATIONS OF PHYSICS
θ h
x
B
GPE AB m g.dx mgx cos mgh
A
While this is only exact when g is constant, it is a good approximation for
vertical displacements close to the surface of the Earth that are small
compared to the radius of the Earth.
M P mg m x
To infinity
r
δx
Increasing
Surface of Earth: potential
VG∼− 6.3 × 10−7 Jkg−1
M
v
r
Black holes
The speed of light is a limiting speed in the universe, so if the escape
velocity reaches this value, then nothing—not even light—can escape.
Black holes are objects that are sufficiently massive and compact that the
escape velocity at a certain distance from the object is equal to the speed of
light. This distance is called the Schwarzschild radius, and while a thorough
analysis of black holes requires general relativity, it is possible to derive
some useful results from Newton’s law of gravitation.
Consider a black hole of mass M. At a certain distance from the center,
called the Schwarzschild radius RS, the escape velocity is equal to the
speed of light, c.
2GM
vesc c
RS
2GM
RS
c2
Another way to look at this is to realize that a body of mass M will become
a black hole if all of its mass is compressed inside its Schwarzschild radius,
RS. For the Earth to become a black hole this radius is just under 1 cm! The
Earth would have to be compressed to the size of a table tennis ball!
A sphere of radius RS surrounding the black hole is called the event
horizon. This is because no events inside this radius can communicate with
the outside Universe. In a sense everything inside the event horizon has
been cut off from the rest of the Universe.
Earth
system was the fact that it did not explain the motions of the planets; each
planet’s motion had to be set up independently of the others, so there was
no simple unifying principle for the system.
In 1542, Nicolaus Copernicus published his De Revolutionibus Orbium
Coelestiium(“On the Revolutions of the Heavenly Spheres”) in which he
argued that planetary motions could be explained more simply by assuming
that the Sun was at the center of the system (a heliocentric model). This
idea was controversial because it moved the Earth from the center of the
universe and conflicted with contemporary religious views about the nature
of the universe.
Johannes Kepler was a German mathematician and astronomer and
strong proponent of Copernicus’s heliocentric model. Kepler used the
detailed astronomical observations of the Danish astronomer Tycho Brahe
to develop three laws of planetary motion. He replaced the complex cycles
and epicycles of Ptolemy and Copernicus with elliptical orbits. The fact that
all of the planets could then be described by the same mathematical laws
suggested that there was an as-yet-undiscovered law that governed all of
their motions.
Perihelion: Aphelion:
closest point to farthest point
Sun, greatest from Sun,
orbital speed lowest or bital
speed
GMm
F=
r2
M
mv2 GMm
2
r r
2 r
v
T
3
r GM
2
T 4 2
458 • FOUNDATIONS OF PHYSICS
that need to scan the entire surface of the Earth because they pass over
a different strip of the surface on each orbit and can observe the entire
surface in a 24-hour period.
GPS satellites are further out, at an altitude of about 20,000 km
and have an orbital period of about 12 hours. The GPS system creates a
constellation of satellites around the globe and relies on several satellites
being visible to a GPS receiver at any point on the Earth’s surface at any
time. The minimum number of operational satellites required to achieve
this is 24, but the target number is 33. Over 70 GPS satellites have been
launched, but not all of them are still operational.
Geostationary satellites are placed in an equatorial orbit at an altitude
of about 35,800 km (r 42,200 km) so that they have an orbital period of
24 hours. This ensures that they remain stationary above the same point
on the Earth’s equator because they complete one orbit as the Earth
itself rotates once. This is called a geosynchronous orbit and is used for
communications satellites. The fact that they remain stationary in the sky
means that ground-based antennae do not have to track them.
Small squares
represent small
masses m at each
D
position
Body 1 of
radius r Gravitational field
A B C
Distance from
E
center of body 1 to
tory
center of body 2 = R
l trajec
Mass of body 2 = M
Orbita
The tidal forces shown are those that act in the freely falling reference
frame.
ctory
al trajeOrbit
GRAVITATIONAL FIELD • 461
Tidal forces are differential forces and are defined as the difference between
the actual gravitational force FA on a small mass m and the force that would
be exerted on it if it was at the center of the body, FB. For example, the tidal
force on the mass at A is given by:
GMm GMm GMm(2 Rr r 2 )
tidal force A FA FB
R r 2 R 2 R r 2 R 2
For r << R, this simplifies to:
2GMmr
tidal force A
R3
The negative sign here indicates that the direction of the tidal force is away
from the center of the body. This obeys an inverse-cube law with respect to
distance, so tidal forces will only be significant for bodies that are relatively
close together.
The Moon’s tidal effect is roughly double that of the Sun, but both are
significant. As the Earth, Moon, and Sun change their relative positions,
the tidal effects of the Moon and Sun on the Earth’s oceans can reinforce
or weaken one another. The largest or “spring tides”occur when the three
bodies are aligned and the smallest or “”neap tides”occur when the lines
from the Sun and Moon to the Earth are perpendicular. These effects are
exaggerated in the diagrams below, which are not to scale!
Earth Sun
Moon
Moon
Relative positions
for spring tides
The period of the Earth’s rotation is significantly less than the period of the
Moon’s orbit, so the Earth effectively rotates under the tidal bulges and
there are approximately two high tides in each 24-hour period. However,
the Earth’s rotation pushes the bulge slightly ahead of the Moon’s position.
Moon
Earth
Sun
Relative positions
for neap tides
Moon
GRAVITATIONAL FIELD • 463
This has the effect of creating a retarding torque on the Earth that reduces
its rotation period, making day length gradually increase and reducing its
angular momentum. However, angular momentum for the system must be
conserved, so the radius of the Moon’s orbit also increases. The present rate
of increase of day length is about 1.7 ms per century, and the Moon’s orbital
radius increases by about 3.8 cm per year. When the Moon first formed,
the length of a day on Earth was only about 5 hours. The long-term effect
of this interaction is that the Moon’s orbital period and the Earth’s rotation
period will eventually become the same—they will be tidally locked.
Ball projected on
surface
464 • FOUNDATIONS OF PHYSICS
follows a curved path around the mass. The geometry of the rubber sheet
has changed and the path followed by the ball has changed too.
Seen from above the presence of the central mass has caused the
deflection of the ball. As far as the ball is concerned, it has simply followed
the curvature of the surface. John Wheeler summarized Einstein’s theory
of general relativity by saying that:
Q Matter tells space how to curve.
Q Space tells matter how to move.
In Newtonian mechanics, an object continues to move in a straight line
unless acted upon by a resultant external force. In Einstein’s universe,
objects move along the shortest paths in curved space–time. These paths
are called geodesics. A good analogy is found in the paths followed by
aircraft on long-haul flights around the Earth. The shortest path between
two points on the Earth’s surface is an arc of a great circle around the center
of the Earth. According to Einstein’s theory, the Earth’s orbit around the
Sun is actually a geodesic in space–time. The Earth is not being deflected
by a gravitational force; it is simply following the local curvature of space–
time and that curvature has been caused by the presence of the Sun.
rocket feel exactly the same as if they were in a uniform gravitational field
of strength g a. Now imagine that a horizontal beam of light enters the
rocket through a port hole on one side and leaves through a port hole on
the other side. Since the rocket has constant acceleration the beam will
follow a parabolic path inside the rocket. Now imagine that the same rocket
is at rest on the surface of a planet with surface gravity g. If a beam of
light enters horizontally through the same window, it must, according to the
equivalence principle, follow the same parabolic path as before and deflect
downwards toward the surface of the planet. If this was not the case, then
an observer inside the rocket could tell whether the rocket was at rest in a
gravitational field or accelerating simply by observing the motion of light
beams.
a
g
were in agreement with Einstein’s theory. The theory has been tested
many times and in many different ways since then, and results have all
been in agreement with theory. General relativity is regarded as one of the
fundamental theories of physics.
M
A B
T V
T ~
c2
V
T T1 2
c
The greater the change in potential, the greater the difference in clock
rates.
Gravitational time dilation becomes extreme in very strong gravitational
fields (where the equation above is not valid). If we were to observe clocks
close to the event horizon of a black hole, they would tick very slowly
indeed, and time would stop at the event horizon. In this sense anything
that actually enters the black hole has passed beyond the infinity of time in
the external universe!
Direction of
gravitational wave
r
te
lit
sp
am
Be
Laser
Mirror
beam
Superposed beams
to detector
Monochromatic light is split so that half of the beam travels along the
“vertical” arm and half along the “horizontal” arm (in reality, these will both
be horizontal—e.g., NS and EW, and perhaps several kilometers long). The
returning beams superpose and interfere. When a gravitational wave passes,
the lengths of the arms fluctuate, introducing a periodic phase difference
that makes the interference pattern change at the frequency of the wave.
These fluctuations were finally discovered at the Laser Interferometer
Gravitational Waves Detector (LIGO) in 2015 and announced in February
2016 (100 years after they were predicted). The signal detected by LIGO
was consistent with the source being the inward spiral and coalescence of a
pair of black holes of around 36 and 29 solar masses. This discovery opens
GRAVITATIONAL FIELD • 469
the door to a new age of astronomy in which the most violent cosmic events
are observed using gravitational wave “telescopes.”
23.6 Exercises
(e) the escape velocity if the mass is projected away from the planet
from rest at point P
(f) the escape velocity if the mass is projected away from the planet
from rest at point Q
(g) the kinetic energy that must be supplied to m if it is to be projected
tangentially from Q and then enter a circular orbit
3. (a) Derive an equation for g as a function of radius r from the center
of the Earth assuming the Earth’s density is constant and that the
Earth is a perfect sphere.
(b) The Earth is actually an oblate spheroid, flattened at the poles.
Discuss how this affects the value of g at different points on the
surface.
(c) The fact that the Earth rotates on its axis means that part of the
gravitational force holding us to its surface must provide the
centripetal force needed for us to rotate with the Earth. This affects
our measured weight. Calculate the percentage change in weight of
a man at the equator. G 6.7 10–11 Nm2kg–2, ME 6.4 1024 kg,
RE 6.4 106 m, T 89,800 s (rotation period).
4. Consider the Earth and Moon as an isolated system. At what distance
from the Earth (along a line connecting it to the Moon) will a space craft
experience no resultant gravitational force? G 6.7 10–11 Nm2 kg–2,
ME 6.4 1024 kg, MM 7.2 1022kg, RE 6.4 106 m, r 3.8 108 m
(separation of centers of Earth and Moon).
5. The Moon orbits the Earth in about 27.3 days at a distance of
380,000 km.
(a) State Kepler’s three laws of planetary motion.
(b) Derive Kepler’s third law for a planet in a circular orbit.
(c) Kepler’s laws were originally stated about the planets in our solar
system, but they can be applied to any similar system where
satellites orbit a central body. Explain how the constant in Kepler’s
third law is affected when the law is applied to different systems.
(d) Use Kepler’s third law to calculate the distance from the Earth at
which an artificial satellite would be geostationary (i.e., have an
orbital period of 24 hours or 1 day).
(e) Why must geostationary satellites be placed in equatorial orbits?
(f) Use Kepler’s third law to derive a lower limit for the period of an
artificial Earth satellite.
GRAVITATIONAL FIELD • 471
24
SPECIAL RELATIVITY
Laboratory moves at
velocity v through ether
Half-silvered
mirror (beam Mirror
splitter)
Monochromatic
light
Interference pattern
delays the light along both arms, but the delay is greater along the path
parallel to motion through ether. If the apparatus is rotated through 90,
the greater delay should shift to the other arm of the interferometer and
the interference pattern should shift. The apparatus used by Michelson and
Morley was sensitive enough to detect shifts caused by relative velocities
comparable to the Earth’s orbital speed, so they expected to detect the
Earth’s motion through ether.
Despite carrying out the experiment carefully and repeating it at
different times of the year, they could not detect any shift in the position
of the interference fringes. Similar experiments have been carried out
using more modern equipment but with the same results—i.e., no shift in
interference fringes. This is called a “null result” and is probably the most
famous null result in the history of physics. All attempts to detect ether
have failed, and the fact that there is no delay in experiments such as the
476 • FOUNDATIONS OF PHYSICS
they undermine the idea of absolute space and, as we shall see, the idea that
there is an absolute time.
L c
c
Observer B
Mirror Mirror
c Observer A
Mirror
478 • FOUNDATIONS OF PHYSICS
′/2
cT′
Mirror Mirror
vT′/2 vT′/2
2L
For A, the time between ticks of his own light clock is T where T c
Also for A, let the time between ticks on the moving (B’s) clock be T.
Now use Pythagoras theorem to find the relationship between T and T.
Considering either one of the two right-angled triangles above:
c2 T2 v2 T2
L2
4 4
SPECIAL RELATIVITY • 479
γ=1
γ=0 Velocity of moving
v=0 v=c reference frame
480 • FOUNDATIONS OF PHYSICS
For relative velocities small compared to the speed of light, the gamma
factor is negligible, so relativistic effects can be neglected. This allows us
to assume that we all share the same time and space and can agree over
measurements of time intervals and distances. For example, even at the
speed of a jet airliner (about 300 ms1) the value of the gamma factor is just
1.0000000000005!
If A sees B’s time slow down, then what does B see when he looks back
at A? Both of the observers are in inertial reference frames, so the laws of
physics are the same for both of them and the speed of light is constant
in their reference frame. B will see A moving in the opposite direction at
speed v and will see the light path in A’s clock stretched out. B will see A’s
time run slow by the same time dilation factor that A saw B’s clock run slow!
The effect is the same for both observers. At first sight this seems to lead to
a contradiction, but while the clocks are in motion, they cannot be placed at
rest next to one another to compare them to find out which one has gained
or lost time. To do this we would have to take at least one of them on a
round trip.
v
B
A
B
v
Earth Star
run slowly and so concludes that A should be the younger one when they
reunite. Is this a valid way to describe the journey? In fact, it is not. If we
ignore the relatively low velocity of the Earth, then A stays in the same
inertial reference frame throughout and so A’s point of view is valid. B,
however, undergoes three separate periods of acceleration: as she leaves
Earth; as she turns around and accelerates back toward the Earth at the
distant star; and then again as she comes to rest back on Earth. To describe
the journey from B’s point of view, we must take into account the effects
of these periods of acceleration. From B’s point of view she will experience
inertial forces during these periods that are just like increases and decreases
in the local gravitational field. These changes introduce gravitational time
dilation effects (see Section 23.5.3) that are not present for A; so B’s view is
not equivalent to A’s.
The traveling twin (B) will be younger when they reunite and the
apparent paradox is resolved.
Clock A Clock B
same time according to local clocks in this space, then those events are
simultaneous in this reference frame.
The question is: will clocks synchronized in one inertial reference frame
be synchronized in all inertial reference frames, regardless of their motion?
Consider the process above carried out in a moving laboratory and observed
from another inertial reference frame. For simplicity we will assume that
the motion is parallel to the separation of the clocks and that the flash occurs
at the moment the light source passes the stationary observer. The dotted
clocks show their initial positions for the observer.
Clock A Clock B
Stationary
observer
For the stationary observer the flash of light leaves the source and travels
away in both directions at the speed of light. However, clock A is moving
toward the source position and clock B is moving away from it. Light reaches
clock A first so that clock starts first. Clock B starts later. The two clocks are
not synchronized in the stationary observer’s reference frame even though
they are synchronized in the moving reference frame. For the stationary
observer the synchronization “error” increases with the separation of the
clocks.
This is a profound result. Two events that occur near A and B when the
clocks show the same time are simultaneous in the moving reference frame
but occur at different times for the stationary observer. This means that
one moment in a particular inertial reference frame is spread over different
times in different inertial reference frames. This destroys the concept of
absolute time, which assumes a unique progression of moments for all
observers.
It should again be noted that there is nothing special about the so-called
“stationary observer.” He is just in a different inertial reference frame and if
SPECIAL RELATIVITY • 483
he were to synchronize two distant clocks in his own reference frame, then
an observer in the “moving” reference frame would think that there was a
synchronization error.
v v
L L
From the Earth’s reference frame From the space craft’s reference
(at rest in the galaxy), the diameter frame (moving at velocity v relative
is L but the spacecraft is contracted to the galaxy), the diameter is
because of its motion contracted to L′ but the space craft
has its proper length
(x, y, z, t)
(x′, y′, z′, t′) Inertial reference frame
moving in positive x-direction
at velocity v
y y′
x
x′
The diagram above shows the coordinate systems used in two inertial
reference frames. The primed frame is moving at velocity v in the positive
x- and x-direction. At time t 0 the origins of both coordinate systems
coincides. An event with coordinates (x, y, z, t) in the unprimed frame has
coordinates (x, y, z, t) in the primed frame. Lorentz transformation allows
us to transform from one frame to the other.
Q Lorentz transformation—to transform from the primed to the unprimed
reference frame:
x x vt
y y
z z
vx
t t 2
c
Q Inverse Lorentz transformation—to transform from the unprimed to
the primed reference frame:
x x vt
y y
z z
vx
t t 2
c
SPECIAL RELATIVITY • 485
These equations can be used to derive the formulae for time dilation, length
contraction, and synchronization differences.
We will use them to derive an equation for relativistic velocity addition.
x x′
If u and v are small compared to the speed of light, then this reduces to the
familiar equation for velocity addition: w (u v). The relativistic velocity
addition equation has the interesting property that no two sub-light speeds
can be added together to produce a speed greater than the speed of light.
However long something accelerates, it will approach—but never quite
reach—the speed of light. Consider the limiting case where u v c:
u v c c 2 c
w c
uv 2 2
1 2 1 c
c c2
24.5 Mass, Velocity, and Energy
24.5.1 Mass and Velocity
When an object is accelerated at a constant rate, as measured in reference
frames momentarily at rest with respect to the object, its acceleration
measured by a stationary observer will gradually fall. This is because the
time taken for the same increase in velocity is longer for the stationary
observer as a result of time dilation.
Consider a particle of charge q and mass m0 (as measured in a reference
frame at rest with respect to the particle) accelerated by a constant electric
field of strength E. The particle increases its velocity from v to v v in a
time t’ in a reference frame moving with the particle at speed v relative
to a stationary observer. The same increase in speed will take a longer time
t t for the stationary observer.
In the moving reference frame:
Eqt
v
m0
In the reference frame of the stationary observer:
Eqt Eqt
v
m m
Combining these two equations:
m m0
This shows that the mass will increase with velocity. The mass measured in
the rest frame of the particle is m0, and this is the fundamental mass of the
particle. The additional mass is related to the kinetic energy of the particle
as we shall see.
SPECIAL RELATIVITY • 487
24.6.1 Invariants
The fact that measurements of lengths and times made in different inertial
reference frames differ from one another can be disconcerting. Physicists
like to discover quantities that are the same for all observers. Such quantities
are called invariants.
Consider two points on a 2D surface. The location of each point can
be described by stating the coordinates of each point relative to a fixed
coordinate system consisting of two perpendicular axes. However, there is
an arbitrary choice about which set of perpendicular axes to use, and the
coordinates will be different for different choices, as shown in the diagram
below.
y
y′ A: (xA, yA) (x′,
A
y′A)
B
yB
y′A A
yA
xA x
xB
x′A
x′B
x′
While the coordinates change when the axes are rotated, the distance AB
does not. It is an invariant under rotation of the axes:
SPECIAL RELATIVITY • 489
2 2
AB xB xA 2 yB yA xB xA 2 yB yA
It is also possible to write down a set of transformation equations
to convert measurements made relative to the unprimed axes into
measurements relative to the primed axes. It was the similarity between
these transformation equations and the Lorentz transformation equations
that alerted Minkowski to the idea that changing from one inertial reference
frame to another is like a geometric rotation.
24.6.2 Space-Time
Space-time is a 4D continuum where the fourth dimension is related to time.
Whereas a point in space is defined by three coordinates (x, y, and z) a point
in space-time, called an event, has four (x, y, z, and t). In the S.I. system,
distances and times are measured in different units—meters and seconds—
so instead of simply using the time in seconds in a space-time diagram, we
multiply the time by the speed of light so that the units on the time axis are
also meters. Minkowskirealized that while measurements of time intervals
and distances will differ for observers in different inertial reference frames,
the Lorentz transformation ensures that the 4D “distance” between two
events is the same for all inertial observers, regardless of their velocity.
This 4D distance is calculated by Pythagoras theorem, but by treating the
time differences as if they are mathematically imaginary quantities;in other
words, the fourth dimension is actually the ict dimension where i is the
square root of –1.
ict Worldline of rocket
B moving at velocity v
ictB
A
ictA
x
xA xB
AB xB xA 2 ictB ict A 2
Since the interval is an invariant, it must be the same using the coordinates
of events A and B in any inertial reference frame, including that of the
rocket. However, in the rocket’s reference frame both A and B occur where
the rocket is—in other words, at x’ 0 but at different times tA’ and tB’. We
cannot assume that the times are the same as those in the other reference
frame. The space-time diagram in the rocket’s reference frame looks like
this:
ict′
ict′B B
Worldline of rocket at
rest in its own inertial
reference frame
ict′A A
0 x′
AB ictB ictA 2
We can equate the two expressions for the interval (because it is an
invariant):
ictB ictA 2 xB xA 2 ictB ict A 2
We can also express the distance traveled in the unprimed frame in terms
of velocity:
xB x A v t B t A
SPECIAL RELATIVITY • 491
After some algebraic rearrangement, we can express the time elapsed in the
rocket frame (primed frame) in terms of the time elapsed in the rest frame
(unprimed frame)
2 2 2
c2 tB tA v2 t B t A c2 t B t A
v2 t t
tB tA 1 B A
2 tB t A
c
This is time dilation. The time elapsed in the moving reference frame is less
than the time elapsed in the rest frame by the gamma factor.
This illustrates how a geometrical approach in flat space-time reproduces
the standard results of special relativity.
24.7 Exercises
X A Y v
Z B
25
ATOMIC STRUCTURE AND
RADIOACTIVITY
Alpha
source Gold foil
Later work showed that the charge on a nucleus is equal to Ze where Z
is the atomic number, equal to the position of the atom in the periodic table
of elements. The forces acting on the scattering nucleus and the scattered
alpha-particles are shown below.
The diagram below shows typical alpha-particle trajectories close to a
nucleus. On this scale the outside of the atom would be about 10 m away!
Therefore, the vast majority of alpha-particles, which pass much further
away from the nucleus than those shown in this diagram, experience weak
electrostatic forces and suffer small deflections.
2Ze2
F=
4πε0r
f alpha
Path o +2e
r
a
particle
+Ze
2Ze2
F=
Nucleus 4πε0r
Incident alpha
particles
Increasing
impact
Gold nucleus parameters
Closest
approach d
First minimum
of diffraction
High-energy
electron beam θ
Target
ATOMIC STRUCTURE AND RADIOACTIVITY • 499
The first minimum of the diffraction pattern for a spherical object is at:
1.22
sin
D
Where D is the diameter of the target nucleus and is de Broglie wavelength
of the electrons.
1.22
D
sin
The wavelength of electrons is calculated from de Broglie equation:
h
p
However, in order to obtain a wavelength comparable to nuclear dimensions,
the electron must be accelerated to very high energy, so relativistic equations
must be used to determine electron momentum. In particular,
2
p2 c2 E2 m0 c2
the energy required E is much greater than rest energy m0c2, so the equation
simplifies to:
E
p
c
giving:
1.22 hc
D
E sin
25.1.4 Nuclear Atom
The Rutherford nuclear atom consists of a small positively charged nucleus
containing protons and neutrons (collectively called nucleons) and most
of the mass of the atom surrounded by orbiting electrons. The number of
protons in the nucleus is called the atomic number and corresponds to the
position of the element in the periodic table. The atomic number is also
equal to the number of orbiting electrons in the neutral atom. If an atom
gains or loses an electron, it becomes an ion. The ratio of the atomic radius
to the nuclear radius is about 20,000 in most atoms.
Nuclear nomenclature
Q Z—the atomic number, equal to the number of protons in the nucleus
and the position of the element in the periodic table
Q A—the atomic mass number or nucleon number, equal to the number
of protons plus the number of neutrons in the nucleus
Q N A Z—the neutron number, equal to the number of neutrons in the
nucleus
500 • FOUNDATIONS OF PHYSICS
are all isotopes of carbon. They are chemically identical but have slightly
different physical properties because of their differing masses. Carbon-14
is also unstable and, in common with many neutron-rich nuclei, it decays
by beta-minus emission.
α α
γ γ
β β
E-field B-field
upwards into page
Sphere at radius r
from source
Source
502 • FOUNDATIONS OF PHYSICS
δx
I I + δI
x
The diagram above shows part of the path of a gamma-ray beam through
matter. Then intensity of the beam changes by an amount I as it passes
through a short thickness x of material. Since the beam is being absorbed,I
is negative. The proportion absorbed per unit length is constant; so:
I
I x
where is the absorption coefficient (a constant) for the medium with units
m1.
In the limit that x 0 this becomes the first-order differential
equation:
dI
I
dx
whose solution is:
I I0 ex
Intensity falls exponentially from its initial value I0. As with all exponential
changes, it has a constant proportion property in that the intensity will
ATOMIC STRUCTURE AND RADIOACTIVITY • 503
always fall by the same fraction in the same distance. We can therefore
derive an expression for the half-thickness x1/2 of the material, that is, the
thickness of material that will reduce any initial intensity of the radiation
by 50%.
1
ex1/2
2
ln 2
x1/2
The absorption coefficient and half-thickness depend on the energy of
the gamma-rays and the nature of the medium. Half-thickness (or “half
value layer,” HVL) is a useful quantity to use in radiological protection
when comparing the effectiveness of different shielding materials or the
penetrating power of radiation.
X-ray penetration also decays exponentially with distance, and the table
below gives half-thicknesses of human tissue, aluminum, and lead for X-rays
of three different energies.
Medium Half-thickness/mm
30 keV photons 60 keV photons 120 keV photons
Human tissue 20 35 45
Aluminum 2.3 9.3 17
Lead 0.02 0.13 0.15
Cloud
Source chamber
E1
E2
ATOMIC STRUCTURE AND RADIOACTIVITY • 505
cal
Medical
R
Radon
Cosmic rays
dN
N
dt
dN
N
dt
N is the number of nuclei present at time t and the minus sign indicates that
the number is falling with time.
is the decay constant and depends on the nuclide being considered. It
has an S.I. unit s1 and represents the probability of decay per unit time in
the limit of small time intervals.
This is a first-order differential equation that can be solved by separation
of variables. Its solution represents how the number of nuclei in the sample
varies with time:
N N0 et
This is exponential decay. The term et is equal to the fraction of the
initial number of nuclei remaining after time t.
510 • FOUNDATIONS OF PHYSICS
The half-life of the nuclide is the time taken for the number of nuclei
in the sample to halve.
1
et1/2
2
ln 2
t1/2
The graph below shows the decay curves for three nuclides, A, B, and C.
Their half-lives are shown on the time axis. A has a half-life of about 0.35y,
B has a half-life of about 0.69 y, and C has a half-life of about 1.4 y.
% of N0
100
90
80
70
60
50
40
30 A
20
10
B
0 C
0 1 t1(A) 2 3 4 5
t1(C) t1(B)
2 2
2
Kinetic
energy
Cut off
This differs from alpha decay where, with only two emerging particles, the
energy is shared in a definite way and the alpha-particles have a discrete
energy spectrum.
discrete quantum jumps to lower states until it reaches its lowest or ground
state. Each quantum jump results in the emission of a gamma-ray photon
whose energy is equal to the difference in energy between the nuclear
energy levels.
E hf
This results in a discrete energy spectrum for the gamma-rays.
Cobalt-60 decays to nickel-60 by emitting a beta-minus particle.
However, the decay can occur to one of two excited states of the nickel
nucleus (indicated by asterisks below) each of which then decays by emitting
a gamma-ray.
Energy
60
27 Co
Beta-minus:
0.31 MeV
60
27 Ni
∗∗
Beta-minus:
1.48 MeV Gamma-ray: 1.17 MeV
60
27 Ni
∗
60
27 Ni
Alpha source
Ion pair
+ −
Positive ions are
attracted to the cap
Negatively and negative ions are
charged repelled from it
electroscope
ATOMIC STRUCTURE AND RADIOACTIVITY • 515
Variable H.T.
Alpha source supply
Fine wire
Gap of ~ 1 mm
between this anode and the surrounding cylindrical cathode, there is a radial
electric field inside the tube. When ionizing radiation passes through the
low-pressure gas, some of the gas molecules become ionized. The electric
field between the anode and cathode accelerates these ions and there is a
small pulse of current in the external circuit. This generates a voltage pulse
that can be detected and counted.
Alpha- and beta-radiation enters the Geiger-Müller tube through a thin
end window, but gamma-radiation can also enter through the walls of the
tube. All of the pulses are identical, regardless of the type of radiation, so
a Geiger counter does not indicate the type of radiation detected. There
is also a “dead time” following each voltage pulse. During this time the
detector will not register any further ionization events. This limits the
maximum count rate that can be measured accurately.
Voltage pulse
Central
wire anode
(+) −
Variable supply
400–600V
Thin end window
The count rate from a Geiger counter should not be confused with the
activity of a source. The latter is the number of disintegrations per second
inside the source. The count rate on the Geiger counter is related to this
but is usually only a small fraction of the activity because of the emissions
that miss the counter, are not detected by it, or are absorbed inside the
source or between the source and the detector.
Count rates are usually recorded as counts per minute (cpm) and
often have to be corrected for the average background count where the
experiment is being carried out.
ATOMIC STRUCTURE AND RADIOACTIVITY • 517
GM tube
Sealed Counter or
source rate meter
This method assumes that there has been no contamination of the sample
and that the fraction of carbon-14 in the atmosphere has remained constant
over the time period being measured. Radiocarbon dating methods are
calibrated against other methods such as dendrochronology (tree ring
counting).
25.8 Exercises
Carbon-13
Carbon-14
Oxygen-16 8
Iron- 56
26 Fe
Gold- 79 197
Uranium-235 92
Uranium-238
(b) Use examples from the table to explain what is meant by an
“isotope.”
(c) The heavier nuclei, such as iron, gold, and uranium, have an excess
of neutrons. Suggest a reason for this.
4. A radioactive rock is tested in a school laboratory. Here are the results:
Setup Number of counts in 5 minutes
No rock present 100
Rock alone present 900
Rock behind card 402
Rock behind 2mm aluminum sheet 398
Rock behind 1 cm lead 123
Rock alone present 24 hours later 197
(b) The first three nuclides in the series are: uranium-238 (atomic
number 92), thorium-234, palladium-234, and uranium-234. Write
down balanced nuclear transformation equations for the first three
decays.
(c) The half-lives of the four nuclides above are: 4.51 109 years, 24.1
days, 77s, and 2.47 107 years. Suggest how the relative abundances
of each nuclide compare.
(d) The isotope bismuth-214 (Z 83) can decay by either alpha or beta-
minus decay, but whichever decay it undergoes, the series reaches
lead-210 (atomic number 82) in two steps. Write down the nuclear
transformation equations to show how this occurs by each route.
(The element with Z 81 is thallium and the element with Z 84 is
polonium.)
11. An experiment was carried out using a Geiger counter to monitor the
activity of a radioactive source. The table below gives the average cpm,
corrected for background, recorded during the experiment.
Time/s Activity/1013 s1
0 1.64
50 1.36
100 1.13
150 0.935
200 0.775
250 0.643
300 0.533
50 0.422
400 0.366
450 0.304
500 0.252
26
NUCLEAR PHYSICS
Repulsion
Attraction
nucleus. The B.E. of the nucleus is equal to the mass defect multiplied by
the speed of light squared.
Mass defect (sum of masses of free nucleons) (mass of nucleus)
m Zm p Nm n Mnucleus
B.E. c2m
B.E. per nucleon:
B.E./A c2m /A
When carrying out calculations of nuclear B.E. it is important to use nuclear
mass and not atomic mass. Most tables of data give atomic masses so the
mass of Z electrons must be subtracted from this. The data below has been
used to calculate the B.E. and B.E. per nucleon of an oxygen-16 nucleus.
Particle Mass/kg
Electron 9.10941031
Proton 1.67261027
Neutron 1.67501027
Oxygen-16 atom 26.56761027
Mass of oxygen-16 nucleus (mass of oxygen16 atom)
8 (mass of electron)
26.5603 1027 kg
Mass of nucleons 8 (mass of neutron) 8 (mass of proton)
26.7808 1027 kg
Mass defect m (mass of nucleons) (mass of nucleus)
0.2205 1027 kg
B.E. c2m 1.9845 1011 J 124.0 MeV
B.E. per nucleon B.E./16 7.752 MeV/nucleon
NUCLEAR PHYSICS • 525
234
90Th 234.04364 u
238
92 U 238.05082 u
The mass defect for the reaction is the difference between the nuclear
masses on each side of the equation. To calculate the nuclear masses we
need to subtract 92 me from the left-hand side and (90 2)me from the
right-hand side. These electron masses cancel so we can work directly with
the atomic masses:
m 238.05082 u (234.04364 4.002603) u 0.00458 u
The energy released is E 932.9 0.00458 4.27 MeV
This is shared between the alpha particle and the recoiling nucleus
and linear momentum must be conserved so (in the reference frame of the
original uranium-238 atom) the two must travel in opposite directions and:
m nucleus vnucleus m v
526 • FOUNDATIONS OF PHYSICS
so that:
m 234
v nucleus vnucleus v
m 4 nucleus
The alpha particle travels much faster than the recoiling nucleus and carries
away most of the kinetic energy.
2
1 2 1 mnucleus 2
E m v m v
2 2 m nucleus
m 234
nucleus Enucleus E
m 4 nucleus
The alpha particle gets 234/238 E.
Beta-minus decay of carbon-14 is:
14 14 0
6 C 7 N 1 00
The relevant atomic masses are:
Atom Mass
0
1 0.000549 u
14
6C 14.003242 u
14
7N 14.003074 u
Beta-minus
=
N
emitters
Beta-plus emitters
and electron capture
Proton number: Z
Q Beta-minus emitters: N/Z too high—neutron-rich nuclei that approach
stability by converting a neutron to a proton.
Q Beta-plus emitters and electron capture nuclei: N/Z too low—proton-
rich nuclei that approach stability by converting a proton to a neutron.
Q Alpha emitters: N/Z too low—heavy proton-rich nuclei that approach
stability by reducing both N and Z by 2. Since N Z this reduces the
ratio of N to Z.
528 • FOUNDATIONS OF PHYSICS
Q Nuclear fission: some heavy nuclei close to the top of the band can
undergo induced or spontaneous nuclear fission to create pairs of
neutron-rich daughter nuclei which lie about half-way down the band
and are beta-minus emitters.
The diagram below shows the effect of alpha and beta decays on a plot of
proton number against neutron number:
Neutron
number
Z− 2 Z− 1 Z Z+ 1 Z+ 2
Z+ 2
Z+ 1
β+
ec
Z
β−
α
Z− 1
Z−2
Proton number
4.0
3
1H
2.0
2
1H
Nucleon number
0.0
50 100 150 200 250
NUCLEAR PHYSICS • 529
It is better to compare B.E. per nucleon rather than total nuclear B.E.
because the latter depends on the number of nucleons in the nucleus so
that a large value does not necessarily mean that the nucleus is particularly
stable.
Q B.E. per nucleon increases rapidly with nucleon number for light nuclei.
Q The curve has a peak value that occurs for iron-56. This is the most
stable nuclide.
Q For nuclides heavier than iron-56 the B.E. per nucleon gradually falls.
Q Most nuclides (from Oxygen to Uranium) have a B.E. per nucleon
between 7.5 and 8.5 MeV/nucleon.
Q Some light nuclides, such as helium-4, carbon-12 and oxygen-16 have
particularly large B.E. per nucleon compared to other nearby nuclides.
Nuclear fission
Nuclear fusion
A
530 • FOUNDATIONS OF PHYSICS
The initial steepness of the curve shows that nuclear fusion releases
more energy per kilogram of fuel than nuclear fission. However, the high
nucleon number of the fissioning nucleus shows that nuclear fission releases
more energy per reaction than nuclear fusion.
We need to work with nuclear masses, but since there are equal numbers
of electrons to subtract from each side of the equation we can, once again,
simply use the atomic masses.
Mass defect (235.044 1.009) u (143.923 89.920 2 1.009)
u 0.192 u
The energy released is E 932.9 0.192 179 MeV
The fact that additional neutrons are emitted could lead to further
nuclear fission reactions. The Hungarian physicist Leo Szilardrealized that
it would be possible to initiate a chain reaction if more than one neutron per
fission on average went on to cause further fission reactions.
A chain reaction can release a huge amount of energy. If it is allowed to
run out of control this energy is released explosively—this is the principle
NUCLEAR PHYSICS • 531
Uranium-235 nucleus
undergoing fission
One way around this problem is to use enriched uranium, i.e., uranium
with a higher content of uranium-235, but this is difficult to obtain in
large quantities because uranium-235 and uranium-238 are isotopes of
the same chemical element and so have the same chemical properties.
Various separation techniques have been used but the most successful
involves centrifuging uranium hexafluoride, a gaseous uranium compound,
to increase the concentration of uranium-235. Since this method can be
used to produce fuel for nuclear reactors (about 4% enrichment) and for
weapons (about 80% enrichment) it is very difficult to distinguish between
the peaceful production of enriched uranium and its production for atomic
bombs.
Conventional
explosive
(detonator)
Sub-critical assembly of
plutonium-239
Compressed, critical
assembly of
plutonium-239
While atom bombs are incredibly powerful weapons they release far less
energy than a hydrogen bomb, which is based on nuclear fusion reactions. In
NUCLEAR PHYSICS • 533
Containment
Control rods
Steam to
Pressure vessel
turbo-
generator
Core
Fuel rods
Heat
Cool water
condensed
Light water
moderator
from steam
Steam
Pump generator
There are two cooling circuits. In the primary circuit water is pumped up
through the core and then returns to the core via a heat exchanger. In the
heat exchanger energy is transferred to water in the secondary circuit. This
generates steam that is used to drive a turbo generator to generate electricity.
The steam is then condensed and returned to the heat exchanger. Cold
water from a lake or river is needed to operate the condenser and large
cooling towers are used to cool this water once it has returned from the
condenser.
26.3.4 Plutonium
Plutonium is a fissile material that can be used in bombs and reactors.
However, it does not occur naturally on Earth in any significant quantities
NUCLEAR PHYSICS • 535
0 n 92 U → 92 U
1 238 239
239
92 U → 93 Np 1
239 0
00
239
93 Np → 239 0
94 Pu 1 00
Plutonium can be harvested from spent fuel rods. This is called
“reprocessing.”
We need to work with nuclear masses, but since there are equal numbers
of electrons to subtract from each side of the equation we can, once again,
simply use the atomic masses.
Mass defect (2.014102 3.0160492) u (4.002603 1.008664) u
0.0189 u
The energy released is E 932.9 0.0189 17.6 MeV
This is about 3.5 MeV/nucleon compared to about 0.76 MeV/nucleon from
nuclear fission (combustion releases less than 1 eV per nucleon!).
536 • FOUNDATIONS OF PHYSICS
For nuclear fusion to take place the reacting nuclei must come close
enough (a few times 1015 m) for the short range strong nuclear force to
bind them together. However, all nuclei are positively charged and repel
one another. To approach close enough for fusion to take place they must
have a very large kinetic energy. This can be achieved by accelerating
the nuclei and then crashing them together in a device such as the Large
Hadron Collider (LHC) at CERN or by confining the reactants and heating
them to extreme temperatures.
Three situations that involve nuclear fusion reactions are:
Q Nucleosynthesis—the formation of heavy nuclei from light nuclei in the
cores of stars.
Q Thermonuclear weapons—the nuclear fusion of isotopes of hydrogen in
a bomb.
Q Fusion reactors—commercial reactors designed to produce electrical
energy from nuclear fusion.
26.3.6 Nucleosynthesis
Soon after the Big Bang the early universe consisted mainly of hydrogen
with some helium and trace amounts of other nuclei. Nuclei of all the
heavier elements were formed (and are still being formed) by nuclear
fusion reactions taking place in stars. Nuclei of elements up to iron-56 are
formed in the cores of stars during most of their “normal life” (when they
are on the “main sequence”). Nuclei beyond iron-56 are formed when very
massive stars explode at the end of their lives (forming supernovae).
Stars form when clouds of gas and dust collapse under their own
gravitational forces. As gravitational potential energy falls the gas and dust
heats up and when the temperature and pressure at the core of the collapsing
mass become high enough nuclear fusion reactions begin. At this point a
proto star is formed. The higher the mass of the star the more extreme the
core conditions and elements higher up the periodic table can form. This
process of nucleosynthesis stops at iron-56 because this is the most stable
nuclide. Nuclides lighter than this are formed by exothermic reactions
whereas those beyond iron-56 are only formed in endothermic fusion
reactions and so need an external source of energy. The energy released
in the core by exothermic nuclear fusion reactions generates an outward
radiation pressure that supports the star against further gravitational
collapse during most of its “life.”
NUCLEAR PHYSICS • 537
As fuel for the fusion reactions in the core runs out gravitational forces
cause the core to collapse. What happens next depends on the mass of the
star (see Section 28.1.1) but when stars of mass greater than about 10 times
the mass of the Sun collapse they undergo a sequence of fusion reactions
and create all the nuclides up to iron-56 and then explode in a supernova.
Some of the energy released in the explosion creates the heavier nuclei up
to uranium and the explosion distributes them throughout space.
Our Sun is a medium-sized star and will spend almost all of its life
synthesizing helium from the hydrogen in its core. The net effect is to
convert four protons into a helium nucleus but the probability of this
happening in one step by a fortunate collision of four particles with enough
energy to get close enough to fuse is effectively zero. The main process by
which helium is created is called the proton-proton cyclethat proceeds in
three steps.
Q Step 1: Two protons collide to form a deuteron, a positron (anti-
electron) and a neutrino:
1
1H 1H → 1H 1 e 0
1 2 0 0
This process releases about 26 MeV per helium-4 nucleus produced. The
overall reaction for the proton-proton cycle is then:
4 11 H → 42 He 2 10 e 2 00
The two positrons created in the core almost immediately annihilate with
electrons creating high-energy gamma-rays that contribute to the outward
radiation pressure that supports the star. The neutrinos are very weakly
interacting and pass through the outer layers of the Sun and into space. The
flux of solar neutrinos detected on Earth gives astronomers a way to monitor
the fusion processes going on in the Sun’s core. Astronomers estimate that
there is enough hydrogen left in the core for the Sun, which was formed
about 5 billion years ago, to continue to shine for another 5 billion years.
Shaped explosives
Q When fusion reactions fail the reactor simply switches off, there is no
core and no possibility of a meltdown.
Q While the cost of a commercial fusion reactor is likely to be high the
decommissioning costs should be significantly less than for a fission
reactor because there is no highly radioactive core or spent fuel rods to
deal with.
The challenge of creating a commercial fusion reactor is very great and
stems from the need to create and maintain the extreme conditions under
which fusion reactions can take place. The two main approaches are:
Q Tokamaks—reactors that support a plasma (a gas of ionized particles)
in a toroidal magnetic field and then pump it to high temperature to
initiate the fusion reactions.
Q Inertial confinement—focusing intense lasers onto a pellet of fusion fuel
so that it implodes, reaches extreme temperatures and pressures and
fuses.
Both methods have achieved some success in producing fusion reactions but
have a long way to go before they can be used in a commercial reactor. The
largest current project is the International Thermonuclear Experimental
Reactor that is being built in France by an international collaboration of
countries representing over half the world’s population. Its aim is to test the
feasibility of magnetic fusion reactors (Tokamaks).
The tokamak method is likely to use the fusion of deuterium (about
0.015% of all hydrogen atoms on Earth are deuterium) and tritium, which
can be created from lithium (a common element in the Earth’s crust).
2
1 H 1 H → 2 He 0 n
3 4 1
The muon and tau are effectively more massive versions of the electron
and they tend to undergo decays that eventually produce electrons. The
neutrinos are all neutral and have very low rest mass. Beams of neutrinos of
any one type begin to oscillate between the different “flavors” of neutrino
so that soon the beam contains equal numbers of electron-, muon-, and
tau-neutrinos. The discovery that this occurs solved the so-called “solar
neutrino problem” where only about 1/3 of the electron-neutrinos emitted
by nuclear fusion reactions in the Sun were detected here on Earth. The
other 2/3 had oscillated to muons or tausen route.
called quarks. Most hadrons contain quark pairs (mesons) or quark triplets
(baryons) and the quarks, like the leptons, come in three generations, each
with a pair of different “flavors.”
Generation Particle/anti-particle Particle/anti-particle
1st Up and anti-up Down and anti-down
2nd Strange and anti-strange Charm and anti-charm
3rd Bottom and anti-bottom Top and anti-top
The underlying process that explains how these three quantum forces work
is based on the exchange of virtual particles. A virtual particle can be
created by “borrowing” energy from the universe for a short time and then
“paying it back” when the particle disappears. This is possible because of
the energy-time Uncertainty principle in quantum theory. For example, the
electromagnetic repulsion between two electrons comes about as a result of
an exchange of virtual photons.
The exchange particles for electromagnetism are photons, for the weak
force they are W, W, and Z0 particles and for the strong force they are
different kinds of gluons. Richard Feynman developed a pictorial way to
represent interactions. His method was useful because it provided a link
to the mathematical methods needed to solve problems in QED. The
diagrams are known as Feynman diagrams and the diagram on the next
page shows two of the many ways a pair of electrons might interact:
e– e–
e– e–
Exchange of a single photon Exchange of two photons
544 • FOUNDATIONS OF PHYSICS
All of the particles in the table on the next page have anti-particles (or
are their own anti-particle).
The table below summarizes all the particles in the Standard Model.
QUARKS FORCE
1st generation 2nd generation 3rd generation CARRIERS
UP CHARM TOP GLUON
Charge 2/3 e Charge 2/3 e Charge 2/3 e Charge 0
Spin ½ Spin ½ Spin ½ Spin 1
DOWN STRANGE BOTTOM PHOTON
Charge 1/3 e Charge 1/3 e Charge 1/3 e Charge 0
Spin ½ Spin ½ Spin ½ Spin 1
Z BOSON
Increasing mass Charge 0
Spin 1
ELECTRON MUON TAU W BOSON
Charge e Charge e Charge e Charge e
Spin ½ Spin ½ Spin ½ Spin 1
ELECTRON- MUON-NEUTRINO TAU-NEUTRINO
NEUTRINO Charge 0 Charge 0
Charge 0 Spin ½ Spin ½
Spin ½
LEPTONS
HIGGS BOSON
Charge 0
Spin 0
The fact that there are three generations of quarks and three generations
of leptons suggest that there is an underlying symmetry linking the quarks
and the leptons. This has led to several hypotheses about new particles
and mechanisms for changing leptons to quarks and vice versa, but so far
there has been no experimental evidence to support these ideas. While the
Standard Model is incredibly impressive, it is unlikely to be the last word on
particle physics; there are too many arbitrary constants that have to be put
into the model to make it work.
visible matter inside the galaxy is much less than the required centripetal
force for the outer parts to rotate as they do. This was first pointed out by
Fritz Zwicky in 1933 when he tried to understand the motion of the Coma
cluster of galaxies.
These observations led physicists to suggest that there must be a lot of
invisible or dark matter in the galaxies to provide the additional centripetal
force. This idea has been supported by evidence from gravitational
lensing—the deflection of light close to galaxies because their mass distorts
the local space-time geometry. The mass required to account for the
observed lensing effects is much greater than the mass of the visible matter
in the galaxies. While a small proportion of the dark matter is probably
cool baryonic matter (i.e., “ordinary matter”) the rest is yet to be identified.
Dark matter is thought to make up 23% of the mass of the universe whereas
ordinary matter is thought to make up just 4.6%. The remaining 72% of the
mass is thought to be dark energy.
Ordinary
matter
5%
Dark matter
23%
Dark energy
72%
26.5 Exercises
16 16.01470 u
6C
(atom)
16 16.006103 u
7N
(atom)
13
13.005738u
14
7N 14.003074 u
15
7N 15.000108 u
16
7N 16.006103 u
17
7N 17.00845 u
18
7N 18.0142 u
13
6C 13.003354 u
13
8O 13.0248 u
Proton 1.007276 u
Neutron 1.008665 u
27
QUANTUM THEORY
Prediction of classical
theory: ultraviolet
catastrophe
Wavelength
z-directions, so the energy per molecule is 3/2 kT. For more complex
molecules, vibrational and rotational degrees of freedom can also be excited
and these should, according to classical theory, get ½ kT as well. The greater
the energy per molecule at a particular temperature the higher the heat
capacity of the gas, so measurements of heat capacity allowed physicists to
test the predictions of classical equipartition theory. They found that while
the predictions were usually good at high temperatures they sometimes
broke down at lower temperatures, as if some degrees of freedom were not
contributing to the heat capacity. This was unexpected and could not be
explained using the classical theory.
Photoelectric effect
Heinrich Hertz discovered the photoelectric effect in 1887. He noticed
that when ultraviolet light was shone onto a pair of electrodes it was easier
to form sparks between them. Ultraviolet light seemed to be able to knock
electrons out of the surface of the metal electrode. Further investigation
showed that the effect depended on the frequency of the absorbed radiation
but not on its intensity. This was a complete surprise. According to classical
theory ejecting an electron from the surface of a metal should depend on
the energy of the incoming radiation and not on its frequency.
QUANTUM THEORY • 553
Metal surface
n=4
n=3
n=2
n=1
Low-frequency High-frequency
oscillator oscillator
554 • FOUNDATIONS OF PHYSICS
This makes it far less likely that a high frequency oscillator will be
excited because it needs a large energy to start vibrating. The effect is
to suppress high frequency vibrations and dramatically reduce the high-
frequency electromagnetic radiation they emit, thus preventing the
ultraviolet catastrophe. At low frequencies, the allowed energies are very
close together and behave more like the classical continuum of allowed
energies. That is why the long wavelength, low-frequency part of the curve
can be explained classically.
Planck’s quantization of energy was the first of several ad hoc
quantizations discovered in the early part of the twentieth century. This
was the beginning of quantum theory, but at that time no one understood
why quantization worked!
require more energy than because they are not at the surface of the
metal). For electrons to be ejected from the surface:
hf
This explains why there is a threshold frequencyf0 below which photoelectric
emission does not occur:
hf0
This was another radical departure from classical physics where light was
considered part of the electromagnetic spectrum and was assumed to
be a continuous wave that can take any energy. Einstein’s theory treated
electromagnetic radiation as if it consisted of discrete packets of energy
and transferred that energy discretely too. The photon theory treated light
more like a particle model than a wave model.
f > f0
f < f0
Cap: metal with
work function Φ
Electroscope is
discharged
Q If the metal is zinc, then visible light does not discharge the electroscope
even if it is very intense but ultraviolet radiation will discharge it even if
its intensity is low.
Q Increasing the intensity of the ultraviolet radiation discharges the
electroscope more rapidly.
Q When ultraviolet radiation is used, the leaf begins to fall as soon as it is
illuminated: there is no delay.
556 • FOUNDATIONS OF PHYSICS
− +
Collector Emitter
The circuit shown on the next page can be used to find the maximum
kinetic energy of the emitted electrons by applying an opposing voltage to
the cell and increasing this until the current in the circuit is reduced to zero.
Measurements of this “stopping voltage” for incident light with a range of
different frequencies can be used to find the Planck constant and the work
function of the emitter.
Photocurrent
0
f0 Frequency
−Φ/e
Diffraction
Electrons pattern
(rings)
Heater
supply (low
voltage) Hot cathode Anode Graphite
target
Fluorescen
t screen
QUANTUM THEORY • 561
d
Planes of atoms
acting like lines in a
diffraction grating
Atomic arrangement
in graphite
The radius of diffraction rings can be used to find the spacing of atomic
planes in the crystalline structure. Changing the accelerating voltage
changes the wavelength of the electrons and the radius of each ring. Higher
voltage gives the electrons greater momentum and smaller de Broglie
wavelength so the rings get smaller:
r
Electron
beam
θ
L
Graphite
h h
sin
d dmv d 2 meV
r
For small angles, sin ~ ~ , so that:
L
hL
r~
d 2 meV
562 • FOUNDATIONS OF PHYSICS
Scattered electron
with momentum p
− −
φ
θ
Scattered photon of
wavelength λ′ − λ
Compton treated this as a collision between particles and used the equations
for conservation of energy and momentum to find the relationship between
the X-ray scattering angle and the change in wavelength of the X-rays:
' mh c 1 cos
e
This was verified experimentally and showed once again that the particle
model must be used for some interactions between radiation and matter.
phenomenon, but there is a deeper problem because the two models seem
to contradict one another.
We have already seen that the wave model of light, with energy spread
continuously across the wave front, cannot be used to explain photoelectric
emission. However, if instead of directing the light onto a metal plate we
had passed it through a double slit apparatus we would have to assume
that the energy is spread continuously to explain the resulting interference
pattern.
EM radiation (e.g., light) Matter (e.g., electrons)
Evidence for the Young’s double slit experiment Electron diffraction
wave model Diffraction patterns Electron standing waves in atoms
Evidence for the Photoelectric effect Discrete nature of electric charge
particle model Compton effect Momentum of individual
electrons
How can these two apparently irreconcilable models be related?
27.3.1 Young’s Double Slit Experiment Revisited
The double slit experiment, first used to support the wave model of light,
is an ideal example to use when trying to reconcile the wave and particle
models.
In the double slit experiment a monochromatic light source is directed
at a double slit and an interference pattern consisting of regularly spaced
maxima and minima appears on a screen. The maxima occur in positions
where waves arriving from each slit are in phase and interfere constructively.
The minima occur where the waves arrive in antiphase ( phase difference)
and interfere destructively. The resultant intensity at any point on the
screen is calculated by adding the phasors from each slit and then squaring
the resultant amplitude. This approach, the wave model, works (up to a
point). It explains the intensity variation across the screen.
If we now assume that the light is emitted and absorbed as individual
photons we run into difficulty. Consider, for example, a minimum position.
When both slits are open the minimum is dark, so no photons arrive at
this position. If we cover either one of the slits, then light does arrive at
this position. So we have a problem. It seems that opening the second slit
and allowing light to reach that position from either slit results in less light
arriving …. how can identical photons cancel one another out? How can
energy disappear?
The diagrams on the next page illustrate this. In the top diagram, only
the top slit is open and NA photons reach point P. In the middle diagram,
564 • FOUNDATIONS OF PHYSICS
NA photons
Bottom slit from slit A
covered
P
B
Laser
A
Single
slit Diffraction
pattern
Single
Screen
slit B
NB photons
from slit B
Top slit
covered P
B
Laser
A
Single
slit
Diffraction
Single Screen pattern
slit A
Zero
Both slits photons
open
P
B
Laser
A
Single
slit
Interference
pattern
Double slit A and B Screen
only the bottom slit is open and NB photons reach point P. With both slits
open one might expect (NA NB) photons to reach point P but in fact zero
photons are detected at P.
QUANTUM THEORY • 565
It seems that opening the second slit affects where photons passing
through the first slit can go. On the face of it, this is bizarre!
One, unlikely, possibility is that when both slits are open photons from
A and B somehow interact on their way to the screen and this prevents
them reaching P. To rule this out the experiment has been repeated using
a filter in front of the source to reduce the intensity so far that only one
photon at a time is interacting with the apparatus. Now photons arrive one
at a time at the screen and cannot interact with one another en route. What
happens?
At first the photons seem to be arriving completely randomly but after
a short while it becomes apparent that the same patterns as before are
produced with single or double slits. If we insist that the photons behave
like particles then each photon can only pass through one of the slits. If it
passes through slit A when B is closed it can reach P. If it passes through slit
B when A is closed it can also reach P. However, if it passes through slit A
when slit B is open it cannot reach P! Similarly, if it passes through slit B
when A is open it cannot reach P! To maintain the particle model we would
need to assume that when the photon passes through one of the slits its
future path is affected by the state of the slit through which it did not pass.
Individual photons Photons cluster
pass through slit A around maxima and
OR slit B do not go to minima
Laser
Filter Single
slit
Double Screen
slit Interference pattern
builds up gradually
A single photon passing through a double slit apparatus cannot reach any of
the minimum positions, even though it could reach all of them if it passed
through either slit when the other one is closed! This shows that photons do
not interfere with each other but every photon interferes with itself. Where
does this leave us?
566 • FOUNDATIONS OF PHYSICS
where is the wave function,i is the square root of minus one, and m is the
mass of the electron. V is the potential energy that might vary with position
and time. If the electron is moving freely in space then V 0.
It is possible to find wave-like solutions to this equation. These represent
the de Broglie waves of the electron. Once Schrodinger had published
his equation physicists could use it to solve a vast range of problems, and
Schrodinger himself showed how it could explain the energy level structure
and spectrum of the hydrogen atom. In Schrodinger’s atomic model the
electron orbitals are three-dimensional standing wave solutions to the
equation.
According to the Copenhagen Interpretation, the square of the
magnitude of the wave function at each point in space is equal to the
probability per unit volume of finding the electron at that point. Since the
wave function is itself a complex quantity it is not directly observable, and
its magnitude is found by multiplying it by its own complex conjugate:
Probability of finding electron in small region xyz (x, y, z) (x, y, z)
if this is integrated over all of space it must equal 1: the electron will be
found somewhere in the universe!
Electron
rn
−
F mvn
+
Proton
The diagram above shows an electron in the nth orbit inside the hydrogen
atom. The dotted lines indicate other quantized orbits. The energy of the
nth orbit is derived below.
Angular momentum:
nh
mvn rn
2
Centripetal force:
mvn2 e2
rn 4 0 rn2
Kinetic energy:
1 e2
KE mvn2
2 8 0 rn2
570 • FOUNDATIONS OF PHYSICS
Potential energy:
e2
PE =
4 0 rn
Energy:
1 e2 e2
En KE PE mvn2
2 4 0 rn 8 0 rn
Bohr used these equations to eliminate rn from the energy equation and to
express the energy of an electron in the nth level in terms of the quantum
number n.
me4 1 13.6 eV
En 2 2 2
8 0 h n n2
This is a very important result as it provides an accurate method to calculate
the ionization energy of hydrogen and the frequencies of the lines in the
hydrogen line emission spectrum. The number n is called the principal
quantum number and n 1 represents the lowest allowed energy state for
an electron in the hydrogen atom: the ground state. All the energies are
negative because the electron is bound to the nucleus.
The diagram below is an energy level diagram for the hydrogen atom
based on the Bohr model.
Energy
13.6 n=∞
− = −0.85 eV 0
42 n=4
n=3
13.6
− = −1.5 eV
32 n=2
13.6
− = −3.4 eV
22
13.6
− = −13.6 eV n=1
12
As n increases the energy levels get closer and closer together becoming a
continuum of states as n approaches infinity. The energy needed to remove
an electron from the ground state of the hydrogen atom (its first ionization
energy) is 13.6 eV, the energy needed to move from n 1 to n where
the energy would be zero (a free electron).
QUANTUM THEORY • 571
13.6 0 n=∞
− = −0.85 eV n=4
42
n=3
13.6
− = −3.4 eV n=2
22
13.6
− = −13.6 eV n=1
12
572 • FOUNDATIONS OF PHYSICS
1s 2s 2p
QUANTUM THEORY • 575
Wolfgang Pauli realized that no two electrons in the same atom can
have the same set of quantum numbers. This is an example of the Pauli
exclusion principleand the consequence is incredibly important. If it
were not the case then all the electrons in a multi-electron atom would fall
into the lowest energy level and different elements would have very similar
chemical properties. The Exclusion principle prevents this and ensures that
the electrons must fill up each energy level in turn. In a very general sense
this is what gives us chemistry. It certainly accounts for the Periodic Table.
For example:
When n 1: l 0, m 0 and s 1. Two electrons can occupy these
states. These are the 1s states in the atom and correspond to a spherically
symmetric wave function. Hydrogen, with one electron has a half-filled 1s
shell. This can be represented by 1s1.
The first number is the energy level or principal quantum number,
the letter refers to the type of orbital (related to the value of l) and the
superscript is the number of electrons in the shell.
This accounts for its reactivity—it can complete the shell by reacting
with other elements with one or more outer electrons.
Helium, has two electrons, so the 1s shell is full and the atom is very
stable. Its electronic configuration is 1s2.
Lithium has three electrons so n 1, l 0, m 0 and s 1;states
are filled by two electrons so the 1s shell is full and the third electron has
to go into one of the eight available states with n 2 (the 2s shell). The
lowest energy state is the n 2, l 0, m 0, s 1 (or 1) state. This
leaves the 2s shell partially filled and lithium is again reactive. Its electronic
configuration is:
1s22s1
This filling of energy shells continues as we move to larger and larger atoms.
The periodicity of the periodic Table comes about as successive shells fill
up and electrons start to fill the next shell.
The fifth element in the Periodic table is Boron. This must accommodate
five electrons so both the 1s and 2s shells are full and the next electron must
have an orbital quantum number l 1. This is called a p-shell and has a
different shape to the spherical s-shells. There are three p-shells (m 1,
0 and 1) in different orientations and each is shaped like a dumbbell. Two
electrons can go into each shell (with s 1 or s 1). The electronic
configuration for Boron is:
1s22s22p1
576 • FOUNDATIONS OF PHYSICS
The table on the next page shows the order of the first few electronic energy
levels and the corresponding electronic configurations. Just a few examples
will show how the electronic configuration, determined by solutions to the
Schrodinger equation, affects chemical properties.
Q Atoms with filled shells tend to be very stable—e.g., helium, neon.
Q Atoms with a single electron in an otherwise empty shell tend to be very
reactive because they easily lose that electron to an atom that needs one
or more electrons to complete its own outer shell—e.g., sodium.
Q Atoms with an almost full shell are also very reactive, easily gaining
electrons from other atoms that have only one or a few electrons in their
outer shell—e.g., fluorine.
Q Carbon has 2 electrons in the 2p shell, so to complete this shell carbon
must gain four electrons. This makes it “4-valent.” Carbohydrates
(compounds of carbon and hydrogen) are the most important class
of compounds for living things and illustrate how valency is linked to
electronic configuration. In methane, CH4, each hydrogen atom shares
one electron so that the carbon 2p shell is completed and the hydrogen
1s shell is completed, so both achieve more stable (lower energy
configurations) and four covalent bonds are formed.
n l m s Configuration Element (ground state)
1s shell 1 0 0 1 1s1 Hydrogen
1 1s2 Helium
2s shell 2 0 0 1 1s22s1 Lithium
1 1s22s2 Beryllium
2p shell 1 1 1 1s22s22p1 Boron
1 1s22s22p2 Carbon
0 1 1s22s22p3 Nitrogen
1 1s22s22p4 Oxygen
1 1 1s22s22p5 Fluorine
1 1s22s22p6 Neon
3s shell 3 0 0 1 1s22s22p63s1 Sodium
1 1s22s22p63s2 Magnesium
…
individual photon, so if only one photon interacts with the apparatus the
same probability distribution is used to work out where it is likely to be
detected.
Now we reach the most controversial aspect of the Copenhagen
interpretation, the act of observation or measurement. Up to this point
the wave function has evolved continuously according to the Schrodinger
equation. At the moment of observation or measurement the probability
distribution, which spreads across the entire screen in the double slit
experiment, suddenly and discontinuously changes. It becomes instantly
zero everywhere except at the point where the photon is observed. This
is called the “collapse of the wave function.” This is not explained by the
Schrodinger equation and there is no agreement on any physical mechanism
by which it occurs. This is called the “measurement problem”and is the
main reason many physicists think that the Copenhagen Interpretation
is unacceptable (even if it does give us a useful way to describe quantum
processes).
The diagram below gives a simplified explanation of the double slit
experiment using the Copenhagen Interpretation.
side then it must have been localized in the region of the hole as it passed
through, so we know its location at that moment with an uncertainty about
equal to the diameter of the hole. However, we must take into account the
wave nature of the electron. As it passes through the hole its wave function
diffracts and spreads out in all directions. This is equivalent to giving the
electron a random sideways momentum as it passes through the hole. As the
hole is made smaller the diffraction effects increase and the random changes
of momentum are likely to be larger. Making a precise measurement of the
position of the electron in the plane of the hole introduces an uncertainty
in the momentum of the electron in that plane.
x
If the hole diameter is x the angular spread of the first diffraction maximum
(which is where most electrons will be found after passing through the hole)
is given by:
1.22
sin
x
For small angles this is approximately:
~
x
The de Broglie wavelength links wavelength to momentum, so:
h
p
h
~
px
where p is the original momentum of the electrons (perpendicular to
x). After the electron passes through the hole its momentum also has an
x-component p perpendicular to its original momentum:
QUANTUM THEORY • 581
Δp
θ
p
Possible path 1
Phasor:
Point B path 1
Possible path 2
Phasor:
Phasor at path 2
point A
Phasor: Sum of
Possible path 3 path 3 phasors
Considering double slits, the photon can reach the detector via either slit,
so each possible route contributes a phasor. The paths are different lengths
so there is a phase difference.
Path via B
B
Path B is longer than path A, so the phasor has rotated more times and
there is a phase difference between phasors via A and via B.
Via B
Via A
Resultant
The length of the resultant will vary as the detector is moved to different
positions. If the two phasors arrive in phase they reinforce so the probability
at that point is a maximum value. At positions where the phasors arrive in
antiphase ( phase difference) they undergo destructive interference and
cancel out. The probability at that point is zero so no photons actually arrive
there.
This “sum-over-histories”approach is particularly helpful in particle
physics where all possible mechanisms for a particular interaction
contribute a phasor and the sum of all the phasor amplitudes (squared)
gives the probability of the process. As an interpretation of quantum theory
it suggests that beneath what we regard as the real world of actual events
there are potential events (possible paths) that may not actually be where
the photon is found but which nonetheless contribute to what it can do.
ψ1 World 1
M1 [Observer 1]
[Observer]
World 2
ψ IN M M2 ψ2
[Observer 2]
ψ3 World 3
M3 [Observer 3]
state. Prior to opening the box and making an observation the most we can
deduce about the system is that the atom is in a superposition of the decayed
and undecayed states. But there is also a wave function for the state of the
detector and the vial of poison. These too must be in a superposition of
states. So must the cat! It is not dead or alive prior to opening the box but in
a superposition of those states! And we can go further—the experimenter
who opens the box can also be described by a wave function. Until we
interact with him and ask him what has happened to the cat we must
describe him, the cat, the detector and the atom by a wave function that
includes a superposition of both alternatives….
If we stick to the Copenhagen Interpretation each wave function is
collapsed by successive observations. This is where the many-worlds theory
offers a way out of these endless wave function collapses. The cat is both
dead and alive but not in the same world. In one world the atom decayed,
the poison was released and the cat died. In that world the experimenter
opened the box to find the dead cat. But in another world the atom did not
decay, the poison remained trapped in the vial and when the experimenter
opened the box the cat was alive and well.
Vial of
poison
Hammer
release
?
Detector
Radioactive
atom
27.6 Exercises
(e) How does the many-worlds theory account for what happens in this
thought experiment?
10. The energy levels of the hydrogen atom are given by the equation:
13.6 eV
En
n2
(a) Derive a similar equation for the singly ionized atom of helium
(Z 2).
(b) Describe how you would expect the line emission spectrum from
singly ionized helium to compare with that of hydrogen.
(c) Explain why we cannot use the same simple method to derive the
energy levels for the second electron in the neutral helium atom.
CHAPTER
28
ASTROPHYSICS
28.1 Stars
Before the start of the twentieth century, physicists were unable to explain
how a star like the Sun could continue to radiate energy for billions of
years. No known chemical or gravitational process could provide a large
enough source of energy. This was not a major problem until the theories of
long-term geological processes and evolution by natural selection became
established. Both required the Earth to have existed, and had a source of
energy, for billions of years.
The problem of the Sun’s energy source was solved by the discovery
of Einstein’s mass–energy equation and the process of nuclear fusion (see
Section 26.3.4). Stars fuse light nuclei into heavy nuclei in their cores
releasing a huge amount of energy. Radiation from the core supports the
star against gravitational collapse while energy is transferred to its surface
where it radiates out into space.
28.1.1 Mass
The most important parameter when modelling a star is its mass. The greater
the mass of the star the higher the temperature at its core. This can allow
fusion reactions to proceed faster and nucleosynthesis (see Section 26.3.5)
to produce heavier elements. Increasing mass rapidly increases the reaction
rate in the core so that more massive stars use up their fuel relatively more
quickly than less massive stars and reach the end of their lives earlier.
Mass determines the fate of a star.
Q Low mass stars are stable for a long time but their cores are not hot
enough to create nuclei beyond carbon. When they run out of fuel they
swell to become red giant stars before shedding their outer layers of gas
and forming a planetary nebula. This exposes the white-hot core. This
final state is called a white dwarf star. Fusion reactions have now ceased
so the white dwarf cools down over a long period of time (of the order of
a billion years) eventually becoming a dense black dwarf star.
Q High mass stars have shorter lives but the core becomes hot enough
for nuclear fusion to create iron. Iron is the most stable nuclide so
at that point fusion reactions stop suddenly and the star undergoes a
violent collapse and explosion called a supernova. This can increase the
luminosity of a star by a factor of around 1010 for a short period (days or
weeks). The supernova explosion has two effects—some of the energy
creates heavier nuclei than iron and the process blasts these out into
space, where they can become part of the raw material for second and
ASTROPHYSICS • 593
third generation stars to form. Our own Solar System must have formed
from supernovae remnants because it contains significant amounts of
the heavy nuclides (e.g., uranium in the Earth’s crust).
Mass also determines what happens to the core left after the supernova
explosion.
Q The core is so massive (typically 2–3 times the mass of the Sun) that
the forces that prevent collapse of ordinary matter—i.e., forces that
stop atoms being crushed (called electron degeneracy pressure)—
are overcome by gravity. This effectively forces orbital electrons and
nuclear protons to combine to form neutrons. When this happens the
core radius decreases enormously (to about 10 km!) and the density
of the core, which is now almost entirely made of neutrons, increases
spectacularly to around 1017 kgm3. One centimeter cubed of this
neutron star material would have a mass of one hundred million tons!
The core is now called a neutron star. This collapse causes the rotation
rate to increase too so neutron stars spin rapidly, sometimes completing
a revolution in milliseconds). Neutron stars form from the collapse of
stars with initial masses in the approximate range 10–30 times the mass
of the Sun.
Q Another effect of the collapse is to intensify the magnetic field of the
star. This has the effect of directing a beam of radio waves out along the
magnetic axis of the star. Since this axis can be in a different place to the
rotation axis the beam sweeps around like the light from a lighthouse. If
the Earth happens to be struck by this beam we receive regular pulses.
When these pulses were first discovered they were so regular that
astronomers thought they might be alien radio signals. They are called
pulsars.
Q If the mass of the core is greater than about 5 times the mass of the
Sun then the collapse to a neutron star is not the end of the story.
The gravitational forces are so strong that the neutrons themselves
are crushed and at the present time we know of no physical force that
prevents collapse to a point or singularity. A black hole is formed.
The reason for the name is that at a certain distance from the central
singularity the escape velocity is c, the speed of light. Since this is a
universal speed limit no material or information from points closer to
the singularity can reach the outside world. The surface at which this
occurs is called the event horizon of the black hole (see Section 23.2.5).
594 • FOUNDATIONS OF PHYSICS
Supernova
Planetary explosion
nebula
Nucleosynthesis-up
to uranium-238
Wavelength
λpeak
different types of spectral lines appear in the spectrum. The details of this do
not need to concern us here but what is interesting is that when luminosity
is plotted against temperature (spectral type) for all of the observable stars a
clear pattern emerges. This was first done by Hertzsprung and Russell and
the plot is called a Hertzsprung-Russell (HR) diagram. Note that the x-axis
points in the direction of decreasing temperature.
Luminosity
Increasing radius
(relative to the Sun)
M Deneb
ain
106 seq Rigel
Betelgeuse
ue
nc
e
Super giants
Antares
Short lifetime:
107 years
Siriu
Giants
Sun
White dwarf
M
stars ain
seq
ue
nc
Long lifetime: e
10 −4 1011 years
O B A F G K M
30,000 3000 K
Increasing temperature
The diagram above shows (in a very simplified form) the main regions of
the HR diagram. The letters refer to spectral classes used in astronomy.
The Sun is in class G and has a surface temperature of about 5800 K.
A diagonal band runs from large luminous hot blue-white stars at top
left to small dim red stars at bottom right. This is called the main sequence
and stars spend most of their lives on this band. At the end of their lives,
when nuclear fusion fuel in their core runs out, they move off the band as
they become red giants or super giants and eventually white dwarf stars,
neutron stars or black holes. These final two star types do not appear on
the HR diagram because their luminosity and spectrum is not measured
directly (and luminosity is very low).
ASTROPHYSICS • 597
28.2 Distances
One of the greatest challenges for astronomers and cosmologists is to find
ways to determine accurate distances to the objects they observe. Ancient
Greek astronomers managed to find ingenious methods to estimate the size
of the Earth and the distances to the Moon and Sun but modern space
exploration has provided accurate methods for surveying our immediate
surroundings in space. Distances to objects beyond the solar system are
determined by several different overlapping methods and these regions of
overlap can be used to calibrate one technique against another.
Background of
α
d
Telescopes can be used to measure the parallax angle . This is half of the
total angular shift in the star’s position during a 6-month period (as the
earth completes half of an orbit).
R
tan
d
Parallax angles are tiny so we can use the small angle approximation and
replace tan() with (in radians).
R
d
where R is the radius of the Earth’s orbit. This is known very accurately from
laser ranging within the solar system and trigonometry. If R is measured in
meters and in radians then d will also be in meters. Astronomers often use
different (non S.I. units):
1 Astronomical unit (AU) 149,597,870,700 m
598 • FOUNDATIONS OF PHYSICS
Mean flux
at Earth
Period of Cepheid
Variable, T
ASTROPHYSICS • 599
Period
T
The way to determine distance using Cepheid variables is summarized
below:
Q Monitor the flux from a Cepheid variable.
Q Measure its period T and mean intensity I.
Q Use the period-luminosity relation to find the luminosity L.
Q Use the inverse-square law to find the distance d.
This method can be calibrated against the parallax method using nearby
Cepheids.
Hubble used this distance method with groups of Cepheids in nebulae
and showed that the nebulae were actually separate galaxies outside the
Milky Way.
28.2.3 Hubble Law
In the 1920s, Edwin Hubble and Vesto Slipher measured the spectra and
distances for many observable galaxies. When a galaxy is in motion relative
to the Earth the spectrum measured on Earth is shifted relative to the
spectrum of the same elements from a stationary source. This is a Doppler
effect (see Section 14.4.1) and can be used to calculate the velocity of the
source, in this case the velocity of the galaxy. The wavelength shift is
given by:
0
600 • FOUNDATIONS OF PHYSICS
Distance
ASTROPHYSICS • 601
28.3 Cosmology
Cosmology is the science of the universe as a whole, dealing with its origin,
nature, evolution and end. All of the evidence that we have suggests that the
laws of physics we have discovered from our own planet operate throughout
the universe, so we use these to try to understand it. Whilst cosmology
deals with physics on the largest scale it is intimately linked to physics on
the smallest scales and discoveries in particle physics and cosmology are
often linked. The enormous energies present soon after the Big Bang are
reproduced in particle physics experiments such as the Large Hadron
Collider at CERN.
Big Bang A B C
A B C
Tim
28.4 Exercises
29
MEDICAL PHYSICS
29.1 Ultrasound
Cornea Retina
Gel Lens
Ultrasound
transducer
Ultrasound pulse
Time/µs
0 40
MEDICAL PHYSICS • 609
responsible for the peaks in the detected signal. The gel between the
transducer and the cornea reduces the amount of the incident signal that is
reflected by the outer surface and does not enter the eye.
B-scans of the eye are used to diagnose problems such as a detached
retina, glaucoma, or cataracts or to monitor the shape and size of a tumour.
In the B-scan the incident ultrasound is moved back and forth to scan slices
of the eye. These can then be combined to form an image. The eye is quite
small so ultrasound does not have to penetrate far and higher frequencies
can be used. Recently frequencies up to 50 MHz have been used to provide
extremely high resolution images of structures at the front of the eye. At
these frequencies penetration is just a few millimeters.
B-scans in prenatal scanning have to penetrate farther into the body so
these are limited to lower frequencies. This reduces their resolution but the
structures being imaged tend to be larger so this is not a major problem.
Blood flow
As the blood flow pulses the frequency difference shows a series of peaks:
Frequency
Time
610 • FOUNDATIONS OF PHYSICS
29.2 X-rays
29.2.1 Overview of Medical X-Rays
X-rays were discovered in 1895 by Wilhelm Roentgen and he won the very
first Nobel Prize for physics in 1901. X-rays are a form of high-frequency,
short-wavelength electromagnetic radiation emitted when electrons moving
at high speed crash into a target and stop suddenly. The radiation is ionizing
and highly penetrating and Roentgen took the first X-ray photograph, of his
wife’s hand, soon after his discovery.
X-rays are now used routinely in medicine to create images of the
inside of the human body. In conventional X-ray radiography the X-rays
pass through the body and are absorbed to differing extents by the tissues
through which they pass, creating a shadow image that can be captured
on film or by arrays of detectors. A more sophisticated technique, called
computed tomography (CT), involves rotating the X-ray source and
detectors around the body to create a 3D image.
X-rays are ionizing radiation so they can damage tissues and doctors
must always balance risk against benefit when deciding whether to use
them. The risks depend on the wavelength and intensity of the X-rays, the
7 6
6
5
Dose/mSv
4
Average annual radiation dose
3 2
2 1.5 1.5
1 0.1 0.4
0.001 0.005
0
CT scan of chest
CT scan of head
CT scan of spine
Body extremity
Dental X-ray
Chest X-ray
Mammography
Spinal X-ray
Procedure
MEDICAL PHYSICS • 611
duration of the procedure and the tissues being exposed. To assess the
risk, the X-ray dose is compared to the annual radiation dose from natural
background sources. The chart below shows typical doses from different
types of X-ray procedure.
Rotating tungsten
anode (target)
Cathode
Vacuum Cooling system for
anode
Electron
beam
X-rays
612 • FOUNDATIONS OF PHYSICS
energy of the electrons and the maximum frequency of the X-ray photons.
It also increases the number of photons emitted.
The spectrum of X-rays produced is continuous, but also contains some
sharp emission lines that are characteristic of the target element. These
lines are created when electrons strike atoms in the target and eject an
electron from an inner orbits (e.g. K-shell or L-shell). The vacancy is then
filled by electrons from higher orbits cascading down and emitting photons
as they do so.
X-ray intensity
K-lines
L-lines
Wavelengt
The largest energy jumps are for electrons dropping into the innermost
shell, the K shell, and these correspond to the shortest wavelength spectral
lines. The energy jumps into the K and L shells correspond to X-ray photon
energies.
The short wavelength cut off corresponds to all the energy of one
incident electron being transferred to a single X-ray photon. The rest of
the continuous spectrum corresponds to more complex interactions and
multiple collisions.
The higher frequency, shorter wavelength X-rays are the most
penetrating. These are called “hard X-rays” and are the ones needed for
image formation. Longer wavelength, “soft X-rays” are usually filtered out
because they do not contribute to the image but do increase the radiation
dose. The absorption of X-rays by tissues attenuates the beam. The amount
of attenuation increases with the density of the tissue, so bones absorb
X-rays more strongly than the surrounding soft tissues. This is what creates
the contrast in an X-ray image and X-rays are particularly good for imaging
bones and bone damage. Soft tissues do not create much contrast so often
a contrast medium is injected prior to the X-ray. CT scans are better at
imaging soft tissue than standard X-rays.
MEDICAL PHYSICS • 613
X-rays
Patient
Lead grid
Film
Some of the X-rays passing through the patient are scattered off-axis so
a lead grid is slowly moved between the patient and the film during the
exposure. This eliminates off-axis X-rays and increases the signal to noise
ratio for the image.
An intensifying screen can be used to increase the number of light
photons created from each X-ray photon. This consists of thin layers of
fluorescent material placed in front of the film. The fluorescent layers
absorb the X-rays and emit visible photons. The arrangement is housed in a
cassette that is placed under the patient for exposure.
An X-ray filter (usually a metal plate) is placed between the X-ray
source and the patient to filter out the low energy (long wavelength) X-ray
photons. This reduces the total dose given to the patient but does not affect
the intensity of the image because low energy photons would have been
absorbed in the body.
An X-ray image intensifier can also be used to increase the brightness
of the image. Incident X-rays strike a phosphor screen that converts the
Electrodes to accelerate and
focus electrons
Output phosphor screen
X-rays
To camera/viewer
Vacuum
X-rays X-rays
Fixed detectors
CT
X-rays
Clinical MRI scanners use magnetic field strengths in the range 0.2–3.0
T and research scanners use up to 11 T. These give Larmor frequencies that
correspond to radio waves. An additional weaker variable magnetic field is
applied along the axis of the patient so that the Larmor frequency varies
with position. Pulses of radio waves corresponding to the Larmor frequency
across a particular slice of the body are then transmitted through the body.
This disturbs the precessing nuclei in that slice so that they create a rotating
magnetic field at the same frequency. It is this field that is detected using
coils placed outside the body. The rate at which this field decays depends
on the type of tissue surrounding the nuclei so can be used to distinguish
different types of soft tissue and to achieve much higher contrast than X-ray
techniques. By varying the frequency of the radio waves nuclei in different
locations resonate and an image can be built up.
Superconducting coils in
liquid helium cryostat
Layers of thermal
insulation Magnetic field
gradient coils
Radio frequency
transmit coils
Scanner
table Moveable
detection coils
MRI
Gamma-rays
Collimator:
lead grid
Scintillator:
sodium iodide
Photomultipliers
Compute
Display
MEDICAL PHYSICS • 619
1 e 1 e →
0 0
2 00
The positron emitted in this decay only travels a short distance (<1 mm)
through the tissue before meeting an electron and annihilating. The
annihilation of the electron-positron pair results in emission of a pair of
gamma-rays that travel in opposite directions (a pair must be emitted in
order to conserve linear momentum). The position of the annihilation along
the line determined by the two gamma-rays is determined from the time
delay in arrival at detectors on either side of the patient.
The detectors only respond to near simultaneous pairs of photon arrivals
(within about 10 ns of each other) and then measure the small additional
time delays for each pair.
620 • FOUNDATIONS OF PHYSICS
B
PET
Annihilation
For a time delay t the annihilation event must have been at a position that
is ct/2 closer to A than the center of the chord AB, i.e. a distance AB/2
ct/2 from A and AB/2 ct/2 from B.
In practice the two gamma-rays emitted from an annihilation event
are not emitted at exactly 180 so this introduces an uncertainty into the
position of the chord. In addition to this the detector can only resolve events
that are more than about 0.50 ns apart so this introduces an uncertainty into
the positon along the chord. The image quality and resolution improves
as more events are detected (signal to noise ratio falls) and resolutions of
about 1-2 mm are possible with clinical scanners. This is not as good as a
CT image but PET scans can be used to investigate a very wide range of
metabolic pathways and when used alongside CT or MRI scans information
about both structure and function can be combined.
The injection of a radioactive tracer means that the patient remains
radioactive for a short time after the procedure. For a typical PET scan
involving fluorine-18 the total activity injected is about 370 MBq. The
patient will absorbs a radiation dose equivalent to that of a full body CT
scan (about 7 mSv). If the PET scan is combined with a CT scan the total
dose will be the sum of doses from the two procedures.
29.6 Exercises
1. (a) State and explain the risks associated with X-ray imaging of the human
body.
MEDICAL PHYSICS • 621
L-lines
Wavelength
(a) Explain how the continuous spectrum arises and why it has a short
wavelength cut-off.
(b) Calculate the cut-off wavelength for a 40 kV X-ray machine.
(c) Explain how the line spectra come about and account for the
difference between the K and L lines.
4. State and explain four factors that must be considered when selecting a
suitable radioisotope to use as a tracer in the human body.
5. (a) Explain why the annihilation of an electron and a positron during
a PET scan is likely to result in two gamma-rays of the same
wavelength emitted in opposite directions.
(b) Calculate the wavelength of these gamma-rays.
APPENDIX
A
ESTIMATIONS AND FERMI
QUESTIONS
Fermi
Ground zero
This gives a result of about 50 KT, much greater than Fermi’s estimate and
about 2.5 times the actual yield of the Trinity test. However, it is the correct
order of magnitude, which is pleasing given the incredibly simple model
used to make the estimate! Fermi would have used a more sophisticated
model, taking into account the actual pressure differences in the shock wave
and the proportion of the input energy that went into it. His estimate of
APPNEDIX A: ESTIMATION AND FERMI QUESTIONS • 625
10 kT was impressive, but did not win the bet. Isodor Rabi was the winner
with an estimated yield of 18 kT. We do not know how he did this (or maybe
he just got lucky).
Method 2
This method has some similarities to the first but uses the total mass of
the atmosphere and the mass of an “air molecule.” The total weight of the
atmosphere is equal to the atmospheric force exerted on the entire surface
of the Earth:
F pA 10 5 4 (6.4 106 )2 5.15 1019 N
The mass of the atmosphere is:
F
m 5.25 1018 kg
g
Air consists mainly of oxygen (molar mass 32g) and nitrogen (molar mass
28 g) so an “air molecule” is taken to have a molar mass of 30 g.
Number of moles of air molecules in the atmosphere:
5.25 1018
n 1.75 1020 moles
0.030
The number of air molecules in the atmosphere:
N 1.75 1020 6.02 1023 1.05 10 44 molecules
There are about 1044 air molecules in the Earth’s atmosphere.
This is (not surprisingly) consistent with our first method.
APPNEDIX A: ESTIMATION AND FERMI QUESTIONS • 627
ds = vδt
Air
During a time t the canopy moves down through a distance vt and sweeps
through a volume r2vt of air. The mass of air in this volume is equal to
r2vt where is the density of the air. If we assume that all of this air
628 • FOUNDATIONS OF PHYSICS
must be accelerated up to speed v as the canopy passes we can work out the
rate of change of momentum.
mass of air swept out in t speed
Rate of change of momentum =
t
2
r2v t
r 2 v2
t
This is equal to the drag force on the parachute and, at terminal velocity,
the weight of the parachute and parachutist:
r 2 v2 mg
This equation can be rearranged to give an expression for r, the radius of
the parachute:
mg
r 2
2v
Now we need to input some reasonable values: mass of parachutist and
equipment, m 100 kg; density of air, 1.2 kgm3; maximum speed,
v 6 ms1
This gives a minimum safe radius of:
100 9.8
r 7.2 m
1.2 2 62
This is probably an over estimate but is certainly of the correct order of
magnitude. It corresponds to a minimum area of about 160 m2.
The approach above was based on momentum but different estimates
result if we consider kinetic energy instead. To make the estimate more
realistic we should consider the drag coefficient for a parachute of a
particular shape. Nonetheless, our simple method has produced a useful
equation to begin an investigation of how parachute size and terminal
velocity are related.
13. What is the drag force on a large truck travelling at speed on a motor-
way?
14. What is the power output of a racing cyclist?
15. What is the greatest distance at which the human eye can resolve car
headlamps?
16. Will relativistic effects be important for an electron in the ground state
of a hydrogen atom?
17. What is the spring constant of a car’s suspension system?
18. How many hairs on your head?
19. How much energy can be supplied by a car battery?
20. What is the temperature at the center of the Sun?
APPENDIX
B
EXPERIMENTAL INVESTIGATIONS
B.1.1 Variables
To make your experiment a fair test, the parameter you are investigating
must be affected only by the parameter you are varying. For example, if you
are investigating factors affecting the acceleration of a dynamics vehicle you
might vary its mass by adding loads to the vehicle. However, if you then
pull it with different forces you have varied two parameters, both of which
affect its acceleration, so you will not be able to separate the effect of mass
from the effect of resultant force. To make this fair you need to keep the
resultant force constant while varying the mass and then carry out a separate
experiment in which the mass is kept constant and the resultant force is
varied. The three parameters, mass, resultant force and acceleration are
examples of the three types of variable parameter in all experimental work:
Independent variable: the unique variable that we change (e.g., mass)
638 • FOUNDATIONS OF PHYSICS
a steel spring can result in it flying up and hitting you in the eye. This
risk should be identified before attaching loads to the spring and suitable
eye protection should be worn. If you are carrying out an experiment that
involves the use of radioactive sources you will need to consider the likely
total radiation dose you will receive an ensure that this is low enough to
justify doing the experiment. There are three stages to a risk assessment:
Q Identify the risks to you and others.
Q Adapt the experimental procedure to minimize the risk—i.e., take
precautions.
Q Decide whether the risk is low enough to justify carrying out the
experiment.
There are guidelines that must be followed when you work with hazardous
chemicals, electricity, radioactive sources, vacuum containers, etc. Your
teacher or supervisor should be consulted about these and if you are ever in
doubt about the risk of an experimental procedure, do not proceed!
Uncertainties
Include an analysis of uncertainties so that any value calculated from
your data should be quoted to an appropriate degree of precision and be
accompanied by its own uncertainty. For example, if you are carrying out
an experiment to determine the specific heat capacity of water and come
up with a value of 4000 Jkg1C1 there is no way to know if your experiment
was a good one even though your value is within 5% of the expected value
(4200 Jkg1C1). However, if your result is 4000 400 Jkg1C1 it is clear that
the expected value is within the range of your experimental uncertainties
and that your result has an uncertainty of 10%. Refining your methods
should reduce the uncertainties but keep the expected value within the
range of your results. If you end up with a result of 4000 100 Jkg1C1 this
does not include the expected value so either you have underestimated the
uncertainties or there is a systematic error in your measurements.
642 • FOUNDATIONS OF PHYSICS
Conclusion
It is surprising how often students forget to state a conclusion to their
experiments. This should be a simple statement of what you have achieved
in your experiment and should relate back to the original aims.
Evaluation
Having carried out the experiment and drawn a conclusion you should
consider:
Q How strongly the conclusion is supported.
Q How any calculated values compare with expected or known values.
Q The significance of the uncertainties.
Q Likely sources of error.
Q Suitability of the apparatus used.
Q How the experiment could be improved.
If possible make your evaluation quantitative by referring to the range of
uncertainties in measurements and their impact on your final result.
Glossary
Include a list of technical terms you have used in your account along with
brief explanations.
References
Include a list of references indicating where you have used each source.
Provide enough information so that someone reading your report can easily
find the information you used. This should include:
Q the title of the work or article
Q the author or authors
Q publisher and publication date
Q the page or pages used
Q URL and date accessed (for websites)
B.2 Investigations
An investigation is a more open-ended project that involves a considerable
amount of preliminary research and pilot experiments (to try things out
APPNEDIX B: EXPERIMENTAL INVESTIGATIONS • 643
and to explore the phenomena). These will help you to plan a sequence
of experiments, but as you proceed you should be prepared to modify
your plans in the light of results, or when you discover that a particular
experiment does not work!
It takes time to get into an investigation, and you must not be easily
deterred, especially at the start when there is a lot of uncertainty about how
to begin. Researchers spend a lot of time getting nowhere, but without trying
a range of different approaches you are unlikely to hit upon the method that
actually works. You might also discover interesting and unexpected aspects
of the problem along the way.
Science is a collaborative endeavor and carrying out an investigation
usually requires you to discuss ideas with your teacher or supervisor and
to think about the feedback they give you. You are also likely to need a
fair amount of apparatus, and this could involve discussing your needs with
laboratory technicians or equipment suppliers (in advance!) In both cases
it is important to be as clear as you can about what you need and what you
need it for—asking for a “block of wood” is pretty meaningless: what type
of wood? What dimensions? What is it for?
Writing up an investigation can be a daunting task, especially if you
leave it all until you have finished in the laboratory. It is worth using a
laboratory notebook and it is essential to write up and process data from
every experiment before moving on. A good investigation cannot be
completely planned in advance, it evolves as you discover more about the
problem you are tackling. There are many ways to write up an investigation
but the written report could take the following form:
Aim: statement of what is to be investigated
Background physics: summary of research about the problem identifying
aspects for investigation.
Pilot experiments: these should be used to explore the phenomena
and test methods or instruments to see if they are suitable. These pilot
experiments are not intended to produce precise data for analysis although
they might suggest relationships to be investigated in more detail later.
Plan of investigation: this should outline a sequence of related experiments
that can be used to collect good relevant data to move the investigation
forward.
Experiments: each experiment should be written up fully (using the
guidelines in Section B.1) including risk assessments and taking account of
644 • FOUNDATIONS OF PHYSICS
C
UNITS, CONSTANTS, AND
EQUATIONS
Derived units
Derived quantity Name Symbol
S.I derived units S.I base units
Force Newton N kgms2
Pressure Pascal Pa kgm1s2
Energy Joule J kgm2s2
Power Watt W kgm2s3
Charge Coulomb C As
Resistance Ohm kgm2s2A2
646 • FOUNDATIONS OF PHYSICS
dv
Dynamics a
dt
v adt
d(mv)
Newton’s laws F
dt
mv mu Fdt
dQ
Electric circuits I
dt
Q Idt
dN
Radioactivity
dt
N dN
N
dt
dQ Q dQ dt
Capacitors
dt
RC
Q RC
C.6 Equations
Mechanics
Equations for constantly accelerated motion:
v u at
u v t
s
2
1
s ut at 2
2
648 • FOUNDATIONS OF PHYSICS
1
s vt at 2
2
v2 u2 2 as
u2 sin 2
Range of a projectile : R
g
Coefficient of friction :
Flimit = N, S = tan ( is limiting angle)
Weight : W mg
m
Density :
V
F
Pressure : p A
Linear momentum : p mv
dp F
Newton’s second law : F a
dt m
(constant mass)
Impulse and change of momentum : Fdt dp p
Ft mv mu (constant mass)
m
Rockets : v f v0 u ln 0
mf
Work done : W Fs cos
Gravitational potential energy : GPE mgh
Kinetic energy : KE ½ mv2
useful output energy
Efficiency : 100%
total input energy
dE ds
Power : P dt P F Fv
dt
E
Photon momentum : pphoton c
I
Radiation pressure : p c
Lagrangian : L T V
L L
Euler–Lagrange equations :
t q q
APPNEDIX C: UNITS, CONSTANTS, AND EQUATIONS • 649
Fluids
Hooke law : F ke
Spring systems : 1 1 1 1
kseries k1 k2 k3
kparallel k1 k2 k3
1 1
Strain energy (spring) : EPE = Fe ke2
2 2
EPE 1
Strain energy (wire) :
V 2
F e
Stress and strain :
A l0
Young modulus : E
Thermodynamics
Temperature scales : T 273.15; T 273.15( in
°C, T in K)
dQ d
Thermal conduction : kA
dt dx
Wien displacement law : T constant 2.9103 mK
Stefan–Boltzmann law : P eAT 4
E
Specific heat capacity : c
m
650 • FOUNDATIONS OF PHYSICS
3kT
RMS molecular speed : vrms v2
m
1 3
Internal energy : U Total KE N A mv2 RT
2 2
3 5
Heat capacities (ideal gas) : cV R cP R
2 2
c
Adiabatic gas constant : P
cV
p
Speed of sound in gas : c
E
Boltzmann factor : f e kT
Qcold 1
Refrigerator : CoPrefrigerator
W Thot
1
Tcold
Qhot 1
Heat pump : CoPheat pump W T
1 cold
Thot
Oscillations
1
Frequency : f
T
Angular frequency : 2f
d2 x
SHM : a 2 2 x
dt
: x = A cos(t )
x = A cos t
dx
v A sin t
dt
dv
a 2 A cos t 2 x
dt
1 k m
Mass-spring system : f T 2
2 m k
1 g l
Simple pendulum : f T 2
2 l g
1
Total energy : TE m 2 A2
2
Damped SHM : x Aet cos(t)
Resonance condition : fd f0
Rotational dynamics
l
Angles in radians :
r
Small angle approximations : As 0 : sin , cos 1,
tan ( in radians)
v
Angular velocity :
r
a
Tangential acceleration :
r
v2
Centripetal acceleration : a r 2
r
mv2
Centripetal force : F ma mr 2
r
652 • FOUNDATIONS OF PHYSICS
u v t
t
i f
s
2 2
1 1
s ut at 2 i t t
2 2
1 1
s vt at 2 f t t 2
2 2
v u 2 as
2 2
2f 2i 2
i1
m i ri2
1
Rotational KE : RKE = I2
2
Angular momentum : L I
d(L) d
Torque : I I
dt dt
Moments of inertia for uniform objects:
Point mass : I mr2
1 1
Rod : Iend ml 2 ICM ml 2
3 12
Thin hoop : I mr2
1
Disc/cylinder : ICM ma2 (radius a)
2
2
Sphere : ICM ma2 (radius a)
5
Waves
Wave speed : v f
T
1D traveling wave : y A cos t kx
Intensity : I A2
c
Wave speed in medium : v
n
Refraction : n1 sin 1 n2 sin 2
n
TIR : sin c 2
n1
APPNEDIX C: UNITS, CONSTANTS, AND EQUATIONS • 653
1
v (medium)
0 r 0 r
Refractive index : n r r
1
Power of a lens : P
f
h v
Linear magnification : m i
ho u
Lens equation : 1 1 1
f u v
f
Astronomical telescope : M o
fe
v
Doppler shift (light) : 0
c
0 v
Red shift : z
0 c
Hubble law : v H0d
2 x
Phase difference :
sy
Young’s double slit :
d
Diffraction grating maxima : n d sin
1.22
Single slit minima : sin
D
1.22
Rayleigh criterion : (for resolution)
D
T
Waves on a string : v
RT E
Speed of sound : v (ideal gas); v (solid)
M
654 • FOUNDATIONS OF PHYSICS
I
Decibel scale : intensity level B log10
I0
Acoustic impedance : Z (speed of sound) (density)
2
Reflection : I r ⎛ Z 2 − Z1 ⎞
=⎜ ⎟
I 0 ⎜⎝ Z 2 + Z 1 ⎟⎠
Electricity
dQ Q
Electric current : I ; I ; (constant current)
dt t
Q1Q2
Coulomb law : F
4 0 r 2
F dV
Electric field strength : E ; E
q dx
F Q
E-field strength (point charge) : E
q 4 0 r 2
EPE
Electric potential : V
Q
W
Potential difference : V
Q
Q
Electric potential (point charge) : E
4 0 r
iN
Qi
Gauss theorem : E.dS
0
i1
surface
V
Resistance (Ohm’s law) : R
I
iN
Resistors in series : Rseries R i
i1
iN
1
Resistors in parallel :
Rparallel
R1 i
i1
APPNEDIX C: UNITS, CONSTANTS, AND EQUATIONS • 655
RA
Resistivity :
l
Electrical energy : E VIt
Real cell : V E Ir
Capacitance :
Q2 CV 2 QV
Energy stored (capacitor) : E
2C 2 2
A
Parallel plate capacitance : C 0 r
d
t
Capacitor discharge : Q t Q0 e RC I t
t t
I0 e RC V t V0 e RC
Capacitor charging :
Q QF 1 e
t
RC
V VS 1 e
t
RC
t
I t I0 e RC
Q
Capacitance (charged sphere) : Csphere 4 0 a
V
Magnetism
Magnetic force : F BIl sin (on current)
f Bqv sin (on moving charge)
Lorentz force : f qE qvB
mv
Radius of curvature in B-field : r
Bq
I sin
Biot–Savart law : B 0 2 l
4 x
656 • FOUNDATIONS OF PHYSICS
0 NI
Magnetic field strength : B (narrow coil)
2r
I
B 0 (long straight current-
2 r
carrying wire)
NI
B 0 r (long solenoid at center)
l
NI
B 0 r (long solenoid at end)
2l
Ampère theorem : B.dl 0 I
closed enclosed
loop by loop
2
VS 2 1
Z R 2 2L C2 R 2 L (RCL series circuit)
I C
VL VC L C
Phase angle : tan
VR R
(RCL series circuit)
Resonant frequency : 1 1
f0
(RCL circuit) 2 LC
Gravitational fields
Gm1 m2
Newton’s law of gravitation : F
r2
Gravitational field strength : gravitational force F
g
mass m
F GM
g 2 (point mass or uniform spherical mass)
m r
GPE
Gravitational potential : VG
m
GM
VG r (point mass or uniform spherical mass)
r
GPE mgh (h >> r)
2GM
Escape velocity : vesc
r
2GM
Scwarzschild radius : RS 2
c
3
Kepler’s third law : r GM
2
T 4 2
2GMmr
Tidal forces : F
R3
V
Gravitational time dilation : T T1 2 (weak fields)
c
Special relativity
1
Gamma-factor :
v2
1
c2
Time dilation : T T
658 • FOUNDATIONS OF PHYSICS
L
Length contraction : L'
Lorentz transformations :
x x vt z z
vx
y y t t
Inverse Lorentz transformations :
x x vt z z
vx
y y t t 2
c
u v
Velocity addition : w
uv
1 2
c
Mass increase : m m0
2
Mass and energy : E c2 m; E2 p2 c2 m0 c2
h
de Broglie relation :
p
h
Compton Effect : 1 cos
me c
me4 1 13.6 eV
Hydrogen atom energy levels : En
8 0 2 h2 n2 n2
1 1 1
Balmer series : R 2 2
n 2 n
1 1 1
Rydberg formula : R 2 2
n m n
me4
Rydberg constant : R 2 3
8 c0 h
h
Heisenberg indeterminacy : xp
4
relation
Astrophysics
L
Radius of a star : r
4 T 4
Hubble law : v H0 d
1
Hubble time : TH
H0
Medical physics
Beer–Lambert law : Iout Iin ex
B0
Larmor resonant frequency : f
2
C.7 Constants
Speed of light in a vacuum c 3.00 108ms1 299 792 458 ms1
Electronic charge (magnitude) e 1.60 1019 C 1.60217662 1019 C
Planck constant h 6.63 10 Js
19
6.62607004 1034Js
Gravitational constant G 6.67 1011Nm2kg2 6.67408 1011Nm2kg2
Avogadro constant L 6.02 1023mol1 6.02214086 1023mol1
Boltzmann constant k 1.38 1023 JK1 1.38064852 1023 JK1
1
Molar gas constant R 8.31 J mol 8.3144598 mol1
660 • FOUNDATIONS OF PHYSICS
D
SOLUTIONS TO EXERCISES
1. (a) 0.267 kg; (b) 25 000 000 mm; (c) 5000 000 cm3; (d) 22.2 ms1;
(e) 0.0045 m2
2. (a) 1.44 108 km; (b) 9.47 1015 m; (c) 1.27 light seconds
3. (a) 2.00; (b) 0.00209; (c) 0.00950; (d) 3.14
4. (a) Nm2kg2 or m3kg1s2 (in base units)
5. (a) 0.012, 0.016, 0.050(b) 1.2 %, 1.6 %, 5.0 % (c) 5300 200 cm2
(d) 105 000 8000 cm3
T k
6. (a) 0.234 0.051kg; (b) time period 2 T k .
7. (a) Systematic errors affect all readings in the same way (e.g. constant
addition or subtraction) and can be corrected for if the error is
known (e.g. subtracting a zero error). Random errors cannot be
predicted and affect each data point independently.
(b) Repeat reading several times and use an average value (neglecting
obvious anomalies). (c) 0.32 mm
8. (a) 5.5 103; (b) 7 1010; (c) 1.2 1024
662 • FOUNDATIONS OF PHYSICS
9. (a) 9.0 1015; (b) 9.0 103; (c) 2.0 108; (d) 2.0 1016
10. (a) 3.3 107 s or 1 year 20.6 days; (b) 4.0 109 s or 126 years 275.8 days.
(c) 2.8 1013 s or 900 000 years;
(d) 9.5 1020 s or 3.0 1013 years (>> age of Universe!)
−1.2 m
Velocity
4.0 ms−1
Time
−1
7.5 ms
Acceleration
Time
−9.8 ms−2
APPENDIX D: SOLUTIONS TO EXERCISES • 663
1. All of them.
2. (a) 2.83 N at 45 to left of vertical; (b) 5.10 N at 32.1 left of vertical
3. 6.2 N
4. 0.75 N
5. (a) FA 1225 N; FB 1715 N; (b) 2940 N
6. (a) S 0.466; (b) K 0.443
(c) Force
2.69 N
2.56 N
Time
(b) There is an interaction between the bullet and gun when the
propellant explodes. The forward force on the bullet is equal to the
backward force on the gun (N3).
(c) To accelerate upwards there must be a resultant upward force on
you (N2). This arises because the contact force from the floor of the
lift is greater than your weight: FC mg ma. You do not feel your
weight but you do feel the contact force and it is this that makes you
feel “heavier.”
(d) These are opposite ends of the same interaction so they must be
equal (N3).
3. (a) 0.50 ms2 to the right; (b) a 19.4 ms2 to the right
4. (a) 30 000 kgms1; (b) 0.16 kgms1
5. 1.3 ms1 to the right
6. (a) v1 0.594 ms1; v2 0.316 ms1; (b) Initial KE 0.32 mJ and final
KE 0.23 mJ so the collision was inelastic. 094 mJ has been transferred
to other forms in the collision.
7. (a) 2500 N; (b)(i) Power needed to overcome drag is unchanged but
additional power must be supplied to increase the GPE of the car as it
rises; (ii) 111 kW
mu2
8. (a) s uT
2B
(b) mu2
s = uT +
2B
Distance
mu2
Braking distance = 2B
Thinking distance = uT
Speed
Chapter 6 Fluids
mass in each unit volume so the weight of water creating the pressure is
greater than if the water was incompressible.
2. (a) p gx; (b) F ½ glh2; (c) Integrate moments on a narrow strip
of width x and equate the result to the moment of the total force at
height a above the base. Show that a h/3.
3. (a) –; (b) The constant is dimensionless so changing its value does
not affect the balance of dimensions in the equation;(c) The Reynold
number will be too high; the flow will change from laminar to turbulent.
4. (a) Sensible estimates must be made for each quantity in the expression
for Reynold number. E.g. v 30 ms1, 1.2 kgm3, L 2 m,
2 105 Pas giving Re 4 106. This is very high and implies
turbulent flow.
(b) (i) Smooth surfaces, streamlined shape, reduced cross-sectional
area; (ii) 765 N; (iii) 8; (iv) 68 ms1; (v) Engine efficiency likely to
fall and other frictional forces will increase.
5. (a) 0.08 m3; (b) 800 N; (c) 1 N; (d) about 0.1 % so probably not
noticeable.
6. (a) 6.83 105 N; (b) 1.12 105 N
(c) Free body diagram has two upward forces: buoyancy B and drag
D(v) and one downward force, weight mg. The resultant force
F mg (B D(v)) so the downward acceleration is
a g (B D(v))/m. The weight and buoyancy forces are constant
but D increases with speed v. When the ball bearing is released
D 0 so the initial acceleration is a(0) g B/m. As D increases
the resultant force and acceleration decrease until mg (B
D(v)) a 0, so the ball bearing then falls at a constant terminal
velocity.
(d) 1.4 Pas assuming Stoke law is valid—i.e., ball bearing falling slowly
in the center of a tube of diameter >> diameter of ball.
(e) Droplets were very small. Stoke law assumes that fluid is a
continuum. For tiny droplets we need to take into account the
particle nature of the air—this affects average viscosity.
7. (a) air<< water; (b) 27 ms1; (c) 1600; (d) Assumption of laminar flow is
dubious. If flow is turbulent result is not valid.
16 pX pZ
8. pY 17
666 • FOUNDATIONS OF PHYSICS
1. (a) 6 kW; (b) Other heat losses—e.g. through ground, draughts, etc…
2. 20.2 C
3. Cu: 24.4 JC1 mol1, Al: 24.2 JC1 mol1, Fe: 25.0 JC1 mol1,
Hg: 28.1 JC1 mol1
4. 5.3 MJ
5. Molten iron cooling / state change, liquid to solid while latent heat is
dissipated at constant temperature / solid iron cooling
Chapter 9 Gases
Pressure
(b)
(a)
Volume
3. (a),(b)
(c) Slow compression: temperature is constant so U is constant. Heat
flow out of system is equal to work done on the system. Fast
compression: heat flow is minimal so work done on system increases
its internal energy, temperature, and pressure.
4. (a) 500 ms1; (b) Same mean KE at same temperature but molecular
masses are different: more massive molecules (oxygen) have lower
rms speed.
5. (a) Molecular volume can no longer be ignored.(b) Collisions become
inelastic.(c) Interactions can no longer be ignored (bonds will form).
6. –
7. (a) 0; (b) 519.5 ms1; (c) ) 519.9 ms1; (d) 7.3 1021 J
8. Internal energy U depends only on absolute temperature T. In an
isothermal change U is constant and so is T. Work done to compress the
gas equals the heat that flows out of the gas to the surroundings.
9. (a) –; (b)(i) U 0; (ii) W H
10. (a) –; (b)(i) exponential increase; (ii) exponential decay; (iii) exponential
decay; (c) Increasing T increases the B.F. so a larger fraction of
collisions exceeds the activation energy and the reaction rate increases.
11. 2.5
12. 1.3 105
13. (a) As T increases B.F. increases so a larger fraction of the electrons
have enough energy to jump to the conduction band.
E
(b) I GV const.×V×e kT so a graph of ln I against 1/T has gradient
E/k
14. (a) –; (b) 0.67; (c) 0.174 gs1
APPENDIX D: SOLUTIONS TO EXERCISES • 669
1. –
2. (a) –; (b) Newton’s laws are reversible.(c) Entropy increases so the
future is distinct from the past as long as the universe started in a low
entropy state.(d) Cosmological expansion (but this is also dependent on
the universe starting in a low entropy state).
3. (a) It was very low.(b) If all possible initial conditions are considered
equally likely the one in which the universe actually began was one of
very low probability (i.e. a microstate belonging to a very small number
of microstates).
4. –; 5. –; 6. –
Chapter 11 Oscillations
(b) N
N N
mg mg mg mg
(c) No. Ball will have tangential acceleration as well as centripetal
acceleration and resultant force is parallel to resultant acceleration.
(d) v rg
3. Velocities shown on diagram. Accelerations all equal to v2/r and directed
toward center.
2v
C
√2v
D 45° B
45°
√2v
v=0
A
4. (a) 1.82 s; (b) 0.55 Hz; (c) 3.5 rad s1; (d) 7.8 103 Js; (e) 0.054 Nm;
(f) 0.013 J
5. –
6. (a) 4.7 109 kgm2; (b) Astronaut experiences an inward contact force
which maintains his circular motion about the centerof rotation. This
feels like the reaction to a gravitational field of strength g rw2;
(c) 0.22 rad s1; (d) 94 MJ
7. (a) 0.63 Js; (b) 26.2 s
8. No external resultant torque. Angular momentum is conserved. L I.
I is reduced so increases. Work must be done by the children as they
use forces to move inwards. This increases RKE.
9. 8.3 1028 Nm
APPENDIX D: SOLUTIONS TO EXERCISES • 671
Chapter 13 Waves
Chapter 14 Light
Chapter 15 Superposition
1. (a) 0 (b) (c) /2; (d) Amplitude 2, intensity 4; (e) 0;
(f) Amplitude 1/2 , intensity ¼ (g) 0.026 m, 13.1 kHz; (h) doubles;
(i) halves; (j) the phase difference is between /2 and
2. (a) –; (b) 9.8 mm; (c)(i) maxima closer together; (ii) minima not
completely dark; (iii) no effect; (iv) maxima farther apart; (v) maxima
farther apart; (vi) clarity of fringes varies—maximum at 0 and , but no
fringes at /2 and 3/2 when polarizations are perpendicular.
3. Same fringe separation but maxima 9/4 times brighter.
4. (a) 8.28, 8.97; (b) 1.44; (c) 6 orders
(d) 1st order diffraction minimum at 25.6 and third order grating
maxima at 25.6 and 27.9 so maxima absent or very dim here.
5. –
6. (a) 0.15; (b) –; (c) –; (d) Minima closer to centerof pattern.
7. –
8. (a) 1.2 104 rad; (b) about 8 m; (c) 2.4 104 light years; (d) Limited
by other factors—e.g. retinal cell density/sensitivity; optical quality of
cornea and lens; (e) Minimum angle resolved is inversely proportional to
diameter and dscope >> deye
9. (a) Assume: separation 2 m, dmax 20 km (in practice less than this
because of other factors); (b) Needs objective diameter 30 m, so no.
10. (a) 8.82 105 kgm1; (b) 543 ms1; (c) 362 Hz; (d) 362 Hz, 724 Hz,
1086 Hz
Chapter 16 Sound
1. –
2. (a) 30 dB; (b) doubles to 60 Db
3. (a) 0.283 m; (b) –; (c) h1 37.9 cm, h2 23.8 cm, h3 9.6 cm
4. (a) Otherwise most of the incident ultrasound would reflect off the
surface of the skin. Gel matches the impedance of tissue so most is
transmitted; (b) 31 %
APPENDIX D: SOLUTIONS TO EXERCISES • 673
1. (a) –; (b) –
2. 7.0 105 ms1
3. (a) 0.26 A; (b) 3.3 105 Am2; (c) 17 1m1; (d) 0.026 1
674 • FOUNDATIONS OF PHYSICS
4. (a) B lights normally, A does not light; (b) A lights almost normally and
B lights just below normal brightness.
ER
5. (a) –; (b) V ; (c) Plot 1/V against 1/R: gradient r/E and
R r
intercept 1/E; (d) –
6. (a) copper; (b) 0.62 , 0.39 , 2.22 ; (c) 44%, 54%, 91%; (d) 0.43 ;
(e) 4.6 A
7. (a) 4.0 V, 0.020 A; (b) 3.87 V, 0.019 A
8. (a) in series: R, 2R, 3R; in parallel: R/2, R/3; seriesparallel: 2R/3, 3R/2
9. (Hint: redraw equivalent circuit as 3 6 3 groups of parallel resistors)
6R/5
10. (a) Dark 5.9 V, Light 0.55 V; (b) For high values the range of voltages
becomes smaller. This makes the device less sensitive but more linear.
11. ammeter 1: 0.060 A, ammeter 2: 0.135 A, ammeter 3: 0.075 A; voltme-
ter: 3.0 V
Chapter 19 Capacitors
1. (a) 1.32 mC; (b) 0.00396 J; (c) charge doubles, energy quadruples
2. (a) 0.103 s; (b) 0.5 s; (c) no change, time constant is the same;
(d) 30.7 k, 330 A
3. (a) exponential decay from I0 V/R; (b) V1 constant, V2 exponential
decay, V3 growing from zero at decaying rate toward V.
4. (a) 0.00040 C, 0.0016 J; (b) Q1 0.00013 C, Q2 0.00027 C;
(c) 0.00053 J energy dissipated as heat in connecting wires as charge is
redistributed; (d) Q1 0.00040 C, Q2 0.00080 C, Qtot 0.00120 C;
(e) 0.0048 J, cell does extra work to charge both capacitors to same
voltage
5. Series: C, C/2, C/3; parallel: 2C, 3C; seriesparallel: 2C/3, 3C/2
6. Charge is halved; current forced back through cell as plates separate;
energy halved; voltage constant.
7. 0.0011 C, 2.34 V
APPENDIX D: SOLUTIONS TO EXERCISES • 675
1. (a) 9.8 N; (b) 2.72 103 N; (c) 3600 and 3602—i.e. the ratios are the
same so it is consistent.
(d) –; (e) –; (f) –
APPENDIX D: SOLUTIONS TO EXERCISES • 677
1. (a) –; (b) –; (c) –; (d) –; (e) 4.6 1014 m; (f) scattering is purely
electrostatic / Coulomb law is valid
2. Can control energy using an accelerator, Electrons do not feel the
strong nuclear force but alpha particle do so for close approach
electrons are only scattered by electrostatic forces.
3. (a) Element Symbol Atomic Nucleon Protons Electrons Neutrons
number number
1
1H
Hydrogen 1 1 1 1 0
12
Carbon-12 6C 6 12 6 6 6
13
Carbon-13 6C 6 13 6 6 7
14
Carbon-14 6C 6 14 6 6 8
16
Oxygen-16 8O 8 16 8 8 8
56
Iron-56 26 Fe 26 56 26 26 30
157
Gold- 79 Au 79 197 79 79 118
235
Uranium-235 92 U 92 235 92 92 143
235
Uranium-238 92 U 92 238 92 92 146
(b) E.g., carbon isotopes: same atomic number but different mass
number
(c) Electrostatic repulsion between protons is a cumulative long range
repulsion. Strong nuclear interaction between neutrons and protons
is short range. For a large nucleus more neutrons are needed to
stabilize the nucleus against the Coulomb force.
4. (a) 20 cpm; (b) radioactive decay is a random process so it fluctuates;
(c) 8 h; (d) alpha and gamma
5. (a) 1.22 104 y1 or 3.85 1012 s1; (b) 22 800 y assuming no
contamination with “younger” carbon; (c) 19,000 years
6. 2
APPENDIX D: SOLUTIONS TO EXERCISES • 679
7. Time / Years P Q
0 N 0
1000 N/2 N/2
2000 N/4 3N/4
3000 N/8 7N/8
5000 N/32 31N/32
11. Half-life approx. 185 s, decay constant approx. 3.7 103 s1
1. –
2. B.E. 470 MeV, B.E./A 8.3 MeV/nucleon
235
3. (a) 92 U → 231 4
90Th 2 ; (b) Use conservation of momentum:
KETh 81 keV; (c) 0.0052 u
16
4. (a) 6C → 167 N 01 00
(b) 8.0 MeV; (c) Energy is shared randomly with anti-neutrino
(d) Neutron rich nucleus. Large mass defect for decay. High probability
of decay and therefore short half-life.
5. (a) 137 N → 136 C 01 00 (1); 137 N → 138 O 01 00 (2)
(b) For (1) m 0.0024 u, for (2) m 0.019 u, so mass of products
in (2) is greater than mass of original nucleus so that reaction cannot
proceed spontaneously.
680 • FOUNDATIONS OF PHYSICS
(c) 7.5 MeV / nucleon, 7.7 MeV / nucleon—both are stable and
nitrogen-15 might be expected to be more abundant. In fact
nitrogen-14 is much more abundant because the odd-odd
configuration is much more stable than an odd-even configuration
of nucleons, allowing protons and neutrons to pair up.
(d) t1/2 (C 16) > t1/2 (C 17) > t1/2 (C 18). Neutron excess
destabilizes nucleus because neutrons are themselves unstable.
14
6. (a) –; (b)(i) 6C → 147 N 01 00 ; (ii) 01 n → 11 p 01 00 ;
(iii) 21/3 1/3 0 0
/3 d → 1/3 u 1 0
(c) Yes—proton (uud) changes to a neutron (udd) so an up quark
changes to a down quark.
8. –;
9. –
54 eV
10. En
n2
Chapter 28 Astrophysics
1. (a) 5.0 107 m; (b) 620 Wm2; (c) 7.0 108 m; (d) Peak of curve shifts to
shorter wavelengths. 2.90 108 m (UV)
2. (a) 3500 K; (b) 5.7 1011 m
3. (a) 0.05 c moving away; (b) 7.2 light years
4. (a) –; (b) 3.09 1016 m, 3.26 light years; (c) parallax angle becomes too
small to measure; (d) 19.2 parsecs
5. (a) 10700 light years; (b) 26 Mpc
6. –
7. (a) 9.7 1010 Hz; (b) 9.7 1010 Hz
(c) Shift for rocket is caused by Doppler Effect due to relative motion,
shift for galaxy is caused by cosmological expansion of space.
(d) 4.3 108 years
8. –
1. –; 2. –
2. (a) –; (b) 3.1 1011 m; (c) –
3. 4. –
4. (a) Photons have momentum. In the CM frame the electron and
positron have net zero momentum sop if only a single photon was
emitted this would violate the law of conservation of momentum. In the
CM frame a pair of identical photons are emitted in opposite directions.
5. 2.4 1012 m
GLOSSARY
would move) regardless of the sign(s) of the charge carriers that make up
the current.
Copenhagen Interpretation: mainstream interpretation of quantum
mechanics in which the state of a system is described by a wave function
and there is a collapse of the wave function when an observation or
measurement is made.
Copernican model: heliocentric (Sun at the center) model of the solar
system
Cosmic microwave background radiation: black body radiation
corresponding to a temperature of about 2.7 K present throughout the
universe. One of the key pieces of evidence supporting the idea of a Big
bang and the expanding universe.
Cosmology: the study of the origin nature and end of the entire universe.
QQ
Coulomb law: force law between electric charges: F 1 2 2 .
4 0 r
Couple: moment of a pair of forces of equal magnitude, acting in opposite
directions through different points in the same body: couple magnitude
of one force distance between lines of action of the two forces.
Creep: gradually increasing strain with constant stress.
Crystalline materials: having long range geometric microstructure,
e.g., metals.
CT (computed tomography): medical imaging technique using X-rays
to create detailed images of slices of the body.
Curie temperature: temperature above which thermal motions prevent
the permanent magnetization of a ferromagnetic material.
Damping force: force opposing motion causing the oscillator to do work
as it oscillates and resulting a decay of amplitude and energy.
De Broglie relation: fundamental relationship between the wave and
Planck constant
particle models of radiation and matter: wavelength = momentum .
Decibel scale: logarithmic scale of relative intensity used to compare
intensity levels with the level at the threshold of human hearing
I
(I0): intensity level dB 10 log10
I0
Degeneracy pressure: the Pauli Exclusion Principleprevents fermions
(half-integer spin particles) from existing in the same set of quantum states,
so that when a gas of fermions is compressed the lower energy states
become filled and it exerts an outward pressure.
GLOSSARY • 689
EMF: energy transferred from other forms to electrical energy per unit
charge passing through a power supply, measured in volts.
Energy availability: extent to which energy within a system can be
harnessed to do useful work.
Equilibrium: situation in which the resultant force and resultant moment
on a body are both zero.
Equipartition of energy: hypothesis that when energy is supplied to a
thermodynamic system each degree of freedom gets an average energy of
½kT.
Equipotential surfaces: surfaces perpendicular to electric field lines.
No work is done on or by a charged particle when it moves from one point
on an equipotential surface to another.
Equivalence principle: Einstein’s thesis that the laws of physics in a
freely falling reference frame are indistinguishable from the laws of physics
in a region of uniform gravitational field.
Error bars: drawn as a vertical or horizontal bar on either side of each
plotted point to indicate the range of uncertainty in that point. Error bars
can then be used to find the worst acceptable lines and the range in gradient
and intercept.
Escape velocity: minimum velocity that will allow an object to escape
from a point in a gravitational field to infinity (neglecting effects of non-
gravitational forces such as atmospheric friction).
Event horizon: surface surrounding a black hole such that no matter or
radiation can escape from within this surface and no event occurring inside
this surface can have an effect on an observer in the outside universe.
Exponential change: growth or decay that changes by a constant
proportion in a constant time—e.g., activity of a radioactive source has a
constant half-life. Described mathematically by an equation of the form:
y Aex .
Extension: difference between unstretched length and stretched length,
e.g., of a wire.
Faraday cage: a conducting box (often a metallic mesh box) enclosing a
region of space and preventing the transmission of electromagnetic waves
into or out of the box.
Faraday law: fundamental law of electromagnetic induction equating
the induced emf to rate of change of flux-linkage.
692 • FOUNDATIONS OF PHYSICS
Heat death: idea that, as entropy continues to increase, the far future of
the universe will be characterized by an equilibrium state in which energy
availability has fallen to zero.
Heat engine: engine designed to extract useful work from a heat
reservoir.
Heat pumps: system in which work is used to pump heat from a heat
reservoir at lower temperature to a heat reservoir at a higher temperature.
Heating: energy transfer as a result of a temperature difference.
Heisenberg’s uncertainty principle: principle that sets a limit on
how much can be known about certain pairs of variables, e.g., position
and momentum or energy and time. For example, the more precisely we
determine the location of an electron the smaller the uncertainty in its
position but the larger the uncertainty in its momentum.
Hertzsprung–Russell diagram: chart displaying luminosity against
spectral type (or inverse surface temperature) of stars, revealing several
distinct groups or bands including: the main sequence, red giants, and
white dwarf stars.
Homogeneous equation: an equation in which quantities, units, and
dimensions balance.
Hubble law: relationship between speed of recession and distance for
galaxies v H0d.
Hubble time: reciprocal of the Hubble constant, a rough approximation
of the age of the universe.
Hydrostatic pressure: pressure due to a stationary fluid.
Hysteresis: e.g., when the force to load a sample is different from force
as it is unloaded, a cycle of loading and unloading results in a closed loop on
a graph of force against extension. This is a hysteresis loop. The area of the
loop is related to the energy dissipated by the sample during the process.
Ideal fluid: incompressible inviscid fluid.
Ideal gas equation: equation of state for an ideal gas, incorporating all
three gas laws: pV nRT.
Ideal gas: theoretical model of a gas whose equation of state is pV nRT.
Impedance: ratio of the peak voltage to the peak current in an A.C.
system even though these values occur at different times. Resistance
and reactance are special cases of impedance when the phase difference
between voltage and current is 0 or /2 respectively. Measured in ohms.
GLOSSARY • 695
velocity, given by m r i i
2
and measured in kgm2. Its value depends on the
i1
axis about which the body rotates.
Moment: turning effect of a force in Nm. Also called a torque.
Monatomic gas: gas consisting of individual atoms acting as particles
with no internal degrees of freedom.
Monochromatic: light consisting of a single wavelength (single “color”).
Monoenergetic: particles having a single energy.
Motor effect: force on a current-carrying conductor when placed into a
magnetic field such that there is a component of the field perpendicular to
the current.
Mutual inductance: when two coils are close together, a changing
current in either coil creates a changing magnetic field that affects the other
coil and induces an emf in it. The strength of this effect is measured by the
mutual inductance of the system of two coils in henries (H).
Natural frequency: frequency of free oscillations when the oscillator is
displaced and released.
Neutral point: point in space where the fields caused by two or more
sources cancels out.
Neutron star: fate of a heavy star that has formed a planetary nebula. Its
core continues to collapse beyond the white dwarf stage until it is prevented
from further collapse by neutron degeneracy pressure.
GLOSSARY • 699
Stiff: material that has a large stress to strain ratio (large Young
modulus)—i.e., hard to stretch.
Strain energy: energy stored because of deformation, e.g., in a stretched
spring.
Strong: large breaking force (for a sample) or large breaking stress (UTS)
for a material.
Sum-over-histories: approach to quantum theory suggested by Richard
Feynman in which all possible paths contribute a phasor and the square of
the resultant phasor at each point represents the probability of the process
taking place.
Supernova: explosion of a massive star at the end of its life.
Superposition: when two or more waves are present at the same point
in space the resultant disturbance is the vector sum of the disturbances due
to each wave.
Symmetry principle: when an operation carried out on a system leaves
it unchanged.
Systematic error: measurement error that affects all measurements in
the same way—e.g., making them all too large or too small by the same
quantity or proportion. If the error is known it can be corrected for (e.g., by
subtracting a constant value from each measurement).
Tensile strain: ratio of extension to original length. Dimensionless.
Tensile stress: ratio of axial force applied to cross-sectional area of
sample perpendicular to the force, measured in Nm2.
Thermal conduction: transfer of heat as a result of particle to particle
interactions.
Thermal equilibrium: when two objects are at the same temperature
and, if placed in thermal contact, there is no net transfer of heat between
them.
Thermal radiation: emission of electromagnetic radiation with a
spectrum that depends on the temperature of the emitting body.
Thought experiment: an imagined experiment used to explore the
implications of theory, e.g., Schrodinger’s cat or the twin paradox.
Tidal forces: differential forces arising because of the difference in
gravitational force across the diameter of an orbiting body. Tidal forces
tend to distort the body along and perpendicular to the line joining it to the
body around which it orbits.
706 • FOUNDATIONS OF PHYSICS
nuclear fusion, 523, 529–530, 532, 535, 541, photon theory, 555, 558, 587
547, 592, 594, 596, 604, 701 Planck, 553
nuclear stability, 527 Planck constant
nuclear transformations, 511 measuring, 556
nucleon number, 499 planetary nebula, 592, 698
Nucleons, 523 plastic, 138
nucleus, 496 plutonium, 534–535
null result, 475 plutonium-239, 532, 535
number of ways Poincaré recurrence, 194, 700
entropy, 191 polarization, 256
polarization by reflection and
O scattering, 259
polarizing filters, 258
Oersted, Hans Christian, 390
polymers, 139
Ohm, 354
Popper, 635
Ohm’s Law, 354
positron, 487, 513, 541, 619, 621,
optical fibers, 255
681, 685
optical infinity, 274
Positron Emission Tomography (PET scans),
orbital motion, 454
619, 700
orbital quantum number, 574
potential dividers, 364
orbitals, 574
power transfer
Otto Cycle, 176
electrical, 363
precision, 638
P predictable rays for thin lenses, 270
parallel plate capacitor, 377 prefixes, 5
parity symmetry, 20 pressure law (Gay Lussac’s law), 160
past hypothesis, 200 pressurized water reactor, 534
Pauli Exclusion Principle, 575, 688 principal quantum number, 570
Penzias, Arno, 284 principle of moments, 61
Penzias and Wilson, 603 projectile motion, 41
period-luminosity relation protons, 542
for Cepheids, 599 Ptolemaic system, 455
Perlmutter, Saul, 546 Ptolemy, 455, 456
permanent magnets, 389 p-type semiconductor, 345
permeability of free space, 268, 397, 567 pulsar, 468
permittivity of free space, 4, 268, 333,
567, 692
phase and phase difference, 207
Q
phase velocity, 245, 248 quantization of angular
phasor, 228 momentum, 572
phasors, 219, 228, 287, 293, 296, 423–435, quantization of energy, 554
563, 582–583 quantum atom, 568
photocell, 556 quantum chromodynamics, 543
photoelectric effect, 552, 554–556, quantum electrodynamics, 4, 542–543
587, 613 quantum of energy, 553
photomultiplier tube, 615 quark flavors, 542
716 • INDEX
spectrometer, 299 T
spectroscopy, 297
spectrum telescope
absorption, 298 astronomical reflecting, 280
line, band and continuous, 298 astronomical refracting, 278
speed of light temperature
constancy, 269 defined, 194
measuring, 266 temperature scales, 144
speed of sound, 168, 181, 309, 313–314, tesla, 391
317–318, 320, 654, 683 thermal conductivities, 147
in a gas, 168 thermal energy, 143
spherical aberration, 277 thermal neutrons, 533
spin quantum number, 574 thermistor, 355
spontaneous process, 509 thermometer
spring tides, 462 types, 144
Sputnik 1, 458 thin lenses, 269
stability, 63, 65 thought experiment, 464, 478, 585, 588–589,
standard candles, 601 703, 706
Standard Model, 544–545 threshold frequency, 555
standing (stationary) waves, 305 time constant
standing waves in air charging or discharging
columns, 315 capacitors, 380
standing waves on a string, 306 time dilation
stars, 592 special relativity, 477
stars as black bodies, 594 time period, 244
statistical interpretation time reversal symmetry, 20
of quantum theory, 566 Torque, 233
statistical thermodynamics, 185 torque, meaning, 60
Stefan-Boltzmann law, 150 total internal reflection, 254
stellar spectra, 595 toughness, 138
Stoke’s law, 3 transformer equation, 418
stopping voltage, 558 transformers, 419
strain, 133 transverse waves, 245
stress travelling or progressive wave, 244
tensile, 133 triangle of forces, 57
strong nuclear force trigonometric parallax, 597
(color force), 543 Trinity Test, 623
sum-over-histories, 577, 581, 583 triple point of water, 145
superconductor, 368–369 tritium, 535
supernova, 592–593, 601, 605, 704 twin paradox, 480
superposition, 287
surface density, 503 U
suvat ultimate tensile strength, 137
equations derived, 39 ultrasound, 318
symmetry, 19 ultrasound (sonography), 607
Szilard, 530 ultraviolet catastrophe, 551, 554
718 • INDEX