Classical Mechanics (PDFDrive) 2
Classical Mechanics (PDFDrive) 2
1 Survey of the
Elementary Principles
The motion of material bodies formed the subject of some of the earliest research
pursued by the pioneers of physics. From their efforts there has evolved a vast
field known as analytical mechanics or dynamics, or simply, mechanics. In the
present century the term “classical mechanics” has come into wide use to denote
this branch of physics in contradistinction to the newer physical theories, espe-
cially quantum mechanics. We shall follow this usage, interpreting the name to
include the type of mechanics arising out of the special theory of relativity. It is
the purpose of this book to develop the structure of classical mechanics and to
outline some of its applications of present-day interest in pure physics. Basic to
any presentation of mechanics are a number of fundamental physical concepts,
such as space, time, simultaneity, mass, and force. For the most part, however,
these concepts will not be analyzed critically here; rather, they will be assumed as
undefined terms whose meanings are familiar to the reader.
Let r be the radius vector of a particle from some given origin and v its vector
velocity:
dr
v= . (1.1)
dt
The linear momentum p of the particle is defined as the product of the particle
mass and its velocity:
p = mv. (1.2)
In consequence of interactions with external objects and fields, the particle may
experience forces of various types, e.g., gravitational or electrodynamic; the vec-
tor sum of these forces exerted on the particle is the total force F. The mechanics
of the particle is contained in Newton’s second law of motion, which states that
there exist frames of reference in which the motion of the particle is described by
the differential equation
dp
F= ≡ ṗ, (1.3)
dt
1
or
d
F= (mv). (1.4)
dt
In most instances, the mass of the particle is constant and Eq. (1.4) reduces to
dv
F=m = ma, (1.5)
dt
where a is the vector acceleration of the particle defined by
d 2r
a= . (1.6)
dt 2
The equation of motion is thus a differential equation of second order, assuming
F does not depend on higher-order derivatives.
A reference frame in which Eq. (1.3) is valid is called an inertial or Galilean
system. Even within classical mechanics the notion of an inertial system is some-
thing of an idealization. In practice, however, it is usually feasible to set up a co-
ordinate system that comes as close to the desired properties as may be required.
For many purposes, a reference frame fixed in Earth (the “laboratory system”)
is a sufficient approximation to an inertial system, while for some astronomical
purposes it may be necessary to construct an inertial system (or inertial frame) by
reference to distant galaxies.
Many of the important conclusions of mechanics can be expressed in the form
of conservation theorems, which indicate under what conditions various mechan-
ical quantities are constant in time. Equation (1.3) directly furnishes the first of
these, the
Conservation Theorem for the Linear Momentum of a Particle: If the total force,
F, is zero, then ṗ = 0 and the linear momentum, p, is conserved.
L = r 3 p, (1.7)
where r is the radius vector from O to the particle. Notice that the order of the
factors is important. We now define the moment of force or torque about O as
N = r 3 F. (1.8)
The equation analogous to (1.3) for N is obtained by forming the cross product of
r with Eq. (1.4):
d
r3F = N = r3 (mv). (1.9)
dt
Equation (1.9) can be written in a different form by using the vector identity:
d d
(r 3 mv) = v 3 mv + r 3 (mv), (1.10)
dt dt
where the first term on the right obviously vanishes. In consequence of this iden-
tity, Eq. (1.9) takes the form
d dL
N= (r 3 mv) = ≡ L̇. (1.11)
dt dt
Note that both N and L depend on the point O, about which the moments are
taken.
As was the case for Eq. (1.3), the torque equation, (1.11), also yields an imme-
diate conservation theorem, this time the
Next consider the work done by the external force F upon the particle in going
from point 1 to point 2. By definition, this work is
! 2
W12 = F ? ds. (1.12)
1
For constant mass (as will be assumed from now on unless otherwise specified),
the integral in Eq. (1.12) reduces to
! ! !
dv m d 2
F ? ds = m ? v dt = (v ) dt,
dt 2 dt
and therefore
m 2
W12 = (v − v12 ). (1.13)
2 2
The scalar quantity mv 2 /2 is called the kinetic energy of the particle and is
denoted by T , so that the work done is equal to the change in the kinetic energy:
W12 = T2 − T1 . (1.14)
If the force field is such that the work W12 is the same for any physically
possible path between points 1 and 2, then the force (and the system) is said to be
conservative. An alternative description of a conservative system is obtained by
imagining the particle being taken from point 1 to point 2 by one possible path
and then being returned to point 1 by another path. The independence of W12 on
the particular path implies that the work done around such a closed circuit is zero,
i.e.:
"
F ? ds = 0. (1.15)
F ? ds = −d V
or
∂V
Fs = − ,
∂s
which is equivalent to Eq. (1.16). Note that in Eq. (1.16) we can add to V any
quantity constant in space, without affecting the results. Hence the zero level of V
is arbitrary.
For a conservative system, the work done by the forces is
W12 = V1 − V2 . (1.17)
T1 + V1 = T2 + V2 , (1.18)
is then no longer the total change in −V during the displacement, since V also
changes explicitly with time as the particle moves. Hence, the work done as the
particle goes from point 1 to point 2 is no longer the difference in the function V
between those points. While a total energy T + V may still be defined, it is not
conserved during the course of the particle’s motion.
(e)
where Fi stands for an external force, and F ji is the internal force on the ith
particle due to the jth particle (Fii , naturally, is zero). We shall assume that the
(e)
Fi j (like the Fi ) obey Newton’s third law of motion in its original form: that the
forces two particles exert on each other are equal and opposite. This assumption
(which does not hold for all types of forces) is sometimes referred to as the weak
law of action and reaction.
Summed over all particles, Eq. (1.19) takes the form
d2 # # (e) #
2
m i ri = Fi + F ji . (1.20)
dt
i i, j i
i̸= j
The first sum on the right is simply the total external force F(e) , while the second
term vanishes, since the law of action and reaction states that each pair Fi j + F ji
is zero. To reduce the left-hand side, we define a vector R as the average of the
radii vectors of the particles, weighted in proportion to their mass:
$ $
m i ri m i ri
R= $ = . (1.21)
mi M
The vector R defines a point known as the center of mass, or more loosely as the
center of gravity, of the system (cf. Fig. 1.1). With this definition, (1.20) reduces
to
d 2 R # (e)
M = Fi ≡ F(e) , (1.22)
dt 2
i
mi
Center of mass
ri R mj
rj
which states that the center of mass moves as if the total external force were
acting on the entire mass of the system concentrated at the center of mass.
Purely internal forces, if the obey Newton’s third law, therefore have no effect
on the motion of the center of mass. An oft-quoted example is the motion of
an exploding shell—the center of mass of the fragments traveling as if the
shell were still in a single piece (neglecting air resistance). The same princi-
ple is involved in jet and rocket propulsion. In order that the motion of the
center of mass be unaffected, the ejection of the exhaust gases at high veloc-
ity must be counterbalanced by the forward motion of the vehicle at a slower
velocity.
By Eq. (1.21) the total linear momentum of the system,
# dri dR
P= mi =M , (1.23)
dt dt
is the total mass of the system times the velocity of the center of mass. Conse-
quently, the equation of motion for the center of mass, (1.23), can be restated as
the
We obtain the total angular momentum of the system by forming the cross
product ri 3 pi and summing over i. If this operation is performed in Eq. (1.19),
there results, with the aid of the identity, Eq. (1.10),
# # d # (e)
#
(ri 3 ṗi ) = (ri 3 pi ) = L̇ = ri 3 Fi + ri 3 F ji . (1.24)
dt
i i i i, j
i̸= j
FIGURE 1.2 The vector ri j between the ith and jth particles.
The last term on the right in (1.24) can be considered a sum of the pairs of the
form
ri 3 F ji + r j 3 Fi j = (ri − r j ) 3 F ji , (1.25)
using the equality of action and reaction. But ri − r j is identical with the vector
ri j from j to i (cf. Fig. 1.2), so that the right-hand side of Eq. (1.25) can be written
as
ri j 3 F ji .
If the internal forces between two particles, in addition to being equal and
opposite, also lie along the line joining the particles—a condition known as
the strong law of action and reaction—then all of these cross products vanish.
The sum over pairs is zero under this assumption and Eq. (1.24) may be written in
the form
dL
= N(e) . (1.26)
dt
The time derivative of the total angular momentum is thus equal to the moment
of the external force about the given point. Corresponding to Eq. (1.26) is the
the internal forces in addition be central. Many of the familiar physical forces,
such as that of gravity, satisfy the strong form of the law. But it is possible to
find forces for which action and reaction are equal even though the forces are not
central (see below). In a system involving moving charges, the forces between
the charges predicted by the Biot-Savart law may indeed violate both forms of
the action and reaction law.* Equations (1.23) and (1.26), and their corresponding
conservation theorems, are not applicable in such cases, at least in the form given
here. Usually it is then possible to find some generalization of P or L that is
conserved. Thus, in an isolated system of moving charges it is the sum of the
mechanical angular momentum and the electromagnetic “angular momentum” of
the field that is conserved.
Equation (1.23) states that the total linear momentum of the system is the same
as if the entire mass were concentrated at the center of mass and moving with it.
The analogous theorem for angular momentum is more complicated. With the
origin O as reference point, the total angular momentum of the system is
#
L= ri 3 pi .
i
Let R be the radius vector from O to the center of mass, and let ri′ be the radius
vector from the center of mass to the ith particle. Then we have (cf. Fig. 1.3)
ri = ri′ + R (1.27)
FIGURE 1.3 The vectors involved in the shift of reference point for the angular
momentum.
*If two charges are moving uniformly with parallel velocity vectors that are not perpendicular to the
line joining the charges, then the net mutual forces are equal and opposite but do not lie along the
vector between the charges. Consider, further, two charges moving (instantaneously) so as to “cross
the T,” i.e., one charge moving directly at the other, which in turn is moving at right angles to the first.
Then the second charge exerts a nonvanishing magnetic force on the first, without experiencing any
magnetic reaction force at that instant.
and
vi = vi′ + v
where
dR
v=
dt
is the velocity of the center of mass relative to O, and
dri′
vi′ =
dt
is the velocity of the ith particle relative to the center of mass of the system. Using
Eq. (1.27), the total angular momentum takes on the form
% &
# # # d #
L= R 3 mi v + ri′ 3 m i vi′ + m i ri′ 3v + R3 m i ri′ .
dt
i i i i
$
The last two terms in this expression vanish, for both contain the factor m i ri′ ,
which, it will be recognized, defines the radius vector of the center of mass in the
very coordinate system whose origin is the center of mass and is therefore a null
vector. Rewriting the remaining terms, the total angular momentum about O is
#
L = R 3 Mv + ri′ 3 pi′ . (1.28)
i
In words, Eq. (1.28) says that the total angular momentum about a point O is
the angular momentum of motion concentrated at the center of mass, plus the
angular momentum of motion about the center of mass. The form of Eq. (1.28)
emphasizes that in general L depends on the origin O, through the vector R. Only
if the center of mass is at rest with respect to O will the angular momentum be
independent of the point of reference. In this case, the first term in (1.28) vanishes,
and L always reduces to the angular momentum taken about the center of mass.
Finally, let us consider the energy equation. As in the case of a single particle,
we calculate the work done by all forces in moving the system from an initial
configuration 1, to a final configuration 2:
#! 2 #! 2
(e)
#! 2
W12 = Fi ? dsi = Fi ? dsi + F ji ? dsi . (1.29)
i 1 i 1 i, j
1
i̸= j
#! 2 #! 2 #! 2 ' (
1 2
Fi ? dsi = m i v̇i ? vi dt = d m i vi .
1 1 1 2
i i i
Hence, the work done can still be written as the difference of the final and initial
kinetic energies:
W12 = T2 − T1 ,
1#
T = m i vi2 . (1.30)
2
i
1#
T = m i (v + vi′ ) ? (v + vi′ )
2
i
% &
1# 1# d #
= m i v2 + m i vi′2 + v ? m i ri′ ,
2 2 dt
i i i
and by the reasoning already employed in calculating the angular momentum, the
last term vanishes, leaving
1 1#
T = Mv 2 + m i vi′2 (1.31)
2 2
i
The kinetic energy, like the angular momentum, thus also consists of two parts:
the kinetic energy obtained if all the mass were concentrated at the center of mass,
plus the kinetic energy of motion about the center of mass.
Consider now the right-hand side of Eq. (1.29). In the special case that the
external forces are derivable in terms of the gradient of a potential, the first term
can be written as
# ! 2 (e) #! 2 # ))2
Fi ? dsi = − ∇i Vi ? dsi = − Vi )) ,
i 1 i 1 i 1
where the subscript i on the del operator indicates that the derivatives are with
respect to the components of ri . If the internal forces are also conservative, then
the mutual forces between the ith and jth particles, Fi j and F ji , can be obtained
from a potential function Vi j . To satisfy the strong law of action and reaction, Vi j
can be a function only of the distance between the particles:
Vi j = Vi j (| ri − r j |). (1.32)
∇i Vi j = ∇i j Vi j = −∇ j Vi j ,
and
The factor 12 appears in Eq. (1.35) because in summing over both i and j each
member of a given pair is included twice, first in the i summation and then in the
j summation.
From these considerations, it is clear that if the external and internal forces are
both derivable from potentials it is possible to define a total potential energy, V ,
of the system,
# 1#
V = Vi + Vi j , (1.36)
2
i i, j
i̸= j
such that the total energy T + V is conserved, the analog of the conservation
theorem (1.18) for a single particle.
The second term on the right in Eq. (1.36) will be called the internal potential
energy of the system. In general, it need not be zero and, more important, it may
vary as the system changes with time. Only for the particular class of systems
known as rigid bodies will the internal potential always be constant. Formally,
a rigid body can be defined as a system of particles in which the distances ri j
are fixed and cannot vary with time. In such case, the vectors dri j can only be
perpendicular to the corresponding ri j , and therefore to the Fi j . Therefore, in a
rigid body the internal forces do no work, and the internal potential must remain
constant. Since the total potential is in any case uncertain to within an additive
constant, an unvarying internal potential can be completely disregarded in dis-
cussing the motion of the system.
1.3 CONSTRAINTS
From the previous sections one might obtain the impression that all problems in
mechanics have been reduced to solving the set of differential equations (1.19):
(e)
#
m i r̈i = Fi + F ji .
j
One merely substitutes the various forces acting upon the particles of the system,
turns the mathematical crank, and grinds out the answers! Even from a purely
physical standpoint, however, this view is oversimplified. For example, it may be
necessary to take into account the constraints that limit the motion of the system.
We have already met one type of system involving constraints, namely rigid bod-
ies, where the constraints on the motions of the particles keep the distances ri j
unchanged. Other examples of constrained systems can easily be furnished. The
beads of an abacus are constrained to one-dimensional motion by the supporting
wires. Gas molecules within a container are constrained by the walls of the ves-
sel to move only inside the container. A particle placed on the surface of a solid
sphere is subject to the constraint that it can move only on the surface or in the
region exterior to the sphere.
Constraints may be classified in various ways, and we shall use the following
system. If the conditions of constraint can be expressed as equations connecting
the coordinates of the particles (and possibly the time) having the form
f (r1 , r2 , r3 , . . . , t) = 0, (1.37)
then the constraints are said to be holonomic. Perhaps the simplest example of
holonomic constraints is the rigid body, where the constraints are expressed by
equations of the form
(ri − r j )2 − ci2j = 0.
Constraints not expressible in this fashion are called nonholonomic. The walls
of a gas container constitute a nonholonomic constraint. The constraint involved
in the example of a particle placed on the surface of a sphere is also nonholo-
nomic, for it can be expressed as an inequality
r 2 − a2 ≥ 0
(where a is the radius of the sphere), which is not in the form of (1.37). Thus, in
a gravitational field a particle placed on the top of the sphere will slide down the
surface part of the way but will eventually fall off.
Constraints are further classified according to whether the equations of con-
straint contain the time as an explicit variable (rheonomous) or are not explicitly
dependent on time (scleronomous). A bead sliding on a rigid curved wire fixed
in space is obviously subject to a scleronomous constraint; if the wire is moving
in some prescribed fashion, the constraint is rheonomous. Note that if the wire
moves, say, as a reaction to the bead’s motion, then the time dependence of the
constraint enters in the equation of the constraint only through the coordinates
of the curved wire (which are now part of the system coordinates). The overall
constraint is then scleronomous.
Constraints introduce two types of difficulties in the solution of mechanical
problems. First, the coordinates ri are no longer all independent, since they are
connected by the equations of constraint; hence the equations of motion (1.19)
are not all independent. Second, the forces of constraint, e.g., the force that the
wire exerts on the bead (or the wall on the gas particle), is not furnished a pri-
ori. They are among the unknowns of the problem and must be obtained from the
solution we seek. Indeed, imposing constraints on the system is simply another
method of stating that there are forces present in the problem that cannot be spec-
ified directly but are known rather in terms of their effect on the motion of the
system.
In the case of holonomic constraints, the first difficulty is solved by the intro-
duction of generalized coordinates. So far we have been thinking implicitly in
terms of Cartesian coordinates. A system of N particles, free from constraints,
has 3N independent coordinates or degrees of freedom. If there exist holonomic
constraints, expressed in k equations in the form (1.37), then we may use these
equations to eliminate k of the 3N coordinates, and we are left with 3N − k inde-
pendent coordinates, and the system is said to have 3N − k degrees of freedom.
This elimination of the dependent coordinates can be expressed in another way,
by the introduction of new, 3N − k, independent variables q1 , q2 , . . . , q3N −k in
terms of which the old coordinates r1 , r2 , . . . , r N are expressed by equations of
the form
r1 = r1 (q1 , q2 , . . . , q3N −k , t)
..
. (1.38)
r N = r N (q1 , q2 , . . . , q3N −k , t)
its orientation. Yet we cannot reduce the number of coordinates, for the “rolling”
condition is not expressible as a equation between the coordinates, in the manner
of (1.37). Rather, it is a condition on the velocities (i.e., the point of contact is
stationary), a differential condition that can be given in an integrated form only
after the problem is solved.
A simple case will illustrate the point. Consider a disk rolling on the horizontal
x y plane constrained to move so that the plane of the disk is always vertical.
The coordinates used to describe the motion might be the x, y coordinates of the
center of the disk, an angle of rotation φ about the axis of the disk, and an angle
θ between the axis of the disk and say, the x axis (cf. Fig 1.5). As a result of the
constraint the velocity of the center of the disk, v, has a magnitude proportional
to φ̇,
v = a φ̇,
where a is the radius of the disk, and its direction is perpendicular to the axis of
the disk:
ẋ = v sin θ,
ẏ = −v cos θ.
d x − a sin θ dφ = 0,
(1.39)
dy + a cos θ dφ = 0.
Neither of Eqs. (1.39) can be integrated without in fact solving the problem; i.e.,
we cannot find an integrating factor f (x, y, θ, φ) that will turn either of the equa-
tions into exact differentials (cf. Derivation 4).* Hence, the constraints cannot be
reduced to the form of Eq. (1.37) and are therefore nonholonomic. Physically we
*In principle, an integrating factor can always be found for a first-order differential equation of con-
straint in systems involving only two coordinates and such constraints are therefore holonomic. A
familiar example is the two-dimensional motion of a circle rolling on an inclined plane.
can see that there can be no direct functional relation between φ and the other
coordinates x, y, and θ by noting that at any point on its path the disk can be
made to roll around in a circle tangent to the path and of arbitrary radius. At the
end of the process, x, y, and θ have been returned to their original values, but φ
has changed by an amount depending on the radius of the circle.
Nonintegrable differential constraints of the form of Eqs. (1.39) are of course
not the only type of nonholonomic constraints. The constraint conditions may
involve higher-order derivatives, or may appear in the form of inequalities, as we
have seen.
Partly because the dependent coordinates can be eliminated, problems involv-
ing holonomic constraints are always amenable to a formal solution. But there is
no general way to attack nonholonomic examples. True, if the constraint is nonin-
tegrable, the differential equations of constraint can be introduced into the prob-
lem along with the differential equations of motion, and the dependent equations
eliminated, in effect, by the method of Lagrange multipliers.
We shall return to this method at a later point. However, the more vicious cases
of nonholonomic constraint must be tackled individually, and consequently in the
development of the more formal aspects of classical mechanics, it is almost invari-
ably assumed that any constraint, if present, is holonomic. This restriction does
not greatly limit the applicability of the theory, despite the fact that many of the
constraints encountered in everyday life are nonholonomic. The reason is that the
entire concept of constraints imposed in the system through the medium of wires
or surfaces or walls is particularly appropriate only in macroscopic or large-scale
problems. But today physicists are more interested in atomic and nuclear prob-
lems. On this scale all objects, both in and out of the system, consist alike of
molecules, atoms, or smaller particles, exerting definite forces, and the notion of
constraint becomes artificial and rarely appears. Constraints are then used only
as mathematical idealizations to the actual physical case or as classical approxi-
mations to a quantum-mechanical property, e.g., rigid body rotations for “spin.”
Such constraints are always holonomic and fit smoothly into the framework of the
theory.
To surmount the second difficulty, namely, that the forces of constraint are
unknown a priori, we should like to so formulate the mechanics that the forces
of constraint disappear. We need then deal only with the known applied forces.
A hint as to the procedure to be followed is provided by the fact that in a particular
system with constraints, i.e., a rigid body, the work done by internal forces (which
are here the forces of constraint) vanishes. We shall follow up this clue in the
ensuing sections and generalize the ideas contained in it.
the forces and constraints may be changing. Suppose the system is in equilibrium;
i.e., the total force on each particle vanishes, Fi = 0. Then clearly the dot product
Fi ? δri , which is the virtual work of the force Fi in the displacement δri , also
vanishes. The sum of these vanishing products over all particles must likewise be
zero:
#
Fi ? δri = 0. (1.40)
i
As yet nothing has been said that has any new physical content. Decompose Fi
into the applied force, Fi(a) , and the force of constraint, fi ,
(a)
Fi = Fi + fi , (1.41)
We now restrict ourselves to systems for which the net virtual work of the
forces of constraint is zero. We have seen that this condition holds true for rigid
bodies and it is valid for a large number of other constraints. Thus, if a particle is
constrained to move on a surface, the force of constraint is perpendicular to the
surface, while the virtual displacement must be tangent to it, and hence the virtual
work vanishes. This is no longer true if sliding friction forces are present, and we
must exclude such systems from our formulation. The restriction is not unduly
hampering, since the friction is essentially a macroscopic phenomenon. On the
other hand, the forces of rolling friction do not violate this condition, since the
forces act on a point that is momentarily at rest and can do no work in an infinites-
imal displacement consistent with the rolling constraint. Note that if a particle is
constrained to a surface that is itself moving in time, the force of constraint is
instantaneously perpendicular to the surface and the work during a virtual dis-
placement is still zero even though the work during an actual displacement in the
time dt does not necessarily vanish.
We therefore have as the condition for equilibrium of a system that the virtual
work of the applied forces vanishes:
#
Fi(a) ? δri = 0. (1.43)
i
Equation (1.43) is often called the principle of virtual work. Note that the coef-
ficients of δri can no longer be set equal to zero; i.e., in general Fi(a) ̸= 0, since
the δri are not completely independent but are connected by the constraints. In
order to equate the coefficients to zero, we must transform the principle into a
form involving the virtual displacements of the qi , which are independent. Equa-
tion (1.43) satisfies our needs in that it does not contain the fi , but it deals only
with statics; we want a condition involving the general motion of the system.
and, making the same resolution into applied forces and forces of constraint, there
results
# (a) #
(Fi − ṗi ) ? δri + fi ? δri = 0.
i i
We again restrict ourselves to systems for which the virtual work of the forces of
constraint vanishes and therefore obtain
# (a)
(Fi − ṗi ) ? δri = 0, (1.45)
i
which is often called D’Alembert’s principle. We have achieved our aim, in that
the forces of constraint no longer appear, and the superscript (a) can now be
dropped without ambiguity. It is still not in a useful form to furnish equations
of motion for the system. We must now transform the principle into an expression
involving virtual displacements of the generalized coordinates, which are then
independent of each other (for holonomic constraints), so that the coefficients of
the δqi can be set separately equal to zero.
The translation from ri to q j language starts from the transformation equations
(1.38),
ri = ri (q1 , q2 , . . . , qn , t) (1.45′ )
(assuming n independent coordinates), and is carried out by means of the usual
“chain rules” of the calculus of partial differentiation. Thus, vi is expressed in
terms of the q̇k by the formula
dri # ∂ri ∂ri
vi ≡ = q̇k + . (1.46)
dt ∂qk ∂t
k
Similarly, the arbitrary virtual displacement δri can be connected with the virtual
displacements δqi by
# ∂ri
δri = δq j . (1.47)
∂q j
j
Note that no variation of time, δt, is involved here, since a virtual displacement
by definition considers only displacements of the coordinates. (Only then is the
virtual displacement perpendicular to the force of constraint if the constraint itself
is changing in time.)
In terms of the generalized coordinates, the virtual work of the Fi becomes
# # ∂ri
Fi ? δri = Fi ? δq j
∂q j
i i, j
#
= Q j δq j , (1.48)
j
where the Q j are called the components of the generalized force, defined as
# ∂ri
Qj = Fi ? . (1.49)
∂q j
i
Note that just as the q’s need not have the dimensions of length, so the Q’s do
not necessarily have the dimensions of force, but Q j δq j must always have the
dimensions of work. For example, Q j might be a torque N j and dq j a differential
angle dθ j , which makes N j dθ j a differential of work.
We turn now to the other other term involved in Eq. (1.45), which may be
written as
# #
ṗi ? δri = m i r̈i ? δri .
i i
In the last term of Eq. (1.50) we can interchange the differentiation with respect
to t and q j , for, in analogy to (1.46),
' ( # ∂ 2 ri
d ∂ri ∂ ṙi ∂ 2 ri
= = q̇k + ,
dt ∂q j ∂q j ∂q j ∂qk ∂q j ∂t
k
∂vi
= ,
∂q j
by Eq. (1.46). Further, we also see from Eq. (1.46) that
∂vi ∂ri
= . (1.51)
∂ q̇ j ∂q j
and the second term on the left-hand side of Eq. (1.45) can be expanded into
, - % &. % & /
# d ∂ #1 ∂ #1
2 2
m i vi − m i vi − Q j δq j .
dt ∂ q̇ j 2 ∂q j 2
j i i
$
Identifying i 21 m i vi2 with the system kinetic energy T , D’Alembert’s principle
(cf. Eq. (1.45)) becomes
# 0* d ' ∂ T ( ∂ T + 1
− − Q j δq j = 0. (1.52)
dt ∂ q̇ j ∂q j
j
Fi = −∇i V.
which is exactly the same expression for the partial derivative of a function
−V (r1 , r2 , . . . , r N , t) with respect to q j :
∂V
Qj ≡ − . (1.54)
∂q j
The equations of motion in the form (1.55) are not necessarily restricted to conser-
vative systems; only if V is not an explicit function of time is the system conserva-
tive (cf. p. 4). As here defined, the potential V does not depend on the generalized
velocities. Hence, we can include a term in V in the partial derivative with respect
to q̇ j :
' (
d ∂(T − V ) ∂(T − V )
− = 0.
dt ∂ q̇ j ∂q j
L = T − V, (1.56)
Lagrange’s equations can be put in the form (1.57) even if there is no potential
function, V , in the usual sense, providing the generalized forces are obtained from
a function U (q j , q̇ j ) by the prescription
' (
∂U d ∂U
Qj = − + . (1.58)
∂q j dt ∂ q̇ j
In such case, Eqs. (1.57) still follow from Eqs. (1.53) with the Lagrangian given
by
L = T − U. (1.59)
F = q [E + (v × B)] . (1.60)
Both E(x, y, z, t) and B(x, y, z, t) are continuous functions of time and position
derivable from a scalar potential φ(x, y, z, t) and a vector potential A(x, y, z, t)
by
∂A
E = −∇φ − (1.61a)
∂t
and
B = ∇ × A. (1.61b)
The force on the charge can be derived from the following velocity-dependent
potential energy
U = qφ − qA ? v, (1.62)
so the Lagrangian, L = T − U , is
L = 12 mv 2 − qφ + qA ? v. (1.63)
The total time derivative of A x is related to the partial time derivative through
d Ax ∂ Ax
= + v ? ∇ Ax
dt ∂t
∂ Ax ∂ Ax ∂ Ax ∂ Ax
= + vx + vy + vz . (1.65)
∂t ∂x ∂y ∂z
where L contains the potential of the conservative forces as before, and Q j rep-
resents the forces not arising from a potential. Such a situation often occurs when
frictional forces are present. It frequently happens that the frictional force is pro-
portional to the velocity of the particle, so that its x-component has the form
F f x = −k x vx .
where the summation is over the particles of the system. From this definition it is
clear that
∂F
F f xi = − ,
∂vxi
or, symbolically,
F f = −∇v F. (1.68)
We can also give a physical interpretation to the dissipation function. The work
done by the system against friction is
2 3
dW f = −F f ? dr = −F f ? v dt = k x vx2 + k y v 2y + k z vz2 dt.
Hence, 2F is the rate of energy dissipation due to friction. The component of the
generalized force resulting from the force of friction is then given by
# ∂ri # ∂ri
Qj = F fi ? =− ∇v F ?
∂q j ∂q j
i
# ∂ ṙi
=− ∇v F ? , by (1.51),
∂ q̇ j
∂F
=− . (1.69)
∂ q̇ j
An example is Stokes’ law, whereby a sphere of radius a moving at a speed v,
in a medium of viscosity η experiences the frictional drag force F f = −6π ηav.
The Lagrange equations with dissipation become
' (
d ∂L ∂L ∂F
− + = 0, (1.70)
dt ∂ q̇ j ∂q j ∂ q̇ j
so that two scalar functions, L and F, must be specified to obtain the equations
of motion.
The previous sections show that for systems where we can define a Lagrangian,
i.e., holonomic systems with applied forces derivable from an ordinary or gen-
eralized potential and workless constraints, we have a very convenient way of
setting up the equations of motion. We were led to the Lagrangian formulation
by the desire to eliminate the forces of constraint from the equations of motion,
and in achieving this goal we have obtained many other benefits. In setting up the
original form of the equations of motion, Eqs. (1.19), it is necessary to work with
many vector forces and accelerations. With the Lagrangian method we only deal
with two scalar functions, T and V , which greatly simplifies the problem.
A straightforward routine procedure can now be established for all problems
of mechanics to which the Lagrangian formulation is applicable. We have only to
write T and V in generalized coordinates, form L from them, and substitute in
(1.57) to obtain the equations of motion. The needed transformation of T and V
from Cartesian coordinates to generalized coordinates is obtained by applying the
transformation equations (1.38) and (1.45′ ). Thus, T is given in general by
⎛ ⎞2
#1 #1 # ∂ri ∂ri⎠
T = m i vi2 = mi ⎝ q̇ j + .
2 2 ∂q j ∂t
i i j
It is clear that on carrying out the expansion, the expression for T in generalized
coordinates will have the form
# 1#
T = M0 + M j q̇ j + M jk q̇ j q̇k , (1.71)
2
j j,k
where M0 , M j , M jk are definite functions of the r’s and t and hence of the q’s
and t. In fact, a comparison shows that
# 1 ' ∂ri (2
M0 = mi ,
2 ∂t
i
and
# ∂ri ∂ri
M jk = mi ? .
∂q j ∂qk
i
Thus, the kinetic energy of a system can always be written as the sum of three
homogeneous functions of the generalized velocities,
T = T0 + T1 + T2 , (1.73)
x = r cos θ
y = r sin θ.
ẋ = ṙ cos θ − r θ̇ sin θ,
ẏ = ṙ sin θ + r θ̇ cos θ.
dr = r̂ dr + r û dθ + k̂ dz
for the differential position vector, dr, in cylindrical coordinates, restricted to the
plane z = 0 where r̂ and û are unit vectors in the r and u-directions, respec-
tively, the components of the generalized force can be obtained from the defini-
tion, Eq. (1.49),
∂r
Qr = F ? = F ? r̂ = Fr ,
∂r
∂r
Qθ = F ? = F ? r û = r Fθ ,
∂θ
since the derivative of r with respect to θ is, by the definition of a derivative, a
vector in the direction of θ̂ (cf. Fig. 1.6). There are two generalized coordinates,
and therefore two Lagrange equations. The derivatives occurring in the r equation
are
' (
∂T ∂T d ∂T
= mr θ̇ 2 , = m ṙ , = m r̈ ,
∂r ∂ ṙ dt ∂ ṙ
m r̈ − mr θ̇ 2 = Fr ,
the second term being the centripetal acceleration term. For the θ equation, we
have the derivatives
∂T ∂T d 2 2 3
= 0, = mr 2 θ̇, mr θ̇ = mr 2 θ̈ + 2mr ṙ θ̇ ,
∂θ ∂ θ̇ dt
d 2 2 3
mr θ̇ = mr 2 θ̈ + 2mr ṙ θ̇ = r Fθ .
dt
Note that the left side of the equation is just the time derivative of the angular
momentum, and the right side is exactly the applied torque, so that we have simply
rederived the torque equation (1.26), where L = mr 2 θ̇ and N (e) = r Fθ .
2. Atwood’s machine—(See Fig. 1.7) an example of a conservative system
with holonomic, scleronomous constraint (the pulley is assumed frictionless and
massless). Clearly there is only one independent coordinate x, the position of
the other weight being determined by the constraint that the length of the rope
between them is l. The potential energy is
T = 1
2 (M1 + M2 ) ẋ 2 .
L =T −V = 1
2 (M1 + M2 ) ẋ 2 + M1 gx + M2 g(l − x).
(M1 + M2 ) ẍ = (M1 − M2 ) g,
or
M1 − M2
ẍ = g,
M1 + M2
which is the familiar result obtained by more elementary means. This trivial prob-
lem emphasizes that the forces of constraint—here the tension in the rope—
appear nowhere in the Lagrangian formulation. By the same token, neither can
the tension in the rope be found directly by the Lagrangian method.
3. A bead (or ring) sliding on a uniformly rotating wire in a force-free space.
The wire is straight, and is rotated uniformly about some fixed axis perpendicular
to the wire. This example has been chosen as a simple illustration of a constraint
being time dependent, with the rotation axis along z and the wire in the x y plane.
The transformation equations explicitly contain the time.
While we could then find T (here the same as L) by the same procedure used to
obtain (1.71), it is simpler to take over (1.75) directly, expressing the constraint
by the relation θ̇ = ω:
8 9
T = 12 m ṙ 2 + r 2 ω2 .
m r̈ − mr ω2 = 0
or
r̈ = r ω2 ,
which is the familiar simple harmonic oscillator equation with a change of sign.
The solution r = eωt for a bead initially at rest on the wire shows that the
bead moves exponentially outwards. Again, the method cannot furnish the force
of constraint that keeps the bead on the wire. Equation (1.26) with the angular
momentum, L = mr 2 ω = mωr02 e2ωt , provides the force F = N /r , which pro-
duces the constraint force, F = 2mr0 ω2 eωt , acting perpendicular to the wire and
the axis of rotation.
DERIVATIONS
1. Show that for a single particle with constant mass the equation of motion implies the
following differential equation for the kinetic energy:
dT
= F ? v,
dt
while if the mass varies with time the corresponding equation is
d(mT )
= F ? p.
dt
2. Prove that the magnitude R of the position vector for the center of mass from an
arbitrary origin is given by the equation
# 1#
M 2 R2 = M m i ri2 − m i m j ri2j .
2
i i̸= j
(where θ, φ, and φ ′ have meanings similar to those in the problem of a single vertical
disk, and (x, y) are the coordinates of a point on the axle midway between the two
wheels) and one holonomic equation of constraint,
a
θ =C− (φ − φ ′ ),
b
where C is a constant.
6. A particle moves in the x y plane under the constraint that its velocity vector is always
directed towards a point on the x axis whose abscissa is some given function of time
f (t). Show that for f (t) differentiable, but otherwise arbitrary, the constraint is non-
holonomic.
7. Show that Lagrange’s equations in the form of Eqs. (1.53) can also be written as
∂ Ṫ ∂T
−2 = Q j.
∂ q̇ j ∂q j
These are sometimes known as the Nielsen form of the Lagrange equations.
8. If L is a Lagrangian for a system of n degrees of freedom satisfying Lagrange’s equa-
tions, show by direct substitution that
d F(q1 , . . . , qn , t)
L′ = L +
dt
also satisfies Lagrange’s equations where F is any arbitrary, but differentiable, func-
tion of its arguments.
9. The electromagnetic field is invariant under a gauge transformation of the scalar and
vector potential given by
A → A + !ψ(r, t),
1 ∂ψ
φ→φ− ,
c ∂t
where ψ is arbitrary (but differentiable). What effect does this gauge transformation
have on the Lagrangian of a particle moving in the electromagnetic field? Is the motion
affected?
10. Let q1 , . . . , qn be a set of independent generalized coordinates for a system of n
degrees of freedom, with a Lagrangian L(q, q̇, t). Suppose we transform to another
set of independent coordinates s1 , . . . , sn by means of transformation equations
qi = qi (s1 , . . . , sn , t), i = 1, . . . , n.
In other words, the form of Lagrange’s equations is invariant under a point transfor-
mation.
EXERCISES
U (r, v) = V (r ) + s ? L,
where r is the radius vector from a fixed point, L is the angular momentum about that
point, and s is a fixed vector in space.
(a) Find the components of the force on the particle in both Cartesian and spherical
polar coordinates, on the basis of Eq. (1.58).
(b) Show that the components in the two coordinate systems are related to each other
as in Eq. (1.49).
(c) Obtain the equations of motion in spherical polar coordinates.
16. A particle moves in a plane under the influence of a force, acting toward a center of
force, whose magnitude is
% &
1 ṙ 2 − 2r̈r
F = 2 1− ,
r c2
where r is the distance of the particle to the center of force. Find the generalized
potential that will result in such a force, and from that the Lagrangian for the motion
in a plane. (The expression for F represents the force between two charges in Weber’s
electrodynamics.)
17. A nucleus, originally at rest, decays radioactively by emitting an electron of momen-
tum 1.73 MeV/c, and at right angles to the direction of the electron a neutrino with
momentum 1.00 MeV/c. (The MeV, million electron volt, is a unit of energy used in
modern physics, equal to 1.60 × 10−13 J. Correspondingly, MeV/c is a unit of lin-
ear momentum equal to 5.34 × 10−22 kg ? m/s.) In what direction does the nucleus
recoil? What is its momentum in MeV/c? If the mass of the residual nucleus is
3.90 × 10−25 kg what is its kinetic energy, in electron volts?
18. A Lagrangian for a particular physical system can be written as
m8 2 9 K 8 2 9
L′ = a ẋ + 2b ẋ ẏ + c ẏ 2 − ax + 2bx y + cy 2 ,
2 2
where a, b, and c are arbitrary constants but subject to the condition that b2 − ac ̸= 0.
What are the equations of motion? Examine particularly the two cases a = 0 = c
and b = 0, c = −a. What is the physical system described by the above Lagrangian?
Show that the usual Lagrangian for this system as defined by Eq. (1.56) is related to L ′
by a point transformation (cf. Derivation 10). What is the significance of the condition
on the value of b2 − ac?
19. Obtain the Lagrange equations of motion for a spherical pendulum, i.e., a mass point
suspended by a rigid weightless rod.
20. A particle of mass m moves in one dimension such that it has the Lagrangian
m 2 ẋ 4
L= + m ẋ 2 V (x) − V 2 (x),
12
where V is some differentiable function of x. Find the equation of motion for x(t) and
describe the physical nature of the system on the basis of this equation.
21. Two mass points of mass m 1 and m 2 are connected by a string passing through a
hole in a smooth table so that m 1 rests on the table surface and m 2 hangs suspended.
Assuming m 2 moves only in a vertical line, what are the generalized coordinates for
the system? Write the Lagrange equations for the system and, if possible, discuss
the physical significance any of them might have. Reduce the problem to a single
second-order differential equation and obtain a first integral of the equation. What is
its physical significance? (Consider the motion only until m 1 reaches the hole.)
22. Obtain the Lagrangian and equations of motion for the double pendulum illustrated in
Fig. 1.4, where the lengths of the pendula are l1 and l2 with corresponding masses m 1
and m 2 .
23. Two masses 2 kg and 3 kg, respectively, are tied to the two ends of a massless,
inextensible string passing over a smooth pulley. When the system is released,
calculate the acceleration of the masses and the tension in the string.
24. A spring of rest length L a (no tension) is connected to a support at one end and has
a mass M attached at the other. Neglect the mass of the spring, the dimension of the
mass M, and assume that the motion is confined to a vertical plane. Also, assume that
the spring only stretches without bending but it can swing in the plane.
(a) Using the angular displacement of the mass from the vertical and the length
that the string has stretched from its rest length (hanging with the mass m), find
Lagrange’s equations.
(b) Solve these equations for small stretching and angular displacements.
(c) Solve the equations in part (a) to the next order in both stretching and angular
displacement. This part is amenable to hand calculations. Using some reasonable
assumptions about the spring constant, the mass, and the rest length, discuss the
motion. Is a resonance likely under the assumptions stated in the problem?
(d) (For analytic computer programs.) Consider the spring to have a total mass
m ≪ M. Neglecting the bending of the spring, set up Lagrange’s equations
correctly to first order in m and the angular and linear displacements.
(e) (For numerical computer analysis.) Make sets of reasonable assumptions of the
constants in part (a) and make a single plot of the two coordinates as functions of
time.