[Encyclopedia of Physical Science and Technology'', Physical Chemistry] Robert Allen Meyers - Encyclopedia of Physical Science and Technology, 3e, Physical Chemistry (2006, Academic Press) - Libgen.li
[Encyclopedia of Physical Science and Technology'', Physical Chemistry] Robert Allen Meyers - Encyclopedia of Physical Science and Technology, 3e, Physical Chemistry (2006, Academic Press) - Libgen.li
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
Chemical Kinetics,
Experimentation
Terence J. Kemp
University of Warwick
I. Introduction
II. Mathematical Manipulation of Rate Data
III. Computer Modeling of Reaction Systems
IV. General Methodology of Reaction Kinetics
V. Flow Systems
VI. Pulse and Shock Methods
VII. Relaxation Methods
VIII. Spectral Line-Broadening Methods
IX. Electrochemical Methods
721
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
Relaxation Process where by a molecule returns from a etc., indicate the concentrations of the various reactants;
high-energy state to its normal state of lower energy. α, β, etc. are the orders of reaction with respect to [A],
Relaxation time Time for the fraction 1/e of excited [B], etc., respectively; α + β + · · · represents the total re-
molecules to return by relaxation to their normal state. action order; and k is the rate constant. The units of k are
dependent on the total reaction order; from Eq. (3) it can
be seen that they are
EXPERIMENTATION IN CHEMICAL KINETICS is
the methodology of measuring the rates of chemical re- (mol dm−3 sec−1 )/(mol dm−3 )α+β+···
actions to yield rate constants, reaction orders, and acti- and for total reaction orders of zero, one, two, and three the
vation parameters with the ultimate goal of establishing units of k are mol dm−3 sec−1 , sec−1 , dm3 mol−1 sec−1 ,
the mechanism of a particular reaction. It covers classical and dm6 mol−2 sec−1 , respectively.
approaches based on conventional sampling and analyti- It is important to note that while the rate of reaction
cal methods, and the use of high pressures as well as the depends on the concentrations of the reactants, the rate
impressive array of fast reaction methods now available constant is independent of these and is the parameter com-
through developments in magnet technology, electronics, monly referred to in discussion of reaction kinetics. While
lasers, and small dedicated computers. These are grouped for normal reaction systems the rate constant is naturally
under flow systems, pulse and shock methods, relaxation independent of time, for systems featuring an initially in-
methods, analysis of spectroscopic line-broadening, and homogeneous distribution of reactants—as, for example,
electrochemical methods. along the track of an α-particle or laser pulse immediately
after discharge—the rate constant varies with time until
homogeneity is achieved.
I. INTRODUCTION
1. Series first-order reactions: where dozens of processes are involved, some chemical
k1 k2 and others, such as diffusion or turbulence, being physi-
A→B→C (18) cal. Here the task is not to express the situation exactly,
2. Reversible first-order reactions: as in Eqs. (18)–(26), but rather to approximate to the ob-
k1 served state by examining the effects of gradually intro-
A B (19) ducing particular component terms. The computer is used
k−1
to generate a numerical solution from a set of given rate
(A notable example is conformational interchange by equations, rate constants, and reaction concentrations for
NMR line-broadening techniques; see Section VIII.) different reaction times, and a screen graphical presen-
3. Reversible reactions involving second-order steps in tation is often employed. This procedure is illustrated in
both directions. The following examples are taken from Fig. 1 for the concentrations-time profiles for reactants
the many studies based on line-broadening analyses from and products on a photoirradiated mixture of trans-but-2-
nuclear magnetic resonance (NMR) and electron spin res- ene, NO, NO2 , and air. The relevant equations are given
onance (ESR) spectroscopy, respectively. in Fig. 2.
(a) Proton transfer: Computer modeling is also helpful in unravelling the
k1 temperature dependence of complex systems. Fitting to
A + H3 O+ AH+ + H2 O (20) the Arrhenius equation [Eq. (1)] is simple, but the number
k−1
of parameters involved increases, and therefore the pro-
(b) Electron transfer: grams become lengthier, with the complexity of the sys-
k1 tem. Well-known variants of the simple Arrhenius equa-
A + B± A± + B (21) tion to which data can be fitted by computer are as follows:
k−1
zone 1: R· + R· → RR
k1
(22) This is often recognized from a plot of ln k against
1/T , which, instead of exhibiting a single line as for
Eq. (1), will show two linear regions intersecting in a
k2 particular, narrow temperature range.
zone 2: R· + R· → RR (23)
2. For solution reactions investigated over very wide
5. Relaxation from two excited states in equilibrium temperature ranges, especially cryogenic
(both stages first-order): temperatures when molecular diffusion can be
“frozen” at a limiting temperature T0 :
∗A
k1 ∗B
k −1 k = Ae−[B/(T −T0 )] (28)
k2 k3
A B (24)
IV. GENERAL METHODOLOGY
6. Relaxation from two excited states in equilibrium OF REACTION KINETICS
(one stage second-order):
∗A
k1 ∗AB
In relatively simple systems, where reactants proceed di-
+ B
k −1 rectly to a small number of products without the interven-
k2 k3 tion of intermediate species (and in most cases even when
A A + B these do play some role), the main problem is that of de-
(25)
termining the concentration of one or more reactants or
7. Kinetics of photobleaching of a system of ab- products as a function of time. This becomes essentially
sorbance A under steady illumination (making allowance an exercise in analytical chemistry, in that virtually all
for increased transmission, and therefore reduced rate of methods of chemical and physicochemical analysis have
photolysis, as photoreaction proceeds): been applied at some time in determining reaction rates.
−d A/dt = k(1 − 10−A ) (26)
A. Conventional Methods
Computers prove invaluable in dealing with exceed-
ingly complex kinetic situations such as are found in The design of specialized kinetic apparatus is largely de-
combustion or the chemistry of planetary atmospheres, termined by the magnitude of the reaction rates concerned.
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
FIGURE 1 Experimental (upper) and computer predictions (lower) for the photoreaction of a mixture of trans-but-2-
ene, NO, NO2 , and air. [From Kerr, J. A., Calvert, J. G., and Demerjian, K. L. (1972). Chem. Br. 8, 252.]
When reactions are “slow,” i.e., with half-lives of the order of the entire reaction mixture with respect to one or
of minutes, hours, or longer, then conventional sampling more components is monitored throughout the reaction,
methods are very appropriate; thus a thoroughly mixed re- from time to time or even continuously, by some physical
action system is made up after thermostating the separate method such as spectrophotometry, conductivity, poten-
reactants to the desired temperature, and samples are taken tiometry, or NMR. Commercial spectrophotometers and
at suitable, measured time intervals and submitted to anal- magnetic resonance spectrometers are now widely used
ysis for a particular reactant or product. Because reaction in which a kinetic mode is available for automatically de-
continues within the sample while sampling is occurring, termining optical or radiofrequency absorbances at preset
it is usually the case that one needs to “quench” the re- band maxima as a function of time: the resulting data,
action either by cooling the sample rapidly to “freeze” sometimes running to thousands of points, can be stored
reaction or by mixing the sample with another reagents, and analyzed at leisure.
which halts reaction by consuming one reactant virtually Polymerization kinetics can be followed very simply
instantaneously. using a dilatometer (a reaction vessel consisting of a bulb
This type of procedure is still widely used because of its with a capillary side-arm) to measure total reaction vol-
simplicity, particularly when more sophisticated apparatus ume, which decreases gradually but markedly as reaction
is unavailable. The analysis itself is typically by titration, proceeds.
ultraviolet (UV)–visible or infrared (IR) spectrophotom- Many “slow” gas phase reactions have been studied
etry, or gas-liquid chromatography. simply by measuring the total pressure of a “static”
An important variant of this method is to use a non- reaction system at different times. Such an approach
destructive analytical method in which the composition can be dangerously simplistic, however, and it is best
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
FIGURE 2 Reaction schemes for the photoxidation of trans-but-2-ene in the presence of NO and NO2 . (Left) Initiation
of O atoms or ozone. (Right) Initiation by hydroxyl radicals. (These are incorporated into the computer model employed
in Fig. 1.) [From Kerr, J. A., Calvert, J. G., and Demerjian, K. L. (1972). Chem. Br. 8, 252.]
supplemented by an in situ analytical method such as spec- Kinetic processes occuring within solids are inherently
troscopy or by sampling tiny quantities into a mass spec- more difficult to monitor, but ESR spectroscopy provides
trometer, if necessary via a gas chromatograph. While the a nondestructive, highly sensitive method for investigating
static system has the advantage that it is well-defined in free radical processes.
terms of temperature, concentration, surface area, and ma- The development in NMR spectroscopy of magic-angle
terial of the containing vessel, it suffers from the compar- spinning (MAS) enables solid-state transformations to be
atively small amounts of material available for analysis. determined, conventional NMR having been limited pre-
Flow systems for gases provide plenty of material but are viously to bulk physical processes such as the transition
less well-defined otherwise. An often useful compromise through the glass temperature of polymers. One nice appli-
is found in the capacity-flow technique, in which reactants cation of MAS-NMR is the kinetics of the hardening of wet
are conducted into the vessel, efficient stirring is provided, cement by following the 29 Si resonances. Optical meth-
and the reacting mixture is withdrawn at such a rate as to ods are most easily applied to thin films of materials—
set up a steady state. for example, the kinetics of polymer degradation can be
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
The rate of formation of A∗ is given by The rate of a chemical reaction is sensitive not only to
temperature [Eq. (1)] but also to pressure. Simple thermo-
d[A∗ ]/dt = Ia (33) dynamic arguments lead to Eq. (42):
The rate of disappearance of A∗ is given by ln(k/k0 ) = − pV = /RT (42)
−d[A∗ ]/dt = (kF + kNR + kQ [Q])[A∗ ] (34) Here k and k0 refer to the rate constants of the reaction at
pressures of p and l atm, respectively, and V = denotes
In the steady-state situation, these two rates are equal, i.e., the volume of activation. Accuracy in values of V = re-
Ia = (kF + kNR + kQ [Q])[A∗ ] (35) quires the use of pressures of up to several thousand atmo-
spheres, as the effect is generally rather small (a few cubic
Now the fluorescence quantum yield is given by the ratio centimeters per mole), and the reaction chamber is usually
of the number of photons emitted per second to the number of thick stainless steel. The term V = is considered in the
of photons absorbed per second: case of solution reactions to be composite, i.e., the sum of
kF [A∗ ] (1) the change in volume of the reactants in forming the
ϕF = (36) transition state and (2) the change in volume of the solvat-
Ia
ing molecules during this process; the latter is particularly
Substituting Eq. (36) into Eq. (35) yields Eq. (37): important when ions are involved as the reactants. When
kF [A∗ ] a process of molecular dissociation is involved, V = will
ϕF = (37) be positive, while when molecules associate to form the
(kF + kNR + kQ [Q])[A∗ ]
transition state, V = is negative; these effects have been
In the absence of any quencher, the fluorescence yield, used diagnostically in solution kinetic studies of inorganic
denoted ϕF0 , is given by Eq. (38): complexes, e.g., by Henry Taube.
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
Recent technical developments of the method involve are used and observations are needed at different points in
the use of the NMR method (Section VIII) at high pres- the mixed stream. A useful variant of the continuous-flow
sures, both in steady-state and stopped-flow applications. method is that in which the stream of mixed reactants is
The sample is contained in an open glass sample tube sit- conducted into a quenching medium, which terminates
uated within a metallic pressure chamber; the metal has to reaction by eliminating one reactant virtually instanta-
be nonmagnetic in this case, and beryllium–copper alloy neously. An adaptation of this idea is to pass the reacting
or titanium are the favored materials. solution into a Dewar vessel containing liquid nitrogen at
77 K when the reaction is literally frozen, and the extent
of reaction, or the presence of some intermediate, is
V. FLOW SYSTEMS ascertained by a solid-state technique such as ESR spec-
troscopy.
These have been used to great effect in both solution and
gas-phase studies. B. Stopped-Flow Methods
The disadvantages of the continuous-flow method are
A. Continuous Flow (Solutions)
largely overcome in the stopped-flow technique. After the
The first successful use of continuous-flow systems in reactants have been forced from their “drive syringes”
solution was due to Hartridge and Roughton; the principle into the mixing chamber, adjacent to the optical cell
here is that if the reaction time is of the order of a few where an analyzing light beam is situated, the emerg-
milliseconds to a few seconds, then it is impossible to mix ing mixed solution runs into another vertically mounted
the reactants manually and then record optical changes. “stop syringe,” which rises as it fills until it is arrested
Instead, the two reactants are fed by gravity or under by a mechanical stop (Fig. 3). Flow then ceases, but
pressure through narrow-bore tubes to a mixing chamber, the fast reaction occurring in the optical cell continues
after which observations are made at various points re- to completion, normally in a time of between 10 msec
mote from the mixing point in order to ascertain the extent and 10 sec. In recent adaptations the analyzing light, in-
of reaction. Alternatively, the flow rate can be varied to stead of passing into a monochromator–photomultiplier
display varying time domains of the reaction at a fixed ob- tube–oscilloscope–microcomputer assembly, is passed to
servation point. The disadvantage of the continuous-flow a diffraction granting, which disperses the light over the
configuration is that considerable quantities of reactants faces of a series of photomultiplier tubes or photodiodes
FIGURE 3 Schematic diagram of stopped-flow apparatus. A, reservoir syringes; D, drive syringes; R, hydraulic ram;
1–5, valves; S, stopping syringe; L, light source; MO, monochromator; C, quartz windows; M, mixing point; PM,
photomultiplier tube; HT, power supply to PM; MS, microswitch; W, waste; MA, microammeter; CRO, cathode-ray
oscilloscope; TR, transient recorder; CR, computer.
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
FIGURE 4 Time-resolved spectra (0–2 sec after mixing, taken at 40 msec intervals) for the reaction of Zn2+ with
2,2 :6 ,2 –terpyridine (L) in methanol:water (80%:20% v/v) at 25◦ C (Zn2+ ] = 10−4 mol dm−3 , [L] = 10 − 5 mol dm−3 ).
[From Priimov, G. U. Ph.D. thesis, University of Warwick, 1999.]
(in the so-called photodiode array), each of which is sepa- with only a small fraction entering the very low pressure
rately powered and from which the individual outputs feed (∼10−6 torr) mass spectrometer. One example of the type
into a multiplexer. The output from the letter is presented of reaction that can be examined is that between ethyl
on an oscilloscope screen as a complete optical spectrum, radical ( C2 H5 ) and oxygen, for which the stages are
x
FIGURE 5 (Top) Side and (bottom) plan views of a quenched- A. Flash Photolysis
flow apparatus. Key: RES, reservoirs (three); DS, drive syringes
(three); MS, microswitch; R, rack; P, pinion; M, motor; TI, thermo- In the technique of flash photolysis, later developed to
stating fluid inlet; TO, outlet; M1, first mixer; M2, second mixer; laser flash photolysis, an intense flash of light is absorbed
Q, quartz windows (two); W, waste outlet; BN, brass nut (three); by molecules in the system to give excited states:
PS, motor power supply; L, lamp; MO, monochromator; FL, focus-
hν
ing lens; PM, photomultiplier tube; MA, milliammeter; CB, control
M → M∗ (49)
box; CRO, cathode-ray oscilloscope; DR, data recorder; CR, chart
recorder; 1–6, taps; shaded area, PTFE blocks. [From Goodman, These then lose energy either by light emission [Eq. (50)],
P. D., Kemp, T. J., and Pinot de Moira, P. (1981). J. Chem. Soc.,
by nonradiative decay [Eq. (51)], by dissociation to give
Perkin Trans. 2, 1221.]
radical species [Eq. (52)], or by attacking the solvent, de-
noted SH [Eq. (53)]:
H + hvF −→ ∗ H x
(48)
M∗ → M + hν1 (or hνp ) (50)
The concentration of H atoms in the reaction cell can thus ∗
M → M. (51)
be monitored by their fluorescence in a repeat of Eq. (47),
and their reactions with the various molecules in the reac- M∗ → A· + B· (52)
tion cell determined. ∗ · ·
M + SH → MH + S (53)
D. Crossed Molecular Beams By monitoring the optical or ESR absorption of the species
A· , B· , MH· , or S· , it is possible to determine their kinet-
Extremely detailed kinetic information for gas-phase pro- ics for such processes as dimerization, as in Eqs. (54)
cesses is gained by allowing two streams of molecules and (55).
emanating from small ovens to intersect at right angles
and by examining the scattering intensity and velocity S· + S· → S—S (54)
distribution of the resulting fragments as a function of MH· + MH· → HM—MH (55)
the scattering angle. The experiment is carried out at high
or reaction with oxygen,
vacuum (10 − 6 to 10 − 7 torr), and the molecules in one
or both of the streams are placed in as narrow a velocity S· + O2 → SO·2 (56)
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
FIGURE 6 Schematic diagram showing combination of gas-flow tube with mass spectrometric detection. [Repro-
duced with permission from Pilling, M. J., and Seakins, P. W. (1995). “Reaction Kinetics,” Oxford University Press,
Oxford.]
While many types of experimental layout have been into the femtosecond (fs) regime, enabling, for example,
used successfully, that shown in Fig. 9 is quite typical. observation of a transition state in the breaking of the I-C
A pulsed excimer laser generates light at a wavelength bond in ICN. The concept underlying this work is schema-
that can be controlled by using different halogens or rare tized in Fig. 10. The laser system generates a pulse which
gases. The uv pulse, of duration that can also be controlled, is split into two components, a pump pulse which insti-
can be used either directly or to pump a dye laser, giving gates chemical processes and a probe pulse which mon-
extensive control of the wavelength range. The emerging itors them. The probe pulse is delayed behind the pump
pulse is then fed by mirrors, prisms, or an optical fiber to pulse by a few femtoseconds by lengthening its light path
the sample cell where it excites the molecule of interest (Fig. 10). The probe pulse causes fragments generated by
(in fluid solution or in gaseous form). the pump pulse to emit light, the characteristics of which
Species generated by Eq. (49)–(56) are monitored by provide dynamical information. An example of the funda-
means of an analyzing beam from a high-pressure point- mental character of this type of investigation is illustrated
source lamp, the output of which passes through the sam- in Fig. 11. Ion pairs Na+ I− are excited by the pump pulse
ple cell and thence through a preset monochromator to a to the excited form [Na I]∗ . The bond distance is very
photomultiplier tube or spectography or diode array. The short at the moment of formation, and the excited molecule
signal from the former, which is related to the intensity of has a covalent character. As the molecule vibrates, charge
the incident light, is taken to a cathode-ray oscilloscope moves toward the I atom, and at the furthest point of the
where it is displayed and stored as a function of time imme- stretching vibration the nuclei are 10–15 Å apart, with the
diately following the firing of the laser. The stored signal is bonding now becoming fully ionic [Na+ · · ·I− ]. The atoms
then either photographed for manual analysis or taken via continue to vibrate between these two forms:
an analog-to-digital converter to a microcomputer, where
it is stored and subsequently has largely been processed [Na − l]∗ [Na+ . . . l− ]∗ (57)
to yield rate constants.
The other main instrumental development involving At the critical distance of 6.9 Å, the energies of the ex-
lasers has been the evolution of lasers pulsed to picosec- cited and ground states are very close and there is a 20%
ond or even femtosecond time intervals by mode-locking. chance that the excited state will dissociate to give sepa-
These yield rather weak pulses and are normally used rate Na and I atoms. The femtosecond technique detects
in association with devices enabling data accumulation. a decaying signal from the excited state and an increasing
The Nobel prize for chemistry for 1999 was awarded to signal as bursts of Na atoms appear from each vibration
Ahmed Zewail for his work in extending flash photolysis (Fig. 11).
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
FIGURE 7 Schematic diagram illustrating the various processes occurring during reaction of H atoms with resonance
fluorescence detection. [Reproduced with permission from Pilling, M. J., and Seakins, P. W. “Reaction Kinetics,” Oxford
University Press, Oxford.]
Lasers are particularly well suited for the time- PM 1 is conducted to a time-to-amplitude converter (TAC),
resolution of light emission [Eq. (50)], especially in the which is triggered or “started” on its arrival. The photon ar-
form of time-correlated single-photon counting (Fig. 9). In riving at the sample cell excites a potentially luminescent
this technique, the weak output from a repetitively pulsed molecule [Eq. (49)], which, after some time, fluoresces
lamp (pulse width ∼1 nsec) or from a sub-nanosecond [Eq. (50)] or phosphoresces [Eq. (50)]. The photons due
pulsed laser is divided such that part of the pulse is taken to luminescence activate the second photodetector (PM 2
directly to one photodetector (PM 1 in Fig. 9) while the in Fig. 9), from which a pulse is fed to the TAC, which
other part is taken to the sample cell. The signal detected by is then “stopped.” The time interval between “start” and
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
B. Pulse Radiolysis
The technique of pulse radiolysis is closely related to that
of flash photolysis: the optical monitoring system is the
same, but the source of light activation is replaced by one
of high-energy radiation in the form of a short pulse of a
FIGURE 8 Schematic diagram illustrating laser flash photoly-
sis. Laser pulses pass through the light guide to the sample
few nanoseconds or microseconds of fast electrons (typi-
cell. Optical monitoring of post-pulse events can be kinetic or cally 3 MeV). The effect of such high-energy radiation is
spectrographic. to excite and ionize the material in the sample cell, which
may be in liquid, gaseous, or even (transparent) solid form.
In the case of water, the initial act of radiolysis is given by
“stop” is stored in the multichannel pulse-height analyzer Eqs. (58)–(60):
and the process is repeated for many thousands, if not hun- H 2 O → H 2 O+ + e −
x
(58)
dreds of thousands, of times per second. The accumulated
+ · +
data yield the averaged luminescent response of the sam- H2 O + H2 O → OH + H3 O (59)
ple molecule and, after computer processing, can lead to e− + H2 O → eaq
−
(60)
lifetimes with very small standard deviations.
In the more traditional form of flash photolysis, origi- Optical studies reveal the presence of the solvated electron
−
nated by R. W. Norrish and G. Porter, the flash is generated eaq as a broad, intense absorption maximizing at ∼700 nm.
FIGURE 9 Block diagram of time-correlated single photon counting apparatus. Key: LS, lamp; LH, lamp housing; S,
slits; HRM, high-radiance monochromator; C, sample cell; SH, sample housing; L, lenses; PM 1, PM 2, photomultiplier
tubes; DISC 1, DISC 2, discriminators; TAC, time-to-amplitude converter; MCAPH, multichannel pulse-height analyzer.
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
FIGURE 10 Schematic diagram illustrating layout of femtosecond flash photolysis technique. The pump and probe
pulses are separated in time by adjusting the light path of the latter. Beams of molecules in the sample tube are
excited or dissociated by the pump pulse, and the fragments monitored by the probe pulse on its path to the detector.
Reproduced with permission from Scientific American (see Bibliography).
Addition of materials capable of reacting with electrons Other developments include extension of the optical
−
reduces the lifetime of eaq , enabling kinetics of its fast re- range into the near IR and the use of cryogenic equipment
actions to be determined. The powerful oxidant ·OH (the to examine radiolysis at temperatures as low as 4.2 K.
hydroxyl radical) has only a weak absorption in the ultra-
violet, and its reactivity is best measured by a competition
C. Shock Tubes
method based on its very fast oxidation of thiocyanate ion
−
CNS− to yield the intensely absorbing (CNS)2 ion (λmax
x
(CNS)2 will be systematically reduced as [X] is increased, is generated by breaking a diaphragm that separates the
enabling a rate constant to be derived. “driver” gas, usually H2 or He at several atmospheres pres-
While optical methods remain the favored means of sure, and the reactant gas diluted in argon at a few torr
analysis in both flash photolysis and pulse radiolysis, other (1 torr = 10−6 mm Hg). The normal configuration for the
methods of detection have been used with great effective- apparatus is a hollow tube (Fig. 10), typically measur-
ness from time to time, including conductivity and ESR ing 15 cm diameter and 6 m in length, which contains,
spectroscopy. The latter technique, in association with in the reactant chamber, sensors for measuring the ve-
flash photolysis in particular, has led to the observation locity of the shock wave, and an observation point. The
of ESR signals with anomalous intensities, for example, shock front generated on rupture of the diaphragm travels
appearing totally in emission, a phenomenon described at supersonic speed (several Mach) toward the reactant
as chemically induced dynamic electron polarization or gas zone, compressing this and heating it to very high
CIDEP. temperatures (which can be calculated), in much less than
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
FIGURE 11 Femtosecond flash photolysis of gaseous sodium iodide. As the excited ion-pair vibrates, it gradually
decays (lower curve) and the resulting free atoms are detected (upper curve).
1 msec. Detection of the species in the reaction zone is duration of the pulse should be much shorter than the re-
normally optical, especially as many of these emit light laxation time τ of the system and should approximate to
from their excited states, although absorption spectra can a step-function as in Fig. 11. Interestingly, if the relative
also be measured. Strong shocks in CH4 and CH4 − NH3 change in concentrations of the reactants is very small,
yield, respectively, the emission bands of C2 and CN, while then, whatever the rate law, the reaction curve will follow
Al2 O3 dispersed as a dust yields AlO. Alternatively, the first-order kinetics described by the relaxation time τ , the
reaction zone can be sampled by allowing materials to leak time for A (the change in absorbance at time t) to fall to
through a pinhole into a mass spectrometer. a value (l/e)(A0 ) (where A0 is the final total change in
absorbance). The mathematics of rather complex systems
undergoing relaxation have been established and are given
VII. RELAXATION METHODS by Hague.
While the single-pulse methods described above are
These were developed initially by Eigen and depend on widely used, another approach is to apply the perturba-
the application of a small disturbance to a chemical sys- tion as a sine wave, in which the response will be in the
tem at equilibrium, normally by the dissipation of a pulse form of another sine wave lagging consistently behind
of energy in the temperature-jump method but also on oc- the perturbation to a degree related to the relaxation time
casion by a sudden change in pressure (pressure-jump) of the system. The frequency of the applied perturbation
or electric field. The system then adapts (Fig. 11) to its is critical: if it is too slow, then the chemical response
new situation, normally a slightly higher temperature and will “catch up” and no lag will be detected, but if it is
hence a changed equilibrium constant, at a rate that can too fast the chemical response will never attain a mea-
be measured either optically or conductimetrically. The surable amplitude before it must change direction. Only
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
in the all-important intermediate situation will the am- (υA ) CH3 H3C
O O
plitude of the chemical response be sufficient to enable
N C N C
determination of the phase lag. Perturbations can take the
form of high-frequency oscillating electric fields, but the (υB) CH3 H H3C H (62)
use of sound waves (to give the ultrasonic method) is that
most widely adopted. The attenuation of the sound wave and will resonate at different frequencies νA and νB giv-
at a particular frequency is related to the chemical relax- ing two lines. Rotation about the N C bond is slow at
ation processes occurring in the solution. The excitation room temperature, because of the partial double-bonding
is via a quartz crystal generator, and detection is by mea- [Eq. (62)], but as the temperature is increased rotation
suring optical dispersion of light transmitted through the becomes faster and finally so fast that the signals coa-
sample. lesce to a single line. Between these two extreme situa-
tions (known as slow and fast exchange, respectively) lies
a continuum of intermediate situations with line shapes
VIII. SPECTRAL LINE-BROADENING given by Eq. (63),
METHODS
g(ν) = [(1 + τ π )P + QR]/(4π 2P 2 + R 2 ) (63)
If the lifetime of a molecule is very short, say δt, then the where
uncertainty principle predicts a broadening of its absorp-
tion line δν (in hertz) given by Eq. (61). P = [0.252 − ν 2 + 0.25δν 2 )τ + /4π (64)
δν ≈ 1/(2π δt) (61) Q = [−ν − 0.5( pA − PB )δν]τ (65)
If δt is ∼0.1 sec, then δν is ∼1 Hz, i.e., in the region R = 0.5( pA − pB )δν − ν(1 + 2π τ ) (66)
of NMR spectroscopy. Accordingly, if a molecule is un-
The symbols are defined as follows: τ = τA τB /(τA + τB ),
dergoing rapid (∼0.1 sec) interchange between two con-
and τA and τB are the average lifetimes of the nuclei in
formations, or is participating in a fast exchange reaction
positions A and B, respectively; pA and pB are the mole
such as proton-transfer, then its NMR spectral linewidth
fractions of A and B, respectively; δν = νA − νB ; is the
provides a unique source of kinetic information.
width (at half-height) of the signal in the absence of ex-
We cannot develop the theory of NMR spectroscopy
change; and ν is the variable frequency. The lineshape
here, but state simply that the nuclei of a sample in a
calculations are exemplified in Fig. 12.
strong magnetic field are excited by radiofrequency radi-
The ESR spectrum of a dilute solution of a radical an-
ation of a particular frequency for each type of nucleus
ion such as naphthalene consists of many lines. Addi-
(called the Larmor frequency) into excited spin states,
tion of further naphthalene results in electron exchange
from which they return to lower spin states on losing spin
[Eq. (67)], which broadens the lines:
energy by nuclear relaxation. Various factors contribute
to such relaxation, especially the motion of surrounding k2
solvent molecules designated (even in liquids) as the “lat- C10 H÷
8 + C10 H8 C10 H8 + C10 H÷
8 (67)
k−2
tice”; this process is called spin–lattice relaxation, and the
time for l/e part of the excited nuclei to relax this way is This is a consequence of reducing the lifetime of a partic-
denoted T1 . The additional process titled “spin–spin re- ular spin state: an analysis of the lineshape yields the rate
laxation” is denoted T2 . Both of these are normally of sev- constant k2 , which equals k−2 . Radicals able to undergo
eral seconds duration in liquids, and, in accordance with conformational change show a spectral phenomenon
Eq. (61). NMR lines are accordingly very narrow in liq- called the alternating linewidth effect. A temperature-
uid samples. Chemical pathways for reducing the lifetime dependence study of this effect will yield activation pa-
reduce T2 and hence broaden the lines. Analysis of such rameters [energy and entropy, Eq. (2)] for this first-order
broadening leads to values for the kinetics of the chemical process.
pathway.
Another factor influencing the appearance of NMR
spectra is the exchange of nuclei (usually protons) be- IX. ELECTROCHEMICAL METHODS
tween positions with different Larmor frequencies. In the
case of dimethylformamide, the protons in the two methyl Electrochemical methods depend on a competition be-
groups in the planar conformation are inequivalent. tween an electrode reaction, e.g., Eq. (68),
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
FIGURE 13 Steady-state concentration distribution (reaction layer) in the case of a chemical reaction preceding an
electrode reaction. [Adapted from Koryta, J., and Dvorak, J. (1987), “Principles of Electrochemistry,” Wiley, Chichester,
England.]
P1: GJC Revised Pages
Encyclopedia of Physical Science and Technology EN002C-98 May 17, 2001 20:49
wave. The relation between the diffusion current, i d and and Medicine,” Pergamon, Oxford.
the reaction current i r provides the kinetic information via Billing, G. D., and Mikkelsen, K. V. (1996). “Molecular Dynamics and
Eq. (70) Chemical Kinetics,” Wiley, New York.
Demas, J. N. (1983). “Excited State Lifetime Measurements,” Academic
1/2 Press, New York.
ir Ck1 t 1/2
= (70) Espenson, J. H. (1995). “Chemical Kinetics and Reaction Mechanisms,”
id − ir K 1/2 2nd ed; McGraw-Hill, New York.
where C is a constant of approximately unity and t is the Greef, R., Peat, R., Peter, L. M., Pletcher, D., and Robinson, J.
(1985). “Instrumental Methods in Electrochemistry,” Ellis Horwood,
drop time.
Chichester.
Hammes, G. G., ed. (1974). “Techniques of Chemistry,” 3rd ed., Vol. VI.
Part II, Wiley (Interscience), New York.
SEE ALSO THE FOLLWING ARTICLES House, J. E. (1997). “Principles of Chemical Kinetics,” W. C. Brown,
Dubuque, Iowa.
ELECTROCHEMISTRY • KINETICS (CHEMISTRY) • NU- Lewis, E. S., ed. (1974). “Techniques of Chemistry,” 3rd ed., Vol. VI.
CLEAR MAGNETIC RESONANCE Part I, Wiley (Interscience), New York.
Ng, C.-Y., Baer, T, and Powis, I. (1994). “Unimolecular and Bimolecular
Ion-Molecule Reaction Dynamics,” Wiley, Chichester.
Pilling, M. J., and Seakins, P. W. (1995). “Reaction Kinetics,” Oxford
BIBLIOGRAPHY University Press, Oxford.
Sandström, J. (1982). “Dynamic NMR Spectroscopy,” Academic Press,
Andrews, D. L. (1992). “Lasers in Chemistry,” 2nd ed., Springer–Verlag, New York.
Berlin. Steinfield, J. I., Francisco, J. S., and Hase, W. L. (1989). “Chemical Ki-
Ashfold, M. N. R., and Baggott, J. E. (1989). “Bimolecular Collisions,” netics and Dynamics,” Prentice-Hall, Englewood Cliffs, New Jersey.
Royal Society of Chemistry, London. Wilkins, R. G. (1991). “Kinetics and Mechanism of Reactions of Tran-
Baxendale, J. H., and Busi, F. (1981). “The Study of Fast Pro- sition Metal Complexes,” VCH, Weinheim, Germany.
cesses and Transient Species by Electron Pulse Radiolysis,” Reidel, Wright, M. R. (1999). “Fundamental Chemical Kinetics,” Horwood Pub-
Dordrecht. lishing, Chichester.
Bensasson, R. V., Land, E. J., and Truscott, T. G. (1983). “Flash Photol- Zewail, A. (1990). “The Birth of Molecules,” Scientific American, De-
ysis and Pulse Radiolysis. Contributions to the Chemistry of Biology cember issue, 40–46.
P1: FYD Revised Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN002-101 May 17, 2001 20:51
Chemical Thermodynamics
J. Barthel and R. Neueder
University of Regensburg
I. Fundamentals
II. Fundamental and Caloric
Equations of Thermodynamics
III. Partial Molar Quantities
IV. Fugacities and Activities
V. Thermodynamic Equilibrium
767
P1: FYD Revised Pages
Encyclopedia of Physical Science and Technology EN002-101 May 17, 2001 20:51
A. Thermodynamic Systems and Properties According to the International Union of Pure and Applied
Chemistry (IUPAC) Commission on Symbols, Terminol-
A chemical system is an accumulation of chemical materi- ogy and Units, a mole is the amount of substance of a
als limited by a boundary. The physical world outside this system that contains as many elementary entities as there
boundary constitutes the surroundings of the system. The are carbon atoms in 0.012 kg of 12 C. The elementary enti-
system may (open system) or may not (isolated system) ties must be specified and may be atoms, molecules, ions,
exchange mass and energy with its surroundings. A sys- electrons, other particles, or specified groups of such par-
tem that exchanges energy but not mass is called a closed ticles, as shown in the following tabulation:
system.
A macroscopic property of a system may (for extensive Elementary Mass of
properties) or may not (for intensive properties) depend on entity 1 mol (g)
its mass. Examples of extensive properties are the volume
of the system or its heat capacity; examples of intensive Hg 200.59
properties are permittivity, density, and specific heat. A HgCl 236.04
Hg2+
2 401.18
system is considered homogeneous if all intensive prop- e− (electrons) 5.460 × 10−4
erties are continuous functions of position throughout the Fe0.91 S 82.88
system; otherwise it is considered heterogeneous. In a het- Mixture of 78.09 mol % N2 , 28.964
erogeneous system the discontinuities of properties are sit- 20.95 mol % O2 , 0.93 mol % argon,
uated on surfaces enclosing homogeneous regions, which and 0.03 mol % CO2
are called phases of the heterogeneous system, with the
surfaces being the phase boundaries. Phases are charac- If the complete set of variables ( p, T, n 1 , . . . , n k ) of a
terized throughout this article by Greek superscripts (e.g., k-component mixture is known, every property P of the
V (α) , V (β) ), phase boundaries by symbols such as V (α/β) . system in its equilibrium state is unequivocally defined:
A homogeneous system or a phase of a heterogeneous
P = P( p, T, n 1 , . . . , n k ) (2)
system can be a gaseous, liquid, or solid system.
Systems showing electrical potentials (Galvani poten- that is, an identical system can be prepared with the same
tials) of their phases are called electrochemical systems. properties by the specified set of variables. This statement
Galvani potentials are the result of movable charges (ions, is the basis of analytical chemistry.
electrons, etc.) within the phases. For electrochemical sys- Infinitely small changes in the variables of state entail
tems electrical potential differences are observed between infinitely small changes in properties P:
electrically conducting phases (e.g., batteries).
A system is in a state of thermodynamic equilibrium ∂P ∂P
dP = dp + dT
if during the period of observation (1) no measurable ∂ p T,ni ∂ T p,ni
changes of its intensive properties occur and (2) no mea-
k
surable flow of matter or energy takes place. The system is ∂P
in a steady state if only the first condition is fulfilled. Elec- + dn i (3)
i=1
∂n i p,T,n j =n i
trochemical systems in equilibrium show constant Galvani
potentials throughout each phase. In Eq. (3) the partial derivatives indicate the changes in
Chemical thermodynamics uses observable quantities P with one of the variables of state, the others being
for the definition of the state of a system. The state of unchanged; dP is an exact differential as a consequence of
P1: FYD Revised Pages
Encyclopedia of Physical Science and Technology EN002-101 May 17, 2001 20:51
the unequivocally defined, continuous, and continuously tional composition variables, molality (m i , moles of so-
differentiable function P( p, T, n 1 , . . . , n k ). lute Yi per kilogram of solvent), demality (also called
Applying to Eq. (3) the cross-differentiation identities molonity, m̃ i , moles of solute Yi per kilogram of solution),
known as Schwarz relations yields the relations and molarity (ci , moles of solute Yi per cubic decimeter
of solvent).
∂ 2 P/∂ T ∂ p = ∂ 2 P/∂ p ∂ T (4a) If in a mixture component Y1 is considered to be the
∂ 2 P/∂ n i ∂ p = ∂ 2 P/∂ p ∂n i solvent, Y1 = S, and Y2 , . . . , Yk are the solutes, the com-
(4b)
position variables of the solutes are defined by the rela-
∂ P/∂ n i ∂ T = ∂ P/∂ T ∂n i
2 2
(4c) tions
m i = n i /n s Ms (8a)
where (i = 1, 2, . . . , k). As a consequence of Eqs. (3) and
(4) every finite change in property P is independent of the k
process applied (path of integration) when passing from m̃ i = n i n i Mi (8b)
an arbitrary intial state I to an arbitrary final state II, i=1
II ci = n i /V (8c)
dP = PII − PI (5)
I In Eq. (8c) the volume V of the solution is given in cubic
and equals zero for a cyclic process, decimeters. Conversion formulas for the frequently used
II I scales of composition variables are given in Table I. If in
a mixture the solvent itself is a mixture of components
dP = dP + dP = 0 (6)
I II (mixed solvent), the quantities n s and Ms can be defined
as “solvent quantities” when using the relations
n s = n s1 + n s2 + · · · ,
B. Composition Variables
(9)
In technological calculations the amounts of substance n i Ms = xs1 Ms1 + xs2 Ms2 + · · · ,
in the basic set defined in the preceding section are often re- where s1, s2 denote the components making up the mixed
placed by other composition variables such as the weights solvent.
of substance wi which are also extensive variables, or Electrochemistry uses mean composition variables
by intensive variables such as weight percent (wt. %), based on the mole numbers of the ions in the solution pro-
volume percent (vol %), or mole percent (mol %). A com- duced by the electrolyte compounds that are the solutes.
monly used intensive composition variable in fundamen- If the set of extensive composition variables (n 1 , . . . ,
tal and applied research on mixed systems is the mole n k ) is replaced by a set of intensive composition variables,
fraction xi : for example, (x1 , . . . , xk ), where xi = 1, only the in-
k k tensive properties of the system investigated are defined;
xi = n i ni ; xi = 1 (7) extensive properties then are converted to appropriate in-
i=1 i=1 tensive properties, which are the corresponding molar
Mixed systems distinguishing solvent and solute compo- quantities (i.e., the amount of the extensive property per
nents are called solutions. Solution chemistry uses addi- mole of substance).
To x2 c2 m2 m̃˜ 2
M 1 c2 M1 m 2 M1 m̃ 2
x2 = x2
d + (M1 − M2 )c2 1 + M1 m 2 1 + (M1 − M2 )m̃ 2
d x2 dm 2
c2 = c2 d m̃ 2
M1 + (M2 − M1 )x2 1 + M2 m 2
x2 c2 m̃ 2
m2 = m2
M1 (1 − x2 ) d − M 2 c2 1 − M2 m̃ 2
x2 c2 m2
m̃ 2 = m̃ 2
M1 + (M2 − M1 )x2 d 1 + M2 m 2
a Subscript 1 denotes solvent; subscript 2 denotes solute; d denotes density of the solution;
C. Forms of Energy tion of silver nitrate where silver deposition and zinc dis-
solution are observed. The chemical energy of the same
Energy can be transferred to and from a system and can
process is used to a maximum as electrical energy by the
be stored in a system in different forms:
help of an appropriate galvanic cell if the electrical current
in the cell is kept infinitely small. This example illustrates
1. Mechanical energy, W mech
the following features of reversible processes:
2. Electrical energy, W el
3. Electromagnetic energy, W elm 1. They are ideal limiting cases.
4. Radiant energy, W rad 2. They are infinitely slow, passing from the initial to the
5. Heat, W heat = Q final state through a series of equilibrium states.
6. Latent heat, W lat. heat = Q lat
7. Chemical energy, W chem
D. The Fundamental Laws of Thermodynamics
These can be converted totally or partially from one to
Thermodynamics is an empirical science. The quintes-
another. By definition, any contribution that increases the
sence of all practical knowledge is condensed in three
energy of a system is counted positive.
fundamental laws (axioms), which are valid because so
Any differential element of work, DW, is not an exact
far they have not been found to be false. (Some textbooks
differential; that is, Schwarz relations are not applicable,
quote an additional “zero law” concerning the definition
and integration from the initial to the final state of the sys-
of thermodynamic temperature.)
tem depends on the path of integration (equation defining
the particular process).
If a system is subjected to a uniform, normal pressure 1. First Law
p, the work executed on this system by an infinitesimal
increase dV of its volume is In 1847 Helmholtz formulated his statement concerning
the conservation of energy and the equivalence of work
DW mech = − p d V (10) and heat: “Although energy may be converted from one
form to another, it cannot be created or destroyed.” As a
If in a galvanic cell the infinitesimal increase of charge
consequence,
dq is subject to a Galvani potential , the electrical work
is (dU )isolated system = 0 (14)
DW = el
dq (11) where
The amount of heat passing to a system during a process dU = DW i ; i = mech, el, . . . (15)
is calculated from the change in temperature dT with the
help of the heat capacity C of the system The differential dU of the total or internal energy U is an
exact differential, in contrast to the differential elements
D Q = C dT. (12) DW i of the different forms of energy. The change in in-
Heat applied to a system for a change of phases (e.g., ternal energy depends only on the initial and final states
melting) without an increase in temperature is called latent and not on the process path.
heat. Energy produced in chemical reactions is chemical
energy. Electromagnetic energy, radiant energy, as well as
2. Second Law
nuclear energy will not be discussed here.
Since all forms of energy do not yield exact differen- The second law of thermodynamics as stated by Clausius
tials, balances of energy forms require the specification of in 1854 concerns the spontaneous evolution of a system
the path of the process studied—for example, isothermal subject to processes. It defines the entropy of a system,
process (T = const) or isobaric process ( p = const). which is related to the change in heat during the process
A process in which the maximum of usable work is and the temperature by the relation
produced is called a reversible process, in contrast to irre-
d S = D Q rev /T (16)
versible processes, which waste usable energy. The usable
chemical energy of the reaction Clausius’s formulation
Zn + 2AgNO3 (aq) ←
→ Zn(NO3 )2 (aq) + 2Ag (13) (d S)isolated system ≥ 0 (17)
is wasted by transforming it to heat (unusuable energy) is the most appropriate formulation of the second law for
when a rod of solid zinc is inserted into an aqueous solu- chemical thermodynamics.
P1: FYD Revised Pages
Encyclopedia of Physical Science and Technology EN002-101 May 17, 2001 20:51
3. Third Law All the symbols in Table II have been explained in Sec-
tion I, except µi (i = 1, . . . , k), which is the chemical
The third law is a summary of conclusions concerning
potential of component Yi . In Table II electrical energy
the zero-temperature behavior of systems. Its formula-
is separated from mechanical, heat, and chemical forms
tion, that the zero-point entropy of perfect crystalline sub-
of energy, since its extensive variable q (charge) is not
stances is zero,
independent of mole numbers n i and its introduction as
S0 = 0 (18) a variable of state would require the use of compatibil-
ity equations. To summarize, the set of variables of state
as announced in 1923 by Lewis and Randall, indicates by
( p , T , n 1 , . . . , n k ) or any other set obtained by replacing
simple application of thermodynamic equations that the
one of the variables by its conjugate variable can be used
heat capacity of such systems vanishes at the zero point
to express the internal energy of the system as a function
of the thermodynamic temperature scale. Consequently,
of its state. Only two functions are relevant in thermody-
the zero point of temperature cannot be attained experi-
namics:
mentally since the thermodynamic functions expressing
maximum reversible work of a system also vanish with U = U (S , V , n 1 , . . . , n k ) (fundamental equation)
horizontal tangent. The internal energy and its Legendre
(20)
transforms (see Section II) are not determined at absolute
zero temperature. U = U (T , V , n 1 , . . . , n k ) (caloric equation) (21)
The other functions can be obtained by appropriate trans-
formations of Eqs. (20) and (21).
II. FUNDAMENTAL AND CALORIC
EQUATIONS OF THERMODYNAMICS
B. Fundamental Equation
Thermodynamics generally is based on two equations, the 1. Homogeneous Systems
fundamental and the caloric equations.
Equation (20), in which all variables of state are extensive
variables, can be written using the notation of Table II,
A. Variables of State
U = U (ξi ); ξi = V, S, n i , . . . , n k (22)
The basic set of variables of state can be extended by re-
flecting on the forms of energy, each of which is given by yielding the exact differential of the internal energy
the product of an intensive (η) and an extensive (ξ ) vari- k+2
∂U
able [see Eqs. (10) and (11)]. Forms of energy yield differ- dU = dξi (23)
ential elements DW i that are not exact differentials, but i=1
∂ξi ξ j =ξi
mathematics postulate that for such elements there exists
at least one integrating factor that converts the nonexact which entails the relations
differential form to an exact differential; for example, the (∂U/∂ V ) S,ni = − p (24a)
integrating factor of heat is T −1 , yielding entropy as the ex-
act differential d S, d S = T −1 D Q rev . Using this concept, (∂U/∂ S)V,ni = T (24b)
every differential element DW i can be expressed with the
help of the conjugate intensive and extensive variables of (∂U/∂n i ) S,V,n j =ni = µi (24c)
the corresponding form of energy (see Table II), where
because the combination of Eqs. (15) and (19) yields the
DW i = ηi d ξi . (19) relation
Extensive ξi V S n1 , . . . , nk q
Intensive ηi –p T µ1 , . . . , µ k
P1: FYD Revised Pages
Encyclopedia of Physical Science and Technology EN002-101 May 17, 2001 20:51
k +2 TABLE III Partial Derivatives of Energy Functions and
dU = ηi d ξi (25) Maxwell Relations
i =1 (∂U /∂ S)V ,ni (∂G /∂ T ) p,ni
to which Eq. (23) can be compared. = (∂ H /∂ S) p,ni = T = (∂ A /∂ T )V ,ni = −S
Because the internal energy U [Eq. (22)] is a homo- (∂U /∂ V ) S ,ni (∂G /∂ p)T ,ni
= (∂ A /∂ V )T ,ni = − p = (∂ H /∂ p) S ,ni = V
geneous function of degree 1 of the variables ξi , Euler’s
(∂ T /∂ V ) S ,ni = −(∂ p /∂ S)V ,ni (∂ T /∂ p) S ,ni = (∂ V /∂ S) p,ni
theorem of homogeneous functions yields
(∂ S /∂ V )T ,ni = (∂ p /∂ T )V ,ni (∂ S /∂ p)T ,ni = −(∂ V /∂ T ) p,ni
k +2
k +2
∂U
U= ξi = ξi ηi , (26)
i =1
∂ξi ξ j =ξi i =1
which, as compared with Eq. (25), yields the Gibbs– µi = (∂U /∂n i ) S ,V ,n j =ni = (∂ H /∂n i ) S , p,n j =ni
Duhem–Margules equation:
k = (∂ A/∂n i )T,V,n j =ni = (∂G/∂n i )T, p,n j =ni (35)
ξi d ηi = −V dp + S dT + n i d µi = 0 (27)
i =1 The exact differentials dU [Eq. (23)] and dU (m) [Eq. (30)]
The replacement of an arbitrary number of extensive vari- entail Schwarz relations. Maxwell relations and Schwarz
ables ξi (i = 1, . . . , m) by their conjugate intensive vari- relations are summarized in Table III.
ables ηi yields new energy functions U (m) of the system
U (m) = U (m) (η1 , . . . , ηm , ξm +1 , . . . , ξk +2 ). (28) 2. Heterogeneous Systems
These functions can be understood as the Legendre trans- The description of the state of a heterogeneous system
forms of Eq. (25): requires a set of variables of the type given in Table II for
every phase α. As a consequence, the internal energy of a
m
U (m) = U (ξi ) − ηi ξi (29) heterogeneous system made up by ν phases is
i =1
U = U (α) ξi(α) = U (ξ j ) (36)
The exact differential of Eq. (28) is
m when numbering the variables ξ either per phase, ξi(α) (i =
∂U (m) 1, . . . , k + 2; α = 1, . . . , ν) or throughout the system,
dU =(m)
d ηi
i =1
∂ηi ξi ,η j =ηi ξ j [ j = 1, . . . , ν(k + 2)]. The exact differential
k +2
∂U (m)
ν
k+2
ν(k+2)
+ d ξi (30) dU = ηi(α) dξi(α) = η j dξ j (37)
i =m +1
∂ξi η j ,ξ j =ξi
α=1 i=1 j=1
and entails the Maxwell relations is of the type given by Eq. (25). Hence, all equations
(m)
∂U for homogeneous systems can be used for the phases of
= −ξi (31a)
∂ηi ξ j ,η j =ηi heterogeneous systems. Equations (36) and (37) neglect
(m) the contributions of the phase boundaries U (α/β) to the
∂U total energy U of the heterogeneous system.
= ηi (31b)
∂ξi η j ,ξ j =ξi
The Legendre transforms currently used in thermodynam- 3. Electrochemical Systems
ics are enthalpy, Helmholtz energy, and Gibbs energy:
The internal energy of phase α in an electrochemical sys-
H = U + pV ; d H = V dp + T d S + µi dn i (32) tem depending on the charge q (α) of phase α and the Gal-
A = U − T S; d A = − p d V − S dT + µi dn i (33) vani potential (α) ,
G = U + pV − T S; dG = V dp − S dT + µi dn i U (α) = U (α) S (α) , V (α) , n (α) (α)
1 , . . . , nk , q
(α)
(38)
The replacement of all extensive by intensive variables dU (α) = − p (α) d V (α) + T (α) d S (α)
again yields the Gibbs–Duhem–Margules equation. + µi(α) dn i(α) + (α)
dq (α) (39)
The chemical potentials µi introduced as the conjugate
variables of mole numbers n i are given by Eqs. (24c) and in which dq (α) is dependent on mole numbers n i(α) and
(31b): charges F z i of the single species Yi . In Eq. (40) F is the
P1: FYD Revised Pages
Encyclopedia of Physical Science and Technology EN002-101 May 17, 2001 20:51
Faraday number (charge of 1 mole of electrons) and z i is intensive variable. The partial derivatives in Eq. (43) are
the electrical valence of particle Yi : material properties such as
dq (α) = F z i dn i(α) (40) (∂U/∂ T )V = Cv (heat capacity at constant volume)
A. Definitions
Extensive thermodynamic quantities Z ( p, T, n 1 , . . . , n k )
of the type given by Eq. (47), such as volume V , entropy S, FIGURE 1 Determination of partial molar quantities Z2 and
apparent molar quantities z from measurements of extensive
enthalpy H , heat capacity C p , and Gibbs energy G, yield
thermodynamic quantities Z p,T,n (n2 ) (also Z2 from Z; see
partial molar quantities: Section III.B).
Besides partial molar and apparent molar quantities, mix Z (c2 )/n 2 − mix Z (c2 )/n 2 = (c2 ) − (c2 ) (65)
thermodynamics makes use of relative quantities such as
(Z i − Z i∗ ), (Z i − Z i∞ ), or ( Z − ∞
Z ), resulting from mix- C. Thermodynamic Relations
ing or solution processes.
Using partial molar quantities Z i , the differential d Z of
the thermodynamic quantity Z at arbitrary pressure and
B. Mixing Processes temperature is
Formally, property Z before and after mixing of the com-
∂Z ∂Z k
ponents Y1 and Y2 can be written as dZ = dp + dT + Z i dn i (66)
∂ p T,ni ∂ T p,ni i=1
Z init = n 1 Z 1∗ + n 2 Z 2∗
(58) entailing Schwarz relations of the type given by Eqs. (4a)–
Z fin = n 1 Z 1 + n 2 Z 2 (4c), where P ≡ Z . Hence, the relations existing for the
or extensive quantities Z can also be used for their partial
molar quantities: for example,
Z fin = n 1 Z 1∗ + n 2 Z (59)
(∂G i /∂ T ) p = −Si (67a)
yielding the balances for the mixing process
(∂G i /∂ p)T = Vi (67b)
mix Z = n 1 Z 1 + n 2 Z 2 ; Z i = Z i − Z i∗ (60a)
(∂(G i /T )/∂ T ) p = − Hi /T 2 (67c)
or
(∂ Hi /∂ T ) p = C pi (67d)
mix Z = n 2 Z − Z 2∗ (60b)
Equations (67a)–(67d) show the particular role of partial
Here, mix Z is the integral effect of mixing: Z i = Z i − molar Gibbs energies G i . By their definition, quantities G i
Z i∗ are the differential effects. For enthalpies and Gibbs are both partial molar quantities and chemical potentials as
energies, Z = H or G, only the differential effects Z i defined by the fundamental equation of thermodynamics:
can be determined by experiments and not the partial molar
G i ( p, T ) = (∂G/∂n i ) p,T,n j =ni = µi ( p, T ) (68)
quantities Z i themselves, in contrast to volumes and heat
capacities, where both quantities are available.
Since the mixing process may be accompanied by a D. Molar Quantities of Pure Compounds
change of state of component Y2 (e.g., solid or gas state and Molar Quantities at Infinite Dilution
of pure Y2 may change to liquid state in the mixture), it is
Molar volumes of pure compounds Vi∗ and their temper-
advantageous to separate the differential effect Z 2 into
ature and pressure dependence are available from density
two steps:
measurements:
Z 2 = Z 2 − Z 2∞ + Z 2∞ − Z 2∗ (61a)
Vi∗ = Mi di∗ (69a)
and ∗
∂ Vi ∂ ln di∗
− Z 2∗ = − ∞
+ Z 2∞ − Z 2∗ (61b) = αi∗ Vi∗ = −Vi∗ (69b)
Z Z Z ∂T p ∂T p
In Eqs. (61a) and (61b) the quantity ∗ ∗
∂ Vi ∂ ln di
= −βi∗ Vi∗ = −Vi∗ (69c)
Z 2∞ = Z 2∞ − Z 2∗ (62) ∂p T ∂p T
P1: FYD Revised Pages
Encyclopedia of Physical Science and Technology EN002-101 May 17, 2001 20:51
Molar heat capacities C ∗pi are known from calorimetric [H+ (aq)] = 0, thus avoiding the ambiguity resulting from
measurements; their temperature and pressure dependence electrolyte dissociation.
is tabulated for practical use.
Molar entropies Si∗ are tabulated at the standard state
◦ ◦
as standard entropies S298 (Yi ), S298 (Yi ) = Si∗ (1 atm, IV. FUGACITIES AND ACTIVITIES
298.15 K). They are defined by the relation (◦ , standard)
298.15
◦ A. Chemical Potential of Pure Gases
S298 (Yi ) = c∗◦
pi d(ln T ) (70)
0 On the basis of Eq. (68), the chemical potential of an
Molar enthalpies and Gibbs energies cannot be determined ideal pure gas is obtained by integration of Eq. (73b) after
by experiments (third law of thermodynamics); only dif- replacing Vi∗ with the help of the equation of state of an
ferences with respect to some reference state can be de- ideal gas, Vi∗id = RT / p:
fined. The concept of formation reactions is frequently
used to replace molar enthalpies Hi∗ and Gibbs energies µi∗id ( p, T ) = µi∗id ( p + , T ) + RT ln( p/ p + ). (74)
G i∗ in thermodynamic balances by the standard enthalpies In Eq. (74) p + is an arbitrary reference pressure.
and standard Gibbs energies of formation, f H ◦ [Yi ] and The chemical potential of a real gas is obtained when
f G ◦ [Yi ], respectively. using an appropriate equation of state of a real gas. Vi∗real ,
The formation reaction at standard conditions (1 atm, such as the van der Waals, the Redlich–Kwong, or the
298.15 K) of a pure chemical compound Yi (e.g., CH3 OH) virial equation:
requires the formation of Yi from chemical elements in p
their most stable state at standard conditions; here, RT
µi∗real ( p, T ) = µi∗id ( p, T ) + Vi∗real − d p (75)
C(graphite) + 2H2 (g) + 0.5 O2 (g) = CH3 OH(1) (71) p+ p
In Eq. (75) the reference pressure p + is chosen so low
According to this definition, the formation of an ele-
( p + → 0) that the reference potentials µi∗ ( p + , T ) of ideal
ment in its most stable state at standard conditions equals
◦ ◦ and real gases are equal. The integral expression in Eq. (75)
zero. Hence, f H298 [H2 (g)] = 0: f H298 [O2 (g)] = 0;
◦ ◦ is related to the so-called fugacity coefficient φi∗ by the
f H298 [C(graphite)] = 0; but f H298 [C(diamond)] =
help of the relation
1.90 kJ mol−1 , this quantity being the heat of phase transi- p
tion, C(graphite) → C(diamond), at standard conditions. RT
RT ln φi∗ = Vi∗real − d p, (76)
Standard heats of formation and phase transition are tabu- 0 p
◦
lated as values f H298 (Yi ). They can be determined with permitting Eq. (75) to be written in the form
high precision by combustion of compounds Yi in special
calorimeters. Tabulated standard Gibbs energies of for- µi∗real ( p, T ) = µi◦ (T ) + RT ln pφi∗ p ◦ . (77)
mation f G ◦298 (Yi ) are based on the same concept of a Fugacity coefficients of numerous pure gases are tabulated
formation, reaction at standard conditions. from low (φi∗ = 1) to high pressures for practical use.
The temperature and pressure dependence of Hi∗ and In Eq. (77) p ◦ is the standard pressure, p ◦ = 1 atm;
∗
G i are given by the relations quantity pφi∗ is called the fugacity of the pure gas Yi .
∗
∂ Hi ∂ T p = C ∗pi Comparison of Eqs. (74) and (77) shows that ideal and
∗ (72) real gases can be treated as formally equal when pressures
∂ Hi ∂ p T = Vi∗ 1 − αi∗ T are replaced by fugacities. Using an appropriate equation
of state of a real gas, for example, the virial equation,
∂G i∗ ∂ T = −Si∗ (73a)
pVi∗ = RT + Bi∗ (T ) p + · · ·
p
∗ (78)
∂G i ∂ p = Vi∗ (73b)
T
the fugacity coefficient φi∗ can be approximated with the
Quantities Z i∞
at standard conditons as used in solution help of the relation
chemistry are also tubulated. Standard heats of formation
◦
of ions in aqueous solutions, f H298 [Yi (aq)], include the φi∗ = exp Bi∗ p RT (79)
heat of formation of the pure compound Yi under standard
conditions and the heat of transfer of pure compound
B. Chemical Potential in Gas Mixtures
Yi from its pure state to infinite dilution in solvent S,
that is, the quantity Hi∞ − Hi∗ . The tables of single-ion In a gas mixture of ideal gases Y1 , . . . , Yk , at pressure p
quantities in aqueous solutions are based on the additional and temperature T , component Yi is at partial pressure pi
assumption that f H ◦ [H+ (aq)] = f G ◦ [H+ (aq)] = S◦ (Dalton’s law) and Eq. (74) can be appropriately used to
P1: FYD Revised Pages
Encyclopedia of Physical Science and Technology EN002-101 May 17, 2001 20:51
yield the chemical potential µi ( p, T ) of Yi in this mixture, The superscript “real” is omitted in Eq. (87) because
since µi ( p, T ) = µi∗ ( pi , T ): this equation is also valid for ideal mixtures where
f i = 1 (φi = φi∗ ). The activity coefficient of a pure phase
µiid ( p, T ) = µi∗id ( p, T ) + RT ln( pi / p)
(xi = 1, φi = φi∗ ) equals unity, in contrast to its fugacity
coefficient [φi∗ ; see Eq. (76)].
= µi◦ (T ) + RT ln pi p ◦ (80)
Equation (80) exhibits the dependence of µi on the phase
composition ( pi = xi p, where xi is the mole fraction): C. Chemical Potentials in Condensed Phases
µiid ( p, T ) = µi∗id ( p, T ) + RT ln xi (81) The knowledge of equations of state for gas phases permits
the calculation of activity coefficients via fugacity coef-
Equations of state of real gas mixtures, for example, the
ficients. Equations of state for general practical use such
virial equation of state of a binary gas mixture,
as the virial equation (and others) are not known for con-
p V̄ = RT + B̄ p + · · · (82a) densed phases (liquids and solids). However, as shown
by Planck and Schottky, the passage from the gaseous
B̄ = x12 B1∗ + x22 B2∗ + 2x1 x2 B12 (82b) to the liquid or solid state does not change the structure
permit the calculation of fugacities φi of components Yi of Eq. (87) and leads to the general formulation for the
in the real mixture chemical potentials,
p
RT µi ( p, T ) = µi∗ ( p, T ) + RT ln xi + RT ln f i (88a)
RT ln φi = Vireal − dp (83)
0 p
µi∗ ( p, T ) = lim [µi ( p, T ) − RT ln xi ] (88b)
with the help of Eq. (56), which yields the partial molar xi →1
quantities needed for estimating the integral expression lim f i = 1 (88c)
in Eq. (83). In Eq. (82b) B1∗ and B2∗ are the second virial xi →1
coefficients of the pure gases Y1 and Y2 ; the cross term which can be used in gaseous, liquid, and solid phases.
B12 , resulting from the interaction of unequal particles in For solutions, the chemical potentials of the solutes
the gas phase, can be obtained from statistical models. A (Y2 , . . . , Yk ), when referred to infinite dilution instead
rough approximation is due to Lewis: B12 = (B1∗ + B2∗ )/2. of pure phase, can be written
The chemical potential of component Yi in a real gas
mixture is related to that in the ideal mixture by the relation µi ( p, T ) = µi∞ ( p, T ) + RT ln xi
∗
µireal ( p, T ) = µiid ( p, T ) + RT ln φi (84) µ − µi∞
+ RT ln f i exp i (89)
Combination of Eqs. (75), (76), (81), and (84) yields the RT
appropriate expression of the chemical potential µi in real entailing the definition of an activity coefficient:
gas mixtures:
f 0i = f i exp µi∗ − µi∞ RT . (90)
µireal ( p, T ) = µi∗real ( p, T ) + RT ln xi + RT ln φi φi∗
(85) The superscript (∗ ) in Eqs. (89) and (90) indicates that the
solute Yi and the solution are considered in equal states
for the dependence of the chemical potential on the com- of aggregation, which is not necessarily the stable state of
position and molecular interactions. Equation (85) relates the pure compound Yi with chemical potential µi∗ .
the chemical potential µireal ( p, T ) of component Yi in the The general definition of the chemical potential of a
gas mixture to that of its pure state, µi∗real ( p, T ); the quo- solute Yi in solution (Y1 is the solvent S) is based on the
tient of the fugacities φi and φi∗ is called the activity activity coefficient of the type given by Eq. (90).
coefficient f i ; the product of activity coefficient f i and
mole fraction xi is the activity ai of component Yi in the µi ( p, T ) = µi∞ ( p, T ) + RT ln xi
mixture:
+ RT ln f 0i (i = 2, . . . , k) (91a)
f i = φi φi∗ ; ai = xi f i (86)
µi∞ ( p, T ) = lim [µi ( p, T ) − RT ln xi ] (91b)
xs →1
Combining Eqs. (85) and (86) yields the generally used
expression lim f 0i = 1 (91c)
xs →1
µi ( p, T ) = µi∗ ( p, T ) + RT ln ai
In solutions, Eqs. (88a)–(88c) are used for the solvent and
= µi∗ ( p, T ) + RT ln xi + RT ln f i (87) Eqs. (91a)–(91c) for the solutes. Identity within restricted
P1: FYD Revised Pages
Encyclopedia of Physical Science and Technology EN002-101 May 17, 2001 20:51
concentration ranges for Eqs. (88a)–(88c) if f i = 1 and for Equation (96b) must be integrated from infinite dilution
Eqs. (91a)–(91c) if f 0i = 1: of component Y2 (x2 = 0, f 02 = 1) to the arbitrary com-
position x2 to yield
µiid ( p , T ) = µi ( p , T ) + RT ln xi . (92) x2
According to IUPAC rules the superscript indicates an (ln f 02 )x =x2 = d ln f 02
x2 =0
arbitrary reference state, here ∗ or ∞. Quantities x1 =1−x2
x1
µi ( p , T ) − µiid ( p , T ) = µiE ( p , T ) = RT ln f i (93a) =− d ln f 1 (98)
x1 =1 1 − x1
µi ( p , T ) − µiid ( p , T ) = µiE ( p , T ) = RT ln f 0i (93b) If the activity coefficient of component Y1 is known as
are referred to as excess chemical potentials. a function of composition, the integrals in Eqs. (97) and
According to the Gibbs–Duhem equation the chemical (98) can be evaluated graphically or analytically to yield
potentials and hence the activity coefficients of a mixture the activity coefficients f 2 and f 02 , respectively.
or solution are not independent; for a binary system at
constant pressure and temperature we have D. Conversion of Reference Potentials
and Activity Coefficients
x 1 d µ1 + x 2 d µ2 = 0 (94)
The preceding considerations are based on the use of the
In Eq. (94) the chemical potentials can be expressed mole fraction scale. Chemists and chemical engineers who
with the help of Eqs. (88a)–(88c) for mixtures and with use other scales for the composition of mixtures and so-
Eqs. (91a)–(91c) solutes in solutions. Since lutions, for example, weight percent and mole percent for
d µi∗ = 0, d µi∞ = 0, mixtures or molality, molonity, and molarity for solutions
and (see Section I.A) must convert chemical potentials and
activity coefficients to these scales. Conversion is based
x1 d ln x1 + x2 d ln x2 = 0 (95)
on the fact that changes in composition scales do not
the following equations are obtained for mixtures and so- change the chemical potential, for example, conversion
lutions, respectively: from the mole fraction scale (µi∞ , f 0i ) to the molality scale
x1 (µi∞(m) , γi ):
d ln f 2 = − d ln f 1 (96a)
1 − x1 µi∞ + RT ln xi + RT ln f 0i
and = µi∞(m) +RT ln m i + RT ln γi (99)
x1
d ln f 02 = − d ln f 1 (96b)
1 − x1 Using Eq. (91c) and the conversion formula from xi to m i
Integration of Eq. (96a) from the pure component Y2 (x2 = (Table I) entails
1, f 2 = 1) to an arbitrary composition x2 yields the activity
µi∞(m) = µi∞ + RT ln Ms ; γi = xs f 0i (100)
coefficient at this composition:
x2
Yi (i = 2, . . .) being the solutes and S the solvent. For
(ln f 2 )x =x2 = d ln f 2
x2 =1 frequently needed conversion formulas of mole fractions
x1 =1−x2
x, molarities c, molalities m, and molonities m̃ and corre-
x1 sponding activity coefficients v, y, γ , and β, see Tables V
=− d ln f 1 (97)
x1 =0 1 − x1 and VI.
To µ∞
2 µ∞(c)
2 µ∞(m)
2 µ2∞(m̃)
To f 02 y2 γ2 β2
d + (M1 − M2 )c2
f 02 = f 02 y2 (1 + M1 m 2 )γ2 [1 + (M1 − M2 )m̃ 2 ]β2
d1
d1 (M1 x1 + M2 x2 ) (1 + M2 m 2 )d1 d1
y2 = f 02 y2 γ2 β2
d M1 d d
d − M 2 c2
γ2 = x1 f 02 y2 γ2 (1 − M2 m̃ 2 )β2
d1
M1 x 1 + M2 x 2 d
β2 = f 02 y2 (1 + M2 m 2 )γ2 β2
m1 d1
a Subscript 1 denotes solvent; subscript 2 denotes solute; d denotes density of pure solvent; d denotes density of
1
solution; M1 and M2 denote molar masses.
a chemical compound Y in two immiscible solvents in Mathematics provide a general criterion for stability.
contact, S1 and S2 , n (1) (2)
Y and n Y moles of Y being the The quadratic form δ 2 U is positive for all considerable
amounts of Y dissolved in S1 and S2 , respectively. Then variations of the parameters ξi if, and only if, all the roots
the internal variable ζ is represented as the ratio of the λ of the equation
mole numbers, ζ = n (1) (2)
Y /n Y . If the state of equilibrium
is attainable from both sides, ζ > ζ eq and ζ < ζ eq , the U11 − λ U12 ... U1,k+2
tangent at ζ eq is horizontal, (∂ S/∂ζ )eq = 0, and δS equals U21 U22 − λ . . . U2,k+2
.
zero, δS = (∂ S/∂ζ )eq δζ = 0. If, however, compound Y is . .. = 0 (107)
. .
insoluble in one of the two solvents, the tangent at ζ eq is
Uk+2,1 ... Uk+2,k+2 − λ
∼
defined only from one side by the limit of (∂ S/∂ζ ), which
is a negative quantity, and δS ≤ 0 (Gibbs’ equilibrium con- are greater than zero (i.e., if the quadratic form is positive
dition). definite). Variations of order higher than 2 generally must
Gibbs criteria of equilibrium, of course, are in agree- not be considered.
ment with the second law of thermodynamics, which gives
evidence of the variation of entropy in spontaneous pro-
D. Equilibria at Phase Boundaries
cesses (entropy increase) but gives no explicit evidence on
the state of equilibrium itself. A heterogeneous system made up of two phases, α and β,
and k components Yi is in equilibrium [Eq. (103b)] if
C. Stability of Equilibrium k
(α) (α)
− p δV + T δS +
(α) (α) (α) (α)
µi δn i
Eqs. (103a) and (103b) do not provide information on the i=1
stability of equilibrium. Stability of equilibrium is rec-
k
(β) (β)
ognized only when taking into account the variations of + − p δV (β) (β)
+T (β)
δS (β)
+ µi δn i ≥0
degrees greater than unity, δ 2 S, δ 3 S, . . . or δ 2 U , δ 3 U , . . . i=1
in the total variations, (108)
S = δS + δ S + δ S + · · ·
2 3
(104a) All variations in Eq. (108) are under the constraints
U = δU + δ U + δ U + · · ·
2 3
(104b) δS = 0; δV = 0; δn i = 0; i = 1, . . . , k (109)
where yielding the relations
∂U
δU = dξi (105a) δS (α) = −δS (β) ; δV (α) = −δV (β) ;
i
∂ξi (β)
(110)
δn i(α) = −δn i ; i = 1, . . . , k
1
δ2U = Ui j dξi dξ j (105b) which permit us to write Eq. (108) in the form
2 i j
− p (α) − p (β) δV (α) + T (α) − T (β) δS (α)
∂ ∂U
Ui j = (105c)
k
∂ξ j ∂ξi ξ j =ξi + µi(α) − µi
(β)
δn i(α) ≥ 0 (111)
ξi =ξ j
i=1
and so on. The equilibrium position is a position of stable
Equation (111) is the basic equation for the discussion
equilibrium if
of equilibrium conditions at phase boundaries. For this
δ2 S < 0 or δ2U > 0 (106a) purpose, three sets of variations are studied:
α = 1, 2, . . . , v. (116)
In Eq. (116) Vi(α) and Si(α) are the partial molar volume and
entropy of component Yi in phase α. The set of v equations FIGURE 5 Stable (solid lines) and metastable (dashed lines)
(116) is the basis for the theory of phase diagrams. phase diagrams of sulfur. For explanation see text.
P1: FYD Revised Pages
Encyclopedia of Physical Science and Technology EN002-101 May 17, 2001 20:51
internal variable ζ of chemical reactions is given by the be taken into account with the help of the corresponding
relation transfer quantities, H (α→β) and V (α→β) , respectively.
dζ = dn i /ωi (123)
yielding the expression 3. Heat Balance of Chemical Reactions
the terminal phases are materially identical (here copper) used for the determination of activity coefficients from
they differ only in their Galvani potentials (α) . vapor-pressure measurements:
(1)
µ̃(1)
e − µ̃e = F
(v)
− (v) = −F E. (135) f i(1) = pi xi(1) pi∗ (140)
In Eq. (135) E is the measured potential difference of the Starting with the equilibrium condition for a solution and
cell at zero current, the electromotive force of the cell. its gaseous phase,
From Eqs. (134) and (135) follows the wellknown ◦(g)
Nernst equation: d µi∞(1) + RT ln ai(1) = d µi + RT ln pi φi (141)
v−1 k
(α) ωi and equivalent approximations yield the relation
◦ RT
E=E − ln ai (136a)
nF α=1 i=1
f 0i(1) = pi xi(1) K (142)
v−1
k where K is Henry’s constant.
−n F E ◦ = ωi vi(α) (136b) An ideal mixture ( f i = 1) or dilution ( f 0i = 1) reduces
α=1 i=1 Eqs. (141) and (142) to
pi = xi(1) pi∗ and pi = xi(1) K (143)
H. Use of Equilibria for the Determination
of Activity Coefficients known as Raoult’s and Henry’s laws, respectively.
Figure 7 illustrates the calculation of f i and f 0i according
Fugacity coefficients and hence activity coefficients can
to Eqs. (140) and (142) as the deviations from ideality.
be calculated with the help of appropriate equations of
state (see Section IV). This is possible, however, only for
the gas phase (van der Waals equation, Redlich–Kwong 2. Liquid–Liquid Equilibria
equation, virial equation); for condensed phases no use- In osmotic pressure measurements a solution is separated
ful general equations of state are available. Experimental from the pure solvent S by a nondeformable membrane
determination of activity coefficients in condensed phases permeable only to the solvent. The pressure of the pure
is based on the study of equilibria. There are numerous
methods, but only typical examples will be given.
1. Vapor–Liquid Equilibria
Equilibrium of a gaseous and a liquid phase in contact
(permeable, deformable, and heat-conducting boundary)
entails the conditions
◦(g)
d µi∗(1) + RT ln ai(1) = d µi + RT ln pi φi (137)
which at constant temperature can be transformed with
the help of Eq. (67b), showing the pressure dependence
of the chemical potentials, to yield the relation
xi(1) f i(1)
RT d ln = −Vi∗(1) d p (138)
pi φi∗
solvent is p, while in osmotic equilibrium the solution is account, and the integral of Eq. (151) can be solved ana-
subject to an additional pressure , the osmotic pressure, lytically to yield the activity coefficients f s from freezing
yielding the equilibrium condition point depressions.
µs ( p + , T ) = µ∗s ( p, T ) (144) Another method using liquid–solid equilibria deter-
mines solute activity coefficients from temperature-
Using the pressure dependence of the chemical potential dependent solubility data. The pure solute Yi is in equi-
and assuming that the partial molar volume is independent librium with the saturated solution. With reference to the
of pressure yields the relation state of the infinitely dilute solution [Eqs. (91a)–(91c)],
p+
the equilibrium condition is given by the relation
µs ( p + , T ) = µs ( p, T ) + Vs d p
p
µi∗(s) ( p, T ) = µi∞ ( p, T ) + RT ln(xi f 0i )sat (152)
= µs ( p, T ) + Vs (145)
The differential of Eq. (152) can be expressed with the help
By combining Eqs. (144) and (145), one can determine of the temperature dependence of the chemical potentials,
the activity coefficient of the solvent from the osmotic yielding at constant pressure
pressure
xs f s = exp(−Vs /RT ) (146) Hi∞ − Hi∗(s) Hi∞
d ln(xi f 0i )sat = 2
dT = dT (153)
The solvent activity xs f s can be expressed with the help RT RT 2
of the molal osmotic coefficient which is defined as In Eq. (153) the quantity Hi∞ is the change in enthalpy
ln xs f s for the transfer of component Yi from the pure solid state to
=− (147)
m Ms the infinitely dilute solution. This quantity can be obtained
where m is the molality of the solute and Ms is the molar from heat of solution experiments.
mass of the solvent. Using Eq. (146) yields the follow- If the temperature-dependent solubility measurements
ing relationship between the osmotic coefficient and the are started at a temperature T where the component Yi
osmotic pressure: is very slightly soluble (xi ) and the activity coefficient f 0i
Vs approaches unity, Eq. (153) can be integrated to yield
= (148)
m Ms RT
xi T
Hi∞
The measurement of osmotic coefficients combined ( f 0i )sat,T = exp 2
dT (154)
with the Gibbs–Duhem–Margules equation is a well- xi sat T RT
established method for the determination of the activity
coefficients of solutes.
4. Electrochemical System Equilibria
3. Liquid–Solid Equilibria Electromotive force (emf) measurements are frequently
In freezing point experiments an equilibrium between liq- used to determine activity coefficients of electrolyte so-
uid solution and its pure solid solvent S is achieved. The lutions. Equation (136a) relates the emf to the activi-
equilibrium condition at constant pressure is ties of the reacting cell components. From concentration-
dependent measurements the standard potential E ◦ of the
µ∗(s) ( p, T ) µ ∗(1)
( p, T )
d s
=d s
+ d(R ln xs f s ) cell reaction and the activity coefficients can be obtained.
T T As an example, according to Eq. (136a), the emf of the
(149) Galvanic cell
The temperature dependence of the chemical potential
Pt(s) | K(Hg) | KCl (conc. in S) | AgCl(s) | Ag(s) | Pt(s)
[Eq. (67c)] can be used in Eq. (149) to yield
(155)
Hs∗(1) − Hs∗(s) fus Hs
d(ln xs f s ) dT = dT (150) can be written as
RT 2 RT 2
where fus Hs is the molar enthalpy of fusion of the pure 2RT
E = E ◦ − ln(cy± ) (156a)
solvent at temperature T . Equation (150) is integrated F
from xs = 1, where T = T ∗ (freezing temperature of pure RT
E ◦ = E ◦ + ln aK(Hg) (156b)
solvent) to an arbitrary concentration xs with freezing tem- F
perature T to yield Here, y± is the mean activity coefficient of the electrolyte
T on the molar scale.
fus Hs
ln xs f s = dT (151) In Eq. (155) K(Hg) is a potassium amalgam electrode
T∗ RT 2 connected to a solution of KCl at concentration c in
Using the appropriate molar heat capacities, the tempera- solvent S; AgCl(s)/Ag(s) is the silver/silver chloride ref-
ture dependence of the enthalpy of fusion can be taken into erence electrode.
P1: FYD Revised Pages
Encyclopedia of Physical Science and Technology EN002-101 May 17, 2001 20:51
Computational Chemistry
Matthias Hofmann Henry F. Schaefer III
Ruprecht-Karls-Universität Center for Computational Quantum Chemistry
487
P1: GNB/MAG P2: FQP Final
Encyclopedia of Physical Science and Technology EN003G-129 June 13, 2001 22:42
Of course, the strict separation of theoretical, compu- phisticated enough to be useful for applications in chem-
tational, and experimental chemistry is of an academic istry. Molecular mechanics emerged in the mid-1960s
nature. In practice, theoreticians often not only develop a and has become more sophisticated and more useful with
new method but also need to design more efficient algo- time.
rithms to make the method applicable. Before computa- Due to the number of various approximations, early
tional results can be interpreted, computational chemists computations performed to try to reproduce experimen-
need to undertake benchmark studies to determine the lim- tal findings yielded varying degrees of success. Compu-
itations of a method. Without the knowledge about the tational chemistry could become a recognized scientific
accuracy of the applied mathematical model, any compu- discipline only after a real predictive power was estab-
tational study is without scientific significance. Likewise, lished. Perhaps the first case in which theory proved to be
many experimentalists use computer programs to support accurate enough to challenge experiment was the structure
or complement their experimental studies. determination of methylene (CH2 ). From a spectroscopic
investigation the ground state of this molecule was first
concluded to be linear. However, this interpretation had to
I. HISTORY OF COMPUTATIONAL be revised after reliable computations predicted a signifi-
CHEMISTRY cantly bent structure in 1970.
The development of different methods and their effi-
As early as 1929, only 3 years after Schrödinger’s formu- cient implementation is only one reason for the success
lation of the fundamental equation that bears his name, of computational chemistry. Another factor is the dra-
Dirac stated correctly, matic development of computer technology (i.e., com-
putational speed as well as the amount of core memory
The underlying physical laws necessary for the mathematical and of disk storage). Today’s personal desktop computers
theory of a large part of physics and the whole of chemistry are provide many times the computer power of early “super-
thus completely known, and the difficulty is only that the exact computers” at a fraction of the price. The combined de-
application of these laws leads to equations much too compli- velopment of both software and hardware allowed com-
cated to be soluble. putational chemistry to become for chemical research an
indispensable tool which allows one to plan experiments
Hence, the further development of quantum chemistry more carefully and hence to optimize the use of labora-
was aiming at approximate solutions of the Schrödinger tory resources. The importance of computational chem-
equation by simplifying the required mathematical istry was honored when the 1998 Nobel Prize in Chem-
treatment. istry was awarded to two pioneers of the field, J. A. Pople
In the 1930s the basics for a wide range of computa- and W. Kohn.
tional methods based on quantum mechanics were laid
by the development of the Hartree-Fock method. In 1951
Roothaan, for the first time, considered molecular or-
bitals as a linear combination of analytic atomic one- II. METHODS USED IN
electron functions, shifting the mathematical task from COMPUTATIONAL CHEMISTRY
the numerical solution of coupled differential equations
to the evaluation of integrals over basis functions. Intro- The methods used in computational chemistry can be clas-
duction of approximations for the most difficult integrals sified according to the sophistication of the underlying
through the use of suitable parameters led to the develop- model (Fig. 1). Molecular mechanics methods are based
ment of semiempirical methods beginning in the 1950s. on classical mechanics and are computationally the fastest.
The more rigorous ab initio methods benefited from the Semiempirical methods are based on a wave function de-
use of Gaussian- instead of Slater-type basis functions, scription, in which some integrals are approximated by
as pointed out by Boys in 1950 but generally accepted means of parameters and many others are neglected to re-
only two decades later. Configuration interaction (CI) was duce the computational cost. Ab initio methods use only
the first theoretical level used to include electron corre- fundamental physical constants but no further experimen-
lation and was widely applied during the 1970s. In the tal results; Hartree-Fock (H F) theory is the starting level,
late 1970s many-body perturbation theory (Møller-Plesset which can be improved upon by accounting for electron
methods), and during the 1980s coupled cluster meth- correlation in various ways. Density functional methods
ods, became more popular because they are more eco- are also quantum mechanical but are based on the electron
nomical and more rapidly convergent, respectively, than density to describe chemical systems. They are often con-
CI. The 1990s can be considered the decade of density sidered “ab initio” although some empirical parameters
functional theory, which by that time had become so- enter the energy functionals.
P1: GNB/MAG P2: FQP Final
Encyclopedia of Physical Science and Technology EN003G-129 June 13, 2001 22:42
FIGURE 1 Classification of computational chemistry methods. AMBER, assisted model building with energy refine-
ment; CHARMM, chemistry at Harvard molecular mechanics; GROMOS, Groningen molecular simulation; CNDO,
complete neglect of differential overlap; INDO, intermediate neglect of differential overlap; NDDO, neglect of diatomic
differential overlap; MNDO, modified neglect of differential overlap; AM1, Austin model 1; PM3, parametric method
number 3; HF, Hartree-Fock; MP2, Møller-Plesset, second order; MP3, Møller-Plesset, third order; MP4, Møller-
Plesset, fourth order; CISD, configuration interaction singly and doubly excited; CISD(T), configuration interaction
singly, doubly, and triply (estimated) excited; CCSD, coupled cluster singly and doubly excited; CCSD(T), coupled
cluster singly, doubly, and triply (estimated) excited; LDA, local density approximation; GGA, generalized gradient
approximation; BLYP, Becke/Lee, Yang, and Parr; B3LYP, Becke three-parameter/Lee, Yang, and Parr.
A. Force Field Methods frared (IR) spectroscopy associated with a certain bond
stretch or angle deformation also fall in narrow ranges,
A force field (FF) is a set of equations describing the po-
which demonstrates that the variation of force constants
tential energy surface of a chemical system. A molecular
is also relatively small. The existence of an increment
mechanics (MM) method uses a force field based on a
system for heats of formation, for example, shows that
classical mechanical representation of molecular forces
the energy behaves additively as well. Hence, in MM the
to calculate static properties of a molecule (e.g., structure
energy is expressed classically as a function of geometric
and energy of an energy minimum structure). Molecular
parameters.
dynamics (MD) also implements a force field but gener-
ates dynamic properties (e.g., evolution of an structure
1. Energy Terms
in time) by calculating forces and velocities of atoms.
In MM methods atoms are treated as “balls” of different Advanced force fields distinguish several atom types for
masses and sizes, and bonds are “springs” connecting the each element (depending on hybridization and neighbor-
balls without an explicit treatment of electrons. The main ing atoms) and introduce various energy contributions to
advantage of this simple classical approach is the small the total force field energy, E FF :
computational cost, which allows one to treat very large
E FF = E str + E bend + E tors + E vdW + E elst + · · · ,
molecules. FFs are typically constructed to yield experi-
mentally accurate structures and relative energies. Some where E str and E bend are energy terms due to bond stretch-
FFs are generated to accurately compute other properties ing and angle bending, respectively; E tors depends on tor-
such as vibrational spectra. sional angles describing rotation about bonds; and E vdW
The observation that properties of chemical functional and E elst describe (nonbonded) van der Waals and elec-
groups are normally transferable from one compound to trostatic interactions, respectively (Fig. 2). In addition to
another validates the MM approach. The most basic com- these basic terms common to all empirical force fields
ponent in a FF is the atom type and one element usually there may be extra terms to improve the performance
contributes several atom types. Each bond is character- for specific tasks. Each term is a function of the nuclear
ized by the atom types involved and has a “natural” bond coordinates and a number of parameters. Once the pa-
length since the variation with the chemical environment rameters have been defined, the total energy, E FF , can be
is relatively small. Similarly, bond angles between atom computed and subsequently minimized with respect to the
types have typical values. The energy absorptions in in- coordinates.
P1: GNB/MAG P2: FQP Final
Encyclopedia of Physical Science and Technology EN003G-129 June 13, 2001 22:42
D(1 − e k/2D(R 0 −R ) 2
) .
with
θ = θ0ABC − θ ABC .
If higher accuracy is desired (e.g., for computing IR fre-
quencies), a third-order term can be included with an an-
harmonicity constant set to be a fraction of k ABC .
FIGURE 2 Most basic energy terms included in empirical force
field (FF) methods. c. Torsion energy. The torsional potential, due to the
rotation of bonds A–B and C–D about bond B–C, is peri-
a. Stretch energy. The harmonic approximation odic in the torsional angle ω, which is defined as the angle
gives the stretch energy of a bond between atom types between the projections of A–B and C–D onto a plane
AB perpendicular to B–C. The torsional energy therefore is
A and B, E str , as
expressed as a Fourier series:
AB
E str (R) = k AB R 2 ,
ABCD
E tors (ω) = Vn cos(nω),
where k AB is the force constant and R = R0AB − R AB is n
the bond length deviation from the natural value, R0AB , for
AB
which E str is defined to be zero. Further improvement can which allows the representation of potentials with various
be achieved by including higher anharmonic terms to the minima and maxima (Fig. 4). Three terms are enough to
equation. While these expressions describe the potential model the most common torsional potentials.
well for R close to R0 , the energy goes to infinity for large
distances (Fig. 3). In contrast, a morse potential allows
the energy to approach the dissociation energy, D, as R
increases:
√
E Morse (R) = D(1 − e k/2DR 2
) ,
but it is much more expensive in terms of computational
cost.
d. van der Waals energy. The van der Waals term, needs to be considered out to very long distances. Fast
E vdW , covers nonelectrostatic interactions between non- multipole methods (FMMs) can be applied to reduce the
bonded atoms. The van der Waals energy is positive (re- computational cost of evaluating E elst .
pulsive) and very large at short distances, zero at large
distances, but slightly negative (attractive) at moderate f. Other energy contributions. So that the perfor-
distances due to temporarily induced multipole attraction mance can be improved, force fields include further pa-
(dispersion force), the most important attractive contribu- rameters to take care of special cases. For example, cross
tion of which (dipole–dipole interaction) has an R −6 de- terms account for the interplay between different contri-
pendence. The Lennard-Jones potential for E vdW includes butions (e.g., longer bonds for small angles). Correction
a repulsive term, which is set proportional to (R −6 )2 to terms may be introduced to describe substituent effects
grow faster than R −6 : (e.g., anomeric effect). Additional terms may be intro-
12 6 duced to adequately treat special cases like pyramidaliza-
AB
AB R0 R0AB tion of sp2 hybridized atoms. Hydrogen bonding may be
E LJ (R) = ε −2 .
R R treated explicitly (in addition to the electrostatic interac-
tion) with a special set of van der Waals interaction pa-
εAB determines the energy depth of the minimum. Alter- rameters. Pseudo atoms maybe introduced to model lone
natively, a Buckingham or Hill potential can be used that pairs. In addition, atoms in unusual bonding situations
employs an exponential function for the repulsive term. (three-membered rings, molecules with linearly conju-
For each atom type a van der Waals radius, R0 , and the gated π -systems, aromatic compounds, etc.), which are
atom softness, ε, have to be determined, from which the not described adequately by the normal parameters, can
diatomic parameters are calculated according to be defined as new atom types.
√ The force field energy, E FF , corresponds to the en-
R0AB = R0A + R0B and ε AB = ε A ε B .
ergy relative to a molecule with noninteracting fragments.
e. Electrostatic energy. The electrostatic energy, Therefore, only energies for molecular structures built
E elst , is due to the electrostatic interactions arising from from the same fragments (conformers) can be compared
polarized electron distributions based on electronegativity directly. So that energy between different molecules (iso-
differences. It can be modeled by Coulomb interactions of mers) can be compared, the energy scale is converted to
point charges associated with individual atoms: heats of formation by adding bond increments (estimated
from bond dissociation energies minus the heat of forma-
QA QB tions of the atoms involved) and possibly group increments
E elst (R AB ) = ,
εR AB (e.g., methyl group):
ε being a dielectric constant, which can be used to model
bonds
groups
the effect of the same or other molecules present (e.g., Hf = E FF + H AB + H G .
solvent). The atomic charges, Q, are commonly obtained
by fitting to the electrostatic potential as calculated by an
2. Parametrization
electronic structure method. An E elst description based on
dipole–dipole interactions between polarized bonds can Determining the parameters for a force field is a substan-
alternatively be employed. tial task. In general, not all necessary data are available
Hydrogen bonds are nonbonded interactions between from (accurate) experiments. Modern electronic structure
a positively charged hydrogen atom and an electronega- computations can provide unknown data relatively easily
tive atom with lone electron pairs (mostly oxygen or ni- and with sufficient accuracy. Another problem is the large
trogen) and can be adequately modeled by appropriately number of parameters: for a force field with N atom types,
chosen atomic charges. Although a single hydrogen bond the number of different types of bonds, bond angles, and
is a very weak interaction, the large number occurring in dihedral angles scales as N 2 , N 3 , and N 4 , respectively,
biomolecules (e.g., proteins) makes hydrogen bonding a each requiring several parameters. So that the number of
very important factor. parameters can be reduced, the atom dependency can be
In the large size limit, the bonded interactions increase reduced (e.g., the torsional parameters may be treated as
linearly with the system size, but the nonbonded interac- dependent on the B–C central bond only and not on the
tions show a quadratic dependence and determine the com- atom types A and D). The parametrization effort can be re-
putational cost. The van der Waals interactions quickly duced further by defining “generic” parameters to be used
fall off with the distance (R −6 dependence) and may be for less common bond types or when no reference data are
neglected for large separations. The electrostatic interac- available. This, of course, reduces the quality of a calcula-
tion (proportional to R −1 ) is much more far reaching and tion. By deriving the di-, tri-, and tetra-atomic parameters
P1: GNB/MAG P2: FQP Final
Encyclopedia of Physical Science and Technology EN003G-129 June 13, 2001 22:42
from atomic data (atom radii, electronegativities, etc.), to reproduce experimental results and are specific to the
universal force fields (UFFs) allow one to include basi- chosen combination of QM and MM methods.
cally all elements. The performance, however, is relatively Alternatively, the total energy, Etot , may be extrapolated
poor. from QM and MM calculations on a small part and on
The kind of energy terms, their functional form, and the whole of a suitably partitioned system (IMOMM-
how carefully (number, quality, and kind of reference integrated molecular orbital, molecular mechanics
data) the parameters were derived determine the quality method)
of a force field. Accurate force fields exist for organic
Etot = EQM (small) + EMM (whole) − EMM (small)
molecules (e.g., MM2, MM3), but more approximate
force fields (e.g., with fixed bond distances) optimized
for computational speed rather than accuracy [e.g., B. Wave Function
AMBER (assisted model building with energy refine- Quantum-Mechanical Methods
ment), CHARMM (chemistry at Harvard molecular me-
The explicit treatment of electrons in atoms and molecules
chanics), GROMOS (Groningen molecular simulation)]
requires quantum mechanics, which invokes a wave func-
are the only practical choice for the treatment of large
tion, , to describe the system of electrons and nuclei. The
biomolecules. The type of molecular system to be studied
square of the wave function represents the probability of
determines the choice of the force field.
a particle’s being at a given position. The central goal be-
One limitation of force field methods is that they can
comes the solution of the (time-independent) Schrödinger
describe only well-known effects that have been observed
equation,
for a large number of molecules (this is necessary for
the parametrization). The predictive power of these meth- H =E ,
ods is limited to extrapolation or interpolation of known which relates the wave function, , to the energy, E, of
effects. the system. The Hamiltonian operator, H, consists of the
kinetic (T) and the potential energy (V) operators:
3. Quantum-Mechanical and H = T + V.
Molecular-Mechanical (QM/MM) Method
The fact that electrons instantly adjust to changes in nu-
Another limitation of MM is the inability to investigate clear positions due to the much greater masses of the nu-
reactions. While force field methods are capable of de- clei allows the motions of electrons and nuclei to be sepa-
scribing conformational changes, for which all bonds re- rated (Born-Oppenheimer approximation). The electronic
main intact along the reaction coordinate, they are by con- wave function depends on only the nuclear position, not
struction not capable of treating reactions in which bonds on the nuclear momenta. The electronic Hamiltonian, He ,
are broken and/or formed. The classical model is not de- in atomic units is given by
signed to describe the electronic rearrangement associ-
1 Elec.
ated with bond breaking and bond formation. Such prob- He = Te + Vne + Vee + Vnn = − ∇i2
lems are better treated by electronic structure methods 2 i
discussed below. For large systems, a combined quantum-
Elec.
Nucl. Za Elec.
Elec. 1
mechanical and molecular-mechanical (QM/MM) method + +
can be applied. In this approach the reactive part of a i
|Ra − ri | i j>i
|ri − r j |
the molecule to be studied is described by a quantum-
mechanical (semiempirical, ab initio, or DFT) method Nucl.
Nucl. Za Zb
+ ,
while the rest of the system is treated by a force a b>a
|Ra − Rb |
field. The problem with this approach is the “communica-
tion” between classical and quantum-mechanical poten- where r and R represent the electronic and nuclear coor-
tial (i.e., how the atoms close to the QM/MM border are dinates, respectively, and the Laplacian is defined as
treated). ∂2 ∂2 ∂2
The total energy, Etot , may be computed as follows: ∇i =
2
+ 2+ 2 .
∂ xi2 ∂ yi ∂z i
Etot = EQM + EMM + EQM/MM ,
The nucleus–nucleus repulsion, Vnn , is constant for a given
where the quantum-mechanical contribution, EQM , and the geometry, and the kinetic energy, Te , and the electron–
molecular-mechanical contribution, EMM , are defined by nucleus attraction, Vne , are easy to evaluate. The electron–
a QM method and a MM method, respectively. The cou- electron repulsion, Vee , however, depends on the distances
pling term, EQM/MM , includes parameters that can be fitted between electrons and is the reason why the Schrödinger
P1: GNB/MAG P2: FQP Final
Encyclopedia of Physical Science and Technology EN003G-129 June 13, 2001 22:42
equation cannot be solved exactly for systems with more For example, a Slater determinant for the ground state
than one electron. of the hydrogen molecule can be written as follow:
The energy can be computed as the expectation value
1 φg (1) φg (1)
of the Hamiltonian operator: (1, 2) = √
∗
2 φg (2) φg (2)
H dτ |H|
E= = , 1
∗ dτ | = √ [φg (1)φg (2) − φg (2)φg (1)] = −(2, 1),
2
where the common bra-ket notation is used. The varia-
where φg represents the bonding molecular orbital (MO),
tional principle states that any trial wave function will
the 1σg orbital.
give an energy equal to or higher than the exact value be-
The electronic Hamiltonian can be written as sums of
cause the real system will adopt the best possible wave
one-electron (hi ) and two-electron (gi j ) operator plus the
function (which corresponds to the exact energy). Thus,
constant nuclear–nuclear repulsion:
a trial wave function constructed in terms of a number
of parameters can be improved by minimizing the energy H= hi + gi j + Vnn
with respect to the parameters (MO coefficients). i i j>1
A meaningful trial wave function should approach zero with
as r goes to infinity; it should be normalized, that is, 1
Nucl.
Za
hi = − ∇i2 −
| =1 2 a |Ra − ri |
and
(meaning the probability that the system is located some-
where in space is one); and it should comply with the 1
gi j = .
Pauli principle. The latter states that two electrons must |ri − r j |
differ in at least one quantum number. Furthermore, the The Hartree-Fock energy expression becomes
wave function should be antisymmetric (i.e., it should
N
1 N N
change sign when two electrons are interchanged). This is E= φi |hi |φi + ( φi φ j |gi j |φi φ j
a characteristic property of electrons. Antisymmetry can i
2 i=1 j=1
be ensured by using Slater determinants with one-electron
N
functions (orbitals) φi in columns and electrons (1, 2, . . .) − φi φ j |gi j |φ j φi ) + Vnn = hi
in rows. i
1 N N
+ (Ji j − K i j ) + Vnn ,
1. Hartree-Fock Method 2 i=1 j=1
Hartree-Fock theory employs a single Slater determinant. or
In the restricted Hartree-Fock (RHF) method, one spatial
N
1 N N
function φi is multiplied by an α (representing spin up, E= φi |hi |φi + ( φ j |Ji |φ j
2 i=1 j=1
spin quantum number m s = + 12 ) or β (representing spin i
1M
M
M
M
+ Dαβ Dγ δ ( χα χγ |g|χβ χδ
2 α β γ δ
− χα χγ |g|χδ χβ ) + Vnn ,
which introduces the density matrix elements, Dγ δ , as
occ.MO
Dγ δ = c γ j cδ j .
j
to avoid the storage bottleneck and the slow input–output a. Complete neglect of differential overlap. The
operations. It is also possible to effectively screen for in- complete neglect of differential overlap (CNDO) approx-
tegrals which contribute only negligibly and thus can be imation is the most rigorous: only the one- and two-center
discarded. The use of symmetry, if present, also reduces Coulomb terms among the two-electron integrals survive:
the computational cost considerably.
µA νB | λC σD = δAC δBD δµλ δνσ µA νB | µA νB .
2. Semiempirical Methods µA νB | µA νB are independent of the orbital type (to guar-
The HF method represents a point of departure in elec- antee rotational invariance) and there are only two param-
tronic structure theory. One direction involves improve- eters, µA νA | µA νA = γAA and µA νB | µA νB = γAB ,
ment of the accuracy by including electron correlation for the two-electron integrals. The γAB depends only on
(see Section II.B.3.). Semiempirical methods, however, the nature of the atoms A and B and the distance between
try to provide moderate accuracy, but at much lower cost them and can be interpreted as the average electrostatic
than that of ab initio methods. Therefore, only valence repulsion of one electron at center A and one electron at
electrons are treated explicitly and core electrons are re- center B. The integral γAA is the average repulsion of two
placed by an effective core (covering nucleus plus core electrons at one atom.
electrons) and a minimal basis of orthogonal Slater-type The one-electron integrals are
orbitals (usually only s and p types) is chosen to describe
Nucl.
the valence electrons. µA |h|νA = −δµν µA |Va |µA .
The two-electron integrals require the main compu- a
tational effort in a HF calculation and their number is
The Pariser-Pople-Parr (PPP) method is a special case
significantly reduced in semiempirical methods by the
of CNDO, restricted to the treatment of π electrons.
zero differential overlap (ZDO) approximation. This basic
semiempirical assumption sets products of functions for
b. Intermediate neglect of differential overlap. In
one electron but located at different atoms equal to zero
the intermediate neglect of differential overlap (INDO)
(i.e. µA (1)νB (1) = 0, where µA and νB are two differ-
approximation the two-electron integrals are limited to
ent orbitals located on centers A and B, respectively). The
the Coulomb integrals. One-electron integrals involving
overlap matrix, S, is set equal to the unit matrix, Sµν = δµν ,
different orbitals of one center and Va operator from an-
and the two-electron integrals µν | λσ are zero, unless
other have to disappear to guarantee rotational invari-
µ = ν and λ = σ , that is,
ance. The one-electron integrals are the same as in the
µν | λσ = δµν δλσ µµ | λλ , CNDO approximation and the two-electron integrals are
given by
where δi j = 0 for i = j and δi j = 1 for i = j.
All three-and four-center two-electron integrals vanish µA νB | λC σD = δµA λC δνB σD µA νB | µA νB
automatically. One-electron integrals involving three cen- and parametrized as γAB and γAA .
ters are also set to zero. The remaining integrals are han- INDO is comparable to CNDO in computational cost
dled as parameters which partly compensate the errors but has the advantage that electronic states of different
introduced by the ZDO approximation. The parameters multiplicities can be distinguished.
are derived from experimental data on atoms or are fitted MINDO/3 (modified intermediate neglect of differen-
to reproduce experimental results for molecules. The var- tial overlap) was the first successful semiempirical method
ious semiempirical methods introduce different approxi- to give reasonable predictions of molecular properties. The
mations for the one- and two-electron parts of the Fock main improvement over earlier methods was the use of
matrix elements, molecular data rather than atomic data for the parametriza-
AO
AO tion. However, the number of parameters to be determined
Fµν = µ|h|ν + Dλσ ( µν | λσ − µλ | νσ ), in MINDO/3 increases with the square of the number of
λ σ atoms included because one parameter depends on the
type of bonded atoms.
with the one-electron operator
1 Z 1 c. Neglect of diatomic differential overlap. Many
h = − ∇2 − a
= − ∇2 − Va ,
a |Ra − r|
2 2 of the shortcomings of MINDO/3 are corrected in the ne-
a
glect of diatomic differential overlap (NDDO) approxi-
where Z a denotes the charge resulting from the nucleus mation, which includes no further approximations beyond
plus the core electrons. ZDO. Thus, all integrals involving any two orbitals on one
P1: GNB/MAG P2: FQP Final
Encyclopedia of Physical Science and Technology EN003G-129 June 13, 2001 22:42
center with any two orbitals on another center are kept, 3. Electron-Correlated Methods
which increases the number of integrals dramatically.
In the Hartree-Fock approach the real electron–electron
The one-electron integrals are
interaction is replaced by an interaction with an averaged
µA |h|νB = µA − 12 ∇ 2 − VA − VB νB field. This means HF suffers from an exaggeration of
electron–electron repulsion. The difference between the
and
energy obtained at the HF level and the exact (nonrela-
µA |h|νA = δµν µA − 12 ∇ 2 − VA µA − µA |Va |νA . tivistic) energy (for a given basis set) is defined as the cor-
a=A relation energy. The name reflects that this energy differ-
The two-electron integrals are given by ence is connected to the correlated movement of electrons
which is not considered in the HF method and which re-
µA νB | λC σD = δAC δBD µA νB | λA σB . duces the electron–electron repulsion. The HF description
typically allows electrons to be unrealistically resident in
d. Modified NDDO. The more successful semiem- the internuclear region. This leads to an underestimated
pirical methods, MNDO (modified neglect of differential nuclear–nuclear repulsion and to bond lengths that are too
overlap), AM1 (Austin model 1), and PM3 (parametric short. As a consequence, stretching force constants and
method number 3), are all based on NDDO but differ harmonic stretching frequencies computed at the HF level
in the treatment of core–core repulsion and how the pa- are too large. Likewise, the polarity of bonds is overesti-
rameters are assigned. There are only atomic parameters, mated (the more electronegative atom tolerates a higher
no diatomic parameters as in MINDO/3. The “modified” electron density in the HF picture) and computed dipole
NDDO methods calculate the overlap matrix, S, explicitly moments are often too large.
rather than using the unit matrix. Dynamic electron correlation, which is connected to the
The MNDO method tends to overestimate the repulsion correlated movement of electrons, can be distinguished
between atoms separated by approximately the sum of from static (near-degeneracy) electron correlation, which
their van der Waals radii. To correct for this deficiency, deals with the insufficiency of the one-determinant ap-
AM1 modifies the core–core term by Gaussian functions. proach. HF usually provides a suitable description of
PM3 is essentially equivalent to AM1 but uses (automated) closed-shell molecules in their electronic ground state.
full optimization of the parameter set against a much larger However, the homolytic dissociation of such a molecule
collection of experimental data while the AM1 parameters generates two electronic states which are very close in en-
are tuned by hand. PM3 therefore on average gives results ergy. This situation requires a description by more than
in somewhat better agreement with experiment. one Slater determinant (i.e., at least a two-configuration
Extending the basis set to include d functions as in method). The energy difference between the HF method
MNDO/d and PM3(tm) raises the number of integrals and a multiconfigurational method is the static correlation
(i.e., the number of parameters) tremendously but allows energy. Accounting for electron correlation is essential
a larger variety of applications, for example, those includ- for quantitative answers from electronic structure calcula-
ing transition metal compounds (albeit with variable ac- tions. Different post-HF methods which attempt to recover
curacy) or hypervalent molecules. all or part of the correlation energy are discussed in the
Semiempirical programs usually report heats of forma- following text.
tion calculated from the electronic energies less the cal- Within the closed-shell HF picture, molecular orbitals
culated energies for the atoms plus the experimental heat are occupied by either exactly two or exactly zero electrons
of formations for the atoms: represented by the variationally best one-determinant
atoms
wave function. Correlated levels give a different electron
Hf = E calc. (molecule) − E calc. (atom) density which cannot be represented by a single Slater
atoms determinant. A logical starting point to account for elec-
+ Hf (atom). tron correlation is to expand a multideterminantal wave
function with the HF wave function as a starting point:
The semi ab initio model 1 (SAM1) is another modified
NDDO method, but it does not replace integrals by param-
= a0 HF + ai i ,
eters. The one- and two-center electron repulsion integrals i=1
are explicitly calculated from the basis functions [em-
ploying a standard STO-3G (Slater-type orbital from three where a0 usually is close to 1. Because this is analogous
Gaussian functions) Gaussian basis set] and scaled by a to expanding one MO in terms of AOs, one speaks of
function which has to be parametrized. SAM1 calculations the basis set as the one-electron basis (responsible for
take about twice as long as AM1 or PM3 calculations do. the one-electron functions, the MOs) while the number
P1: GNB/MAG P2: FQP Final
Encyclopedia of Physical Science and Technology EN003G-129 June 13, 2001 22:42
is not trivial. One easy way to construct an MCSCF is the Because this equation has to be true for all values of λ, the
complete active space self-consistent field, or CASSCF terms connected to the same power of λ can be separated:
(also called full optimized reaction space, or FORS). In-
λ 0 : H0 0 = E0 0,
stead of choosing configurations, one must select a set
of “active” (occupied and unoccupied) orbitals and all λ1: H0 1 + H1 0 = E0 1 + E1 0,
possible (symmetry-adapted) configurations within this
“active space” are automatically included in the MCSCF. λ2: H0 2 + H1 1 + H2 0 = E0 2 + E1 1 + E2 0,
The method is called restricted active space self-consistent
and so forth, which gives the zeroth-, first-, second-, etc.,
field, or RASSCF, when subsets of the active orbitals are
order perturbation equations. If one chooses the interme-
restricted to have a certain (minimum or maximum) num-
diate normalization condition,
ber of electrons to reduce the computational cost.
0| = 1; 0| = 0, i > 0,
The MCSCF provides a good first-order description 0 i
covering the static electron correlation due to degener-
acy problems. Dynamic electron correlation should be simple energy expressions are obtained:
addressed with the MCSCF wave function as a reference.
E0 = 0 |H0 | 0 ,
The multireference configuration interaction, or MRCI,
generates excited determinants from all (or selected) de- E1 = 0 |H | 0 ,
terminants included in the MCSCF. The complete active
space perturbation theory, second order (CASPT2) is a E2 = 0 |H | 1 ,
more economical approach. Both methods can be applied and so forth. Knowledge of wave function corrections
to compute excited states. up to order i allows calculation of the energy up to or-
der (2i + 1). This relationship is known as the Wigner
c. Many-body perturbation theory. Perturbation theorem.
theory assumes that somehow an approximate solution In Møller-Plesset (MP) perturbation theory the unper-
to a problem can be found. The missing correction, which turbed Hamiltonian, H0 , is taken as the sum over n Fock
should be small, is then considered as a perturbation of operators (n = number of electrons) giving a total of twice
the system. When the perturbation is to correct for the ap- the average electron–electron repulsion energy and the
proximation of independent particles the method is called perturbation operator becomes the difference between
many-body perturbation theory, or MBPT. In electronic the exact electron–electron repulsion and twice the av-
structure theory the Hamiltonian operator, H, is written as erage electron–electron repulsion. With this choice of H0
a combination of a reference Hamiltonian, H0 , which can the zeroth-order energy is just the sum of MO energies and
be solved for, and a perturbation H : the first-order energy equals the Hartree-Fock energy. The
H = H0 + λH , second-order correction, E(MP2), is the first to contribute
to the electron correlation energy and can be calculated
λ being the perturbation parameter (0 ≤ λ ≤ 1) which de- from the two-electron integrals over MOs:
termines the strength of the perturbation. The energy, E,
and wave function, , are expanded as Taylor series in λ:
occ virt
[ φi φ j | φa φb − φi φ j | φb φa ]2
E(MP2) = .
E = E 0 + λE 1 + λ2 E 2 + λ3 E 3 + · · · i< j a<b
εi + ε j − εa − εb
TABLE I Formal Scaling of Various Methods The expansion coefficients, t, are called amplitudes. Sub-
with the Size of the Molecular System, N, and stituting the exponential function by a series,
Size-Extensive and Variational Properties
Size
∞
1 k
eT = 1 + T + 12 T2 + 16 T3 + 1 4
T + ··· = T
Methoda Scaling extensive? Variational? 24
k!
k=0
N4
HF Yes Yes = 1 + T1 + T2 + 12 T21 + T3 + T2 T1 + 16 T31
MP2 N5 Yes No
MP3 N6 Yes No + T4 + T3 T1 + 12 T22 + 12 T21 T2 + 24 T1 + · · ·
1 4
MP4SDQ N6 Yes No
MP4SDTQ N7 Yes No
shows that due to the exponential ansatz in CC,
MP5 N8 Yes No
i.) a given excitation level in general is not due to just
MP6 N9 Yes No
one excitation operator (there are “disconnected” terms in
CCSD N6 Yes No
addition to the “connected” term; for example, for doubles,
CCSD(T) N7 Yes No
T21 and T2 , respectively).
CCSDT N8 Yes No
ii.) restricting T generates not only excitations up to
CISD N6 No Yes
a given level, but also higher ones (quartets, etc.) up to
CISDT N8 No Yes
infinity. For example, for T = T2 ,
CISDTQ N 10 No Yes eT2 = 1 + T2 + 12 T22 + 16 T32 + · · · .
a HF, Hartree-Fock; MP, Møller-Plesset (numbers 2–
i< j a<b
CC = eT 0 ,
× ( φi φ j | φa φb − φi φ j | φb φa ).
where the cluster operator T is defined as
CCSD is the only pure CC method that can be used in rou-
T = T1 + T2 + T3 + · · · T N . tine applications. Explicit treatment of triples (CCSDT)
The excitation operators, Ti , generate all ith excited Slater is usually too expensive. However, the contribution of
determinants from the reference: triples can be estimated perturbatively in the CCSD(T)
occ
virt method. Brueckner (B) theory is a variation of CC the-
T1 0 = tia ia ; ory which uses orbitals that make the singles contribu-
i a tion vanish. The accuracy and computational cost of BD
is comparable to that of CCSD. Excited electronic states
occ
virt
T2 0 = j i j ;
tiab ab
··· may be treated within the CC formalism by the equation-
i< j a<b of-motion (EOM-CC) approach.
P1: GNB/MAG P2: FQP Final
Encyclopedia of Physical Science and Technology EN003G-129 June 13, 2001 22:42
correlated wave functions even more than one set is essen- enthalpies of formation, bond energies, ionization po-
tial if high accuracy is desired (e.g., one d and one p set for tentials, electron affinities, and proton affinities. Starting
H, He; one f set and two d sets for first-row elements). An- from the MP4/6-311G(d,p)//MP2(fu)/6-31G∗ + 0.8929
ionic species or molecules with many lone pairs have very ZPE(HF/6-31G∗ ) level, corrections for diffuse functions,
diffuse electron distributions and require an additional set higher polarization functions, and a more complete elec-
of diffuse functions (often denoted by a “+”). The basis tron correlation treatment, as well as a “higher level cor-
sets by Pople and coworkers are very popular: STO-3G rection” depending on the number of α and β valence elec-
is a minimum basis with three GTOs contracted to rep- trons and an empirical factor, are included to extrapolate
resent one atomic orbital. 3-21G is a split valence basis to the QCISD(T)/6-311 + G(3df,2p) level. This procedure
with core orbitals contracted from three GTOs and the va- gives a mean absolute deviation from experimental data
lence region described by one basis function contracted of 1.21 kcal mol−1 for the “G2 test set,” a large number of
by two primitives and another uncontracted function. 6– various types of relative energies accurately known from
31G∗ is constructed analogously but includes one set of experiment. Variations to the G2 method including the
polarization d functions on heavy atoms (other than hy- use of DFT methods have been proposed either to further
drogen). Dunning’s correlation consistent basis sets (cc-p increase the accuracy (e.g., G3) or to reduce the compu-
VXZ; X = D, T, Q, 5, 6, . . .) represent a series of basis tational expense.
sets converging to the basis set limit. This allows extrap-
olation to the infinite basis set limit. The cc-pVDZ (cor-
relation consistent polarized valence double zeta) basis C. Density Functional Theory
set has [3s, 2p, 1d/2s, 1p] contracted from (9s, 4p, 1d/4s, The Hohenberg-Kohn theorem provides the inspiration for
1p) for first-row and hydrogen atoms, respectively. The density functional theory (DFT): all ground-state proper-
next-better basis set, cc-p VTZ, has one basis function ties of a system are functionals of the charge density. In
more of each type and adds one next-higher angular mo- particular the correct energy can be derived from the cor-
mentum function: [4s, 3p, 2d, 1f/3s, 2p, 1d]. Augmenta- rect charge density. Conversely, an incorrect density will
tion of one extra diffuse function for each type of angular give an energy above the correct energy. In a DFT calcu-
momentum (1s, 1p, 1d for cc-pVDZ) is denoted by the lation the energy is optimized with respect to the density.
prefix aug-. It should be much simpler to handle the total electron den-
sity (which depends on three coordinates) than to treat all
5. Optimizing the Performance/Cost Ratio electrons explicitly (involving one spin plus three spatial
coordinates per electron). Although Hohenberg and Kohn
In any application a compromise between computational proved that the electron density determines the electronic
cost and accuracy has to be made. Accurate geometries ground-state energy, the functional to convert the electron
are relatively easy to compute; MP2 with a polarized density function into an energy value is unknown.
double zeta basis set usually gives satisfactory results. Each contribution to the total energy—the kinetic
Accurate energetics require a better theoretical treatment energy, E T [ρ]; the nucleus–electron attraction, E ne [ρ];
and therefore are generally determined from a “single en- and the electron–electron repulsion, E ee [ρ]—can be ex-
ergy point calculation” at a higher level [e.g., CCSD(T) pressed as a functional of the total electron density:
with a TZP basis set]. Whereas a single slash (/) is
used to separate method and basis set specification, a E[ρ] = E T [ρ] + E ne [ρ] + E ee [ρ].
double slash (//) means “at the geometry optimized at”
In analogy to HF theory, E ee [ρ] can be divided into a
(e.g., CCSD(T)/TZP//MP2/DZP). Zero-point vibrational
Coulomb (E J [ρ]) part and an exchange part. The formula
energy (ZPE) corrections to relative energy are usually
for E ne is exact:
applied but need not be derived from a frequency calcula-
tion at the highest level of optimization. ZPEs are usually
Nucl.
ρ(r)
scaled by an empirical correction factor (depending on the E ne [ρ] = − Za dr,
a |r − Ra |
theory level) to account for the overestimation of vibra-
tional frequencies. and
To achieve chemical accuracy (i.e., ±1 kcal mol−1 ) for 1 ρ(r1 )ρ(r2 )
relative energies, for molecules of chemical interest, in- E J [ρ] = dr1 dr2
2 |r1 − r2 |
vestigators devised interpolation schemes based on an ad-
ditivity assumption. Most popular is the G2 method, a holds true for electrons moving independently in the field
general procedure based on ab initio theory for the ac- caused by all electrons—approximations which are hoped
curate prediction of energies of molecular systems, like to be corrected by a combined exchange and correlation
P1: GNB/MAG P2: FQP Final
Encyclopedia of Physical Science and Technology EN003G-129 June 13, 2001 22:42
term, E XC . The major task of DFT is to develop approxi- assessed by comparison with experimental data and there
mate but accurate functionals for E T and E XC . is no consistent way to improve the quality of a given func-
For the noninteracting uniform electron gas, the kinetic tional. This is in contrast with wave function methods, in
and exchange energies can be derived: which a more complete treatment of electron correlation
E TTF [ρ] = 3
(3π 2 )2/3 ρ(r)5/3 dr (and a more flexible basis set) means a closer approach to
10
the exact solution.
and
3 3 1/3
E KD [ρ] = 4 π
ρ(r)4/3 dr. 1. Local Density Approximation
In Thomas-Fermi theory the total energy is expressed as The local density approximation (LDA) assumes varia-
E TF [ρ] = E ne [ρ] + E J [ρ] + E TTF [ρ], tions of the density to be slow and treats the local density
while the Thomas-Fermi-Dirac expression adds the ex- as a uniform electron gas:
1/3
change expression E kD [ρ]. The uniform electron gas is a E XLDA [ρ] = − 34 π3 ρ 4/3 (r) dr.
model too crude to describe molecules: neither TF nor
TFD gives bonding between atoms. The X α method is an example of LDA in which the cor-
As a way to improve on the expression for the kinetic relation energy is neglected and the exchange energy ex-
energy, Kohn-Sham theory calculates the kinetic energy pression is multiplied by a parameter α. A fairly accurate
for noninteracting electrons and corrects for the error rel- expression for the correlation energy of the uniform elec-
ative to the real kinetic energy by means of the exchange tron gas, the VWN (Vosko, Wilk, Nusair) functional, was
correlation term E XC [ρ]: derived by fitting it to Monte Carlo results.
E DFT [ρ] = E TS [ρ] + E ne [ρ] + E J [ρ] + E XC [ρ],
2. Generalized Gradient Approximation
where E TS [ρ] can be computed from a Slater determinant:
As a way to better treat the nonuniform electron dis-
N
E TS [ρ] = φi − 12 ∇ 2 φi , tribution of molecules, the exchange and correlation
i functionals were modified to include derivatives of the
and E XC [ρ] is usually separated into an exchange contri- density in the gradient-corrected approximation, or gener-
bution (E X [ρ]) and a correlation contribution (E C [ρ]). alized gradient approximation (GGA). Gradient-corrected
In the Kohn-Sham implementation of DFT, the den- exchange functionals were developed [e.g., by Perdew and
sity, ρ, is derived from a single Slater determinant with Wang (PW86) and Becke (B)]. Popular correlation func-
orthonormal orbitals, φi : tionals are those of Lee, Yang, and Parr (LYP), Perdew
(P86), Perdew and Wang (PW91), and Becke (B91). Hy-
occ
ρ(r) = |φi (r)|2 . brid methods (such as the popular Becke three-parameter
i functional, B3) use part of the exchange as computed by
The energy is then optimized by solving a set of one- the HF method. The three parameters in B3 determine
electron equations, the Kohn-Sham equations, but with the mixing of LDA and exact exchange, as well as the
electron correlation included: gradient-corrected contributions to the exchange and cor-
relation terms. The parameters are fitted to experimental
hKS φi = εi φi , thermodynamical data. Exchange and correlation func-
where tionals can freely be combined to give an arsenal of DFT
1
Nucl.
Za ρ(r ) methods (e.g., BLYP, BP86, B3LYP, etc.), but B3LYP
hKS = − ∇ 2 + + dr + VXC (r). is the most popular because of its consistently good
2 a |Ra − r| |r − r |
performance.
The (Kohn-Sham) orbitals, φi , which are used to represent Recently, the DFT formalism has been extended to treat
the electron density, ρ, can be determined numerically or excited electronic states through the implementation of
variationally as an expansion of basis functions. time-dependent DFT (TD-DFT).
The Kohn-Sham equations have to be solved iteratively
because the Coulomb term depends on the density (i.e.,
the orbitals to be determined). The main advantage of DFT III. APPLICATIONS OF
methods is that they include some treatment of electron COMPUTATIONAL CHEMISTRY
correlation at a computational cost equivalent to that of
the HF method. The main disadvantage of DFT, however, When a chemical problem is to be studied computation-
is that there is no hierarchy of increasingly better func- ally, an appropriate level of theory must first be chosen.
tionals. The performance of a given functional must be Simple qualitative concepts such as the frontier molecular
P1: GNB/MAG P2: FQP Final
Encyclopedia of Physical Science and Technology EN003G-129 June 13, 2001 22:42
orbital (FMO) theory or orbital symmetry (Woodward- tive eigenvalue describes the transition mode. Following
Hoffman) rules, which were developed on the basis of the minimum energy path in mass-weighted coordinates
primitive computations, can often successfully predict rel- (intrinsic reaction coordinate, IRC) allows one to confirm
ative reactivities and selectivities. Qualitative concepts which minima are connected by a transition state. Har-
are very useful as they can provide chemical insight and monic vibrational frequencies can be derived from the
understanding. For quantitative answers more sophisti- force constants and allow one to compute entropy and en-
cated computational methods have to be employed and a thalpy values.
compromise between desired accuracy and computational The molecular geometry input can be provided in the
cost has to be made. In any case, it is essential to know form of a Z-matrix [i.e., be defined through internal coor-
about the possibilities and limitations of the method to be dinates (bond length, bond angles, and dihedral angles)].
applied. Due to improved optimization algorithms (handling re-
dundant internal coordinates) and the size of molecules
that can now be computed routinely, the input is mostly
A. The Potential Energy Surface provided as Cartesian coordinates, often generated with
The methods described in Section II provide expressions the help of a graphical user interface to the computational
for the energy as a function of the atomic coordinates [i.e., chemistry program package.
they describe the potential energy surface (PES) for a given The computational speed of the more approximate
molecular formula] (Fig. 8). This allows one to search for methods allows one not only to treat larger molecular
an atomic arrangement which makes the energy a mini- systems, but also to address new problems. The func-
mum (geometry optimization). More efficient optimiza- tion of a protein is determined by its three-dimensional
tion algorithms can be employed when the forces and the structure. Hence, there is considerable interest in solv-
Hessian [first and second derivatives, respectively, of the ing the protein folding problem by predicting the three-
energy with respect to (w.r.t.) the nuclear coordinates] are dimensional (secondary and tertiary) structure of a protein
also known. One global and usually many additional local based on the amino acid sequence (primary structure).
minima exist for a given formula and relative energies for The binding of a substrate (or inhibitor) is crucial for the
isomeric structures can be obtained. The transition struc- catalytic activity of an enzyme. Possible binding modes
ture, which is the highest point on a minimum energy path are investigated by molecular docking, in which various
connecting two minima, can also be localized. This reveals structures of intermolecular complexes are generated and
information about chemical reactions, such as mechanistic evaluated.
details, activation barriers, and so forth, which is difficult
or impossible to deduce from experimental investigations.
Minima and transition states are stationary points (i.e., the B. Analyzing the Wave Function
forces are zero). All eigenvalues of the Hessian are posi- The wave function, , which describes the electron dis-
tive for minima while transition states have one negative tribution around a given nuclear arrangement, not only
eigenvalue. The eigenvector corresponding to the nega- can be used to compute the energy of the system, but also
offers useful interpretation opportunities.
to determine the number of electrons ρA associated with orbital (NBO) analysis has some advantages (e.g., has no
atom A. The atomic charge, Q A , is given as the difference strong basis set dependence, is computationally inexpen-
between the nuclear charge Z A and ρA : sive, and can also be applied to electron correlated meth-
ods). In a first step, natural atomic orbitals (NAOs) are
Q A = Z A − ρA .
generated by a diagonalization of the atomic blocks of
Bond orders bAB can be defined on the basis of the sum of the density matrix giving pre-NAOs which are orthog-
electrons shared between atoms A and B: onalized in several steps applying occupancy-weighted
and normal orthogonalization procedures. Diagonal val-
bAB = (DS)αβ (DS)βα . ues of the density matrix in the resulting NAO basis cor-
α∈A β∈B
respond to orbital populations which can be summed up
It is arbitrary how the electron density arising from ba- to give atomic charges. Off-diagonal blocks define bonds
sis functions located on different atoms is divided. The between atoms. The resulting localized MOs give a de-
Mulliken procedure can give unphysical charges and scription in agreement with chemical intuition (core or-
shows a strong basis set dependence. Because of these bitals, lone pairs, and bonds) and can be analyzed in terms
shortcomings an atom definition which does not depend of bond polarity, bond bending, hybridization, and so
on the basis set is desirable. forth. Effects such as hyperconjugation can be investigated
by analyzing the interactions between formally occupied
2. Wave Function Analysis Based and formally empty orbitals within a localized orbital
on the Electron Density picture.
FIGURE 9 Some chemical information that can be gained from a computational investigation. NMR, nuclear magnetic
resonance; IR, infrared.
predicting molecular properties and chemical behavior. Kryspin, K. N. Kirschner, P. v. R. Schleyer, and A. Y. Timoshkin. M.H.
The era foreseen by R. S. Mulliken in his acceptance is grateful to Professor R. Krämer for support.
speech for the 1966 Nobel Prize has arrived:
I would like to emphasize strongly my belief that the era of SEE ALSO THE FOLLOWING ARTICLES
computing chemists, when hundreds if not thousands of chemists
will go to the computing machine instead of the laboratory for MECHANICS, CLASSICAL • MOLECULAR ELECTRONICS •
increasingly many facets of chemical information, is already at PHOTOELECTRON SPECTROSCOPY • PROTEIN STRUCTURE
hand. • QUANTUM CHEMISTRY • QUANTUM MECHANICS
Dynamics of Elementary
Chemical Reactions
H. Floyd Davis
Hans U. Stauffer
Cornell University
697
P1: GLM/GJK P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004J-189 June 8, 2001 19:4
beam is characterized by a velocity distribution much presence of a third body (M), which may be a molecule
narrower than a Boltzmann distribution. or the wall of the container. Note that the two elementary
Potential energy surface Schematic two- or three- propagating reactions may be added like mathematical
dimensional representation of the total potential en- equations, yielding the overall chemical reaction (1).
ergy of a chemical system as a function of internuclear For an overall reaction such as that in Eq. (1) involving
coordinates. a sequence of elementary steps, the overall rate of forma-
Transition state Region of the PES corresponding to the tion of products may be a complex function of reactant
critical geometry through which a reacting system must concentrations, because products are formed by several
pass for reactants to become products. different elementary processes. In the previous example,
the HCl products are formed by reactions (3) and (4), each
of which has its own rate law and rate constant. Thus, for
AN ELEMENTARY CHEMICAL reaction is any process a complex multistep process such as reaction (1), the rate
involving bond fission and/or bond formation following a law can only be determined through experiment.
single collision between two reactants. Chemical reactions For an elementary bimolecular reaction A + B → C +
occur in all three phases of matter (gas, liquid, and solid), D, the reaction rate is proportional to the concentrations
and at their interfaces. Under the experimental conditions (denoted by [A], [B], etc.) of the reactants:
most commonly used to carry out reactions, the overall d[C] d[D] d[A] d[B]
reaction usually consists of a sequence of two or more el- = =− =− = k[A][B]. (5)
dt dt dt dt
ementary reactions. For example, the reaction of gaseous
hydrogen with chlorine forming hydrogen chloride is rep- The proportionality constant, k, is called the reaction rate
resented by the following balanced chemical equation: constant. Since an elementary reaction involves a single
bimolecular collision between A and B, the maximum
H2 + Cl2 → 2HCl. (1) possible rate constant is usually the frequency of collisions
between reactants. A few simple atom-transfer reactions
This reaction proceeds by a chain mechanism involving
(e.g., F + H2 → HF + H) actually do occur on nearly every
a repetitive sequence of elementary reactions involving
collision, and are said to proceed at or near the “gas kinetic
the three stable molecules listed in Eq. (1), as well as
limit.”
two short-lived free radical intermediates, i.e., chlorine
atoms (Cl) and hydrogen atoms (H). The most important
elementary steps in the overall reaction mechanism are:
I. KINETICS AND COLLISION THEORY
Initiation : Cl2 → 2Cl. (2)
In order to estimate the frequency of collisions between
gaseous A and B molecules, consider a beam of molecules
Propagation : Cl + H2 → HCl + H (3a) of incident flux I A (molecules/cm2 · s) impinging on a
static cell containing molecules at a concentration [B]
H + Cl2 → HCl + Cl. (3b) (molecules/cm3 ). The particles interact in a volume el-
ement V . The collision rate per unit time, Z , is given by
Cl + Cl + M → Cl2 + M (4b) Here, σ is the collision cross section, which may be es-
timated using a simple hard sphere model for colliding
H + H + M → H2 + M. (4c) particles (Fig. 1). Two particles collide with a relative ve-
The chain reaction is initiated by dissociation of Cl2 , a sta- locity vector, g, the magnitude of which is denoted by
ble molecule, to form two highly reactive chlorine atoms g, and impact parameter b, also known as the “aiming
(Cl). Since chemical bond fission requires the input of error” of the collision. A hard sphere collision will occur
energy, initiation may be achieved by heating the sample provided 0 ≤ b ≤ (r A + r B ). The collision cross section is
(see Lindemann mechanism) or by ultraviolet irradiation therefore the area of a circle of radius d AB = r A + r B , i.e.,
(photodissociation). Following initiation, the two elemen- σh.s. = πd AB
2
. The incident flux, I A = [A]g A , may then be
tary bimolecular propagating reactions will continue until substituted into Eq. (6). If the rate of reaction between A
either or both of the reactants (H2 and Cl2 ) are consumed, and B is simply the collision rate, then
at which time the termination steps end the chain reac-
tion. Termination typically involves termolecular recom- d[A]
− = Z = σh.s. g A [A][B]. (7)
bination of two radicals to form a stable molecule in the dt
P1: GLM/GJK P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004J-189 June 8, 2001 19:4
making it convenient to conceptualize an effective poten- to the empirical Arrhenius expression found to satisfacto-
tial, rily model a large number of chemical reactions:
(µgb)2 k = Ae−Ea /k B T , (23)
Veff (r ) = + V (r ), (17)
2µr 2
where A is the Arrhenius preexponential factor, and E a is
that governs the radial motion of the particles. At large the Arrhenius activation energy. These quantities are most
internuclear distances, the total energy of the system, readily determined by plotting ln k vs. 1/T , which should
E = 1/2 µg 2 , is simply the radial kinetic energy, since be linear with a slope −E a /k B and intercept ln A. Note that
lim
V (r ) = 0. However, at smaller values of r , where
r →∞ eff this purely empirical relationship often holds for elemen-
Veff (r ) > 0, some of the initial kinetic energy is converted tary as well as multistep reactions. The obvious similarity
into energy associated with the effective potential. The dis- between Eqs. (21) and (23) suggests that E a is at least
tance of closest approach, or turning point, rt. p. , is reached loosely related to the height of the potential energy bar-
when the magnitude of the effective potential is equal to rier for the rate-limiting step in the reaction. However, the
the initial radial kinetic energy, and the radial velocity Arrhenius parameters are only phenomenological quanti-
becomes zero: ties derived from the temperature dependence of reaction
(µgb)2 rate constants. In fact, Arrhenius plots are in many cases
E = 1/2 µg 2 = + V (rt. p. ). (18) found to be markedly nonlinear, suggesting the occurrence
2µrt.2 p.
of a multistep reaction mechanism or a mechanism that
This may be rearranged to solve for b = bmax , the max- changes at different temperatures.
imum impact parameter for a given relative velocity, g, The critical distance model can be used to derive an
that allows the collision partners to reach a fixed critical explicit formula for the temperature dependence of the re-
distance for reaction, rc : action rate constant for charge transfer reactions of the
form A+ + B− → A + B. Such interactions are subject
2V (rc ) 1/2 to long-range Coulomb attractions of the form V (r ) =
bmax (g) = rc 1 − , (19)
µg 2 −q 2 /4π ε0r , where q is the charge of an electron. Tak-
ing the critical distance, rc , to be the ionic–covalent curve
which holds for g ≥ gmin , where, in order to be physically
crossing radius (R), which corresponds to the distance
meaningful,
at which the Coulomb attraction between ions balances
gmin = [2V (rc )/µ] /2
1
if V (rc ) > 0 the energy required for electron transfer, one obtains by
substitution into Eq. (22) the following expression for the
=0 if V (rc ) ≤ 0. (20) charge exchange rate constant:
The reaction rate constant may in both of these cases be 8k B T 1/2 q2
determined analytically by integration over all relative ve- k(T ) = πR 2 1+ . (24)
πµ 4πεo Rk B T
locities exceeding gmin :
This reaction rate constant expression bears some similar-
∞ ity to a hard-sphere rate constant; however, an additional
k(T ) = π {bmax (g)}2 g f (g) dg term (q 2 /4π εo Rk B T ) results from the long range attrac-
gmin
tive interaction, and is in general the dominant contribution
1/2 to the reaction rate constant for reactions of this type.
8k B T − Vk (rcT)
= πrc2 e B if V (rc ) > 0 For attractive potentials of the form V (r ) = −a/r s , Veff
πµ
is given by
(21)
1/2 L2 a (µgb)2 a
8k B T V (rc ) Veff (r ) = − = − s. (25)
= πrc2 1− if V (rc ) ≤ 0. 2µr 2 r s 2µr 2 r
πµ kB T
Provided s ≥ 3, Veff has a local maximum for a given im-
(22) pact parameter, b, at a radial distance, rmax , determined
In cases where V (rc ) is positive, the critical distance rate by
constant expression [Eq. (21)], is similar to the hard sphere 2−s
sa
collision rate constant [Eq. (10)]; however, an additional rmax = . (26)
µg 2 b2
exponential term is present. This term represents the frac-
tion of molecules at temperature T having sufficient en- For close approach required for reaction, the two parti-
ergy to react. This temperature dependence is thus similar cles must overcome this maximum, Vmax = Veff (rmax ), as
P1: GLM/GJK P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004J-189 June 8, 2001 19:4
FIGURE 5 Relationship between Veff (r ) and V(r ) for a given impact parameter, b. Close approach necessary for
reaction requires E >Vmax , where Vmax is the maximum in Veff (r ).
illustrated in Fig. 5. Classically, if the energy of the col- van der Waals interactions (i.e., s = 6) are expected to
liding reactants is exactly equal to Vmax , all of the initial show a small positive temperature dependence (k ∝ T 1/6 ).
kinetic energy is converted into Veff when r = rmax , and The profound effect of the exact form of the internu-
“orbiting” will occur; i.e., the interacting pair will rotate clear potential on the interaction between particles can be
together for an infinite amount of time since both the radial observed in elastic scattering experiments. These studies
kinetic energy and the centrifugal force acting on the parti- allow determination of the angle of deflection of a parti-
cles (−d Veff /dr ) are zero. The orbiting impact parameter, cle from its original direction upon interaction with the
bor b , defined as the impact parameter for a given initial col- second particle. Conceptually, the scattering process can
lision energy, E = 1/2 µg 2 , at which orbiting occurs, can be be understood by considering the effect of a particle of
determined by setting Vmax = E and rearranging to arrive mass µ colliding with an infinitely massive particle fixed
at in space. The deflection angle, χ , defined as the angle be-
tween the initial and final relative velocity vectors of the
a(s − 2) 2/s s colliding pair, will depend on the form of potential, V (r ),
bor b =
2
. (27)
µg 2 s −2 and, based on the impact parameter at which a given col-
lision occurs, the region of V (r ) that is sampled by the
Since smaller impact parameter collisions will result in a colliding pair.
smaller value of Vmax , only collisions occurring with im- Recall from Eq. (17) that the effective potential, Veff ,
pact parameters less than bor b will lead to a close collision governs the radial motion of the colliding particles, and
and reaction. Thus, the reaction cross section, σ = π bor 2
b, therefore determines the radial turning point, rt. p. , for a
which depends on the relative velocity g, may be inte- given magnitude of initial collision energy, E. Figure 6
grated over all relative velocities to derive the rate constant shows three effective potentials (for impact parameters
temperature dependence: b = 0, b1 , b2 , and b3 , where 0 < b1 < b2 < b3 ) for an in-
∞ ternuclear potential, V (r ), with both long-range attractive
k(T ) = π bor
2
b g f (g) dg and short-range repulsive components. If the two particles
0 collide with a fixed collision energy (e.g., that denoted by
1/2
3s−4 π s−4 s −2 E 1 ), the radial turning point, and therefore the regions of
=2 2s (s − 2) a (k B T ) 2s
2/s 2/s
, V (r ) accessed during the collision, will depend strongly
µ s
on the magnitude of b. For very large values of b, (b ∼ b3 ),
(28) the turning point lies at very large r , and the particle
where the gamma function, , is available in mathemat- experiences little deflection upon approach (χ ∼ 0). For
ical tables. This equation predicts that reactions involv- smaller values of b (b ∼ b2 ), the turning point moves to
ing quenching of an electronically excited state, which smaller values of r , and the interacting particles therefore
can be modeled using s = 3, will show weak inverse tem- sample more of the long-range attractive part of V (r ), re-
perature dependence, k(T ) ∝ T 1/6 . Ion–molecule reac- sulting in a deflection toward more negative angles. As
tions, having s = 4, are predicted to have rate constants b becomes even smaller, however, the effects of the re-
independent of temperature. Reactions dominated by pulsive part of the potential begin to play a role, and χ
P1: GLM/GJK P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004J-189 June 8, 2001 19:4
FIGURE 6 Dependence of Veff on impact parameter, b, where 0 < b 1 < b 2 < b 3 . Larger magnitudes of b result in
larger values of the radial turning point for a given collision energy.
reaches a minimum value, called the “rainbow” angle, χr , imum of χ = π for direct head-on collisions (b = 0). A
by analogy to the optical rainbow resulting from scattering pictorial depiction of the dependence of χ on the impact
in small water droplets. The impact parameter that results parameter, known as the deflection function, is shown in
in this most negative degree of deflection is referred to Fig. 7, where particle trajectories during the course of a
as the rainbow impact parameter, br . For impact parame- collision are shown for a wide range of impact parameters.
ters less than br (b ∼ b1 ), short-range repulsion begins to Note that this figure depicts the dependence of the deflec-
dominate, and the deflection angle becomes less negative, tion angle, χ , on b∗ = b/re , where re is the internuclear
reaching zero at the impact parameter at which the attrac- separation where V (r ) reaches a minimum.
tive and repulsive forces during the collision exactly off- Although collision theory has provided considerable in-
set, the so-called “glory” impact parameter, bg . For impact sight beyond simple hard spheres, it cannot properly ad-
parameters smaller than bg , repulsion dominates, and the dress questions such as what fraction of collision geome-
particle is scattered to positive angles, reaching a max- tries are likely to lead to reaction. Such issues cannot be
FIGURE 7 Deflection angle, χ , dependence on impact parameter. Trajectories depict the degree of deflection of
impinging particle resulting from attractive and repulsive components of the interaction potential. (From Levine, R. D.,
and Bernstein, R. B. (1987). “Molecular Reaction Dynamics and Chemical Reactivity,” Oxford University Press, New
York.)
P1: GLM/GJK P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004J-189 June 8, 2001 19:4
properly accounted for unless the details of the molecu- According to transition state theory, a fast equilibrium
lar structure of reactants are considered. Often, the rate exists between reactants and molecules at the transition
constant calculated from collision theory must be reduced state, denoted by AHB‡ :
by inclusion of an ad hoc “steric factor” which represents k1 ω
the fraction of collisions that have proper geometry for A + HB AHB‡ → AH + B. (31)
k−1
reaction. This additional factor is an empirical factor used
The equilibrium concentration of molecules at the transi-
to bring experimental observation into line with collision
tion state is given by
theory, and accounts for those important factors not ad-
dressed by collision theory. [AHB‡ ] = K ‡ [A][HB], (32)
where K = k1 /k−1 . The overall rate of product formation
‡
where v is the velocity through the transition state region The overall rate of product formation is
and δ is the “width” of transition state along the reaction d[P1 ] d[P2 ] d[A] k1 k2 [A][M]
coordinate. = =− = k2 [A∗ ] = .
dt dt dt k−1 [M] + k2
The contribution to the partition function of the acti-
vated complex corresponding to motion along the reaction (41)
coordinate, in this case translational motion, is factored out At high pressure, k−1 [M] k2 , and d[P1 ]/dt = (k1 k2 /
of the transition state partition function, Q AHB‡ , to yield: k−1 )[A], yielding first-order kinetics. Under low-pressure
(2πµk B T ) /2 δ
1 conditions, k−1 [M] k2 and d[P1 ]/dt = k1 [A][M] result-
Q AHB‡ = qtrans,AHB‡ Q AHB‡ = Q AHB‡ . ing in second-order behavior. The important result is that
h
(36) reactions that appear to be unimolecular, exhibiting first-
order kinetics at high pressures, actually involve bimolec-
Note that the width parameter δ appears again in Eq. (36). ular processes.
This parameter ultimately factors out in the transition state An important class of elementary bimolecular reactions
theory rate constant, k TST , which is given by: are those that involve formation of persistent collision
complexes, denoted AB∗ , that may ultimately form prod-
k B T Q AHB‡ −E/k B T
k TST = ωK ‡ = e . (37) ucts C + D, or decay back to reactants:
h Q A Q HB
k1 k2
A + B AB∗ → P1 + P2 . (42)
Although we have derived this equation assuming that k−1
the reaction coordinate for atom transfer corresponds Following formation of the AB∗ intermediate by bimolec-
to translational motion, the same expression is obtained ular collision, the reaction dynamics are somewhat analo-
if the reaction coordinate is assumed to be vibrational gous to those in the Lindemann mechanism: an internally
motion. According to Eq. (37), the reaction rate constant excited molecule may decay to products or reform reac-
may be calculated using the relevant molecular partition tants. In the absence of collisions, the dynamics are solely
functions, known from statistical mechanics, remember- determined by the unimolecular dynamics of the complex.
ing that Q AHB‡ does not include the translational motion However, at high pressures, particularly if the lifetime of
contribution to the transition state partition function. the AB complex is long, it may undergo collision with
another body, possibly carrying away excess energy re-
sulting in formation of stable AB:
k3
III. UNIMOLECULAR VERSUS AB∗ + M → AB + M. (43)
BIMOLECULAR REACTIONS
Many early experiments showed that thermal decompo- IV. STATISTICAL THEORIES OF
sition of a molecule A, forming products P1 and P2 , UNIMOLECULAR DECOMPOSITION
often exhibits first order kinetics at high pressure, and
second–order kinetics at low pressure. As first proposed The Lindemann mechanism as well as reactions occurring
by Lindemann, the mechanism involves collisional exci- via formation of long-lived complexes involve participa-
tation of the reactant A: tion of highly internally excited intermediate species that
may ultimately dissociate by one or more chemical chan-
k1 k2 nels. For example, the intermediate complex AB∗ in reac-
A + M A∗ + M, A ∗ → P 1 + P2 . (38)
k−1 tion (42) may form new products P1 + P2 , or decay back
to reactants, A + B. The total rate constant for decay of
Under steady-state conditions, the concentration of the AB∗ is the sum of the two rate constants, k−1 + k2 , and
collisionally activated molecule, A∗ is constant, i.e., the the relative importance of these competing processes is
rate of its formation is exactly balanced by the rate of its defined as the product branching ratio k2 /k−1 . Of key im-
destruction: portance in understanding these reactions are the reaction
d[A∗ ] rate constants k−1 and k2 . A number of theories have been
= k1 [A][M] − k−1 [A∗ ][M] − k2 [A∗ ] = 0. (39) developed to quantitatively predict rate constants for uni-
dt
molecular reactions.
Rearranging: Rice, Ramsperger, and Kassel developed a simple the-
ory, now known as RRK theory, which contains many
k1 [A][M] fundamental elements underlying most modern theories
[A∗ ] = . (40)
k−1 [M] + k2 of unimolecular reaction. According to RRK theory, the
P1: GLM/GJK P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004J-189 June 8, 2001 19:4
energized molecule produced by bimolecular collision With the development of computers, accurate calcula-
may be considered to consist of a group of s identical har- tions using theoretical models better able to represent the
monic oscillators, each of frequency ν. If any oscillator behavior of real molecules has become widespread. A very
accumulates sufficient energy E o = mhν, where m is an important extension of the original theory, due to Marcus,
integer and h is Planck’s constant, the energized molecule is known as RRKM theory. Here, the real vibrational fre-
will dissociate. An underlying assumption of RRK (as well quencies are used to calculate the density of vibrational
as other related theories) is that energy may flow freely be- states of the activated molecule, N (E). The number of
tween the oscillators in the molecule. The total energy of ways that the total energy can be distributed in the acti-
all of the oscillators is denoted by E = nhν. The number of vated complex at the transition state is denoted W (E ).
ways that n quanta can be placed in a molecule consisting Note that the geometry of the transition state need not be
of s oscillators is: known, but the vibrational frequencies must be estimated
(n + s − 1)! in order to calculate W (E ). In calculating the total number
wn = . (44) of available levels of the transition state, explicit consid-
n!(s − 1)!
eration of the role of angular momentum is included. The
The number of ways that n quanta can be placed in a RRKM reaction rate constant is given by:
molecule such that at least m quanta are in one oscillator
W (E )
is given by kRRKM = , (49)
h N (E)
(n − m + s − 1)!
wm = . (45) where h is Planck’s constant.
(n − m)!(s − 1)!
The probability, P, of dissociation is the ratio of these
V. REACTIONS IN SOLUTION
quantities:
wm (n − m + s − 1)!n! The density of molecules is substantially higher in liquids
P= = . (46)
wn (n − m)!(n + s − 1)! than in the gas phase. However, for reactions carried out in
solution under relatively dilute conditions, the concentra-
If n and m are large, then Sterling’s approximation tions of reactants are not appreciably different from in the
(x! ≈ x x /e x ) may be applied, and, if s is small relative gas phase. Since reactant molecules A and B must undergo
to (n − m), then (n − m + s − 1) ≈ n − m. The probability collision in solution in order to react, many of the same
of decomposition then reduces to principles developed for gas-phase reactions also apply
in solution. However, the presence of solvent molecules
n − m s−1
P= . (47) leads to important differences between reactions in solu-
n
tion and in the gas phase. In solution, the rate of diffusion
The rate constant for unimolecular decomposition is the often limits the rate of approach of molecule A to within
probability P multiplied by a frequency factor ν: a sufficient distance to B for reaction to occur. Once an
encounter pair AB is formed, however, the solvent may act
n − m s−1 E − E o s−1 as a “cage,” effectively holding them in close proximity,
kRRK = ν =ν . (48)
n E thereby increasing the probability of reaction.
In solution, the overall reaction may again be broken
For a given value of s, the reaction rate constant increases
down into a sequence of elementary steps:
with increasing energy E above threshold, E o . If E E o ,
the reaction rate constant approaches the frequency factor kd kr
A + M AB, AB → P, (50)
ν. On the other hand, for reactions involving two simi- k−d
lar but different-sized molecules having the same E and where, AB is an encounter pair, kd and k−d are rate
E o , since the larger molecule has a greater number of os- constants for approach and separation of the reactant
cillators, s, the reaction rate constant k is smaller (since molecules by diffusion, and kr is the rate of conversion
(E − E o )/E < 1). In practice, to obtain agreement with of encounter pairs to products. Applying the steady-state
experiment, it is necessary to use values of s in Eq. (48) approximation to the concentration of encounter pairs, we
which are approximately one half of the actual number obtain
of vibrational modes in the molecule. Of course, because
d[P] k d kr
RRK theory treats all oscillators as having the same vi- = [A][B] = k[A][B]. (51)
brational frequency, the theory employs very simple equa- dt k−d + kr
tions that represent qualitatively but not quantitatively the If the activation energy for the reaction is large, kr kd
behavior of real molecules. and the reaction rate constant k ≈ kd kr /k−d . Alternatively,
P1: GLM/GJK P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004J-189 June 8, 2001 19:4
if kr kd , then k ≈ kd , and the reaction rate constant is the A. Cross Section Measurements
rate of diffusion of reactants to sufficiently close proximity
The reaction cross section may be determined experimen-
to facilitate reaction. The rate of a diffusion controlled
tally as a function of collision energy using a variety of
reaction is determined by the magnitude of the diffusion
methods. For the reaction H + D2 → HD + D, the reaction
coefficients, D, for the reactants A and B:
has been carried out in a flow cell containing mixtures of
k = 4π(DA + DB )R. (52) D2 and a stable H atom precursor such as HBr. The H
atoms are produced by photolysis of HBr at various UV
The diffusion coefficients are related to the viscosity of the wavelengths. Reaction cross sections are determined by
solvent, η, and hydrodynamic radius, r , of the diffusing measuring the concentration of products relative to reac-
species by the Stokes–Einstein Law: tants directly. In the present case, this involves monitoring
the relative concentrations of H and D atoms (i.e., reactants
kB T and products) as a function of time following photodisso-
D= , (53)
πβηr ciation of reactant precursor. Both H and D atoms may
be detected by laser induced fluorescence (LIF) excita-
where β is a constant typically ranging from 4 to 6. Note tion spectroscopy near 121 nm (Lyman-α). By choosing
that this derivation assumes that diffusing particles are different photolysis wavelengths and H atom precursor
equally likely to move in any direction. However, if the molecules, reaction rate constants may be determined for
reactants are oppositely charged ions, Coulomb attraction different collision energies. In this case, reaction cross sec-
substantially increases reaction rates relative to the rate of tions are determined directly from Eq. (8), since the rela-
diffusion. tive velocity is well defined using photolytic reactants. In
Fig. 9, experimental data from several different laborato-
ries are shown as solid points surrounded by rectangles,
and theoretical values are connected by a solid line. The
VI. EXPERIMENTAL TECHNIQUES
reaction cross section is zero below 0.4 eV due to the
presence of a potential energy barrier for reaction, as dis-
It is well known from the empirical Arrhenius expression
cussed in detail in Section VI.B. The reaction cross section
that reaction rate constants often increase with reactant
increases with increasing energy above threshold, with be-
temperature. One goal of molecular reaction dynamics is
havior qualitatively similar to that predicted by Eq. (15).
to understand the roles that different forms of reactant en-
ergy (translational, electronic, vibrational, and rotational)
play in chemical reaction. Also, if a reaction is successful,
how is the total available energy channeled into the various
available degrees of freedom of the product molecules?
Many of the microscopic details of a chemical reac-
tion are related to the nature of the transition state. Most
insight into the transition state region has been obtained
through experiments focusing on the asymptotic limits
of the reaction; i.e., reactants and products. Four gen-
eral categories of experiments will be discussed here, all
focusing on elementary gas-phase reactions. In the first
type of experiment, the total cross sections for various
chemical reaction channels are measured, usually as a
function of collision energy. In the second type, the an-
gular and velocity distributions of products from single
reactive encounters are measured. In the third type of ex-
periment, the quantum state distributions (vibrational and
rotational) of products are measured using spectroscopic
techniques. In this latter approach, velocity distributions
may also be obtained using Doppler or velocity imag-
FIGURE 9 Cross section for H + D2 → HD + D reaction vs. colli-
ing methods. Finally, a relatively recent development is
sion energy. Solid points surrounded by rectangles are experimen-
“transition state spectroscopy,” which focuses directly on tal data and open points connected by solid line is theoretical cal-
the transition state itself, usually through spectroscopic culation. (From Gerlach-Meyer, U., Kleinermanns, K., Linnebach,
measurements. E., and Wolfrum, J. (1987). J. Chem. Phys. 86, 3047–3048.)
P1: GLM/GJK P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004J-189 June 8, 2001 19:4
FIGURE 18 Experimental apparatus for studies of H + D2 reaction in crossed molecular beams. (From Schnieder, L.,
Seekamp-Rahn, K., Wrede, E., and Welge, K. H. (1997). J. Chem. Phys. 107, 6175–6195.)
P1: GLM/GJK P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004J-189 June 8, 2001 19:4
FIGURE 21 Two-dimensional contour map for AB + C reaction. Figure on left denotes reaction involving “early”
transition state, in which translation is more effective in promoting reaction than is vibration; figure on right denotes
reaction having “late” transition state, where vibrational energy promotes reaction.
P1: GLM/GJK P2: FYK Final Pages
Encyclopedia of Physical Science and Technology EN004J-189 June 8, 2001 19:4
Electrolyte Solutions,
Thermodynamics
J. Barthel
W. Kunz
R. Neueder
University of Regensburg
219
P1: GJC Final Pages
Encyclopedia of Physical Science and Technology EN005D-208 June 15, 2001 20:28
Ionophore Electrolyte compound that in its pure phase (e.g., NaHSO4 ) or still more complex
electrolyte com-
exists as an ionic crystal and forms ions in solution by pounds. Electroneutrality requires νi z i = 0.
decomposition to the preformed ions.
1. Ionophores
Medium activity coefficient Measure for the change in
Gibbs energy when an electrolyte is transferred from Ionophores are substances that already exist as ionic crys-
one solvent to another solvent at infinite dilution. tals in their pure state, such as alkali metal halides. When
Osmotic pressure Additional pressure applied to a solu- dissolved in a solvent, ionophores are initially completely
tion to maintain equilibrium with a pure solvent that is dissociated in the solution and their ions are solvated.
in contact with the solution through a membrane per- However, association to ion pairs and higher ion aggre-
meable only to solvent molecules. gates with and without inclusion of solvent molecules may
Partial molar quantity Increase in any extensive ther- occur in many solvents depending on electrolyte concen-
modynamic property of a system at constant pressure tration and solvent permittivity. Equation (1) shows the
and temperature when 1 mol of component Yi is added ionic species in an aqueous CdSO4 solution—solvated
to an infinite amount of the system; the chemical po- ions and ion pairs—which can be detected by appropriate
tential of component Yi is also the partial molar Gibbs methods and which determine the physical properties of
energy of compound Yi . this solution:
Protic (amphiprotic) solvent Solvent apt to react as a CdSO4 (s) −−−−→ Cd2+ (aq) + SO2− →
4 (aq)←
proton donor or acceptor (Brönsted acid or base). H2 O
Solvation Process leading from the ions of an electrolyte, → [Cd(OH2 )SO4 ]0 ←
→ [CdSO4 ]0 .
[Cd(OH2 )2 SO4 ]0 ←
after complete dissociation in a vacuum, to the ions in
solution surrounded by oriented solvent molecules. (1)
Structure-making (-breaking) ion Ion that by a solva-
Aqueous solutions of alkali metal salts exhibit almost no
tion process increases (decreases) the structure of a
association, in contrast to solutions of 2,2 electrolytes,
pure solvent.
such as CdSO4 . Solutions of low permittivity contain ion
pairs (equilibrium constant K A ), triple ions (equilibrium
constant K T ), and higher associates, such as
ELECTROLYTE SOLUTIONS are solutions of elec-
trolyte compounds in pure or mixed solvents such that 1,2-dimethoxyethane
the solutions become electric conductors in which current LiBF4 (s) −−−−−−−−−−→ Li+ + BF−4
is carried by the movement of ions. They exhibit specific KA
←→ Li+ BF− 4 ,
0
(2a)
properties due to the more or less complete dissociation
of their solutes into ions. Aqueous electrolyte solutions + − 0 KT
− −
Li BF4 + BF− → − +
4 ← BF4 Li BF4 , (2b)
are involved in numerous biological, biochemical, geo-
logical, and technical processes. Nonaqueous electrolyte and
0 KT
+ +
solutions are intensively studied owing to unique proper- Li+ BF−
4
→ Li+ BF−
+ Li+ ← 4 Li . (2c)
ties for their application in various new technologies such
as high-energy batteries, electrodeposition, nonemissive
displays, solar cells, phase-transfer catalysis, or electroor- 2. lonogenes
ganic synthesis. Ionogenes, such as mineral or carboxylic acids, form ions
only by a chemical reaction with solvent molecules, for
I. ELECTROLYTES AND SOLVENTS example,
KI
HClO4 + CH3 COOH ← → CH3 COOH+ 2 ClO4
− 0
A. Classification of Electrolytes
KD
→ CH3 COOH+
←
−
2 + ClO4 . (3)
Electrolytes can be classified into two categories,
ionophores and ionogenes, independent of their stoi- The solution of perchloric acid in glacial acetic acid
chiometry. Stoichiometry entails formal arrangement in [Eq. (3)] shows the typical two-step mechanism of ion for-
classes according to the valency of their cations and an- mation of ionogenes. An ionization (equilibrium constant
z+ z−
ions. An electrolyte of type Cν+ Aν− composed of a z + - K I ) process is followed by a dissociation (equilibrium con-
valent cation C and a z − -valent anion Az− is referred
z+
stant K D ) process. The overall constant K is given by the
to as a binary z + , z − electrolyte, for example, HCl (1,1), relationship
CdCl2 (2,1), and CdSO4 (2,2). Binary electrolytes contain
only one type of cation and anion, in contrast to ternary K = K I K D /(1 + K I ). (4)
P1: GJC Final Pages
Encyclopedia of Physical Science and Technology EN005D-208 June 15, 2001 20:28
Solvents of type LH (amphiprotic solvents) such as wa- capacity to interact with other species. Four main groups of
ter, alcohols, and carboxylic acids themselves behave as solvents with typical donor–acceptor properties generally
ionogenes in producing their cations LH+2 (lyonium ions) are distinguished:
and anions L− (lyate ions) by autoprotolysis reactions:
→ LH+ − 1. Protic solvents (e.g., water, alcohols, amides, car-
LH + LH ← 2 +L (5)
boxylic acids)
→ H3 O+ + OH− ). The solution of acids and
(e.g., 2H2 O ← 2. Aprotic solvents (e.g., aprotic amides, nitriles, ke-
bases in amphiprotic solvents in generally followed by tones)
a protolysis reaction as a consequence of the following 3. Low-permittivity electron donor solvents (e.g.,
equilibria: ethers)
4. Inert solvents (e.g., hydrocarbons, chlorinated
→ A− + LH+
AH + LH ← (6)
2 hydrocarbons)
and
Despite such limitations as the overlapping of solvent
→ BH+ + L− ,
B + LH ← (7) classes or possible interactions evading the unambiguous
where LH represents the amphiprotic solvent, AH the acid, classification of a solvent, such classifications are useful
and B the base. When these equilibria are shifted markedly for understanding the properties of electrolyte solutions
toward ionization (e.g., HCl + H2 O ←→ H3 O+ + Cl− ), the and for rationalizing the choice of appropriate solvents
acids or bases are almost completely replaced by the lyate and solvent mixtures for particular investigations.
or lyonium ions of the solvent (leveling effect).
Two aspects determine the role of the solvent: its bulk A. Fundamentals
properties and its proton or electron donor or acceptor ca-
Electrolyte solutions commonly are one-phase systems of
pabilities. A solvent S may act by a basic center β (accep-
k components. The internal energy U of such systems
tor: βS), an acidic center α (donor: Sα), or both (βSα).
expressed as a function of its extensive variables S (en-
Solvation of an electrolyte, Y = C+ A− , then occurs ac-
tropy), V (volume), and n i (amount of substance of com-
cording to the scheme
ponent Yi , i = 1, 2, . . . , k) yields the fundamental equa-
→ C+ (βS)m + A− ,
Y + m(βS) ← (8a) tion of thermodynamics,
→ C + (Sα)n A ,
Y + n(Sα) ← + − (8b)
k
dU = T dS − p d V + µi dn i , (9a)
or i=1
→ C+ (βSα)m + (βSα)n A− .
Y + (m + n)(βSα) ← (8c) and its Legendre transforms,
Various attempts have been made in the literature to where T (temperature), p (pressure), and µi (chemical po-
classify solvents according to their acid–base properties, tentials, i = 1, 2, . . . , k) are the intensive variables of the
which can be used as a general chemical measure of their system, which are conjugated to the extensive variables. H
P1: GJC Final Pages
Encyclopedia of Physical Science and Technology EN005D-208 June 15, 2001 20:28
is the enthalpy, A the Helmholtz energy, and G the Gibbs to rationalize thermodynamic properties for chemical en-
energy. Equation (9e) is the Gibbs-Duhem equation. gineering. Graphs of the Gibbs excess energy and ther-
For use in thermodynamic calculations the chem- modynamic functions derived from it such as the excess
ex ex
ical potential at constant temperature and pressure enthalpy, H = −T 2 (∂( G /T )/∂ T ) p , or excess en-
ex ex
µi = (∂G/∂n i ) p,T,n j =ni , is split into two parts, tropy, S = −(∂ G /∂ T ) p , as a function of mole frac-
tion x can be used for an easy determination of the partial
µi ( p, T ) = µ− −
i ( p, T ) + RT ln ai /ai ,
(10) molar quantities Z i = (∂ Z /∂n i ) p,T,n j=i [Eq. (38)] by the
which relates µi ( p, T ) to the reference potential µ− method of intercepts. The partial molar excess Gibbs en-
i for
which the activity is a−
ergies are the chemical potentials µiex ( p, T ) of the studied
i .
The commonly chosen reference states for solutions are solution.
the pure solvent µ− ∗
S ( p, T ) = µS and for the solute Y the
−
state of infinite dilution µY ( p, T ) = µ∞
xS →1 µY ( p, T ) = ν+ µ+ ( p, T ) + ν− µ− ( p, T ), (15)
lim f i = 1. (11b) where
xS →1
The choice of concentration scales other than the mole trolyte compounds are components. As a consequence,
fraction scale entails changes in the reference potentials there is no rigorous method for measuring the chemical
and activity coefficients: potential or other thermodynamic properties of a single
ionic species.
Molality scale:
Despite this drawback there is an interest in science
µY ( p, T ) = µ∞(m)
Y ( p, T ) + ν RT ln m ± γ± . (19) and technology in single-ion quantities for rationalizing
the discussion of electrolyte solution properties. Various
Molarity scale:
methods have been developed for their estimation based
µY ( p, T ) = µ∞(c)
Y ( p, T ) + ν RT ln c± γ± . (20) on extrathermodynamic assumptions, such as the follow-
ing: (1) The contributions of cation and anion are set equal
In Eqs. (19) and (20) µ∞(m)
Y and µ∞(c)
Y are the reference for a salt composed of ions of equal charge and approx-
potentials at infinite dilution, m ± and c± are the mean imately equal radii; (2) the results of measurements on a
molality (moles per kilogram of solvent) and the mean series of homologous electrolytes are extrapolated with
molarity (moles per cubic decimeter of solution), and γ± regard to ionic radii or ionic volumes to zero ion size or
and y± are the corresponding activity coefficients in the zero reciprocal radii; (3) the differences in conventional
molality and molarity scales. All quantities are defined ac- ionic properties are used for theoretically rationalized ex-
cording to Eqs. (18c). The transfer from one concentration trapolations; and (4) the properties of ions are compared
scale to another is executed with the help of conversion with those of isoelectronic neutral molecules of similar
formulas: chemical constitution and size.
∞(m)
µ∞
Y ( p, T ) = µY ( p, T ) − ν RT ln MS Besides the chemical potentials of single ions, electro-
∞(c) chemical potentials are defined in electrochemistry. They
= µY ( p, T ) − ν RT ln 103 MS dS , (21) are useful in electrode kinetics and related topics but are
MS of only limited interest in the discussion of the thermody-
x± = m± namic properties of electrolyte solutions. Electrochemical
1 + MS νm ± /Q
potentials µ̃i ( p, T ) are related to chemical potentials with
103 MS the help of the relationship
= c± , (22)
d + 103 (ν MS − MY )c± /Q
µ̃i ( p, T ) = µi ( p, T ) + F z i , (24)
and
f ± = (1 + MS νm ± /Q)γ± where F is the Faraday number, the Galvani poten-
tial of the liquid phase, and z i the valency of the charged
d + 10 (ν MS − MY )c± /Q
3
= y± , (23) species. They are equal to the corresponding chemical
dS potentials for electrically neutral species, molecules, and
where Q = (ν+ν+ ν−ν− )1/ν ; MS and MY are the molecular neutral electrolyte compounds (z i = 0). Their definition
weights of solvent and solute, respectively; and dS and results from formally
introducing the electrical charge of
d are the densities of the solvent and the solution, re- the phase, q = n i F z i , as a further extensive variable
spectively. These conversion formulas are valid if SI units into Eq. (9a), which yields an additional term, dq.
are used for all quantities except molar concentration c± ,
which traditionally is given as moles per cubic decimeter.
At a pressure of 1 bar and a temperature of 298.15 K D. Ionic Equilibria
the chemical potentials µ−
The equilibrium between species X and “free” cations to the related osmotic coeffcients, used as molal osmotic
→ X, yields the equilibrium
and anions, ν+ Cz+ + ν− Az− ← coefficient and the rational osmotic coefficient g:
condition
= −(ln aS )/νm MS (31a)
µx ( p, T ) − ν+ µ+ ( p, T ) − ν− µ− ( p, T ) = 0. (26) and
Combining Eqs. (25) and (26) and taking into account the g = (ln aS )/(ln xS ). (31b)
definitions of c± and µ ∞(c) , c± = αc± and µY∞(c) = µ∞(c)
Y , The Gibbs-Duhem equation [Eqs. (14)], when combined
leads to the chemical potential of the electrolyte Y in the with Eqs. (11a) and (19), yields the relationship
form 1
d(ln aS ) + νm d(ln mγ± ) = 0, (32)
µY ( p, T ) = µ∞(c)
Y ( p, T ) + ν RT ln αc± y± , (27) MS
which can be easily transformed with the help of Eq. (31a)
which is identical to Eq. (20) if we set to yield the basic equation for calculating solute activity
y± = αy± . (28) coefficients from solvent activities and vice versa:
d(m ) = dm + m d(ln γ± ). (33)
The importance of Eq. (28) is that both degree of disso-
ciation and activity coefficient of the “free” ions are parts Either or ln γ± is obtained by integration of Eq. (33) to
of the excess Gibbs energy. yield
m
The equilibrium condition [Eq. (26)] yields the 1
=1+ m d(ln γ± ) (34a)
relationship m 0
or
µ∞(c)
x ( p, T ) − ν+ µ∞(c) ∞(c)
+ ( p, T ) − ν− µ− ( p, T ) m
1−
c y ln γ± = ( − 1) − dm. (34b)
= −RT ln x x ν . (29) 0 m
(c± y± )
The peculiar behavior of electrolyte solutions that follow
The left-hand side of Eq. (29) is the molar Gibbs energy a limiting law of type 1 − ∝ Am 1/2 entailing divergency
G∞A of formation of the species X from the initially of the integrand in Eq. (34b) is taken into account by trans-
infinitely separated ions; the right-hand side shows the forming this equation into the relationship
equilibrium constant as the association (K A(c) ) or dissocia- m
1−
tion (K D(c) ) constant [the superscript (c) indicates the mo- ln γ± = ( − 1) − 2 dm 1/2 . (35)
0 m 1/2
lar concentration scale], which for symmetric electrolytes
(ν+ = ν− = 1) takes the form Equations (34) are valid for both completely and incom-
pletely dissociated electrolytes except that for incom-
−1 1 − α yx pletely dissociated electrolytes γ± is replaced by αγ± ,
K A(c) = K D(c) = 2 . (30)
α c y±2 the degree of dissociation being related to the association
constant in the molal scale (dilute solution, γx = 1):
The quantity yx is the acitivity coefficient of the species
X; the other symbols are explained in the foregoing text. 1−α 1
K A(m) = = K A(c) dS . (36)
In dilute solutions yx equals unity. α 2 m γ±2
Incomplete dissociation and partial association of sym-
metric electrolytes are given here as simple but important F. Partial and Apparent Molar Quantities
examples of ionic equilibria in solution. Other equilibria,
such as those resulting from the formation of solvent- All extensive thermodynamic properties Z (volume, en-
separated and contact ion pairs, triple-ion formation, mul- thalpy, entropy, Gibbs energy, heat capacity) when defined
tistep equilibria of unsymmetric electrolytes, or simulta- as functions of the set of the variables p, T , and n i ,
neous complex ion formation, in the investigated solution Z = Z ( p, T, n i ), i = 1, . . . , k, (37)
can be treated in a similar way. They require in Eq. (25)
the consideration of all identified species in the solution. yield the partial molar (or molal) quantities
∂Z
Zi = (38)
∂n i p,T,n j=i
E. Solvent Activity and Osmotic Coefficient
and
In dilute solutions (xS ∼ = 1) both the activity and the ac-
k
tivity coefficient of the solvent differ only very little from Z= ni Z i , (39)
unity with varying electrolyte concentration, in contrast i=1
P1: GJC Final Pages
Encyclopedia of Physical Science and Technology EN005D-208 June 15, 2001 20:28
∞
= Z Y∞ − Z Y∗ . and
sol Z Y (45b)
∞
∞ ∞ ∂ sol HY
The molar quantity sol Z Y
corresponds to the transfer sol C pY = . (46e)
of 1 mol of electrolyte compound from its pure state to ∂T
p,n i
P1: GJC Final Pages
Encyclopedia of Physical Science and Technology EN005D-208 June 15, 2001 20:28
i (47) (51)
Since the mole fraction xi is independent of temperature Then the Gibbs energy of transfer can be written at any
and pressure, the derivatives of Eq. (47) yield concentration:
∂(ln f i ) Hi − Hi−
(S) (W)
=− (48a) tr G Y = RT ln m γY + RT ln aY aY . (52)
∂T p,n j RT 2
At distribution equilibrium of Y between the two solvents,
and (S) (W)
tr G Y = 0, the distribution coefficient aY /aY yields the
∂(ln f i ) Vi − Vi−
transfer activity coefficient.
= . (48b)
∂p T,n j RT Separation into ionic medium activity coefficients is
executed with the help of the equation
Quantities of type Z i − Z−
and in the solution are equal and the transfer of the un- for the molecule and the ion. The reference electrolyte
charged ion does not affect the Gibbs energy of solvation. method proposes the partition of the measured electrolyte
For an electrolyte component Y, the overall energy is solvation into equal parts for the ions of the electrolyte;
found by summing over the ionic contributions: that is, the electrostatic and nonelectrostatic contributions
∞ ∞ ∞ of cations and anions are assumed to be equal. Also, un-
solv G Y = ν+ solv G + + ν− solv G − . (56) equal partition of solvation quantities between the Ph4 As+
The Born model is only a rough approximation. Im- cation and the BPh− 4 anion is used in the literature.
provements of the method take into account a local per-
mittivity ε∗ and effective ionic radii ai(eff) = ai + δi , where
δi is the distance between an ion and an adjacent solvent III. STATISTICAL APPROACHES
dipole. More elaborate models include in the calculation
the energy of formation of a spherical cavity in the pure A. Hamiltonian Models
solvent into which an ion and its solvation shells can be
Establishing the relationships between the macroscopic
transferred from the vacuum. Further interactions that can
properties of an electrolyte solution and the properties
be taken into account result from ion–quadrupole, ion-
of its particles, ions, and solvent molecules is the task
induced dipole, dipole–dipole, dispersion, and repulsion
of statistical mechanics. Efficient statistical mechanical
forces. For nonaqueous electrolyte solutions most of the
models are based on the Hamiltonian H , which expresses
molecular and structural data needed for this calculation
the total energy of the N -particle system as a function of
of the solvation energy are unknown, and ab initio calcu-
the momenta pi and coordinates ri of the particles:
lations have not so far been very successful. Actual infor-
mation on ion solvation in nonaqueous solutions is based H (r1 · · · r N , p1 · · · p N )
almost exclusively on semiempirical methods and/or the
N
pi2 N N
extrathermodynamic assumptions quoted in Section II.C. = + u i j (ri , r j ). (58)
Extrapolation methods are based on the representation i=1
2m i=1 j≥i
of the electrostatic ion–solvent interactions as a series ex-
pansion of type b j (ai )− j , where ai is one of the ionic radii. In Eq. (58) limitation is made to models omitting molec-
For example, in a series of electrolytes with a common ular orientation. The first term on the right-hand side of
cation and differing anions, the relationship the equation represents the kinetic energy, and the second
term the potential energy U N (r1 · · · r N ), approximated by
∞ ∞ ∞
solv Z Y = solv Z + + solv Z − a sum of pairwise direct interaction potentials u i j (ri , r j ).
n Hamiltonian models are classified according to their
∞
= solv Z + + b j (a− )− j level of approximation. The features of Schroedinger (S),
j=1 Born-Oppenheimer (BO), and McMillan-Mayer (MM)
∞ level Hamiltonian models are exemplified in Table I by
+ solv Z − (neutral) (57)
a solution of NaCl in H2 O. The majority of investigations
is used for determining single-cation solvation quanti- on electrolyte solutions are carried out at the MM level.
ties from the measured solv Z Y∞ values. The extrapola- BO-Level calculations are a precious tool for Monte Carlo
tion of a− toward infinite values, a− → ∞, yields the and molecular dynamics simulations as well as for integral
∞ equation approaches. However, their importance is widely
quantity solv Z + because the electrostatic contribution
vanishes for the anion. The nonelectrostatic contribution limited to structural investigations. They, as well as the
∞ S-level models, have not yet obtained importance in elec-
solv Z − (neutral) is usually estimated from neutral ref-
erence molecules that are similar in size and chemical trochemical engineering. S-Level quantum-mechanical
constitution. calculations mainly follow the Car-Parinello ab initio
Other extrathermodynamic assumption methods are molecular dynamics method.
based on reference systems, such as large ion couples
and their corresponding uncharged molecules (e.g., fer-
B. Distribution Functions and
rocinium/ferrocene) or systems of large cations and an-
Mean Force Potential
ions of similar structure [e.g., (Ph4 As)+ /(BPh4 )− or
(iAm3 BuN)+ /(BPh4 )− ]. The reference ion–molecule Statistical mechanics when based on Liouville’s theorem
method assumes that the ion–molecule pair shows equal yields a hierarchy of equations (BBGKY hierarchy) that
solvation in the investigated solvent; that is, the electro- makes use of the s-particle distribution function ρ (s) giving
static contribution of the ionic species is neglected, and the the probability of finding s particles, i = 1 . . . s, out of the
nonelectrostatic contribution is considered to be the same N particles in the system in the positions r1 · · · rs and
P1: GJC Final Pages
Encyclopedia of Physical Science and Technology EN005D-208 June 15, 2001 20:28
Schroedinger (S) Coordinates and momenta of all the nuclei Quantum Approximating the wave function
and electrons of all the H, O, Na, and Cl atoms
Born-Oppenheimer (BO) Coordinates and momenta of the water Classical Simulation: MD, MC; perturbation:
molecules and the Na and Cl ions PY, HNC
McMillan-Mayer (MM) Coordinates of the Na and Cl ions only Classical MD, MC, MSA, PY, HNC, ChM
a Abbreviations: HNC, hypernetted chain equation; MC, Monte Carlo simulation; MD, molecular dynamics simulation; MSA, mean spherical
with momenta p1 · · · ps , regardless of the position and The one-particle distribution function is independent of
momentum coordinates of the remaining particles: particle position and equals the number of particles per
unit of volume (particle density):
ρ (s) (r1 · · · rs , p1 · · · ps , t)
f (1) (ri ) = ρi = Ni /V. (64)
=σ (s)
ρ(r, p, t) drs+1 · · · dr N dps+1 · · · dp N . (59)
The expression for gi j (r ) is easily obtained from Eqs. (61)
In Eq. (59) the coefficient σ takes into account whether
(s) and (63). The equilibrium distribution around a spherically
identical or different particles are observed. ρ(r, p, t) is symmetric ion depends only on the distance r between
the density function of the N -particle system in a 6N - particles i and j, r = |ri j | = |r j − ri |:
dimensional space, the so-called space. The symbol r
stands for r1 · · · r N , and an analogous definition holds 2 exp(−U N /kT ) dr3 · · · dr N
gi j (r ) = V . (65)
for p. The density function ρ(r, p, t) tends toward a time- exp(−U N /kT ) dr1 · · · dr N
independent equilibrium distribution, Values of gi j (r ) greater than 1 indicate attraction, and less
exp[−H (r, p)/kT ] than 1 repulsion, between a pair of particles. The particles,
ρ (eq) (r, p) = , (60) when represented as hard spheres of mutual minimum
exp[−H (r, p)/kT ] dr dp
distance a = ai j , require the boundary condition
where H (r, p) is the Hamiltonian of Eq. (58), k the
Boltzmann constant, and T the temperature. lim gi j (r ) = 0, lim gi j (r ) = 1. (66)
r →a r →∞
It can be shown with the help of the BBGKY hierarchy
of equations that a knowledge of the ρ (1) and ρ (2) functions, A related function of fundamental interest is the potential
together with the pertinent principle of superimposition, of mean (or average) force, which is defined such that its
is sufficient to produce ρ (3) functions and so on. derivative is the force on the selected particle i averaged
Molecular distribution functions f (s) (r1 · · · rs ) at equi- over the position coordinates of all the remaining particles
librium, indicating the probability of finding s particles in the system except those of particle j:
in positions r1 , r2 , . . . , rs , are obtained by integrating the
∂ Wi j
functions ρ (s) themselves with respect to the momentum = K i j (r )
coordinates p1 , p2 , . . . , ps : ∂r
(∂U N /∂r ) exp(−U N /kT ) dr3 · · · dr N
σ (s) UN = . (67)
f (s) (r1 · · · rs ) = exp − drs+1 · · · dr N exp(−U N /kT ) dr3 · · · dr N
Q (N ) kT
(61) Comparison of Eqs. (65) and (67) yields the link between
and functions gi j (r ) and Wi j (r ),
UN Wi j (r ) = −kT ln gi j (r ), (68)
Q (N ) = exp − dr. (62)
kT
defining the potential of mean force, which is different
The distribution function of interest is the pair correlation from the interaction potential u i j (r ) [Eq. (58)] as a conse-
function gi j (ri , r j ), which is defined by the relationship quence of the averaging procedure by Eq. (67).
The pair correlation functions gi j (r ) can be experi-
f (2) (ri , r j ) mentally determined. They are obtained by using scat-
gi j (ri , r j ) = . (63)
f (1) (ri ) f (1) (r j ) tering techniques such as X-ray or neutron scattering.
P1: GJC Final Pages
Encyclopedia of Physical Science and Technology EN005D-208 June 15, 2001 20:28
Wide-angle neutron scattering yields them at the BO-level, the MM system are then (ρ, T, P0 ) and the solution is un-
whereas small-angle neutron scattering usually cannot de- der pressure P = + P0 . The osmotic coefficient in the
tect the structure of the solvent and therefore may be re- MM system is found to be
garded as the experimental analogue of the theoretical MM
= /ρkT. (71)
approach. The vanishing scattering angle in a so-called
contrast experiment is even directly related to the osmotic Thermodynamic functions, however, are defined with the
coefficient. help of the independent variables (m, T, P1 ), where P1
There are two usual routes for determining the thermo- is generally the standard pressure of 1 atm. Hence it is
dynamic properties of the N -particle system in equilib- necessary to convert experimental data to the MM sys-
rium from its pair correlation function gi j (r ): tem when comparison is needed for model calculations at
this level. For dilute solutions the resulting corrections are
1. The pressure or virial equation: negligible.
P ∂u i j (r ) At the MM level any plausible model for the ion–ion
=1− ρi ρ j 4πr 3 gi j (r ) dr. (69) interactions can be described by the direct potential u i j ,
ρkT i j
∂r
which can be interpreted as the potential of mean force at
2. The energy equation: infinite dilution:
E 3 1 u i j (r ) = u i∗j (r ) + z i z j e02 4π ε0 εr. (72)
= ρkT + ρi ρ j 4πr 2
V 2 2 i j In Eq. (72) the second term on the right-hand side is a
coulombic potential, ε0 is the permittivity of vacuum, z i e0
∂u i j (r )
× u i j (r ) − gi j (r ) dr, (70) and z j e0 are the charges on ions i and j, and ε is the rela-
∂(ln T ) tive permittivity of pure solvent. The first term, u i∗j (r ), is
where ρ = ρi , P is the pressure, E the internal energy, the short-range noncoulombic part of the direct potential,
V the volume, and the sum is over all particles of interest including repulsion, dispersion, or induction forces.
in the system. In analogy to Eq. (72) the mean force potential, related
to the pair correlation function at finite concentration by
Another, less frequently used relationship, the so-called Eq. (68), can be split into two parts,
compressibility equation, is an alternative to the pressure
Wi j (r ) = Wi∗j (r ) + Wielj (r ), (73)
equation, but the virial equation is more convenient to use.
where Wielj (r ) and Wi∗j (r ) are again the contributions due to
coulombic and noncoulombic interactions, respectively.
C. McMillan-Mayer Solution Theory
The evaluation of the pair correlation functions of both
the solvent molecules and the solutes is feasible but trou- D. The Chemical Model
blesome. For electrolyte solutions, however, averaging 1. Fundamentals
over the solvent effects yields reliable approximations, as
shown by McMillan and Mayer, who considered the solu- The chemical model is an MM-level Hamiltonian model
tion in osmotic equilibrium with the pure solvent (Fig. 2). taking into account both long- and short-range forces. It
They stressed that solutes can be treated as an imperfect has been used for investigating and calculating the prop-
gas, provided that one uses the potential of mean force erties of dilute electrolyte solutions of numerous salts in
at infinite dilution. The calculation yields the osmotic a great variety of solvents and has proved to be successful
pressure = P − P0 (see Fig. 2) from the virial equa- insomuch as all of the properties of an electrolyte solu-
tion [Eq. (69)] in terms of the forces among the particles tion investigated can be understood from the same set of
for the solution state (ρ, T ). The independent variables of interaction parameters.
The chemical model subdivides the space around an ion
into three main regions (Fig. 3):
2. Ion-Pair Concept
Solution chemists usually think of short-range interactions
in terms of ion-pair formation. Models of the electrolyte
solution allow the introduction of the ion-pair concept if
a critical distance around the central ion can be defined
within which the paired states of oppositely charged ions
are considered ion pairs.
The ion-pair concept for symmetric electrolytes can be
FIGURE 3 Chemical model of electrolyte solutions. O, Observer:
i , ion Xi ; j , ion X j in an arbitrary position r21 with regard to the ion easily introduced into the chemical model, assuming that
Xi ; special positions (contact, separation by one or two oriented the cutoff distance parameter R of the short-range forces
solvent molecules) are sketched with dashed lines; r, a, R, dis- equals the upper limit of ion association. The relationship
tance parameters; Wi j , mean force potentials; v j i and vi j relative (see Table II)
velocities of ions Xi and X j .
(I) 2κqκ T 2κqκ T ∗
W+− (r ) = − + + W+− ;
r 1 + κR
3. r ≥ R, the region of long-range interactions.
e02 |z + z − |
Region 1 is controlled by a hard-sphere potential. The po- q= (76)
8π ε0 εκ T
tentials of mean force of regions 2 and 3, Wi(I) (II)
j and Wi j ,
yields the integral (a . . . R) of Eq. (75) in the form
are split into two parts according to Eq. (73). The elec-
trostatic parts Wielj are obtained as solutions of Poisson- R (I)
W i j 2κq
Boltzmann differential equations and appropriate bound- r exp −
2
dr = exp −
a kT 1 + κR
ary conditions; the nonelectrostatic parts can be chosen
as step potentials, that is, Wi∗(I) ∗(II)
j = const, Wi j = 0. The R
W ∗
2q +−
total mean force potentials are compiled in Table II. The × r 2 exp − dr. (77)
parameter κ is the reciprocal Debye length as defined by a r κT
the relationship At low electrolyte concentrations the activity coefficient of
e02 2 ion pairs may be set equal to unity and Eq. (77) is linked to
κ2 = ρi z i . (74)
ε0 εkT the ion-pair association constant K A and the mean activity
coefficient y± of the free ions by the relationships
The link between the pair correlation function gi j and the
experimentally determined property of the solution—with 1−α 1
= K A = 4000π NA
the exception of scattering experiments, which yield the α 2 c y±2
R ∗
2q W+−
TABLE II Mean Force Potential for the Chemical × 2
r exp − dr (78a)
Model a r kT
Region Mean force potential and
r ≤a ∞ κq
y± = exp − . (78b)
e02 z i z j e02 z i z jκ 1 + κR
a≤r ≤ R − + Wi∗j
4π ε0 εr 4π ε0 ε 1 + κ R
The distance parameter R can either be determined by
e02 z i z j 1 exp[κ(R − r )] experiment or set by chemical evidence; α is the degree
r≥R
4π ε0 ε r 1 + κR
of dissociation.
P1: GJC Final Pages
Encyclopedia of Physical Science and Technology EN005D-208 June 15, 2001 20:28
The chemical association concept is based on the equi- association constant and its appropriate activity coefficient
∗
librium between “free” ions and ion pairs in the solution are obtained by setting R = q and W+− = 0 in Eq. (78a).
(see Section II.D).
Equation (78a) can also be derived as an equilibrium
3. Debye-Hückel Theory and Extensions
constant in the framework of the statistical mechanical
equilibrium theory based on the molecular partition func- A multitude of MM-level models can be found in the lit-
tions Q + , Q − , and Q P of cations, anions, and ion pairs, erature differing from one another by the underlying as-
QP − E0 sumptions of short-range interactions. The oldest one is
K A = NA V exp (79a) the Debye-Hückel theory, which does not recognize short-
Q+ Q− κT range interactions and association. The Debye-Hückel the-
and ory yields the limiting laws of thermodynamic and trans-
(2π m i kT )3/2 port properties based on the potential of mean force,
Q i = Q itrans Q iint ; Q itrans = V, (79b)
h3
e02 z i z j exp(−κr )
when the cations and anions are represented as charged Wi j (r ) = ; 0 < r < ∞, (80)
hard spheres of masses m + and m − without internal 4π ε0 ε r
degrees-of-freedom Q int + = Q − = 1, and the internal
int which can be deduced from the data in Table II by assum-
molecular partition function of the ion pair is made up by ing a = R = 0 and Wi∗j = 0.
integration over all configurations a ≤ r ≤ R of the paired By adding pairwise short-range interactions β+− be-
states of a cation and an anion with variable distance pa- tween cation and anion, Guggenheim obtained an activity
∗
rameter r . E 0 = 2q/r − W+− /kT is the corresponding coefficient which for 1,1 electrolytes is given by the rela-
distance-depending activation energy per molecule of the tionship (dS : solvent density)
reaction of ion-pair formation. The internal partition func- √
Aγ m
tion thereby is based on an internal degree of translation ln γ± = − √ + 2β+− m;
of a particle of reduced mass m̃ = m + m − /(m + + m − ) rep- 1+ m
3/2
resenting the ion pair P(r ). e02
Figure 4 shows the family tree of some theories which Aγ = (2π NA dS ) 1/2
. (81a,b)
4π ε0 εkT
assume association. For example, the often-used Bjerrum
FIGURE 4 Family tree of association constants of the chemical model. For explanations see the text.
P1: GJC Final Pages
Encyclopedia of Physical Science and Technology EN005D-208 June 15, 2001 20:28
It may be used in aqueous solutions up to molalities of ticle. This feature is reflected by the Ornstein-Zernicke
0.1 mol/kg. equation:
Among a variety of extensions of the Guggenheim equa-
h i j (r ) = gi j (r ) − 1
tion, those of Pitzer are the most successful ones. For a
single electrolyte in solution they read ∞
= ci j (r ) + ρs cis (r )h s j (r ) drs . (84)
ln γ± = f (γ ) + m B (γ ) + m 2 C (γ ) , (82a) s 0
TABLE III Thermodynamic Data of Ion-Pair Formation if the solute is a nonvolatile compound such as a salt. The
in Aqueous Solutions symbols are explained in Section II.A. The integration of
KA ∆H ∞
A Rexp Rcalc Eq. (103) gives the solvent activity aS as a function of the
Electrolyte (dm3 mol−1 ) (J mol−1 ) (nm) (nm) vapor pressure p of the solution at temperature T :
MgSOa4 161 5780 0.93 0.88
p VS∗(l) − BS
CaSOa4 192 5570 0.95 0.92 ln aS = ln ∗ + ( p ∗ − p)
p RT
CdSOa4 239 8390 0.96 0.91
CdSOb4 245 7900 — 0.91 ∼ p
= ln ∗ . (104)
a From heats of dilution.
p
b From temperature dependence of conductance.
In Eq. (104) p ∗ is the vapor pressure of the pure solvent.
The nonideality of the gas phase is taken into account
A comprehensive investigation of solution properties with the help of the second virial coefficient of the solvent
in various solvents and with a multitude of electrolytes vapor; VS∗(l) is the molar volume of the pure solvent in
has yielded an important result. Distance parameters can the liquid phase. Neglect of the nonideality and of the
be obtained for all property equations from chemical ev- pressure dependence of the activity yields the well-known
idence: R = a + ns, where a is the center-to-center dis- approximation aS = p/ p ∗ .
tance of closest approach of cation and anion in the so- Vapor pressure measurements can be carried out by
lution, s the dimension of an oriented solvent molecule, static, dynamic, or isopiestic methods. Static methods are
and n = 0, 1, or 2. An example is given in Table III, where based on direct measurements with the help of manome-
Rexp is the distance parameter from heat of dilution mea- ters. They require thorough outgassing of the solution,
surements and Rcalc the quantity calculated from a config- prevention of vapor condensation in any part of the mea-
uration M 2+ (H2 O)2 SO2−
4 .
suring device, and highly precise temperature constancy.
Differential measurements on solution and pure solvent
B. Electrolyte Solution Data Collections are advantageous.
and Databases The dynamic boiling point method (ebulliometry) stud-
ies the solution in a steady-state equilibrium of boiling and
Critically revised data of various electrolyte solution
condensing under a constant pressure of an inert gas. The
properties help scientists and engineers to overcome
difference in boiling points of the solution and the pure
the time-consuming procedure of searching for reliable
solvent, T − T ∗ , at pressure p,
data for technical applications. Special knowledge-based
databases undertake the interpolation, estimation, or sim- vap H
ln aS = (T − T ∗ ), (105)
ulation of data by theory-founded procedures differing RT T ∗
fundamentally from those for nonelectrolytes. The rea-
where vap H is the enthalpy of vaporization, is con-
son for the difference is the essentially different reference
verted to a change in pressure with the help of the
states of electrolyte solutions which are the infinitely di-
Clausius-Clapeyron equation. The boiling point method
lute solutions with at least three interacting components,
at constant pressure and the static vapor pressure method
namely solvent molecules, cations, and anions. In con-
at constant temperature are equivalent methods.
trast, databases for nonelectrolytes always use the pure
The dynamic gas saturation method is based on the de-
substances as the references.
termination by weight of the amount of solvent transported
by a current of an initially dry inert gas after passage
C. Vapor Pressure Measurements
through the solution where the gas is saturated. The con-
Vapor pressure measurements yield the activity and/or tent of solvent per unit volume of gas is proportional to
osmotic coefficients of electrolyte solutions. The equi- the partial pressure of the solvent.
(g)
librium condition µ(l)
S ( p, T ) = µS ( p, T ) for the solvent The isopiestic method is a highly accurate but time-
in the liquid (l) phase and gaseous (g) phase yields the consuming relative method. In a constant-temperature en-
relationship closure the solvent distills isothermally from a reference
∗(g) ∗(l)
solution of known activity to a solution of unknown activ-
1 ∂ µS − µS ity or vice versa, which entails a change in concentration.
d ln aS = dp
RT ∂p At equilibrium the solvent activities of both samples are
T
equal and (R denotes reference solution)
∗(g)
VS − VS∗(l)
= dp (103) = (νR m R /νm) R. (106)
RT
P1: GJC Final Pages
Encyclopedia of Physical Science and Technology EN005D-208 June 15, 2001 20:28
membrane separating solvent and solution is maintained at m = 0.001 mol kg−1 . Extrapolation with the help of in-
by the osmotic pressure . The osmotic equilibrium appropriate methods would yield seriously erroneous val-
condition, where ues of ∞ V . This example illustrates the necessity of using
theoretically well founded extrapolation methods.
p+
µS ( p + , T ) = µS ( p, T ) + VS d p Thermodynamics yields the relationship
p
∞ ν RT m
∂(ln γ± )
= µS ( p, T ) + VS , (107) V = + dm, (112)
V
m 0 ∂p T
yields the desired relationship
from which the limiting law of apparent molar volumes
ln aS = −VS /RT. (108) ∞
V = V + SV c1/2 , (113a)
Equation (108) is an approximation assuming a pressure- 1/2
independent molar volume of the solvent. In practice, this NA2 e03 |z i z j | 1000 νi z i2
method is limited to large solute particles such as polymers SV =
4π ε03 ε 3 RT
or biocolloids since it is difficult to find an impermeable
membrane for small ions. ∂(ln ε) β
× − , (113b)
∂p T 3
E. Molar Volumes is obtained by introducing the activity coefficient of the
Molar volumes of solute and solvent, VY and VS , are de- infinitely dilute solution (κ R 1) of statistical thermo-
termined from apparent molar volumes V (Section II.F) dynamics. Equations (113) are valid for a completely dis-
by density measurements. In terms of the measured den- sociated electrolyte.
sities of the solution and the pure solvent, d and d ∗ , the The treatment of partially associated electrolytes subdi-
apparent molar volume is given by (molarity and molality vides the space around every ion into a region populated by
scale, respectively) ion pairs (a ≤ r ≤ R) and a region of free ions (r ≤ R) (see
Section III.D); the volume V of the electrolyte solution is
d∗ − d MY d∗ − d MY then given by the relationship
V = + = + . (109)
1000cd ∗ d∗ mdd ∗ d
V = n S VS∗ + n Y [α V + (1 − α)VA ], (114)
Then the molar volumes VY and VS are obtained with the
help of the relationship (molality scale; see Section II.F) where VA is the concentration-independent molar volume
MS m 3/2
∂ V of the ion pair. Equation (114) can be transformed to yield
VS = VS∗ − (110a) the density equation for associated electrolytes:
2 ∂m 1/2 p,T
and d = d ∗ + MY − ∞ ∗ ∗
V d αc − SV d (αc)
3/2
m 1/2 ∂ V + MY − VA d ∗ K A y±2 (αc)2 . (115)
VY = V + . (110b)
2 ∂m 1/2 p,T
Equation (115) is a property equation of the type given
The main problem in calculating molar volumes from den- by Eq. (98) and is consequently combined with Eqs. (99)
sity measurements is the difficulty of measuring reliable and (100) for data analysis. Even measurements of the
densities at low electrolyte molalities. An estimation of highest obtainable precision (σd < 10−7 g cm−3 ) are not
the error of apparent molar volumes sufficiently precise for the simultaneous evaluation of
P1: GJC Final Pages
Encyclopedia of Physical Science and Technology EN005D-208 June 15, 2001 20:28
rel
H =α rel
H (FI) + (1 − α) rel
H (IP),
G. Electromotive Force
(120)
rel
H (IP) = HA∞ . For a galvanic cell,
Heats of dilution dil HY∞ are determined by the measure- M(Hg) | MX(c in S) | AgX(s) | Ag(s), (122)
ments of intermediate heats of dilution (see Fig. 1), which involves the chemical reaction
rel
H (m 2 ) − rel
H (m 1 ) = α2 rel
H (FI, m 2 ) − α1 rel
H (FI, m 1 ) → MX(solv) + Ag(s),
M(Hg) + AgX(s) ← (123)
+ (α1 − α2 ) HA∞ , (121) emf is given by the relationship
an appropriate extrapolation to infinite dilution. Heats of
ν
k
(β) (β)
dilution yield precise information on association constants −n FE = ωi µi . (124)
K A , heats of ion-pair formation HA∞ , and cutoff dis- β=1 i=1
tances of the chemical model R. Entropies of ion-pair
formation SA∞ can be calculated from K A and HA∞ by In Eqs. (122) and (123), M(Hg) is an alkali metal amalgam
the usual thermodynamic relationships. Table III shows electrode, MX the solvated halide of the alkali metal M at
the agreement of HA∞ values from heat of dilution concentration c in a solvent S, and AgX(s)/Ag(s) a silver
and temperature-dependent conductance measurements; halide–silver electrode. Equation (124) is the general ex-
Fig. 8 shows the dependence of heat of dilution on the pression for the electromotive force E of a galvanic cell
concentration of a partially associated electrolyte. without liquid junction in which an arbitrary cell reaction
Heats of solution, sol HY = HY − HY∗ , are obtained by ω1 Y1 + ω2 Y2 + · · · ←→ ωi Yi + · · ·, takes place between
direct measurements and can be combined with the ap- k components in ν phases. In Eq. (124) n is the number
propriate heats of dilution to yield the quantities sol HY∞ of moles of electrons transported during this process from
(Fig. 1), which in turn yield the enthalpies of solvation the anode to the cathode through the outer circuit, F the
(β)
∞
solv HY when combined with lattice energies lat HY .
Faraday number, and µi the chemical potential of com-
Single-ion enthalpies of solvation are obtained with the ponent Yi in phase β. Cells with liquid junctions require
help of extrathermodynamic assumptions. the electromotive force E in Eq. (124) to be replaced by
the quantity E − E D , where E D is the diffusion potential
due to the liquid junction. The standard potential E ◦ for the
cell investigated by Eq. (122) is given by the relationship
FE ◦ = − µ∗Ag + µ∞ ∗
MX − µAgX − µM(Hg) .
sat
(125)
and yields the standard potential E ◦ when the activity for biopolymers. The effectiveness of anions for salting-
aM(Hg) is known. Hence the Gibbs energy of transfer of in and salting-out follows the so-called Hofmeister series.
the electrolyte MX from the pure to the infinitely dilute Despite numerous efforts a satisfactory explanation of this
state in the solvent S is available: series by proper models is still unavailable.
µ∞ ∗
MX −µMX =
∞
sol G MX = −FE ◦ +µ∗AgX −µ∗MX . (128)
I. Phase Separation by Electrolytes
Measurements of emf as a function of temperature yield Specific electrolyte solutions have been discovered and in-
∞ tensively studied which can produce a phase with a higher,
sol HMX in agreement with calorimetric data.
Measurements of emf are also widely used for cal- and a phase with a lower salt concentration. The reasons
culating activity coefficients at concentrations higher for phase separation may be different. For aqueous solu-
than the limit of validity of the model calculations. For tions of large organic ions (e.g., tetra-n-pentylammonium
this purpose Eq. (126) is used for the determination of bromide), the phenomenon is ascribed to hydrophobic in-
y± , y± = αy± , after previous determination of E ◦ from teraction; for large ions in low permittivity solvents (e.g.,
dilute-solution data. tetra-n-pentylammonium picrate in 1-chloropentane), it is
due to the long-range electrostatic interactions (coulombic
phase separation).
H. Solubility
Making solubility measurements is another method for J. Nernst’s Distribution Law
the determination of Gibbs energies. A precipitated elec-
trolyte when studied in equilibrium with its saturated An important technical application of liquid–liquid equi-
solution libria uses Nernst’s law of phase distribution of a solute
Y between two nonmiscible solvents to make up the two
µ∗Y (S) = µY (sat sol) = µ∞
Y + ν RT ln a±
sat
(129) phases in contact, α and β. The equilibrium condition,
(β)
µ(α)
Y ( p, T ) = µY ( p, T ) yields the relationship
sat ∞(β)
yields the mean activity at saturation a± and hence of- aY(α) KY
∞ = = N, (132)
fers the possibility of calculating sol G Y with the help of
Eq. (129).
(β)
aY K Y∞(α)
Solubility measurements may advantageously replace where aY means the activity of the solute and N is Nernst’s
emf methods for the determination of sol G ∞ Y values and
distribution coefficient which is the ratio of the Henry
∞(β)
related quantities of sparingly soluble electrolytes, such constants, K Y∞(α) and K Y , for the solution of Y in the
as the Gibbs energy of transfer from a solvent S to a two separate solvents. Distribution coefficients of water–
solvent S: octanol are widely used in toxicological and ecological
studies.
∞
tr G Y (S → S) = µ∞ ∞
Y (S) − µY (S )
sat
a± (S ) SEE ALSO THE FOLLOWING ARTICLES
= ν RT ln sat . (130)
a± (S)
CHEMICAL THERMODYNAMICS • ELECTROCHEMISTRY •
Frequent use of the solubility method is made for inves-
ELECTROLYTE SOLUTIONS, TRANSPORT PROPERTIES •
tigating activity coefficients of ternary systems where the
LIQUIDS, STRUCTURE AND DYNAMIC • PERMITTIVITY OF
variation of the activity coefficient γY of the electrolyte
LIQUIDS • SOLVENT EXTRACTION
is studied as a function of the concentration of a second
solute Z. Thermodynamics provides the relationship
BIBLIOGRAPHY
∂(ln γY ) ∂(ln γZ )
= . (131)
∂m Z mY ∂m Y mZ Barthel, J., Krienke, H., and Kunz, W. (1998). “Physical Chem-
istry of Electrolyte Solutions—Modern Aspects,” Steinkopf,
If the solute Z is a nonelectrolyte, the sign of the par- Darmstadt/Springer-Verlag, New York.
tial derivative (∂(ln γZ )/∂m Y )m determines whether Z is Blum, L. (1980). Primitive electrolytes in the mean spherical approxi-
mation. In “Theoretical Chemistry: Advances and Perspectives,” pp.
salted-in (−) or salted-out (+) by the electrolyte. 1–66, Academic Press, New York.
Salting-in and salting-out of electrolytes is an important Bockris, J. O., Reddy, A. K. N. (1998). “Modern Electrochemistry”
effect with regard to charged polymers and particularly (Vol. 1, Ionics), 2nd ed., Plenum, New York.
P1: GJC Final Pages
Encyclopedia of Physical Science and Technology EN005D-208 June 15, 2001 20:28
DECHEMA Datenbank DETHERM (1989). Elektrolytdatenbank Barthel, J. et al., Electrolyte Data Collection (12 partial volumes),
ELDAR, DECHEMA, Frankfurt a.M. DECHEMA, Frankfurt, to be continued.
Friedman, H. L., and Dale, W. D. T. (1977). Electrolyte solutions in equi- Pitzer, K. S. (1991). “Activity Coefficients in Electrolyte Solutions,” 2nd
librium. In “Statistical Mechanics,” Part A, Equilibrium Techniques ed., CRC Press, Boca Raton, FL.
(B. J. Berne, ed.), pp. 85–135, Plenum, New York. Robinson, R. A., and Stokes, R. H. (1970). “Electrolyte Solutions,” 2nd
Kreysa, G., ed. (1992 ff). DECHEMA Chemistry Data Series, Vol. XII, rev. ed., Butterworth, London.
P1: LDK/GLE P2: FJU/LPB QC: FYD Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN005E-209 June 15, 2001 20:29
Electrolyte Solutions,
Transport Properties
J. Barthel P. Turq
W. Kunz O. Bernard
University of Regensburg University Pierre et Marie Curie
243
P1: LDK/GLE P2: FJU/LPB QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-209 June 15, 2001 20:29
orthogonal coordinate system of the 3N momentum conductance according to Ohm’s law. Temperature gra-
and 3N position coordinates of the N particles of the dients yield thermal conductance according to Fourier’s
system. law. Inhomogeneities of drift velocity entail Newton’s
Relaxation effect Hindrance of the undisturbed move- law. Gradients of electric potential, temperature, and drift
ment of the ions in the solution, produced in an exter- velocity are observed in both pure phases and mixtures.
nal electric field by the nonspherical symmetry of the Coupled transport phenomena obey Onsager’s laws of ir-
charge distributions around the ions. reversible processes, yielding the appropriate cross co-
Self-diffusion Transport of matter caused by Brownian efficients. However, only diffusion, viscosity, migration,
motion. electric conductance, and related topics will be considered
s-Particle distribution function Probability of finding in this article; little or no information on the other trans-
s particles out of the total number of particles of a port processes in electrolyte solutions can be found in the
system at time t with given momenta at given positions. literature.
Steady state State of a system in which, during the obser- The formulation of transport processes requires the
vation time, measurable flows of matter and/or energy use of time and space variables t and ri , in contrast
may take place without changing the intensive proper- with equilibrium thermodynamics which does not recog-
ties of the system. nize time as a variable. The object of investigation is a
phase with local inhomogeneities, composed of particles
X 1 , X 2 , . . . , X i , . . . , molecules or ions, at particle num-
AN ELECTROLYTE SOLUTION that is not in ther-
ber densities ρi (ri ; t) and particle velocities vi (ri ; t), the
modynamic equilibrium is subject to generalized forces
generalized forces being also functions of time and space.
that control the irreversible processes taking place in such
If particle X i is an ion of valency z i , the charge density at
systems. Processes of this type are chemical reactions,
point ri would be e0 NA z i ci ; e0 is the proton charge, NA
including relaxation processes, and transport properties.
Avogadro’s number (e0 NA = F, Faraday constant), and ci
Homogeneous chemical reactions do not presuppose spa-
the concentration of particle X i (ci = ρi /NA ).
tial inhomogeneities, in contrast with transport phenom-
The amount of particles X i passing per interval of time
ena requiring a gradient of concentration, electric poten-
dt in a perpendicular direction through an elementary sur-
tial, temperature, or drift velocity. Transport phenomena in
face dS of a volume V is given by the quantity ci vi dSdt
electrolyte solutions can involve both types of irreversible
if the solution does not move with regard to the chosen
processes, taking place simultaneously, since ion-pair for-
reference system, the “laboratory frame.” If the solution
mation in electrolyte solutions and formation of higher
moves at velocity s v with regard to its reference system,
aggregates are related to chemical reactions.
this quantity is charged to ci (vi − s v) dSdt.
The models of electrolyte solutions and the fundamen-
In solutions the reference velocity s v is given by the
tals of thermodynamics and statistical thermodynamics
relationship
are given in Electrolyte Solutions, Thermodynamics.
Just as in the case of equilibrium thermodynamics, ther- s
v= gi vi ; gi = 1, (1)
modynamics of irreversible processes yields phenomeno- i i
logical equations which in principle are independent of the where gi is the weight factor for particle X i . Various refer-
molecular nature of the system. In contrast, statistical me- ence systems are used in the literature. The most important
chanics derives transport properties from the microscopic examples are the Hittorf reference system, in which the
picture of the solution. Considerable progress has been velocity of solvent particles v0 is chosen as the reference,
made in this latter type of approach during recent years. and the Fick reference system, in which the mean volume
velocity of the system is chosen.
I. THERMODYNAMICS OF IRREVERSIBLE Investigations on transport properties commonly are
PROCESSES based on diffusion current densities Ji (also, flux or
flow):
A. Definitions
Ji = ci vi − s v . (2)
Gradients of properties produce flows that at low concen-
trations are proportional to these gradients. Concentration Then the total diffusion current φi of particles X i through
inhomogeneities entail chemical diffusion as the trans- an observed surface S enclosing volume V is given by the
port process. Concentration presupposes a system of at relationship
least two components for its definition. The law linking ∂ci
diffusion and flow is Fick’s law. Electric potential gradi- − dV = φi = Ji dS. (3)
∂t
ents produce ion migration and are manifest by electric (V ) (S)
P1: LDK/GLE P2: FJU/LPB QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-209 June 15, 2001 20:29
Equation (3) presupposes that there is no source of parti- K c , as usual in chemical kinetics. The determination of
cles X i within volume V . Transformation of the surface quantities σi is straightforward:
integral to a volume integral by the help of Gauss’s theo-
σ+ = σ− = −k1 c+ c− + k2 cp (8a)
rem yields the relationship
and
∂ci
−
= div Ji . (4) σp = +k1 c+ c− − k2 cp . (8b)
∂t
In the theory of electrolyte conductance, reference is made The quantities c+ , c− , and cp are the molarities of the
to the density of charges e0 NA ci z i (C m−3 ) rather than to “free” cations, anions, and ion pairs in the solution which
the density of particles. Then Eqs. (2) and (3) are trans- are related to electrolyte concentration cE by the degree of
formed to dissociation α of the ion pair:
ji = e0 NA z i Ji = e0 NA z i ci vi − sv , (5a) c+ = c− = αcE , (9a)
cp = (1 − α)cE , (9b)
Ii = ji dS, (5b)
and
j= ji = e0 NA z i ci vi − sv , (5c) (1 − α)
Kc = . (9c)
i i
α 2 cE
and This simple case of a nonconserved system is chosen for
I = j dS. (5d) exemplifying the fundamental concept. Electrolyte solu-
tions exhibiting more complex equilibria can be treated in
In Eqs. (5), ji is the partial electric current density of ions the same way, the quantities σi always being functions of
X i , j the total current density, and Ii and I the partial and rate constants and concentrations of free particles in the
total electric currents (A). Electroneutrality within each solution.
elementary volume, With the sources taken into account, Eqs. (3) and (4)
must be transformed to
ci z i = 0, (6)
∂ci
i − − σi d V = Ji dS (10)
∂t
entails that the electric current density j is independent of (V ) (S)
the reference system, in contrast with the partial current and
densities ji .
∂ci
+ div Ji − σi = 0. (11)
∂t
B. Electrolyte Solutions in Irreversible Equations (10) and (11) apply to a system removed from
Thermodynamic Processes equilibrium as well as to a system in equilibrium under-
The application of Eq. (3) or Eq. (4) to electrolyte so- going spontaneous fluctuations. For this purpose, super-
lutions would require a solution made up of completely position is assumed of the average values ci , Ji , and
dissociated electrolytes. This restriction can be overcome σi and the fluctuations δci , δJi , and δσi :
by the introduction of sources that do not produce spatial ci (ri ; t) = ci + δci (ri ; t), (12a)
inhomogeneities in the solution. A simple but important
example of such sources is the chemical reactions of ion- Ji (ri ; t) = Ji + δJi (ri ; t), (12b)
pair formation and decomposition, which produce particle and
densities per unit of time σi at rate constants k1 and k2 :
σi (ri ; t) = σi + δσi (ri ; t). (12c)
k1
C z+ −→ [Cz+ Az− ]0
+ A ←− z−
(7a) Introduction of expressions (12) into Eq. (11) permits the
k2
discussion of
and
the relaxation to equilibrium if macroscopic hetero-
k1
= Kc. (7b) geneities are imposed on the initially equilibrated system;
k2 systems in equilibrium undergoing fluctuations:
In Eqs. (7), Cz+ , Az− , and [Cz+ Az− ]0 are the z + -valent Ji = 0, σi = 0; and
cation, the z − -valent anion, and the electrically neutral ion systems in steady states: ci (ri ; t) is constant in time
pair (z + = |z − |). The ratio of the rate constants k1 and k2 but not in space, ci (ri ; t) = ci (ri ); Ji and σi are also
yields the concentration-dependent equilibrium constant constant in time but may have nonzero values.
P1: LDK/GLE P2: FJU/LPB QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-209 June 15, 2001 20:29
and RuNO(NO− )
3 2
+
in 2-M HNO 3 at various temperatures. Low Equation (30a) shows that the relevant peak undergoes a
temperature: two peaks, separation of the two complexes. High simultaneous migration and diffusion process but is free
temperature: one peak, no separation of the two complexes.
from kinetic attenuation. Equation (30b) shows a kineti-
x Axis: broadness of the Ru zone after migration (relative dis-
tance scale); y axis: Ru distribution after migration (relative con- cally fast-attenuated mode which yields a negligible con-
centration scale). [Orcil, L., Fatouros, N., Truq, P., Chemla, M., tribution to the electrophoretic pattern.
and Barthel, J. (1983). Zeitschrift Phys. Chem. N. F., Oldenberg
Verlag, München, 138, 129–136.]
The first term on the right-hand side of Eq. (32) represents ions i and t, and χ ti is the hydrodynamic coupling tensor
the kinetic energy; the second term is the potential en- of ions t and i. The forces F(N )
k (k = i or t) are given by
ergy U N +N0 (r1 , . . . , r N +N0 ; t) approximated by a sum of the relationship
pairwise direct interaction potentials Ui j (ri , r j ; t); both
momenta pi and coordinates ri are functions of time. F(N
k
)
= e0 z k E − kT gradk (ln D N )
The Liouville equation [cf. Eq. (4)],
N
−1
+ ωk n k χ ik gik Fk drk . (41)
= DN vi (r1 , . . . , r N +N0 ; p1 , . . . , p N +N0 ; t) k=1
ρ N +N0 dr N +1 · · · dr N +N0 dp1 · · · dp N +N0 , (35) In Eq. (41), gik is the two-particle correlation function g (2)
of ions i and k (see Fig. 2); the force Fk , depending on both
and the Liouville equation yields the continuity equation the action of the external field E and the ion–ion interaction
∂ DN forces in the solution, is given by the relationship
=− divi D N vi(N ) . (36)
∂t i Fk = e0 z k E − kT gradk gik − gradk Uik
The ion velocity vi(N )depends on the applied external elec- N
gikm
tric field E (which is independent of the position vari- − nm gradk Ukm drm , (42)
gik
ables), the ionic interactions, and the hydrodynamic cou- m=1
pling effects of ion i with the remaining ions: where gikm is the three-particle correlation function g (3)
N of ions k, i, and m; and n k and n m are the one-particle
vi(N ) = v0 (ri ) + ωi Fi(N ) + ωt χ ti F(N
t .
)
(37) distribution functions of ions k and m, which are the par-
t=1
ticle densities of Section I in the molecular scale, ρi (Sec-
In Eq. (37), v0 (ri ) is the velocity of solvent at position ri , tion I) = n i /NA . The velocities vi(1) and v0 of Eq. (41) are
and ωi and ωt are the generalized ion mobilities of ions i the particle velocity vi and the Hittorf reference velocity
and t [see Eq. (15)]. Fi(N ) and F(N
t
)
are the forces acting on v0 as defined in Section I.
P1: LDK/GLE P2: FJU/LPB QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-209 June 15, 2001 20:29
e02 z i z j
Ui j = Uicj + Ui∗j = + Ui∗j . (47)
FIGURE 2 Explanation of the two-particle distribution and cor- 4π ε0 εri j
relation functions, f i j and gi j . The probability of simultaneously
finding an ion i in dV1 and an ion j in dV2 is given by the relation- Less elaborated theories neglect the short-range potentials.
ship. The theory of electrolyte conductivity is based on the
f i j (r1 , r21 ) = ni (r1 )n j (r2 )gi j (r1 , r21 ) two-particle continuity equation
= n j (r2 )ni (r1 )g j i (r2 , r12 ) = f j i (r2 , r12 ).
The elementary volumes dV1 and dV2 , with ion- and solvent- divi ( f i j vi j ) + div j ( f ji v ji ) = 0. (48)
permeable walls, are immovably fixed in relation to the observer
O at positions r1 and r2 in the solution. The quantities ni and n j where vi j and v ji are the relative velocity of ion X j in
are the concentrations of ions i and j in the solution (number of the vicinity of ion X i and vice versa. Equation (48), often
ions per unit of volume). called the Onsager continuity equation, is obtained from
the general continuity equation (36) by the usual averaging
processes, that is, integration over all momenta and N − 2
B. Single-Ion and Electrolyte Conductivity
space coordinates of the ions. The resulting two-particle
In a homogeneous and isotropic medium (solvent), the ion continuity equation yields Eq. (48) for the stationary case,
velocity vi is independent of position and yields the molar ∂ f i j /∂t = 0.
single-ion conductivity The boundary conditions for the continuity equation
|vi (t) − v0 | are
λi = e0 |z i |NA . (43)
|E| lim gi j = 0 (49a)
|r12 |→∞
Equation (43) is the basic relationship of electrolyte con-
ductivity. An electrolyte C z+ A z− shows the molar elec- and
trolyte conductivity
ri j
(vi j − v ji ) = 0, (49b)
m = λ+ + λ− . (44) ri j ri j=R
Both single-ion and electrolyte conductivity depend on requiring attenuation to zero of the perturbation function
three effects: at infinite ion distance, Eq. (49a), and zero normal compo-
unperturbed ion migration [second term in Eq. (41)], nent of the relative ion velocity at a distance R, Eq. (49b);
relaxation effect [third term in Eq. (41)], and only the tangential component is different from zero at R.
electrophoretic effect [fourth term in Eq. (41)]. For the chemical model, distance R is the cutoff distance of
the short-range forces, which is also assumed to be the up-
Relaxation and electrophoretic effects are calculable with per limit of ion-pair association. Paired states of unequally
the help of Kirkwood’s superposition approximation for charged ions of symmetric electrolytes (z + = |z − |) at dis-
the three-particle correlation function gikm : tances lower than R are considered to be nonconducting
gikm = gik · gmk . (45) ion pairs.
The integration of the continuity equation is a laborious
The two-particle correlation functions gi j do not show the and time-consuming task, even for simple models of vi j
spherical symmetry of equilibrium functions as a conse- and gi j .
quence of the applied external field. The calculation of gi j Onsager’s limiting law is based on the correlation func-
functions requires a further approximation: the superpo- tion gi j of the Debye-Hückel theory which is the first cor-
sition of the unperturbed equilibrium function gi0j and a rect chemical model. It neglects the noncoulombic interac-
perturbation function gi j of axial symmetry, tion potentials. Since it was developed many attempts have
been made to put the concentration limit to higher values.
gi j (r1 , r12 ; t) = gi0j (r ) + gi j (r1 , r12 ; t). (46)
During the last 20 years, a quantitative description of
The perturbation term equals zero for correlation functions conductance and self-diffusion up to 1-M solutions has
of equal ions, gii = g j j = 0. been achieved by the use of modern gi j functions coming
The hydrodynamic coupling tensor χ ik is given by the from integral equation techniques such as the hypernetted
Oseen or the Navier-Stokes equations for Newton’s law chain (HNC) equation or mean spherical approximation
P1: LDK/GLE P2: FJU/LPB QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-209 June 15, 2001 20:29
applied external field is a high-frequency field. Finite re- C. Conductivity Equations of Dilute
laxation times of the ions surrounding the moving central Electrolyte Solutions
ion as a cloud of axial symmetry more or less prevent the
Onsagar’s limiting law is obtained from Eq. (57) when in
ion cloud from taking on the configuration that it would
Eq. (56) the passage to zero frequency is made, ν → 0.
have in a static external field, which thus diminishes the
Improved approximations in the calculation of pair cor-
conductance-hindering relaxation effect.
relation functions gi j and the introduction of short-range
According to the discussion in the preceding section,
potentials have led to improved conductivity equations of
molar conductivity can be expressed by an equation of
the type given by Eq. (56) or its truncated series develop-
type
ments of type
m = ∞
m + m + m ,
rel el
(56) = ∞ − Sc1/2 + Ec ln c + J1 c − J2 c3/2 . (61)
showing the three contributions to conductivity: In Eq. (61) every coefficient, S, E, J1 , and J2 , contains
unperturbed ion movement in the external field at infi- the contributions of the electrophoretic and relaxation ef-
nite dilution of the electrolyte compound, ∞ fects, el and rel . The coefficients of the series-developed
m;
relaxation effect, rel ; and Fuoss-Hsia equation, including the so-called Chen effect
m
electrophoretic effect, el (see Section III.D), are given in Table I. Onsager’s limit-
m.
ing law in terms of Eq. (61) would be given by the series
At the lowest meaningful level of theory (limiting law at development
very low electrolyte concentrations), the relaxation term at
frequency ν of the external field is given by the relationship = ∞ − Sc1/2 . (62)
Quantity κ is the reciprocal radius of the unperturbed S = S1 ∞ + S2 ; S1 = 0.82043 × 106 z 3 (εT )−3/2
Debye ion cloud; it is given by the relationships (22). S2 = 82.484 × 10−5 z 2 (εT )−1/2 η−1
The electrophoretic term is given at this level by the E = E1 ∞ − 2E 2 ; E 1 = 6.7749 × 1012 z 6 (εT )−3
relationship E 2 = 0.99750 × 103 z 5 (εT )−2 η−1
2 2b2 + 2b − 1 κR
2 NA e0 J1 = σ1 ∞ + σ2 ; σ1 = 2E 1 + 0.9074 + ln 1/2
el
m = −(ν+ z + + ν− z − )
2
κ. (60a) b3 c
6πη
35 2 κR
For symmetric 1,1 electrolytes (ν+ z + = ν− |z − | = 1) in the σ2 = E 2
3b
+ 2 − 2.0689 − 4 ln 1/2
b c
concentration range of the chemical model, Eq. (60a) is
changed to κR 4.4748 3.8284
J2 = σ3 ∞ + σ4 ; σ3 = 1/2 E 1 0.6094 + +
c b b2
NA e02 κ
el
m = . (60b) κR 34 2
3πη 1 + κR σ4 = 1/2 E 2 −1.3693 + − 2
c 3b b
The frequency dependence of conductivity was postulated κR
where = 0.50290 × 1012 z(εT )−1/2 ;
in 1928 by Debye and Falkenhagen from theoretical con- c1/2
siderations; a satisfactory experimental proof of this effect b = 16.709 × 10−6 z 2 (εT )−1 R −1 ; E 1 = 0.43429E 1 ;
E 2 = 0.43429E 2
is still lacking.
P1: LDK/GLE P2: FJU/LPB QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-209 June 15, 2001 20:29
effect is small and does not change the coefficients of the c+ = c− = αc are conducting species leads to an expres-
conductivity equation to a significant extent. Conductivity sion of the equivalent conductance:
equations for unsymmetric electrolytes, such as MgCl2 or
Na2 SO4 , are also given in the literature. /α = ∞ − S(αc)1/2 + Eαc ln(αc)
The chemical model of electrolyte solutions introduces + J1 (R)αc − J2 (R)(αc)3/2 , (65a)
short-range interactions by means of potentials of mean
1−α 1
force Wi∗j , which can be considered as contributions to KA = ; y±2 = y+ y− , (65b)
ion-pair formation. α 2 c y±2
For electrolyte conductivity, ion pairs of symmet- and
ric electrolytes are considered to be nonconducting,
κq
electrically neutral species. The appropriate expression − ln y± = . (65c)
of the thermodynamic equilibrium constant of ion-pair 1 + κR
formation, The coefficients J1 (R) and J2 (R) depend on the cutoff dis-
−→ [Cz+ Az− ]0 , tance R and thus include the influence of the short-range
C z+
+ A ←−
z−
(63)
forces on the transport phenomenon; for the activity coeffi-
introduced ad hoc into conductance theory, is given by the cient y± of the chemical model, see Electrolyte Solutions,
relationship Thermodynamics.
R Figure 4 shows the typical concentration and tempera-
2q Wi∗j
K A = 4000π NA 2
r exp − dr, (64a) ture dependence of the conductivity of a moderately asso-
a r kT ciated electrolyte in a nonaqueous solvent. The calculated
e02 |z + z − | conductivity curve [Eqs. (65)] reproduces the measured
q= . (64b) conductivities (open circles) with a precision of better
8πε0 εkT
than 0.02%. The broken line of 25◦ C is the limiting law,
The upper limit of ion-pair association and the cutoff dis- Eq. (62). There is no doubt that the Onsager limiting law
tance of short-range interactions in electrolyte solutions can be confirmed by highly precise measurements, but its
are considered identical in the framework of the chemi- validity range is limited at very low concentrations.
cal model. Association constants are independent of the Data analysis of conductivity measurements yields the
electrolyte property, thermodynamic property, or trans- limiting conductivities ∞ and the association constants
port property for which they are determined. Table II KA.
proves this statement by comparison of association con-
stants obtained from conductance measurements and mea-
surements of the heat of dilution.
The inclusion of the ion-pair concept in the statistical
thermodynamic theory is made by setting the distance R
of the boundary condition, Eqs. (49), equal to the upper
limit R of the association integral. Taking into account
that only the “free” cations and anions at concentrations
D. Theory of Transference Numbers The first is the contribution to the electrophoretic part of
of Dilute Solutions conductance, and the second results from the negligibly
small space-dependent part of the interionic two-particle
The definition of transference numbers, Eqs. (52), entails
force, λel(2) ∼
= 0 (Chen effect).
for 1, 1 electrolytes the relationships
Combining Eqs. (66)–(68) yields the transference-
Ii λi number equation in the form
ti = = , (66a)
I ti − 0.5 ∞
λi∞ = . (69)
ti∞ = ∞, (66b) ti∞ − 0.5 ∞ + el
Equation (69) indicates that the relaxation effect does not
and influence transference numbers of symmetric electrolytes.
= λ+ + λ− , (66c) Using Eq. (60b) for el yields the relationships
√
where λi and are the single-ion and the electrolyte ti − 0.5 ∞ ∞ S2 αc −1
= − + Bαc, (70a)
conductivities and λi∞ and ∞ the corresponding lim- ti∞ − 0.5 1 + κR
iting values. Single-ion conductivities contain relaxation 1−α 1
(λirel = λi∞ × E/E) and electrophoretic (λiel ) contribu- KA = , (70b)
α 2 c y±2
tions, just as electrolyte conductivity does; E is the
change in the electric field caused by the ion charges in and
κq
the solution: − ln y± = . (70c)
1 + κR
E
λi = λel + λi∞ 1 + . (67) In Eq. (70a) the coefficient B is not completely calculable;
E
B results from the terms of improved theories that are
The electrophoretic effects on anions and cations are neglected in Eq. (70a) and might include the consequence
equal. The electrophoretic contribution consists of two of unknown corrections needed in the experiments; S2 and
parts: κ R are given in Table I.
E Figure 5 shows the features of transference numbers.
λel = λel(1) 1 + The symmetry of Eq. (70a) is obvious. If t+∞ > 0.5, the
E
transference numbers increase with increasing concen-
E 2λel tration and decreasing temperature, and vice versa if
+ λel(2) 1 + + ∞ . (68) t+∞ < 0.5. Transference-number measurement yield the
E
FIGURE 5 Temperature dependence and concentration dependence of cationic transference numbers of methanol
solutions of Me4 NSCN ( ❤) and KSCN (♦) at various temperatures. The full lines are the computer plots according to
Eqs. (70). [From Barthel, J., Ströder, U., Iberl, L., and Hammer, H. (1982). Ber. Bunsenges. Phys. Chem. 86, 636–645.]
P1: LDK/GLE P2: FJU/LPB QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-209 June 15, 2001 20:29
quantities ti∞ by extrapolation as the only character- where commonly the two formation constants K T+ and K T−
istic quantities of the electrolyte solution; the small are supposed to be equal. The maximum results from the
concentration dependence of the transference num- competition of ion aggregates of various types.
bers for symmetric electrolytes does not permit the The conductivity equation of Fuoss and Kraus,
determination of association constants, in contrast with
conductance measurements. The most important applica- y± c1/2
tion of transference-number measurements for symmetric 1 − S(∞ )−3/2 [c(1 − /∞ )]1/2
electrolytes is the determination of precise single-ion con- ∞ KT
ductances. Transference numbers were also successfully = 1/2
+ λ∞
T 1/2
(1 − /∞ )c, (72)
KA KA
applied in investigations of ion aggregates, which con-
tribute to ion mobility by their charges, such as ion pairs is the appropriate equation for reproducing the conductiv-
of nonsymmetric electrolytes and triple ions of symmetric ity curve up to concentrations near the conductivity min-
electrolytes. imum (cmin = 2.14 × 10−2 mol dm−3 at 25◦ C in Fig. 6).
In Eq. (72), y± is the mean activity coefficient of the free
ions; S the limiting slope (see Table I); λ∞ T the limiting
E. Triple-Ion and Higher-Ion
value of the triple ions C+ A− C+ and A− C+ A− ; and K A
Aggregate Formation
and K T = K T+ = K T− are the equilibrium constants of ion-
For low-permittivity solutions, the highest concentration pair, Eq. (63), and triple-ion, Eqs. (71), formation.
for which pairwise additivity of the potential functions is At concentrations far below the conductivity minimum
reasonable in MM-level Hamiltonian models is found at (c 10−3 mol dm−3 in Fig. 6), triple-ion formation can
very low concentrations, for example, 10−4 M in Fig. 6. be neglected. Data analysis is possible with the help of
Figure 6 shows the dependence on concentration Eqs. (65), in agreement with pairwise additive potential
and temperature of the molar conductivity of 1,2- functions, and yields the values of ∞ in Table III. The
dimethoxyethane solutions of LiBF4 from infinite dilution ion-pair association constants K A(1) of these plots agree
to saturation. The plots of versus c1/2 show a minimum well with the K A(2) determined independently at higher
at moderate concentrations and a maximum at high con- concentrations with the help of Eq. (72), which takes into
centrations. Although the minimum is only weakly de- account both ion-pair and triple-ion formation. No method
pendent on temperature, the maximum exhibits a strong is known for the determination of the values of λ∞ T ; data
displacement. The minimum is a general feature of bilat- analysis yields only the product λ∞ T K T , in which the quan-
eral triple-ion formation: tity λ∞
T is commonly estimated to be 2 ∞
/3. Both ion-pair
−→ [C+ A− C+ ]+ ; K +
[C+ A− ]0 + C+ ←− (71a) and triple-ion formation decrease with decreasing temper-
T
ature in accordance with increasing solvent permittivity.
and The conductivity equation for electrolytes undergoing
−→ [A− C+ A− ]+ ;
A− + [C+ A− ]0 ←− K T− , (71b) unilateral triple-ion formation, Eq. (71a) or Eq. (71b), is
given by the relationship
Triple-ion formation is commonly restricted to low- are rigorous statistical mechanical approaches. Only a few
permittivity solvents (ε < 15), but it is also known in high- of those that have attracted the interest of applied research
permittivity solvents as a consequence of noncoulombic and engineering science are treated here.
interactions. Three classes of transport equations can be found in the
literature: molten salt approaches, empirical extensions of
the equations for dilute solutions, and empirical equations
IV. VISCOSITY just for fitting measured data.
Molten salt approaches such as the Vogel-Fulcher-
The concentration dependence of the viscosity η of elec- Tamman (VFT) equation have been used repeatedly for
trolyte solutions up to moderate electrolyte concentrations analyzing the temperature dependence of transport prop-
is commonly represented by the equation erties W (T ) such as diffusion, conductance, and fluidity,
η(c) √ or of relaxation processes:
− 1 : A c + Bc. (74)
ηs B
W (T ) = A exp − . (78)
In Eq. (74), ηs is the viscosity of the solvent, η(c) is the R(T − T 0 )
viscosity of electrolyte concentration c, and A and B are Equation (78) can be deduced from the equilibrium dis-
constants. Coefficient A is available from the theoretical tribution of an isothermal, isobaric ensemble of coop-
z+ z−
limiting law for Cν+ Aν− electrolytes: eratively rearranging domains in the liquid, which can
√ undergo a transition to a new configuration without con-
1 e02 NA p̄ 2 1 − p ∗
η(c) − η = κ p̄ + 4 √ , figurational change at and outside its boundary.
480π ν+ z +2
ν− z −
2 p̄ 1 + p ∗ At the glass transition temperature T 0 of the system
(75) the configurational part of entropy vanishes. It is assumed
that the transition of the supercooled melt to the glass is a
where type of second-order transition to obtain Eq. (78), where
2 2 B is a temperature-independent energy term of transport,
3 3 2 2
z+ z− z+ z−
p̄ = ν+ ∞ − ν− ∞ ; p̄ = ν+
2
∞ + ν− ∞
R is the gas constant, and A is a temperature-independent
λ+ λ− λ+ λ− quantity, depending on the composition of the solution.
(76a,b) Equations based on empirical extensions of the equa-
and tions for dilute solutions use the fact that the viscosity
1 ν+ z + λ∞ ∞
+ − ν − z − λ−
of the system is the most important effect on the transport
p∗ = . (76c) properties and introduce appropriate viscosity corrections.
ν+ z +
2
+ ν− z −
2
λ∞ ∞
+ /z + − λ− /z −
δµiel kT
∞ =− ∞ , (84)
µi 3π ηDi 1 + σ
where µi∞ is the electrophoretic velocity, Di∞ is the ionic
diffusion coefficient at infinite dilution [deducible from
λi∞ ; see Eq. (25)], and is Blum’s screening parameter
of the MSA:
κ
2 = . (85)
FIGURE 9 Specific conductivity κ versus molarity m at 25◦ C
(1 + σ )
for Bu4 NClO4 in mixed solvents propylene carbonate–acetonitrile
In Eq. (84) σ is an average ionic diameter calculated from
(mole fraction of acetonitrile is indicated on the right side of each
curve). Points: measured data; full lines: Eq (79); dashed lines: the ionic diameters σi and the ion concentrations ci :
MSA equation. 2
i z i σi ci
σ = . (86)
feature that the temperature coefficients of conductivity z i2 ci
for all electrolyte solutions at infinite dilution in a given κ is Debye’s parameter [Eqs. (22)].
solvent, and of viscosity, are equal at every temperature; The first-order relaxation effect is obtained by the solu-
that is, infinitely dilute solutions are corresponding states tion of the continuity equation at this level, which yields
in terms of transport energies. Suffice it to note that the
values extrapolated toward zero concentration of transport δE 1 β 2 e02 |z i z j |
=−
energies from highly concentrated solutions based on the E 4π ε0 ε 6kT σ (1 + σ )2
VFT equation equal those obtained from the conductivity
1 − exp(−2βσ )
equations based on MM-level Hamiltonian models. × , (87)
β2 + 2β + 2 2 [1 − exp(−βσ )]
B. Statistical Mechanical Approaches where
2 ∞ 2 ∞
Extended laws are available for the variation with con- e02 NA ci z i Di + c j z j D j
centration of the transport coefficients of strong and as- β2 = . (88)
ε0 εkT Di∞ + D ∞j
sociated electrolyte solutions at moderate to high concen-
trations. Like the CM calculations, this work is based on Association can be included in the extended theory with
the Fuoss-Onsager transport theory. The use of MSA pair the help of the CM, which yields the association con-
distribution functions leads to analytical expressions. Ion stant given by Eq. (65b). For the calculation of the portion
association can be introduced with the help of the chemical (1 − α) of ion pairs, Eq. (65b) is subjected to an iteration
method. A simplified version of the equations, by taking process beginning with yIP = y± = 1:
average ionic diameters, reduces the complexity of the 1/2
original formulas without really reducing the accuracy of yIP 1 2
yIP 4yIP
the description and is therefore recommendable for prac- 1−α =1+ − + .
2K A cy±2 2 K A2 c2 y±4 4K A cy±2
tical use for up to 1-M solutions.
(89)
C. Conductivity Equations The single-ion and the ion-pair activity coefficients of
MSA are made up by an electrostatic part and a hard-
For a completely dissociated electrolyte, the
appropriate sphere contribution:
equivalent conductivity expression, = λi , follows
from the ionic conductivities ln yi = ln yiel + ln yihs ; i = +, − or IP. (90)
P1: LDK/GLE P2: FJU/LPB QC: FYD Final Pages
Encyclopedia of Physical Science and Technology EN005E-209 June 15, 2001 20:29
The electrostatic part yiel is given by the expression Ebeling and Kraeft developed a statistical theory for
ion–dipole solutions (physical model) with the aim of tak-
1 e02 z i2 ing into account ion–solvent interactions.
ln yiel = − , (91)
4πε0 ε kT 1 + σ Computer simulations such as molecular dynamics
el
that is, yIP = 0 for a symmetric electrolyte. For the hard- (MD) and Brownian dynamics (BD) permit the study of
sphere contribution, only the ratio transport properties. Self-diffusion coefficients can easily
be obtained by differentiation of mean-square displace-
hs 2
y± 1 − 0.5ξ πσ 3 NA ments or by integration of the velocity self-correlation
= ; ξ = cj (92) functions of the ion. In contrast, the evaluation of con-
hs
yIP (1 − ξ )3 6
ductivity by means of cross-correlation functions is cum-
is needed for the calculation of the association constant. bersome and computer-time-consuming and can only
The data analysis with the help of Eqs. (83), (84), (87), scarcely be executed.
and (88) where association constants are used with MSA The advantage of computer simulations is the possi-
activity coefficients [Eqs. (90), (91), (92)] yields a good bility of obtaining transport data that cannot or can only
reproduction of experimental data up to molar concen- barely be measured. It is possible in this way to simulate
trations. As an example, Fig. 9 shows the conductivity diffusion coefficients of solvent molecules in the ionic sol-
of Bu4 NClO4 in the mixed solvent system acetonitrile– vation shells and to compare them with those of the bulk
propylene carbonate. Comparison is made of measured solvent molecules and with those of the ions, or to study
data with MSA and the Amis-Casteel equation, which both transport coefficients at different time scales.
exactly reproduce the conductivity maximum at every sol-
vent composition.
SEE ALSO THE FOLLOWING ARTICLES
I. Introduction
II. Qualitative Dynamics
III. Dynamics: Theory
IV. Statistical Approximations and Dynamics
V. Experimental Studies
VI. Control of Dynamics
VII. Summary
477
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
on the dynamical flow of energy among subcomponents dance with quantum mechanics. Third, the time-evolved
of the molecule as the primary property of interest. state of the system is analyzed, that is, measured. Ex-
plicit emphasis in a measurement (and sometimes prepa-
ration) on the flow of energy among subcomponents of
I. INTRODUCTION the molecule (e.g., chemical bonds or rotational degrees
of freedom) constitutes the study of intramolecular energy
Naturally occurring systems in the gaseous or liquid transfer.
phase at typical temperatures are composed of vast It is advantageous to distinguish two qualitatively dif-
numbers of atoms and molecules in constant motion. The ferent types of intramolecular energy flow, which we
well-defined system temperature is a reflection of the shall term reversible and irreversible. In the former, en-
stored energy content, both in the form of intermolecular ergy flows from one part of the molecule to another,
features such as molecule–molecule interactions and but then subsequently returns, reforming the initial state.
translational motion and in the form of motion internal to That is, the system shows no long-term trend toward a fi-
individual molecules. The latter, the motion of isolated nal redistribution of energy among subcomponents of the
molecules, is termed intramolecular dynamics. Included molecule. In the latter, there is a transfer of energy within
within this definition are both dynamics at energies below the molecule, with a general trend toward a stationary final
the dissociation energy of the molecule, in which case state. The latter behavior is that associated with relaxation,
it remains perpetually bound, and dynamics at energies or statistical, dynamics and has historically been assumed
above dissociation, in which case the molecule can break to occur in highly excited molecules. Conditions under
up into different chemical products. which molecular systems appear to display reversible vs
Intramolecular energy transfer is the subset of in- irreversible energy transfer are discussed in detail later.
tramolecular dynamics in which the focus of attention is Intramolecular dynamics and intramolecular energy
on the flow of energy within the isolated molecule. As the transfer have been, and continue to be, areas of intense
simplest of models, one may imagine a linear molecule scientific interest. Such interest falls into two categories,
A B C as being three mass points coupled by springs. loosely termed “practical” and “fundamental.” From the
Initiating an oscillation of the model by stretching the A B practical viewpoint, note that the outcome of a molec-
spring and subsequently observing the time-dependent al- ular process is heavily linked to the flow of energy in
teration of lengths of the A B and B C springs correspond the molecular participants. That is, an understanding of
to a simple model of an experiment on vibrational energy intramolecular energy transfer proves central to the in-
transfer. Indeed, modern experimental techniques allow terpretation of chemical processes and their dependence
for the creation of collision-free environments in which on system conditions. For example, unimolecular decay,
such isolated intramolecular dynamics may be studied and in which an isolated energized molecule dissociates into
possibly externally influenced. chemical products (e.g., ABC → A + BC, where ABC, A
The isolated molecule, comprised of a set of N atoms and BC are arbitrary molecules), occurs via the concen-
bound by interatomic forces, is a complex physical sys- tration of sufficient energy in the A BC bond. Further
tem. It possesses 3N degrees of freedom related to nuclear practical interest arises from important developments in
motion, three being associated with center-of-mass trans- laser technology that permit the introduction of energy
lation, three with rotation (or two if the molecule is linear), into molecules in a variety of controlled ways. This opens
and the vast remainder with vibration. The molecule also the possibility of externally influencing the outcome of
has electronic degrees of freedom associated with the con- a molecular process by varying the initial mechanism of
figuration of its electrons. Additional degrees of freedom, preparation (e.g., producing AB + C, rather than A + BC,
related to internal nuclear composition, are not readily from ABC by judicious preparation of the initial state).
altered at the energies of interest in molecular chemistry Indeed, theoretical developments over the past three years
and physics and can therefore be neglected. Thus, even have led to several proposals for controlling chemical re-
the simplest description of the dynamics of a typical small actions in this manner (see Section VI).
molecule such as benzene (C6 H6 ), which regards it as be- From a fundamental viewpoint, intramolecular energy
ing composed of 12 atoms, involves the complex motion transfer links to three basic scientific issues: (1) rever-
of 33 interacting degrees of freedom. sible vs relaxation phenomena, (2) quantum vs classical
It is convenient to visualize intramolecular dynamics in chaos, and (3) quantum/classical correspondence. These
terms of three steps, not necessarily independent. First, the are briefly introduced in Section II, where it becomes clear
molecule is prepared in a time-dependent state by any of a that intramolecular energy-transfer studies provide a use-
variety of means (e.g., collisions, chemical reaction, laser ful laboratory for the study of fundamental questions in
excitation). Second, the molecule evolves in time in accor- these areas.
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
Experimental and theoretical studies of intramolecular separate steps. In the first step, the forces between the
energy transfer and intramolecular dynamics have a long atoms in the molecule are determined or modeled, while
history. This article, designed as an introduction rather in the second step one considers the dynamics determined
than as a survey, focuses on recent directions in this field of by these forces. This dynamics is done either quantum
research. Of specific interest are insights gained into gen- mechanically, which is correct but difficult, or via
eral rules for understanding and measuring intramolecular classical mechanics, which is often a good approximation
dynamics. For this reason we cite only a few sample com- to the quantum result. In either case, the forces describing
putations to illustrate relevant general features and provide the dynamics are sufficiently complex to necessitate
only a brief qualitative discussion of experimental meth- numerical computer solutions. As an introduction to
ods. References to more historical interests in the field are the nature of intramolecular dynamics and to issues of
provided at the end of this article. interest in this area, we consider two examples.
The organization of this article is as follows. Section II
is designed to provide qualitative insight into intramolec-
ular energy transfer via two subsections. The first, Sec- A. Two Model Calculations
tion II.A, discusses selected computational results on two As a first example, consider nuclear motion in a four-
molecular models, and the second, Section II.B, quali- atom system, A B C D. Two qualitatively different en-
tatively introduces the fundamental problems alluded to ergy ranges are possible. In the first, the system is pro-
above. The reader who is interested in a qualitative pic- vided with sufficient energy to induce vibrational and ro-
ture is urged to first focus on these sections. Section III tational motion but insufficient energy to break any of
contains a description of isolated molecule dynamics from the bonds. This is bound-state intramolecular dynamics.
both the quantum and the classical viewpoints. Emphasis In the second regime, there is sufficient energy to allow
here is on several general features of classical and quan- molecular dissociation to one or more of the molecular
tum intramolecular dynamics. Finally, two brief sections, products (e.g., A B + C D). Typically, each of the inter-
Sections IV and V, discuss statistical approximations to atomic bonds will have a different dissociation energy, the
intramolecular energy transfer and the nature of modern energy required to break the bond. Thus, several dissoci-
experiments designed to probe molecular motion. ation “channels” are possible, such as
Space limitations, coupled with the author’s intention
to provide a useful introductory treatment, have led to A B C D → A B+C D E1
restrictions on the material that can be covered. Thus, A B C D → A+B C D E2
we focus throughout this article on adiabatic processes,
that is, dynamics that take place without change in the A B C D → A+B+C+D E3
electronic configuration of the molecule. When this is where the lowest energy required for each of the particular
not the case, a remark to this effect is made. In addi- processes is arbitrarily labeled E 1 , E 2 , etc.
tion, we assume throughout that the radiation field, be Consider the specific case of NaBrKCl for which the-
it associated with radiative absorption or emission, is suf- oretical, classical dynamics studies are available at en-
ficiently weak to be treatable as a perturbation. Further, ergies where two dissociative channels are energetically
the field of intramolecular dynamics is replete with model accessible. (Questions as to the validity of the classical
approaches that, albeit reasonable, have not been justi- picture are relegated to later sections.) This system pos-
fied either theoretically or experimentally—the latter due sesses attractive forces between the atoms such that the
principally to technological limitations. The modern focus bound NaBrKCl species lies at an energy of approxi-
on accurate dynamics is emphasized in this article, with mately 40 kcal/mol below NaBr + KCl or NaCl + KBr.
the consequence that such simple models are necessarily Specifically, consider the case where energized NaBrKCl
slighted. is formed by the collision
NaBr + KCl → NaBrKCl
II. QUALITATIVE DYNAMICS with sufficient energy to dissociate as
Information regarding intramolecular dynamics is avail- NaBrKCl → NaCl + KBr
able from three sources: experimental studies on specific
NaBrKCl → NaBr + KCl.
molecular systems, theoretical computations on spe-
cific systems or models, and formal studies of typical The initial collision between NaBr and KCl is here
(“generic”) systems. At present, the latter two provide regarded as the preparatory step to the subsequent in-
considerably greater detail than the first and involve two tramolecular dynamics and decay of the intermediate
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
FIGURE 2 Similar to Fig. 1 but at a lower energy, E = 0.0227 a.u. [From Brumer, P. (1972). Ph.D. dissertation,
Harvard University.]
main themes in intramolecular energy transfer. Such sta- role of the relative rates of intramolecular energy trans-
tistical models have both advantages and disadvantages. fer and other competitive processes. That is, the degree of
They are advantageous in that they considerably simplify intramolecular energy flow within the molecule depends
the description of the dynamics. They are disadvantageous on the length of time the system exists as a bound en-
in that they imply that the final state of the system is rela- tity, as well as on some (as yet undefined) “rate of in-
tively insensitive to the initial state. That is, the outcome tramolecular energy flow.” In cases where competitive
of a chemical event is not readily influenced by altering processes such as dissociation (or the ever-present radia-
initial conditions. tive emission) are possible, effective intramolecular en-
The example discussed here displays a number of rele- ergy flow requires a larger rate of energy transfer than of
vant features. Clearly noticeable is the complexity of indi- competitive processes. (Note, however, that in the case
vidual trajectories modeling long-lived dynamics, as well of competitive dissociative or other nuclear rearrange-
as the possibility of simplifications that result if a statistical ment processes, both the energy transfer and the compet-
description that includes relaxation applies. The partici- itive process are governed by the same set of underlying
pation of rotations as well as vibrations in the dynamics dynamical equations for the nuclear motion. This leads to
is also evident. In addition, it makes clear the important a natural difficulty associated with attempting to partition
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
FIGURE 3 Vibrational energy in each of the four alkali–halide bonds, as a function of time, for the trajectory shown
in Fig. 2. [From Brumer, P. (1972). Ph.D. dissertation, Harvard University.]
the dynamics into distinct parts labeled intramolecular dy- mechanics, where the initial state is a mixed state (see
namics and dissociation.) Section III.C). One useful measure of system dynamics is
The computation described above is completely classi- the probability of the system returning to the state from
cal: the nuclear motion is assumed to be well described which it started. Figure 7 shows the time dependence of
by Newton’s equations. The extent to which classical me- this probability for one case, where the quantum and clas-
chanics provides a useful description of intramolecular en- sical results are seen to be in excellent agreement. In sharp
ergy flow is another focus of current research in this area. contrast is the comparison shown in Fig. 8, which cor-
As one example of the validity of classical mechanics, responds to the probability of return to another, higher
consider the bound-state dynamics of a three-atom sys- energy initial mixed state where classical–quantum dis-
tem confined to a line, that is, A B C. Computations on agreement is substantial. The effect has also been seen
the case where the A B and B C bonds are anharmonic for the higher energy decay of A B C to AB + C as
have been performed using both classical and quantum shown in Figs. 9 and 10. Specifically, the probability of the
FIGURE 4 Total rotational energy associated with the NaClKBr system during the trajectory shown in Fig. 2. [From
Brumer, P. (1972). Ph.D. dissertation, Harvard University.]
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
FIGURE 6 Average energy in each of the four alkali–halide bonds as a function of time. Here, the average values
are obtained from a set of NaBr + KCI trajectories. “Error bars” show the range of values associated with the set of
trajectories incorporated in the calculation. [From Brumer, P. (1972). Ph.D. dissertation, Harvard University.]
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
of the system, and we shall assume this reduction to the phase space, with (q, p) as coordinates. A trajectory is a
internal molecular coordinate system has been carried out; curve in this space parametrized by the index t.
thus, q denotes 3(N − 1) coordinates. Modeling a realistic system necessitates producing a set
Central to the nature of dynamics is the notion of cou- of trajectories with varying initial conditions and looking
pled vs uncoupled degrees of freedom. Consider, for ex- at averages of system properties over this set. It is this
ample, a system whose Hamiltonian is the sum of two trajectory technique, where V (q) is a model or realistic
terms, such that potential and where Hamilton’s equations are solved nu-
merically on a computer, that has been extensively used
H (q, p) = h 1 + h 2 , (2) to study intramolecular dynamics in small molecules. Re-
where h 1 depends on a different set of coordinates and sults typical of those obtained were shown in Section II.A.
momenta than h 2 . Under these circumstances, the dynam- The flexibility of Hamilton’s approach lies in the ap-
ics of the degrees of freedom in h 1 are independent of pearance of generalized canonical coordinates (p, q). As
those in h 2 . That is, the total system is composed of two a consequence of their generality, one may seek out the set
independent subsystems. Introduction of a coupling term, of coordinates and momenta within which the dynamics
to give is most easily performed and understood. For example,
a conservative Hamiltonian system may have, along any
H = h 1 + h 2 + V (1, 2), (3) trajectory, a constant value of total angular momentum J.
Such a quantity is said to be a constant of the motion and
where V (1, 2) is a term dependent on the coordinates and
can prove useful as a momentum, since the equation of
momenta of both h 1 and h 2 , induces energy exchange
motion for J is particularly simple, dJ/dt = 0. Indeed, the
and interrelated dynamics among the entire system. Major
idea of seeking constants of motion for use as coordinates
qualitative changes in the dynamics can result from small
or momenta is the central goal of the Hamilton–Jacobi
perturbations or couplings.
approach to classical dynamics. The essential approach is
It is important to note that although Eq. (3) is conceptu-
simple. One seeks a set of M constants of the motion A of
ally pleasing, there is no unique division into subsystems
the system via a systematic procedure. Once found, these
and into intersubsystem coupling for a given physical sys-
momenta are known to be constant along any trajectory,
tem. Rather, such a division must be motivated by an ex-
and the time dependence of the conjugate coordinates is
perimental or theoretical interest in a particular property,
generally simple. With the constants of the motion known,
such as the energy flow between subsystems h 1 and h 2 .
along with the procedure for generating them, it is a rel-
Central to the nature of intramolecular energy transfer is
atively straightforward algebraic problem to express the
an understanding of the effect of the coupling V (1, 2) on
desired q(t), p(t) in terms of these constants and their
the energy flow among the subsystems represented by h 1
conjugate coordinates. That is, we have
and h 2 .
p(t) = A = const 1
(5)
B. Classical Mechanics q(t) = q(t, A).
There are a number of formulations of classical mechan- Although we shall not deal with Hamilton–Jacobi theory
ics, each providing different insights into its nature. For here, the concept of a set of constants of the motion is vital
example, Hamilton’s method, used here, describes dynam- to an understanding of the issue of intramolecular energy
ics in terms of trajectories in generalized coordinates and flow and statistical vs nonstatistical behavior. In essence,
momenta. Consider an M degrees of freedom system with the number of global constants of the motion provides a
system Hamiltonian H (q, p), where (q, p) is a complete method for grouping systems into general categories.
set of M conjugate generalized coordinates and momenta. Consider an M-degrees-of-freedom system. How many
The time evolution of the system is given by Hamilton’s constants of the motion can we identify for a given tra-
equations, jectory? The answer is clearly 2M, with the simplest set
dqi /dt = ∂ H/∂ Pi d pi /dt = −∂ H/∂qi (4) consisting of the initial values of the coordinates and mo-
menta that define the trajectory and that surely constitute
with i = 1, . . . , M. Specifying the state of the system at a set of 2M equations for q(t), p(t) in terms of constants.
t = 0 via the initial conditions q(t = 0) = q0 , p(t = 0) = p0 That is,
then leads to a solution, a trajectory p(t), q(t), from which
q(t) = q(q0 , p0 ; t) p(t) = p(q0 , p0 ; t).
the time dependence of all system properties along that
trajectory can be computed. The motion may be best visu- These constants of the motion, however, are of little inter-
alized as taking place in a 2M-dimensional space, termed est. Although knowing them does identify the trajectory
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
uniquely, it provides no reduction of the complexity of Note then that trajectories in such systems come arbitrarily
solving the full problem for q(t), p(t). Clearly, what we close, during the course of their dynamics, to their orig-
seek is a set of global constraints that define smooth sur- inal starting position. For this reason, such dynamics is
faces in phase space on which the trajectories lie. Members termed “quasiperiodic.” The vast majority of “textbook”
of such a set of functions are called global constants of the problems dealt with in elementary and advanced analyti-
motion. The number of global constants of motion that a cal mechanics treatises are of this type. Examples include
system possesses provides a means of generally classify- the hydrogen atom or the small-vibrations Hamiltonian,
ing the system behavior. although these systems tend to be, in addition, separable,
that is, of the form H = H (K 1 ) + H (K 2 ) + · · · .
1. Integrable Systems Trajectories in integrable systems are stable with
respect to small changes in initial conditions. In
Consider first the case in which an M-degrees-of-freedom particular, consider a trajectory [q(t), p(t)] emanat-
system possesses M independent global constants of the ing from initial conditions q(0) = q0 , p(0) = p0 and an
motion, denoted K. Selecting these as the M generalized initially close trajectory [q (t), p (t)] originating from
momenta K, with conjugate momenta Q, allows us to write q (0) = q0 + δ Q, p (0) = p0 + δ P with δ Q and δ P very
the Hamiltonian as H (K). Hamilton’s equations then be- small. Then define d(t) as the time-dependent “distance
come in phase space” between these two trajectories,
d K i /dt = −∂ H/∂ Q i = 0 d(t) = [ p1 (t) − p1 (t)]2 + [ p2 (t) − p2 (t)]2
(6)
d Q i /dt = ∂ H/∂ K i = ωi (K), + · · · + [q1 (t) − q1 (t)]2 + · · ·]1/2 . (8)
This quantity measures the rate at which two nearby tra-
where the last equation defines the frequencies ωi (K). Sys-
jectories separate as a function of time.
tems allowing this description are termed integrable, or
4. Then, for regular systems, d(t) grows linearly with
regular, and possess a number of important properties:
t, a relatively slow rate of separation characteristic of
stability.
1. Trajectories lie on M-dimensional surfaces in phase
space whose topology is that of a torus. The tori are la-
All these features are, in a qualitative sense, indicative
beled by the values of K. For example, in a two-degrees-
of essentially predictable motion in integrable systems.
of-freedom system, the motion lies on the surface of a
doughnut (see Fig. 11).
behavior is repetitious, or at least describable by a simple
2. The frequencies of motion about the independent
set of frequencies. The linear growth of d(t) suggests that
directions on the torus are given by ω(K).
knowledge of the behavior of a single trajectory allows
3. The time dependence of the coordinates and mo-
prediction, over a fair length of time, of the behavior of its
menta for a given trajectory are given by the Fourier series
initially nearby neighbors.
(where n is a vector of M integers)
As a simple example of regular dynamics, to be em-
q(t) = qn (K)exp[in · ω(K)t] bellished later, consider a system of two uncoupled
n oscillators,
(7)
H = H1 ( p1 , q1 ) + H2 ( p2 , q2 ), (9)
p(t) = pn (K)exp[in · ω(K)t].
n where the individual oscillators, with Hamiltonians
Hi (qi , pi ) = T ( pi ) + V (qi ), have potentials terms of the
Morse form,
V (qi ) = Di [exp(−qi /ai ) − 1]2 . (10)
Here, Di is the oscillator dissociation energy and ai is
a system parameter. Equation (10) might, for example,
model three atoms on a line where the coupling poten-
tial between the oscillators has been eliminated. The in-
dividual Hamiltonians H1 and H2 are then the conserved
integrals, and their numerical values remain constant at
FIGURE 11 Sample quasiperiodic trajectory in a two-degrees-
of-freedom system as it moves on the surface of a torus in phase their initial values throughout the dynamics. Since this is
space. The trajectory shown is actually periodic; in general, the a two-degrees-of-freedom system and since two constants
trajectory will fill the entire torus surface. of the motion exist, the system is integrable.
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
Consider a measurement on the dynamics of this sys- to actual experimental techniques and measurements are,
tem. If an experiment were able to begin with an initial however, not well formulated.
state consisting of a single trajectory, then the time evolu- The regular system constitutes one major category of
tion would be on a torus in phase space and there would observable dynamical behavior. Systems that are inte-
be no energy transfer between the subsystems H1 and H2 . grable are well known and have been experimentally ob-
Sufficiently long-time observations on the system would served, as discussed later. A second major category of
reveal repetitive quasiperiodic motion. Two remarks re- Hamiltonian systems emerges from formal ergodic the-
garding such measurements are, however, important. First, ory, which defines a set of increasingly idealized statistical
if one is interested in the dynamics of a subcomponent systems. Such systems (in terms of increasing statistical
other than H1 and H2—say one interrogates the time de- characteristics) are termed ergodic, mixing, K-systems,
pendence of the harmonic oscillator p12 /2m 1 + q12 —then and Bernoulli systems. Each category imposes additional
the measurement would reveal energy flow into and out conditions, leading to requirements difficult to verify for
of this subsystem. Quasiperiodicity would still be evident, realistic systems. Thus, they are to be regarded as idealized
however, after a suitable time. Second, if the time scale of models of statistical motion.
the experiment is short compared to the relevant system
frequencies, then this quasiperiodicity will not be man- 2. Ergodic
ifest. The essential point then is that the nature of the
Consider first the integrable system where each trajectory
measurement of interest determines whether the regular
lies on the surface of a torus. Two conditions are possible.
system behavior is fundamental to, or observable in, the
In the first, the trajectory wraps about the torus and closes
particular experimental study.
on itself without covering the torus completely. An exam-
There is another feature of integrable systems that is
ple is shown in Fig. 11, where it is clear that this property
important. Specifically, consider the concept of statistical
arises if the frequencies of motion about the torus are re-
behavior in dynamics. The fundamental features of such
lated to one another by the relation n 1 ω1 + n 2 ω2 . Such a set
behavior are that a trajectory at energy E fills the entire
of frequencies is said to be rationally related and results in
volume of phase space associated with that energy E, that
the trajectory returning exactly to its original position. On
a set of trajectories relaxes to a long-time limit that no
the other hand, the frequencies on the torus may not be ra-
longer varies with time, and that the final state is depen-
tionally related, in which case the trajectory fills the entire
dent solely on the energy of the system. It is clear that
surface of the torus. Under such conditions the dynamics
the first of these properties is not satisfied by a regular
is said to be ergodic on the torus. This formal terminol-
system, since a trajectory lies on the surface of a torus
ogy does not correspond to the historical use of the term
of dimensionality M, whereas the constraint to constant
ergodic as found in the physics literature. There, ergodic
energy would confine dynamics to a larger surface of di-
tends to mean ergodic on the energy hypersurface, that is,
mension 2M−1. It would also appear, from the list of pro-
on the (2M−1)-dimensional surface in 2M-dimensional
perties above, that a regular system does not relax. This is,
phase space that results from constraining the system to
in fact, not the case. That is, properties (1)–(4) constitute
constant energy. For clarity, we shall term this E-ergodic.
features of the trajectories of a regular system. As al-
Thus, the characteristic of an E-ergodic system is the
ready remarked, however, typical comparisons with phys-
existence of a single trajectory at each energy E that comes
ical systems require information on the average behavior
arbitrarily close to all points on the energy hypersurface.
of the time development of a collection, or ensemble, of
It is important to note, however, that this property does
trajectories. It is therefore important to note that despite
not ensure that the system displays irreversible relaxation
the quasiperiodic behavior of integrable systems, an en-
during the course of the dynamics. A pictorial analog of
semble of trajectories in a regular system can relax to a
possible motion of an ergodic system is provided by imag-
long-time, stationary distribution. The final relaxed state
ining a speck of carbon in a continuously stirred fluid. The
of the system is, however, intimately related to the initial
carbon speck, representative of the system in phase space,
conditions of the dynamics. This is clear from the simplest
moves throughout the fluid without constraint, but does
of considerations. That is, each of the trajectories in the
not settle down to some long-time stationary state.
set of trajectories retains its original values of the con-
served quantities. Thus, the final state of the system will
3. Mixing
depend on more than just the total overall energy of the
system. A system that is ergodic but has the rudimentary prop-
Each of properties (1)–(4) gives rise to useful computa- erties associated with statistical irreversible behavior is
tional tools for the theoretical identification of integrable the mixing system. Such a system displays the following
behavior in models of molecular motion. Relationships properties.
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
Consider the system at energy E. Denote the average tween nearby trajectories that grows exponentially, that is,
over the energy surface by the expression f , where f (q, d(t) = d(0)exp(kt), indicative of trajectory instability.
p) is any dynamical property. Then, This set of properties gives rise to useful theoretical
indicators of irregular motion, but connections with actual
1. lim f [q(t), p(t)] = f t → ∞. experimental observables are not well established.
2. The correlation between any two dynamical From the viewpoint of measurement, if one were able
properties, that is, to prepare a single trajectory as the initial state of an ir-
regular system, then the subsequent measurement of any
g[q(t), p(t)], f [q(0), p(0)] property, other than energy, would show continual varia-
− g[q(t), p(t)] f [q(0), p(0)] , tion with time. The trajectory would, in addition, display
no tendency to return to the original state over any fi-
goes to zero as t → ∞. nite time. If one prepared an ensemble of trajectories, it
3. Subdivide the total phase space into regular regions of would approach a long-time stationary distribution depen-
particular volume. Then the probability of going from dent solely on energy.
region i to region j in the long-time limit depends
only on the size of the phase space regions i and j.
4. Typical Molecular Systems
Thus, a mixing system satisfies a number of simple
properties that are in qualitative agreement with statistical Both regular and irregular motions are extremes of behav-
relaxation dynamics. A particle of soluble colored material ior, and their relation to the dynamics of realistic systems
stirred into water provides a pictorial analog of mixing dy- has principally been established through numerical com-
namics. Once again, the fluid models the phase space. The puter studies. These studies indicate that many, but cer-
system evolves over time to reach a final macroscopically tainly not all, molecular systems display behavior charac-
invariant distribution of uniformly colored fluid through- terizable as regular at low energies and irregular at higher
out the container. energies. The example of carbonyl sulfide, OCS, is shown
It is unfortunate that the formal definitions of ergodic, in Fig. 12, where the percentage of phase space not show-
mixing, etc. systems involve the infinite time limit. As a ing exponential divergence is shown. The system is seen
consequence, a system may, for example, still be mixing to display a transition to chaotic motion at an energy of
even if relaxation is not observed in the finite time asso- approximately 14,000 cm−1 . By 20,000 cm−1 , close to
ciated with a realistic measurement. This limitation sig- dissociation, almost all of the phase space is irregular.
nificantly reduces the practical utility of formal concepts To appreciate the origin of the regular behavior at
such as mixing behavior. low energies, we note two common approximations
A host of other formal systems with additional, and
hence stricter, requirements have been defined. Here, we
only mention the C-system, which is ergodic and mixing
and which possesses the important characteristic that the
distance d(t) between any two initially close trajectories in
phase space grows exponentially in time. This trajectory
instability leads to the rapid parting of trajectories from
one another and, hence, the inability to predict the dynam-
ics of trajectories, even for a relatively short time period,
from knowledge of the dynamics of their neighbors.
A system that displays characteristics of mixing as
well as exponential divergence of adjacent trajectories
is termed irregular or chaotic. In contrast with the char-
acteristic properties of a regular system, an irregular
system displays (1) trajectories that lie upon the (2M−1)-
dimensional energy hypersurface in phase space (addi-
tional simple constants of the motion such as angular mo-
mentum may also be incorporated), (2) and (3) trajectory
FIGURE 12 Percent regular trajectories as a function of energy
dynamics that cannot be written in terms of a Fourier se- for model OCS. The symbol D denotes the molecular dissociation
ries involving a simple set of discrete frequencies and their energy. [From Carter, D., and Brumer, P. (1982). J. Chem. Phys.
overtones and combinations, and (4) a distance d(t) be- 77, 4208.]
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
in low-energy molecular motion. The first, rotation– The relationships in Eqs. (15) and (16) allow us to
vibration decoupling, assumes that the rotational and vi- rewrite V2 (P, Q) as
brational motions are essentially uncoupled at low ener-
gies, that is, that the Hamiltonian is the sum of vibrational V2 = A(I1 I2 /ω1 ω2 )1/2 [cos(θ1 −θ2 )−cos(θ1 +θ2 )], (18)
and rotational parts: where A is a constant.
H = Hvib + Hrot . (11) Hamilton’s equations of motion [Eq. (4)] then provide
expressions for d Ii /dt that are nonzero due to the cou-
Second, we recall the standard normal-mode procedure pling V2 . In the event that the coupling is small, one may
for small-amplitude vibrational motion wherein, at suffi- approximate the solution for the time dependence of the
ciently small energy, the vibrational Hamiltonian is of the angles as that of the time dependence in the absence of
form the perturbation. This approach, a classical perturbation
Hvib = H0 theory, gives the following result for the time dependence
of Ii (t):
1 2
= Pi + λi Q i2 + V (P, Q), (12) 1/2
2 i I1 I2
I1 (t) = I1 (0) − A
with V (P, Q) sufficiently small to be negligible. The Q, ω1 ω2
P are called normal coordinates and momenta. Thus, low- × cos (ω1 − ω2 )t + θ10 − θ20 (ω1 − ω2 )
energy vibration is well approximated by a sum of M
harmonic-oscillator Hamiltonians. In some instances an − cos (ω1 + ω2 )t + θ10 + θ20 (ω1 + ω2 ), (19)
alternative separable Hamiltonian, composed of the sum where θ10 , θ20 are the initial values of the angles. The quan-
of bond Hamiltonians, provides a superior separable rep- tity I2 (t) is similar, but out of phase. Thus, the action
resentation. In either case, the low-energy vibrational mo- variables oscillate about their unperturbed values with
tion is regular and separable. frequencies (ω1 − ω2 ) and (ω1 + ω2 ). Since ω1 − ω2 is as-
The situation changes dramatically with increasing en- sumed large, the total variation of I1 and I2 as a function
ergy as V (P, Q) becomes larger and the system begins of time is small.
to exchange energy between the decoupled harmonic os- The result is quite different if the system is resonant,
cillators. The subsequent dynamics, as observed in the that is, ω1 = ω2 . In this case, the effect of the perturbation
measurement of the energy in a normal mode, depends in- is more drastic, and energy can be exchanged completely,
timately on the nature of the coupling, which is typically albeit periodically, between the two harmonic oscillators.
expandable in the form There are several reasons why the example treated
V (P, Q) = Vn (P, Q), (13) above is a gross oversimplification of the situation in
n molecules. First, the unperturbed system is assumed har-
where Vn (P, Q) denotes polynomial terms of the form monic, that is, linear in I. Second, the perturbation has been
Q ik P jm with k + m = n. assumed to be composed of a single term. Third, only one
As a simple example of the effect of coupling, consider type of perturbation has been included. We now qualita-
a two-degrees-of-freedom system with tively examine the important effects associated with the
breakdown of these simplifying assumptions.
V (P, Q) = V2 (P, Q) = AQ 1 Q 2 . (14)
It is convenient to first identify the constants of the motion a. Anharmonicity of H0 . In general, H0 is not har-
in the harmonic Hamiltonian and use them as the new monic, but is rather of the general anharmonic form
momenta. Consider then the momenta H (I). As a result, the zero-order system frequencies
ωi (I) = ∂ H0 /∂ Ii are no longer independent of the actions
Ii = (4λi )−1/2 Pi2 + λi Q i2 (15)
I; that is, the frequencies depend on the energy content
and conjugate coordinates of the oscillators. As a consequence, anharmonic systems
1/2 will display regions of the I1 , I2 space where ω1 (I) and
θi = cot−1 −λi Q i Pi . (16)
ω2 (I) are resonant as well as other regions where they are
In these coordinates, H0 assumes the form not. Thus, in the course of the dynamics, the zero-order
system can go into and out of resonance as the energy of
H0 = ω1 I1 + ω2 I2 , (17)
the oscillator varies. Regions of I space where the system
1/2
where ωi = λi . We shall assume ω1 and ω2 to be unequal. is resonant are called resonance zones. Note that despite
These specific types of momenta and coordinates I are the coupling, the dynamics within this resonance region
termed action-angle variables. is regular.
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
C. Quantum Dynamics
Molecules are, of course, properly described by quantum
mechanics, and classical mechanics is recognized as a par-
ticular approximation. Nonetheless, an introductory de-
FIGURE 14 Coordinates defining the C H bond distance (s).
C C bond distances s6 , s1 ; and wag angle β in benzene. [From scription of intramolecular energy transfer via classical
Sibert, E. L., Hynes, J. T., and Reinhardt, W. P. (1984). J. Chem. mechanics has proven useful since it contains few con-
Phys. 81, 1135.] cepts that are truly unfamiliar to the macroscopic world.
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
rate of decay of such a system then entails an analysis of ple, as is the case in some of the available techniques, the
the rate at which the relaxed uniform distribution crosses initial state of the molecule is in itself highly randomized
over into the dissociation region. Such a computation is energetically, then a subsequent measurement that shows
straightforward in both classical and quantum mechanics, that the energy is statistically distributed among the sys-
if a number of approximations on the structure of the tem modes is of little consequence. That is, this particular
molecule are made, and leads to statistical theories, such observation would be a consequence of the preparation,
as RRKM theory, discussed in the bibliography. as distinct from the evolution, of the molecule.
The essence of such theories is, then, that the system The dynamics of molecules with low energy content is
undergoes memory erosion during the dynamics so that usually well described by a Hamiltonian that is the sum
information on initial conditions is lost; only knowledge of independent degrees of freedom, and absorption spec-
of E and J remains. Similar theories have been proposed troscopy has provided considerable information on the
for a wide variety of processes that involve the participa- nature of such systems. The situation with respect to ener-
tion of long-lived molecular intermediates. These include gized molecules is far more complex. A number of exper-
chemical reactions that proceed via collision complexes, imental tools have been utilized over the past 30 years to
photodissociation where molecular preparation is through examine the nature of energy transfer in such systems, as
controlled laser excitation, molecules adhering to surfaces discussed next. Many suffer from the inaccurate knowl-
where detachment is induced via a variety of means, etc. edge of the initial system state. In our brief description
Such theories have the advantage of yielding rather gen- of several experimental methods it will become clear that
eral results, which are amenable to both theoretical and experimental tools have been rapidly developing over the
experimental analysis. For example, in the case of uni- past few years and that an explosion of highly informative
molecular decay, the rate constant for dissociation is found experimental data is now underway.
to increase with increasing energy and decrease with the First and foremost, note that interest is in the nature of
number of participating degrees of freedom in the system. intramolecular dynamics of molecules in isolation. That
Experimental studies on the validity of such theories is, observations must be made over a time scale where
have been ongoing for many years, as discussed briefly in the molecule does not collide with others in the reaction
the next section. The theoretical examination of the valid- vessel. Modern techniques allow very low pressures un-
ity of such approaches is more recent and links directly der which such measurements can be made. Most desir-
to the issues discussed in Section III. At present, classi- able among these methods are beam techniques in which
cal mechanical studies have shown the possibility of both molecules are studied in a low-density beam produced,
statistical and nonstatistical decays, depending on the de- for example, by vaporizing molecules in an oven. Exper-
gree and extent of exponential divergence of trajectories iments prior to this “beam age” (circa 1960) often in-
in phase space. The greater the degree and extent of expo- ferred information about intramolecular dynamics from
nential separation, the closer the agreement with statistical bulk data, which contained effects due to collisions, with
approaches. Similarly, quantum-mechanical studies have resultant loss in accuracy.
shown that model systems can display unimolecular rate Measurement techniques in typical experiments can be
constants whose energy dependence is inconsistent with subdivided into two categories. The first and most modern
that predicted by simple statistical theories. It is fair to probes the bound molecular dynamics directly, for exam-
say, however, that a clear understanding of the interrela- ple, by observing radiation emitted or absorbed during
tionship between molecular properties and the validity of the course of the dynamics. The second infers informa-
statistical theories is in its early stages of development. tion about the nature of bound molecular dynamics by
indirect means, typically by analyzing the outcome of a
process that involves the molecule of interest as a long-
V. EXPERIMENTAL STUDIES lived intermediate. These latter types of measurement are
readily clarified by considering the NaCl + KBr example
The ideal experiment on intramolecular energy transfer, discussed in Section II. Specifically, this particular reac-
as yet unachieved, entails a number of simple features. tion proceeds via the bound NaC1KBr intermediate to
Specifically, the molecule is prepared in a well-defined yield two different sets of products, either NaCl + KBr
and well-characterized state and evolves for a known time or NaBr + KCl. The probability of observing these two
interval, after which the state of the system is precisely de- product “channels” depends on the nature of the NaCIKBr
termined via a high-finesse experimental probe. The im- dynamics. That is, the product ratio will provide insight
portance of each of these components to the resolution of into whether intramolecular energy transfer in NaCIKBr
even the most qualitative of questions (e.g., is the energy is rapid or not. More extensive measurements will entail
transfer statistical or not) should be clear. If, for exam- analysis of the internal states of the product molecules
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
(see Fig. 5), as well as of the relative velocities of the information to the molecule upon which they impinge. As
products. Thus, these kinds of measurements allow one to a consequence, by controlling the phases of these laser
infer information on the characteristics of the dynamics of sources, one can affect the nature of the molecular dy-
the intermediate without directly observing it. Other ex- namics. More careful examination indicates that the ex-
amples of this kind include measurements on the products perimentalist must control the relative phases of two laser
of unimolecular decay, photodissociation, etc. sources, rather than the absolute phase of a single source,
Similarly, most experiments on intramolecular energy a far more feasible prospect.
transfer fall into one of two categories with respect to These proposals for the control molecular dynamics and
preparation of the molecule. In the crudest, the system chemical reactions rely heavily on quantum interference
is prepared by a “coarse” technique where little detail phenomena similar to that seen in the famous “double-
about the initial molecular state is available. Such, experi- slit” experiment. As a consequence they herald a new age
ments include preparation via collision that is, where the in molecular reaction dynamics, one in which quantum
molecule of interest, A, collides with, and absorbs energy aspects of molecular motion are utilized to alter molecular
from, another molecule B and preparation via reaction, dynamics.
where the molecule A is the product of a “precursor” re-
action. Although the bulk of early work on intramolecular
dynamics was carried out with these techniques, far greater VII. SUMMARY
insight emerges from modern experiments in which the
molecule A is prepared by the absorption of radiation. Understanding the nature of intramolecular energy flow
The optimum experiment would therefore proceed by in isolated molecules is of great practical and fundamen-
preparing the molecules in a precise state using beam tal interest. Early developments, both theoretical and ex-
methods and laser excitation, followed by measurement perimental, were hampered by a number of technolog-
of the radiative emission as well as other properties ical problems now being overcome. As a consequence,
of the bound molecule. Such experiments are, in fact, general features that determine the rate and extent of in-
underway on a variety of molecules in several labora- tramolecular energy transfer are slowly emerging. These
tories around the world. In addition, information on the include generic features of classical Hamiltonian systems
bound-state dynamics of molecules has emerged from and the way in which coupling terms influence the na-
pump-probe techniques in which two lasers are utilized, ture of the dynamics, the dependence of observed energy
one to prepare the molecule in the desired state and the transfer on the zero-order system, the interaction between
second to interrogate the dynamics. Along with these state preparation and state measurement on the qualitative
experimental developments, we note a need for reliable interpretation of the dynamics, and the differences and
theories to understand the interrelationship between the similarities between the quantum and classical views of
observed features and the nature of the dynamics. Such dynamics. Nonetheless, general rules regarding the rates
developments are in progress. and extent of intramolecular energy flow have yet to be
established. Similarly, a number of fundamental issues
arising in the study of intramolecular energy flow have
VI. CONTROL OF DYNAMICS yet to be resolved. Rapid technological developments in
computational and experimental tools hold great promise
We noted, in Section I, that the study of intramolecular en- for substantial developments over the next decade.
ergy transfer is linked to the practical goal of controlling
the dynamics of molecules. Since the 1980s theoreticians
have made giant strides which make control over mol- ACKNOWLEDGMENT
lecular dynamics feasible. Space restrictions prevent any-
thing other than a brief comment; the reader is referred to We are grateful to the donors of the Petroleum Research Fund, admin-
the Accounts of Chemical Research reference for further istered by the American Chemical Society, for partial support of the
details. research on which this overview is based.
Equation (27) makes clear that the dynamics of a molec-
ular process is intimately linked to the phases of the system
preparation, contained in the di, j coefficients. It is clear, SEE ALSO THE FOLLOWING ARTICLES
then, that if one were able to control the phases and mag-
nitude of these terms, the subsequent system dynamics ATOMIC AND MOLECULAR COLLISIONS • CHAOS • CHEM-
would also be controlled. Recently, a number of theoreti- ICAL KINETICS, EXPERIMENTATION • CHEMICAL PHYSICS
cians have noted that coherent laser sources transfer phase • COLLISION-INDUCED SPECTROSCOPY • DYNAMICS OF
P1: ZBU Final Pages
Encyclopedia of Physical Science and Technology EN005H-225 June 15, 2001 20:36
ELEMENTARY CHEMICAL REACTIONS • MECHANICS, Gruebele, M., and Bigwood, R. (1997). Int. Rev. Phys. Chem. 17, 91.
CLASSICAL • QUANTUM MECHANICS • STATISTICAL ME- Gruebele, M. (2000). Adv. Chem. Phys. 114, 193.
Lehmann, K. K., Scoles, G., and Pate, B. H. (1994). Annu. Rev. Phys.
CHANICS
Chem. 45, 241.
Levine, R. D. (1969). “Quantum Mechanics of Molecular Rate
Processes,” Oxford Univ. Press, London.
BIBLIOGRAPHY Noid, D. W., Koszykowski, M. L., and Marcus, R. A. (1981). Annu. Rev.
Phys. Chem. 32, 267.
Brumer, P. (1981). Adv. Chem. Phys. 47, 201. Rice, S. A. (1975). “Excited States” (E. C. Lim, ed.), Vol. 2, Academic
Brumer, P., and Shapiro, M. (1988). Adv. Chem. Phys. 70, 365. Press, New York.
Brumer, P., and Shapiro, M. (1989). Accts. Chem. Res. 22, 407. Rice, S. A. (1981). Adv. Chem. Phys. 47, 117.
Faraday Discuss. Chem. Soc. (1983). 75. Stechel, E. B., and Heller, E. J. (1984). Annu. Rev. Phys. Chem. 34,
Felker, P. M., and Zewail, A. M. (1988). Adv. Chem. Phys. 70, 265. 563.
Forst, W. (1973). “Theory of Unimolecular Reactions,” Academic Press, Wyatt, R. E., Iung, C., and Leforestier, C. (1995). Accts. Chem. Res. 28,
New York. 423.
P1: FJU Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
Hydrogen Bond
Krzysztof Szalewicz
University of Delaware, Newark, Delaware
I. Historical Perspective
II. Quantum Mechanical Description of Hydrogen
Bond
III. Analytic Representations of Potential Surfaces
IV. Nature of Hydrogen Bond
V. Properties of Hydrogen Bonds
VI. Types of Hydrogen Bonds
VII. Hydrogen Bonds in Clusters
VIII. Hydrogen Bonds in Solids
IX. Hydrogen Bonds in Liquids
X. Proton Transfer
XI. Hydrogen Bonds in Biological Structures
505
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
FIGURE 2 A generic one-dimensional interaction potential. The equilibrium separation Re and the separation Rσ ,
where the potential changes from repulsive to attractive, are marked. De denotes the potential depth at the equilibrium.
The position of the ground vibrational level and the ground vibrational wave function are indicated. D0 is the dissociation
energy.
energy, is of quantum mechanical origin. It results from minimum on the potential energy surface. Thus, the use-
correlations of the fluctuations of the electronic charges fulness of the hydrogen bonding concept may simply re-
on monomers A and B. The exchange energy is also of sult from our ability to predict structures of dimers and
quantum origin. This component is the consequence of the larger clusters. A further reason for recognizing hydro-
electrons tunneling through the potential barrier between gen bonding as a phenomenon worth a separate treatment
the monomers. The name follows from the fact that the are spectroscopic properties of the X-H stretching mo-
electrons are exchanged between monomers during this tion when the hydrogen atom participates in a hydrogen
process. Sometimes the exchange energy is interpreted bond. The fundamental frequency of this vibration is sig-
in terms of the “repulsion of the electron clouds.” This nificantly lowered (red shifted) in the dimer compared to
picture—which derives from the fact that the exchange that in the isolated monomer. The shift is very sensitive to
energy is proportional to the overlap of electronic charge the molecular environment and therefore provides a ma-
distributions—does not reflect the physics of the interac- jor tool to investigate the structure of hydrogen-bonded
tion as well as the tunneling picture. clusters, liquids, and solids.
Hydrogen bonds result from the same physical forces as The hydrogen bond has been the subject of numerous
all other intermolecular interactions. Thus, from this point review papers and monographs. It is always treated in
of view there would be no need to define hydrogen bond- works devoted to intermolecular interactions [see, e.g.,
ing as a distinct process. Therefore, the phenomenon of Hobza and Zahradnik (1988), Stone (1996), Jeziorski and
hydrogen bonding is related to the structural characteris- Szalewicz (1998), Müller-Dethlefs and Hobza (2000), as
tics of hydrogen-bonded dimers rather than to the physical well as a special issue of Chemical Reviews (1994)].
nature of the interaction. As we will discuss below, if two Monographs restricted to the hydrogen bond include
monomers can be brought together to form a hydrogen- Schuster (1984), Jeffrey (1997), Scheiner (1997), and
bonded structure, this structure will likely be close to the Desiraju and Steiner (1999).
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
I. HISTORICAL PERSPECTIVE date back to the 1950s). The method of neutron diffrac-
tion has an advantage over the X-ray diffraction—it gives
The subject of hydrogen bond (or bonding) is certainly a information about the positions of hydrogen atoms. In the
popular one. The ISI Science Citation database lists more 1980s high-precision near-infrared spectra of clusters in
than 26,000 papers that use this phrase in the title, ab- molecular beams gave reliable information on the frequen-
stract, or as a keyword. In the year 1999 alone there were cies of intermonomer vibrations in hydrogen-bonded clus-
about 3,000 such papers published. The importance of the ters. In the 1990s it became possible to measure the same
hydrogen bond stems mainly from the fact that for certain frequencies directly using the techniques of far-infrared
types of molecules formation of such bonds determines spectroscopy. Very recently these methods have enabled
the structure of the dimers or of larger molecular clusters. a rather complete spectroscopic characterization of small
Thus, hydrogen bonds are characteristic features of many clusters such as the water dimer.
clusters, biological aggregates, and of condensed phases.
Chemical structures containing what we now call hy-
drogen bonds were considered already in the early 1900s.
II. QUANTUM MECHANICAL DESCRIPTION
The concept itself appeared around 1920 in works of
OF HYDROGEN BOND
Huggins and of Latimer and Rodebush. The term hydro-
gen bond was used for the first time by Pauling in the early
The hydrogen bond can be completely described from
1930s. In the same period it became clear that anomalous
first principles by solving the Schrödinger equation for
properties of bulk water are due to the formation of hydro-
a set of molecules. In practice such solutions involve sev-
gen bonds. An important paper on this issue was published
eral approximations. Despite these approximations, with
in 1933 by Bernal and Fowler (despite their not using the
the current computer capabilities the solutions predict
term hydrogen bond). The idea was then extended to other
properties of small clusters with accuracy approaching,
“associated” fluids. The hydrogen bonding concept was
in some cases, experimental accuracies and provide ex-
popularized by Pauling’s 1939 book Nature of Chemical
tremely useful information on hydrogen-bonded systems,
Bond.
including information on systems and properties that can-
While initially the experimental evidence of hydrogen
not be measured. In addition to numerical information,
bonding was coming from thermodynamic measurements
quantum mechanics provides the framework necessary to
of anomalous properties and from X-ray measurements
understand the hydrogen-bond phenomenon. The theoreti-
of crystal structure, in the 1930s it was realized that the
cal approach most useful for an analysis of hydrogen bond-
formation of a hydrogen bond has a profound effect on
ing is symmetry-adapted perturbation theory (SAPT). For
the frequency of the X-H stretch. This started the in-
a more detailed presentation of SAPT and references to the
frared investigations of hydrogen bonds, which became
original papers, see Jeziorski and Szalewicz (1998). This
the most sensitive and the most widely applied exper-
approach serves four main purposes: (a) it provides the
imental method of studying this phenomenon in clus-
basic conceptual framework within which intermolecular
ters and in the liquid and solid phases. In the condensed
interactions—including hydrogen bonds—are discussed;
phase the vibrational spectra are often determined with
in particular it provides the standard division of the inter-
the Raman techniques, which detect lines corresponding
molecular interaction into four fundamental components:
to vibrational transitions in scattered radiation of visi-
electrostatic, induction, dispersion, and exchange; (b) it
ble frequencies. Since the early 1950s the nuclear mag-
is the source of models used in the construction of em-
netic resonance (NMR) spectroscopy has been applied to
pirical potentials (empirical force fields); (c) it provides
hydrogen-bonded systems. This technique is, however,
asymptotic constraints on any potential energy surface de-
utilized less often than the infrared spectroscopy due to
rived either from experiment or from theory; and, finally,
complexity of the spectra. In the 1960s the microwave
(d) it can accurately predict the complete intermolecular
spectra in gas phase for some fairly strongly bound clus-
potential energy surfaces for hydrogen-bonded molecu-
ters were measured and gave precise information about
lar complexes. The last goal can also be obtained using
structures of the clusters. Starting in the 1970s, molecular
the so-called supermolecular approach [Chalasinski and
beam techniques provided a major tool for investigating
Szczesniak (1994), van Duijneveldt et al. (1994)].
small clusters. Spectroscopic measurements in molecular
beams produced—via the rotational spectra—a wealth of
information on geometries of clusters, including even very
A. Interaction Energy
weakly bound ones. Also in the 1970s, neutron scattering
techniques were applied to the condensed phase contain- To study hydrogen bonding it is sufficient to use the
ing hydrogen bonds (although the earliest such studies time-independent, nonrelativistic Schrödinger equation.
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
As usual for most of chemistry, the Born-Oppenheimer dinates. The set of equilibrium coordinates, which will be
approximation is assumed. This approximation relies on denoted by re , describe the geometry of the monomer at its
the observation that the electronic motion is orders of mag- potential energy minimum. Another reasonable choice of
nitude faster than the nuclear motion. Therefore, the mo- intramonomer coordinates is the geometry averaged over
tion of electrons is computed by solving the electronic monomer vibrations, r 0 . The two geometries will be dif-
Schrödinger equation with nuclei clamped in space. The ferent since the monomer vibrations are always to some
solution of this equation provides the potential energy sur- extent anharmonic. However, intuitively one can expect
face governing motion of the nuclei. For investigations of that the best effective potential can be obtained by av-
intermolecular interactions, this surface is conveniently eraging of the complete, monomer geometry-dependent
divided into the energies of monomers and the interaction potential over an appropriate vibrational wave function of
energy E int . In the dimer case the monomer. Computation of such an averaged potential,
E int 0 , although as expensive as the computation of the
E AB (R, ω A , ω B ; Ξ A , Ξ B ) = E A (Ξ A ) + E B (Ξ B ) complete potential energy surface, represents a useful task
+ E int (R, ω A , ω B ; Ξ A , Ξ B ), (1) because its availability simplifies dramatically the spectro-
scopic, scattering, and bulk phase computations. For atom-
where E AB is the total energy of the dimer, E X is the total diatom complexes the E int 0 potentials predict spectral
energy of monomer X, ω X denotes the Euler angles defin- quantities that are only about 0.1% different from those
ing the orientation of monomer X in a dimer-embedded obtained from full three-dimensional nuclear dynamics
coordinate system, and Ξ X is the set of internal coordi- calculations. Recently it has been shown [Jeziorska et al.
nates of each monomer. The energies entering Eq. (1) are (2000)] that the r 0 geometry is the optimal choice if
solutions to the electronic Schrödinger equation. It should only a single monomer geometry can be considered. The
be noted that in definition (1) the energies of the dimer spectra of atom-diatom complexes computed using r 0
and of the monomers correspond to the same values of the potentials exhibit deviations from the spectra computed
internal coordinates Ξ X . with three-dimensional potential, which are about four
In most cases of hydrogen-bonded systems, the mono- times smaller than the analogous deviations produced by
mers are nearly rigid compared to the dimer (i.e., the am- the re potentials.
plitudes of intramonomer motions are much smaller than The rigid-monomer approximation will not work well
those of the intermonomer ones). This is reflected in the if the monomers are too floppy and for complexes involv-
monomer vibrational frequencies being much higher than ing charged monomers. In the latter case the reason is
the intermolecular ones. Thus, one can expect that the de- that the monomers may be fairly significantly distorted
pendence of the interaction energy on the intramonomer upon the formation of the complex. For biopolymers, cer-
coordinates can be neglected and that a large body of spec- tain intramolecular coordinates do vary significantly, and
troscopic, scattering, and bulk phase experiments can be these coordinates have to be included in the description of
interpreted in terms of effective potentials depending on hydrogen bonds in such systems.
intermolecular coordinates only. This approximation leads In the supermolecular method the interaction energy is
to a dramatic simplification in studies of hydrogen-bonded computed by subtraction of the individual energies E AB ,
complexes. For general complexes consisting of N atoms, E A , and E B . These energies are in practice calculated us-
there are 3N nuclear degrees of freedom, (i.e., 3N coor- ing finite basis sets. As a result, the monomer part of the
dinates are needed to describe the position of the system dimer energy is improved by utilizing the basis functions
in space). However, different locations of the center of of the interacting partner. This leads to a spurious lower-
mass of the complex in space as well as rotations of the ing of the interaction energy, referred to as the basis set
whole system do not change interaction potentials. Be- superposition error (BSSE). This error can be removed us-
cause of these translational and rotational invariances, the ing the counterpoise (CP) technique introduced by Boys
energy surface of a system containing N atoms depends and Bernardi. Denoting by E Sσ (G) the electronic energy
on 3N − 6 coordinates (3N − 5, i.e., one coordinate for of system S at geometry G computed with basis set σ , the
a diatomic dimer like Ar2 ). Even for relatively small sys- definition of the CP correction takes the following form
tems like the water dimer 3N − 6 = 12 is a large number β α∪β α∪β
δ E CP = E αA (AB) + E B (AB) − E A (AB) − E B (AB),
of degrees of freedom to treat. The “frozen” monomer ap-
proximation reduces the number of degrees of freedom to (2)
only 6 for any dimer consisting of two general molecules. where α and β stand for the basis sets used for monomers
The simplest and apparently most natural way of ob- A and B, respectively. The use of AB geometry for a
taining rigid-monomer potentials is to perform interaction monomer means that the nuclei of this monomer are at the
energy calculations assuming equilibrium monomer coor- same relative positions as in the dimer AB, and the basis
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
functions originating from the interacting partner (often and ρ B (r ) of the monomers, showing clearly its relation
called the “ghost” functions) are located at the same spatial to the Coulomb law
positions as in the dimer. The SAPT approach by definition
(1) 1
does not include BSSE, since it calculates the interaction E elst = ρ A (r 1 ) ρ B (r 2 ) d 3r 1 d 3r 2 . (6)
energy directly rather than by subtraction of monomer |r 1 − r 2 |
energies from the energy of the dimer. This feature made The total electric charge distribution ρ X (r ) for monomer
it possible to prove the correctness of the CP method by X is the sum of the electronic contribution—which can
comparisons of supermolecular and SAPT calculations be obtained from the wave function 0X —and the nuclear
[van Duijneveldt et al. (1994)]. contribution.
The electrostatic interaction plays a major role in de-
B. Perturbation Theory of Hydrogen Bond termining the structure of dimers consisting of polar
molecules, in particular hydrogen-bonded systems. The
In the SAPT method, the total Hamiltonian H of the dimer evaluation of the electrostatic interaction energy for such
is partitioned as H = H A + H B + V , where H X , X = A or systems is often performed by approximating the elec-
B, is the total Hamiltonian for monomer X , and V is the trostatic potential of a molecule by that resulting from
intermolecular interaction operator. The operator V col- a set of point charges or from a single-center or multi-
lects Coulomb interactions of all the particles of monomer center distribution of multipole moments. However, one
A with those of monomer B. This partition means that the should emphasize that the electrostatic energy contains
unperturbed operator is chosen as H0 = H A + H B and V also important short-range terms due to the mutual pen-
is the perturbation. The interaction energy is then obtained etration (charge overlap) of monomers’ electron clouds.
directly in the form of a perturbation series in V , This short-range part of the electrostatic energy, neglected
(1)
E int = E pol (1)
+ E exch (2)
+ E pol (2)
+ E exch + ···, (3) both in the monocentered and distributed multipole expan-
sions, makes significant contributions to the stabilization
with each term of the sum having a well-defined physical energy of hydrogen-bonded systems. The electrostatic in-
meaning. The polarization components, denoted by the teraction can be either attractive or repulsive. This property
subscript “pol,” are determined by the standard Rayleigh- is best illustrated by large-R interactions of two neutral
(1)
Schrödinger perturbation expansion. The correction E pol molecules possessing dipole moments. The electrostatic
is the classical electrostatic (Coulomb) interaction energy dipole– dipole term dominates then the interaction energy.
of two unperturbed charge distributions and will be written This term can be either attractive or repulsive, depending
(1)
as E elst . The remaining components, denoted by the sub- on the mutual orientation, with the maximum and mini-
script “exch,” are the exchange corrections accounting for mum of equal magnitudes.
the effect of resonance tunneling of electrons between the Often the electrostatic interaction energy alone, or
interacting systems. These contributions originate from rather its asymptotic form, is used to determine the approx-
the antisymmetrization of the polarization corrections to imate equilibrium orientation (but not the equilibrium sep-
the wave function. The second-order corrections can be aration) of hydrogen-bonded clusters. Due to the impor-
further divided into induction and dispersion components: tance of this interaction, such predictions are frequently
(2)
E pol (2)
= E ind (2)
+ E disp , (2)
E exch (2)
= E exch−ind (2)
+ E exch−disp . correct, and a few examples will be given in section V.
The electrostatic predictions are not correct when other
(4) components of the interaction energy have anisotropies
The polarization energies through second order have a significantly different from the anisotropy of the electro-
simple physical interpretation and can be rigorously ex- static component (cf. section V). In fact, those anisotropies
pressed through monomer properties. are always to a smaller or larger extent different so that
the exact minimum structure of a hydrogen-bonded cluster
1. Electrostatic Interaction can be found only by taking into account all of them.
The electrostatic energy, the lowest order polarization
component, is defined as 2. Induction Interaction
(1)
E elst = 0A 0B V 0A 0B , (5) (2)
The second-order polarization energy E pol is given by the
where 0X is the expression
unperturbed wave function of monomer X
and f | g = f ∗ gdτ with the integration extending over
0A 0B V KA LB 2
all electron coordinates. The electrostatic energy can be (2)
E pol = , (7)
expressed in terms of the total charge distributions ρ A (r ) KL E 0A + E 0B − E KA − E LB
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
where K and L label excited states of the monomer A and ponent any further. In contrast, the difference between the
B, respectively, and the prime over the summation sign in- dimer electron density and the sum of the densities of the
dicates that the term with K = 0 and L = 0 is excluded. two monomers is perfectly well defined and can be com-
Eigenfunctions M X
and eigenenergies E M X
are solutions puted to interpret experimental observations related to the
of the Schrödinger equation, with the Hamiltonian H X for charge density, like, for example, NMR spectra.
monomer X describing the ground and excited states of
this system. The induction energy is obtained when the
sum over states in Eq. (7) is restricted to functions ex- 3. Dispersion Interaction
(2)
cited on only one monomer (i.e., to the functions of the The second-order dispersion energy E disp is defined as
(2)
form 0A LB and KA 0B ). The corresponding part of E pol , the difference between the second-order polarization and
(2) (2) (2) (2)
denoted by E ind , is given by E ind = E ind (A) + E ind (B), (2)
induction energies, E disp (2)
= E pol (2)
− E ind . Therefore it can
where be written as
(2)
E ind (A) = 0A B (1)
ind (A) (8) (2)
0A 0B V KA LB 2
E disp =
(2)
and a similar expression holds for E ind (B). The induction K =0 L =0 0
E A + E 0B − E KA − E LB
(1)
function ind (A) is defined by
= 0A 0B V (1)disp , (10)
(1)
KA B 0A
ind (A) = KA . (9) where
E A
− E A
K =0 KA LB V 0A 0B
0 K
(1) = KA LB , (11)
The operator B = 0B |V 0B is the electrostatic poten- disp
K =0 L =0
E 0A + E 0B − E KA − E LB
tial generated by the unperturbed monomer B.
Equation (8) has the form of the second-order en- is the “dispersion function” representing the leading in-
ergy correction arising when monomer A is perturbed termolecular correlation contribution to the dimer wave
by the static electric field generated by the (unperturbed) function. This function is a sum of products of wave func-
monomer B. Notice that Eqs. (8) and (9) include only tions that are describing the electronic excitations on both
the coordinates of electrons belonging to system A, all the monomer A and B; therefore the dispersion inter-
the effects of system B entering via the potential B . The action can be viewed as the stabilizing energetic effect
second-order induction energy results, thus, from the po- of the correlation of instantaneous multipole moments of
larization of the monomers by the static electric fields the monomers. Since the dispersion energy is a correlation
of unperturbed partners (in older literature the induction effect, it cannot be reproduced at the Hartree–Fock level of
contribution is sometimes referred to as the polarization theory. The dispersion force results from the dependence
energy). Asymptotically, at large intermolecular distances of electrons of system B on the position of an electron in
R, this effect is fully determined by the permanent multi- molecule A; therefore it goes beyond the averaged charge
pole moments and static multipole polarizabilities of the distributions characteristic of the Hartree–Fock method.
monomers. At finite R, additional monomer information The dispersion energy is usually the most isotropic com-
is needed to account for the short-range, penetration part ponent of the interaction in hydrogen-bonded clusters.
(2)
of E ind . Although it does not have a large effect on the cluster’s
Because monomers that form hydrogen bonds almost equilibrium orientation, it contributes significantly to the
always are very polar, the induction energy for hydrogen- energy of hydrogen bonds, being of similar size to the in-
bonded clusters makes a very significant contribution to duction energy for medium-size systems but playing an
the total interaction energy. It is also quite anisotropic al- increasingly larger role as the system size grows.
though less than the electrostatic energy. Some authors
proposed to identify within the induction energy a term
4. Exchange Interaction
called the charge transfer energy. This term was assumed
to describe the interactions due to the transfer of a part The sum of low-order polarization corrections, even when
of the electronic charge from monomer A to monomer evaluated exactly without the use of the multipole ex-
B or vice versa. Although such a transfer certainly takes pansion, is not able to predict the existence of the van
place to some extent, a quantitative determination of this der Waals minimum or the repulsive wall at shorter inter-
component proved to be not possible thus far. Methods molecular separations. The repulsive contributions are due
proposed by various authors lead to dramatically differ- to the electron exchange (i.e., to the physical process of
ent estimates of the size of this effect. Therefore, we will the [resonance] tunneling of electrons between interacting
not discuss the hypothetical charge-transfer energy com- systems). The unperturbed function 0 = 0A 0B as well
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
1 ∞
l<
(−1)l B (l A + l B )!
= √
|r 1 − r 2 | l A ,l B =0 m=−l< (2l A + 1)(2l B + 1)(l A + m)!(l B + m)!(l A − m)!(l B − m)!
the potential minimum these effects are, however, always where ri , θi , φi are the polar coordinates of ith parti-
very important. In the case of hydrogen-bonded dimers cle and l< denotes the smaller of l A and l B . The coor-
near the potential minimum, the electrostatic, induction, dinates of particle 1 are measured in system A, while
and dispersion effects are all negative (the latter two are those of particle 2 are measured in system B. The two
in fact always smaller than zero). The positive exchange coordinate systems have their z axes along the same line,
energy cancels a large part of the attractive effect. In many and the other axes of one system are parallel to the
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
respective axes of the other system. The functions proximation to the interaction energy. The distance where
Ylm (θ, φ) are the standard spherical harmonics. The ex- this starts to happen is significantly larger than the equi-
pansion is valid only when R > r1 + r2 . Combining the librium separation. The quantitative measure of the devi-
equivalents of Eq. (13) for other Coulombic terms in the ation can be obtained by comparing the results of SAPT
intermolecular interaction operator V , one gets the asymp- calculations and of the multipole expansion for a given
totic expansion of V . Individual terms in this expansion configuration. At large enough distances SAPT agrees
are interpreted as arising from the interaction of a 2l A - with the asymptotic expansion arbitrarily well. In addi-
pole moment on monomer A (i.e., an instantaneous dipole, tion, SAPT electrostatic, induction, and dispersion en-
quadrupole, etc., moment formed by an electron) with a ergy components can be directly compared with analo-
2l B -pole moment on monomer B. Therefore the expansion gous components of the asymptotic expansion, and the
in Eq. (13) is often called the multipole expansion of the agreement is reached for each component separately. As
potential. the distance decreases, the exchange effects are becoming
Equation (13) shows that after the multipole expansion important so that the total interaction energies predicted
of the potential is performed, the coordinates of electrons by SAPT and by the asymptotic expansion begin to dif-
of molecule A are separated from those of molecule B. fer. The individual components begin to differ as well,
Therefore, if this expansion is substituted in Eqs. (5), (8), although these differences are smaller than the exchange
(10), and (11), the asymptotic interaction energy is ex- effects. The discrepancies in the components appear be-
pressible in terms of integrals each involving only coor- cause the terms neglected when assuming the multipole
dinates of one monomer. Thus, this energy is expressible expansion of the interaction potential are not negligible
in terms of monomer properties only. This fact makes the anymore. These terms are related to the overlap of electron
calculations in the asymptotic region much easier than for charge distributions and therefore are called the overlap
the finite distances, where this approximation cannot be or penetration effects. When analytic forms of the po-
applied. It is important to realize that the multipole ex- larization components are developed using the multipole
pansion has built-in dependence on the mutual orientation expansion, the penetration effects have to be taken into
of two monomers. Thus, asymptotically the anisotropy of account. Most often this is done by multiplying the 1/R
the polarization part of the interaction energy is precisely terms by the so-called damping functions, decreasing the
predicted by this expansion. Although for finite separa- magnitude of these components for smaller R. In addition
tions this anisotropy is modified by the penetration effects to damped 1/R terms, the polarization energies contain
discussed below, the asymptotic prediction remains useful also purely exponential components that have to be in-
at finite R. cluded for an accurate modeling of these energies. The
The lowest power of 1/R appearing in the asymp- type of analysis of the interaction energy presented here is
totic expansion determines the behavior of the energy very useful for obtaining analytic fits to potential energy
for very large separations. Monomers forming hydrogen surfaces.
bonds almost always have nonzero dipole moments. The
electrostatic dipole–dipole interaction becomes then the
dominating long-range term at very large R, because it de- C. Many-Body Effects
cays as 1/R 3 . The consecutive electrostatic terms, dipole–
quadrupole and quadrupole–quadrupole, decay as 1/R 4 The interaction energy of a system consisting of N
and 1/R 5 , respectively. For neutral monomers with non- molecules can be defined similarly to the dimer energy
vanishing dipole moments, the induction energy decays of Eq. (1) as
as 1/R 6 , whereas the dispersion energy always decays as
E int (ξ 1 , . . . , ξ N ) = E tot (ξ 1 , . . . , ξ N ) − E i (Ξi ), (14)
1/R 6 (i.e., both components decay faster than the elec-
i
trostatic interaction of polar monomers). Thus, the elec-
trostatics strongly dominate the long-range interaction where E tot is the total energy of the N -mer, E i is the energy
energy for hydrogen-bonded systems. For charged sys- of the ith monomer, and ξi = (Ri , ω i , Ξi ) stands for the
tems the role of electrostatics is even more important: set of all coordinates needed to specify the spatial position
the charge–charge, charge–dipole, and charge-induced Ri , orientation ω i , and the internal geometry Ξi of the ith
dipole terms decay as 1/R, 1/R 2 , and 1/R 4 , respectively. monomer. The N -mer interaction energy can be expressed
Notice that for polar monomers at some angular orien- as a sum over 2, 3, . . . , N -body interactions:
tations the interaction energy will be positive rather than
negative for large enough R due to the positive dipole– E int = E int [2, N ] + E int [3, N ] + · · · + E int [N , N ], (15)
dipole contribution.
When the intermolecular distance gets sufficiently where K -body contributions to the N -mer energy,
short, the multipole expansion fails to provide a good ap- E int [K , N ], can be written as the following sums:
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
III. ANALYTIC REPRESENTATIONS large exponent would do—except that it speeds up calcula-
OF POTENTIAL SURFACES tions since (R 6 )2 = R 12 . The potentials of Eq. (18) or (19)
can represent isotropic interactions of apolar systems. For
Knowledge of the interaction potential for a hydrogen- modeling the hydrogen-bonded interactions, the potentials
bonded system on a grid of points would not be convenient have to be supplemented by at least the multipole inter-
for most applications. Therefore, analytic representations actions, dipole–dipole, dipole–quadrupole, etc. Alterna-
of this potential are needed (i.e., reasonably simple math- tively, one may assign to chosen sites on each monomer—
ematical expressions depending on the intermolecular and typically atoms but other choices are possible—charges
possibly intramolecular coordinates). Parameters in such simulating the multipoles of the monomer. Most of the
an analytic potential can be fitted to the computed inter- popular potentials fitted to thermodynamic properties of
action energies. Alternatively, the parameters can be ad- liquids are a combination of an isotropic term of the form
justed by comparing predictions of a trial potential to the of Eq. (18) or (19) and of terms describing the Coulombic
observed quantities. The existing potentials for hydrogen- interactions of a set of distributed charges. Examples in-
bonded systems range from very simple functions con- clude the SPC potentials of Berendsen et al. and the TIP
taining only a few terms to rather elaborate functionals potentials of Jorgensen et al. developed in the 1980s to de-
with dozens of parameters. The simplest forms of the in- scribe liquid water. More accurate representations of the
termolecular potentials are the Lennard–Jones potential interaction potentials, in particular those fitted to ab ini-
A B tio calculations or to spectroscopic data, have to assume
E int (R) =12
− 6 (18) more complicated forms. An example of such a potential
R R
can be the SAPT-5s potential for water [Groenenboom et
and the so-called “exp-6” potential al. (2000)] which is of the form:
C
E int (R) = A e−α R − , (19) qa q b
R6 E int = f 1 δ1ab , Rab + Aab g ab (Rab )e−βab Rab
where A, A , B, C, and α are adjustable parameters. The a∈A,b∈B
R ab
where the sum extends over eight sites (five of them sym- better than the effective two-body potentials. Applications
metry distinct) a(b) in molecule A (B). Three of the sites of the former potentials in condensed phase simulations is
are on the atomic nuclei, two are in the region of lone a few times more time consuming than calculations with
pairs, and the three remaining are on the bisector plane the latter ones. Obviously, the many-body effects have also
between the hydrogens. Not all sites are involved in all to be taken into account if ab initio two-body potentials
interactions. The first term represents the Coulombic in- or the potentials fitted to dimer spectra are to be used in
teractions between charged sites. The exponential terms condensed phase simulations.
are multiplied by polynomials
3
g(R) = 1 + am R m IV. NATURE OF HYDROGEN BOND
m=1
To see how the interplay between the four fundamental
for increased flexibility. In addition to the leading induc-
physical forces creates the hydrogen bond, we will con-
tion and dispersion 1/R 6 terms, also the terms with inverse
sider a number of hydrogen-bonded systems: N2 · · ·HF,
powers of 8 and 10 are included (odd powers are nonzero
hydrogen fluoride dimer, HF· · ·ClF, water dimer,
in an exact expansion but are very small). All 1/R terms
dimethylnitramine (DMNA)· · ·methyl alcohol complex,
are damped using the Tang-Toennies function:
and ammonia dimer. The first molecule represents a
n relatively weak hydrogen bond, with the interaction
f n (δ, R) = 1 − e−δ R (δ R)m /m! (21) energy at the potential minimum of only −2.3 kcal/mol.
m=0
The water dimer, certainly the most important hydrogen-
The potential of this form is capable of fitting the computed bonded system, is bound twice as strongly, by
points to within about 0.1 kcal/mol. 5.0 kcal/mol, and the bond in the HF dimer is of similar
The potentials become still more complicated if the strength. The complex of DMNA with methyl alcohol,
rigid monomer approximation is abandoned. Unfortu- bound by about 6 kcal/mol, is discussed to show a
nately, at the present time very little reliable information system for which a very plausible hydrogen-bonded
is available about flexible-monomer potentials. What is configuration is not the global minimum structure.
often done is to assume that the flexible potential can be The components of the interaction energy of N2 · · ·HF
obtained by simply letting the sites centered on atoms to as functions of the orientation of HF denoted by the angle
move with atoms, but this assumption has not been truly θ2 are shown in Fig. 4. For θ2 = 0◦ (180◦ ) the configuration
tested so far. is linear, of the form N-N· · ·F-H (N-N· · ·H-F). The pref-
The potentials discussed above are pairwise or two- erence for the hydrogen-bonded minimum at θ = 180◦ is
body potentials (i.e., potentials describing dimers). Yet, clearly due to the electrostatic interaction. The negative
in many cases such potentials are fitted to thermodynam- dipole–quadrupole term dominates this component (the
ics data for liquids and solids. In such media the pairwise magnitude of this term is maximized at the N-N· · ·H-F
nonadditive effects are usually quite important. Therefore, geometry), but the next term in the multipole expansion,
potentials of this type are called “effective” two-body po- the quadrupole–quadrupole interaction, is not negligible,
tentials since they approximate the many-body effects by contributing close to 50% of the electrostatic energy at
an unphysical distortion of the two-body potential rela- some configurations. At the minimum this term is attrac-
tive to the exact two-body potential. As a consequence, tive and contributes about 28%. A rotation of N2 can only
the effective two-body potentials perform poorly in pre- make both terms less attractive, so that it is clear that we do
dicting pure dimer properties such as dimer spectra or not need to consider other positions of this monomer. As
second virial coefficients. In fact, the effective two-body Fig. 4 shows, the minimum configuration is also preferred
potentials perform poorly also in predicting trimer prop- by the induction energy and (slightly) by dispersion. Even
erties (although the three-body component dominates the the rather small term denoted by δ E intHF
—which contains
nonadditive effects, cf. section III.C). the induction and exchange-induction effects of the order
A partial solution to the problem is offered by the so- higher than second computed at the Hartree–Fock level
called polarizable potentials. As discussed in section III.C, of theory (these effects cannot at present be separated,
the major component of the three-body nonadditivity for therefore δ E int
HF
is not included in the four fundamental
hydrogen-bonded systems are the induction effects. These interaction energy components)—favors the same config-
effects can be modeled by computing the electric field on uration. On the other hand, the N-N· · ·H-F configuration
a given monomer due to other monomers and finding the is strongly disfavored by the exchange effects. This is to be
dipole moments induced by this field. The polarizable wa- expected, since the size of these effects is proportional to
ter potentials are able to predict the dimer properties much the overlap of electronic clouds and when HF rotates from
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
FIGURE 4 Components of the interaction energy for N2 · · ·HF shown as functions of the angle θ2 formed by HF with
the intermolecular axis (θ2 = 0◦ for the N–N· · ·F–H geometry) at R = 3.6 Å, close to the minimum separation, and the
N2 molecule lying on the intermolecular axis. SAPT, symmetry-adapted perturbation theory. (Data from Jankowski
et al., 2000.)
0 to 180◦ around its center of mass, close to the F atom, the ical van der Waals radii of atoms. Since hydrogen bonds
overlap strongly increases. The increase of the exchange are shorter than other van der Waals bonds, the radius of
repulsion is in fact larger than the absolute increase of the hydrogen equal to 1.2 Å is conventionally neglected
the magnitude of electrostatic interaction. Only the inclu- when adding van der Waals radii. For example, for water
sion of the induction and dispersion effects leads to the the sum of oxygen’s van der Waals radii is 2.8 Å, reason-
hydrogen-bonded minimum. N2 · · ·HF is a good example ably close to the accurate value of Re = 2.9 Å. A similar
of quite a typical situation in hydrogen-bonded complexes. calculation including also the hydrogen atom gives too
The prediction based on electrostatics alone gives the cor- large a distance of 3.6 Å.
rect minimum angular configuration, which is then con- Some authors describe an even simpler model of the
firmed by a more detailed analysis, despite the fact that the hydrogen bond than the purely electrostatic model. This
other physical components (and their anisotropies) may be model is based on the observation that the X-H bond is
of similar size or larger than in the case of the electrostatic polarized such that there is a net positive charge on the
component. However, there are also cases where predic- hydrogen and negative on the atom X. The positive charge
tions based on electrostatics fail (see below). Of course, on H is attracted to a negative charge on the electroneg-
the electrostatic model cannot predict the radial position ative atom Y. Since both X and Y are negatively charged,
of the minimum since the electrostatic interaction only in- the hydrogen bond has to be linear to attain the minimum
creases in magnitude as the separation decreases (in most repulsion of these charges. This model is too simple to
cases this is true, but for some combinations of dipole and completely predict the minimum geometry in most cases.
quadrupole moments the electrostatic energy may change Let us see how this model works for N2 · · ·HF. Although
sign at short R). Therefore this model assumes a hard due to symmetry there is obviously no charge flow from
sphere repulsion at a distance determined from the empir- one nitrogen atom to another in N2 , there is an excess
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
electron density on the molecular axis outside the nuclei, Consider now the water dimer. The minimum structure,
where the lone pairs are located. Thus, the positive charge shown in Fig. 5, can be rationalized in a simplistic way as
on the hydrogen is attracted to this place and the negative follows. The water molecule has two lone electron pairs in
charge on F is repelled, leading to a linear hydrogen bond. a tetrahedral arrangement with the hydrogen atoms. Since
However, this model is not able to determine the angle there is an excess negative charge on the lone pair, the
between the N-N and N· · ·H-F axes. The linear structure O-H· · ·O line should lie along the position of one of the
of the whole dimer cannot be explained by repulsion of pairs. If one uses an additional argument that the accep-
the negative charge on F by the negative charge on the tor’s hydrogens should maximize the distance from the
other side of the N2 molecule, as there are regions of free hydrogen of the donor to minimize the repulsion of
either sign between the nitrogen nuclei. positive charges, an almost correct dimer structure is ob-
Our next example will be the hydrogen fluoride dimer. tained (cf. Fig. 5). Now let us see what structure would
It is a good example to show that the electrostatics re- be predicted taking into account the electrostatic interac-
stricted to the dipole–dipole interaction (or the equivalent tion only. Although for interactions involving diatoms the
charge interaction model) is not sufficient to determine dipole–dipole model predicting a linear dimer may some-
the minimum orientation even for interactions of diatomic times be correct, it is rather clear that it will rarely work for
monomers. Such a model predicts the linear F-H· · ·F-H general monomers. Indeed, for the water dimer the elec-
structure dictated by the optimal orientation of the dipoles, trostatic picture limited to the dipole–dipole interaction
which disagrees with the observed structure in which the would predict a head-to-tail geometry without specifying
hydrogen acceptor is tilted away from the linear geometry the dihedral angle between the planes of the monomers. If
by about 60◦ we use an additional argument taking into account the re-
pulsion of the charges on the hydrogens, a bifurcated struc-
H ture is predicted, far from the true minimum (this structure,
#
F H· · · ·F however, is a saddle point on the potential energy surface,
see section VIII.A). That this approach cannot be suffi-
The electrostatic model gets the right geometry if one in- cient can be seen by recalling the charge distribution of
cludes the dipole–quadrupole and quadrupole-quadrupole the water monomer. The simplest representation of this
interactions. The former interaction is zero for linear ge- distribution is two positive charges on the hydrogens and
ometry, as the dipoleA –quadrupoleB term cancels with a twice larger negative charge on the oxygen—quite a dif-
the quadrupoleA – dipoleB term, but the sum of the two ferent distribution than a dipole along the symmetry axis.
becomes negative for a tilted acceptor. The quadrupole– The description of the water electrostatics limited to the
quadrupole interaction also prefers nonlinear structures. In dipole–dipole term does work at very large intermonomer
fact, a simple electrostatic calculation involving only the separations, as this term decays slower than other electro-
dipole and quadrupole moments predicts the tilting angle static terms. At distances near the minimum, the dipole–
of 68◦ , in reasonable agreement with the observed value. quadrupole and the quadrupole–quadrupole interactions
The overall stabilizing electrostatic effect between the lin- are, however, very substantial. Taking them into account
ear and the tilted configurations is about 2 kcal/mol. As for a fixed R of 3 Å gives a geometry very close the true
in the case of N2 · · ·HF, the exchange interaction acts op- minimum shown in Fig. 5: the hydrogen bond nonlinear-
posite to the electrostatics, but the destabilizing exchange ity angle α is only 8◦ too large and the angle β between
effect when going from the linear F–H· · ·F–H to the non- the O-O axis and the bisector of the hydrogen acceptor is
linear configuration is in this case smaller than electro- 9◦ too large compared to the values computed using the
static stabilization. The induction and dispersion energies SAPT method. The complete, ab initio computed elec-
both favor the nonlinear configuration but only by a frac- trostatic energy (including the charge penetration effects)
tion of kcal/mol. Experiment and ab initio calculations predicts the minimum parameters even more accurately:
also predict that the bonding hydrogen is somewhat tilted both α and β are only 2◦ too small.
away from the F–F axis, by about 10◦ down on the diagram The minimum structures of the DMNA complex with
above. methyl alcohol are shown in Figs. 6 and 7. The first struc-
An example where electrostatics does fail to predict the ture is the global minimum and as one can see it is not
correct minimum structure is the complex formed by HF hydrogen bonded. The local minimum shown in Fig. 7 is
and ClF. This complex is experimentally found to exist hydrogen bonded but it is located 1.1 kcal/mol above the
in the “anti-hydrogen bonded” HF· · ·ClF configuration global minimum of 5.8 kcal/mol. It is also at 1.1 Å larger
rather than in the hydrogen-bonded structure ClF· · ·HF separation than the global minimum. Sure enough, the
predicted by electrostatic arguments analogous to those electrostatic prediction favors the hydrogen-bonded struc-
discussed for the HF dimer. ture. The peculiar shape of the global minimum structure
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
FIGURE 5 Water dimer equilibrium geometry. The system has the Cs symmetry. The angles are computed from the
symmmetry-adapted perturbation theory (SAPT)-5s potential (Mas et al., 2000) and the angles measured by Odutola
and Dyke (1980) are given in parentheses. The equilibrium separation is a mixed theoretical/experimental value.
results from competing tendencies to maintain a reason- all the components of the interaction energy at the global
ably favorable electrostatic interaction and to maximize minimum are fairly close to the corresponding compo-
the dispersion attraction while at the same time keeping nents at the local minimum, except for the dispersion en-
the methyl groups of both monomers as far apart as possi- ergy, which is more than 2 kcal/mol larger in magnitude in
ble to reduce the exchange repulsion. It is interesting that the former configuration. Thus, the dispersion component
plays here a decisive role. The dispersion energy is not par-
ticularly anisotropic for this complex, and the increase in
FIGURE 6 The global minimum structure of the dimethyl- FIGURE 7 The secondary (local) minimum structure of the
nitramine complex with methyl alcohol. The carbon atoms are gray, dimethylnitramine complex with methyl alcohol. The carbon atoms
hydrogens—white, nitrogens—blue, and oxygens—red. (From are gray, hydrogens—white, nitrogens—blue, and oxygen—red.
Bukowski et al., 1999.) (From Bukowski et al., 1999.)
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
related to the physical picture of hydrogen bonding involv- different. As is shown in Fig. 9, the difference in transition
ing the electrostatic, induction, dispersion, and exchange energies is equal to the difference of the dimer dissociation
forces. Such a picture does not apply to covalently bound energies D0 in the interactions potentials corresponding to
compounds. Only the exchange interactions (related to the ground and excited X-H vibrations
electron tunneling) can be identified in covalently bonded
E monomer − E dimer = D0 (v = 1) − D0 (v = 0).
compounds. However, the contribution of these interac-
tions to the chemical bond energy is stabilizing, in contrast Depending on the sign, this difference is called either
to intermolecular interactions where it is always repulsive. the red or blue shift. For a hydrogen-bonded system the
Therefore, it is better to reserve the descriptor “covalent” excited-state potential is almost always deeper; therefore
to the chemical bonds. In fact, Muller-Dethlefs
¨ and Hobza the frequency shift is to the red. This deepening of the po-
(2000) use the term noncovalent interactions for inter- tential is usually attributed to an increase in magnitude of
molecular interactions (including hydrogen bonds). the electrostatic and induction components, although re-
cent ab initio calculations have shown that the change of
V. PROPERTIES OF HYDROGEN BONDS the dispersion component also plays a major role. The in-
crease of the former two components can be related to the
The hydrogen-bonded complexes are characterized by lengthening of the X-H distance upon vibrational excita-
several structural and spectroscopic properties that pro- tion. The longer distance results in larger magnitudes of
vide means of identifying the presence of such bonds. multipole moments, which increases both the electrostatic
The most typical hydrogen bonds would exhibit all those and induction energies. This fact explains also why the
properties. Some authors require that all the characteris- magnitudes of the red shift are correlated with strengths of
tics listed below should be present to classify dimers as hydrogen bonds. The bond strength is measured by its dis-
hydrogen-bonded ones, but recently the more “relaxed” sociation energy D0 . Generally one may expect that if D0
definition of the hydrogen bond given in section I, which is getting larger, also the difference D0 (v = 1) − D0 (v = 0)
does not include this requirement, has become generally (i.e., the red shift) will get larger. There exist, in fact, em-
accepted. pirical formulas relating the red shift to the strength of hy-
drogen bond. The red shift is accompanied by an increase
(a) The H· · ·Y distances in hydrogen-bonded systems in the intensity of the X-H stretch due to the appearance
are shorter than generally expected of intermolecular of the induced dipole moments, as discussed next. Upon
bonds. This fact is a simple consequence of the strength formation of a hydrogen-bonded dimer, most of the other
of hydrogen bonds and has been historically important for monomer vibrational frequencies are also shifted, either
the formation of the concept of hydrogen bonding. The to red or to blue, but these shifts are typically a few times
distances between atoms in molecular crystals have been smaller in magnitude than the red shift of the hydrogen-
known for a long time. Based on these data, Pauling de- bonded X-H stretch.
fined the van der Waals radii of elements such that the sum (c) The dipole moment of a hydrogen-bonded complex
of such radii gives the interatomic distance. For example, is larger than the vector sum of the monomer’s moments.
the radii for oxygen, nitrogen, and fluorine are 1.4, 1.5, This effect is due to the presence of the induced dipoles.
and 1.35 Å, respectively. Also, somewhat more arbitrar- Although a dipole moment of a cluster is well defined,
ily, a radius of about 1.1–1.3 Å was assigned to hydrogen attempts to determine the dipole moments of the individ-
(other authors use even 1.0 Å). The hydrogen bonds are ual monomers within the cluster encounter difficulties in
much shorter than the sum of van der Waals radii of hydro- the partition of the total quantity. For large water clusters
gen and atom Y. In the case of the water dimer discussed and for ice, the literature values vary from about 2.3 to
above, the H· · ·O distance is about 2.0 Å, whereas the sum 3.1 Debye, depending on the method of partition, com-
of H and O van der Waals radii gives 2.5 Å. pared to the monomer’s value of 1.86 Debye.
(b) The fundamental frequency of the X-H bond stretch (d) The length of the X-H bond is often increased upon
is lowered upon the formation of a complex resulting in the formation of the cluster. This increase is almost negli-
a red shift, which is a very sensitive probe of hydrogen gible for the dimers. In the case of the water dimer, recent
bonds in clusters and in the condensed phase. The origin ab initio calculations have found that the O-H distance
of this effect can be understood as follows. In an iso- increases by only 0.006 Å compared to its equilibrium
lated monomer the fundamental frequency of the X-H vi- value of 0.958 Å [Klopper et al. (2000)]. However, for ice
bration is proportional to the energy difference between some experiments indicated that this bond can be length-
the ground (v = 0) and the first excited (v = 1) vibrational ened to about 1.000 Å, a factor of seven more than for
states of the monomer. When the transition takes place in the dimer. These experimental observations remain con-
a dimer, it involves also the ground van der Waals inter- troversial and can be alternatively explained by disorder-
molecular vibrations and therefore its frequency will be ing of oxygen atoms off the hexagonal axis. Another often
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
FIGURE 9 An illustration of the origin of the red shift in vibrational frequencies upon formation of hydrogen bond.
quoted experimental result is 0.973 Å [Kuhs and Lehmann nucleic acids, and typical hydrogen bonds forming within
(1986)], only 0.015 Å more than the monomer equilibrium or between proteins. The strong hydrogen bonds involve
value. An increase of this order is predicted by ab initio ionic species. Examples include Cl− · · ·H2 O, F− · · ·H2 O,
calculations for clusters of size of about six molecules. H3 O+ · · ·H2 O, and F− · · ·HF with interaction energies of
about 15, 30, 35, and 40 kcal/mol, respectively. In the
case of the last complex, the bond is so strong that this
VI. TYPES OF HYDROGEN BONDS complex could also be classified as a chemically bonded
one. Furthermore, the H· · ·F− distance of about 1.3 Å
The hydrogen bonds are classified based mainly on the is closer to distances typical for chemical bonds (about
strength of interaction as measured by the depth of the in- 1 Å) than to H· · ·Y hydrogen bond distances (about 2 Å).
teraction potential De at the minimum of the complex. Strong hydrogen bonds result in a significant distortion of
Usually three classes are distinguished: weak, moder- the monomer’s structure. The H-F bond in F− · · ·HF is of
ate, and strong bonds, with energetic boundaries at about the same length as the F− · · ·H hydrogen bond (i.e., the
2 and 15 kcal/mol. The weak hydrogen bonds involve complex is centrosymmetric). Thus, the F-H distance is
less polar X-H groups in proton donors, like C-H or
P-H groups, or less polar acceptors, like the N2 molecule
in the N2 · · ·HF complex discussed above. Also, the hydro-
gen bonds where X-H attaches to a π bond on the acceptor
belong to this group (examples of such bonds are given
in Fig. 10). The weakest hydrogen bonds considered in
the literature are about 0.5 kcal/mol. Most of hydrogen-
bonded complexes of interest form the group of moderate
hydrogen bonds. Water dimer or hydrogen fluoride dimer FIGURE 10 Examples of hydrogen-bonded compounds where
are typical examples for this group. Other well-known an X-H group attaches to a π bond. (a) Complex of chloroform
dimers in this group involve carboxylic acids, base pairs of with benzene. (b) Complex of hydrogen chloride with ethylene.
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
increased from 0.93 Å to 1.30 Å upon the dimer forma- of the two monomers agrees very well with results of mod-
tion. This is, however, an extreme case. For F− · · ·H2 O the ern ab initio calculations. The equilibrium O-O distance
hydrogen-bonded O-H distance increases by about 0.1 Å cannot be directly measured, but spectral data provide the
from its equilibrium-isolated monomer value of 0.96 Å, value of R −2 0 . This value together with the difference
and this size of elongation is typical for ionic systems. The between R −2 0 and Re from dynamical calculations in
hydrogen bonding between ions and water is the principal the SAPT-5s potential (Groenenboom et al. 2000) gave a
mechanism of the hydration processes. mixed experimental/theoretical estimate of 2.91 ± 0.005
Most of hydrogen-bonded molecules exhibit to a larger Å (Mas et al., 2000), which agrees to all digits with the
or lesser extent all of the properties described in section VI. value obtained in large-scale ab initio calculations [Klop-
There exist cases, however, when some of these features per et al., 2000]. The depth of the water dimer potential
are not present or even the trend is opposite. An example has been accurately determined by ab initio calculations
are complexes in which the X-H vibrations are blue shifted to be 5.0 kcal/mol (Klopper et al., 2000). Knowledge of
(see Hobza et al., 1999, and Gu et al., 1999). Some au- the limit value of De allowed scaling of the dissociation
thors questioned if such complexes should be considered energy computed from the SAPT-5s potential (Groenen-
hydrogen bonded. According to the definition adopted in boom et al., 2000) to obtain an estimate of D0 equal to
this article, the hydrogen-bonded classification will be ap- 1165 ± 54 cm−1 , in excellent agreement with the value
propriate if the equilibrium geometry has a nearly linear derived from experiments equal to 1168 ± 245 cm−1 (Cur-
X-H· · ·Y shape, and the bond is reasonably strong (i.e., tiss et al., 1975). The water dimer comparison represents
not much weaker than about 1 kcal/mol). an unprecedented level of agreement between theory and
experiment on the equilibrium parameters for a hydrogen-
bonded system.
VII. HYDROGEN BONDS IN CLUSTERS As an example, Fig. 11 presents the intermonomer
vibrational-rotational-tunneling (VRT) energy levels of
Small clusters offer substantial advantages in investiga-
tions of hydrogen bonds compared to the condensed phase,
where one can observe only statistically averaged proper-
ties. Clusters can be detected in bulk gas phase or in molec-
ular beams, the latter method being presently the method
of choice. Various spectroscopic techniques applied to in-
vestigations of hydrogen bonds have been discussed in
section II. Most information about clusters comes from
infrared techniques, both near and far infrared. The former
method operates in the range of several thousand inverse
centimeters and excites one of the vibrational modes in a
monomer with a simultaneous transition between various
van der Waals vibrations. The van der Waals transitions
have much lower frequencies because the intermonomer
motion is much slower than the intramonomer one. The
far infrared method operates in the range from a few to a
few hundred inverse centimeters and therefore can be used
to observe pure transitions between the intermolecular vi-
brational modes. Another technique used to investigate
hydrogen-bonded clusters are matrix isolation methods,
where clusters are included in solid matrices made mostly
of rare gases. Such spectra, however, give much less pre-
cise information due to relatively strong interactions of
polar molecules with the matrix.
TABLE II Vibrational Frequencies of Water (in cm−1 ) DT, AW, and AT in Fig. 11 denote the ground state (A ),
Monomer a Bend 1595 donor torsion (A ), acceptor wag (A ), and acceptor twist
Symm. stretch 3657 (A ) modes. The solid lines refer to A symmetry and
Asymm. stretch 3756 the dashed lines to A symmetry with respect to the point
Dimer inter b Donor torsion 86 group Cs of the equilibrium structure. Experimental levels
Acceptor wag 112 not shown have not been measured to date. The K = 0 and
Acceptor twist 121 1 columns describe the rotational ground and first excited
Stretch 187 states, respectively. The agreement between the spectra
In-plane bend 370 measured and calculated from the SAPT-5s potential is
Out-of-plane bend 565 very good for most levels. Some of the existing discrep-
Dimer intra Acceptor H-O-H bend 1593c ancies may result from misassignments of the measured
Donor H-O-H bend 1611c lines.
Donor bridge O-H stretch 3530d The intramonomer vibrational frequencies change sig-
Donor free O-H stretch 3730d nificantly upon the formation of a hydrogen-bonded dimer
Acceptor symm. stretch 3600d compared to the isolated monomers’ frequencies. Table II
Acceptor asymm. stretch 3745d presents the fundamental vibrational frequencies for the
Liquid watere Intermonomer stretch 65, 162 water monomer, dimer, and condensed states of water.
Librational 430, 650, 795 The monomer frequencies are related to the characteristic
H-O-H bend 1581, 1641 vibrational types of motion called normal modes. There
O-H stretch 3051, 3233, 3393 are three normal modes in the water monomer: a bending
3511, 3628 mode and the symmetric and asymmetric stretches. In the
Ice I f Intermonomer stretch 50, 164, 229 water dimer there are six intramonomer modes. We expect
Librational 840 that the vibrational modes of the acceptor will be closer to
H-O-H bend 1650 those in the monomer than those of the donor. Since the
O-H stretch 3220 dimer bending modes are available only from measure-
ments in matrices, we should use monomer’s frequency
from matrix measurements, which is equal to 1590 cm−1
aFland et al. (1976).
bGroenenboom et al. (2000). The three lower frequencies are from
in argon matrix. Thus, as Table II shows, the acceptor’s
exact calculations, whereas the three higher ones are harmonic.
c Bentwood et al. (1980), in argon matrix. bend is blue shifted by 3 cm−1 , while the donor’s bend is
d Huang and Miller (1989). blue shifted by 21 cm−1 . We also expect that the stretch of
e Carey and Korenowski (1998), measured at temperature of 24◦ C
the O-H bond containing the bridge hydrogen will be af-
and pressure of 256 bar. fected most. This motion cannot be anymore classified as
f Franks (1972), p. 133–135.
a symmetric or asymmetric stretch, and instead we distin-
guish just the bridge O-H and donor’s free O-H stretches.
Compared to the frequency of the monomer symmetric
the water dimer. The numerical values of the inter- stretch, the bridge O-H stretch is red shifted by as much
monomer vibrational frequencies are also presented in as 127 cm−1 . The free O-H stretch is shifted 26 cm−1 to
Table II. The water dimer potential has eight equivalent the red compared to the monomer’s asymmetric stretch.
minimum structures differing only by a permutation of For the acceptor molecule we still can recognize the sym-
hydrogen atoms. The monomers can tunnel between these metric and antisymmetric stretches, which are red shifted
minima through the separating barriers (this tunneling by 57 and 11 cm−1 , respectively.
should not be confused with the electron tunneling lead-
ing to the exchange forces). The major tunneling processes
are shown in Fig. 12. The tunneling results in a splitting
B. Trimers and Larger Clusters
of the vibrational energy levels. The presence of tunnel-
ing splittings explains the name VRT for these spectra. The structures of hydrogen-bonded clusters have been the
The donor-acceptor interchange splittings are of the or- subject of extensive research in the past 20 years [see, e.g.,
der of 1 cm−1 and would not be visible on the scale of Xantheas and Dunning (1998); Buck et al. (1998); Lee
Fig. 11. The levels shown, denoted as 1 and 2, are av- et al. (2000)]. The hydrogen-bonded N -mers with N = 3,
erages of sets of three levels split by the donor-acceptor 4, and 5 tend to be of cyclic form. The preference for the
interchange tunneling. The distance between the levels 1 cyclic form can be understood from simple arguments.
and 2 is the measure of the acceptor tunneling splitting. Consider water clusters as an example. Various isomers
The bifurcation tunneling does not lead to additional split- of water clusters from trimer to hexamer are displayed in
tings but to small shifts of levels. The abbreviations GS, Fig. 13. A chain of three water molecules can be formed
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
FIGURE 12 Water dimer tunneling paths. The numbers below transition states are barrier heights computed from
the symmetry-adapted perturbation theory (SAPT)-5s potential.
such that the two hydrogen bonds are in dimer equilibria resulting in a trimer bound by about 16 kcal/mol. Thus,
configurations. Neglecting the three-body effects this con- the trimer is more strongly bound than the dimer in the
figuration would be bound by about 11 kcal/mol, including sense that the separation of the equilibrium dimer into
about 1 kcal/mol energy from the “second-neighbor” inter- monomers requires about 5 kcal/mol of energy (neglecting
action. If instead a cyclic trimer is formed, three hydrogen zero-point energy) while separation of one monomer from
bonds are present. Of course, the energy of each bond is the trimer’s equilibrium structure requires 11 kcal/mol.
smaller than dimer’s minimum energy of 5 kcal/mol; how- Often the binding energy of a cluster is divided by the
ever, this decrease turns out to be small and the two-body number of monomers, defining the average binding energy
binding effect at the trimer minimum is about 14 kcal/mol. as
Three-body nonadditive energy adds about 2 kcal/mol, N = −E int /N ,
FIGURE 13 Water clusters from trimer to hexamer. The top row shows the global minima isomers, the structures in
the bottom row are local minima isomers. The global minimum structure of the hexamer is called “cage,” whereas the
noncyclic local minima hexamers in the bottom row are known under names “prism,” “book,” and “bag,” respectively.
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
where E int is the interaction energy at the minimum of gen bonds and the interaction energy per molecule, ∞ , is
the cluster (global or local). For the smallest clusters this about 14 kcal/mol). An interesting question is how large a
quantity changes rapidly (it is equal to 2.5 kcal/mol for cluster one has to consider to get close to the ice-binding
the water dimer and 5.3 kcal/mol for the cyclic trimer). energy. Ab initio calculations for water decamers gave 10
This rapid change is mainly due to two-body cooperative of only about 9 kcal/mol [Lee et al. (2000)]. Unfortunately,
effects. Even for the chain-like trimer the number of hy- for clusters with N larger than 10 reliable ab initio results
drogen bonds increases by a factor of two, whereas the are very difficult to obtain. In addition, the potential en-
number of monomers increases by a factor of 1.5 com- ergy surfaces for clusters of this size contain enormous
pared to the dimer, and for the cyclic trimer the number of numbers of local minima and therefore finding the global
hydrogen bonds increases by a factor of three. In the in- minimum is a highly nontrivial task.
crease of N between the dimer and the cyclic trimer about The infrared spectra of water clusters provide infor-
0.7 kcal/mol is due to nonadditive effects. The number of mation which in conjunction with ab initio calculations
hydrogen bonds per monomer grows further in larger clus- can be used to determine clusters’ structures. As an ex-
ters, reaching the maximum value of four bonds per water ample, spectra of water clusters with N = 8, 9, and 10
molecule in ice. measured by Buck et al. (1998) are shown in Fig. 14.
Similar considerations to those for the trimer explain The octamer and decamer are of particular interest due to
the greater stability of the cyclic tetramer and pentamer their possible importance in liquid water. Let us consider
over the possible chain structures. In these two cases the octamer first. One of possible octamer structures is
the number of hydrogen bonds is four and five, respec- displayed in Fig. 14. One can see that this structure in-
tively. Unlike in the step from the dimer to trimer, there is cludes exactly two types of water molecules: those that
no increase of N resulting from increase of the number are double-proton donors but accept only a single proton
of hydrogen bonds per monomer. However, N does in- (and will be denoted as DDA monomers) and those that
crease quite substantially, by nearly 2 kcal/mol, between are single-proton donors and double acceptors (DAA). All
the trimer and the tetramer. This increase is due to the hydrogen bonds are between a DDA and a DAA molecule
“second-neighbor” two-body interactions and to the in- but are of two kinds, depending whether the O-H group
creased number of three-body interactions: there are four is on DDA or on DAA monomer. Therefore, one may
trimers in a cyclic tetramer. However, the increase in N expect that the spectrum will consist of three groups of
between tetramer and pentamer is less than a half kcal/mol lines: two connected with the two discussed types of hy-
despite the number of three-body interactions growing to drogen bonds and the third one due to the vibrations of
10. One of the reasons is the “frustrated” structure of the free hydrogens. Indeed, Fig. 14 shows that this is the
pentamer with two free hydrogens on the same side of case. Based on calculations using an empirical model,
the ring, which leads to an increased repulsive contribu- the lower frequency was assigned to bonds with the O-
tion. For the pentamer one can construct various pyra- H group belonging to DAA. This frequency is red shifted
midal structures like the one shown in Fig. 13 and some by as much as about 550 cm−1 from the monomer’s sym-
of them are local minima; however, all of them are ener- metric stretch, even more than the largest red shifts in ice
getically significantly above the cyclic pentamer despite or liquid water (comparing peak values). The peak cor-
having more hydrogen bonds. This is because the dimer responding to the O-H stretch in a DDA monomer is red
configurations in those structures are far from the dimer’s shifted by only about 110 cm−1 (i.e., even less than in
minimum structure, which reduces the magnitude of the the water dimer). The free O-H stretch is at 3700 cm−1 ,
two-body component. midway between the two stretches in the monomer. The
In contrast to smaller clusters, there exist several en- splittings of the peaks are related to the symmetric and
ergetically close isomers for the water hexamer. The six asymmetric types of motion. Although the octamer struc-
hydrogen bonds in the cyclic structure may provide less ture of D2d symmetry shown in Fig. 14 agrees with the
stabilization than eight bonds that are possible in a cage- spectrum, it is not known at this point whether it is in-
like structure. Indeed, the latter structure has been shown deed the minimum structure since a similar isomer of S4
both by ab initio calculations and by experiments to be symmetry provides an equally satisfactory interpretation
the lowest one. However, the cyclic hexamer as well as of the spectrum. Quantum mechanical calculations [Lee
several other isomers shown in Fig. 13 are energetically et al. (2000)] predict that the two isomers have energies
very close, with only about 1% or smaller differences in close to within 0.1 kcal/mol. Buck et al. assumed that both
the binding energies. This process of forming more and isomers might be present in the beam. Calculations of Lee
more hydrogen bonds per molecule continues as the size et al. show that (harmonic approximation) spectra of the
of the cluster increases and at the limit of infinite clus- two isomers do differ significantly enough to distinguish
ter (i.e., in ice, each molecule participates in four hydro- between them, provided that higher resolution spectra are
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
TABLE III Intramonomer Vibrational Frequen- nonadditivity in HCl compared to HF (assuming monomer
cies of Hydrogen Halides (in cm−1 ). separations in crystal unchanged from the dimer separa-
HF HCl HBr tions). Overall, one can expect that the stabilization due
to the cooperative effects will be an order of magnitude
Monomer 3961a 2886a 2556b smaller for HCl than for HF crystals.
Dimer 3868a 2857a 2496b Let us now discuss the solid phase of water. Ice exists
3931a 2880a 2550b in several polymorphic phases. The one most commonly
Solidc 3065 2699 2395 investigated is the hexagonal form (ice Ih). The structure
3275 2715 2406 of ice Ih obtained from simulations with a SAPT potential
3404 2744 2431 is shown in Fig. 15. The oxygen atoms form a hexago-
3585 2754 2440 nal lattice, and each water molecule is in the center of a
a Pine and Howard (1986). tetrahedron, participating in four hydrogen bonds. There
b Maillard et al. (1979), in argon matrix. are several possible arrangements of the hydrogen atoms
c Kittelberger and Horning (1967) for HF and Savoie
in such a structure, and these are realized stochastically.
and Anderson (1966) for HCl and HBr. There are four different nearest neighbor water dimer con-
figurations in ice Ih.
An interesting property of ice is its response to an ex-
hydrogen-bonded chains, while the structure of HI is not ternal electric field. The dielectric constant of any polar
precisely known. The chains are loosely packed, the dis- medium is a measure of the ability of the molecules to
tance between chains being larger than the distance be- reorient in the electric field. It is believed that in ice the
tween the nearest halide atoms in a chain. The triples of
nearest heavy atoms form the angle of 116◦ for HF and 90◦
for HCl and HBr, close to the halide-halide-free hydrogen
angles in the corresponding dimers. The most striking dif-
ferences are in the red shifts of X-H vibrations displayed
in Table III. The HF crystal exhibits an extremely large
shift of 896 cm−1 , whereas for HCl and HBr the shifts
are only 187 and 161 cm−1 , respectively. The differences
are much smaller in the dimers, where the corresponding
shifts are 93, 29, and 60 cm−1 . The main reason for these
differences appears to be the significantly larger X-Y sep-
arations in HCl (and HBr) than in HF clusters, which lead
to much smaller nonadditive effects in the former case. In
the dimers the relevant separations are 3.8 and 2.7 Å. The
bond is longer for HCl simply because the chlorine atom
is larger than fluorine and charge-overlap repulsive effects
are larger for a given separation. Also the dipole moment
of HCl is about twice as small as that of HF—therefore the
electrostatic interaction is about a factor of four smaller in
magnitude at a given distance—but this loss of attractive
interaction is partly made up by the dispersion interaction,
which is larger for HCl. As a result, the strength of the hy-
drogen bond is only twice as small in HCl than in HF.
The increase in the heavy atom separation has, however, a
more significant impact on nonadditive effects. The lead-
ing nonadditive term is proportional to the product of the
square of the dipole moment and polarizability. Because
the polarizability of HCl is about twice that of HF, this
product is roughly twice as small for HCl than for HF at
a given separation. However, because the intermonomer
distance in HCl is almost 1 Å larger than in HF, the 1/R 6 FIGURE 15 The structure of ice Ih. The top view is along the
decay of the induction nonadditivity means that there will hexagonal axis, and the bottom one is from a perpendicular
be an additional sixfold decrease of the magnitude of the direction.
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
FIGURE 16 The oxygen–oxygen pair correlation function for liquid water at ambient conditions.
The vibrational spectrum spectrum of liquid water is may actually not be due to an O-H stretch vibration but
presented in Fig. 17 and the centers of the peaks are listed may originate from the overtone of the H-O-H bend. In
in Table II. The most striking feature of this spectrum the dimer this overtone is around 3180 cm−1 . In addition
is that it consists of very broad bands. The band in the to the broad range of red-shifts of O-H stretching, there
region between 3000 and 3800 cm−1 corresponding to are a number of other factors contributing to the smooth-
the O-H stretching excitations is about 400 cm−1 wide at ness of the spectral band. One of them is the fact that an
half height and shows little structure. This lack of struc- excitation of the intramonomer transition is in most cases
ture is not due to insufficient spectral resolution (which coupled to some excitation of the intermonomer modes.
is about 0.5 cm−1 ) but is an inherent feature of this spec- The smallest of the latter excitations are of the order of
trum. The major reason for the large width of the band is 100 cm−1 , which is comparable to the width of the band.
the broad distribution of environments of the O–H bonds in Finally, the lines are broadened by thermal effects (i.e., by
liquid water. Some O–H bonds are in ice-like clusters and collisions with other water molecules).
therefore are strongly red shifted. The strongest peak at In addition to the O-H stretch band, Fig. 17 shows also
3233 cm−1 , red shifted by 424 cm−1 , is usually attributed the H-O-H bend region and the intramonomer vibration
to this type of environment. It is quite close to the strongest region. The frequency ranges not shown in Fig. 17 are rel-
peak at ice I at 3220 cm−1 . The Gaussian deconvoluted atively featureless. In the H-O-H bend region one may dis-
peak at about 3628 cm−1 , red shifted by 79 cm−1 com- tinguish two frequencies likely corresponding to bendings
pared to the average of the two stretching modes in the in molecules with the prevailing donor versus acceptor
monomer, corresponds likely to a water molecule playing character. Compared to the monomer, one of the frequen-
the role of a donor to only one other molecule or being cies is blue and another red shifted. The blue-shifted fre-
only an acceptor. In both cases such a molecule possesses quency is very close to the H-O-H bend frequency in ice I.
a “free” O-H bond whose frequency can be only little The intramonomer frequencies have been classified
shifted. One may also mention that the peak at 3051 cm−1 as either corresponding to stretches (translations) or to
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
FIGURE 17 Raman vibrational spectrum of liquid water at temperature of 24◦ C and pressure of 128 bar. (From Carey
and Korenowski, 1998.) The figure shows also the Gaussian deconvoluted peaks.
librations. Comparison with the dimer frequencies may tions, which show a significant disorder of the hydrogen-
suggest that two of the librational peaks, at 430 and at bond network. The picture based on such simulations de-
650 cm−1 , may indeed be related to dimer’s in-plane and fines the current model of liquid water. While ice-like
out-of-plane bends. The frequency at 795 cm−1 may be an structures prevail, some molecules form less than four
overtone or may result from collective motions like in ice. and some do not form any hydrogen bonds at a given
One of the stretches, at 162 cm−1 , is close to the dimer instant. Neutron scattering measurements coupled with
stretch at 187 cm−1 . The other frequency, at 65 cm−1 , clas- Monte Carlo simulations indicate that on the average each
sified usually as a stretch, may also result from torsional water molecule participates in about 3.5 hydrogen bonds.
motions of donors, which have 86 cm−1 frequency in a During the thermal motion the hydrogen bonds are bro-
dimer. ken and made; thus a given molecule can move back and
Several models for the structure of liquid water have forth from an ice-like cluster environment with four hydro-
been proposed in the past. Some of them, like the bent- gen bonds, through intermediate hydrogen bond coordina-
bond model with all water molecules four-coordinated as tions, to a monomer not forming any hydrogen bonds. This
in ice, can be discarded based on the molecular simula- model explains the essential features of the pair correlation
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
functions, the vibrational spectra of water, the decrease of hydrogen bond. At sufficiently low dilutions, spectral line
volume on melting, and several other anomalous proper- shifts characteristic of dimers can be observed. NMR tech-
ties of water. For example, the time constant of the di- niques are often used in such measurements because the
electric relaxation in water has been measured to be about small number of dimers in fairly well-defined geometries
six orders of magnitude shorter than in ice and much less produce spectra that are easy to interpret.
temperature dependent. This indicates that the response to Molecules of all nonpolar and certain polar compounds
the electric field is achieved in liquid water by partial ro- form aggregates in water. Because most of these molecules
tations of “free” water monomers and of small clusters— do not form aggregates in other solvents, the process is
processes that do not involve activation barriers. An open clearly related to the specific structure of water. Aggrega-
question about the structure of liquid water is whether it tion of molecules in water is commonly called hydropho-
contains one or a few predominant cluster sizes and struc- bic interaction. This aggregation is a process with interest-
tures. If there are a few such clusters, water should ex- ing thermodynamic properties. In some cases the change
hibit some characteristic properties of a multicomponent of energy due to the aggregation is positive; therefore,
liquid, and there is some experimental evidence for this. from this point of view the aggregation should not take
Among other structures, octamers and decamers shown place. Thus, the entropic change has to be the driving force
in Fig. 14 have been postulated. Other authors believe of the process. The entropy that is the measure of the disor-
that more likely structures are cyclic hexamers, as these der of a system is increased upon aggregation compared to
naturally result from ice Ih and have been the predomi- dispersion. If nonpolar solute particles are dispersed, wa-
nant structures observed in some molecular simulations. ter molecules tend to orient themselves around the solute
A snapshot from a Monte Carlo simulation with a SAPT molecules in an orderly fashion. Formation of aggregates
potential at ambient conditions is shown in Fig. 18. This leads then to a loss of order.
figure suggests that liquid water is a collection of a variety
of cluster structures.
Structures and properties of other strongly associated X. PROTON TRANSFER
liquids like hydrogen fluoride or carboxylic acids share
many features with liquid water. Therefore, a separate Proton transfer from one molecule to another is one of the
discussion of other pure liquids is not needed here. Spec- most studied phenomena in chemistry. It is the essence
troscopic techniques can also be applied to solutions of of acid–base chemistry. This process is related to hydro-
hydrogen-bonded species in solvents that do not form hy- gen bonding, as a hydrogen-bonded complex can be an
drogen bonds. This method is particularly useful for inves- intermediate step:
tigations of solutions, with two solutes forming a strong +
A-H + B A− · · · H-B+
A-H · · · B A− + H-B ,
(22)
where A-H· · ·B is a hydrogen-bonded complex and
A− · · ·H-B+ is an ionic pair, bound mainly by the Coulom-
bic attraction between the charges. Proton transfer requires
breaking of the chemical bond, a highly energetic transi-
tion compared to energies of hydrogen bonds. Therefore,
the proton transfer processes are very distinct from the
processes considered in the rest of this article, which in-
clude breaking and making of hydrogen bonds only.
Almost all neutral dimers form only one stable com-
plex in the gas phase, the neutral complex A-H· · ·B. The
energy of the two ions is much higher than that of the
two neutral molecules, and the barrier along the proton
transfer path between the two minima may also be high.
The proton transfer will therefore be a rare event. The
situation changes in solution, where both complexes are
stabilized by solvation. Some solvents will stabilize the
ionic complex more than the neutral one, up to the point
FIGURE 18 A snap shot from a Monte Carlo simulation of liquid that it becomes the global minimum on the proton transfer
water at ambient conditions using a symmetry-adapted perturba- path. The extent of proton transfer is determined then by
tion theory (SAPT) potential. the acidity of A-H and the basicity of B. Experimentally
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
the transfer from the neutral to the ionic form can be ob- hydrogen bonds is just right for biological applications.
served by a very significant increase of the dipole moment The energy needed to break such a bond is somewhat
of the complex. The neutral forms have dipole moments larger than the thermal energy at ambient temperatures;
in the range of 1–3 Debye, whereas the ionic ones have therefore, hydrogen-bonded structures are well-defined.
dipole moments around 10 Debye. The ionic complex can At the same time, in proper environments these bonds can
also be dissociated into free solvated ions. The whole pro- dissociate and associate quickly, allowing dynamical bi-
ton transfer process is a complicated many-step reaction. ological processes. The chemical bonds could not have
It involves more degrees of freedom than just the proton been used for this purpose because the energies required
motion because the reorientation of the solvent molecules to break them are much larger.
is critical. The rate of the proton transfer process depends Proteins are polypeptides (i.e., polymers formed from
on the barrier height on the reaction path. There is little amino acids). About 20 different amino acids appear in
reliable information available on the barrier heights for proteins. The sequence of amino acids determines the pri-
the proton transfer in solutions. In some solvents (e.g., mary structure of a protein. The peptide link between the
in water), the process is simplified since the solvent ef- adjacent amino acids; in other words, the group
fects are so strong that neither the neutral complex nor
O
the ionic complex are formed, except as very short-lived #
intermediates. Then the process can be well described as C#
❝
N
a single-step reaction. #
H
There are some charged dimers that exhibit an in-
trinsic symmetry on the proton transfer coordinate. All
those systems are strongly bound, typically by more than can act both as proton donor and acceptor (at the oxy-
10 kcal/mol. The best known example is F− · · ·HF (bound gen atom). The polypetide chains are connected by hydro-
by about 40 kcal/mol). Clearly, the structure FH· · ·F− gen bonds formed between different peptide link groups
must possess, the same energy. Early electronic structure as shown in Fig. 19. A single chain can form a helical
calculations were predicting that the intermolecular po- structure, the so-called α helix, where peptide link groups
tential for the proton motion along the transfer path is that are above each other form hydrogen bonds. Another
a symmetric double-well potential. This could give rise chain arrangement are the pleated sheets (β sheets), where
to interesting tunneling phenomena. However, more ac- chains of polypeptides placed next to each other are con-
curate calculations have found that there is only a single nected by hydrogen bonds. These spatial arrangements
minimum on this path. Thus, the structure of the system is are called secondary structures. All polypeptides have one
better described as [F· · ·H· · ·F]− . The potential is similar structure or the other and often have alternating regions of
for [H3 O2 ]− , [HCl2 ]− , and [HBr2 ]− . Only for weak pro- either one. The spatial arrangement of fragments consist-
ton donors like CH4 there probably is a double minimum ing of α helices and β sheets is called the tertiary structure
well with a small barrier. Also the cationic complexes like (see Fig. 19), and hydrogen bonds contribute to its stabil-
[H5 O2 ]+ usually have near-centrosymmetric potentials, ity as well. Enzyme molecules are a class of polypeptides
with barriers smaller than 1 kcal/mol. An exception could whose function in an organism is chiefly determined by
be [H7 N2 ]+ , where the barrier could be around 1 kcal/mol. their shape. In particular, the shape is critical near the cat-
alytic center of the enzyme (key and lock mechanism). Be-
XI. HYDROGEN BONDS IN cause these shapes are determined by hydrogen bonds, it
BIOLOGICAL STRUCTURES follows that hydrogen bonds are centrally important to the
functions of life. In some cases hydrogen bonds can also
Hydrogen bonds are of great significance in determin- directly participate in a catalytic reaction of an enzyme.
ing the structure of the biologically most important com- Of similar importance are hydrogen bonds in nucleic
pounds like proteins and nucleic acids. Since virtually acids, although they play a different role there than in pro-
all biological processes take place in aqueous solution, teins. These bonds are responsible for the specific pair-
equally important to the hydrogen bonds between biopoly- ing of the nucleic bases. There are only a few bases that
mers are hydrogen bonds between biopolymers and water are the building blocks of nucleic acids, and their spe-
and between water molecules surrounding biopolymers. cific sequence contains all the genetic information. Only
Water stabilizes biopolymers and, as discussed above, is certain pairs of bases, called complementary pairs, form
often the medium stimulating the formation of various strong hydrogen-bonded complexes binding the double
kinds of aggregates essential in living matter. Most bio- helix structure of nucleic acids. For example, cytosine con-
logical compounds simply do not exist in their biologically nects with guanine and thymine with adenine as shown in
functional form in nonaqueous solutions. The strength of Fig. 20. The double helix is stabilized by two backbone
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
FIGURE 19 The structure of proteins. (a) The primary structure is the determined by the sequence of amino acids
in the polypeptide chain. Four of such acids shown are valine (Val), glycine (Gly), serine (Ser), and leucine (Leu).
(b) The secondary structures: α helix and pleated β sheet. Hydrogens or side chains connected to the carbon atoms
situated between the peptide link groups (Cα carbons) are not shown. The hydrogen bonds are indicated by dotted
lines. Most of the hydrogen bonds in proteins are N-H· · ·O = C bonds, but C-H· · ·O = C bonds are also present, and
one such bond is marked on the α helix. (c) Fragments of proteins built of pleated sheets and α helices connected
by additional hydrogen bonds (not shown) are creating the tertiary structure. (d) Well-determined units such as the
one shown in (c) may repeat a number of times in a large protein. Mutual arrangements of such units determine the
quaternary structure of a protein. The units can be linked to each other by hydrogen bonds.
strands on the outside consisting of sugar molecules to and uracil (U). DNA uses thymine (T) instead of uracil.
which the bases are attached and of phosphate groups Thus, a given nucleic acid alphabet contains to a very
linking the sugar molecules together, as shown in Fig. 21. good approximation just four letters. Only very exception-
Two such strands form a double helix displayed in Fig. 22. ally do some other bases appear, mainly methylated forms
The strands are connected by hydrogen bonds between the of major bases found in ribosomal RNA. The hydrogen-
nucleic bases. There are two classes of nucleic acids: ri- bonded pairs GC and AT (AU in RNA) are more stable
bonucleic acid (RNA) and deoxyribonucleic acid (DNA). than the other combinations and better fit into the geom-
RNA plays a function in the cellular synthesis of proteins, etry of the backbone, and for this reason they are called
whereas DNA is the carrier of the genetic information, complementary pairs. Their hydrogen-bonded configura-
except for some viruses that use RNA for that purpose. tions shown in Fig. 20 were proposed by Watson and Crick.
There are two main differences between RNA and DNA: The complementarity is the basis of the molecular mech-
the sugar of the backbone is ribose and 2-deoxyribose, re- anism of nucleic acid replication as well as transcription
spectively, and one of the bases is different. There are four from DNA to RNA and translation from RNA to proteins.
major bases appearing in either RNA or DNA. In the for- The genetic information coded by the sequence of the nu-
mer case these are guanine (G), adenine (A), cytosine (C), cleic bases is replicated when DNA separates into two
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
signments have been made assuming that the observed complexes cor-
respond to the minimum structures determined in ab initio calculations.
FIGURE 21 A single strand of DNA. The sugar molecules are connected to each other by the phosphate groups
forming the DNA backbone. A nucleic base is attached to the other end of each sugar molecule.
P1: FJU Final Pages
Encyclopedia of Physical Science and Technology EN007B-322 June 29, 2001 20:6
Groenenboom, G., Mas, E. M., Bukowski, R., Szalewicz, K., Wormer, Lee, H. M., Suh, S. B., Lee, J. Y., Tarakeshwar, P., and Kim, K. S. (2000).
P. E. S., and van der Avoird, A. (2000). Phys. Rev. Lett. 84, p. 4072. J. Chem. Phys. 112, 9759.
Groenenboom, G., Wormer, P. E. S., van der Avoird, A., Mas, E. Mas, E. M., and Szalewicz, K. (1996). J. Chem. Phys. 104, 7606.
M., Bukowski, R., and Szalewicz, K. (2000). J. Chem. Phys. 113, Mas, E. M., Bukowski, R., Szalewicz, K., Groenenboom, G., Wormer,
0000. P. E. S., and van der Avoird, A. (2000). J. Chem. Phys. 113, 0000.
Gu, Y., Kar, T., and Scheiner, S. (1999). J. Am. Phys. Soc. 121, 9411. Maillard, D., Schriver, A., Perchard, J. P., and Girardet, C. (1979).
Hobza, P., Spirko, V., Havlas, Z., Buchold, K., Reinmann, B., Barth, J. Chem. Phys. 71, 505.
H.-D., and Brutschy, B. (1999). Chem. Phys. Lett. 299, 180. Müller-Dethlefs, K., and Hobza, P. (2000). Chem. Rev. 100, 143.
Hobza, P., and Sponer, J. (1996). Chem. Phys. Lett. 261, 379. Nelson, D. D., Fraser, G. T., and Klemperer, W. (1987). Science 238,
Sponer, J., Leszczynski, J., and Hobza, P. (1996). J. Phys. Chem. 100, 1670.
1965, 5590. Odutola, J. A., and Dyke, T. R. (1980). J. Chem. Phys. 72, 5062.
Hobza, P., and Zahradnik, R. (1988). “Intermolecular Complexes—The Olthof, E. H. T., van der Avoird, A., and Wormer, P. E. S. (1994).
Role of Van der Waals systems in Physical Chemistry and Biodisci- J. Chem. Phys. 101, 8430, 8443.
plines,” Elsevier, Amsterdam. Pine, A. S., and Howard, B. J. (1986). J. Chem. Phys. 84, 590.
Huang, Z. S., and Miller, R. E. (1989). J. Chem. Phys. 91, 6613. Rybak, S., Jeziorski, B., Szalewicz, K. (1991). J. Chem. Phys. 95, 6576.
Jankowski, P., Tsang, S., Klemperer, W., and Szalewicz, K. (2000). Savoie, R., and Anderson, A. (1966). J. Chem. Phys. 44, 548.
J. Chem. Phys., submitted. Scheiner, S. (1997). “Hydrogen Bonding. A Theoretical Perspective,”
Jeffrey, G. A. (1997). “An Introduction to Hydrogen Bonding,” Oxford Oxford Univ. Press, Oxford and New York.
Univ. Press, Oxford and New York. Schuster, P. (ed.) (1984). “Hydrogen Bonds,” Top. Curr. Chem. 120,
Jeziorski, B., and Szalewicz, K. (1998). “Intermolecular Interactions Springer, Berlin.
by Perturbation Theory”, in “Encyclopedia of Computational Chem- Soper, A. K. (2000). Chem. Phys. 258, 121.
istry,” edited by P. von Ragué Schleyer, N. L. Allinger, T. Clark, Stone, A. J. (1996). “The Theory of Intermolecular Forces,” Clarendon
J. Gasteiger, P. A. Kollman, H. F. Schaefer III, and P. R. Schreiner, Press, Oxford.
Wiley, Chichester, UK, 1998, vol. 2, pp. 1376–1398. van Duijneveldt, F. B., van Duijneveldt-van de Rijdt, J. G. C. M., and
Jeziorska, M., Jankowski, P., Jeziorski, B., and Szalewicz, K. (2000). van Lenthe, J. H. (1994). Chem. Rev. 94, 1873. Xantheas, S. S, and
J. Chem. Phys. 113, 2957. Dunning, T. H., Jr. (1998). In “Advances in Molecular Vibrations and
Kittelberger, J. S., and Horning, D. F. (1967). J. Chem. Phys. 46, 3099. Collision Dynamics,” edited by J. M. Bowman and Z. Bacic, p. 365,
Klopper, W., van Duijneveldt-van de Rijdt, J. G. C. M., and van JAI Press, Stamford.
Duijneveldt, F. B. (2000). Phys. Chem. Chem. Phys. 2, 2227. Yanson, I. K., Teplitsky, A. B., and Sukhodub, L. F. (1979). Biopolymers
Kuhs, W. F., and Lehmann, M. S. (1986). Water Sci. Rev. 2, 1. 18, 1149.
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
I. History
II. Ion Energetics
III. Ion Kinetics
65
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
PEPICO Form of photoionization mass spectrometry in nuclei and are called cations, while negatively charged
which ions produced by photoionization are detected species are referred to as anions.
in coincidence with the accompanying energy-selected One of the early triumphs of the theory of atomic struc-
photoelectrons. ture was the correct description of the quantized energy
PFI Pulsed field ionization—the application of a short level structure of atoms formed by the Coulomb attraction
electric field pulse that induces long-lived Rydberg of a single electron to a positively charged nucleus. The
states of atoms or molecules just below the ionization 1/n 2 dependence of the energy levels on integer quantum
continuum to decay by tunneling through the electric number n in one-electron systems yielded an exact fit to
field-induced barrier. the emission spectra of such atoms that had been cataloged
Photoionization Production of atomic and molecular at the end of the nineteenth century by Balmer, Rydberg,
ions from precursor neutrals by absorption of photons. Bohr, and Ritz. The convergence of such Rydberg series
Proton affinity (PA) Negative of the enthalpy change of at large n to energy levels of the corresponding ion pro-
the protonation reaction: vided a sharp definition of the ionization energy of an
atom.
M + H+ −→ MH+ , Hr×n = −PA.
The PA is a quantity defined at a finite temperature,
A. Ion Energetics
usually 298 K.
Quasi-Equilibrium theory (QET) Theory of mass spec- Once the fundamental principle was established that mat-
tra based on rapid internal conversion of excited elec- ter exists as a collection of positively charged nuclei
tronic states produced by electron impact to the ground surrounded by negatively charged electrons, many ex-
electronic state. Statistical redistribution of energy on perimental observations confirmed that ionized matter is
the-ground state surface, followed by vibrational pre- present throughout the Universe. In addition to “cathode
dissociation, determines the rate of fragmentation. rays” produced by thermionic emission from metals or by
REMPI Resonance-enhanced multiphoton ionization— the photoelectric effect, “positive rays” were also gener-
the ionization process whereby an atomic or molecular ated in gases. The identification of the latter species as
species absorbs two or more photons through a tran- positive ions from the fact that their deflections in elec-
sition from an intermediate resonant bound state to a tric and magnetic fields were opposite to those of cathode
bound state. rays occurred quickly. The same experimental techniques
R2PI Resonance two-photon ionization—a variant of of focusing and deflection of charged particles in elec-
REMPI in which two photons are required to produce tric and magnetic fields, in conjunction with improved
ionization, with the first photon accessing a resonant methods for producing a high vacuum, led to the devel-
bound intermediate state. opment of mass spectrometers, devices that sorted these
ZEKE Zero-kinetic energy electrons, produced by field charged particles according to mass. The early history of
ionization of excited Rydberg states just below the ion- mass spectrometry focused on the determination of atomic
ization limit to a particular internal energy state of the and molecular masses of ions formed by electron impact-
resultant ion. induced removal of one (or more) electrons from a stable
molecule. The use of incident electrons with reasonably
well-characterized energies also led to systematic infor-
I. HISTORY mation on the threshold ionization energies of atoms and
molecules, forming the basis for the determination of en-
At the beginning of the twentieth century, ground- thalpies of formation of gaseous ions. In the case of sim-
breaking experiments by leading scientists of the day, ple electron removal to form the parent ion, the resul-
most notably J. J. Thomson and Ernest Rutherford, de- tant energy is the ionization energy, while bond cleavage
termined that the electrical nature of matter is fundamen- occurring in conjunction with ionization, so-called dis-
tally a property associated discrete particles. Thomson’s sociative ionization, yields the appearance energy of the
discovery of the “electron” as the particle that carries neg- corresponding ionic fragment. The systematic study of
ative charge in matter and Rutherford’s determination that ionization and appearance energies for parent and frag-
the positive charge in atoms is localized in a very small ment ion formation to yield enthalpies of formation for
volume called the “nucleus” provided the foundations of gaseous ions forms the heart of the subject of “ion en-
the modern atomic theory. In an electrically neutral atom ergetics.” The electron volt (eV) is the most frequently
or molecule, the number of electrons balances the posi- employed unit of energy in studies of ionization phenom-
tive charge of the constituent nuclei. Positively charged ena and corresponds to the energy acquired by a single
species have a deficiency of electron charge relative to the fundamental charge when accelerated by an electrostatic
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
potential of one volt. The definition of the electron volt is reactions with ionic products observed in the radiation
as follows: 1 eV = 96.4845 kJ/mol or 23.06036 kcal/mol. chemistry of hydrocarbons. Pioneering work by Hamill
at the Notre Dame Radiation Laboratory suggested that
the ionic products formed in hydrocarbon mixtures irra-
B. Ion Kinetics
diated by ionizing radiation from the γ -emission of ra-
Concurrently with these studies of primary atomic and dioactive nuclei could be explained on the basis of simple
molecular ionization mechanisms in atoms and molecules ion-neutral collisions. It was the observation in 1958 by
under collision-free conditions, mass spectrometrists be- Field and Lampe that hydrocarbon cations were capable
gan to observe pressure-dependent phenomena in the ion of abstracting hydride (H− ) ions from neutral hydrocar-
sources of their instruments. One of the first such obser- bon collision partners that led Futrell to connect chem-
vations goes back to the beginning of the twentieth cen- istry occurring in γ -radiolysis with simple ion–molecule
tury, when J. J. Thomson discovered that operating his reactions. The determination of ionization potentials and
positive ray parabola apparatus in a hydrogen atmosphere appearance energies with instrumentation that developed
produced signals at a mass-to-charge ratio of 3. Although rapidly in the 1950s and 1960s led to additional studies of
he correctly attributed this signal to the species H3 , it was the reactivity and stability of gas-phase ions by pioneers
only in later high-pressure ion source mass spectrometer such as Franklin and Ausloos.
experiments by Hogness and Lunn that it was demon- The 1960s brought two major advances in the discipline
strated unequivocally that this species was produced by a of ion energetics and structure. The development of high-
reaction between the primary ionization product H+ 2 and pressure mass spectrometer ion sources led by Kebarle and
molecular hydrogen. This point of view concerning the co-workers allowed the measurement of equilibrium con-
rapidity of reactions between gaseous ions and molecules stants for gas-phase ionic systems at well-specified tem-
in the gas phase was supported by other pioneers in the peratures and, from temperature-dependent studies, the
emerging field of mass spectrometry, including Aston and enthalpy and entropy changes associated with gas phase
Dempster. equilibria. These concepts provided the foundation for cur-
Although theoretical work by Eyring, Hirschfielder, and rent studies of molecular recognition in the gas phase.
Taylor in the mid-1930s predicted that the rate of the ion– The development of the discipline of elementary re-
molecule reaction action dynamics as the foundation of chemical kinetics
also occurred in the 1960s. This subject addresses the
H+ +
2 + H2 −→ H3 + H (1)
fundamental question of energy disposal in a chemical
was limited only by the frequency with which the reactants reaction and the microscopic details of the collisions that
collide, a picture that was claimed to describe the rates of lead to reaction product formation by probing the features
many such ion–molecule reactions, systematic studies of of the potential energy surface (or surfaces) that governs
such rates were not carried out for another 15 years. The the conversion of reactants into products. The applica-
pioneering work of Tal’rose in Moscow on ion formation tion of the methods of collision kinematics and dynamics
in methane gas irradiated by electrons initiated the study exemplified in the molecular beam experiments of Datz
of ion–molecule reactions as a distinct discipline within and Taylor was a natural development, as the techniques
the field of chemical kinetics. In those studies, electron of mass spectrometry form the ionic reagents into a di-
impact ionization in methane produced CH+ 4 as a parent rected beam. The measurement of product energy and an-
ion and CH+ 3 by dissociative ionization. The bimolecular gular distributions that such an experimental geometry af-
reaction fords yielded a microscopic view of the kinds of collisions
that lead to chemical reaction. Some of the pioneering
CH+ +
4 + CH4 −→ CH5 + CH3 , (2)
applications of these techniques were accomplished by
occurring by a mechanism in which proton transfer from Mahan and co-workers at Berkeley, Herman in Prague,
CH+ +
4 to CH4 and hydrogen transfer from CH4 to CH4 are and Wolfgang at Yale, on collisions of atomic and diatomic
implicated, was observed to occur on every collision. In ions with hydrogen molecules.
addition, the carbon–carbon bond formation reaction By exploiting the state selectivity that ionization of
small molecules by ultraviolet radiation provided, ionic
CH+ +
4 + CH4 −→ C2 H5 + H2 (3)
collision dynamics made a central contribution to the
was also observed to occur at the collision frequency. emerging field of chemical dynamics. A key question in
Reactions (2) and (3) occurring in methane were also chemical dynamics is how energy that is placed in spe-
observed by Stevenson and co-workers in 1955 in the ion cific modes of reactant motion affects the outcome of a
source of a laboratory mass spectrometer and launched chemical reaction. The first example of a chemical reac-
an important era that connected elementary ion–molecule tant prepared in a quantized excited vibrational state and
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
the observation that energy in this form allowed the system of the ionization threshold to cause fragmentation of the
to surmount a potential energy barrier occurred in an ionic parent ion. The threshold energies associated with frag-
system. By using a monochromatic light source in the ul- ments A+ formed through the reaction
traviolet region of the spectrum, Chupka and co-workers
AB −→ A+ + B + e− (5)
were able to produce H+ 2 ions in selected vibrational states
by photoionization of molecular hydrogen. The use of are the appearance energies, AE, for a given fragment.
photoionization methods to produce vibrationally state- Appearance energy values result in thermodynamic infor-
selected reagents both for bimolecular collision studies mation only when the reaction coordinate has no barrier
and for unimolecular decay of energy-selected ions played in excess of the endothermicity for dissociative ioniza-
a central role in contemporary developments in ionic re- tion, and the products appear promptly on the time scale
action dynamics. Along with more sophisticated methods of product detection in a particular instrument, i.e., there
for reagent preparation and product state-resolved detec- is no “kinetic shift.”
tion came advances in the production of chemically more The terminology introduced above for the formation
complex reagents. The field of ion kinetics has provided of cations may also be applied to anions, which may be
benchmark data for testing theories of chemical reaction formed either through heterolytic bond cleavage in AB,
dynamics in elementary atom–diatom systems. The field
AB −→ A+ + B− , (6)
has also provided a wealth of studies on more complex pro-
cesses at the microscopic level, including organometallic or by electron attachment to a suitable neutral precursor
chemistry related to catalysis, cluster ion studies relevant A having a positive electron affinity,
to atmospheric and materials chemistry, the ion chemistry
A + e− −→ A− . (7)
of electrical discharges and plasmas (relevant to the micro-
electronics industry), and reactions of biological species Some classes of ionic decomposition processes occur
that provide useful information on the nature of confor- via pathways requiring that the system surmount a barrier
mation as a function of solvation. on the potential surface in excess of the endothermicity.
The majority of ionic species are formed by the removal Simple bond cleavage processes generally occur without
(or the addition) of an electron from (or to) a stable atom or such a barrier, but rearrangements often occur with such
molecule. As a result, ionic species are highly reactive. Be- a barrier. The determination of accurate thermochemical
cause the environment in which ionic species are created information from fragmentation threshold energies is al-
is often chemically complex, special techniques for the ways subject to the uncertainty that the enthalpy of reac-
preparation and handling of such transients are required tion may differ from the observed threshold energy. The
for reliable determination of ionization and appearance presence of a potential barrier for a fragmentation process
energies, energetic thresholds for chemical reactions, and may result in excess product translational energy release,
unambiguous measurements of chemical reaction cross and measurement of the kinetic energy release distribution
sections and rates. The general techniques of mass spec- (KERD) often allows an estimate of the barrier height.
trometry form the basis for experimental methods that pro- The term “kinetic shift” refers to experimentally
vide information on ion energetics and kinetics. observed ionization thresholds that are higher than true
thermodynamic threshold energies because of the finite
lifetime of the decaying species. Most experimental appa-
II. ION ENERGETICS ratuses sample ions at a specific, instrument-dependent
time after ionization has occurred, typically about
A. Spectroscopic and Ionization Methods 10−5 sec, so that ions decaying on a time scale long com-
in Isolated Atoms and Molecules pared to this characteristic observation time will not be
detected at threshold. This situation results in an apparent
1. Threshold Determination
threshold for dissociation that is shifted to higher values
The ionization threshold energy for the process of the internal energy of the decaying parent ion by the
magnitude of the “kinetic shift.” The kinetic shift thus de-
M −→ M+ + e− (4)
pends on the time scale for observation and the sensitivity
is the minimum internal energy change accompanying the of the instrument for detecting low signal levels for ion
removal of an electron from a stable neutral precursor M. formation. Measuring the decay rate as a function of the
The term ionization energy generally refers to the adia- parent ion internal energy provides a way to correct experi-
batic ionization energy, denoted IEa . The species M+ is mental data for this kinetic shift. Time-resolved ionization
called the parent ion. When M is a molecular species, the methods employing pulsed ion production methods also
ionization process may impart sufficient energy in excess provide a way to estimate the kinetic shift.
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
The most widely used technique for determination of Doppler broadening. These refinements make the deter-
ionization and appearance energies measures the mini- mination of ionization energies quite precise, in the fa-
mum energy required to form a parent or fragment ion vorable case where the neutral and ionic ground states
from its precursor neutral or to detach an electron from a do not involve significant geometric rearrangements. As
negative ion. Ionization of a given species may be effected with electron impact ionization, the Franck–Condon prin-
by absorption of a photon, by collision with electrons hav- ciple predicts that a large change in geometry upon single-
ing kinetic energy in excess of the ionization threshold, or photon absorption will result in an ionization threshold
by interaction with excited atoms or molecules having ex- that is gradual, making the exact ionization threshold dif-
citation energies in excess of the ionization energy of the ficult to locate accurately. In the case where ionization
neutral. Such measurements can be carried out by deter- is direct, the narrow bandwidth of modern photoioniza-
mining the coincidence between the ions and the electrons tion sources often provides a means to achieve the pro-
formed. duction of molecular ions with vibrational state selection.
The venerable technique of electron impact is the most At the lowest ionization threshold, ions are formed in the
widely employed in mass spectrometric determination of ground vibrational state. As the frequency of the photoion-
ionization and appearance energies. The development in ization radiation is increased, thresholds for the formation
the 1960s of electron monochromators, in which electron of molecular ions with vibrational excitation are encoun-
beams with kinetic energy spreads of as little as 0.005 eV tered, and the photoionization efficiency curve shows step-
are generated by passing the beam through deflection- function increases in ion production as excited vibrational
type electron energy selectors, led to important advances states become accessible. In the favorable case of pho-
in measuring ionization and appearance potentials. The toionization of H2 , rotational state selection of the resul-
incident electron excites the molecular precursor through tant molecular ions can also be achieved. At the threshold
an energy transfer collision in a vertical Franck–Condon for the formation of a molecular ion in a given excited
transition to excited electronic states. The Franck–Condon vibrational state, the product ion distribution will contain
principle states that the most probable ionizing transition contributions from all lower vibrational states that are en-
will be the one in which the positions and momenta of the ergetically accessible, degrading the quantum state purity
nuclei do not change. When the equilibrium geometries of the parent ions. In addition, the simple step-function
of a neutral precursor and its corresponding parent ion are behavior that accompanies direct ionization may be com-
similar, the energy dependence of the ionization threshold plicated by autoionization processes, in which the initial
will be a sharp step function, forming the ground vibra- optical excitation is to an excited Rydberg state of the neu-
tional state of the ion. However, when the equilibrium ge- tral molecule. As discussed below, energy analysis of the
ometry of the ion is significantly different from that of the electrons that accompany the ionization process can sort
neutral, e.g., is accompanied by a significant change in one out the details of such complex ionization processes.
or more bond lengths or angles, the transition to the lowest The rapid development of intense laser sources has
vibrational level of the ion is no longer the most proba- led to the development of ionization methods based on
ble one. The vertical ionization energy corresponding to the absorption of multiple photons. Ionization energies
the maximum transition probability will populate excited for molecules have been measured using resonant two-
vibrational levels of the ion preferentially. The geometry photon ionization (R2PI) or resonance-enhanced multi-
change upon ionization can be sufficiently large that ion- photon ionization (REMPI) of rotationally cooled species
ization to produce the lowest vibrational level of the ion in a molecular beam. In such studies, the internally cooled
at the adiabatic ionization energy will not be observed. beam of molecules is excited to a specific quantum state
In the mid-1960s, the availability of tunable ultraviolet with a tunable laser; while holding this excitation energy
radiation laboratory sources led to the replacement of elec- constant, a second, independently tunable laser ionizes the
tron beams with photon sources. Rare gas resonance lamps beam of excited molecules. By tuning the first excitation
producing continuum sources of radiation that could then laser to different transitions, the entire Franck–Condon
be passed through monochromators allowed the field of accessible region of the intermediate electronic state is
photoionization mass spectrometry to develop. The deter- probed, yielding the adiabatic ionization energy. The ex-
mination of photoion yield as a function of ionization en- periment thus contains an internal consistency check that
ergy, the photoionization efficiency curve, led to determi- leads to more reliable values of ionization threshold. The
nation of ionization potentials with accuracies exceeding production of ions by multiple photon absorption is com-
those of electron impact methods. Modern photoioniza- plicated by the loss of information on the Franck–Condon
tion experiments often utilize laser or synchrotron light factors for the initial photon absorption steps. However, in
sources with narrow bandwidths and may employ colli- cases where the ionization is an (n + 1)-photon process,
mated molecular beam sources that reduce the effects of with the first n photons providing access to a Rydberg
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
state of the precursor neutral, the final photon absorption spectral resolution of conventional photoelectron spec-
process often occurs from a specific vibrational state of troscopy. When the full energy of the photon is ab-
the neutral to an ion with the same vibrational quantum sorbed in the optical transition leading to ionization, the
number. Under such conditions, vibrational state selection ejected electron has zero kinetic energy. As the photon
can be achieved by multiphoton absorption. The method energy is increased beyond an ionization threshold, the
has been applied successfully in the production of NH+ 3 atom or molecule continues to absorb light and electrons
cations with specific excitation in the ν2 umbrella-bending are ejected with nonzero kinetic energy. The resulting
mode. “hot” electrons can be discriminated from the zero ki-
The photoionization process is accompanied by the netic energy electrons by the fact that they are moving
emission of electrons. The technique of photoelectron in the center of mass coordinates, and the electrons with
spectroscopy is based on a measurement of the kinetic zero kinetic energy are stationary in this frame of ref-
energies of those electrons, either independently of the erence. The zero-kinetic energy-threshold electrons re-
ion formation process or in coincidence with it. In photo- main in the interaction region, and a small electric field
electron spectroscopy, the precursor atom or molecule is extracts them through a small aperture or a restricted
irradiated with light of a known energy sufficient to eject channel.
an electron from the material through the process An important variant of the threshold photoelectron
spectroscopy technique arises from a consideration of the
M + hν −→ M+ + e− (KE), (8)
nature of the Rydberg states of the precursor neutral that
where converge to the ground state of the ion, defining the ion-
ization energy. Additionally, each electronic, vibrational,
KE(e− ) = hν − I − E ∗ (vib,rot). (9)
and rotational (rovibronic) state of the ion has a unique
In this expression, I is the binding energy of the electron Rydberg series converging to that state of the ion. While
and E ∗ (vib,rot) is the internal energy of the product ion conventional TPES focuses on the zero-kinetic energy
M+ . The technique may also be employed to determine electrons in the ionization continuum, i.e., KE(e− ) → 0+ ,
the electron affinities of atomic or molecular anions. the technique of ZEKE spectroscopy is concerned with
Conventional photoelectron spectroscopy, invented by the detection of Rydberg states in a very narrow band just
Turner and Siegbahn, developed through the application below the ionization limit of each robronic state of the
of the helium or neon resonance lines (21.218 and 40.813 ion. These long-lived states are separated from the fast-
or 16.848 and 16.671 eV, respectively) from discharge decaying low-n Rydberg states by a simple time delay in
lamp photon sources. The development of synchrotron the detection process. The usual method for carrying out
radiation sources as well as tunable vacuum ultraviolet this delayed ionization is to apply a pulsed electric field of
laser sources has also enhanced the utility of photoelec- magniture ∼1 V/cm approximately 10 µsec after the pri-
tron spectroscopic methods. In an experiment, the ejected mary excitation step and after the “prompt” electrons from
electrons will be produced with a distribution of kinetic short-lived states have moved away. The small applied
energies degermined by the distribution of energy levels in field then ionizes the long-lived Rydberg states by field
the M+ ion product. The energy distribution of the emit- ionization. The lifetimes of such high-n Rydberg states are
ted electrons is called the photoelectron spectrum. The observed to be nearly two orders of magnitude longer than
photoelectron band shapes reflect not only the vibrational expected and arise from external perturbations that operate
energy level structures in the ground and excited elec- only on the Rydberg states lying extremely close to the ion-
tronic states of the ion, but also the M+ ← M transition ization threshold. The acronym “ZEKE,” referring to the
probabilities dictated by Franck–Condon factors. When detection of these zero-kinetic energy electrons, is com-
the equilibrium geometries of the ion and the correspond- monplace, although the term “ZKE” has also been used
ing neutral are similar, the observed threshold of the first in the literature, more correctly referring to the zero-field
photoelectron band generally yields the adiabatic ioniza- production of threshold electrons. The use of a pulsed elec-
tion energy. The ionization energies are thus determined tric field is referred to as “pulsed field ionization” (PFI).
with an energy resolution characteristic of the resolving ZEKE spectroscopy differs from other methods of pho-
power of deflection-type energy analyzers, or as low as toelectron spectroscopy because it is a multistep process.
0.005 eV. On an energy scale appropriate to optical mea- The successive steps are required to isolate the narrow
surements, this resolution corresponds to 40 cm−1 , signif- band of long-lived high-n Rydberg neutrals from the back-
icantly poorer than the subcentimeter value afforded by ground hot electrons and their counterions from the low-n
narrow-bandwidth optical sources. Rydberg states. The ions produced in coincidence with
Threshold photoelectron spectroscopy (TPES) was de- ZEKE electrons can also be detected, forming the basis for
veloped to overcome the inherent limitations in the mass-analyzed threshold ionization (MATI) spectroscopy.
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
Owing to the kinematic restriction that results from by summing the enthalpy of formation of the precursor
detecting the ion, this method has an intrinsically lower molecule to the adiabatic ionization energy (IEa ) and sub-
energy resolution than detecting the corresponding ZEKE tracting the enthalpy of formation of the electron,
electron. However, that disadvantage is offset by the de-
f H 0 (M+ ) = f H 0 (M) + IEa − f H 0 (e− ). (10)
sirability of producing mass-selected ions with a known
vibrational excitation. The analogous expression for an anion requires the value
For the purpose of producing ions with well- for the electron affinity:
characterized levels of vibrational excitation, as well as
for studying the thermochemistry of ionic fragmentation f H 0 (M− ) = f H 0 (M) − EA + f H 0 (e− ). (11)
processes, the coincident detection of photoions and their Similarly, the enthalpy of formation of a positive fragment
corresponding electrons yields a powerful combination. ion A+ with appearance energy AE formed by dissociative
The technique known as photoelectron–photoion coinci- ionization of precursor molecule AB is given by
dence (PEPICO) correlates ejected electrons formed with
zero kinetic energy with their corresponding positive ions. f H 0 (A+ ) = f H 0 (AB) − f H 0 (B) − f H 0 (e− ) + AE.
Below the dissociation threshold, this method provides a (12)
way to produce parent ions with vibrational state selec- As we have already noted, this expression assumes that
tion. At energies where parent ions undergo dissociation the dissociation reaction coordinate has no barrier in ex-
to form one or more fragment ions, one measures the rel- cess of the dissociation energy and that the dissociation
ative probabilities for the formation of the daughter ions lifetime is rapid compared with the time scale of measure-
from parent ions with a well-specified internal energy, a ment in the specific instrument. The molecular quantities
quantity called the “breakdown diagram.” The ions can be of ionization energy, electron affinity, and appearance en-
detected at differing times after the ionization event for ergy refer to differences in zero-point energy levels of the
determination of the time dependence of the dissociation appropriate potential energy surfaces of reactants or prod-
process. By measuring the time-of-flight distributions for ucts and are, thus, internal energies. Their conversion to
state-selected ions, a detailed analysis of the lineshapes the corresponding enthalpies at finite, nonzero absolute
can yield the kinetic energy release distributions for state- temperatures requires correction factors that depend on
selected fragmentation processes occurring on the time how the free electron enthalpy is accounted for. The en-
scale of transit through the instrument. thalpy of formation of a chemical species is defined as
Photoelectron spectroscopy can also be carried out by the difference between the enthalpy of the compound and
measuring the distribution of flight times of photoemitted the sum of the enthalpies of the elements comprising it.
electrons. This method is useful in systems in which the In the case of a cation or anion, the balanced chemical
absorbing species are produced by a pulsed source and the reaction corresponding to its formation from the elements
photolysis radiation is also pulsed. These electron time of will also contain the electrons that are liberated or cap-
flight methods have been used to elucidate structures of tured in the ionization process. The enthalpy of forma-
transient species such as free radicals and clusters pro- tion of the free electron must therefore also be accounted
duced in pulsed photolysis sources and in assessing the for in the ionic enthalpy of formation. Although the nu-
vibrational-state purity of ions produced in multiphoton merical value of the enthalpy of formation of a given ion
ionization processes, particularly those in which the final should be independent of the convention whereby one ac-
photon absorption process is from a Rydberg state whose counts for the free electron enthalpy, two distinct pro-
geometry is similar to that of the ion. cedures for dealing with this issue have evolved in the
literature.
The computation of an enthalpy change at a finite tem-
2. Ion Enthalpies of Formation perature relative to the 0 K value requires the evaluation
from Threshold Energies of the following expression:
The methods for determining ionization and appear- T
ance energies under collision-free conditions as described H 0T (K) = H00K + CP (T ) dT . (13)
above must be employed according to well-documented 0
procedures of chemical thermodynamics to yield the ap- The evaluation of this expression for an ionization pro-
propriate enthalpies. The conversion of the internal en- cess, in which an electron appears explicitly as a reactant
ergy changes for these primary ionization processes to or product, requires a consistent approach. In the “Electron
enthalpies of formation requires that attention be paid to Convention,” used predominantly in the thermodynamics
a number of important principles and conventions. The community, and the “Ion Convention,” employed in the
enthalpy of formation of a molecular cation is obtained ion physics/chemistry communities, important differences
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
arise in the manner by which the integrated heat capacities Convention, the enthalpies of formation of cations and
of the elements are used to compute enthalpies at nonzero anions reduce to
absolute temperatures. The more widely used Electron
Convention defines the enthalpies of formation of ele- f H 0 (M+ )298 K = f H 0 (M)298 K − f H 0 (e− )298 K
ments in their standard states to be zero at all tempera-
tures. Under such a convention, the integrated heat ca- + IEa + C, (18)
pacities of the elements must be accounted for implicitly = f H 0 (M)298 K + IEa , (19)
in other parts of the calculation. The Ion Convention is
based on the definition of the enthalpy of formation of a and
given element at temperatures above absolute zero to be
equal to the integrated heat capacity of that species. At f H 0 (M− )298 K = f H 0 (M)298 K + f H 0 (e− )298 K
the zero of absolute temperature, the enthalpies of forma-
tion of a given ion are identical in both conventions. Be- − EA + C, (20)
cause standard tabulations of thermochemical quantities = f H 0 (M)298 K − EA. (21)
have historically used both conventions, there is a need
to define the conventions with clarity and caution users Note that in both expressions, the equality of C and the
of such data to be aware of the conventions. In the Elec- enthalpy of formation of the electron at temperature T
tron and Ion conventions, at 0 K the enthalpy of formation (=298 K) results in expressions that have no dependence
of the electron is zero and the enthalpies of formation of on the integrated heat capacity of the electron.
the ions are exactly equal to the 0 K enthalpy of forma- The calculation of C, the integrated heat capacity of
tion of the precursor molecule plus the ionization energy the electron, is based on treating the free electrons as an
of M. The following equations treat anions and cations ideal gas and computing the heat capacity according to
consistently: Boltzmann statistics. At 298 K, the heat capacity at con-
f H 0 (M+ ) 0 K = f H 0 (M) 0 K + IEa , (14) stant pressure is that of an ideal gas, C = 5/2 RT, or
−
6.197 kJ/mol. This computation allows data tabulated un-
f H (M ) 0 K = f H (M) 0 K − EA.
0 0
(15) der the Electron Convention to be rationalized with Ion
At temperatures above 0 K, e.g., 298 K, where many Convention data as follows:
standard thermochemical data are reported, the Electron
and Ion conventions differ in the following manner: in the f H 0 (M+ )298 K (IC) = f H 0 (M+ )298 K (EC)
Electron Convention, the electron is treated like a stan-
− 6.197 kJ/mol, (22)
dard chemical element, and its enthalpy of formation is
− −
constrained to be zero at all temperatures. However, the f H (M )298 K (IC) = f H (M )298 K (EC)
0 0
B. Measurement of Equilibrium Constant and viscous flow, diffusion of neutral species leads to
Methods, Keq mass discrimination effects. Other systematic errors arise
from clustering of neutral molecules to the ions at low
The preceding methods may be characterized as collision-
temperatures and pyrolysis of the ions at high tempera-
less, i.e., they determine the properties of isolated atoms
tures. The high level of development of ab initio quantum
or molecules in the gas phase. In contrast, ion/molecule
chemistry calculations has resulted in reliable estimates
equilibrium processes maintained by collision and char-
of equilibrium geometries of polyatomic molecular ions,
acterized by a specific temperature
from which vibrational frequencies and rotational con-
−−−→
A± + B ←−−− C± + D (24) stants yield absolute entropies that rival or surpass the
accuracy of experimental determinations or provide valu-
may be established in a high-pressure mass spectrometer, able consistency checks.
flow tube, or ion cyclotron resonance spectrometer. By Thermochemical scales derived from equilibrium con-
determining the mass action equilibrium constant through stant determinations are relative values, and absolute as-
measurement of the relative abundances of the two ions A± signments for appropriate quantities require reliable cal-
and C± after equilibrium has been established by a large ibration values of ionization energy, electron affinity, or
number of collisions, the thermodynamic state variables proton affinity, for example. Moreover, because of the in-
for chemical reaction may be obtained. K eq is defined by terrelationships among the thermochemical data for struc-
the following expression: turally similar molecules, the scale must be evaluated as a
K eq = [C± ][D]/[A± ][B]. (25) whole, not just for individual molecules, and must demon-
strate internal consistency among the variables G, H ,
The neutral reactants B and D are present in great excess and S.
compared to the ionic reactants, yielding a ratio [D]/[B] A particularly important application of equilibrium
that does not change as equilibrium is established. A single methods in mass spectrometry arises in the phenomenon
measurement of K eq yields a value for the Gibbs free en- of “chemical ionization.” In analytical mass spectrom-
ergy change of reaction at the temperature of the measure- etry, the extensive fragmentation that accompanies
ment, while a series of measurements at different temper- the ionization of large molecules in electron impact
atures permits an experimental evaluation of the entropy ionization is a serious problem. The distribution of ions
and enthalpy changes associated with the reaction through in a high-pressure ion source at thermal equilibrium is
the relation illustrative of the so-called “soft” ionization techniques
that have been developed to circumvent the problem of
−RT ln K eq = G 0 = H 0 − T S 0 . (26)
fragmentation when a Franck–Condon distribution of
Accurate knowledge of the temperature is essential for species is created by electron impact or photoionization.
a correct application of equilibrium methods. Equilib- The chemical ionization (CI) method employs a chemical
rium measurements carried out in widely different pres- reaction between ions generated from a reactant gas and
sure regimes have been able to reproduce relative spectro- the species to be ionized to generate primary ions. The
scopic ionization energies quite reliably, demonstrating reactant gas in CI is usually chosen to be a species such
that equilibrium may be achieved readily in such experi- as CH4 or H2 which may be ionized by electron impact
ments. Such measurements produce only relative thermo- followed by reactions with the neutral gas that lead to
chemical data; consequently, the resulting thermochem- protonated species such as CH+ +
5 and H3 . These species
ical “ladders” must be calibrated to reliable comparison may then undergo proton transfer reactions with the
standards to yield absolute values. species M to be ionized. Such reactions generally lead
Extracting the more useful H 0 values from G 0 mea- to MH+ ions in thermodynamic equilibrium at the ion
surements requires determination of Third Law entropies, source temperature with very little fragmentation.
through either temperature-dependent methods, i.e., van’t
Hoff plots,
1. Proton and Electron Transfer Equilibria
d ln K eq −H 0
= , (27) Three important classes of gas-phase ion–molecule re-
d(1/T ) R action schemes demonstrate the value of thermochemi-
or applications of statistical mechanical methods. Sys- cal data deduced from equilibrium systems. The first of
tematic errors in such estimates arise from inaccurate these concerns the derivation of extensive scales of relative
determination of the reactive neutral pressure in pulsed proton affinities, gas-phase acidities, and electron affini-
high-pressure mass spectrometers. When such an instru- ties. These results derive primarily from measurements of
ment operates in the transition regime between molecular enthalpy changes for proton or electron transfer reactions:
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
interest because of the fact that it binds K+ selectively over 1. Ion-Molecule Bracketing Experiments
Rb+ , Cs+ , and Na+ (in that order) in an aqueous environ-
Equilibrium in the ion source of a mass spectrometer
ment. In contrast, the gas-phase bond energies reported by
cannot be achieved in certain ion–molecule systems be-
Armentrout, using the thermokinetic method of collision-
cause of rapid competitive reactions or because one of the
induced dissociation, for the various alkali metal cations
species in the equilibrium of interest is an unstable species
with 18c6 clearly indicate that Na+ is most strongly bound,
such as a free radical. In such cases, it is sometimes pos-
as one expects from charge density models of electrostatic
sible to obtain an experimental estimate of the enthalpy
binding. A consideration of the competition between gas-
change of a particular reaction (charge transfer, proton
phase solvation of the cations by water and complexation
transfer, etc.) by use of a technique known as “bracket-
by the crown ether,
ing,” in which the ion of interest is reacted with a series of
M+ (H2 O)x + 18c6 −→ M+ (18c6) + xH2 O, (33) molecules selected to provide a range of values for the rele-
+ vant thermochemical parameter of interest, e.g., ionization
shows that complexation of Na by 18c6 is favored until
energy, electron affinity, gas-phase basicity, or acidity. Re-
the number of displaced water molecules corresponds to a
action is presumed to occur for exothermic processes and
complete solvent shell. At this point, K+ selectively binds
not to occur for endothermic processes.
to the crown ether, as observed in solution. In addition, for
For example, the proton affinity of a species B can be
all metal cations, the displacement reaction that liberates
“bracketed” between the know proton affinity of reference
six water molecules but binds a single 18c6 species is en-
compound A and that of reference compound C if the
tropically driven. Overall, this comparison demonstrates
following observations are made:
that the selective binding of K+ by 18c6 in aqueous me-
dia is a subtle balance of the solvation and complexation no reaction
BH+ + A −−−−→ AH+ + B, (34)
enthalpies, entropic effects, and the solvation of the resul-
fast reaction
tant complex. The results are consistent with a picture in BH+ + C −−−−−→ CH+ + B. (35)
which such selectivity is not an intrinsic property of the
metal cation–crown ether complex but depends critically The notation “no reaction” means either that the reaction
on solvation. This point of view makes it clear that a proper products are not observed or that the reaction efficiency,
understanding of molecular recognition in such systems i.e., the rate relative to the Langevin collision rate, is low.
must include details of the solvent itself. The observations lead to the relative ordering PA(A) <
PA(B) < PA(C).
C. Thermokinetic Methods
2. The Kinetic Method
In addition to the two general methods described above for
determination of thermochemical parameters, i.e., ioniza- The kinetic method is an approximate scheme to determine
tion under collision-free conditions and free energy mea- relative thermochemical properties based on the rates of
surements in systems at equilibrium at a well-specified competitive dissociation of mass-selected cluster ions. As
temperature, a third set of methods exists for determina- an example, consider the proton-bound dimer system:
tions of reaction enthalpies and enthalpies of formation. k1
These methods are collectively referred to as “thermoki- B1−H+−B2 −→ B1 H+ , (36)
netic” measurements and are based on the relative rates or k2
−→ B2 H+ , (37)
cross sections for the formation of reaction products under
single-collision conditions. The “ion bracketing method” with rate constants for competitive dissociation as indi-
puts constraints on the enthalpy of formation for a spe- cated. In its simplest form, the kinetic method deals with
cific species by observing the occurrence or nonoccur- systems in which the structures of B1 and B2 are suf-
rence of reactions of that species with known collision ficiently similar that it is reasonable to assume that the
partners. The collision-induced dissociation method deter- entropic factors in both decay processes are equal. With
mines bond energies in polyatomic ions by observing the the additional assumption that the dissociation processes
threshold energy at which collisions with a rare gas atom occur with no barriers in excess of the endothermicities,
fragments the parent ion. The “kinetic method” evaluates the following relationship holds:
the difference in enthalpies of reaction of two competi-
k1 [B1 H+ ] (H )
tive reaction channels arising from decay of a common ln = ln +
≈ , (38)
precursor, usually a cluster ion. Each of these classes of k2 [B2 H ] RTeff
thermokinetic methods is based on underlying assump- where Teff is the effective temperature of the activated
tions that restrict the validity of a given method. proton-bound dimer. Perhaps the most difficult issue
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
associated with the application of this method is the es- cessibility of methods for performing accurate ab initio
timation of the effective temperature, which describes a quantum chemical calculations has led to reliable esti-
Boltzmann distribution of activated dimer ions which frag- mates for the vibrational frequencies and rotational con-
ments to give the same fragment ion abundance ratio as stants of decaying molecules, as required for evaluation
observed for the non-Boltzmann population sampled in of rate constants computed according to statistical Rice–
the experiment. The assignment of Teff is made especially Ramsperger–Kassel–Marcus (RRKM) theory.
difficult because the distribution it describes is time depen- Threshold CID measurements have proven to be valu-
dent, a function of the observation time characteristic of able in determining thermodynamic information. By vary-
a particular instrument. Since the method determines dif- ing the kinetic energy of the AB+ ions in reaction (39) and
ferences in enthalpies of reaction, calibration data based observing A+ products as a function of incident energy, the
on the ion ratios of several species with energetics known threshold energy for CID can be determined, yielding the
from other methods, such as ion/molecule equilibria, are bond energy of AB+ . The accuracy of this method requires
necessary for absolute data determinations. The method that the probability for transferring all translational energy
may be applied to reactions other than relative proton affin- into the internal energy of the AB+ molecule is finite and
ity determinations, including electron affinities, ionization nonzero and that there is no activation energy in excess of
energies, gas-phase acidities, and cation affinities. In fa- the bond energy along the reaction coordinate for dissoci-
vorable cases, entropy changes associated with attachment ation. Accurate threshold determinations require attention
of an ion to a molecule may be determined from kinetic to the role of multiple collisions with the Rg collision gas,
method data, yielding information on bonding and frag- the effect of the internal energy content of the AB+ re-
mentation mechanisms in cluster ions, especially those of actant, and the metastability of the activated AB+ species
biological origin. on the timescale for detection. The generality of the
methods has been demonstrated through its application
to systems such as small organic molecules, solvated hy-
3. Collision-Induced Dissociation
dronium ions and metal ions, organometallic complexes,
This collisional method obtains bond energies by mea- atomic clusters, and metal–crown ether complexes.
suring the kinetic energy threshold for the dissociation of
AB+ ions, represented schematically as follows:
D. Sources of Evaluated Data
AB+ + Rg −→ A+ + B + Rg. (39)
The foregoing discussions of methods for determining
Formally, this process may be thought of as a superpo- thermochemical properties of gas-phase ions have focused
sition of bimolecular activation and unimolecular decay. on specific experiments that yield a particular observ-
The bimolecular step occurs in the collisional activation able parameter. New experimental methods that improve
of an AB+ ion above one (or more) of its dissociation the accuracy and precision of thermochemical determina-
thresholds; A+ fragment ions appear as products of the tions, as well as the fact that conventions for expressing
unimolecular decay of the activated ion. In the process such information change with time, make up-to-date criti-
noted above, Rg is often a rare gas atom. cal data compilations of prime importance for consumers.
A consideration of collisional dissociation near thresh- The National Institutes of Standards and Technology
old requires an understanding of the amount of energy (NIST) has continued its historical tradition of maintain-
transferred in a single encounter, the role of angular mo- ing a critical compilation of data, estimating error lim-
mentum in the decay of activated molecules, and the life- its in appropriate cases. The NIST compilations focus on
times for decay of such species. Accurate determination experimental determinations, but the importance of high-
of apparent thresholds has been achieved by modeling the quality ab initio calculation of enthalpies of formation of
threshold behavior of the system with a cross-sectional ionic compounds should not be ignored.
form as follows: As an example of this last point, the understanding of
the structure of the seven-membered cyclic tropyl ion,
σ (E) = σ0 gi (E − E i − E 0 )n E, (40)
c-C7 H+ 7 , is particularly illustrative of the value of com-
where the summation extends over internal states labeled putations. Neither the tropyl ion, nor the isomeric ben-
by the index i and having degeneracy gi , σ0 and n are zyl, C6 H5 CH+ +
2 , and tolyl, C6 H4 CH3 , ions are stable in
adjustable parameters, E is the relative kinetic energy, the neutral precursor forms, forming in the gas phases
and E 0 is the 0 K reaction threshold. only through ionic dissociation reactions. Numerous re-
The dissociation rates of activated molecules close to actions produce the c-C7 H+ 7 ions, among them hydro-
threshold may be long in comparison with the observa- gen atom loss from toluene and halogen atom loss from
tion time window, leading to large kinetic shifts. The ac- benzyl halides and halotoluenes. The tolyl and benzyl ion
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
structures can be produced at their thermochemical thresh- microcanonical equilibrium is achieved, and subsequent
olds, yielding well-established heats of formation. How- vibrational predissociation leads to fragmentation. The
ever, the tropyl ion is formed only through a process with breakdown diagram, the graph of fragment ion abundances
a substantial barrier. Thus, the only means for obtaining as a function of the internal energy of the ion, is computed
its energy directly is through ab initio quantum chemical from QET rate constants, and the mass spectrum is the
calculations, which have been carried out with the Gaus- convolution of the energy deposition function for the ion-
sian G2 methodology. The most current online source of ization process, e.g., the Franck–Condon factors for the
thermochemical data may be found in the NIST Web- primary ionization, with the breakdown diagram. QET has
book (http://webbook.nist.gov). In particular, the web site enjoyed considerable success in interpreting mass spectra.
http://webbook.nist.gov/chemistry/ion-ser.htm provides a The QET is formally identical to the Rice–Ramsperger–
searchable database for enthalpies of formation, electron Kassel–Marcus (RRKM) theory of unimolecular decay, in
affinities, ionization energies, gas-phase acidities and ba- which the rate constant for dissociation to reaction prod-
sicities, and appearance energies. The NIST data collec- ucts of an energized species with total angular momentum
tion includes only data derived from experimental deter- J and internal energy E ∗ over a barrier of E 0 is given by
minations, although the evaluation of the reliability of the the following relation:
data has been enhanced by using computational results,
σ N + (E ∗ − E 0 , J )
particularly for the proton affinity tabulation. k(E ∗ ) = . (41)
hρ(E ∗ , J )
species. The sum of the long-range attractive electrostatic below the LGS limit. The interpretation of such slow re-
potential and the repulsive 1/r 2 centrifugal potential re- actions has been the subject of numerous investigations.
sults in a barrier as the reactants approach. For every col- Brauman and co-workers have proposed a model of ionic
lision for which the relative kinetic energy exceeds this reactions that proceed on potential energy surfaces with
centrifugal barrier, reaction is assumed to occur. For an two minima separated by a barrier lower than the entrance
ion of charge e approaching a neutral collision partner channel energy. A schematic surface with these features
with polarizability α, the long-range potential behaves as is shown in Fig. 1. As the reagents approach under the
1/r 4 and the Langevin capture cross section is given by influence of the long range attraction, they enter the first
1/2
well, the electrostatically bound encounter complex. For
2α reaction to occur, this complex must undergo an atom or
σLangevin = πq , (42)
µE T functional group transfer over a barrier to a second bound
where the relative collision energy is E T and the reduced species that is a precursor to products. When this barrier
mass of the approaching reactants is µ. The thermal aver- height is below the energy of the approaching reagents,
age of this cross section over the relative speed distribution the barrier is not an energetic bottleneck to reaction, but
at a fixed temperature yields the Langevin–Gioumousis– the low density of states for crossing the barrier may re-
Stevenson (LGS) rate constant expression, duce the rate for this process below that for decomposition
of the initial encounter complex back to reagents. When
1/2
α the branching ratio for decay of this encounter complex
kLangevin = 2πe , (43) favors production of the reactants, the reaction efficiency
µ
drops significantly below the LGS prediction. By model-
a result that is independent of temperature. For neutral col- ing complex formation and decomposition reaction rates
lision partners with permanent dipole moments, the long- using statistical RRKM theory and computing reaction ef-
range potential also has a charge-dipole term that behaves ficiencies as a parametric function of the intermediate bar-
as r −2 cos θ, where θ is the angle between the relative rier height, one can determine barrier energies for many
velocity vector and the axis of the dipole. Several mod- potential surfaces, correlating them with structural fea-
els have been proposed to account for interaction of the tures of the reactants.
charge with the dipole, the most successful of which is the A particularly important reaction that has been exam-
average dipole orientation (ADO) model. This model pos- ined in the context of the double minimum potential model
tulates that the electric field of the approaching ion does is the gas-phase nucleophilic substitution reaction, written
not fully quench the rotational motion of the dipole and symbolically as
results in a partial locking of the dipole orientation. The
ADO cross section is given by the following expression: X− + CH3 Y −→ Y− + CH3 X, (45)
FIGURE 1 Double-minimum reaction coordinate showing formation of complex A with rate k1 . The sparse level
density for crossing the intermediate isomerization barrier to B with rate constant k2 makes this rate low in comparison
with nonreactive decay to reactants, with rate constant k−1 .
of internal energies and is injected into a reverse geometry arly important testing ground for such concepts. Exper-
sector mass spectrometer. To be detected in the time win- imental methods employing liquid helium-cooled drift
dow of these experiments, the metastable ion Cl− ·CH3 Br tubes, Penning traps, and supersonic expansions such as
must be formed with internal energies between about 0.4 the CRESU (Cinetique de Reaction en Ecoulements Su-
and 0.5 eV. The lower limit corresponds to the height of personiques Uniformes) wind tunnel instrument have al-
the isomerization barrier, while ions produced with inter- lowed studies of ion-neutral interactions at temperatures
nal energies of more than 0.5 eV decay on a time scale as low as 3 K. At these temperatures, radiative association
very fast in comparison with the 10-µsec transit time to and three-body association reactions become important,
the second field-free region of the mass spectrometer. Ions as well as bimolecular processes. Recent low-temperature
with this distribution of internal energies yield the prod- studies of such collisional processes include the reaction
ucts Br− + CH3 Cl. The data also allow a determination of N+ with molecular hydrogen and its isotopomers. Al-
of the kinetic energy release distribution (KERD) for the though the magnitudes of the rotational energies of the
reaction, thus providing a test of the statistical assump- neutral reactants, the spin-orbit electronic energies of N+ ,
tion inherent in RRKM or phase-space treatments of the and the zero-point energy differences among H2 , HD, and
dynamics. D2 are fairly small, they are significant compared to the
magnitude of the collision energy at low temperatures
and provide sensitive tests of quantum theories of capture
2. Low-Temperature Dynamics
processes.
At very low temperatures, quantum mechanical effects Such quantum capture theories in three dimensions have
that classical electrostatic capture models fail to account been developed to solve the Schrödinger equation for the
for play a critical role in determining reaction rates. Ion long-range attractive entrance channel potential in a cou-
chemistry at temperatures below 80 K is especially im- pled rotational-states formalism. A rotationally adiabatic
portant in elucidating the role of such quantum effects; approximation to this theory has been developed by con-
molecule formation in the interstellar medium is a particul- structing potential curves that describe the evolution of
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
isolated rotational states of molecular reagents into hin- v + = 3 and 4, agreement with the Langevin result is better.
dered rotor states of the ion–molecule complex. The re- At these levels of vibrational excitation, Franck–Condon
action probability of that reagent rotational state is unity factors make the charge transfer process less favorable,
when the collision energy lies above a particular adia- and the total cross section is more closely described by the
batic curve and zero when the curve rises above the col- N+2 + H2 long-range potential. Above a collision energy of
lision energy. The cross section is computed by summing 0.09 eV, the reactive cross section rises above the Lange-
over capture probabilities for the rotationally adiabatic vin value as endothermic charge transfer channels open.
curves. These theories yield improved agreement with ex- The most detailed probe of an ion–molecule reaction
perimental data on ion–dipole collision processes, at low is the state-to-state cross section. The measurement of
temperatures. angularly resolved state-to-state cross sections has been
achieved only for reactants in their ground state, but true
state-to-state total cross sections have now been measured
3. Vibrational-State Resolution
in a few favorable cases. A particularly novel method for
in Ion–Molecule Reactions
product state determination with reactant state selection
Since the pioneering experiments of Chupka and co- has been developed by Ng. In the differential reactivity
workers, which employed single-photon VUV ioniza- method, product vibrational states are distinguished by
tion of molecular hydrogen to prepare H+ 2 with a spe- their differing cross sections for charge transfer with se-
cific amount of vibrational excitation, the field of study lected molecules. The charge transfer reaction
of ion–molecule reaction dynamics with vibrationally
state-selected reactants has experienced a number of H+ + +
2 (v ) + H2 −→ H2 + H2 (v ), (48)
advances. Another important system in the history of in which H+ +
2 is prepared in v = 0 and 1 by VUV photoion-
ion–molecule reactions is the hydrogen atom transfer ization provides a particularly good example of its power.
process (47): The H+ 2 charge transfer products, formed in v = 0–3, are
N+ +
2 + H2 −→ N2 H + H. (47) first mass analyzed, accelerated to 10 eV, and then passed
through a collision cell containing N2 , Ar, or CO, where
A recent study of this system, in which the N+ reactant charge transfer occurs once again. The N+ + +
2 , Ar , or CO
2
is produced in the ground electronic state in a single ro- products are mass analyzed and the cross sections σm for
tational state (J = 2) and with vibrational quantum num- forming these ions by charge transfer from the H+ 2 reaction
ber v + varied between 0 and 4 has been performed by products are measured. The charge transfer cross sections
the group of Kompa. The reactant ions are produced by for H+2 with these gases have different dependences on v ,
(2 + 1) REMPI on nitrogen gas in a pulsed supersonic and therefore the gases can be used to probe the product
expansion. The spectroscopic method for ion production states of H+2 formed in the symmetric charge exchange
is a one-color scheme that employs high vibrational lev- reaction. Letting X v denote the fraction of H+ 2 formed in
els, v =10−14, of the two-photon allowed a 1 g state the vibrational state v , the following set of simultaneous
of N2 having a lifetime of 120 µsec, followed by absorp- equations can be solved for X v , since the cross sections
tion of a third photon to produce the desired N+ 2 reactant σv are known and the σm s are measured:
ions. This state-preparation scheme requires photons in the
wavelength range near 237 nm and is employed in con- X 0 + X 1 + X 2 + X 3 = 1, (49)
junction with octupole ion guides and a scattering gas cell
to determine total cross sections for chemical reaction of X 0 σ0 (N+ + +
2 ) + X 1 σ1 (N2 ) + X 2 σ2 (N2 )
N+ +
2 formed in vibrational states v = 0–4 over the trans-
lational energy range from 0.01 to 2 eV in the center of + X 3 σ3 (N+ +
2 ) = σm (N2 ), (50)
mass coordinates.
An unexpected result of this study was the observa- X 0 σ0 (Ar+ ) + X 1 σ1 (Ar+ ) + X 2 σ2 (Ar+ )
tion of a small activation barrier to reaction of magnitude
+ X 3 σ3 (Ar+ ) = σm (Ar+ ), (51)
0.01 eV. The data show a rise from this threshold and fol-
low an E −1/2 dependence over the collision energy range
from 0.02 to 0.09 eV. The cross sections appear to be larger X 0 σ0 (CO+ ) + X 1 σ1 (CO+ ) + X 2 σ2 (CO+ )
than the Langevin prediction, calculated with the polariz- + X 3 σ3 (CO+ ) = σm (CO+ ). (52)
ability of H2 , particularly for reactant ions with vibrational
excitation v + ≤ 2, and in fact are in much better agreement An extensive set of data at collision energies from 2 up
with the cross section computed for the attraction of the to 16 eV has been measured. When the reactant ions are
charge-transferred reactants H+ +
2 + N2 . For N2 ions with in the vibrational ground state, at low kinetic energies the
P1: GSR/MAG P2: GLM/GKP QC: FXY Final Pages
Encyclopedia of Physical Science and Technology EN008I-352 June 29, 2001 12:38
of umbrella excitation (denoted ν1 = 1, ν2 = 2, or 11 22 ), attenuation of ions with a particular mobility and therefore
with 0.63 eV of internal energy, and the state with no in a particular electronic state.
breathing excitation and five umbrella quanta, denoted The reaction of Fe+ with propane, C3 H8 , is a particu-
10 25 , with a total internal energy of 0.60 eV. Because larly interesting system with rich chemistry. In ion chro-
the two state preparations have essentially the same in- matography experiments, the Fe+ ions are prepared by
ternal energy, their differential reactivities can address the electron impact on Fe(CO)5 , and the electronic state dis-
question of whether the magnitude the internal energy is tribution is controlled with the ionizing electron beam en-
sufficient to effect a particular reaction or if the form of the ergy. The 6 D (4s3d6 ) ground state of Fe+ exhibits a high
internal motion in the reactants plays an important role. rate for formation of the adduct Fe+ · C3 H8 without addi-
All three of the reactive channels of NH+ 3 with ND3 are
tional reaction, while the 4 D (4s3d6 ) and 4 F (3d7 ) excited
direct processes that do not involve the intermediacy of a states exhibit subsequent elimination of H2 and CH4 . The
transient complex and, thus, represent good opportunities system is particularly interesting in that it is one of a very
for demonstrating mode-specific dynamics. A compari- few that have been studied with all three methods of elec-
son of the reactive cross section for the isoenergetic states tronic state preparation, and agreement among the differ-
11 22 and 10 25 shows that deuterium abstraction and charge ing studies is excellent.
transfer are enhanced, and proton transfer is suppressed
by the 10 25 excitation, i.e., the state with enhanced um-
brella excitation. In addition to comparing the reactivity
SEE ALSO THE FOLLOWING ARTICLES
of different isoenergetic vibrational state preparations, the
CHEMICAL KINETICS, EXPERIMENTATION • DIELECTRIC
study also compares the effect of vibration with equiva-
GASES • ION TRANSPORT ACROSS BIOLOGICAL MEM-
lent amounts of translational energy, demonstrating that
BRANES • KINETICS (CHEMISTRY) • MASS SPECTROM-
ν2 umbrella excitation is four times more effective in pro-
ETRY • POTENTIAL ENERGY SURFACES
moting deuterium transfer than an equivalent amount of
translational energy.
BIBLIOGRAPHY
5. Electronic State Selection
Armentrout, P. B. (1990). “Electronic state-specific transition metal ion
Electronic excitation of atomic ions provides numerous chemistry,” Annu. Rev. Phys. Chem. 41, 313.
opportunities to observe dramatic effects in both the over- Armentrout, P. B., and Baer, T. (1996). “Gas phase ion dynamics and
chemistry,” J. Phys. Chem. 100, 12866.
all rates of reaction of such species and the distributions of Baer, T., Ng, C. Y., and Powis, I. (eds.) (1996). “The Structure, Energetics,
products in these reactions. Three fundamental methods and Dynamics of Organic Ions,” Wiley, New York.
have been applied to the production of electronically state- Bowers, M. T. (ed.) (1979, 1980). “Gas Phase Ion Chemistry,” Vols. 1–3,
selected ions. The first method compares the reactivities of Academic, Press, New York.
different distributions of electronic states resulting from Cooks, R. G., Koskinen, J. T., and Thomas, P. D. (1999). “The kinetic
methods of making thermochemical determinations,” J. Mass Spec-
ions produced in different ways, e.g., electron impact ion- trom. 34, 85.
ization and thermal ionization. The second method uses Farrar, J. M. (1995). “Ion reaction dynamics,” Annu. Rev. Phys. Chem.
REMPI for atoms whose Rydberg states are unperturbed 46, 525.
by configuration interaction and autoionization. The Farrar, J. M., and Saunders, W. H., Jr. (eds.) (1988). “Techniques for the
final method, referred to as “ion chromatography,” uses Study of Ion-Molecule Reactions,” Wiley–Interscience, New York.
Lias, S. G., and Bartmess, J. E. (1997). “Gas-Phase Ion Thermochem-
the differing mobilities of metal ions with different elec- istry,” National Institute of Standards and Technology, Washington,
tronic configurations in He. Particularly large differences DC. http://webbook.nist.gov/chemistry/ion/.
are observed when vacancies are created in valence s or- Ng, C. Y., and Baer, M. (eds.) (1992). “State-Selected and State-to-State
bitals. By injecting a short pulse of ions into a helium-filled Ion-Molecule Reaction Dynamics, Part 1: Experiment,” Vol. LXXXII
cell with a uniform drift field, the ions separate spatially in Advances in Chemical Physics, Wiley–Interscience, New York.
Ng, C. Y., Baer, T., and Powis, T. (eds.) (1994). “Unimolecular and
and temporally as they diffuse through the cell. Electronic Bimolecular Ion-Molecule Reaction Dynamics,” Wiley, New York.
state-specific chemistry can be examined by introducing Thomson, J. J. (1913). “Rays of Positive Electricity,” Longmans Green,
small amounts of a reagent gas and observing the selective New York.
P1: ZCK/GJP P2: GLM Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
Kinetics(Chemistry)
Keith J. Laidler
University of Ottawa
Sometimes, but not always, it is possible to define a α + β + · · · , is known as the overall order and is often
quantity, known as the rate of reaction, which is inde- given the symbol n.
pendent of the reactants and products. This can only be A simple case is when the rate equation is given by
done if the reaction is of known stoichiometry; for some
v = k[A] (6)
reactions there are numerous minor products and the sto-
ichiometry is uncertain. Another condition for defining a An example of such a first-order reaction is the conversion
rate of reaction is that the stoichiometric equation must re- of cyclopropane into propylene
main the same throughout the course of reaction: for some
reactions intermediates are formed in significant amounts,
and the stoichiometry varies as the reaction proceeds. If
these two conditions are satisfied (i.e., if the stoichiometry
is known and is time independent), the rate of reaction is A second-order reaction can involve a single reactant;
given by any of the expressions that appear in Eq. (3). In
v = k[A]2 , (7)
other words, the rate of reaction is the rate of consump-
tion or formation divided by the appropriate coefficient or two reactants,
that appears in the stoichiometric equation. In the case of
v = k[A][B]. (8)
products these coefficients are called the stoichiometric
coefficients; in the case of reactants the stoichiometric co- In the latter case the partial orders are unity, and the overall
efficients are the negatives of the coefficients in the rate order is two. The process
equation. As seen in Eq. (3), this division has made the
H2 + I2 2HI
four rates equal to one another, so that the rate of reaction
is unique for the reaction under the particular conditions is an example, being second order in both directions. Re-
of the experiment. action orders are not necessarily integral; in the case of
It is important to note that rate of reaction, unlike rates the acetaldehyde decomposition, for example,
of consumption and formation, only has meaning with
CH3 CHO → CH4 + CO
reference to a specified stoichiometric equation for the
reaction. Thus, if the stoichiometric equation had been the order is :
3
2
written as
v = k[CH3 CHO]3/2 . (9)
3
2 A+ B→Y+ Z
1
2
1
2
This failure of the kinetics to correspond to the stoichiome-
the rate of reaction would be try indicates that the reaction occurs by a composite mech-
anism, the nature of which is considered in Section IX.
v = 23 vA = 2vB = vY = 2vZ (4)
The constant k that appears in the preceding equations
and would therefore be twice the rate specified with refer- is known as the rate constant. Its units depend on the order
ence to the first form of the equation. of reaction. Thus, if a reaction is first order [Eq. (6)] and
Note also that some kineticists define rates with refer- v is expressed as mol dm−3 s−1 and [A] as mol dm−3 ,
ence to the rates of change of amounts of substances rather the unit of k is s−1 . Similarly, for the second-order reac-
than their concentrations; the S.I. unit is then mol s−1 , there tion the unit of k is usually dm3 mol−1 s−1 . Just as rates
being no division by volume. of consumption and formation depend, in general, on the
In general, rates depend on concentrations of reactants reactant or product under consideration, so do the cor-
and sometimes on concentrations of products and other responding rate constants. The rate constant that derives
substances. For some reactions the rate of reaction v can from the rate of reaction, for a specified stoichiometric
be expressed by an equation of the form equation, is unique.
By no means do all equations have rate equations of the
v = k[A]α [B]β , (5)
type shown in Eq. (5). Sometimes reactant concentrations
where k, α, and β, are independent of concentration and appear in the denominator of the rate equation, and this is
time. Similar equations apply to rates of consumption and indicative of a composite mechanism.
formation. The exponent α is known as the order of reac- If a chemical reaction is sufficiently slow it is possible
tion with respect to A. The term order was introduced in to mix reactants together or to raise the temperature of the
1887 by F. W. Ostwald (1853–1932), but the concept of or- reaction system very rapidly so as to start the reaction,
der had been used in 1884 by J. H. van’t Hoff (1852–1911). and then to measure concentrations of reactants or prod-
Similarly the exponent β is the order with respect to ucts at various times. Such concentrations may be mea-
B. When there is more than one such order, as in this sured by chemical methods or by physical methods such
example, each one is called a partial order, and their sum, as spectrophotometry. Two procedures are then available
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
for determining orders of reactions and rate constants. One called catalysis, this name having been coined in 1836
method is the method of integration, which involves inte- by J. J. Berzelius (1779–1848). Catalysis can be classi-
grating the differential equation for the rate and obtaining fied as homogeneous catalysis (Section XI) in which only
expressions for concentrations of reactants and products one phase is involved, and as heterogeneous or surface
as a function of time. This integrated equation, which de- catalysis (Section VIII) in which the reaction occurs at
scribes the time course of the reaction, is then fitted to the an interface between phases. In biological systems catal-
experimental results. ysis is brought about by enzymes, which are proteins: the
A second method, the differential method, involves de- action is sometimes homogeneous and sometimes hetero-
termining rates from the slopes of concentration–time geneous. A special type of catalysis, brought about by the
curves. An analysis is made of the way in which the rates product of a reaction, is known as autocatalysis. For ex-
depend on the concentrations of reactants, and the order ample, ester hydrolysis is catalyzed by acids, which are
and rate constant can then be deduced. products of the reaction; the process may therefore first
For rapid chemical reactions special methods must be accelerate. The term catalyst is sometimes applied to a
used. For reactions that are not extremely rapid, flow substance that is consumed but which in other ways acts
methods are useful. Reactants may be mixed together like a catalyst: such substances, however, are better called
in specially designed vessels and then passed along a pseudocatalysts or activators. Mechanisms of catalysis are
tube, concentrations being measured at various positions considered in Sections VIII and XI.
by physical techniques such as spectrophotometry which An inhibitor is a substance that reduces the rate of a
allow instantaneous determinations to be made. A com- chemical reaction. Such substances were formerly called
monly used technique is the stopped-flow method, in negative catalysts, but this terminology is not recom-
which reactants are mixed and the flow suddenly stopped, mended since their action is quite different from that of a
so that the mixture is trapped in a vessel in which, by phys- true catalyst. Inhibitors, in fact, act either by interacting
ical methods, concentrations can be studied as a function with a catalyst and rendering it less effective or by remov-
of time. The results can be analyzed by suitable adapta- ing active intermediates such as free radicals. The term
tions of either the method of integration or the differential inhibitor is often applied to substances that are consumed
method. during the course of reaction. If a reaction in the absence
Some reactions, however, are so rapid that mixing can- of an inhibitor proceeds with a rate v0 , and in the presence
not be achieved sufficiently rapidly. For such reactions of inhibitor with rate v, the degree of inhibition is defined
the relaxation methods, such as the temperature-jump as
(T-jump) method may be used. They were developed in v0 − v v
1954 by M. Eigen (b. 1927). In this technique a reaction εi = =1− . (10)
v0 v0
system at equilibrium is subjected to a very rapid rise in
temperature which causes the equilibrium to shift (relax)
to a new position of equilibrium. Physical methods are II. MOLECULARITY
available for following the concentration changes during
this relaxation, and the results can be analyzed to give the Chemical reactions can be classified as either elementary
rate constants and the order of reaction. or composite. An elementary reaction is one which, as far
A quantity that has proved useful for comparing rates as can be determined, goes in a single stage; the reactants
of different reactions is the half-life, or half-period. The pass smoothly through an intermediate state and then be-
half-life of a given reactant is the time that it takes for come products. If a reaction is elementary no specific inter-
half of it to be consumed during the reaction. The way mediates can be detected or need to be postulated in order
in which the half-life depends on reactant concentrations to explain the kinetic behavior. Composite reactions, also
varies with the order of the reaction; for the special case of known as complex or stepwise reactions, occur in more
a first-order reaction there is no dependence of half-life on than one stage, and therefore involve two or more ele-
reactant concentration. For a reaction of any order the half- mentary reactions; they are considered in more detail in
life is inversely proportional to the rate constant; half-lives Section IX.
are therefore useful in giving an inverse measure of the rate For an elementary reaction, but not for a composite
of a reaction and can be used for comparing reactions of reaction, the term molecularity can be employed. The
different orders. molecularity of an elementary reaction is the number of
Sometimes the rate of a chemical reaction is affected reactant particles (atoms, molecules, free radicals, or ions)
by the addition of a substance that is not consumed in the that are involved in each individual chemical event. For
process and does not affect the equilibrium constant for example, the cyclopropane isomerization appears to be
the reaction. When the rate is increased in this way the elementary, in that each chemical act involves a single cy-
added substance is known as a catalyst, and the effect is clopropane molecule: the molecularity is unity, and the
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
reaction is said to be unimolecular. For the elementary resolved in favor of equations for which there was a useful
reaction physical interpretation.
One of the earliest equations relating the rate constant
Br + H2 → HBr + H,
k to the absolute temperature T was
the molecularity is two, and the reaction is said to be bi-
k = Ae DT or ln k = ln A + DT. (11)
molecular. A reaction having molecularity of three, such
as This equation was proposed in 1862 by M. Berthelot
2NO + Cl2 → 2NOCl (1827–1907). The parameters A and D are empirical quan-
tities, and this equation requires that a plot of ln k against
is said to be trimolecular. Reactions of higher molecu- the absolute temperature T will be linear, as is approxi-
larities are unknown: it is unlikely for more than three mately true in many cases. It is of interest that recently
molecules to come together in a single chemical event, (1982–1985) C. M. Hurd and co-workers have shown that
and instead the mechanism is composite. this equation is obeyed when a process occurs by quantum-
It is important to distinguish clearly between the molec- mechanical tunneling (see Section IV), and they have pro-
ularity and the order. The latter is a purely experimental vided an interpretation in terms of atomic vibrations.
quantity, which is concerned with how the rate depends In 1884 J. H. van’t Hoff presented the equation, derived
on reactant concentrations; the concept of order applies to from thermodynamics, for the temperature dependence of
some composite reactions. The molecularity of an elemen- an equilibrium constant K :
tary reaction, on the other hand, is arrived at by inference
d ln K H◦
from all of the evidence available about the reaction. One = , (12)
such piece of evidence is the order. If a reaction in the dT RT 2
gas phase appears to be elementary and has an order of where R is the gas constant and H ◦ the standard change
one, it is reasonable to conclude that it is unimolecular. in heat constant (enthalpy) during the overall process. He
However, as will be seen in Section VI, unimolecular gas then noted that an equilibrium constant is the ratio of rate
reactions become second order at low pressures, and it is constants for the reaction in forward and reverse direc-
therefore unsafe to conclude that a second-order gas reac- tions, his argument being as follows: If a reaction
tion is bimolecular: it may be a unimolecular reaction in
A+B Y+Z
its second-order region.
With reactions in solution the question arises as to is elementary in both directions, the rate from left to right
whether the solvent should be included in the molecularity. is k1 [A][B] and that from right to left is k−1 [Y][Z]. At
It is usually considered that it counts in the molecularity equilibrium the two rates are equal so that
if it enters into the overall reaction but not if it exerts only
[Y][Z] k1
an environmental effect. Thus, a process in aqueous solu- = = K. (13)
tion in which a compound was being hydrolyzed, with the [A][B] eq k−1
reaction of a water molecule with each solute molecule, This being so, van’t Hoff argued, k1 and k−1 must show the
would be regarded as bimolecular. The isomerization of same kind of temperature dependence as K , and therefore
cyclopropane in solution, however, is unimolecular. for a rate constant k
Similar conventions are applied to reactions on surfaces. d ln k E
If individual molecules undergo decomposition or isomer- = , (14)
dT RT 2
ization on a surface the reaction is usually described as
unimolecular, even though the surface atoms are also in- where E is some energy term. van’t Hoff did not as-
volved. This, however, would not be done if the material sume that E itself is temperature independent but consid-
of the surface entered into the final products. ered several possibilities. If E is temperature independent
Eq. (14) integrates to
E
III. TEMPERATURE AND ln k = ln A −
RT
REACTION RATES or
k = ze−E/RT . (17) face has been calculated. If it has not, the parameters can
usually be estimated sufficiently well to lead to useful ap-
A particularly clear formulation of this kind was presented proximate rate constants.
in 1918 by W. C. McC. Lewis (1885–1956), and on its basis The second procedure is to employ a thermodynamic
he calculated a value of the pre-exponential factor for the formulation. The equilibrium constant K ‡ can be ex-
reaction 2HI → H2 + I2 that was in excellent agreement ‡
pressed as e− G/RT , where ‡ G, known as the Gibbs en-
with experiment. ergy of activation, is the change in Gibbs energy when the
However, later work showed that rather large devia- reactants become activated complexes. The Gibbs energy
tions from experiment are obtained for reactions in which of activation can, in turn, be expressed as ‡ H − T ‡ S,
the reacting molecules are more complicated. This col- where ‡ H is the enthalpy of activation and ‡ S the en-
lision theory is evidently too simple and unlikely to be tropy of activation. The rate constant k can thus be formu-
generally reliable. One weakness is the assumption that lated as
molecules are hard spheres, which implies that any col- kT − ‡ G/RT
lision with sufficient energy will lead to reaction; if the k= e
h
molecules are more complicated, this is not the case. A
more fundamental objection to the treatment is that when KT ‡ S/R − ‡ H/RT
= e e . (19)
applied to forward and reverse reactions it cannot lead to h
an expression for the equilibrium constant that involves This equation is of the same form as Eq. (16); the factors
‡
the correct thermodynamic parameters. More recent work e− H/RT corresponds to e−E/RT , while the pre-exponential
‡
has involved a similar approach but has treated molecular factor (kT / h)e S/R shows dependence on temperature.
collisions in a more realistic and detailed way. ‡
However, H is not quite the same as the activation en-
The most successful of the later statistical treatments ergy E, but adjustments can easily be made for different
has been transition-state theory, first formulated simulta- types of reactions.
neously and independently in 1935 by H. Eyring (1901– Much effort has gone into testing the validity of the
1981) and by M. G. Evans (1904–1952) and M. Polanyi original formulation of transition-state theory (now often
(1891–1976). Transition-state theory treats the rates of referred to as conventional transition-state theory) and to
elementary reactions as if there were a special type of improving it or proposing alternative theories. Useful ex-
equilibrium, having an equilibrium constant K ‡ , between tensions of the theory have been suggested, some of them
reactants and activated complexes. The rate constant is involving locating the activated complex at a position other
then given by than at the col in the potential energy surface. Improve-
k = (kT / h)K ‡ , (18) ment has been achieved, but only at the cost of making very
extensive calculations and making assumptions the justi-
where k is the Boltzmann constant (the gas constant per fication for which is not always entirely clear. In addition
molecule) and h is the Planck constant. This assumption to these extensions of conventional transition-state theory
of equilibrium can be justified in most cases, but it breaks there have been some entirely different approaches, such
down for certain types of potential energy surfaces, and as the dynamical treatments to be considered in Section V.
particularly when the barrier is low so that the reaction is Imperfect as it is—as are all scientific theories—
rapid. An important and simplifying feature of transition- conventional transition-state theory is of importance in
state theory is that it focuses attention on the activated providing a conceptual framework with the aid of which
complex. How the system reaches the transition state and much insight is gained into how chemical reactions occur.
how it behaves after reaching the transition state, is not It is possible without even making any numerical calcu-
considered to affect the rate of reaction. lations to make qualitative predictions of many important
The equilibrium constant K ‡ can be treated in two kinetic effects. So far no alternative treatment has provided
different but equivalent ways. In the first procedure, the any such insight.
methods of statistical mechanics are used. Equilibrium Sometimes chemical reactions occur to some extent
constants can be calculated reliably in terms of partition by quantum-mechanical tunneling. Usually systems must
functions for the molecules involved, and the same can surmount the col in the potential energy surface (Fig. 3),
be done for the particular equilibrium constant K ‡ . What but quantum-mechanical theory allows the possibility of
are required are the masses of the reactant molecules and going from the initial state A + BC to the final state
the activated complex, their moments of inertia, and their AB + C without passage over the col. This arises when
vibrational frequencies. For the reactant molecules this is very light species, such as electrons and hydrogen atoms,
usually straightforward. These parameters are also known are involved in the reaction. For example, in a reaction
for an activated complex if a reliable potential energy sur- such as,
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
Tl2+ + Fe2+ → Fe3+ + Tl+ , the probability of reaction and d, the collision diameter, is
the distance between the centers of the reactant molecules
there is simply the transfer of an electron, which tunnels when they collide. In practice the reaction cross section,
through the potential energy barrier without going over it. given the symbol σ , is the quantity that has to be postu-
Tunneling is also important in a reaction such as lated in using collision theory to interpret calculated or
D + H2 → DH + H, experimental rates.
Much of the theoretical work in molecular dynamics has
where D is a deuterium atom. Here there is a transfer of a been based on potential energy surfaces that have been cal-
hydrogen atom H, and the reaction involves some passage culated for individual reactions. Dynamical calculations
over the energy barrier but also some tunneling. Since the are then carried out for various initial states of the re-
rate of tunneling increases much less with temperature acting molecules. For a reaction A + B—C, for example,
than the passage over the barrier, quantum-mechanical one chooses a particular vibrational and rotational state
tunneling is relatively more important at low temperatures, for the molecule B–C, and particular translational states
and it is at such temperatures that much of the evidence for A and B–C. Other details of the collision between the
for tunneling has been obtained. two are also selected. One then calculates, on the basis of
Quantum-mechanical tunneling is an important factor dynamics, the path that the system takes on the potential
that must be considered in connection with kinetic-isotope energy surface. A diagram or mathematical description
effects. Suppose, for example, that the reactions that describes the motion of a reaction system over a po-
D + H2 → DH + H tential energy surface is known as a trajectory. Ideally the
dynamical calculations are based on quantum mechanics,
and but this presents difficulty and more often classical calcu-
D + D2 → D2 + D lations are made; there is good reason to conclude that not
much error is then introduced. Even when the trajectories
are compared. The ratio of rate constants depends on cer- are obtained classically, the initial states of the reactants
tain factors that enter into the partition functions for the are usually selected on the basis of quantum theory.
reactants and activated complexes but also on the fact that A large number of trajectories have been calculated,
quantum-mechanical tunneling is important for the first and the results have provided considerable insight into
reaction but not for the second. The deuterium atom D how reactions proceed. Calculations by Karplus and co-
is twice as heavy as the H atom, and tunneling is quite workers on the reaction
unimportant when D, or any heavier atom, is transferred
H + H 2 → H2 + H
in a chemical process.
have revealed a number of interesting features. One is that
not all of the vibrational energy of H2 can contribute to
V. REACTION DYNAMICS allowing the system to surmount the potential energy bar-
rier. Another is that, as assumed in transition-state theory,
It is important to have knowledge, on the molecular or the system passes directly through the col and does not
microscopic level, of the elementary act that occurs dur- remain there and perform a number of vibrations. On the
ing the course of chemical change. Work in this field, basis of calculations made for a variety of initial condi-
referred to as-reaction dynamics or molecular dynamics, tions it was possible to compare calculated rates with those
deals with the intermolecular and intramolecular motions obtained from transition-state theory, and the agreement
that occur in a chemical reaction and the quantum states of was very good. For the reaction
the reactant and product molecules. There are two main
Br + H2 → HBr + H,
reasons for studying chemical dynamics. One is to test
the validity of the statistical theories that were outlined in the agreement is not as good. Other trajectory calculations
the preceding section. The other is that there are impor- have been made for potential energy surfaces that do not
tant applications (e.g., lasers) in which it is necessary to have the shape shown in Fig. 3 but instead have a basin at
have information about the energy states of products of the col. It is then found that the activated complexes are
reaction, information that is not provided by the statistical trapped for a short period in the basin, and perform several
theories. vibrations before they become products.
The results of dynamical calculations and experiments A number of trajectory calculations have been made,
are frequently expressed in terms of reaction cross sec- particularly by J. C. Polanyi (b. 1929) and co-workers,
tions rather than rate constants. The reaction cross section with the object of exploring how the slopes of potential
for a bimolecular reaction is defined as Pr πd 2 , where Pr is energy surfaces affect the transfer of energy during the
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
course of a chemical reaction. Three questions of particu- with each other. The primary purpose of these molecular-
lar importance are: beam studies is to determine the fate of the reactants and
products after the beams have come together.
1. If energy is released in a chemical reaction (i.e., if 2. Experiments carried out in bulk systems in which
the reaction is exothermic), what is the distribution of the the energy states of the reaction products are determined
energy among the translational, vibrational, and rotational by spectroscopic techniques. The reaction itself may oc-
states of the product molecules? cur in a static or flow system, and often the reactant
2. What type of energy is particularly effective in lead- molecules have been put into particular energy states. The
ing to a successful chemical reaction? term chemiluminescence is applied to such studies, since
3. If a collision occurs with substantially more than they are concerned with radiation emitted by the products.
enough energy to surmount the barrier, in what form is the
excess energy released? Ideally, in both types of investigation, the reactant
molecules are put into selected translational, vibrational,
The answers to all three questions depend on the form and rotational states, and the corresponding states of the
of the potential energy surface. In the schematic potential reaction products are determined. When this objective is
energy surface shown in Fig. 3 the energy level of the more or less achieved the expression state-to-state kinetics
products, A—B + C, is lower than that of the reactants is applied.
(i.e., the reaction is exothermic). Also, the surface was Molecular-beam studies have only been carried out to
drawn in such a way that at the activated state there is any extent since the 1950s, and the techniques have devel-
not much change in the B–C distance, but that the A–B oped considerably since that time. A particular advance
distance is considerably greater than it becomes in the has been the use of mass spectrometers for determining
product A–B. This type of surface is called an attractive the nature, speeds, and direction of the molecules after
or an early downhill surface. In the trajectory shown in the collision has occurred. Analysis of the results yields
Fig. 3 there is not much vibrational energy in the reactant detailed information about the distribution of angular mo-
B–C, but much more in the product A–B, and this is typical mentum among the reaction products, the reaction cross
of this type of surface. If, on the other hand, an exothermic sections, the quantum states of the product molecules, and
reaction has a late downhill surface, or a repulsive surface, the lifetime of the collision complex. Many reactions fall
the calculations showed that not so much of the energy into one of two main classes:
released passed into vibrational energy of A–B, most of it
becoming translational energy of A–B and C. 1. Reactions having high reaction cross sections (i.e.,
The answer to the second question can be appreciated occurring at relatively high rates) and in which the reaction
by considering the reverse processes. Suppose that the products are scattered forward with respect to the center
surface is as shown in Fig. 3 but that the reaction is of mass of the system. This type of process is said to occur
by a stripping mechanism.
A—B + C → A + B—C.
2. Reactions having small cross sections, in which there
The reverse of the trajectory shown in Fig. 3 therefore ap- is backward scattering of the reaction products. The mech-
plies, and the reaction absorbs energy (i.e., it is endother- anism is then said to be a rebound mechanism.
mic). Vibrational energy is now much more important than
translational energy in surmounting the barrier. If there is It has been found that reactions for which the potential en-
only translational energy the system is likely to bounce ergy surface is attractive (Fig. 3) tend to occur by a strip-
against the inner wall of the surface and return to the en- ping mechanism, while when it is repulsive there is a re-
trance valley. bound mechanism. The reasons for this have been worked
With regard to the third question, the disposition of out.
excess energy, there is a tendency for excess translational When a reaction occurs either by a stripping or a re-
energy to appear as translational energy in the products bound mechanism the lifetime of the activated complex
and for excess vibrational energy to appear as vibrational must be short; if it were long enough for rotation to occur
energy. This effect has been referred to as adiabaticity, and the products would separate in random directions and the
theoretical reasons for it have been proposed. reaction is then said to occur by an indirect or complex-
These conclusions have been amply confirmed by many mode mechanism. When the lifetime of the complex is
experimental results. Broadly speaking the experimental short, as in the stripping and rebound mechanisms, the
investigations fall into two main classes: expression direct or impulsive is used. The occurrence of
a reaction by a complex-mode mechanism appears to be
1. Experiments in which narrow beams of reactant associated with the existence of a basin in the potential
molecules, in preselected states, are brought into contact energy surface.
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
The chemiluminescence investigations of chemical re- ular entities having configurations between those of the
actions have as their origin some pioneering investigations reactants and products of a chemical reaction. The term
carried out in the 1920s and 1930s by M. Polanyi. In the transition species covers a much broader range of config-
course of these studies it was found, for example, that in urations than does the term activated complex, which is
the reaction defined as existing in an arbitrarily small region of space.
It has long been known that spectral lines are broadened
Cl + Na2 → NaCl + Na
when a gas is at higher pressures, and this was interpreted
a good deal of the energy liberated resides in the prod- in 1915 by H. A. Lorentz (1853–1928) as being due to
uct NaCl molecule in the form of vibrational energy, as collisions between the molecules. It was only in 1980,
indicated by the prime. These investigations have been however, that J. C. Polanyi and co-workers were success-
extended, particularly by J. C. Polanyi and co-workers, ful in detecting similar effects from molecules that were
who have introduced new experimental techniques and undergoing reactive collisions. For example, they studied
have studied reactions of a variety of types. It was these the reaction
investigations, together with parallel theoretical studies,
that led to the classification of different types of potential F + Na2 → F . . . Na . . . Na → NaF + Na∗
energy surfaces. The product Na∗ is in an electronically excited state and
Both the molecular-beam and chemiluminescence tech- emits the familiar yellow D-line. On both sides of this line
niques have been applied to the investigation of state-to- there was “wing” emission, and the evidence indicated
state kinetics. In a molecular-beam experiment the trans- that this was due to the transition species F . . . Na . . . Na.
lational energies of the reactant molecules can be well Techniques have now been developed for studying the
controlled, and by laser excitation the molecules can be course of chemical change in great detail. In particular,
put into particular vibrational and rotational states. In the femtosecond flash photolysis has been used to observe
chemiluminescence experiments the reactant molecules transition species, notably by Philip R. Brooks and by
are sometimes put into particular vibrational and rotational Ahmed H. Zewail.
states by forming them in prereactions.
State-to-state studies have been carried out only since
the early 1970s, but they have already revealed much im-
portant information. By and large they have confirmed VI. REACTIONS IN THE GAS PHASE
and amplified the previous theoretical and experimental
studies in reaction dynamics. One general result that has Reactions in the gas phase are in some respects easier
been obtained is that when reactions are substantially en- to understand than those in solution or on surfaces, and
dothermic, their barriers are usually late ones, so that they what is learned about gas reactions is valuable in leading
are enhanced more by vibrational than by translational en- to an understanding of the other types of processes. The
ergy. On the other hand, when reactions are only slightly most straightforward of gas reactions are those that are
endothermic or are exothermic, the barriers are generally elementary and bimolecular. There are few bimolecular
earlier, and translational energy then plays a more im- reactions involving molecules, as opposed to atoms and
portant role in leading to reaction. State-to-state studies free radicals, since most second-order reactions involving
have also led to some clarification of the influence of rota- molecules occur in more than one stage, i.e., they are
tional energy on reactivity, although the matter is still not composite. Such composite reactions are dealt with in
entirely clear. When the translational and vibrational ener- Section IX.
gies are held constant and the rotational quantum state of The prototype of all bimolecular reactions is the process
a reactant molecule is steadily increased, the rates some- H + H 2 → H2 + H
times first decrease and then increase. As rotational speeds
increase, the time that a molecule spends in a favorable The measurement of the rate of this reaction is made possi-
orientation decreases, and this factor leads to a decrease ble by the fact that H2 exists in two forms, designated ortho
in reaction probability. At higher rotational speeds, how- (o) and para ( p). To explain the kinetics of this reaction
ever, the preferred orientations are obtained again at short a large number of theoretical studies have been made. It
intervals during the approach of the reactants, and this can is now possible to calculate, by pure quantum mechanics,
give rise to higher rates as the rotational energies increase. an accurate potential energy surface for this reaction. For
Also, at the higher rotational speeds there is a greater pro- this particular reaction, which has a symmetrical potential
portion of energy in the form of rotational energy, and this energy surface, conventional transition-state theory is in
again can contribute to the effects observed. good agreement with the best of the dynamical treatments.
An important development has been the spectroscopic One matter of considerable interest is the kinetic isotope
detection of transition species, which are defined as molec- effect observed when H atoms are replaced by deuterium
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
(D) or tritium (T) atoms in this system. These kinetic iso- suggestion made in 1921 by F. A. Lindemann, later Lord
tope effects still cannot be treated completely satisfac- Cherwell (1886–1957).
torily, largely because the theory of quantum-mechanical Lindemann’s hypothesis, as later interpreted and devel-
tunneling is by no means satisfactory. The effects are, how- oped by C. N. Hinshelwood (1897–1967) and others, is as
ever, broadly understood in a semiquantitative way. follows. In a unimolecular process an energized molecule
Among other bimolecular gas reactions that have been A∗ is first formed by a collision between two molecules
extensively investigated are so-called abstraction or meta- of the reactant A:
thetical reactions involving atoms and free radicals. An
A+A A∗ + A
example is
The energization process is second order, the rate being
CH3 + C2 H6 → CH4 + C2 H5 .
k1 [A]2 , and the rate of the reverse de-energization process
In this process the free methyl radical CH3 abstracts a is k−1 [A∗ ][A]. The energized molecule A∗ may undergo
hydrogen atom from the ethane molecule. There has also de-energization, but it may also undergo a process in which
been much study, by mass spectrometric techniques, of the reaction products are formed:
bimolecular reactions involving ions, such as the reaction
A∗ → products
+ +
O + N2 → NO + N.
The rate of this first-order process is k2 [A∗ ].
For most such reactions conventional transition-state the- If the pressure is high enough the de-energization pro-
ory provides a useful, but not always precise, interpretation cess will be more rapid than the reaction to form products.
of the results. This being so, the energized molecule A∗ is essentially in
A few trimolecular reactions in the gas phase have also equilibrium with normal molecules, and its concentration
been investigated; examples are [A∗ ] is proportional to the first power of the concentra-
tion [A]. Since the rate is proportional to [A∗ ] it is also
2NO + Cl2 → 2NOCl
proportional to [A]; i.e., the kinetics are first order.
and At sufficiently low pressures, on the other hand, the
time between successive collisions between A∗ and A be-
2NO + O2 → 2NO2 .
comes longer than the time that elapses before A∗ can
The interpretation of these reactions was a considerable become products. The removal of A∗ molecules by chem-
triumph for conventional transition-state theory. Simple ical change then seriously diminishes the concentration of
collision theory proved unsatisfactory for trimolecular re- activated molecules. At extremely low pressures the en-
actions, owing to the difficulty of defining a collision be- ergized molecules A∗ almost inevitably become products,
tween three molecules, and usually led to very serious as there are few collisions to inactivate them. The rate of
overestimations (by several powers of ten) of the rate con- reaction is then equal to the rate of energization and is thus
stants. Similar difficulties are encountered with dynami- proportional to the square of the concentration.
cal treatments, and these have still not been satisfactorily If this Lindemann–Hinshelwood hypothesis is correct,
resolved. Conventional transition-state theory, by regard- unimolecular gas reactions should be first order at high
ing the activated complex as being in equilibrium with pressures and should become second order at low pres-
the reactants, leads to a very simple formulation of the sures. This behavior has now been confirmed for a large
rate constant and to values in good agreement with ex- number of reactions. In its original form the hypothesis
periment. It also very neatly explains the rather marked had some difficulty in interpreting results quantitatively,
negative temperature dependence of the pre-exponential but a number of extensions of the original hypothesis have
factors for these reactions. been made, notably by R. A. Marcus whose treatment is
The elucidation of unimolecular gas reactions proved consistent with transition-state theory.
to be much more difficult than that of bimolecular and
trimolecular reactions. Whereas the rates of the latter re-
actions can be interpreted in terms of collisions between VII. REACTIONS IN SOLUTION
two and three molecules respectively, no such treatment
appears at first sight to be possible for unimolecular reac- Reactions in solution or in the liquid phase are intrinsi-
tions. At one time it was widely held that collisions are not cally more difficult to understand than reactions in the gas
at all involved in unimolecular reactions, and that instead phase because of the complications arising from the effect
the reactions occur as a result of the absorption of radia- of the liquid. When a solvent is present there are two pos-
tion emitted by the vessel walls. However, this proved to sibilities: the solvent may exert purely an environmental
be incorrect, and the problem was resolved as a result of a effect on the reaction, or it may also enter into the reaction.
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
The latter case is exemplified by the hydrolysis of an or- the pre-exponential factors of such reactions. In water at
ganic compound in aqueous solution; water molecules are 25◦ C, for example, the pre-exponential factor of a reaction
chemically involved in the reaction. of the type A+ + B+ is apt to be about two orders of mag-
Some understanding of solvent effects has been nitude lower than for the corresponding reaction between
provided by comparisons of the same reaction in the gas neutral molecules. By contrast, for a reaction in aqueous
phase and in solution. Some reactions, however, do not solution between singly charged ions of opposite signs,
occur at all in the gas phase, and one must then be con- A+ + B− , the pre-exponential factor at 25◦ C is usually
tent with comparing their rates in different solvents. When about 102 greater than that between neutral molecules.
such comparisons are made it is sometimes found that the Similar considerations allow useful predictions to be
solvent does not have much effect on the rate. When, on the made about the effects or different solvents on reaction
other hand, ions are involved as reactants or products, sol- rates. If a reaction is of the type A+ + B+ the forces are
vents usually have a much greater effect on rates, because repulsive and will be smaller the higher the dielectric con-
of the rather strong electrostatic interactions between ions stant of the solvent. The reaction therefore proceeds more
and solvent molecules. rapidly in aqueous solution than in alcohol, since water
Theoretical studies have also contributed to an under- has a higher dielectric constant than alcohol and reduces
standing of reactions in solution. For reactions between the repulsions to a greater extent. When on the other hand
neutral species the frequencies of collisions are of the same the reaction is of the type A+ + B− , the rate will be greater
order of magnitude in solution and in the gas phase. How- in alcohol than in water, since water greatly reduces the at-
ever, there are important differences between the distribu- tractions between the reactants. Quantitative formulations
tion of collisions. When two molecules collide in solution of these effects have been worked out.
they are “caged” in by surrounding molecules and within Another matter of considerable importance in connec-
a very short period of time they are likely to undergo one tion with ionic reactions in solution is the effect on rates
or more additional collisions before they separate. Such a of the ionic strength. This property, introduced in 1921 by
set of collisions, occurring in rapid succession, is known G. N. Lewis (1875–1946), is defined as
as an encounter. This tendency for collisions in solution to 1 2
occur in sets has no effect on reactions involving an activa- I = z ci , (20)
2 i i
tion energy, because reaction may occur on any collision
within the set. However, atomic and free-radical combi- where ci is the concentration of each ion in solution and
nations do not involve an activation energy and occur on z i is its charge number (e.g., +1 for Na+ , −2 for SO−24 ):
every collision. They therefore occur at the first collision the summation is made over all of the ions present in
in the set, so that the remaining collisions in the encounter the solution. Qualitatively the ionic strength has the same
do not contribute to the rate. The pre-exponential factor kind of effect as the dielectric constant; increasing I re-
is therefore not related to the frequency of collisions but duces the forces between ions. Thus for a reaction of the
rather to the frequency of encounters. type A+ + B+ , increasing the ionic strength increases the
This cage effect, also known as the Franck– rate by reducing the repulsion. For a reaction A+ + B− ,
Rabinowitch effect, has other important consequences. In increasing the ionic strength decreases the rate by reduc-
a photochemical reaction in solution, for example, a pair of ing the attraction. These ideas were put into quantitative
free radicals produced initially may, owing to their being form in 1922–1924 by J. N. Brønsted (1879–1947), N. J.
caged in by the surrounding molecules, recombine before Bierrum (1879–1958), and J. A. Christiansen (1888–
they can separate from each other. This effect is known as 1969).
primary recombination. Special effects arise for a solution reaction that is
When reaction occurs between ions in solution some extremely rapid, in which case the rate may depend on the
rather special effects become important. These can be rate with which the reactant molecules diffuse through the
understood in an approximate but very useful way by solvent. Two effects are to be distinguished, macroscopic
focusing attention on the dielectric constant of the solvent. diffusion control and microscopic diffusion control. If a
The forces between charges, whether they are attractive rapid bimolecular reaction in solution is initiated by mix-
or repulsive, are inversely proportional to the dielectric ing solutions of the two reactants, the observed rate may
constant of the medium. The dielectric constant of wa- depend on the rate with which the solutions mix, and one
ter, for example, is about 78 at room temperature, and the then speaks of mixing control or macroscopic diffusion
force of repulsion between two ions A+ and B+ in wa- control.
1
ter is only 78 of that in the gas phase. Even in aqueous Even if this effect has been eliminated the rate of a
solution the forces are still strong enough to have a sub- reaction may be influenced by the rate with which the
stantial effect on the collision frequencies and therefore on reactant molecules diffuse towards each other. This effect
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
is known as microscopic diffusion control or encounter sorbed molecules, so that 1 − θ is the fraction that is not
control. If the measured rate is almost exactly equal to the covered. If [A] is the concentration of gas molecules, then
rate of diffusion one speaks of full diffusion control. An the rate of adsorption, which can only occur on bare sur-
example of this is provided by the combination of H+ and face, is ka [A](1 − θ ), where ka is a constant. The reverse
OH− ions in solution, a reaction that is so fast that the desorption process is a unimolecular process and its rate
rate is almost entirely controlled by the diffusion of the is kd θ. At equilibrium the two rates are the same so that
ions towards each other. For some reactions the rates of ka [A](1 − θ ) = kd θ (21)
chemical reaction and diffusion are similar to each other,
and one then speaks of partial diffusion control. from which it follows that
Quantitative formulations have been worked out for the K [A]
θ= , (22)
rate constants for both full and partial diffusion control. 1 + K [A]
Diffusion rates do not vary greatly from one system to where K is equal to ka /kd .
another and for uncharged reactants in aqueous solution Equation (22) leads at once to kinetic equations that
at 25◦ C diffusional rate constants are approximately 7 × apply to unimolecular processes on surfaces. The rate is
109 dm3 mol−1 s−1 . If the chemical rate constant is sub- proportional to θ and is therefore given by
stantially greater than this there is therefore appreciable
k K [A]
diffusion control. Most reactions, however, have much v= (23)
smaller rate constants because of an energy barrier to re- 1 + K [A]
action, and are therefore not affected by diffusion. For If [A] is sufficiently small that k[A] 1, this reduces to
reactions between ions the diffusion rates and chemical v = k K [A] (24)
rates are increased if the ions are of opposite signs, and
they are decreased if they are of the same sign. and the reaction is first order. If [A] is sufficiently large
that K [A] 1 the rate is
v=k (25)
VIII. REACTIONS ON SURFACES and is now independent of concentration; the order of
the reaction is thus zero. One example of a reaction that
Many reactions are affected by the surface of the vessel in changes from zero-order kinetics at high concentrations
which they occur, with an increase in rate. The effect is of- to first-order kinetics at low concentrations is the decom-
ten a catalytic one, the surface remaining unchanged, and position of ammonia on the surface of a metal such as
substances are often deliberately introduced into reaction tungsten or iron.
systems with the object of increasing the rates. Surface When two substances are undergoing reaction at a sur-
catalysis is of very great importance in technical work. face there are two possibilities. The reaction may be an
The fact that the enhanced rates are due to adsorption interaction between two molecules that are adsorbed side-
at the surface has long been known. An important ad- by-side on a surface, or it may be an interaction between
vance was made in 1916 by I. Langmuir (1881–1957) who an adsorbed molecule of one kind and a gas molecule of
showed that in many cases of adsorption the gas molecules the other. The kinetics are satisfactorily interpreted by an
are held to the surface by bonds of the same character as extension of Langmuir’s isotherm [Eq. (22)], which for
covalent chemical bonds, and the term chemisorption has two gases A and B gives the following expressions for the
been applied to this type of adsorption. Langmuir also de- fractions of surface covered by A and B respectively:
veloped adsorption isotherms that relate the fraction of
K A [A]
surface covered to the pressure or concentration of a gas. θA = (26)
Later H. S. Taylor (1890–1974) emphasized the fact 1 + K A [A] + K B [B]
that surfaces are never smooth on the atomic scale, and K B [B]
θB = (27)
that surface sites are therefore of variable activity. Certain 1 + K A [A] + K B [B]
sites, which he called active centers, are particularly active Reactions involving interaction between two adsorbed
and it is on these that catalysis occurs for the most part. molecules are said to occur by Langmuir–Hinshelwood
Taylor also showed that the process of chemisorption itself mechanisms, and their rates are proportional to the product
is accompanied by an activation energy, and he referred θA θB :
to this type of adsorption as activated adsorption.
k K A K B [A][B]
The kinetic equations that apply to reactions on sur- v = kθA θB = (28)
faces are given to a good approximation on the basis of (1 + K A [A] + K B [B])2
the Langmuir isotherm, which can be derived as follows. If either concentration is held constant and the other
Let θ be the fraction of a surface that is covered by ad- one increased, the rate passes through a maximum. The
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
most favorable situation is for equal numbers of A and B described as composite, complex, or stepwise. One indica-
molecules to be on the surface, since then the number of tion that a reaction is composite is that the kinetic equation
A–B pairs is a maximum. If the concentration of either does not correspond to the stoichiometric equation. A sim-
is then increased, the other is displaced from the surface, ple example is the gas-phase reaction between nitric oxide
and the number of A–B pairs diminishes. and hydrogen, the stoichiometric equation for which is
Mechanisms in which a gas molecule reacts with
2NO + 2H2 → N2 + 2H2 O
an adsorbed molecule of the other kind are known as
Langmuir–Rideal mechanisms. If, for example, the inter- If the process were elementary it would be second order in
action is between gaseous A and adsorbed B the rate is NO and second order in H2 , with an overall order of four.
proportional to [A]θB and is therefore In fact, the rate is proportional to [NO]2 [H2 ]. The reason
k K B [A][B] is that the reaction occurs in two steps,
v= (29)
1 + K A [A] + K B [B] 2NO + H2 → N2 + H2 O2
There is therefore no longer a maximum in the rate, which H2 O2 + H2 → 2H2 O
now approaches a limiting value if the concentration of
one reactant is increased with the other held constant. The second reaction is rapid compared with the first, so
This difference in kinetic behavior allows a discrimination that the overall rate is that of the first reaction and is pro-
between the two mechanisms, both of which sometimes portional to [NO]2 [H2 ].
occur simultaneously. Because reactions can occur in steps, the order of a
Since about 1970 there have been many important reaction is often less than corresponds to the stoichiome-
advances in experimental techniques for the study of try. A reactive collision between three molecules is much
surfaces and adsorbed layers of molecules on surfaces. less likely than one between two, and one between four
Techniques that have been particularly valuable in the molecules is exceedingly unlikely; indeed it is doubt-
investigation of solid surfaces are field-ion microscopy ful whether elementary reactions between four molecules
(FIM) and low-energy electron diffraction (LEED). These ever occur. A reaction whose stoichiometric equation in-
techniques have shown that surfaces have different types volves more than three molecules always proceeds more
of surface sites, such as atoms at terraces, atoms at steps, rapidly by two or more processes of lower molecularity.
atoms at kinks, and adatoms which project out of the Even if the kinetics of a reaction does correspond to the
surfaces. stoichiometry it may still be the case that the mechanism
A technique that has provided valuable information is composite. For example, the reaction
about adsorbed films is infrared spectroscopy. As an ex- H2 + I2 → 2HI
ample of its use we mention some work on the adsorption and the reverse decomposition are both second order, and
of ethylene (C2 H4 ) on various surfaces. It has been found the processes were formerly thought to occur in a single
that the manner in which ethylene is adsorbed depends stage. Later work, however, has shown that reactions in-
on the availability of hydrogen. If no hydrogen is avail- volving I atoms are also involved. The decomposition of
able the adsorption tends to be of the dissociative type; ethane,
the ethylene splits into C2 H2 and 2H, all of which are
adsorbed separately. With hydrogen present, however, ad- C2 H6 → C2 H4 + H2 ,
sorbed H atoms add on to C2 H4 and ethyl radicals (C2 H5 ) is first order and was long thought to be elementary; how-
become attached to the surface. Evidence of this kind is of ever it occurs entirely by a free-radical mechanism, to be
great importance in understanding the kinetics and mech- considered later.
anisms of the hydrogenation of ethylene and many other Composite mechanisms can be classified in a number
processes. of different ways. First, there is the usual classification
Reactions on surfaces frequently undergo inhibition by according to whether the process occurs in the gas phase,
added substances, which themselves become adsorbed on the liquid phase, or on a surface. Another classification is
the surface and reduce its catalytic activity. This effect is according to whether the process is thermal, photochemi-
commonly referred to as poisoning of the surface. cal, or radiation chemical. A thermal reaction occurs sim-
ply by virtue of the heat energy present in the system;
molecules can pick up more energy on colliding with other
IX. COMPOSITE REACTION MECHANISMS molecules, and eventually an energetic collision will lead
to reaction. Photochemical and radiation-chemical reac-
Many of the reactions familiar to the chemist occur in tions are brought about as a result of radiation absorbed
more than one stage, and their mechanisms are therefore in the system.
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
(4) 2CH3 → C2 H6 . O + H2 → OH + H.
The radical CHO produced in reaction (1), the initiation In each of these reactions two carriers have been formed
reaction, breaks down into CO + H, and H atoms combine from one; if the two reactions are added together the result
to form H2 which is found as a minor product. The C2 H6 is
formed in the termination reaction is also a minor product. H + O2 + H2 → 2OH + H,
Reactions (2) and (3) are chain-propagating steps. The
steady-state equations are now, for CH3 : so that the H atom is regenerated but has produced two
OH radicals which can undergo further reactions. When
k1 [CH3 CHO] − k2 [CH3 ][CH3 CHO] + k3 [CH3 CO]
such chain branching occurs the number of chain carriers
−k4 [CH3 ]2 = 0 (35) in the reaction can increase extremely rapidly. The rate of
reaction thus increases, and the result may be an explosion.
and for CH3 CO:
Mixtures of oxygen with substances such as hydrogen,
k2 [CH3 ][CH3 CHO] − k3 [CH3 CO] = 0. (36) carbon monoxide, and various organic substances do in
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
fact explode under certain conditions, and the reason is In order for a photochemical reaction to occur the radi-
that chain branching is involved. ation must be absorbed, and with the advent of the quan-
Reactions in which substances react together with the tum theory it became possible to understand the relation-
formation of much larger molecules known as polymers ship between the amount of radiation absorbed and the
are frequently chain reactions. Thus, a substance contain- extent of the chemical change that occurs. It was first re-
ing a carbon–carbon double bond may be caused to poly- alized by A. Einstein (1879–1955) that electromagnetic
merize by introduction of a free radical R which adds on to radiation can be regarded as a beam of particles, which
the double-bonded compound to form another free radical: G. N. Lewis (1875–1940) later called photons: each of
these particles has an energy equal to hν, where ν is the
frequency of the radiation and h is the Planck constant.
In 1911 J. Stark (1874–1957) and independently in 1912
Einstein proposed that one photon of radiation is absorbed
by one molecule. This relationship, usually referred to
as Einstein’s Law of Photochemical Equivalence, applies
The resulting free radical adds on to another monomer
satisfactorily to electromagnetic radiation of ordinary in-
molecule and the process continues with the eventual for-
tensities but fails for lasers of very high intensity. The
mation of a much larger molecule.
lifetime of a molecule that has absorbed a photon is usu-
ally less than about 10−8 sec, and with ordinary radiation it
is unlikely for a molecule that has absorbed one photon to
X. PHOTOCHEMICAL AND absorb another before it has become deactivated. In these
RADIATION-CHEMICAL REACTIONS circumstances there is therefore a one-to-one relationship
between the number of photons absorbed and the number
There are two kinds of radiation, electromagnetic and par- of excited molecules produced. Because of the high in-
ticle. The former behaves in some experiments as if it were tensity of lasers, however, a molecule sometimes absorbs
a beam of photons, but these have no mass and can be re- two or more photons, and one then speaks of multiphoton
garded as packets of energy. Particle radiation, on the other excitation.
hand, consists of beams of particles having mass. Cath- Even with ordinary radiation it may appear that the law
ode rays and β radiation, for example, consist of beams of photochemical equivalence is not obeyed, and this arises
of electrons, and α radiation consists of a beam of helium because of two factors. One is that a molecule that has ab-
nuclei. Particle-generating machines such as cyclotrons sorbed a photon may become inactivated before it has had
produce beams of protons, deuterons, and other particles. time to enter into reaction. When this alone is the case
All of these types of radiation are capable of bring- the ratio of the number of molecules undergoing reaction
ing about chemical reaction provided that the energies are to the number of photons absorbed (a quantity known as
sufficiently high. Reactions induced in this way are re- quantum yield or photon yield) is less than unity. The sec-
ferred to as either photo-chemical or radiation-chemical ond factor is that the reaction may occur by a composite
reactions. The distinction between the two types is not mechanism. For example, in the photochemical decompo-
sharp and is sometimes made on the basis of whether ions sition of hydrogen iodide (HI) into hydrogen and iodine,
are produced in the reaction. With radiation of lower en- the quantum yield is 2; that is, one photon brings about
ergies, such as electromagnetic radiation in the visible the decomposition of two molecules of hydrogen iodide.
and near-ultraviolet regions of the spectrum, there is no The reason is that the mechanism is
ion formation, and the resulting reaction is referred to as
photo-chemical. With ultraviolet radiation of higher fre- (1) HI + hν → H + I
quencies, with X-rays and γ -rays, and with high-energy (2) H + HI → H2 + I
particle radiation, ions are usually formed, and the pro-
cess is then described as radiation-chemical.1 Sometimes (3) I + I → I2 .
a distinction is based on specificity. With radiation of lower In the first step one photon, designated by its energy hν,
energy the resulting reaction is often of simple stoichiom- breaks apart an HI molecule, and the H atom produced
etry and can then be called photochemical. Higher-energy interacts with a second HI molecule [reaction (2)]. The
radiation, however, breaks up molecules into a number of sum of reactions (1), (2), and (3) is
fragments which lead to a variety of products; the term
radiation-chemical can then be applied. 2HI + hν → H2 + I2 ,
1 They should not be called “radiochemical,” which would cause which explains the quantum yield of 2.
confusion with radiochemistry, which is concerned with radioactive This mechanism is not a chain reaction, since no cy-
substances. cle of reactions is repeated. When chain processes are
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
involved the quantum yields may be very large. In the ions, and free radicals which undergo further reactions.
photochemical reaction between hydrogen and chlorine, For example, if hydrogen is irradiated with α particles
quantum yields of over 106 have been reported, and it was the overall primary process is largely the production of
this fact that led W. Nernst (1864–1941) to propose, in hydrogen atoms, and the process is usually written as
1918, the following chain mechanism for the reaction: α
H2 2H.
(1) Cl2 + hν → 2Cl
Several elementary processes lead to this dissociation. One
(2) Cl + H2 → HCl + H is the ejection of an electron by the α particle, the hydro-
gen molecule becoming an H+ 2 ion which dissociates into
(3) H + Cl2 → HCl + Cl
H + H+ :
(4) 2Cl → Cl2 . α + H 2 → H+ −
2 +α+e
Reactions (2) and (3) are chain-propagating steps, and H+ +
2 → H + H.
since they occur rapidly they are repeated many times,
leading to high quantum yields. The H+ ion produced is then likely to pick up an electron
One important photochemical technique is flash photol- with the formation of a hydrogen atom.
ysis, in which an intense flash of radiation of very short du- In radiation chemistry the analog of flash photolysis is
ration initiates a reaction by producing excited molecules, pulse radiolysis. For example, linear electron accelerators
atoms, and free radicals. The method, first used in 1950 by have been used to give pulses of very high energy with
G. Porter and R. G. W. Norrish (1887–1978), has been ap- durations of a few microseconds, and with special tech-
plied extensively to the study of fast reactions, both in the niques much shorter pulses, of the order of a nanosecond
gas phase and in solution. Extremely rapid processes can (10−9 sec), have been achieved.
be studied by the use of pulsed lasers, which can have a The term chemiluminescence is used to describe the
duration of less than 1 psec (10−12 sec). This is sufficiently radiation emitted as a result of a chemical reaction.
short to allow the study of the fastest of chemical reactions, Some of the radiation emitted by flames is produced
and even molecular relaxation process in which molecules in this way, although some of it is blackbody radiation
are changing their vibrational and rotational states. resulting from the high temperature of the burnt gases.
Some molecules do not absorb radiation at convenient The radiation emitted by fireflies and by certain tropical
wavelengths, and the technique of photosensitization is fish is chemiluminescence.
then useful. This term applies to the absorption of radia-
tion by a substance known as a photosensitizer, which then
transfers energy to a colliding molecule, causing it to un- XI. HOMOGENEOUS CATALYSIS
dergo chemical change. For example, hydrogen does not
absorb in the near ultraviolet, but mercury vapor does so at Catalysis at surfaces, or heterogeneous catalysis, has been
a wavelength of 253.7 nm which corresponds to an energy outlined in Section VIII, and the present section deals with
of 469.4 kJ mol−1 . If hydrogen saturated with mercury homogeneous catalysis, where only one phase is involved.
vapor is irradiated at this wavelength, the mercury atoms, Substances whose reactions undergo homogeneous
normally in a 61 S0 state, are excited to the 63 P1 state: catalysis are commonly known as substrates. There are
several different types of homogeneous catalysis. Exam-
Hg 61 S0 + hν → Hg 63 P1 . ples in the gas phase sometimes involve chain mecha-
On collision with a hydrogen molecule an excited Hg nisms. Reactions in solution are commonly catalyzed by
atom can bring about dissociation into atoms: acids and bases, and many reactions in biological systems
are catalyzed by enzymes, which are proteins. Reactions
Hg 63 P1 + H2 → Hg 61 S0 + 2H. in aqueous solution are often catalyzed by ions of variable
valency.
The dissociation of a mole of H2 atoms requires 432 kJ
No single mechanistic pattern applies to all cases of
of energy, so that the excited Hg atoms have more than
catalysis, but several different types of catalysis occur ac-
enough energy for this process to occur. Other molecules,
cording to the following scheme:
such as hydrocarbons, can be decomposed by similar
photosensitization processes. C+S X+Y
Radiation-chemical reactions have mechanisms simi-
X + W → P + Z.
lar to photochemical reactions, the difference being in
the nature of the radiation-chemical primary process. Un- Here C is the catalyst and S the substrate, while X is a
like photochemical primary processes, these processes are reaction intermediate which forms the product P in the
usually composite, leading in a series of steps to atoms, second step. The species Y, W, and Z undergo further
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
processes that do not affect the kinetics. In the case of The protonated acetone then gives up a different proton to
a unimolecular surface-catalyzed reaction the scheme is a basic species B present in solution:
simplified to:
C+S X
X → P.
The intermediate X is now an addition compound formed
from C and S, and in the second step it forms products. This is a very common pattern in acid catalysis. In the first
This scheme also applies to many reactions catalyzed by step the acid transfers a proton to the substrate, which in
enzymes. For reactions catalyzed by acids and bases, how- the second step transfers another proton, the product being
ever, it is necessary to include the additional species Y, W, formed either simultaneously or in a subsequent step. In
and Z. basic catalysis (which also occurs with the iodination of
Suppose, for example, that the reaction of a substrate S acetone), the substrate molecule first transfers a proton to
is being catalyzed by an acid HA. A typical mechanism is the basic catalyst, and in a second step accepts a proton at
another position.
HA + S SH+ + A−
Sometimes when reactions are catalyzed by acids and
SH+ + B → P + BH+ . bases there is little sign of any effect other than catalysis
by hydrated hydrogen ions (usually written as H3 O+ ) or
In the first step a proton H+ is transferred to the substrate,
by hydroxide ions. One then speaks of specific acid–base
with the formation of the intermediate SH+ . In the second
catalysis, and the rate of reaction might be of the form
step this intermediate SH+ transfers a proton to a basic
species B (which may be identical with A− or may be a v = k0 [S] + kH+ [S] H3 O+ + kOH− [S][OH− ]. (39)
water molecule), and in this step the product P is formed.
The coefficient k0 relates to any uncatalyzed reaction, and
For example, the reaction between acetone and iodine,
kH+ and kOH− are catalytic coefficients.
CH3 COCH3 + I3 → CH3 COCH2 I + HI, It is sometimes found that acidic species other than hy-
drogen ions and basic species other than hydroxide ions
is catalyzed by acids, and the rate is proportional to the ace- are capable of catalyzing reactions. One then speaks of
tone concentration and also to the concentration of acid. general acid–base catalysis. For example, in an aqueous
The rate is, however, independent of the concentration of solution containing acetic acid and sodium acetate the rate
iodine, and indeed the rate remains the same if iodine is coefficient k(=v/[S]) can be expressed as
replaced by bromine. This suggests that the slow and rate-
determining step does not involve iodine at all, but is the k = k0 + kH+ H3 O+ + kOH− [OH− ]
conversion of the ordinary form of acetone (known as the + kHA [HA] + kA− [A− ], (40)
keto form) into another form (the enol form) which can
react with iodine or bromine very rapidly: where HA is acetic acid and kHA the corresponding cat-
alytic coefficient; A− is the acetate ion and kA− its catalytic
coefficient.
If an acid has a large dissociation constant, for its dis-
sociation into ions, it tends to have a large catalytic coef-
ficient; similarly the catalytic strength of a base is greater
the larger the base dissociation constant. In 1924 J. N.
Brønsted proposed the following relationship between ka ,
the catalytic coefficient of an acid, and K a , its acid disso-
ciation constant:
The way in which an acid HA catalyzes the formation ka = G a K aα , (41)
of the enol form is believed to be as follows. First the
acid transfers a proton to the oxygen atom on the acetone where G a and α are constants. A similar relationship ap-
molecule: plies to a base catalyst.
Catalysis by enzymes, the biological catalysts, is much
more specific than that by acids and bases. Some enzymes
show absolute specificity, in that they are only known to be
able to catalyze a single chemical reaction. Others show
a lower specificity, in that they are able to catalyze only a
P1: ZCK/GJP P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008C-358 June 29, 2001 12:55
the enzyme, and EA then reacts with the other substrate It is almost always the case that a catalyst exerts its
B to form a ternary complex EAB which then gives prod- action by reducing the activation energy, which means
ucts: this is the ordered ternary complex mechanism. Al- that it introduces the possibility of an alternative reaction
ternatively, in the random ternary complex mechanism the path involving less energy and therefore a higher rate of
ternary complex EAB may be formed either via EA or reaction.
EB. In another mechanism the reaction of enzyme with
one substrate A leads to one product of reaction, together
with an intermediate which reacts with B to give another SEE ALSO THE FOLLOWING ARTICLES
product. All of these mechanisms lead to similar types
of kinetic behavior, and discrimination between them re- ATOMIC AND MOLECULAR COLLISIONS • CATALYSIS,
quires careful investigation. INDUSTRIAL • CHEMICAL KINETICS, EXPERIMENTATION
In recent years there has been much interest in immobi- • ION KINETICS AND ENERGETICS • LASERS • MULTI-
lized enzymes, where the enzyme has been attached to a PHOTON SPECTROSCOPY • PHARMACOKINETICS • PHO-
solid support. This has the advantage in technical work of TOCHEMISTRY, MOLECULAR • POTENTIAL ENERGY SUR-
allowing an enzyme preparation to be used many times. FACES • RADIATION PHYSICS • STATISTICAL MECHANICS
The study of the kinetics of immobilized enzymes is also
valuable in leading to an understanding of how enzymes
behave in living systems, where they are often immobi-
lized. Kinetic studies have shown that in such systems the BIBLIOGRAPHY
rate is sometimes diffusion controlled (Section VII).
Other types of homogeneous catalysis do not follow Bamford, C. H., and Tipper, C. F. H., eds. (1969). “Comprehensive Chem-
a general pattern. Sometimes, for example, substances ical Kinetics,” Elsevier Publishing Co., Amsterdam. This series of
many volumes, begun in 1969 and still appearing, is a valuable source
bring about catalysis in gas reactions by bringing about of reference to all aspects of chemical kinetics.
chain-initiation processes, or by becoming involved in Laidler, K. J. (1987). “Chemical Kinetics,” 3rd ed., Harper & Row, New
chain-propagating processes. Ions of variable valency are York.
good catalysts for certain solution reactions. For example, Laidler, K. J. (1996). “Glossary of Terms used in Chemical Kinetics,
singly charged silver ions (Ag+ ) can bring about a reduc- Including Reaction Dynamics,” Pure Appli. Chem., 68, 149.
Pilling, M. J., and Seakins, P. W. (1995). “Reaction Kinetics,” 2nd edi.,
tion with the formation of Ag2+ , which then can bring Oxford University Press, Oxford, UK.
about an oxidation; the Ag+ − Ag2+ system can therefore Zewail, A. H. (1990). “The birth of molecules,” Scientific American,
act as a mediator in oxidation–reduction systems. November, p. 76.
P1: GNH/GLT P2: GNH Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN009B-418 July 31, 2001 18:58
Membranes, Synthetic
(Chemistry)
Michael E. Starzak
State University of New York, Binghamton
I. Membranes
II. Membrane Modifiers
III. Membrane Characterization
IV. Membrane Phenomena
GLOSSARY either of the two bulk phases, its utility often arises from its
selectivity, i.e., its ability to facilitate the transport of spe-
Amphipathic Having both hydrophilic and hydrophobic cific materials between the two bulk phases. The net flux
regions in distinct regions of the same molecule. through the membrane is characterized by a permeability
Ionophore An organic molecule that binds a polar ion to coefficient P with units of velocity. The net flux of a ma-
create a membrane soluble complex. terial through the membrane is often proportional to this
Casting Solvent is evaporated from solvated membrane permeability coefficient and the concentration gradient of
material on a support to produce a membrane. permeant between the two bulk phases.
Permeability coefficient The proportionality factor with Naturally occurring materials like β-alumina can con-
units of velocity that relates concentration gradient to duct ions like Na+ , Ag+ and H+ , and water. Sections of
flux through a membrane. this material can then be used as membranes to regulate
Selectivity Membrane ability to pass certain species the flow of these ions without modification of the basic
while rejecting others. solid structure. More often, the goal is the preparation of
Ultrafiltration Water permeation through a membrane— synthetic membranes with specific properties that permit
reverse osmosis. the membrane to be used in a particular application. For
example, a synthetic membrane permeable only to water
would be useful for reverse osmosis where pure water is
A MEMBRANE is a special structure or phase that sep- separated from an aqueous solution. The state of the art in
arates two macroscopic phases. The membrane normally membrane research is often defined by methods of prepa-
forms a thin barrier between the macroscopic phases so ration that give a membrane with the qualities desired.
that its total volume is small relative to the separated A membrane with few desirable permeability proper-
phases. Although the membrane is generally insoluble in ties can also be modified by adding molecules that impart
345
P1: GNH/GLT P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN009B-418 July 6, 2001 20:50
the proper selectivity or permeability. The membrane sur- surface layer toward a high pressure limit where the hy-
face might be modified to produce a useful membrane. drocarbon chains are forced parallel and normal to the
Special molecules might be added to or formed with the surface. This more ordered monolayer has some of the
membrane to produce the desired properties. For example, properties of a liquid-crystal. The hydrocarbon chains are
special protein channels can be inserted into membranes parallel but observation of the top surface of the monolayer
to produce a membrane selective to a specific ion. The would reveal a random liquidlike structure.
membrane is then permeable only to this species. In most This ordered monolayer is used as an air/solution bar-
cases, the ultimate goal is the development of a selective rier or as a medium for selective extraction of organic
membrane optimal permeability or adsorption. material from the aqueous phase. The ordered mono-
Membranes are available for a large number of research layer also serves as the building block multilayer mem-
and industrial applications including gas separation, fuel branes. A glass slide or other support is passed through
cells, reverse osmosis, dialysis, sensors, and purification. the monolayer. With each pass, a monolayer membrane is
They also serve as the support for special processes like the transferred from the surface layer to the support. If done
oxidation reduction processes in photosynthesis or cell- properly, this monolayer is stable and is not disturbed as
cell communication. additional monolayers are added with additional passes
The synthesis and production of membranes with struc- through the ordered surface monolayer.
tures that control material flow are presented in Section I. A biological membrane separates two aqueous phases.
Section II surveys special molecules incorporated into The outer surfaces of the membrane must be hydrophilic
membranes to impart specific properties to these mem- to stabilize the two membrane/solution interfaces. This is
branes. Section III describes some techniques and instru- accomplished with a bilayer membrane, in which the or-
mentation used to characterize membranes and pore struc- dered hydrocarbon chains from two monolayers merge to
ture. Section IV illustrates some special equilibrium and form a stable central hydrocarbon region approximately
kinetic physical phenomena most conveniently observed twice the length of the hydrocarbon chains. The opposed
with membrane systems. polar head groups form outer hydrophilic surfaces to sta-
bilize the structure and the solution/membrane interfaces
(Fig. 1).
I. MEMBRANES
The bilayer structure, which is the most common mem-
brane in biological systems, was originally revealed by
A. Monolayer Membranes
experiments that first determined the total membrane area
Biological systems are protected by membranes with in a specific number of red blood cells. The cells were
molecular structures that serve as models to illustrate then destroyed, phosopholipid was separated and spread
the structure and behavior of more complicated mem- on a surface balance. The area occupied by the lipid on the
branes. Monolayer membranes are made from amphi- surface balance was exactly twice the total surface area of
pathic molecules that have a polar, water-soluble, or hy- the red blood cells.
drophilic terminal region and a hydrocarbon, nonpolar or A lipid or soap membrane of large area is inherently
hydrophobic, region that usually contains long hydrocar- unstable. A soap bubble is a good example. Research
bon chains whose length dictates membrane thickness and membranes are often formed over very small pinholes
stability. Hydrocarbon chains of 16–20 carbons generate by applying solutions of the membrane material using a
a self-stabilizing hydrocarbon phase that is insoluble in
water. Soap molecules with a weak acid terminal group or
phospholipids with two hydrocarbon chains and a polar
phosphate-centered head group united on a glycerol base
can spontaneously form membranes.
If these amphipathic molecules are spread on a water
surface, the polar groups dissolve in water to “root” the
molecules. Since the polar heads are insoluble in hydro-
carbon, the molecules normally spread to form a mono-
layer membrane on the surface of the water. In this state,
the long hydrocarbon chains can be parallel to the wa-
FIGURE 1 Ion channels in the plasma membrane. The mem-
ter surface to create an extended, random monolayer. A
brane phospholipids are arranged in a bimolecular layers with their
more-ordered, liquid crystalline monolayer is generated polar heads on the outside and their hydrophobic tails inside. The
by applying surface pressure to the monolayer. A surface bilayer is about 30 Å thick. Sitting in it are various intrinsic proteins,
balance has floating barriers that move to compress the including the channels shown here.
P1: GNH/GLT P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN009B-418 July 6, 2001 20:50
pipette or brush. Bilayers are also created as hemispherical reagents, often enhances reaction rates. Zeolites with par-
bubbles at the end of a Teflon™ tube. Pressure is applied allel pores suitable for transport are formed by allowing
to the solution phase within the tube to expand membrane the zeolite structure to develop around regularly spaced
material painted across the tube outlet into a hemispheri- organic molecules that function as templates. The organic
cal bubble. In each case, solvent moves toward the support material is then purged to produce membrane with regular,
walls to form a Gibbs plateau, a ring of solvent and mem- parallel pores. Pore diameters can be controlled accurately
brane surrounding the bilayer membrane. With the proper by the choice of template molecule.
concentrations, purity, and overall cleanliness, such mem-
branes are quite stable. However, a biological cell must C. Silica Membranes
remain stable for the life of a cell. Biological lipid mem- Many glass, i.e., silicon dioxide, membranes have some
branes are believed to act as stabilizing filler between the intrinsic porosity and selectivity. The pH “glass” electrode
proteins scattered throughout the membrane. The proteins selectively absorbs H+ ions to give a potential proportional
provide the support while restricting the area needed for to the logarithm of the H+ concentration. Other mem-
an integral bilayer. Larger synthetic bilayers might be sta- branes have been developed as ion-specific electrodes
bilized using membrane molecules that crosslink to form where the membrane is selectively permeable to ions to
a more durable structure. give a potential difference proportional to the concentra-
Solvent-free bilayer membranes are formed by form- tion of that ion.
ing monolayers on water surfaces on either side of a Sensors function in a similar manner. A membrane is
Teflon™ sheet containing a pinhole. As the water (and formed directly on an electrode support so ions or so-
monolayer) level is raised slowly past the pinhole, mono- lute reaching the sensor can induce changes at the support
layers from opposite sides merge to form the solvent-free electrode that can be detected electrically. The basic prin-
bilayer across the pinhole. ciple remains the same; the membrane facilitates transport
A vesicle or liposome is a spherical bilayer that encloses of a selected material between two bulk phases and the
solution and separates it from an external solution. They transport is reduced to some observable signal that can be
might be considered synthetic cells with integral bilayer measured to establish the presence and concentration of
membranes. Vesicles form spontaneously when a solution the species selected.
is agitated with ultrasound. Vesicles formed in this man- Glass membranes can be formed with small pores that
ner are called small, unilamellar vesicles, (SUV). Larger permit separation of hydrogen from other gases. For ex-
vesicles (large unilamellar vesicles, LUV) are formed by ample, the naturally occurring pores in Corning’s Vycor
repeated freeze thaw cycles of a lipid solution. Vesicles of glass are filled with additional silicon dioxide to produce
a specific diameter are formed by high-pressure extrusion pores of a size suitable for gas separation using chemical
of the lipid solution through polycarbonate membranes. vapor deposition. SiH4 and oxygen gas, flowing across op-
The membrane pores must be homogeneous of relatively posite surfaces, diffuse into the pores and react to create
constant diameter. SiO2 with a structure containing pores suitable for the gas
A vesicle can fuse with another vesicle or a planar bi- separation.
layer to create a new homogeneous membrane. Lipid bi- Colloidal silica can also be cast, i.e., aggregated on a
layer fusion is often mediated by a polyvalent ion like Ca2+ suitable surface, to form silica membranes. The colloids
which reduces the Debye lengths of the charged double aggregate into sols on the substrate to form protomem-
layer to permit the two membranes to reach a separation branes that can be calcined to stable, gas-permeable glass
distance where attractive forces can mediate the fusion. membranes.
Membrane fusion might facilitate the transfer of drugs in
the vesicle interior to biological cell. It might also per- D. Capillary Membranes
mit drug delivery to specific cells or act as a time-release
transfer mechanism. Silica membranes with homogeneous pores are also made
The molecules within a bilayer can aggregate into dif- by heating and drawing bundles of capillaries. As the cap-
ferent phases with different degrees of order. Phase for- illary length increases, the inner capillary diameter de-
mation has been studied using fluorescence imaging. creases. The process is repeated to create capillaries with
the appropriate inside diameter. The bundles are then cut
laterally to produce planar membranes.
B. Zeolite Membranes
E. Chemical Etching
Zeolites are aluminum-silicon compounds that form struc-
tures with random pores and cavities. The environment Homogeneous channels with a narrow size distribution
within the cavities, which can be selective for specific are created by irradiating 10–20 µm polymer films such
P1: GNH/GLT P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN009B-418 July 6, 2001 20:50
as polyethylene terephthalate with heavy ions like Ar+9 the cooling rate, polymer solution concentration, and the
produced by a cyclotron. The ions create damage tracks composition of the polymer or polymers selected.
that can be expanded by chemical etching to pores of the Hollow fiber (tubular) membranes are made by spinning
proper diameter. The density of pores is dictated by the or extruding the polymers. The hollow fiber membranes
time and intensity or radiation while the pore diameter is provide a large surface area for transfer and are ideal for
determined by the etching time. separations in flow systems.
Membrane polymers include polypropylene, poly
(vinylidene difluoride), polysulfone, poly(ether sulfone),
F. X-Ray Lithography poly(ether ether ketone), polyvinyl alcohol, polyacryloni-
A surface covered with a photoresist and a gold mask can trile, polycarbonate, and poly(ethylene terephthalate).
be irradiated with high-intensity, well-collimated X-rays These “tortuous pore” membranes resemble porous
that degrade the exposed regions of the polymer. The de- sponges where permeant moves through a convoluted path
graded polymer is removed to create a primary template. as it passes between bulk phases. Membranes are consid-
Galvanoforming produces a secondary metallic template ered dense when they have less than 50% void volume.
that is used as a mold for a polymer membrane with ho- However, even dense membranes can be relatively porous
mogenous pores (Fig. 2). and might require additional treatment to produce a more
selective membrane. The membrane characteristics are of-
ten altered by treating the membrane surface to produce a
G. Cast Dense Membranes region with the requisite properties. The bulk of the mem-
Polymer membranes are also formed by casting. The or- brane then functions simply as a porous support.
ganic polymer that will constitute the membrane is dis- The properties of the membrane skin might depend on
solved in a solvent and spread on a support. The solvent the composition of the casting mixture. The surface might
evaporates leaving an integral membrane with the requi- be modified by high-temperature annealing or by plasma
site properties. Membranes are also formed without sol- treatment, where a discharge in an oxygen atmosphere
vent by spreading polymer melts on the surface to leave a produces the desired changes in membrane properties. The
membrane when the system cools to ambient temperature. surface might also be modified by covalently grafting new
The final membrane structure is controlled by factors like molecules with the desired properties to the surface. All
such treatments attack the surface rather than the interior
support region of the dense membranes.
H. Porous Membranes
A membrane is classified as porous when the void vol-
umes within the membrane exceed 50% of the membrane
volume. The porous membranes can be generated from
the dense membranes by addition of a solvent that in-
duces swelling. The solvent can be evaporated from some
membranes to leave large, stable voids within the porous
membrane. The surfaces of these porous membranes can
also be modified to produce a skin with the desired selec-
tivity and permeability properties.
The fibrous membranes are porous membranes based
on materials derived from cellulose. These fibrous mem-
branes often have pores that permit small biological en-
tities like small cells or viruses from aqueous solution.
Cellulose-based materials include cellulose acetate and
cellophane.
I. Liquid Membranes
A membrane can be formed by inserting a thin layer of in-
FIGURE 2 Schematic diagram showing the preparation of a soluble liquid between two phases. This liquid membrane
metal structure by galvanoforming. might be flowed onto one aqueous phase. The second
P1: GNH/GLT P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN009B-418 July 6, 2001 20:50
aqueous phase is then carefully flowed onto the membrane and solute with equal facility, i.e., the solution flows
phase to complete the system. A porous membrane might through the membrane.
also be impregnated with the membrane liquid to function Semipermeable membranes are used for reverse osmo-
as a support for this liquid membrane. Solutes that dissolve sis where an applied pressure selectively forces water from
in the membrane are absorbed and transported between the a solution through the membrane to separate it from solute.
bulk phases if there is a concentration gradient. Strong synthetic membranes are required for effective, i.e.,
high pressure, purification.
J. Ion Exchange Membranes Membranes with pores that permit a flow of small so-
lute molecules and ions as well as water can separate these
The beads of an ion exchange resin have fixed ionized sites
species from much larger protein or polymer molecules.
of common charge. The sites retain ions of opposite charge
Dialysis membranes with this property are used to purge
for variable times to permit a separation in the eluent. The
small ions from a protein solution, leaving a protein with
classical example is the separation of the lanthanide series
neutralized charge sites. The neutralized semipermeable
of ions with very similar chemical properties.
membrane can be dialyzed with a semipermeable mem-
An ion exchange membrane must have pores that con-
brane where water is transported from the protein solution
tain fixed surface charge. This fixed charge attracts coun-
to a concentrated solution leaving a more concentrated
terions into the pores for transport while rejecting co-ions,
protein solution.
ions with the same charge as those in the surface.
The ion exchange membranes are made for modify-
ing polymers to have ionizable groups that can function L. Gold-Sulfur Bonded Membranes
as the fixed charge. Anionic fixed charge is normally in-
Metal surfaces are difficult to clean to produce surfaces
troduced with sulfonic acid or carboxylic acid groups.
as supports for homogeneous membranes. Patches of ox-
Cationic fixed charge for an anion exchange membrane
idized surface can reject membrane binding to produce
uses positive groups like alkyl ammonium. For example,
gaps in the membrane barrier to spoil the effectiveness of
Nafion 117 has a polytetrafluoroethylene backbone with
the membrane. Membrane monolayers can be bound to
regularly spaced fluorovinyl ether pendant side chains that
intrinsically clean gold surfaces using a gold-sulfur link-
terminate in a sulfonic acid. The concentration of fixed
age. Long hydrocarbon chains with a terminal sulfur bind
charge is high and, when the pores and channels within
densely and homogeneously on the fold surface to produce
the membrane are filled with water, the system functions
an integral membrane. Since the gold substrate functions
as a high concentration electrolytic system.
as an electrode, the combination is particularly suited to
The ion exchange membranes are used in systems where
the development of chemical sensors. The monolayer is
ions of specific charge must be transported. They are used
impregnated with molecules that permit selected species
as separators in fuel cells and for electrodialysis where
to react to the electrode for oxidation or reduction and,
ions must move between phases.
through those chemical changes, detection.
K. Semipermeable Membranes
Semipermeable membranes that transport only water II. MEMBRANE MODIFIERS
while blocking solute flow were discovered by Nollet
200 years ago. They are relatively common in biological While both selectivity and high permeability are charac-
systems. For example, frog skin is often used as a semi- teristic of an ideal membrane, it might be worthwhile
permeable membrane. Synthetic membranes such as to sacrifice high permeability for excellent selectivity.
cellophane and membranes made with polyvinyl alco- This condition can be reached by impregnating an im-
hol, polyurethane, and polytrifluorochloroethylene sele- permeable membrane with molecules that provide highly
ctively transmit water. Their ability to transport water is selective pathways through the membrane. Biological evo-
quantified with their ultrafiltration coefficient, the pro- lution has produced a large number of protein molecules
portionality constant that relates the concentration gradi- that are highly selective to specific chemicals. Although
ent to the net water flux through the membrane. The reflec- their densities in natural cells are often quite low, molec-
tion coefficient measures the relative selectivity of the ular biology provides the means to produce large samples
membrane for water and solute. A reflection coefficient of such proteins. If a large number of such highly selective
of one represents the ideal semipermeable membrane proteins are incorporated into a synthetic membrane, both
where all solute is rejected and only water flows through high selectivity and permeability are possible. The pro-
the membrane. A reflection coefficient of zero signifies a teins transport material across the membrane by several
membrane with pores that permit the flow of both solvent distinct mechanisms.
P1: GNH/GLT P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN009B-418 July 6, 2001 20:50
A. Carrier Molecules central core lined with polar groups. An ion, approaching
this molecular channel, replaces some of its waters of hy-
A hydrophobic membrane like a lipid bilayer blocks the
dration with channel binding groups so that the channel
passage of both metal ions and water since they are in-
interior becomes a continuation of the aqueous solution.
soluble in the nonpolar central region formed from the
Transport across the membrane is then a series of transi-
hydrocarbon chains of the lipids. Ionic solubility is in-
tions between the charged intrachannel binding sites until
creased, however, if the metal ion is encased in organic
the ion is released on the opposite side of the membrane.
ligands which mitigate the effects of the ion charge in the
Biological protein channels can be extremely selective.
membrane. The tetraphenylborate anion, with the boron
A variety of special channels exist to transport sodium,
charge center surrounded by four bulky, nonpolar phenyl
potassium, and calcium ions, the most common biologi-
groups is soluble in the membrane and enters with a single
cal cations. Other channels exist to transport anions like
negative charge. The tetraphenylarsenate cation dissolves
carbonate and chloride.
in the membrane with a net positive charge. The ions are
The number of known ion-selective channels has grown
stabilized in sites near the membrane/solution interfaces.
exponentially with the development of the patch-clamp
Because the bilayer membrane is thin, a small electri-
technique. A specially polished pipette tip is brought care-
cal potential in the millivolt range can produce extremely
fully into contact with the membrane of a biological cell.
large electric fields (ca. 107 V/m) in the membrane. These
Suction is applied to the pipette to hold the cell to the
fields are sufficient to induce a transition in which the ion
pipette tip. Binding occurs between the glass of the tip and
passes through the hydrophobic central region to a site
the cell membrane to produce a high-resistance “gigohm”
at the opposite membrane/solution interface. The motion
seal. The seal is extremely strong and the pipette can be
of the ions produces an observable current in an external
retracted carrying a small section of the cell membrane
circuit. The current decay is exponential indicating inde-
across the tip. This patch membrane can be inserted into a
pendent transitions of the ions.
second solution to produce a tiny area of membrane sep-
Membrane-soluble carrier molecules can bind an ion
arating two aqueous phases. The small membrane area
from aqueous solution to create a charged ion/carrier com-
contains very few channels and the system is studied us-
plex within the membrane. This soluble ion then moves
ing single channel techniques which can monitor the small
down a concentration or electrical gradient to the opposite
currents through a single open channel. The properties of
interface where the ion is released to that solution. Carrier
the channel are deduced from the kinetic behavior of the
transport is modeled in four kinetic steps: (1) the carrier
channel.
absorbs an ion from solution 1; (2) the ion/carrier com-
Early studies on channels were restricted to a limited
plex moves down its gradient to the opposite interface;
number of biological systems that could be probed ex-
(3) the ion dissociates from the carrier or ionophore; and
perimentally with internal electrodes. For example, the
(4) the carrier (without ion) diffuses back to the first in-
squid axon, the portion of the nerve cell which transmits
terface to bind another ion and repeat the four step cycle.
a nervous impulse to the muscles that contract the ani-
For each cycle, one carrier transports one ion across the
mal’s mantle for flight, is roughly ten times larger than a
hydrophobic membrane.
human axon. Piggy-back electrodes with voltage-sensing
A carrier molecule can have structure or binding sites
and current-carrying electrodes glued together were in-
that make it suitable to transport a specific ion or class of
serted axially into the axon. A voltage clamp was then
ions. Valinomycin, for example, binds univalent K+ ion to
used to maintain a constant transmembrane potential while
initiate the four-step carrier/ion cycle. A class of macro-
current through channels in the axon membrane was mon-
cyclic molecules, the crown ethers, absorbs ions into the
itored. The patch-clamp technique has largely supplanted
central cavity of their crown-like structure to produce the
these more restricted membrane systems since a patch of
membrane-soluble ion complexes. Special ionophores like
membrane can be excised from almost any cell.
lasalocid and A23187 with judiciously placed carboxyl
The gramicidin channel, formed by dimerization of two
groups are selective for polyvalent ions such as Ca (II) or
molecules in a loose helical conformation (Fig. 3), has
lanthanide ions like Eu (III). The ionophore wraps around
been studied extensively as a model system. The helical
the ion as the carboxyl groups bind to produce the mem-
dimer spans the membrane to produce a channel through
brane soluble complex ion.
the hollow center of the helix with anionic sites or re-
gions to facilitate cation transport. The channel transmits
only univalent cations. Divalent calcium ion blocks ion
B. Channel Molecules
flow through the channel. The channel exhibits the anoma-
Ions can be transported across a hydrophobic membrane lous mole fraction effect. Both univalent thallous ion and
if this membrane is traversed by molecules with a hollow sodium ion are very permeable when gramicidin dimers
P1: GNH/GLT P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN009B-418 July 6, 2001 20:50
are possible. These wavelengths permit resolution of brane capacitance. The capacitance then determines the
structures on the nanometer scale. Transmission elec- membrane thickness if the area of the membrane is known.
tron microscopy (TEM), where the electrons penetrate If the membrane under study also contains channels or
through thin cross-sectional samples (50 nm) that have pores that transmit ions, the current also contains an ohmic
been freezedried and sectioned, uses the wave interference component and the electrical properties of the membrane
patterns to establish the structure of the sample. are modeled as a parallel resistor-capacitor combination.
Scanning electron microscopy (SEM) utilizes scattered Liposomes or vesicles in solution can be modeled
electrons to generate the image and has higher resolu- as nonconducting spheres embedded in a conduc-
tion than TEM. However, both techniques require sample tive medium. The measured capacity of the ensemble of
preparation which might alter the structure of the sample vesicles is converted to a capacitance per unit area of mem-
under observation. In addition, the high voltages required brane surface which, in turn, determines the membrane
to accelerate the electrons to the proper wavelength can thickness.
also cause damage to the surfaces.
opposite bulk phase to reestablish the equilibrium condi- because the fixed negative charge in membrane channels
tion. The net effect is a reduction of salt CA in Bath 1, the excludes anions from the pores so that only cations move
solution with the protein, and a CA concentration gradient with the electrial potential gradient; the flow of anions in
across the membrane. the opposite direction is reduced significantly by the fixed
charge. The waters of hydration on the moving cations are
carried through the pores as well to produce the observed
3. The Electrochemical Potential
water flow that characterizes electroosmosis.
An ion exchange membrane that can pass ions of one po-
larity while rejecting those of opposite polarity will pro- 3. Streaming Potential and Current
duce an electrical potential to balance the permeable ion
In electroosmosis, an applied voltage produces a flow of
concentration gradient. For example, channels with an-
neutral water, i.e., the ion flow and water flow are coupled.
ion charge will encourage the passage of cation while re-
The reciprocal process occurs when solution is forced
pelling anion to produce a charge separation and electrical
through the membrane with charged pores under a hy-
potential
drostatic pressure. Ions of charge opposite to that fixed in
V = V2 − V1 = −(RT )/(z F) ln[c2 /c1 ] the pore walls are carried along with the water while the
co-ions are repelled from the pores. The flow of one type of
where RT /F ∼ 25 mV and z is the charge on the permeant
charge produces a net current through the channels. This
ion.
streaming current would appear in a closed electrical cir-
A strong, cation permeable membrane might tap the the
cuit. If the electrical circuit is open, a streaming potential
Na+ concentration gradient as a fresh water river reaches
appears across the pore or channel.
the ocean to produce electrical power.
4. Oscillations
B. Dynamic Phenomena
Oscillatory behavior is the exception in homogeneous
1. Ultrafiltration phases although there are some special chemical reac-
tions that do produce oscillatory behavior. A membrane
A water concentration gradient or an externally applied
system is a multiphase system and oscillations can be gen-
hydrostatic pressure can force water through a semiperme-
erated with a judicious choice of external forces on the
able membrane. The flux of water through the membrane
membrane.
is directly proportional to the concentration gradient. This
One basic oscillatory system uses the kinetic behavior
is an example of Fick’s first law for a flux J
of protein channels for sodium and potassium ion in
J = P(c1 − c2 ) a nerve membrane. Before electrical excitation, the
sodium channels are closed to sodium ion flow and
where P is the permeability coefficient.
the electrochemical gradient of 110 mV possible with
A membrane permeable to both water and solute re-
the sodium ion gradient does not develop. The more per-
quires flux equations for both the water and the solute.
meable potassium channels tap the potassium ion gradient
The fluxes of water and solute are directly proportional to
to produce an internal electrical potential of −60 mV
their respective concentration gradients. In addition, the
relative to the external solution as ground. On excitation,
flows can be coupled so that a hydrostatic pressure also
the transient channels open and then close. As the sodium
produces a solute flux while a concentration gradient, pro-
channels open, the sodium electrochemical potential of
portional to osmotic pressure π , produces a water flux. The
110 mV appears to produce a peak internal potential
linear equations
of −60 + 110 = + 40 mV, The one-shot oscillation is
Jw = L 11 P + L 12 π completed as the sodium channel closes, the sodium-ion
induced potential is lost, and the internal potential returns
Js = L 21 P + L 22 π
to the potassium channel dominated potential of −60 mV.
obey the Onsager reciprocal relations The entire oscillation is controlled only by the sodium
and potassium channels and the sodium and potassium
L 12 = L 21
ion concentration gradients across the membrane.
2. Electroosmosis
SEE ALSO THE FOLLOWING ARTICLES
For a membrane with pores containing fixed negative
charge, the flow of ions through these pores produces a BATTERIES • CRYSTALLIZATION PROCESSES • DIS-
concomitant flow of water. This electroosmosis results TILLATION • ELECTROCHEMICAL ENGINEERING • ION
P1: GNH/GLT P2: GNH Final Pages
Encyclopedia of Physical Science and Technology EN009B-418 July 6, 2001 20:50
TRANSPORT ACROSS BIOLOGICAL MEMBRANES • Hiemenz, P. (1986). “Principles of Surface and Colloid Chemistry,”
MEMBRANES, SYNTHETIC, APPLICATIONS • SOLVENT Marcel Dekker, New York.
EXTRACTION • SURFACE CHEMISTRY • WASTEWATER Kesting, R. (1971). “Synthetic Polymer Membranes,” McGraw-Hill,
New York.
TREATMENT AND WATER RECLAMATION Lloyd, D. R., ed. (1985). Materials Science of Synthetic Membranes,
ACS Symposium Series No. 269, Am. Chem. Soc., Washington, DC.
Sorensen, T. S. (1999). “Surface Chemistry and Electrochemistry of
Membranes,” Marcel Dekker, New York.
BIBLIOGRAPHY Starzak, M. (1984). “The Physical Chemistry of Membranes,” Academic
Press, New York.
Aideley, D., and Stanfield, R. (1996). “Ion Channels,” Cambridge Univ. Starzak, M. (1994). “Kinetics of Permeation and Gating in Membrane
Press, Cambridge. Channels,” 46: 61.
P1: GPA/GJK P2: GPJFinal Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN09J-437 July 19, 2001 22:41
Micelles
Srinivas Manne L. K. Patterson
University of Arizona University of Notre Dame
I. Brief History
II. Micelle Formation
III. Micellar Structure
IV. Micellar Kinetics
V. Solubilization
VI. Micellar Catalysis
VII. Micelles at Interfaces
VIII. Surfactant Aggregates at Solid-Liquid
Interfaces
IX. Conclusion
661
P1: GPA/GJK P2: GPJFinal Pages
Encyclopedia of Physical Science and Technology EN09J-437 July 19, 2001 22:41
662 Micelles
Micelles 663
typical micellar surfactants are given in Fig. 2, in which consideration of the driving forces for molecular organiza-
the dual, or amphiphilic, nature of these molecules may tion in this type of aggregate. Two principal factors govern
be easily seen. For example, in the case of sodium dode- such organization: (1) tailgroup–tailgroup interactions, in
cylsulfate (SDS), one of the most widely used surfactants which the hydrophobic effect (see Section II.A) causes the
in micelle studies, the molecule consists of a charged sul- nonpolar entities to coalesce; (2) headgroup–headgroup
fate headgroup (hydrophilic) and a 12-carbon aliphatic and headgroup–water interactions which oppose the total
chain (hydrophobic). Such chains are commonly termed separation of surfactant into a nonpolar phase. The in-
hydrocarbon or hydrophobic tails. Also among the most terplay of these two factors determines the concentration
extensively studied groups of surfactants are the tertiary onset of micelle formation, the micelle size distribution,
amines (e.g., hexadecyltrimethylammonium bromide, and subsequent physical and chemical properties.
CTAB). For uncharged surfactants, the molecule may have
a hydroxyl headgroup, but normally it includes a region
A. The Hydrophobic Effect
of linked ethoxy groups that may undergo some hydration
to compensate for the lack of charge on the OH. See the The marked insolubility of nonpolar molecules in wa-
polyoxyethylene alcohol (or ethyleneglycolmonoether) in ter is essentially an entropic effect and arises, not from
the figure. energy of association between hydrocarbon molecules,
It was established early that surfactants dissolve as sin- but rather from the unique hydrogen-bond character
gle molecules in water up to a given concentration (de- of water structure. Simply stated, the hydrogen-bonded
pendent on temperature and ionic strength), beyond which structure of bulk water must be distorted to accommodate
they are no longer soluble in water as monomeric units but hydrocarbon molecules. H2 O forms an ordered, cagelike
form aggregates. The monomeric surfactant concentration
beyond which added surfactant forms micelles is termed
the critical micelle concentration (cmc). Micelle forma-
tion is not a matter of simple phase separation. There is a
region over which both monomer concentration increases
and aggregation takes place. Experimentally, this region
is determined by plotting measurements of some physical
or spectroscopic property that changes markedly with mi-
celle formation as a function of surfactant concentration.
Data from low and high concentrations are extrapolated
to an intersection point that is taken as the cmc. Some of
the properties that exhibit such behavior are illustrated in
FIGURE 3 Concentration-dependent behavior of some physi-
Fig. 3. cal properties for a micelle-forming surfactant in solution. [From
Any understanding of effects produced in aqueous so- Lindman, B., and Wennerström, H. (1980). Top. Curr. Chem. 87,
lution by the presence of micelles must be preceded by a 1–83.]
P1: GPA/GJK P2: GPJFinal Pages
Encyclopedia of Physical Science and Technology EN09J-437 July 19, 2001 22:41
664 Micelles
Micelles 665
TABLE I Effect of Physical and Structural Properties on the as well. The absence of highly repulsive headgroups in
Critical Micelle Concentration (cmc), Aggregation Number nonionic systems is expected to affect both cmc and micel-
(n), and Degree of Ionization (α)a
lar size. And as expected, it is found that the cmc is about
Surfactant cmc (mM) n α two orders of magnitude lower for a nonionic surfactant
than for an ionic one having the same-length aliphatic
Chain length
group. With the nonionic polyoxyethylene surfactants,
C10 N(CH3 )3 Br 63 52 0.18
micelle size exhibits a marked dependence on the size of
C12 N(CH3 )3 Br 15 73 0.16
the polyoxyethylene chain. With C12 H25 (OEt)8 OH, only
C14 N(CH3 )3 Br 3.6 107 0.15
small micelles with aggregation numbers up to about
Surfactant branching
120 are seen at room temperature, even at high concen-
C14 OSO3 Na 1.7 — —
tration, while with the (OEt)6 analog, size can grow to
C12 CH(CH3 )OSO3 Na 3.3 — —
quite large values.
C11 CH(CH2 CH3 )OSO3 Na 5.8 — —
Surfactant head size
C10 NH3 Br 16 1100 0.07
III. MICELLAR STRUCTURE
C10 NH2 CH3 Br 20 670 0.07
C10 NH(CH3 )2 Br 12 69 0.08
As a consequence of the interactions discussed previ-
C10 (CH3 )3 Br 18 48 0.13
ously, the micelle can be visualized as being composed
C14 N(CH3 )3 Br 3.5 71 0.14
of three regions: (1) the hydrocarbon core, (2) the polar
C14 N(n-Pr)3 Br 2.1 48 0.11
surface layer, and (3) the diffuse double layer. These may
Counterion type
be considered separately along with the tools that have
C12 N(CH3 )3 Br (0.5 M NaF) 8.4 59 0.21
been used to give us our present perception of the overall
C12 N(CH3 )3 Br (0.5 M NaCl) 3.8 62 0.20
structure.
C12 N(CH3 )3 Br (0.5 M NaBr) 1.9 84 0.17
C12 N(CH3 )3 Br (0.5 M NalO3 ) 5.1 38 0.24
Temperature A. The Hydrocarbon Core
C12 N(CH3 )3 Br (25◦ C) 15 43 0.17
In early work, Hartley suggested that the hydrocarbon in-
C12 N(CH3 )3 Br (35◦ C) 15 34 0.18
terior of the micelle must be liquidlike if one is to under-
C12 N(CH3 )3 Br (45◦ C) 17 28 0.19
stand its extensive solubilization properties. Very little has
C12 N(CH3 )3 Br (55◦ C) 17 24 0.20
happened that would alter that view. Since alkanes with
Salt concentration
structures analogous to the hydrocarbon tails found in sur-
C14 OSO3 Na (0.0125 M NaCl) 1.7 120 0.12
factants exhibit freezing points generally much lower than
C14 OSO3 Na (0.050 M NaCl) 1.1 144 0.11
the temperatures at which micelles are studied, the liquid-
C14 OSO3 Na (0.10 M NaCl) 0.32 153 0.11
like nature of the core is to be expected.
Surfactant concentration
Many studies have been directed at reaching some quan-
C12 OSO3 Na (0.012 M) — 83 0.26
titative view of the micellar interior. Thermodynamic data
C12 OSO3 Na (0.020 M) — 87 0.26
taken from partial molal properties indicate that these
C12 OSO3 Na (0.129 M) — 110 0.26
properties in micelles are comparable to those found in liq-
Nonelectrolyte additive
uid hydrocarbons. Several spectroscopic approaches have
C12 OSO3 Na (pure H2 O) 5.7 — —
been used including fluorescence depolarization and ESR
C12 OSO3 Na (6 M urea) 9.5 — —
measurements involving nitroxide spin labels. Both of
C12 OSO3 Na (0.06 mole fraction MeOH) 7.9 64 0.19
these measure the mobility of the probe within the micelle.
C12 OSO3 Na (0.12 mole fraction MeOH) 9.0 23 0.22
A somewhat lower mobility than in pure hydrocarbon is
a From Menger, F. M. (1977). In “Bioorganic Chemistry III. Macro- found for the fluorescence probes, and a lower mobility has
and Multimolecular Systems” (E. E. van Tamelen, ed.). Academic Press, been observed for the nitroxide than in aqueous solution.
New York. Such measurements have been criticized by suggesting
that the probes themselves significantly perturb the mi-
interactions, are summarized in Table I for various ionic celle and, in monitoring probe motion, do not give a true
systems. picture of the liquid character of the core. Further, some of
With nonionic surfactants, for example, the poly- these probes are, indeed, of a size significant on the scale
oxyethylene alcohol in Fig. 2, the solubility of the of a micelle. Additionally, there is evidence that some such
headgroup comes from hydration, and the headgroup probes tend to localize near the micelle surface where flu-
repulsions are much smaller. Steric factors may play a role idity can differ from that found near the micellar center.
P1: GPA/GJK P2: GPJFinal Pages
Encyclopedia of Physical Science and Technology EN09J-437 July 19, 2001 22:41
666 Micelles
An approach that involves no such perturbations is the is found, that the cmc is lowest for I− , increasing as one
use of 13 C NMR, from which one may gather information goes to the left in the series. For sulfates and sulfonates, the
about the conformation of the chains and chain motion. counter-cations bind in the order Li+ < Na+ < K+ < Cs+ .
It has been suggested that, on average, the chains tend to In both cases, smaller counterion size correlates with
be somewhat more extended in the micelle than in pure weaker binding to headgroups. This is because small and
liquid; the measurements of motion also give indication energetic ions such as Li+ and F− strongly bind dipolar
of a liquidlike structure. H2 O molecules, greatly inflating the (now hydrated) ion
size and shielding the charge within the hydration shell. If
the counterion is made hydrophobic, one has, in addition
B. The Micellar Surface Region
to electrostatic effects, hydrophobic interactions between
It is normal to think of the surface of the micelle as the re- surfactant and counterion. An example is the salicylate
gion in which one finds not only the surfactant headgroup anion, which binds very strongly to cationic surfactants,
but also bound counterions (in the case of ionic head- decreasing the cmc manyfold and α to near zero.
groups) and water molecules. This region is also referred
to as the Stern layer. The outer edge of this layer defines the
C. Hydration in the Micelle
shear surface of the micelle as it moves and is only a few
angstroms thick, roughly the diameter of the headgroup Because there is effectively a hydrocarbon–water inter-
(see Fig. 1). As stated above, the headgroups are always face at the micelle surface, several possibilities must be
partially associated with counterions in the Stern layer, considered for hydration in the micelle. These involve
which diminishes the surface potential. A number given counterions, headgroups, and the portion of the hydro-
for concentration local to the Stern layer is about 3 M. carbon chain near the Stern layer. The bound counterions
These counterions are subject to both coulombic and ther- themselves may differ considerably in extent of hydration;
mal forces, one serving to attract them to the headgroup the cations noted above maintain a higher degree of hy-
and the second to disperse them. The extent to which coun- dration than do the anions. There has been considerable
terions are unbound from headgroups in the Stern layer is evidence that much of the water of hydration is maintained
defined as the degree of ionization α and has a profound upon binding. From viscosity and diffusion studies, hydra-
effect not only on micelle structure but on the whole range tion numbers for micelles have been deduced for a variety
of properties that depend on electrostatic interactions. The of charged surfactants; these fall into the range of 5–10
region beyond the shear surface in which the dissociated water molecules per surfactant molecule. For uncharged
counterions lie is called the Gouy–Chapman double layer micelles with variable numbers of oxyethylene groups,
and may be several hundred angstroms thick. Here the the hydration number can vary considerably, as seen from
counterions may exchange with the bulk phase. Fig. 6.
The perception of bound versus dissociated counteri- The extent to which water interacts with the hydrocar-
ons, of course, depends on the method used to measure it. bon chains of the surfactant is a question yet unsettled;
Spectroscopic techniques, such as fluorescence quench- various workers have suggested complete, partial, or no
ing or NMR, measure changes in properties affected by water penetration into the micellar core. For aliphatic
the presence of ions in the surface region; thermodynamic
measurements are sensitive to ion concentrations well
away from the Stern layer. Transport techniques, on the
other hand, measure the quantity of ions that move through
solution with the micelle. These approaches give results
that agree only qualitatively; however, α generally falls
in the region 0.1–0.4. By and large, it is little changed by
temperature, total surfactant concentration, or alkyl chain
length for micelles of the same shape. (Exceptions are fatty
acid sodium salts, for which α decreases from 0.43 for C7
to only 0.26 for C12 .) For systems with amine and sul-
fate headgroups, a compilation of dependences on various
surfactant parameters is included in Table I.
The effectiveness of binding has been shown to be
FIGURE 6 Dependence of the number of water molecules per
dirrerent for different counterions. For alkyl ammonium polyoxyethylene chain on the number of oxyethylene units for mi-
headgroups, anions bind more effectively in the sequence celles of sodium dodecylpolyoxyethylene sulfates. [From Takiwa,
F− < Cl− Br− < No− −
3 I . It is to be expected, and F., and Ohki, K. (1967). J. Phys. Chem. 71, 1343–1349.]
P1: GPA/GJK P2: GPJFinal Pages
Encyclopedia of Physical Science and Technology EN09J-437 July 19, 2001 22:41
Micelles 667
surfactants, persuasive evidence has been given to sug- D. Surfactant Organization in Micelles
gest that the core of the micelle contains no substantial
Developing a three-dimensional model that accurately
amount of water and that if there is penetration it proba-
takes into account all the characteristics experimentally
bly involves no more than two carbons. Surfactants con-
defined for the micelle—surface structure, water pene-
taining oxyethylene moieties appear to be substantially
tration, fluidity of the core, solubilization, and dynamic
hydrated in the region of these groups. This is implied in
character—is not a trivial task. The initial picture given in
Fig. 6. A number of studies involving probes solubilized
Fig. 1, while providing an elementary concept with which
in the micelle or attached to the surfactant itself (e.g., car-
to work, does not reflect either the dynamic character of
bonyl groups) indicate the presence of water around such
micelles or the effect this should have on surface regu-
probes. It is possible that perturbation of the system by the
larity. Further, it does not reflect any randomness in the
included probe can lead to significant water–hydrocarbon
coiling of the hydrocarbon chains.
contact and/or that many such probes lie near the surface.
Attempts have been made to construct micelle struc-
Of course, as has been shown, the dynamic equilibrium
tures along the lines suggested by Fig. 1, with molecular
between surfactants in bulk phase and micelle assures con-
models in which some kinks are introduced to provide a
siderable motion of material across the micelle boundary.
much more rugged micelle, but one in which considerable
Additionally, it is quite likely that the micelle surface is
water penetration is suggested. The representation in Fig. 7
not a simple two-dimensional interface but involves some
approximates this approach. Other more geometrically
protrusion of surfactants into the bulk where hydrocarbon
simple models using square-lattice packing suggest al-
wetting could occur (Fig. 7). In summary, one should note
ternate ways of arranging surfactants: first using parallel
that whether there are 5–10 water molecules associated
correlation with headgroups separated at right angles and
with each surfactant of a simple micelle or whether there
then with the introduction of gauche conformations near
is penetration to several carbons depth in the presence of
the headgroup to release headgroup contact. The results of
solubilized material, a substantial fraction of material as-
such efforts are given in Fig. 8. Averaging arrangements
sociated with the micelle is, indeed, water.
of this type can yield an approximation to the ideal micro-
scopic oil drop shown in Fig. 8b.
Statistical theory employing a three-dimensional lattice
model has also been developed. The model emerging
from such an approach (Fig. 9) provides constant radial
668 Micelles
Micelles 669
670 Micelles
FIGURE 11 Solubility of decanol in solutions of (a) sodium octanoate (20◦ C), (b) sodium decanoate (40◦ C), (c)
sodium dodecanoate (40◦ C), and (d) sodium tetradecanoate (40◦ C). [From Ekwall, P. et al. (1969). Mol. Cryst. Liq.
Cryst. 8, 157–213.]
such substances in aqueous solution has been used as a of a solubilizate. As an illustration, solute near the surface
measure of the cmc. Because the extent of solubilization can produce a gain in energy by decreasing the ratio of
is dependent upon the quantity of micellar surfactant, such gauche to trans configurations in hydrocarbon chains in
behavior has been interpreted in terms of the phase sep- that region. Rigid molecules such as planar aromatic hy-
aration model where the surfactant is considered to be a drocarbons may perturb packing in the core, which may
separate or pseudo-phase. Indeed, various spectroscopic contribute to their unfavorable location in that domain.
observations indicate that many solubilizates move in a While various techniques, such as stopped flow, have
liquidlike environment, an observation that is consistent been used to follow substrate kinetics, many kinetic mea-
with the idea of a separate phase. One consequence of this surements have involved the photophysical properties of
approach has been to show the validity of a relationship solubilized probes. Because of the luminescent properties
analogous to Henry’s law: of their excited states, the aromatic hydrocarbons provide
opportunities for monitoring movement of such probes
cmc(Xa) = cmc(Xa = 0) − k · Xa (4)
across the micelle boundary. For example, long-lived
for micellar systems, where Xa is the mole fraction of the phosphorescence of aromatic hydrocarbons has been mon-
material to be solubilized. itored in micellar solutions containing ionic quenchers that
As described previously, the micelle exhibits a surface themselves are repelled by the surfactant head groups.
region with significant polar character, while the interior is Since quenching must take place in the aqueous phase,
essentially hydrocarbon in nature. It is to be expected that phosphorescence lifetimes may be interpreted to provide
materials associating with micelles would distribute them- rate constants for exit of the probe from the micelle. Some
selves between these two regions depending on polarity typical values obtained by this technique are given in
or polarizability. Indeed, those materials such as aliphatic Table III. Fluorescence data have also been used to obtain
hydrocarbons that have no surface activity are to be found such information.
in the micellar core while the more polarizable aromatic
hydrocarbons have been found to associate with the sur-
face region. Many such species give evidence of lying in an VI. MICELLAR CATALYSIS
alcohol-like environment. There are further energetic con-
siderations related to the effects of surfactant packing on It has been shown that a number of chemical processes can
the micellar structure itself that influence the distribution be kinetically altered in the presence of micelles. Either
P1: GPA/GJK P2: GPJFinal Pages
Encyclopedia of Physical Science and Technology EN09J-437 July 19, 2001 22:41
Micelles 671
672 Micelles
In systems where more than one reaction pathway is effects on reaction rates of excited states and radiolyti-
possible, localization of reactant by micelles may govern cally generated radicals. While these might appear to be
the kinetics and determine product yield. An example of highly specialized measurements, the time resolution that
this may be nicely seen in the photolysis of ketone: may be achieved provides insight into the catalysis of
many micelle-related events on a very short time scale.
O
For example, the movement of charged radicals, such as
CH2 C CH2 CH3 hydrated electrons, to reaction sites at positively charged
micellar surfaces has been shown to increase up to two
orders of magnitude over diffusion control. Dispropor-
For convenience, one may use the notation tionation of Br− 2 radicals has been shown to occur much
more rapidly on a CTAB micelle surface than in solu-
A CH2 tion. Initial steps in the radical processes governing lipid
peroxidation, a mechanism of particular biological inter-
est, have been studied in micelles to determine the effects
and of a membrane-mimic environment on radical behavior.
Micellar influence on kinetics of excited-state quench-
B CH2 CH3 ing by adsorbed anions, photoionization of sequestered
chromophores, and many other photoprocesses have been
investigated.
and write the reaction
h
ACB AA AB BB (8) VII. MICELLES AT INTERFACES
O
For all the structural and behavioral complexity that mi-
The distribution of products in homogeneous solution celles present, they are the simplest assembly of am-
is governed by simple statistics as shown in Fig. 13. How- phiphilic molecules with which to deal experimentally.
ever, with the addition of surfactant up to a concentration In most cases one merely dissolves the surfactant of inter-
that localizes each reactant molecule in a micellar “cage,” est in water, and the gods of thermodynamics do the rest.
the yield of AB is totally dominant. While only a limited number of biological amphiphiles ac-
A large number of studies have been carried out with tually aggregate in this way (bilesalts, fatty acid salts, and
fast-reaction kinetics techniques to characterize micellar lysolecithin), micelles do present accessible model sys-
tems with which to approach phenomena governing more
complex, extended assemblies such as biological mem-
branes. Although the limitations of micelle–membrane
comparisons must be kept in focus, a wide range of infor-
mation characterizing hydrophobic interaction and water–
lipid interfacial phenomena, which are highly relevant to
biological systems, has been built up from the collective
study of micelles.
Micelles 673
(hydrophilic
surfaces)
(hydrophobic
surfaces)
Adsorption density
CMC
Surfactant concentration
FIGURE 14 Schematic of “two-step” absorption isotherms for surfactants on hydrophobic surfaces (dashed line, right
vertical scale) and on hydrophilic surfaces (solid line, left vertical scale). The absorption models are inferred from the
measured amounts of surfactant absorbed per unit surface area.
Another is particulate detergency, in which surfactant in- ety of hydrophobic and hydrophilic surfaces. At the risk of
filtration between a soil particle and the substrate eventu- overgeneralizing, adsorption isotherms typically follow a
ally desorbs the soil and prevents its readsorption. A final two-step pattern (Fig. 14) for both hydrophobic and hy-
example is tertiary oil recovery, in which oil is dislodged drophilic surfaces. In both cases, a low-density plateau at
from the microchannels of porous rocks by competitive low concentrations gives way to a high-density plateau as
surfactant adsorption at the pore surfaces and by emul- the concentration approaches the cmc; this final adsorp-
sification of the oil. Despite the importance of these ap- tion density remains approximately constant up to very
plications, knowledge of surfactant behavior at interfaces high surfactant concentrations. The surfactant organiza-
has historically lagged far behind that in bulk solutions. tion at each plateau is inferred from the measured surface
Which morphologies surfactant aggregates assumed at in- density and from known interaction sites. Hydrophobic
terfaces, or even whether well-defined aggregates analo- surfaces, which interact with tailgroups via hydrophobic
gous to bulk micelles existed at interfaces, remained open interactions, display a low-density plateau consistent with
questions until fairly recently. a horizontal monolayer and a high-density plateau con-
The first evidence for hydrophobic association at in- sistent with a vertical monolayer, with tailgroups facing
terfaces came from adsorption isotherms—i.e., measure- the surface. This is in contrast with charged hydrophilic
ments of surface adsorption density as a function of surfaces, which interact electrostatically with oppositely
surfactant concentration in solution. In a landmark 1955 charged headgroups. Here the low-density plateau approx-
paper, A. M. Gaudin and D. N. Fuerstenau noted that the imately corresponds to a vertical monolayer (headgroups
surface density of SDS on alumina increased sharply as the facing the surface), whereas the high-density plateau is
concentration approached the cmc, and they interpreted consistent with a vertical bilayer above the cmc.
this as evidence for hydrophobic association into interfa- However, while these flat morphologies served as the
cial aggregates termed hemimicelles. standard models of interfacial aggregation, uncertain-
Confirming evidence for interfacial aggregation has ties in surface area determination (which can approach
since come from hundreds of adsorption studies on a vari- 30%) could not definitively exclude curved interfacial
P1: GPA/GJK P2: GPJFinal Pages
Encyclopedia of Physical Science and Technology EN09J-437 July 19, 2001 22:41
674 Micelles
Micelles 675
B
FIGURE 16 AFM images and schematics of alkyltrimethylammonium surfactant aggregates on (a) the hydrophobic
graphite surface and (b) the hydrophilic mica surface. In (a), the bottom row of tails are aligned parallel to a surface
symmetry axis, orienting the half-cylinder perpendicular to this axis. In (b), the smaller contact area between the
aggregate and surface allows the cylindrical aggregate to meander over the surface.
heavily on the crystalline anisotropy of the substrate, with micelles in bulk. The orientation of the aggregates with
the surfactant geometry itself playing a comparatively respect to the surface lattice suggests that the crystalline
minor role. Almost all surfactants—ionic, nonionic, and anisotropy of the substrate plays a central role in deter-
zwitterionic, with tail lengths ranging from 12 to 18 car- mining this curvature. This has been further confirmed
bon atoms—aggregate in the form of half-cylindrical ag- by control experiments on amorphous hydrophobic
gregates on cleaved crystals of graphite and MoS2 . This surfaces; these show half-spherical micelles above the
is evidenced by AFM images in the form of rigid, paral- cmc, in agreement with the spontaneous curvature in bulk
lel stripes, separated by a little over twice the surfactant solution.
length, with stripe axes running perpendicular to the un- The current understanding of the adsorption and aggre-
derlying lattice symmetry axes. Parallel half-cylinders are gation process is as follows. At very low concentrations,
consistent with the observed stripe spacing and with the hydrophobic attraction causes the tailgroup to adsorb hor-
known adsorption density (roughly equivalent to a vertical izontally on the surface, and an anisotropic interaction
monolayer) from isotherms. with the surface lattice causes the tailgroup to orient it-
The cylindrical curvature is, however, initially sur- self parallel to an underlying symmetry axis. Because the
prising considering that these surfactants form spherical tail-surface interaction for this configuration is typically
P1: GPA/GJK P2: GPJFinal Pages
Encyclopedia of Physical Science and Technology EN09J-437 July 19, 2001 22:41
676 Micelles
stronger than that between two tails, the molecule does not structure than close-packed spheres and a flatter curvature
gradually desorb to a vertical orientation (as in Fig. 14) than is found for bulk micelles. The mica surface has been
as the surfactant concentration is increased. Instead, each likened to a highly charged “planar counterion,” which in-
strip of horizontal molecules, arranged head-to-head and duces a sphere-to-rod transition at the interface, in a simi-
tail-to-tail along a symmetry axis, serves as a founda- lar way that multivalent counterions induce sphere-to-rod
tion for a half-cylindrical aggregate above the cmc (see transitions in bulk micelles. In cases where the surfac-
the schematic of Fig. 16a). Apparently, the original align- tant headgroups are highly repulsive or bulky, even the
ment of tails parallel to a symmetry axis not only deter- mica surface is unable to bring headgroups close enough
mines the direction of the final half-cylindrical aggregate, to effect a sphere-to-cylinder transition, and interfacial
but also the fact that half-cylinders are the favored ar- micelles remain spherical. This is the case for divalent
rangement. Where the lattice alignment is absent—i.e., surfactants, as shown in Fig. 15b.
for isotropic substrates such as hydrophobized silica—the In summary, the geometry of surfactant aggregates at
same surfactants form half-spherical aggregates. charged surfaces is highly sensitive to “charge density
Both graphite and MoS2 —despite marked differences matching” between the surface and free (unperturbed)
in lattice symmetry, spacing, and surface groups—give micelles. The interfacial aggregate can have a flatter
rise to similar half-cylindrical aggregates at the same per- curvature than free micelles in cases where the adsorp-
pendicular orientation relative to the underlying lattice. tion density becomes comparable to the micelle charge
This suggests that strong tailgroup alignment, leading to density.
half-cylindrical aggregates, may be a feature common to
all crystalline hydrophobic adsorbents. Since hydrophobic
surfaces interact with the entire length of the tailgroup, this IX. CONCLUSION
large interaction area evidently leads to a high degree of
surface control of the aggregate structure. In the technological realm, an application that has gen-
erated much excitement is the synthesis of mesoscopic
C. Aggregates at Hydrophilic Surfaces materials using surfactant micelles as templates, as first
reported in 1992 by Beck et al. This process relies on in-
Hydrophilic surfaces, on the other hand, interact with the organic polymerization at the interfacial region between
much smaller surfactant headgroup. It is therefore natural a surfactant aggregate and a solution in which the inor-
to expect “full” aggregate structures, whose curvature is ganic precursors (usually silicate ions) are initially dis-
controlled as much by intermolecular interactions as by persed. By restricting the polymerization reaction to the
the surface. This is exactly what is observed by AFM. micelle–solution interface, a complex nanocomposite is
The vast majority of experiments in this category have formed consisting of surfactant micelles embedded in an
investigated ionic surfactants on oppositely charged sur- ordered array within a continuous inorganic (e.g., silica)
faces. On the anionic surface of silica, single-chain “scaffold.” Pyrolyzing the surfactant finally results in a
cationic surfactants self-assemble into spherical aggre- mesoporous material, with pore sizes of order 5 nm, which
gates (resembling bulk micelles) above the cmc. These can serve as, for example, a molecular filter, catalytic
are thought to originate from the electrostatic binding be- support, or laser waveguide.
tween charge sites on the surface and individual surfac- However, aside from such applications and, of course,
tant molecules; the latter then serve as nucleation sites for the industrial interest in detergent action, one should note
micellar aggregation above the cmc. Similar results have that of the wide range of experimentalists and theoreticians
been observed with anionic surfactants on the cationic that have been drawn to investigation of micelles, most
surface of alumina. In both cases, the spherical interfacial have been attracted by the unique intellectual challenge
aggregates are consistent with the curvature found in bulk such systems offer. The literature provides ample evidence
solution. Micelle curvature is expected to be relatively that they have not been disappointed.
unperturbed as long as the substrate charge density (i.e.,
adsorption site density) falls short of the charge density
on the outer surface of the micelle. Most surfaces satisfy
this requirement. SEE ALSO THE FOLLOWING ARTICLES
A notable exception is the anionic cleavage plane of
mica, where exchangeable surface ions give rise to a far CHEMICAL KINETICS, EXPERIMENTATION • CHEMICAL
higher adsorption density than on silica. Alkyltrimethy- THERMODYNAMICS • ELECTROPHORESIS • HYDROGEN
lammonium surfactants on mica self-assemble into paral- BOND • KINETICS (CHEMISTRY) • MACROMOLECULES,
lel cylindrical aggregates (see Fig. 16b)—a higher-density STRUCTURE • PRECIPITATION REACTIONS • SILICONE
P1: GPA/GJK P2: GPJFinal Pages
Encyclopedia of Physical Science and Technology EN09J-437 July 19, 2001 22:41
Micelles 677
(SILOXANE) SURFACTANTS • SURFACTANTS, INDUSTRIAL Klafter, J., and Drake, J. M. (1989). “Molecular Dynamics in Restricted
APPLICATIONS • SURFACE CHEMISTRY Geometries,” Wiley, New York and Chichester.
Lindman, B., and Wennerström, H. (1980). Top. Curr. Chem. 87,
1–83.
Manne, S., and Gaub, H. E. (1995). Science 270, 1480–1482.
BIBLIOGRAPHY Manne, S., and Warr, G. G. (1999). In “Supramolecular Structure in
Confined Geometries” (S. Manne and G. G. Warr, eds.), pp. 2–23.
Beck, J. S., et al. (1992). J. Am. Chem. Soc. 114, 10834–10843. American Chemical Society, Washington DC.
Fendler, J. H. (1982). “Membrane Mimetic Chemistry,” Wiley, New Menger, F. M. (1977). In “Bioorganic Chemistry III. Macro- and Multi-
York. molecular Systems” (E. E. van Tamelen, ed.), pp. 137–152. Academic
Fendler, J. H., and Fendler, E. J. (1975). “Catalysis in Micellar and Press, New York.
Macromolecular Systems,” Academic Press, New York. Tanford, C. (1980). “The Hydrophobic Effect: Formation of Micelles
Gaudin, A. M., and Fuerstenau, D. W. (1955). Trans. AIME 202, 958– and Biological Membranes,” Wiley, New York.
962. Wennerström, H., and Lindman, B. (1979). Phys. Rep. 52, 1–86.
P1: GLQ/LSK P2: GPJ Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
Physical Chemistry
Douglas J. Henderson Charles T. Rettner
Brigham Young University IBM Almaden Research Center
I. Classical Mechanics
II. Quantum Mechanics
III. Statistical Thermodynamics
IV. Kinetics and Dynamics
59
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
Because of this, nuclei are slow moving, virtually mo- B. Wave Equation
tionless, on the timescale of electronic motions. For many
In the earliest formulation of quantum mechanics, classi-
chemical phenomena, the nuclei can be regarded as fixed
cal mechanics was assumed valid with the exception that
in space. Only the electronic dynamics need be consid-
some periodic variables were quantized (i.e., had discrete
ered. This simplification is called the Born–Oppenheimer
values). Their values could be determined by integrating
approximation.
the pi over their orbits in phase space, according to
At first it was thought that the electronic motions could
be described by classical mechanics. However, it is im- pi dqi = n i h, (7)
possible to describe the microscopic world by classical
mechanics. It became apparent, from for example the pho- where n i is an integer called a quantum number. The inte-
toelectric effect, that electromagnetic radiation was not al- gral in Eq. (7) over a closed path is called a phase integral.
ways wavelike, but, in some circumstances, consisted of However, as the implications of the duality of matter
discrete, particle-like, units of magnitude called quanta, and energy and the uncertainty principle were accepted,
it became apparent that one could refer only to the proba-
E = hν, (3) bility of finding the system in some configuration. Just as
the wave nature of radiation meant that there was a wave
where ν is the frequency of the radiation and h =
equation for radiation, the wave nature of matter implied
6.626 ×10−34 J sec is Planck’s constant. Conversely, it
the existence of a new wave equation. This wave equa-
became apparent, through for example the diffraction of
tion, called the Schrödinger equation, is formulated as
electrons, that matter was not always particle-like but, in
an eigenvalue equation (eigen ≡ characteristic or proper)
some circumstances consisted of waves of wavelength
where the Hamiltonian operator “operates” on the prob-
λ = h/ p. (4) ability function or wave function or eigenfunction, ψ, to
give the energy eigenvalue, E, times ψ. Thus, the wave
Such waves can be used to probe the structure of crystal equation is
surfaces, through low-energy electron diffraction (LEED) ψ = Eψ (8)
or atomic beam diffraction. The latter is usually confined
to He atoms, but even Ar atom diffraction can be discerned The wave function has the property that |ψ|2 gives the
in favorable cases. probability of the system having the eigenstate whose en-
In other words, there is a duality of matter and energy. ergy is E. The Hamiltonian operator is formed by re-
Whether the particle-like or the wavelike character of mat- placing p j in the classical Hamiltonian by √ the opera-
ter/energy is dominant depends on the experiment. In fact, tor −(h/i)(∂/∂q j ), where h = h/2π and i = − 1. The
the experiment itself interacts with the matter/energy and q j remain unchanged. Interestingly, the earlier phase in-
defines some aspect of the system at the cost of indefinite- tegral formulation [Eq. (7)] becomes the approximate
ness of some other aspect. This uncertainty principle was Wentzel–Kramers–Brillouin (WKB) method of solution
made precise by Heisenberg who showed that even under of Schrödinger’s equation and remains useful in many
the most ideal circumstances problems in the sense that differences between quantized
values of a phase integral are integral multiples of h except
pi qi = h/4π. (5) that there may be a zero point value of the phase integral
given by a fractional value of h.
If the experiment defines the particle character of the sys-
tem, the uncertainty of the positions, qi , is small and the
C. Hydrogen-Like Atom; Electronic Transitions
uncertainty in momenta, pi , or frequency is large. How-
ever, if the experiment defines the wave character of the One of the first systems to which quantum mechan-
system, the reverse is true. The momenta pi and positions ics was applied was the hydrogen-like atom consisting
qi are called conjugate variables. Energy and time are also of a single electron orbiting a nucleus of charge Z e0 .
conjugate variables so that The energy eigenvalues or levels are obtained by solving
Schrödinger’s equation and are given by
Et = h/4π. (6)
2π 2 m 0 Z 2 e04 hc R Z 2
E =− = − , (9)
Classical mechanics, where there is no uncertainty, is a n2h2 n2
limiting case in which the magnitudes of the variables are where e0 is the charge of an electron, n is an integer, c is
large compared to h. As a result, classical mechanics is the velocity of light, and R is called Rydberg’s constant.
appropriate for large macroscopic bodies. Strictly speaking we should not use the electronic mass m 0
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
is an active field of research. The potential energy terms and they are termed doublet states. The former three have
in the wave equation for the molecule involve not only zero spin, a multiplicity of 1, and are termed singlet states,
the Coulomb repulsions between the electrons and the while states with a multiplicity of 3 and 4 are termed
Coulomb attractions of the electrons and nuclei but also triplet and quartet states, respectively. Thus, the ground
the Coulomb repulsion between the nuclei. electronic state of NO is written as 2 1/2,3/2 , where the 2
Given the repulsion between the nuclei, we are inclined refers to the doublet nature of the state, the to = 1, and
to ask how atoms form a chemical bond in a molecule. the 12 and 32 refer to values of which arise from the two
A simple answer can be obtained by considering the one possible ways that the spin and orbital angular momentum
electron hydrogen molecule ion H2+ . The wave functions can add.
of each H atom separately are ψls (A) and ψls (B). An ap- In addition to electronic energy states, molecules posses
proximation to the H2+ wave function can be obtained by both rotational and vibrational energy levels. Assuming a
forming a molecular orbital from a linear combination of fixed distance between two atoms (rigid rotor approxi-
these two wave functions mation), the Schrödinger equation yields for the allowed
rotational energy levels of a diatomic molecule
ψb = ψls (A) + ψls (B)
h2
or Er ≈ J (J + 1), (12)
8π 2 I
ψa = ψls (A) − ψls (B). (11) where I is the moment of inertia of the molecule, J the
In the first (bonding) orbital, the electron is concentrated rotational quantum number and the quantity (h 2 /8π 2 I ) is
between the nuclei, and is simultaneously attracted by both termed the rotational constant for that particular electronic
nuclei resulting in a lower electronic energy which more state of the molecule (usually given the symbol B).
than offsets the repulsion of the nuclei. In other words Vibrational energy levels can be estimated by inserting
when the electron is between the nuclei it acts as a cement the Hooke’s law potential energy, U (r ) = 0.5k(r − re )2 ,
holding them together. There is zero probability of finding in the Schrödinger equation (harmonic oscillator approx-
the electron between the nuclei in the second (antibonding) imation). This yields eigenvalues, E v , for the permissible
orbital. As a result, a chemical bond is not formed by this energy levels, of
orbital.
E v = (h/2π ) k/µ v + 12 = hν0 v + 12 , (13)
where µ is the reduced mass of the system [µ =
F. Molecular Systems; Energy Levels m 1 m 2 /(m 1 + m 2 )] and ν0 is known as the fundamental
When atoms combine to form molecules, the individ- vibrational frequency. The smallest amount of vibrational
ual atomic energy levels give rise to discrete electronic energy a molecule can possess is thus ν0 /2, termed the
energy levels or states of the molecule. The number of zero-point energy. Rather than the simple Hooke’s law
these molecular electronic states far exceeds those of potential we may consider more realistic molecular po-
the individual atoms because of the many different ways tential energy curves such as a Morse potential given by
in which the atomic states can be combined. Electronic
U (r ) = De {1 − exp[−β(r − re )]}2 , (14)
states of molecules are classified in terms of several
molecular quantum numbers in a manner analogous to where De is the dissociation energy of the molecule and β
atomic electronic states. For a diatomic molecule these is related to De and ν0 . This leads to a similar expression
include the electronic orbital angular momentum, l, its for E v , but with an additional quadratic term in (v + 12 ),
component along the internuclear axis, λ, and the corre- which is negligible for low vibrational energies. Similarly,
sponding quantities for the molecule as a whole, L and an accurate treatment of molecular rotation leads to addi-
. Just as l = 0, 1, 2, . . . gives rise to s, p, d, . . . elec- tional terms in higher powers of the quantity {J (J + 1)}.
tron states and L = 0, 1, 2 . . . gives S, P, D, . . . atomic
states, so λ = 0, 1, 2 . . . yield σ, π, δ . . . electron states
G. Spectroscopy
and = 0, 1, 2 . . . correspond to , , . . . molecu-
lar states. The component of the total electronic angu- Atoms and molecules can adsorb and emit radiation
lar momentum along the internuclear axis, , is also of to change their internal energy states. The electronic
importance. transitions of the hydrogen-like atoms have already been
The ground electronic states of H2 , O2 , and N2 are mentioned. The quantization of the energy levels restricts
states, while those of OH and NO are states. These latter the possible wavelengths of the radiation to discrete
two molecules have open valence shells with net spin of 12 , spectral lines. Only certain transitions are allowed and
so that the multiplicity, S, of these states is 2 (S = 2S + 1), these are given by separate selection rules for electronic,
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
vibrational, and rotational transitions. For example, in radiation, while nuclear magnetic resonance (NMR) is a
atomic transitions l can only take values of ±1. In add- radio-frequency technique.
ition, each allowed transition has an associated transi- 5. Mössbauer spectroscopy, which involves the reso-
tion linestrength. In electronic spectroscopy involving nant absorption of a γ -ray photon by a nucleus. The res-
transitions between electronic states, these are termed onant condition is achieved via the Doppler effect, by
transition moments, Franck–Condon factors and Hönl– sweeping the velocity of a sample relative to the source.
London factors for electronic, vibrational, and rotational The chemical environment of the nucleus causes charac-
transitions, respectively. teristic frequency shifts.
The spectrum of absorption lines associated with a given
species can be used as a fingerprint for identification and
quantification in the process of absorption spectroscopy. III. STATISTICAL THERMODYNAMICS
An absorption spectrum is usually recorded with an ab-
sorption spectrometer or, more recently, with a laser. Sim- A. First and Second Laws of
ilarly, emission spectroscopy is concerned with the radia- Thermodynamics; Entropy
tion that is emitted by an excited atom or molecule. Here
The first law of thermodynamics states the conservation
radiation is generally spectrally resolved using a spectro-
of energy,
graph. In the various spectral regions, these techniques
may be referred to as vacuum ultraviolet (VUV), ultravi- δ Q = dU + δW, (15)
olet (UV), infrared (IR), and microwave spectroscopies.
Microwave photons have energies of less than 1 meV and where δ Q is the heat absorbed by the system, dU is the
are associated with rotational transitions, while infrared change in internal energy of the system, and δW is the
photons have energies of up to 1 eV and are involved with work done by the system. The second law of thermody-
vibrational transitions. More energetic radiation from the namics states that heat cannot pass from a cold reservoir
visible to VUV is usually associated with electronic tran- to a hot reservoir without the application of work.
sitions. In addition to identifying species, these and re- The change in entropy, d S, is just δ Q/T , where T
lated spectroscopic techniques are frequently employed is the temperature. The factor 1/T is an integrating factor
to obtain detailed structural information, such as bond that transforms δ Q into an exact differential just as 1/v 2
lengths and bond angles, and to study the flow of energy transforms vdu − udv into the exact differential d(u/v).
in chemical reactions. Because the change in entropy, d S = δ Q/T , is an exact
Other spectroscopic techniques include: differential, the change in entropy in a reversible cyclic
process is zero. The entropy of a thermodynamic state is
a well-defined single-valued function and the entropy is
1. Raman spectroscopy, which concerns the spectral
said to be a state function. An equivalent statement of the
analysis of radiation scattered by an atom or molecule.
second law of thermodynamics is
Recent developments are coherent anti-Stokes Raman
spectroscopy (CARS) and surface enhanced Raman spec- S ≥ 0, (16)
troscopy (SERS), both of which are sensitive laser-based
techniques, and examples of laser spectroscopy. where the change in entropy is zero for a reversible cyclic
2. Photoelectron spectroscopy, which is based on ki- process. The entropy increases in an irreversible process.
netic energy analysis of electrons ejected from an atom
or molecule by an enegetic photon and provides informa- B. Free Energy; Experimental Measurements
tion on the binding energies or ionization potentials of
the ejected electrons. Recent developments include X-ray The first and second laws of thermodynamics can be
photoelectron spectroscopy (XPS) of surfaces, and the use combined to give
of lasers as radiation sources. T d S = d E + δW. (17)
3. Auger electron spectroscopy (AES), which involves
the electron impact ionization of an atom to give an ex- If the only work done by the system results from an expan-
cited electronic state that decays by emission of a second sion d V or a change in the amount d Ni of the constituents,
electron whose energy is characteristic of the atom. This then Eq. (14) becomes
is most commonly used in surface analysis.
m
4. Spin resonance spectroscopy, which concerns the ap- T d S = dU + pd V − µi d Ni , (18)
plication of a magnetic field to split energy states associ- i=1
ated with electron or nuclear spin orientations. Electron where p is the pressure, V the volume, µi the chemical po-
spin resonance (ESR) involves absorption of microwave tential of constituent i, and Ni the amount or concentration
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
of constituent i. The chemical potential of a gas is the value the so-called gas constant (R) divided by Avogadro’s num-
of the Gibbs function for one mole. In a purely mechanical ber, N A = 6.022 × 1023 mol−1 , the number of molecules
system, equilibrium is achieved when the energy is a min- in a mole.
imum. In a thermodynamic system, entropy changes as The thermodynamic properties are related to the energy
well as energy changes must be considered. At constant levels E i through the partition function Z defined by
temperature, volume, and concentration, the Helmholtz
Z = e−β A = exp(−β E i ), (23)
free energy, i
A = U − T S, (19) where the sum is over all the energy levels of the system.
is a minimum at equilibrium, whereas at constant pressure Thus, to obtain the thermodynamics of a system, all that
and temperature the Gibbs free energy is required is that Schrödinger’s equation for the system
be solved and the partition function summed. For most
m
systems this is a difficult task, often impossible without
G = A + pV = µi Ni (20)
i=1
some approximations.
There is also a relation between the entropy and the
is a minimum at equilibrium.
microscopic configurations of the system. The entropy is
For a dilute gas, where the perfect gas law ( pV = n RT )
proportional to the logarithm of the number of accessible
applies, the value of µ per mole at a pressure p is
states, , of the system. Thus,
µ( p) = µ◦ + RT ln( p/atm), (21)
S = k ln . (24)
where µ◦ is the value of µ at 1 atmosphere, which is the
Equation (24) is called Boltzmann’s relation. At absolute
pressure at which the standard state is established. The
zero, the system is in its ground state, and the number
value of µ◦ is often taken as zero for the elements. In Eqs.
of accessible states is unity. Thus, the entropy of a system
(20) and (21) R( = 8.3144 J mol−1 K−1 ) is the so-called
tends to zero as the temperature goes to zero. This is called
gas constant. A mole of any substances is the amount with
the third law of thermodynamics.
a mass in grams equal to its molecular weight in atomic
The Boltzmann relation provides a statistical interpre-
mass units, so that a mole of molecular hydrogen has a
tation of the entropy. The greater the number of accessible
mass of 0.002 kg. The value of G or µ can be determined
states, the less our knowledge of the system and the more
for some pressure p by measuring the volume of the gas
randomness or disorder in the system. This entropy is a
as a function of pressure up to p and integrating. Thus,
measure of disorder. The tendency of the entropy to in-
1 p crease reflects the tendency of thermodynamic systems to
µ( p) = µ◦ + V ( p)d p, (22)
n p0 increase in disorder just as an initially ordered deck of
cards increases in disorder during a game of cards.
where p0 is the pressure of the standard state (usually 1
If the system is classical, the energy levels merge into
atm). The free energy of a condensed phase can be re-
a continuum and an important simplification results. The
lated to that of a dilute gas through the vapor pressure,
sum in the partition function becomes an integral. More-
the pressure of the gas in equilibrium with the condensed
over, if the kinetic energy degrees of freedom (i.e., the
phase. Once the free energy of the condensed phase has
momenta) are independent of the potential energy or in-
been established, values for other states can be obtained
ternal degrees of freedom (i.e., the generalized positions)
by measuring pressure or energy through a sequence of
then the momenta can be integrated immediately. For
states leading to the desired state.
the particular case in which there is only translational
motion.
C. Statistical Mechanics; Partition Function
λ− 3N
Z= exp(−β) dr1 · · · dr N , (25)
The thermodynamic properties of a system result from N!
the dynamics of its molecules. Since even a three-body
where λ = h/(2π mkT )1/2 , N is the number of molecules
system is difficult, statistical methods must be employed
in the system, and = (r1 , . . . , r N ) is the potential en-
to treat the large number of molecules in a thermodynamic
ergy. The factor N ! is required because states that differ
system.
only by an interchange of molecules are not distinguish-
The fundamental result in statistical mechanics is the
able.
fact that the probability of a system occupying the en-
From Eq. (20), it follows that the average kinetic energy
ergy level E i is proportional to the Boltzmann factor,
of the system is
exp(−β E i ), where β = 1/kT and k = 1.3804 ×10−23 J
K−1 is the Boltzmann constant. The Boltzmann constant is KE = 32 N kT (26)
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
In other words, there is a statistical relation between the computer simulations show that it will freeze and exist as
temperature and the average motion of the molecules. The a close-packed solid.
greater the temperature, the more rapidly the molecules
move and the greater their kinetic energy.
E. Liquids; van der Waals Theory; Critical
The problem of predicting the thermodynamic proper-
Point; Renormalization Group
ties of such a classical system becomes the problem of
evaluation of the configuration integral, the integral over In contrast to a gas, a liquid need not fill space but can exist
exp(−β). This is still a difficult task. In general, it can in equilibrium with its vapor with a surface separating the
be done only through computer simulations (Monte Carlo liquid and vapor. The pressure at which the equilibrium
and molecular dynamics methods). However, there are a occurs is called the vapor pressure. Below the vapor pres-
few simple approximations which are helpful. sure, liquid will evaporate until equilibrium is reached. For
pressures greater than the vapor pressure, there is no inter-
face between liquid and vapor. The liquid fills the container
D. Perfect and Imperfect Gases
and there is no clear distinction between liquid and gas.
The simplest system is the perfect gas in which the The liquid under pressure can be heated at constant volume
molecules do not interact, i.e., = 0. Thus, the config- to a temperature greater than the critical temperature (the
uration integral is just the volume raised to the power N . highest temperature at which liquid–vapor coexistence can
Using Stirling’s approximation, N ! = (N /e) N , occur), then allowed to expand and cool to the original
temperature and pressure without any transition from liq-
Z = λ− 3N (eV /N ) N (27)
uid to gas being observed. A continuity of states between
and liquid and gas is said to exist. This is illustrated in Fig. 2.
The liquid–gas phase can be referred to by the single
pV = N kT. (28)
term fluid. Thus, a theory of the liquid state is of necessity
If the molecules interact, then the problem is more com- also a theory of an imperfect gas. The earliest theory of
plex. The gas is called imperfect because there are devia- the liquid state is that of van der Waals. Although more
tions from the perfect gas result. These deviations can be than a century old, with slight modifications it is viable
written as a power series in the density, ρ = N /V , called a today. The idea of van der Waals was that a liquid behaved
virial series. For example, if the molecules are hard spheres as a hard sphere gas except that the pressure must include
such that the molecules collide elastically but exert no at- the internal pressure due to the attractive forces of the
tractive forces on each other, then molecules in the liquid. It is reasonable to assume that the
contribution of the internal pressure to the free energy is
βp /ρ = 1 + ρb + 58 (ρb)2 + · · · . (29)
proportional to the density. Thus,
For hard spheres, the coefficients of ρ , called virial co-
n
p = p0 − ρ 2 a , (30)
where p0 is the pressure of the hard-sphere gas and a is a
constant depending on the nature of the attractive forces.
In its original formulation, the van der Waals theory was
only qualitatively successful because van der Waals ap-
proximated p0 by the perfect gas expression with a re-
duced free volume, i.e.,
βp0 1
= . (31)
ρ 1 − ρb
This expression gives only the second hardsphere virial co-
efficient correctly and seriously overestimates p0 . Much
more satisfactory results can be obtained from the approx-
imation
βp0 1
= (32)
ρ (1 − η)4
where η = 14 ρb. The van der Waals theory predicts that
the equation of state of a liquid can be expressed in
a universal form if the following reduced variables,
T ∗ = bkT /a, p ∗ = b2 p/a, and ρ ∗ = ρb, are used. This is
called the law of corresponding states. As is illustrated in
Fig. 3, the theory also predicts that below the critical tem-
perature there is a first-order phase transition between the FIGURE 3 Typical pressure isotherms as a function of the vol-
liquid and vapor accompanied by a discontinuous change ume V in the van der Waals theory. The shaded region is not
in the density ρ. At the critical temperature the transition thermodynamically stable. Here Tc is the critical temperature.
becomes second order since the liquid and vapor have be-
come identical. For temperatures above the critical tem-
perature, there is no phase transition. In the van der Waals reaching is the renormalization group approach in which
theory, the critical point occurs when a group of successive transformations is applied to the
2 liquid, yielding ultimately a renormalized system in which
∂p ∂ p only the long-range correlations typical of the critical point
= = 0, (33)
∂ρ T ∂ρ 2 T remain. In this system the critical point properties can be
i.e., the pressure isotherms have a point of inflection at the examined.
critical point.
Modern theories show that the van der Waals theory F. Mixtures
is a first approximation to a systematic approach, called
perturbation theory, in which the pressure is obtained as a Mixtures of two gases or liquids can be treated by the
power series in 1/T . same techniques as liquids. The analog of a perfect gas is
In the van der Waals theory, the first two derivatives the ideal mixture, where molecules of the components are
of p at constant T with respect to the density vanish at very similar so that the partition function can be written
the critical point. This is not just a prediction of the van (for the two-component case) as
der Waals theory. Any theory in which the equation of N!
state is analytic at the critical point will yield this result. Z= Z1 Z2
N1 !N2 !
By analytic, it is meant that the pressure can be expanded
as a power series about the critical point. Experimentally, where N = N1 + N2 and Ni is the number of molecules
the equation of state is not analytic at the critical point. of species i. From this, it can be deduced that the par-
The exponents in an expansion near the critical point are tial pressures of the components are proportional to their
generally not integers. At least one, and possibly two, more concentrations. This result is known as Raoult’s law. The
derivatives of p with respect to the density at constant factor N !/N1 !N2 ! gives rise to the entropy of mixing
T vanish near the critical point. There has been a great S = −N k[x1 ln x1 + x2 ln x2 ], (34)
deal of work on the fascinating properties of the equation
of state in the vicinity of the critical point. The most far where xi = Ni /N .
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
For a nonideal mixture, the original approach of van der produce a diffraction pattern, which can be analyzed to re-
Waals is still useful. In this approach veal the crystal structure of the sample. All crystal lattices
can be classified into 14 Bravais lattices belonging to seven
N!
Z= Z (35) systems. For example, the simple cubic, face-centered cu-
N1 !N2 ! bic and body-centered cubic lattices are the 3 lattices of the
where Z is obtained from the van der Waals equation cubic system. Cubic and hexagonal close-packed struc-
of state of a liquid, preferably with the more satisfac- tures have the structure of tightly packed spheres where
tory expression for p0 , with a and b replaced by the each sphere touches 12 neighbors, 6 in the same plane and
concentration-dependent quantities 3 above and 3 below. These two close-packed structures
differ in the placement of successive planes or layers. For
a = xi x j ai j (36) the cubic case, a third layer is laid down to reproduce
ij
the first layer, so that the structure could be represented
and by ABABAB. . . . For hexagonal close packing, the third
layer is again displaced, corresponding to ABCABC. . . .
b = xi x j bi j (37) No theory of freezing exists. That is, there is no par-
ij tition function that encompasses both the solid and fluid
√ phases. However, separate theories of solids and fluids can
with ai j aii a j j taken as a parameter. Assuming that
the spherical cores of the molecules do not overlap, then be developed and their solid–fluid coexistence examined.
To that extent theories of melting or freezing exist. Since
1/3 1/3 3
bii + b j j freezing can occur in the hard-sphere system, no critical
bi j = . (38) point is expected for freezing. This transition is expected
2
to be first order at all temperatures, as illustrated in Fig. 1.
This approach is satisfactory for mixtures of nonelec- If a solid were classical, the heat capacity would be 3N k.
trolytes. The situation is a little more complex for mixtures This is indeed the case at high temperatures and is called
of electrolytes because of the long range of the Coulomb the law of Dulong and Petit. However, the experimental
potential. However, each ion in the mixture tends to be heat capacity goes to zero at low temperatures. This can be
surrounded by ions of opposite charge, which causes the explained by regarding the solid as a collection of quan-
potential to decay exponentially with a decay factor, κ, tized oscillators. The only difficulty is to determine the
the Debye parameter, which is proportional to the square spectrum of frequencies of the oscillators. For many pur-
root of the product of the density and T −1 . As a result, an poses, the solid can be regarded as an elastic continuum.
appropriate expansion parameter for electrolytes is κ, or The result is the Debye theory. If something more sophis-
T −1/2 , which is different from nonelectrolytes, where T −1 ticated is needed one must solve for the normal modes of
is the expansion parameter. These ideas become quantita- the crystal, i.e., the method of lattice dynamics.
tive in the Debye–Hückel theory. The conduction of electricity in a metal is due to the
presence of free or quasi-free electrons in the metal. Clas-
sically, free electrons would contribute 3nk/2 to the heat
G. Solids
capacity, n being the number of free electrons. However,
In contrast to the disorder of gases and liquids, there is experimental evidence indicates that the electrons do not
translational order in crystals. Disordered or amorphous contribute significantly to the heat capacity of a metal.
solids (i.e., glasses) exist which lack this order. However, The reason for this is the exclusion principle. Although
they are really highly viscous liquids. This translational the electronic gas is in its ground state, because of the ex-
order is such that the entire structure, or lattice, can be clusion principle the electrons can each occupy one energy
generated by repeated replication of a small regular figure, level. The electrons occupy the levels up to a maximum en-
termed the unit cell. The planes of any crystalline structure ergy, called the Fermi energy, εF . Only the small number
can be specified using Miller indices, which also serve to of electrons with energies near εF can be thermally ex-
identify single crystal faces. Miller indices are obtained cited and, as a result, the electronic heat capacity is small.
by determining the intercepts of the plane with the unit If the exclusion principle is taken into account, treating
cell axes in terms of the length of the cell in that direction, the conduction electrons as free describes many of the
taking the reciprocal, and normalizing so the indices are electronic properties of a metal. To treat metals in a more
all integers. sophisticated manner and to account for semiconductors,
The ordered structure, or lattice, of a solid can be deter- the structure of the solid must be included. If this is done,
mined by X-ray or neutron diffraction studies, in which a the electrons are not free but are restricted to bands of
beam of X-rays of neutrons is scattered from the sample to energy.
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
FIGURE 4 Image of a single atom of xenon sitting on the surface IV. KINETICS AND DYNAMICS
of a platinum single crystal at 4 K, obtained with a scanning tunnel-
ing microscope. Xenon has an atomic radius of ∼1.2 × 10−10 m.
Picture provided by Don Eigler of the Almaden Research The previous sections have dealt only with the equilib-
Center. rium properties of a system of molecules. Such properties
tell us nothing of the time required for equilibrium to be
reached or about the dynamical properties of these sys-
H. Interfaces tems. The rate at which change occurs is the province of
kinetics. The detailed manner in which chemical forces act
The study of interfaces is becoming an increasingly impor- to bring about atomic and molecular motion is the province
tant area of physical chemistry. Of particular interest are of chemical dynamics. This section deals with the motions
gas–solid and gas–liquid interfaces. Both of these can now of atoms and molecules and the processes associated with
be imaged directly with scanning tunneling microscopy chemical change.
(STM). Here the contours of a surface may be determined
by sensing the small current that tunnels across a vacuum
gap to an atomically sharp tip as it is scanned across it. A. Kinetic Theory of Gases
Figure 4 illustrates the remarkable resolution of this in-
The kinetic theory of gases assumes that molecules have
strument. It shows a single atom of xenon sitting on the
negligible size compared to their separation, are in contin-
surface of a platinum crystal at 4 K. For the case of the
uous random motion, and interact only via elastic scatter-
liquid–solid interface, STM can be used to study biolog-
ing. These postulates permit the calculation of molecular
ical samples or the electrodes of an electrochemical cell.
speed and velocity distributions. The probability that a
Other important interfaces are those between a liquid and
molecule has a speed between v and v + dv is found to be
its vapor and between two imiscible liquids.
Consider, for example, the physical adsorption of a gas d F(v) = 4π (m/2π kT )3/2 v 2 exp(−mv 2 /2kT )dv, (40)
by a solid. If the solid is regarded as a giant sphere, the
adsorption of the gas can be regarded as the interaction of where T is the gas temperature and m the molecular mass.
a gas with a single infinitely large molecule dissolved in This is the Maxwell distribution of molecular speeds.
that gas. If the simplest form of the van der Waals theory Figure 5 displays this distribution for nitrogen gas at 25
of mixtures is applied to that system, then the adsorption and 500◦ C. Notice that the velocities are in the range of
isotherm is just hundreds to thousands of meters per second, which are
typical of those for small molecules at ambient tempera-
a12 tures. Recent experiments using light pressure from lasers
= ρβ , (39)
β(∂ p /∂ρ)T to slow down atoms have resulted in atoms moving with
velocities comparable to walking speed (1 m/s) and be-
where a12 is a constant. At low densities β(∂ p /∂ρ)T = 1
low. Such slow species are ideal for spectroscopic stud-
and the adsorption is proportional to the density (Henry’s
ies, since their adsorption spectra are not blurred by the
law). However, at higher densities β∂ p /∂ρ is a function of
Doppler effect due to their motion.
the density, and deviations from Henry’s law are observed.
The Maxwell distribution of molecular speeds permits
Especially interesting is the region near the critical point
the evaluation of such important quantities as the pressure
of the gas where (∂ p /∂ρ)T → 0 and singularities in the
p exerted by a dilute gas and the collision frequency Z in
adsorption are observed.
the gas under given conditions. The pressure is then given
Interfaces between dissimilar materials may also be-
by
come electrically polarized, with a separation of charge
occurring at the interface. When a metal is placed into p = 13 ρmvrms
2
, (41)
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
B. Transport Properties
The kinetic motion of molecules may cause them to
change their spatial distribution through successive ran-
dom movements. This is the process of diffusion, which
is a transport property. Other transport properties include
viscosity, electrical conductivity, and thermal conductiv-
ity. While diffusion is concerned with the transport of mat-
ter, these are associated with the transport of momentum,
FIGURE 5 Maxwell distributions of speeds for molecular nitrogen
at 25◦ C (298 K) and 500◦ C (773 K). Arrows indicate v̄ and vrms for
electrical charge, and heat energy, respectively. Transport
each case. The most probable velocity has been arbitrarily scaled is driven in each case by a gradient in the respective prop-
to unity in each case. erty. Thus, the diffusion rate of species A is given by Fick’s
law,
while the collision frequency for a one-component gas is Jz (A) = −D[dρ(A)/dz] (45)
given by
√ where Jz (A) is the net flux of A molecules crossing unit
Z = 2σ v̄ρ, (42) area in the z direction and D is the diffusion coefficient;
simply kinetic theory leads to
where ρ is the number of molecules per unit volume and
σ is the collision cross section. The quantity vrms is the D ≈ 13 v̄λ (46)
average square speed and is related to the average kinetic
energy and thus to the temperature, so that Derivation of other transport properties follow from sim-
ilar relationships.
3kT
vrms = v 2 =
, (43) The viscosity coefficient or viscosity of a gas is given
m by
where m is the molecular mass. This quantity differs from
η ≈ 13 v̄ρmλ, (47)
the average speed,
while the thermal conductivity coefficient κ is given by
8kT
v̄ = v = .
πm κ ≈ 13 v̄ρλCv = ηCv /m, (48)
Equation (43) is equivalent to Eq. (26).
where Cv is the molar heat capacity of the gas at constant
Knowing the collision frequency and the molecular
volume. Notice that since λ is inversely proportional to ρ,
speed, it is possible to estimate the mean free path be-
both η and κ are independent of the gas density. This will
tween collisions, λ = v/Z , so that
√ be true so long as λ is small compared to the dimensions
λ = 1/ 2ρσ, (44) of the apparatus.
In solution, D is given by the Stokes–Einstein relation
which shows the expected behavior that λ must decrease which relates D to the viscosity coefficient of the solution,
as the diameter of the molecule increases or as the density η, and the effective hydrodynamic radius a, where
of the gas increases.
Helium and nitrogen have estimated self-collision cross D = kT /6π ηa (49)
sections of 0.13 and 0.31 nm2 and at a pressure of
1 Torr ( = 133.3 N m−2 = 1.32 × 10−3 atm) there are and by the Einstein–Smoluchowski relation:
about 3 × 1022 molecules m−3 , giving mean free paths D = d 2 /2τ, (50)
of ∼1.8 × 10−4 and 7.6 × 10−5 m for helium and nitro-
gen, respectively. At 25◦ C, these species have respective where τ is the characteristic time between jumps of dis-
velocities of v̄ = 1254 and 474 m/sec, giving collision fre- tance d.
quencies of 6.9 and 6.2 × 106 sec−1 per molecule. More elaborate theories of transport phenomena make
The kinetic theory of gases can also be applied to the use of the Boltzmann transport equation or computer sim-
free-electron gas to describe the transport properties of ulations.
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
C. Chemical Kinetics; Activated in a prior step. The species concerned is often a highly
Complex Theory reactive molecule with a vacancy in its outermost shell
of electrons, termed a free radical. Such processes are
The quantitative study of chemical reaction rates and the
termed chain reactions. Chain reactions are very important
factors on which these rates depend constitutes the field
in polymerization reactions, where a radical may add to
of chemical kinetics. Chemical reaction rates can be ex-
another reactant to form another (larger) radical. In cases
pressed in terms of the rate of production of any reaction
where more than one reagent species is formed as a product
product or as the rate of decrease in the concentration of
of a later step, the chain is said to be branched, and such
any reactant. The individual steps in a chemical reaction
branching chain reactions often lead to explosions. In other
sequence are termed elementary reactions. A number of
cases, explosions may occur as a result of a fast exothermic
consecutive elementary reactions may be responsible for a
reaction which yields a net excess of energy in the form of
given chemical change. If the observed rate is found to be
heat and in a time too short for the energy to be dissipated.
proportional to the concentration of a given reactant raised
The increase in temperature then causes an increase in
to some power, α then α is said to be the order of the re-
rate, and the cycle ends in a thermal explosion.
action with respect to that reagent. The sum of the orders
In some mechanisms a species may be consumed in one
over all reagents gives the overall order of the reaction. A
step of a reaction only to be regenerated in a subsequent
complex reaction, involving a number of elementary steps,
step. In cases where the presence of this species increases
may have a noninteger order. Thus the order should not be
the overall reaction rate, it is termed a catalyst, which is
confused with the molecularity of an elementary reaction,
defined as a species that increases the rate of a reaction
which is the number of reagent molecules involved in a
without being consumed or changing the reaction prod-
single reaction step. The sequence of elementary reactions
ucts. A catalyst must increase the rate of both forward and
by which a reaction proceeds is termed the reaction mech-
backward reactions in any system at equilibrium and can
anism, a term also used to describe the detailed process of
be thought of as lowering E 0 (see below).
bond breaking and formation in a single reactive collision.
An expression for bimolecular rate constants can be
By way of illustration, consider the formation of nitro-
obtained by observing that along a reaction coordinate
gen dioxide from nitric oxide and oxygen. This reaction
the energy surface consists of two wells, representing the
is found to be third order, corresponding to
reactants and products, separated by a saddle point, rep-
2NO + O2 = 2NO2 (51) resenting the maximum energy required to pass along the
minimum energy path between reactants and products. If
with a third-order rate law corresponding to
the height of this maximum, relative to the reactant well, is
d[NO2 ] E 0 then only collisions where the energy exceeds E 0
= keff [NO]2 [O2 ]. (52)
dt can lead to reaction. Integrating the Boltzmann distribu-
Here the constant keff is the reaction rate constant and tion of energies over all energies exceeding E 0 , shows
square brackets indicate concentrations. If concentrations that probability of a collision with energy in excess of
are given in moles per liter, the rate constants will have E 0 is proportional to exp(−βE 0 ). This is consistent
units of (mol/L)1−n sec−1 , where n is the order of the re- with the rate law of Arrhenius:
action. A likely mechanism for this process can be written k = A exp(−E 0 /kB T ), (54)
in terms of the elementary steps:
where A is known as the pre-exponential factor, and the
k1
NO + NO → N2 O2 (53a) Boltzmann constant is written as kB here to avoid con-
k−1 fusion; A can readily be estimated from collision theory,
N2 O2 → NO + NO (53b) using the expression for the collision frequency for one
k2 reagent with another, Z 12
N2 O2 + O2 → 2NO2 , (53c)
Z 12 = ρ1 ρ2 σ12 [8kB T /µπ ]1/2 (55)
which leads to the observed rate law if the first two steps
are assumed to come to equilibrium prior to the third reac- which leads to
tion, or if the steady-state approximation, which assumes
A = Pσ12 N A [8kB T /µπ ]1/2 , (56)
that the rate of change of all concentrations is zero, is in-
voked. Reaction (53a) has a molecularity of 2 and is a where NA is Avogadro’s number, which converts ρ to mo-
bimolecular reaction, while reaction (53b) is an example lar units, and P is the so-called steric factor, which ac-
of a unimolecular reaction, involving a single species. counts for the fact that not all collisions lead to reaction.
An important class of reaction mechanisms are those Alternatively, we can replace the product Pσ12 with σreac ,
in which a reaction product from one step is a reagent where σreact is termed the reactive cross section.
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
A more general treatment of detailed reaction rates is sis experiments, including discharge lamps, flash lamps,
available in the activated complex theory of Eyring, which and synchrotrons. However, most recent studies have con-
assumes that there is an intermediate state between the re- cerned laser photolysis. The photofragments are detected,
actants and the products, called the activated complex or for example, by emission or laser spectroscopy, which
transition state which can be regarded as at least some- provides information on the velocity and quantum-state
what stable and which is in thermodynamic equilibrium distribution of the fragments, with respect to rotational,
with the reactants, thus permitting thermodynamics to be vibrational, and electronic states. It is even possible with
applied. Instead of an energy, we must use the free energy the aid of femtosecond lasers to follow the photofragmen-
G (because the pressure is constant) in the exponential. tation process in real time. Such measurements can pro-
This treatment yields vide information on the shape of the excited state potential
energy surface. In the last decade, researchers have taken
kT G ‡0 these ideas one step further to use the coherent nature of
k=κ exp −
h RT laser light to control the outcome of a photochemical reac-
tion such as photodissociation. In this work, one or more
kT S0‡ H0‡
=κ exp − exp − , (57) pulse of laser energy is used to drive a reaction to a de-
h R RT sired outcome, opening up exciting possibilities for new
where R is the gas constant, κ is the transmission coef- methods of chemical synthesis.
ficient and G ‡0 S0‡ , and H0‡ refer to differences be- Bimolecular reactions are often studies by firing col-
tween the activated complex and the reactants. limated streams of reagents at each other in the form
Extensions of this statistical thermodynamical ap- of crossed molecular beams. The scattered reagents and
proach to estimating reaction rates include the RRK and products can be detected by a rotatable mass spectrome-
RRK M theories of unimolecular decay rates, and the ter in order to measure angular distributions. Such experi-
information theoretic formulation of reaction dynamics. ments have shown that many reactions occur in essentially
These theories are remarkably successful, although gen- a single encounter in a direct mechanism, while others pro-
erally more successful at interpreting experimental data ceed through a long-lived complex mechanism. In other
and correlating results than at deriving results a priori. experiments, spontaneous light emission from the unre-
laxed, or nascent, products, termed chemiluminescence,
has been analyzed to yield quantum-state distributions.
D. Reaction Dynamics; Inelastic Collisions
Lasers are often used to probe internal states of products,
Kinetic measurements and knowledge of reaction prod- for example, by inducing emission as in laser-induced flu-
ucts and yields can provide only rather limited insight into orescence (LIF) detection, and to prepare molecules in
the molecular dynamics of chemical reactions. To under- specific states and with chosen orientations. Vibrational
stand the detailed manner in which atoms and molecules energy is often found to be more efficacious in promoting
move together and come apart in the process of a chemical reaction than is translational or rotational energy, since it is
reaction, it is necessary to study the isolated elementary re- more strongly coupled to the reaction coordinate, or path
actions in as much detail as possible. Such isolation is most in phase space along which reaction takes place. Product
often provided by a dilute gas environment. Ultimately, the distributions are frequently observed to be far from equi-
hope is to understand reaction dynamics in terms of elec- librium. For example, in direct reactions, high vibrational
tronic structure and to be able to calculate this for a cho- levels are often found to be more populated than low ones.
sen system. The electronic structure or potential energy This so-called population inversion forms the basis of the
surface is the meeting ground between theory and exper- chemical laser.
iment. Currently most studies seek to probe those factors, Reaction rate constants cannot be used to describe
or states, which influence the rate of chemical reactions, such detailed processes. Instead the differential reac-
such as vibration and translational energy, and to examine tion cross section, σreact n 1 , n 2 , n 3 , . . . n 1 , n 2 , n 3 , is em-
the manner in which energy and angular momentum are ployed, where n i are various quantum numbers and the
disposed among the product states for various processes. primed quantities refer to reaction products. Such cross
This is the area known as state-to-state chemistry. sections represent the effective collision area for reagents
Molecular photodissociation is an ideal process for such with given n 1 , n 2 , . . . , to give specific products. Rate con-
studies and has been examined in considerable detail. This stants represent the effective average of the product of the
unimolecular event is sometimes considered as a “half cross section with the approach velocity taken over the
collision,” where the absorption of a photon excites the calculated distribution of reagent quantum states.
system to a repulsive state that flies apart. A number of Cross sections can be predicted from semiclassical tra-
radiation sources have been employed for such photoly- jectory calculations, in which equations of motion are
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
solved by numerical integration, or they can be ob- often catalyzed, for example, by protons in acid catalysis
tained from quantum calculations via the time-dependent and by enzymes in many biological systems. Moreover,
Schrödinger equation. Both approaches require a previ- interactions with solvent molecules may grossly alter the
ously calculated potential energy surface. However, accu- potential energy surface on which reaction occurs, com-
rate potentials are currently available only for the H + H2 pared to the isolated gas-phase system. Such interactions
reaction and its isotopic analogues, for which precise are strongest for polar reagents and solvents.
quantum calculations can be made. For other systems ap- Reactions in solution are often diffusion controlled,
proximate surfaces can be obtained either semiempirically where the limiting step is the rate at which reagents can
(e.g., using the LEPS or BEBO functions), or from approx- find each other. In the absence of strong interactions such
imate a priori calculations. Statistical theories are also em- as those between ions, the rate constant may be esti-
ployed. These are based on the assumption that different mated from Fick’s law [Eq. (45)] together with the Stokes–
reaction channels are populated in proportion to the vol- Einstein relation [Eq. (49)] giving:
ume of phase space with which they are associated, which
k = 8RT /3η, (58)
is consistent with conservation of energy and linear and
angular momentum. Dynamical factors may cause devia- where R is the gas constant. Since reactants are also slow
tions from such statistical behavior, providing information to drift apart, the time-averaged collision frequency per
on the reaction mechanism. This is recognized in the in- molecule may be close to the gas-phase value, so that re-
formation theoretic approach where the fully statistical actions with rate constants much smaller than given by
outcome, or prior distribution, is compared with observa- Eq. (58) may be relatively insensitive to diffusion effects
tions in so-called surprisal plots which indicate the degree (in practice this means E 0 ≥ 40 kJ/mol). Reaction prod-
to which the data deviate from statistical behavior. This ucts may also be slow to move apart, thus in liquid-phase
approach has proven very valuable in the correlation and photodissociation, where the adsorption of light causes a
extension of a wide body of data. molecule to fall apart, the surrounding solvent cage may
Since reaction rates can depend not only on reagent hold the products together long enough for recombination
energy, but also on the form in which it is available, a to occur.
full understanding of chemical behavior requires knowl- If two reactions are in equilibrium with an equilibrium
edge of the manner in which energy in various forms is constant K , and the back reaction is held constant at the
redistributed by collisions. This information must be ob- diffusion limit, kd , then the forward rate constant will be
tained by studies of energy transfer associated with in- equal to K kd . More generally, it is often found that for a
elastic collisions. Experimental studies vary from kinetic given reaction involving a series of similar reagents,
measurements of decay rates, to full state-to-state stud-
ies. It is found that rotational energy is readily trans- k ∝ K α, (59)
ferred from one molecule to another, occurring on al- log k = b + α log K . (60)
most every collision. Transfer from rotation to transla-
tion can be 102 times slower, while transfer to vibration This is the Brønstead equation, and is an example of a
may be 104 times slower. Transfer between translation linear free-energy relationship, since log K ∝ G 0 and
and vibration occurs only about once in a million colli- log k ∝ G ‡0 , then Eq. (60) could be written as
sions at room temperature. In general, the rate of energy G ‡0 = b + αG 0 (61)
transfer decreases rapidly as the amount of energy trans-
ferred increases, following an approximate exponential Related to the Brønstead equation is the Hammett equa-
gap rule. tion, which expresses the rate constant k of one of a series
of related reactions in terms of a specific reference reaction
with rate k0 , giving
E. Reactions in Solution
log(k/k0 ) = ρσ, (62)
In principle, reactions in solution occur in a similar man-
ner to those in the gas phase and in some favorable cases where ρ is a characteristic of the type of reaction and σ is
the observed rate constant is the same in both phases. For a characteristic of the specific system. Expressions such
example, the unimolecular decomposition of N2 O5 yields as the Brønstead and Hammett equations are particularly
similar A and E 0 values in the gas phase and in a large useful since the complex nature of the environment makes
range of solvents. However, there are many important dif- absolute rate theories such as the activated complex theory
ferences. Reactions of ionic species and of large molecules difficult or impossible to apply in solution.
such as proteins and polymers are rare in gas-phase stud- The rate constant for a bimolecular reaction in solution
ies but are common in solution. Reactions in solution are can be expressed in terms of the activity coefficients of
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
the reagents, γ (A) and γ (B), and of the transition state, (see above). In this method, the STM current shows fea-
γ (AB)‡ , by tures associated with the vibration frequencies of chemical
bonds. Figure 6(A) shows an STM image of the molecule
k = k0 [γ (A)γ (B)/γ (AB)‡ ], (63) HC2 D on a copper surface. Figure 6(B) shows an image
where an activity coefficient is defined as the ratio of the of the CD bond of this molecule.
effective concentration to the actual concentration, as re- The fraction of a surface covered by a gas, , is given
quired to obey Raoult’s law. The Debye–Hückel limiting in terms of monolayers (ML), where 1.0 ML represents
law gives γ in terms of the ionic charges z of the species: complete coverage by a single layer. The pressure depen-
√ dence of at a given temperature is termed an adsorp-
log γ (i) = −sz i2 I , (64) tion isotherm. If the rate of adsorption is ka p(1 − ), and
the rate of desorption is kd , we obtain the Langmuir
where I is the ionic strength: isotherm:
1 i
=∞
=
Kp
(67)
I = xi z i2 , (65) 1 + Kp
2 i =0
where K = ka /kd . Thus, when K p is small, is simply
in which xi indicates molar concentration, giving:
proportional to the pressure. An adsorption dependence on
√
log k = log k0 + 2sz A z B I . (66) 1 − arises in the ideal case in which each molecule ad-
sorbs at and occupies a single surface site. If two adjacent
This indicates that the rate constant for reactions in solu- sites are required for adsorption, a (1 − )2 dependence
tion depends on the ionic strength of the solution, behavior might hold. The probability of adsorption for a single col-
referred to as the kinetic salt effect. Thus, addition of inert lision is termed the sticking probability, which usually
ions will increase the rate of reaction between ions of like implies chemisorption and may range from unity, for say
charge and vice versa. oxygen on a clean metal surface, to close to zero for an in-
Recent advances in theory√ have shown that log k ceases ert system. At low temperatures, even inert gases may trap
to be a linear function of I at large ionic strength. The into a physisorption state, in proportion to their trapping
reaction rate continues to increase with increasing ionic probability.
strength, but less rapidly than that predicted by the above With a sticking or trapping probability of unity, a mono-
relation. A careful examination of the experimental data layer will form in about 1 sec at 10−6 Torr, which means
confirms this prediction. that studies of clean surfaces must be carried out under
conditions of ultra high vacuum (UHV), or below ∼10−9
Torr.
F. Reactions at Surfaces; Heterogeneous The ability of surfaces to promote chemical reactions
Catalysis; Corrosion stems largely from their ability to cause dissociation.
Atoms and molecules frequently adsorb on surfaces, Consider the decomposition of ammonia to nitrogen and
where they may decompose and/or react with other ad- hydrogen:
sorbed species. Modern technology is increasingly depen- 2NH3 → N2 + 3H2 (68)
dent on surface chemistry which underlies many industri-
ally important processes as well as destructive processes In the gas phase, this process has an activation energy of
such as corrosion. It is useful to distinguish two types ∼330 kJ mol−1 , whereas in the presence of on a tung-
of adsorption: physical adsorption, or physisorption, and sten surface this falls to ∼160 kJ mol−1 . By providing
chemical adsorption, or chemisorption. Physisorption is an alternative low-energy path for reaction, the surface
similar in nature to condensation and involves little chem- causes a large increase in the reaction rate. This is an ex-
ical interaction with the surface, being associated with van ample of heterogeneous catalysis. (See Section IV.C for
der Waals forces. Chemisorption involves a true chemical a definition of catalysis.) For such a reaction to occur at
interaction with the surface, with the formation of a chem- a surface, the ammonia must first diffuse to the surface,
ical bond. Thus, the enthalpy of physisorption is usually of it must dissociatively chemisorb, the hydrogen and ni-
the order of 20 kJ mol−1 , while for chemisorption values trogen atoms must then recombine, and they must desorb
are in the region of 200 kJ mol−1 . A chemisorbed molecule and diffuse away from the surface. The recombination and
may either remain intact in molecular chemisorption, or desorption may actually occur as one step, as the reverse
fall apart in dissociative chemisorption. In an important of dissociative adsorption. Either the adsorption or des-
recent development, it is now possible to identify individ- orption steps are rate-limiting in gas–surface reactions,
ual molecular bonds of adsorbed molecules using STM although fast liquid–surface reactions may be diffusion
P1: GLQ/LSK
Physical Chemistry
P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology
EN012A-573
July 25, 2001
17:10
FIGURE 6 Scanning tunneling microscope image of a singly deuterated acetylene molecule (HC2 D) molecule on a
Cu(100) surface. (A) An image of the C–D bond obtained by setting the second differential of the tunneling current
to 269 meV. [See Stipe, B. C., Rezaei, M. A., and Ho, W. (1999). Phys. Rev. Lett. 82, 1724; courtesy B. C. Stipe, with
permission.]
175
P1: GLQ/LSK P2: GPJ Final Pages
Encyclopedia of Physical Science and Technology EN012A-573 July 25, 2001 17:10
limited. This is consistent with the fact that dissociative SEE ALSO THE FOLLOWING ARTICLES
chemisorption and recombinative desorption are often ac-
tivated processes. ADSORPTION • ATOMIC SPECTROMETRY • CHEMICAL
Chemical reactions at the gas–surface interface can be THERMODYNAMICS • CRYSTALLOGRAPHY • KINETICS
followed by monitoring gas-phase products with, for ex- (CHEMISTRY) • LIQUIDS, STRUCTURE AND DYNAMICS •
ample, a mass spectrometer, or by directly analyzing the MECHANICS, CLASSICAL • PERIODIC TABLE (CHEMISTRY)
surface with a spectroscopic technique such as Auger • QUANTUM CHEMISTRY • QUANTUM MECHANICS • STA-
electron spectroscopy (AES), photoelectron spectroscopy TISTICAL MECHANICS • SURFACE CHEMISTRY
(PES), or electron energy loss spectroscopy (EELS), all
of which involve energy analysis of electrons, or by sec-
BIBLIOGRAPHY
ondary ionization mass spectrometry (SIMS), which ex-
amines the masses of ions ejected by ion bombardment.
Adamson, A. W. (1980). “Physical Chemistry of Surfaces,” Wiley, New
Another widely used surface probe is low-energy electron York.
diffraction (LEED), which can provide structural infor- Adamson, A. W. (1979). “A Textbook of Physical Chemistry,” 2nd ed.,
mation via electron diffraction patterns. At the gas–liquid Academic Press, New York.
interface, optical reflection ellipsometry and optical spec- Atkins, P. W. (1982). “Physical Chemistry,” 2nd ed., Freeman, San Fran-
cisco.
troscopies are employed, such as Fourier transform in-
Bernstein, R. B. (1982). “Chemical Dynamics via Molecular Beam and
frared (FTIR) and laser Raman spectroscopies. Laser Techniques,” Oxford Univ. Press, New York.
Elastic and inelastic collisions of atoms and molecules Berry, R. S., Rice, S. A., and Ross, J. (1980). “Physical Chemistry,”
at surfaces are also of importance. The scattering of hydro- Wiley, New York.
gen and helium from surfaces leads to diffraction patterns Eyring, H. (1944). “Quantum Chemistry,” Wiley, New York.
Eyring, H., Henderson, D., and Jost, W. (1967). “Physical
in the same manner as with LEED, but since the atoms
Chemistry—An Advanced Treatise,” 15 vols., Academic Press, New
penetrate the surface far less deeply than even low-energy York.
electrons, the structures obtained reflect the very surface of Eyring, H., Henderson, D., Stover, B. J., and Eyring, E. M.
the sample. The inelastic surface scattering of molecules (1982). “Statistical Mechanics and Dynamics,” 2nd ed., Wiley, New
can be examined in detail using laser and mass spectromet- York.
Herzberg, G. (1950). “Molecular Spectra and Molecular Structure: I.
ric detection for the scattered molecules. Such measure-
Spectra of Diatomic Molecules,” 2nd ed., Van Nostrand-Reinhold,
ments can be used to model the form of the gas–surface New York.
interaction potential, knowledge of which is a prerequisite Kauzmann, W. (1957). “Quantum Chemistry,” Academic Press, New
for any detailed picture of gas–surface reaction dynamics. York.
Not all surface chemistry is catalytic. In many cases Laidler, K. J. (1965). “Chemical Kinetics,” 2nd ed., McGraw-Hill, New
York.
the surface itself may be consumed, in processes such
Levine, R. D., and Bernstein, R. B. (1974). “Molecular Reaction Dy-
as etching and corrosion. Etching is employed to fabri- namics,” Oxford Univ. Press, New York.
cate devices where it provides for the controlled removal Levine, R. D., and Bernstein, R. B. (1987). “Molecular Reaction
of material. In the semiconductor industry, for example, Dynamics and Chemical Reactivity,” Oxford Univ. Press, New
discharges containing fluorine are used to etch silicon by York.
Linnett, J. W. (1960). “Wave Mechanics and Valency,” Methuen, Lon-
volatilization as silicon fluorides. Corrosion is generally
don.
an unwanted process whereby items are destroyed through McQuarrie, D. A. (1976). “Statistical Mechanics,” Harper and Row, New
dissolution and/or oxidation. Metals may corrode through York.
many different (usually electrochemical) processes. For Moore, W. J. (1983). “Basic Physical Chemistry,” Prentice Hall, New
example, in the presence of oxygen, a metal may displace York.
protons as water or reduce oxygen to OH− , in acid and Partington, J. R. (1954). “An Advanced Treatise on Physical Chemistry,”
5 vols., Wiley, New York.
alkaline environments, respectively. In principle, this pro- Rowlinson, J. S. (1982). “Liquids and Liquid Mixtures,” 3rd ed., But-
cess requires the additional presence of a second metal, terworths, London.
with a lower electrochemical potential. However, all sam- Smith, I. W. M. (1980). “Kinetics and Dynamics of Elementary Gas
ples have regions of high and low strain, which will have Reactions,” Butterworths, London.
Smith, R. A. (1961). “Wave Mechanics of Crystalline Solids,” Wiley,
slightly different potentials. A given metal can be pro-
New York.
tected by contact with another (sacrificial) metal with a Steinfeld, J. I. (1974). “Molecules and Radiation: An Introduction
more negative potential, which will be preferentially cor- to Modern Molecular Spectroscopy,” MIT Press, Cambridge, Mas-
roded. This is applied in the galvanizing of iron by zinc. sachusetts.
P1: GQT/GAM P2: GLM Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN013C-606 July 26, 2001 17:34
I. Introduction
II. Quantum Mechanical Basis for Adiabatic
Potential Energy Surfaces
III. Topology of Adiabatic Potential Energy Surfaces
IV. Breakdown of the Adiabatic Approximation
V. Shapes of Potential Energy Surfaces
9
P1: GQT/GAM P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN013C-606 July 26, 2001 17:34
electronic adiabatic potential energy surfaces. Under some II. QUANTUM MECHANICAL BASIS
circumstances, however, it is convenient to use more gen- FOR ADIABATIC POTENTIAL
eral definitions; this results in potential energy surfaces ENERGY SURFACES
variously known as diabatic or nonadiabatic, the latter ad-
jective being a useful double negative (it is convenient The Schrödinger equation for a system of N nuclei and n
to use this term because, except for a few small terms, electrons is
adiabatic surfaces may be defined uniquely but in gen-
eral diabatic surfaces are not; nonadiabatic has the useful (H − E)(r, R) = 0, (1)
connotation then of “anything except the adiabatic”). The where H is the Hamiltonian or energy operator of the
concept of potential energy surfaces may be generalized molecule:
to systems in liquids, in which case one obtains potentials h2 2 h2 2
of mean force. H =− ∇ − ∇ + V (r, R) + Hrel (r, R). (2)
2M R 2m r
In these equations, h is Planck’s constant divided by 2π,
and R denotes a 3N -dimensional vector of scaled nuclear
coordinates:
I. INTRODUCTION M1 1/2
R1 = X 1, (3a)
M
The separation of electronic and nuclear motions accord-
ing to time scales and the consequent introduction of M1 1/2
R2 = Y1 , (3b)
an effective potential energy surface for nuclear motion M
was first considered by Born and Oppenheimer. Although
..
their method is seldom used in modern work, the modern .
equivalents are still commonly called Born–Oppenheimer 1/2
MN
approximations and Born–Oppenheimer potential energy R3N = ZN, (3c)
surfaces. The modern form of the derivation, which is M
summarized below, dates to a later paper by Born and where M j , X j , Y j , and Z j are the mass and Cartesian co-
to work by Born and Huang. Occasionally the phrases ordinates of the nucleus j, M is any of the nuclear masses
Born–Oppenheimer and Born–Huang are used to specify or a convenient reduced nuclear mass, m is the electronic
whether certain small terms are included in the poten- mass, r is a 3n-dimensional vector of electronic Cartesian
tial energy surfaces, although, as mentioned above, it is coordinates {xk , yk , z k }nk=1 :
more common to refer to any adiabatic surfaces as Born–
r1 = x 1 , (4a)
Oppenheimer surfaces.
A potential energy surface is an effective potential en- r2 = y1 , (4b)
ergy function for the relevant nuclear degrees of freedom. ..
The latter are usually defined as all nuclear degrees of .
freedom minus three overall translations and two or three
r3n = z n , (4c)
rotations of the nuclear subsystem. If an atom has N nu-
clei, it is common to consider the potential energy as a V (r, R) is the sum of all coulomb forces between the par-
function of 3N − 5 coordinates for N = 2 (since two nu- ticles, Hrel (r, R) is the energy operator for relativistic ef-
clei always lie on a line and therefore, when considered as fects including mass-velocity and spin–orbit coupling, E
point masses, have only two rotational degrees of freedom) is the total energy, and ψ(r, R) is the wave function. Notice
and 3N − 6 coordinates otherwise. Thus for N = 2 we ac- that the first two terms in Eq. (1) represent the nonrelati-
tually have a potential energy curve (i.e., a function of one vistic kinetic energy of the nuclei and the electrons. Usu-
variable), whereas for N ≥ 3 we have a potential energy ally the change in ψ(r, R) is about the same order of mag-
hypersurface (i.e., a function of three or more variables). nitude when we move a nucleus a small amount as when
A potential energy surface would strictly speaking denote we move an electron the same small amount. When this
the potential energy as a function of two coordinates in is the case, ∇R2 ψ(r, R)/M is smaller than ∇r2 ψ(r, R)/m
a two-dimensional cut through the (3N − 6)-dimensional by a factor of order m/M, which is less than about 10−3 ;
internal-coordinate space. In this article, however, we fol- thus the first term in Eq. (2) may be neglected to a first
low the very common language by which any potential en- approximation.
ergy hypersurface or potential energy function is referred The physical interpretation of this is that because of
to as a surface. their larger masses, the nuclei move so slowly compared
P1: GQT/GAM P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN013C-606 July 26, 2001 17:34
with the electrons that the dependence of the wave function By the same argument given above for the variation of
on electronic coordinates is essentially the same as if the ψ(r, R) with respect to r and R, we expect that the terms
nuclei were completely stationary (i.e., as if their mass containing Fαβ (R) and G αβ (R) are usually much smaller
were infinite so that their kinetic energy was zero). Then than the term containing Uα (R). When this is so, we may
Eq. (1) becomes neglect the small terms, and the set of coupled Eqs. (8) sim-
plifies to a separate uncoupled equation for each χα (R),
h2 2
− ∇r + V (r, R) + Hrel (r, R) − Uα (R) namely,
2m
h2 2
× ψα (r; R) = 0, (5) − ∇ + Uα (R) − E χα (R) = 0. (11)
2M R
where Uα (R) and ψα (r; R) denote the eigenvalue and
eigenfunction, respectively, with index α. Notice that both This has the form
of these depend parametrically on the nuclear positions R (Hnuc − E)χα (R) = 0, (12)
and also that the ψα (r, R) form a complete set of functions
of r for any particular value of R. In general the spectrum where Hnuc is an effective Hamiltonian for nuclear motion
{Uα (R)} contains a continuous part as well as a discrete given by
part, but the discrete part is the important part for the h2 2
concept of potential energy surfaces. We shall therefore Hnuc = − ∇ + Uα (R). (13)
2M R
use only a discrete index α. This is adequate for almost
all practical work since the continuous part is usually ne- Since a Hamiltonian is usually the sum of a kinetic energy
glected in calculations. For a more rigorous derivation, operator and a potential energy operator, we may interpret
however, all sums over discrete α in the equations below Uα (R) as an effective potential for nuclear motion. In fact,
must be replaced by a sum and an integral. Uα (R) is the potential energy surface that we sought to
To obtain the complete system wave function we choose derive. Recalling the origin of Uα (R), we see that it rep-
a trial function resents the total energy of the electrons, both kinetic and
potential, plus all the rest of the potential energy, when
ψ(r, R) = ψα (r; R)χα (R). (6) the electrons are in state α. Alternatively, it represents
α
the entire (coulombic plus relativistic) potential energy of
Since the {ψα (r; R)} are complete, this trial function yields all particles plus the electrons’ kinetic energy. When the
the exact ψ(r, R) if we retain the complete set and solve equations for the χα (R) decouple, as in Eq. (12), the elec-
for {χα (R)} by the variation method. The best {χα (R)} by tronic state is preserved during the nuclear motion. The
this method satisfy the equation resulting quantized energy requirement of the electrons
plus the rest of the potential energy (given in the absence
dr ψβ∗ (r; R)(H − E) ψα (r; R)χα (R) = 0, of relativistic effects by the nuclear–nuclear coulombic in-
α
teraction energy) together constitute an effective potential
β = 1, 2, . . . , ∞. (7) for nuclear motion.
If we carry out the indicated operations, using Eq. (5) When nuclear motion can be approximated by classical
and the orthogonality of the {ψα (r; R)} in r at fixed R, mechanics (which is often reasonable, especially for atoms
Eqs. (7) become heavier than hydrogen), Eqs. (12) and (13) are replaced by
h2 M R̈ = −∇R Uα (R), (14)
− ∇R2 χα (R) + 2 Fαβ (R) · ∇R χβ (R)
2M β where R̈ is the nuclear acceleration, and the right-hand
side is the force on the nuclei. Since Uα (R) generates the
+ G αβ (R)χαβ (R) + [Uα (R) − E]χα (R) = 0, force function, it is sometimes called the force field.
β Inclusion of the spin–orbit and other relativistic terms
α = 1, 2, . . . , ∞, (8) in Eq. (5), as we have done, is, strictly speaking, the
most correct approach. This yields, as we have seen, a
where
set of nuclear wave functions χα (R) whose uncoupled
motion is governed by the potentials Uα (R) and which
Fαβ (R) = dr ψα∗ (r; R)∇R ψβ (r; R) (9)
are coupled only by the nuclear-derivative terms Fαβ (R)
and and G αβ (R). In practice, though, Hrel (R) is difficult to
treat on an equal footing with the coulombic terms in the
Gαβ (R) = dr ψα∗ (r; R)∇R2 ψβ (r; R). (10) Hamiltonian. Therefore one sometimes works with
P1: GQT/GAM P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN013C-606 July 26, 2001 17:34
In general, for systems of three or more nuclei, some adiabatic approximation is expected to break down in the
potential energy surfaces that intersect when one neglects vicinity of any avoided crossing or avoided intersection,
spin–orbit coupling avoid intersecting when one includes and also at actual intersections for which the symmetry of
it. the two states is the same at the intersection.
There are other circumstances under which the adiabatic
approximation may break down. We have considered the
IV. BREAKDOWN OF THE ADIABATIC case where ψ(r, R) varies rapidly with R because of the
APPROXIMATION factor ψα (r; R). A second case where the term containing
Fαβ (R) in Eq. (8) may be significant occurs when χα (R)
We are now in a position to understand the limitations of varies unusually rapidly, for instance, in highenergy col-
the adiabatic potential energy surface concept. First, how- lisions of Na+ with Cl− . When the nuclear speed is large,
ever, we should understand the physical origin of avoided χα (R) must vary significantly on the scale of a very small
crossings and avoided intersections. We begin by consid- de Broglie wavelength.
ering the diatomic molecule NaCl, and we let R denote We may summarize the two cases in a simple but ap-
the distance between the nuclei. At R = ∞, the energy proximate way as follows: When the nuclear kinetic en-
of two neutral atoms is lower than the energy of a Na+ ergy is much smaller than the spacings between the adi-
ion and a Cl− ion by the difference E of the ioniza- abatic electronic energy surfaces Uα (R), these surfaces
tion potential of Na and the electron affinity of Cl− . As serve as potential energy surfaces for nuclear motion.
R is decreased, however, the energy of the ionic state When the nuclear kinetic energy is comparable to or larger
decreases rapidly because of the long-range coulomb at- than the spacings between the Uα (R), the adiabatic ap-
traction, which may be represented by −e2 / R where proximation may, and often does, break down.
e is the electronic charge, while the energy of the neu- The adiabatic potential energy surfaces need not be
tral state stays approximately constant until much shorter abandoned completely when the adiabatic approxima-
distances where the covalent interaction becomes appre- tion breaks down, especially if the region of breakdown
ciable. Thus at some distance R ∗ given approximately is fairly localized, as it often is when the breakdown is
by due to an avoided or conical intersection. If the nonadi-
abatic behavior is localized to a small region, we often
e2
E∼ = (16) employ the model of surface hopping. In this model the
R∗ nuclear motion is assumed to be governed by an adia-
the hypothetical purely ionic state and the hypothetical batic potential energy surface until a nonadiabatic region
purely covalent state would have the same energy. Actu- is reached. In such a region there is a nonzero quantum
ally though, at this R the corresponding eigenfunctions mechanical probability that the system “hops” to another
of Eq. (5) have mixed character, partly covalent and partly surface. Based on this probability one portion of the quan-
ionic, with about 50% partial ionic character. Since both tum mechanical probability density exits the nonadiabatic
states have 1 + g character, their eigenvalues are different. region in one of the adiabatic electronic states, and the
We call the energies of the hypothetical states with pure other portion exits in the other one or more coupled adi-
valence characteristics U1d ( R) and U2d ( R), where d de- abatic electronic states. After this the nuclear motions
notes diabatic (or nonadiabatic). Although U1d ( R) and again proceed independently as governed by single po-
U2d ( R) cross, the adiabatic curves U1 ( R) and U2 ( r ) tential energy surfaces until another nonadiabatic region
avoid crossing, having the shapes shown in Fig. 1. For is reached. Although this model neglects certain coherency
this case α = 1 corresponds to a covalent state to the right effects that may be important for quantitative work, it is of-
of R ∗ but to an ionic state to the left of R ∗ and vice ten useful for qualitative discussions and semiquantitative
versa for α = 2. Now recall the argument given above for calculations.
neglecting F12 (R) and G 12 (R). At R = R ∗ , the wave Another concept often invoked for qualitative discus-
function is changing character very rapidly as a function sions and for calculations when the adiabatic approxima-
of R, and the action of ∇R on ψ(r, R) is unusually large; tion breaks down is that of diabatic potential energy sur-
this means that F12 ( R) and G 12 ( R) need not be negligi- faces. There are several nonequivalent ways of defining
ble. In such a case, the dynamics governed by the nuclear- such surfaces, each of which may be useful under some
motion wave functions χ1 (R) and χ2 (R) do not decouple circumstances. The simplest way is that already illustrated
into independent motions governed by effective potentials above in conjunction with the NaCl example: namely, a
(i.e., the adiabatic potential energy surface concept breaks diabatic state is the effective potential energy function for
down). Although we have given the argument for a par- nuclear motion when the electronic state is artificially con-
ticular diatomic molecule, the effect is general, and the strained to a state of prespecified pure valency.
P1: GQT/GAM P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN013C-606 July 26, 2001 17:34
A second way to define diabatic electronic states and po- h2 2
tential energy surfaces is more mathematical. Notice that − ∇r + V (r, R) − Uαn (R) ψαn (r, R) = 0, (22)
2m
the valency-character prescription leads to states coupled
by the operator where the superscript n denotes nonrelativistic. The
true adiabatic states are coupled only by −(h 2 /2M)∇R2 ,
h2 2 but these nonrelativistic adiabatic states are coupled by
Hel = − ∇ + V (r, R) + Hrel (r, R) (17)
2m r both this operator and Hrel (r, R). Because of the latter
and also by −(h 2 /2M)∇R2 . From the point of view of the coupling, the nonrelativistic adiabatic electronic states
nuclear motion, the former is sometimes called potential ψαn (r; R) and their associated potential energy curves
coupling (since it involves only multiplicative operators Uαn (R), which are the most widely employed states and
in the nuclear coordinates), and the latter is called deriva- potential energy surfaces in quantum chemistry, are ac-
tive coupling. Since electronically adiabatic states have tually diabatic. They are nevertheless usually called adi-
derivative coupling but no electronic coupling, a natural abatic although nonrelativistic is adiabatic is technically
question is whether useful diabatic states can be defined to more appropriate.
have potential coupling but not derivative coupling. Un-
fortunately, this leads to states that are completely inde-
pendent of R and are not useful. It is possible, however, to V. SHAPES OF POTENTIAL
make one component (in one or another coordinate sys- ENERGY SURFACES
tem) of the vector coupling operator Fαβ (R) vanish for all
α, β. Furthermore, if Fαβ (R) is approximated as the gradi- A. Diatomics
ent of a scalar (which can be a good approximation when
nonadiabatic effects are dominated by a narrowly avoided A schematic illustration of some typically shaped adia-
intersection), then it is possible to make all components of batic potential energy curves for a diatomic molecule is
Fαβ (R) vanish for all α, β. Both of these prescriptions are shown in Fig. 3. All five curves shown become large and
sometimes employed to obtain diabatic states. Consider, positive at small internuclear distance R. This repre-
for example, the case where Fαβ (R) is the gradient of a sents a repulsive force between the nuclei and is due to
scalar for all R; then it has zero curl. We define diabatic internuclear repulsion and the unfavorability of overlap-
electronic states by ping the atomic charge clouds of the two different centers.
All five curves tend to constants at large R. This is be-
φαd (r; R) = φβ (r; R)Tβα (R). (18) cause the atomic interaction energy eventually decreases
β to zero as the distance between the atoms is increased.
The states {φαd (r; R)χαd (R)} will be uncoupled by nuclear The constant spacings between the curves at large R are
derivative operators if we choose Tβα (R) for all β and α equal to the atomic excitation energies. Curves 1 and 2
such that are effective potentials for the interaction of ground-state
atoms, and curves 3–5 represent effective potentials for the
∇R Tαβ (R) = Fαγ (R)Tγβ (R), (19) case where at least one of the atoms is excited. The figure
γ shows an avoided crossing between curves 2 and 3 and
and this set of coupled partial differential equations does
have a solution if Fαβ (R) has zero curl. Furthermore, if
the state expansion α, β, . . . is truncated to a finite num-
ber of computationally important states, then the diabatic
electronic basis is not independent of R. In this way, one
can define a diabatic basis by a transformation from an
adiabatic one, and it spans the same space. The diabatic
potential surfaces are given by
Uααd
(R) = |Tγ α (R)|2 Uγ (R), (20)
γ
the center of mass of H2 , and the vertical axis is [(MCl + function of the solute coordinates by adding the free en-
2MH /4MCl ]1/2 times the Ha -to-Hb distance. Because of ergy of solution to the gas-phase potential surface. The
the mass scaling factor, the nuclear-motion Hamiltonian, resulting potential function may be used in Eq. (14), and
Eq. (13), in this coordinate system becomes it is called a potential of mean force.
h2 ∂ 2 ∂2
Hnuc = − + + Uα (x , y), (32)
2µ ∂ x 2 ∂ y2
SEE ALSO THE FOLLOWING ARTICLES
where
2MCl MH ATOMIC AND MOLECULAR COLLISIONS • ORGANIC
µ= . (33)
(MCl + 2MH ) CHEMICAL SYSTEMS, THEORY • QUANTUM MECHANICS
Since the coefficient of both derivative operators is the • SURFACE CHEMISTRY
same, this Hamiltonian is the same as that for a single par-
ticle in two dimensions under the influence of a potential
function Uα (x , y). This analogy is very helpful in mentally BIBLIOGRAPHY
visualizing the motion of a polyatomic system whose dy-
namics are governed by a multidimensional potential sur- Gao, J., and Thompson, M. A. (1998). “Combined Quantum Mechani-
face. In Fig. 6 the H + HCl reaction is initiated at the lower cal and Molecular Mechanical Models,” American Chemical Society,
right, and products are formed when the system, having Washington, DC.
Herzberg, G. (1966). “Molecular Spectra and Molecular Structure. III.
passed through or near the saddle point (denoted + in the
Electronic Spectra and Electronic Structure of Polyatomic Molecules,”
figure), reaches the top. Van Nostrand Reinhold, New York.
For solutes in the liquid phase (e.g., an organic molecule Kondratiev, V. N., and Nikitin, E. E. (1981). “Gas-Phase Reactions:
in aqueous solution), one can obtain an effective potential Kinetics and Mechanisms,” Springer-Verlag, Berlin.
Löwdin, P.-O., and Pullman, B., eds. (1983). “New Horizons of Quantum
Chemistry,” D. Reidel, Dordrecht, Holland.
Maitland, A., Rigby, M., Smith, E. B., and Wakeham, W. A. (1981).
“Intermolecular Forces,” Clarendon, Oxford.
Michl, J., and Bonačić-Koutecký. (1990). “Electronic Aspects of Or-
ganic Photo Chemistry,” John Wiley & Sons, New York.
Murrell, J. N., Carter, S., Farantos, S. C., Huxley, P., and Varandas,
A. J. C. (1984). “Molecular Potential Energy Functions,” Wiley,
Chichester.
Salem, L. (1982). “Electrons in Chemical Reactions: First Principles,”
Wiley-Interscience, New York.
Simons, J. (1983). “Energetic Principles of Chemical Reactions,” Jones
and Bartlett, Boston.
Smith, I. W. M. (1980). “Kinetics and Dynamics of Elementary Gas
Reactions,” Butterworths, London.
Truhlar, D. G., ed. (1981). “Potential Energy Surfaces and Dynamics
FIGURE 6 Contour map of a potential energy surface for collinear Calculations: For Chemical Reactions and Molecular Energy Trans-
H + HCl → H2 + Cl. x is the distance from H to the center of fer,” Plenum, New York.
mass of HCl and y the mass-scaled distance from Cl to its Truhlar, D. G., and Morokuma, K., eds. (1999). “Transition State
nearest H. Both axes are given in units of a 0, where 1a 0 = Modeling for Catalysis,” American Chemical Society, Washington,
1 bohr = 0.5292 × 10−10 m. DC.
P1: GTV Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN015D-710 August 2, 2001 15:22
Solid-State Chemistry
Bahne C. Cornilsen
Michigan Technological University
295
P1: GTV Final Pages
Encyclopedia of Physical Science and Technology EN015D-710 August 2, 2001 15:22
withstand extremely high temperatures, improved hetero- charge, and the number of lattice sites must be conserved,
geneous catalysts, improved electronic materials, optics unusual oxidation states can be introduced (in dopants
compatible with very high laser intensities, stabile nu- or predominant cations) and nonstoichiometric composi-
clear waste containment materials, and longer lasting elec- tions stabilized. For example, Ni(III) can be formed in
trodes for fuel cells and magnetohydrodynamic genera- NiO, which nominally contains Ni(II) ions, and nickel va-
tors. To solve such problems and to optimize performance, cancies (VNi ) are formed according to Eq. 1.
it is necessary to control the molecular-level structure and
Ni2+ 3+
Ni(1−3x) NiNi(2x) VNi(x) (1)
bonding and to understand the structure–property relation-
ship. The syntheses and fabrication processes themselves Point-defect ordering (e.g., vacancy-dopant pairs) leads to
are therefore critical for preparation of solid-state mate- interesting complications. Preparation conditions them-
rials which display the desired properties. Properties of selves (e.g., oxygen partial pressure and temperature in
interest include the chemical reactivity as well as the elec- oxides) thermodynamically define and control this defect
tronic, magnetic, optical, and mechanical properties. content and structure. It is important to realize that point
defects are thermodynamically allowed and defined; they
are not anomalous in the least. Therefore, undoped, high-
II. SOLID-STATE STRUCTURE AND purity compounds may exhibit sizable nonstoichiometry
STRUCTURAL CHARACTERIZATION due to intrinsic point defects. Doping (intentional addi-
tion of an impurity) allows one to precisely control the
Characterization of the detailed chemical structure and point-defect content and nonstoichiometry and, thereby,
bonding of a solid is a prerequisite to the understanding the properties. Transport properties are influenced by the
and control of the chemical and physical properties. In point defects. Electrical conduction (hole or electron trans-
general the properties of a solid are controlled by the port) and solid state diffusion of atoms generally vary with
macroscopic and the microscopic structures. The macro- the quantity and type of point defects.
scopic characteristics (e.g., dislocations) predominantly The determination of how nonstoichiometry is accom-
influence the mechanical properties, such as strength modated (i.e., by what type and amount of defect) is
for metals and ceramics. The microscopic structure an active research area. Nonstoichiometry can also be
(interatomic or molecular-level structure) is controlled accommodated by subtle changes in structure known as
by the chemical bonding. Solids are classified according extended defects or crystallographic shear. Crystallinity,
to their chemical bonding as metals, semiconductors, or impurity levels, point-defect structure, and nonstoichiom-
insulators. etry are each controlled by or influenced by the preparation
Complete structural characterization of a material in- method; therefore, it is discussed further in Section III.
volves not only the elemental composition for major com- Surface properties can differ from the bulk structurally,
ponents and a study of the crystal structure, but also the both as clean surfaces or because of products formed on re-
impurity content (impurities in solid solution and/or addi- active surfaces (physisorbed or chemisorbed). The former
tional phases) and stoichiometry. Noncrystalline materials can experience relaxation, that is, surface reconstruction
can display unique behavior, and noncrystalline second due to the distortion in bonding for surface atoms which
phases can alter properties. Both the long-range order and are lacking bonds. Impurity segregation at a surface can
crystal imperfection or defects must be defined. For ex- further alter properties, as can second phases formed on a
ample, the structural details which influence properties of surface. The activity of heterogeneous catalysts and cor-
oxides include the impurity and dopant content, nonstoi- rosion is controlled by such surface properties and by the
chiometry, and the oxidation states of cations and anions. bulk and surface point-defect structures.
These variables also influence the point-defect structure, Phases formed on semiconductor surfaces can change
which in turn influences chemical reactivity, and electri- the electrical properties in an uncontrolled, deleterious
cal, magnetic, catalytic, and optical properties. fashion. Oxide passivation layers on compound semicon-
Point defects are imperfections in the actual crystalline ductors (e.g., mercury cadmium telluride IR detectors or
architecture as compared to the ideal lattice in which each gallium arsenide solar cells) can be grown to impart pro-
atom site is filled with the appropriate element. They tection to the surfaces and to stabilize electrical properties
can influence properties at extremely low levels (ppb or by preventing uncontrolled reactions.
ppm). Typical point defects include crystal sites with miss- Interfaces between two bulk phases, between the bulk
ing atoms (vacancies), atoms positioned in sites that are and a surface, or at grain boundaries can further com-
not filled by the crystal structure in question (intersti- plicate the chemistry. Grain boundaries in polycrystalline
tials), crystal sites containing impurity atoms (dopants), materials can contain second phases (crystalline or non-
and cations with different oxidation states. Because mass, crystalline) and have significant width. This is termed an
P1: GTV Final Pages
Encyclopedia of Physical Science and Technology EN015D-710 August 2, 2001 15:22
interphase. On the other extreme, highly ordered (coher- ing varying degrees of order. Powders may be used as cata-
ent) interfaces can occur between two microcrystallites lysts or as precursors for crystal growing, glass formation,
(or grains). A second phase in a boundary can impart or ceramic formation. Numerous technologically useful
unique electrical properties, significantly influencing the components are produced by sintering powders, taking
characteristics of a capacitor, for example. Furthermore, advantage of the unique electrical, magnetic, optical, or
dopants may segregate to a grain boundary phase, affecting mechanical properties.
boundary and bulk phase properties. Positive temperature Particle size is also an important powder variable. Pro-
coefficient (PTC) resistors and boundary layer capacitor duction of powders with a controlled particle size distri-
operation are based on such effects. bution allows production of ceramics with uniform mi-
Characterization of the preceding structural variables crostructures, improving mechanical properties. Strength
is complex and challenges an extensive array of modern of zirconia and zirconia–alumina refractories can be im-
analytical instrumentation. Diffraction techniques (X-ray, proved another way. Phase transitions which involve
neutron, and electron) are basic to the study of crystal large atomic displacements lead to microcracking during
structures. Improved data analysis techniques make these heating and cooling, which implement fracture. Small-
methods even more powerful for the study of powders. particle-sized zirconia stabilizes a high-temperature phase
Nuclear magnetic resonance (NMR) spectroscopy has be- at lower temperatures. This behavior, called “phase trans-
come a powerful tool for the study of solids with the formation toughening,” has been explained on the basis
advent of magic angle spinning techniques. Neutron in- of higher surface free energies for these systems. Since a
elastic scattering, Raman scattering, and IR vibrational phase transition from this high-temperature phase is elim-
spectroscopic analyses have been traditionally used to inated, the structural properties are improved.
study lattice dynamics and solid-state phase transforma- Research has shown that novel chemical preparation
tions. They can also provide information about dopants methods allow the production of unique materials, demon-
and point-defect structures through studies of local modes strating properties unattainable through more traditional
as well as the extensive crystal structure information. methods. Traditional solid-state synthesis techniques re-
High-resolution electron microscopy lattice imaging has quire high temperatures to increase the kinetics and allow
proven to be a powerful tool for the study of crystal struc- reaction in reasonably short times. It is common to react
tures and extended defects. Electron spin resonance re- solid powders after mixing by grinding or ball-milling.
mains an effective tool for the study of paramagnetic To ensure complete reaction of two powdered reactants
solids, including impurities or low-level dopant structures. it may be necessary to carry out repeated grinding and
A variety of X-ray and electron spectroscopic techniques heating cycles. The grinding is necessary to reduce dif-
have been developed. These are particularly useful for pro- fusion distances and increase product homogeneity. Re-
viding information about elemental composition, surface peated grinding and high-temperature treatments intro-
structures, and cation oxidation states. duce undesirable impurities.
Low-temperature solid preparation methods (meaning
from ∼900◦ C to cryogenic temperatures) can produce
III. PREPARATION TECHNIQUES powders having fewer impurities, high surface areas,
AND PROPERTY VARIATION and other unique characteristics which have useful
applications, such as reactive surfaces for sintering or
The crystal structure of a solid can influence the properties for catalysis. Higher treatment temperatures can actually
of a material, for example, the structure must be noncentric reduce such activity by changing the bulk and/or surface
for a material to demonstrate antiferromagnetic, ferromag- structures.
netic, ferroelectric, or piezoelectric behavior. Rapid cool- Low-temperature syntheses can sometimes allow
ing of a sample from high temperature and/or high pres- unique surface phases to be stable. Tetragonal barium ti-
sure can quench in a structure that is not stable at room tanate, prepared at ∼700◦ C, has hexagonal barium titanate
temperature or atmospheric pressure. High-pressure ox- on the surface, which is stabilized by a higher surface free
ide polymorphs, which are more dense, have been studied energy. Normally this hexagonal phase is not formed be-
to model the earth’s interior. Furthermore, unique crystal low 1460◦ C. This hexagonal surface was also found to
structure characteristics of a material can allow structure– reversibly adsorb CO2 as a surface carbonate.
property variation, for example, insertion compound for- A variety of solution (water or organic solvent) tech-
mation in layered materials. niques have been devised to control composition (e.g.,
Solids can be prepared as single crystals, glasses, thin the Ca:Mn ratio in an oxide such as CaMnO3 ). Control
films, powders, or sintered powder compacts (ceramics). of this ratio is important in terms of the composition-
Powders may be noncrystalline or polycrystalline, exhibit- sensitive properties. This is especially true for transition
P1: GTV Final Pages
Encyclopedia of Physical Science and Technology EN015D-710 August 2, 2001 15:22
metal–containing systems because several oxidation states or other metal-containing species to the reaction site, and
are stable and available for most of them. Unusual ox- is used in the production and development of semiconduc-
idation states introduced by variation in cation ratio and tor devices.
oxygen nonstoichiometry can influence catalytic and elec- In some instances materials with potentially useful
tronic properties. The “solid-state precursor” method in- properties have not been exploited until prepared as pure
volves precipitation of a compound or solid solution (e.g., crystals and films. Examples of this include doped polythi-
oxalate or carbonate) with the desired stoichiometry which azyl, (SN)x , and polyacetylene, (CH)x , which have metal-
is then thermally decomposed to form the desired prod- lic properties, including electrical conductivity. The use
uct oxide. Atomic scale mixing of the components pre- of polymer precursors for ceramics (e.g., silicon carbide)
cludes long calcining times at high temperatures. Powders is another interesting solid-state preparation technique.
formed are extremely homogeneous and have high surface An exciting example, demonstrative of every aspect
areas. The composition of the products, however, does de- of solid-state chemistry, is the development of high-
pend on the solution solubilities, and the precision of these temperature superconducting oxides, which has followed
precursor methods is not always high. The so-called “liq- the 1986 discovery of superconducting YBa2 Cu3 O7−x .
uid mix technique” does allow better precision as well This oxide will conduct at temperatures much higher than
as homogeneity because no precipitation occurs; rather, previous superconducting metal alloys, thereby reducing
the solution is heated to a viscous, glasslike form which cooling expense. The synthesis, purification, character-
is then decomposed to the product oxide. No solution– ization, extension to other metal-oxide systems, and the
solid partitioning occurs, giving a composition equal to eventual application of these oxides in devices is certain to
that of the starting mixture. The latter can be weighed become a classic example of solid-state chemical science
to high accuracy (hundreds of ppm or better for major and technology.
components), providing precise control of product cation Control of the structure during preparation and process-
ratios. ing allows one to control the properties and to optimize
The sol–gel technique is a low temperature, solution- material performance for particular applications. Thor-
preparation method which has been applied extensively ough structural characterization is a prerequisite. Based
to produce glasses, fibers, coatings (protective or dielec- on the knowledge of how synthesis and processing influ-
tric), abrasive particles, and controlled-pore-size catalyst– ence structure and of how structure controls properties, the
substrates. This method is used to prepare glasses, for ex- structure can be tailored and materials can be designed for
ample, that cannot be obtained upon quenching from the optimum performance.
melt. Either colloidal or polymeric gels are formed by
gelation of a precursor solution, involving hydrolysis and
condensation of colloidal sols of metal salt or hydroxide SEE ALSO THE FOLLOWING ARTICLES
solutions or of metal alkoxides. Drying, solvent removal,
and firing conditions are then chosen to provide the desired ANALYTICAL CHEMISTRY • BONDING AND STRUCTURE IN
microstructures and properties. SOLIDS • CRYSTALLOGRAPHY • LASERS, SOLID-STATE •
Another important solution technique which should be MICROSCOPY (CHEMISTRY) • PHASE TRANSFORMATIONS,
mentioned is the “homogeneous precipitation technique.” CRYSTALLOGRAPHIC ASPECT • PRECIPITATION REAC-
It favors formation of a more ordered, crystalline prod- TIONS • SOLID-STATE IMAGING DEVICES • SUPERCON-
uct when two solutions are mixed to form an insoluble DUCTORS, HIGH TEMPERATURE • SURFACE CHEMISTRY
compound. The principle is slow precipitation, avoiding
instantaneous formation of a disordered product. The ben-
efits of all of the low-temperature solution techniques in- BIBLIOGRAPHY
clude homogeneity (atomic scale mixing) and minimal
introduction of impurities. Brinker, C. J., and Scherer, G. W. (1990). “Sol–Gel Science,” Academic
Press, Boston.
Other methods of preparation include chemical vapor Corbett, J. K., ed. (1985). Symposium on metal–metal bonding in solid-
deposition (CVD) and electrochemical methods. The lat- state clusters and extended arrays. J. Solid State Chem. 57(1), 1.
ter are used to form thin films and protective coatings as Etourneau, J. (1999). Novel synthesis methods for new materials in solid-
well as battery electrodes. Since this is generally a low- state chemistry. Bull. Mater. Sci. 22(3), 165–174.
temperature method, the structure can differ from that of Fischer, J. E. (1997). Fulleride solid-state chemistry: Gospel, heresies
and mysteries. J. Phys. Chem. Solids 58(11), 1939–1947,
the same material prepared at a higher temperature. It can Grasselli, R. K., and Brazdil, J. F., eds. (1985). “Solid State Chemistry
be disordered or amorphous. Chemical vapor deposition in Catalysis,” Series 279, American Chemical Society, Washington,
involves vapor phase transport of volatile organometallics D.C.
P1: GTV Final Pages
Encyclopedia of Physical Science and Technology EN015D-710 August 2, 2001 15:22
Holt, S. L., Milstein, J. B., and Robbins, M., eds. (1980). Snyder, R. L., Condrate, R. A., and Johnson, P. F., eds. (1985). “Advances
“Solid State Chemistry: A Contemporary Overview,” Advances in Materials Characterization II,” Vol. 19, Materials Science Research,
in Chemistry Series No. 186, American Chemical Society, Plenum, New York.
Washington, D.C. Sorensen, O. T., ed. (1981). “Nonstoichiometric Oxides,” Academic
Honig, J. M., and Rao, C. N. R., eds. (1982). “Preparation and Charac- Press, New York.
terization of Materials,” Academic Press, New York. State of the art symposium: Solid state. (1980). J. Chem. Educ. 57, 531–
Nelson, D. L., and George, T. F. (1988). “Chemistry of High-Temperature 590.
Superconductors II,” American Chemical Society, Washington, D.C. West, A. R. (1984). “Solid State Chemistry and its Applications,” Wiley,
Pimentel, G. C., and Coonrod, J. A. (1987). “Opportunities in Chem- New York.
istry, Today and Tomorrow,” National Academy Press, Washington, Zelinski, B. J., and Uhlmann, D. R. (1984). Gel technology in ceramics.
D.C. J. Phys. Chem. Solids 45, 1069.
P1: ZBU/GJP P2: GSS Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN015D-711 August 2, 2001 12:38
Solid-State Electrochemistry
Günter Holzäpfel
Dortmund University
I. Introduction
II. Disorder in Solids
III. Transport Processes—Diffusion, Mobility,
and Partial Conductivity
IV. Solid Electrolytes, Solid Ionic Conductors,
and Solid–Solution Electrodes
V. Galvanic Cells with Solid Electrolytes
VI. Technical Applications of Solid Electrolytes
301
P1: ZBU/GJP P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN015D-711 August 2, 2001 12:38
TABLE III Notation of the Lattice Molecule and Typical De- The concentrations of charged atomic defects—point
fects in an AB Lattice in Terms of Building Units Using the Old defects—follow the law of mass action. The considera-
and New Schottky Symbols Compared to the Corresponding
Structure Elements According to Kröger and Vink
tions of thermodynamic equilibria can be applied to dis-
order equilibria in solid crystalline compounds, the so-
Schottky Schottky called ordered mixtures. Point defects can be regarded as
old new Kröger/Vink
quasi-chemical species with which chemical reactions can
AB lattice molecule AB AB be formulated. This has led to the so-called imperfection
Neutral A particle on AO A Ai − Vi chemistry. As an example, the disorder equilibrium be-
interstitial site tween vacancies and interstitial particles—the so-called
Neutral A vacancy A |A| V A − AA Frenkel equilibrium—will be regarded.
Neutral B particle on A-site B•(A) B|A| BA − AA In this case a particle A moves from an A lattice site to
Neutral C particle on A-site C•(A) C|A| CA − AA an interstitial site whereby, for example, with respect to the
Quasi-free electron e e unperturbed lattice a single positively charged interstitial
.
Electron hole ⊕ |e· | h particle Ai is formed, a vacancy Vi in the interstitial lattice
is destroyed, and a negatively charged vacancy VA on an
A site is left. This exchange process can be written in the
excess of positive charge denoted with. Two unoccupied form of a chemical reaction, a so-called disorder reaction
.
sites can be seen; these vacancies are negatively charged A + |A| = 0 (2)
relative to the unperturbed lattice and are denoted with .
Structure elements with the corresponding symbols pro- using the new Schottky notation, we have
.
posed by Kröger and Vink are easy to memorize; this no- AA + Vi = Ai + VA . (3)
tation is now also widely used in the literature. However, it
is very important to know that the numbers of the various Here the reaction is formulated in terms of structure el-
structure elements in a crystalline compound are not inde- ements, which for a thermodynamical treatment must be
pendent of one another. This is due to the fixed ratio of the combined with building units as follows:
.
number of A and B sites in an AB lattice. It is therefore (Ai − Vi ) + (VA − AA ) = 0. (4)
generally impossible to change the concentration of only
one type of structure element in a crystal. Thus, to create a The concentration of the interstitial particles or vacancies
vacancy on an A site we must either remove an A particle as building units is identical with the concentration as
or simultaneously add a B particle to a new B site, so that structure elements.
the ratio of A to B sites remains unchanged. The gener- Symbolizing the concentrations by square brackets the
ation of an interstitial particle leads to the destruction of law of mass action corresponding to Eq. (4) can thus be
a vacancy in the interstitial lattice. Thus, if we increase formulated as follows:
.
the size of a crystal or change the number of defects con- [Ai ][VA ] = const. (5)
tained in it, one must either add or remove combinations
of structure elements, this means that in general building For all crystals there exists at equilibrium a constant prod-
units have to be used. Similarly, the use of structure ele- uct of the concentrations of interstitial particles and the
ments to describe reactions of defects generally involves corresponding vacancies, the so-called Frenkel equilib-
the use of building units. rium which depends on temperature and pressure.
For higher concentrations of defects the equation must
be written in terms of activities instead of concentrations.
Frenkel disorder occurs in the silver halides, for example,
in AgCl.
Another kind of disorder equilibrium exists between A
vacancies and B vacancies in a binary AB crystal—the
so-called Schottky equilibrium. In this case the exchange
of particles between A and B vacancies and the crystal
is considered. This means that either A or B particles are
transferred to the surface of the crystal, which is thus en-
larged, while A or B vacancies are generated, or vice versa.
For example, the A vacancies may be singly negative and
FIGURE 1 Possible defects in an AB crystal using Kröger the B vacancies singly positive. When the particles are
symbols. brought to the surface, two of the particles there will be
P1: ZBU/GJP P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN015D-711 August 2, 2001 12:38
j = −D dc/d x, (15)
where D is called Fick’s diffusion coefficient. If there is
a concentration coefficient in any direction in space then
we must use the general form of Fick’s first law:
dc dc dc
j = −D grad c = −D , , . (16)
d x dy dz
In what follows only gradients in one direction (the x
direction) will be regarded.
The time dependence of the concentration is given by
Fick’s second law:
FIGURE 3 Lattice of silver iodide with regions in which the silver
∂c ∂ 2c
ions can move. = D 2. (17)
∂t ∂x
the quasi-molten state (structural disorder), whereas the These laws are valid in the case of ideal behavior that is,
iodide partial lattice does not become liquid below the when the chemical potential µ of the particles holds:
actual melting point. Comparable crystals that do not c
show the special behavior of structural disorder before µ = µ0 + RT ln 0 , (18)
c
they melt have an entropy of fusion about as large as the
where µ0 is the chemical potential in the standard state, c is
sum of the transition entropy and entropy of fusion in
the concentration of the particles, c0 is their concentration
AgI, that is, about twice as large as the residual entropy
in the standard state, R is the general gas constant, and
of fusion in AgI.
T is the absolute temperature. Using this equation Fick’s
For completeness let us mention that besides the point
first law [Eqs. (15)] can be written as
defects discussed in this chapter there exist still other de-
fects in crystals. These include Dc dµ
jx = − . (19)
RT d x
1. one-dimensional defects such as edge of screw In the case of nonideal behavior, that is, for higher con-
dislocations, centrations of the mobile particles, the more general ex-
2. two-dimensional defects such as surfaces or grain pression
boundaries, and
3. three-dimensional defects such as cavities. µ = µ0 + RT ln a (20)
must be used for the chemical potential of the diffusing
These defects are not discussed here because they are not species, where a is the activity. A component diffusion
properties of the thermodynamic equilibrium. coefficient D K is then defined in such a manner that an
expression analogous to Eq. (19) is obtained:
III. TRANSPORT PROCESSES—DIFFUSION, D K c dµ
MOBILITY, AND PARTIAL j =− . (21)
RT d x
CONDUCTIVITY A relationship between D and D K can be reached in the
following way.
In this section the transport of ions in an electrical field Since dc is equal to c d ln c, Eq. (15) can be expressed as
and their diffusion in a concentration or activity gradient
d ln c
will be treated. The expressions derived are valid for the j = −Dc . (22)
fluxes of each type of ion or electron separately. From dx
these expressions equations for an interconnected trans- From Eqs. (21) and (20) an expression for the particle
port of different types of particles can be derived. In the flux which contains the component diffusion coefficient
following a phenomenological treatment and an outlook D K is obtained:
on the statistical treatment will be given. d ln a
j = −D K c . (23)
dx
A. Transport by Diffusion A comparison of Eqs. (22) and (23) gives the relationship
between D K and Fick’s diffusion coefficient D:
Fick stated an empirical relationship for the diffusion flux
j in a concentration gradient dc/d x in the x direction. d ln a
D = DK . (24)
This is known as Fick’s first law: d ln c
P1: ZBU/GJP P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN015D-711 August 2, 2001 12:38
particle in an isotropic crystal the diffusion coefficient can solid electrolytes reach these values or come near them.
be expressed by They are sometimes called super ionic conductors.
D K ,i = 16 νa 2 (38)
if there is no correlation between the jumps; ν denotes the IV. SOLID ELECTROLYTES, SOLID
jump frequency and a denotes the distance for each jump. IONIC CONDUCTORS, AND
Using Eqs. (38), (31), and (1) we obtain for the partial SOLID–SOLUTION ELECTRODES
conductivity
Having discussed in Sections II and III concentration and
σi = ci νa 2 z i2 F 2 6RT (39) mobility, which influence the conductivity of solid elec-
and for the electrical mobility: trolytes, a compilation of solid ionic conductors will be
given in this section. This compilation does not presume to
u i = z i F νa 2 6RT . (40) be complete because new solid electrolytes are discovered
From Eqs. (39) and (40) we can see that the important and developed continuously. In Fig. 4, the conductivities
quantity for the conductivity and the mobility is the prod- of some of the most important ones are shown as a function
uct of the jump frequency of an ion and the square of the of temperaure and reciprocal temperature. The conductiv-
jump distance. The jump distance is of the order of the ity of liquid sulfuric acid is included for comparison. In
lattice parameter. the following, several important solid electrolytes will be
It is possible to estimate an upper limit of the jump treated according to the type of mobile ions that cause the
frequency. The maximum jump frequency νmax , that is, ionic conductivity.
the highest frequency of change of a lattice site, results if
the particles move with thermal speed v between the lattice A. Silver Ion Conductors
sites without performing oscillations on such a site. Thus One of the first solid electrolytes exhibiting a very
νmax = v/a . (41) high ionic conductivity, found in 1914, is α-Agl. This
conducting α-phase is stable above 149◦ C and its high
For a given temperature this natural limit cannot be ex- conductivity is caused by structural disorder. A similar
ceeded. The maximum conductivity denoted as σi (max), disorder exists in RbAg4 I5 . This solid electrolyte exhibits
is then given according to Eqs. (40) and (41) as the highest silver ion conductivity at room temperature
ci vaz i2 F 2 at present. Therefore, it is of great technical interest. A
σi (max) = . (42)
6RT
Herewith we can calculate a maximum possible partial
conductivity for a substance, for example, silver iodide.
Assuming that the silver ions migrate with thermal veloc-
ity v from one lattice site to another without oscillating
at each lattice site, we get a jump frequency ν = v/a =
3.4 × 1012 s−1 at a temperature of 300◦ C, a diffusion co-
efficient of D K ,i = 5.6 × 10−5 cm2 /s, and the maximum
conductivity is σ(max) = 2.8 −1 cm−1 for a jump distance
of 1 Å. The measured conductivity is σ = 1.97 −1 cm−1 ,
which is not much less than the calculated value.
Many efforts have been made to improve this very sim-
ple but already good model. Besides jump and lattice gas
models continuous models have been made. These models
rely on the fact that the diffusion of an ion is not repre-
sented by instantaneous jumps from an equilibrium site to
another one but by a continuous motion in between.
From these considerations it can be seen that there is
a natural upper limit for the value of the ionic conduc-
tivity of solid compounds. This upper limit is between
1 and 10 −1 cm −1 corresponding to a component dif-
fusion coefficient of about 2 to 20 × 10−5 cm2 s. These FIGURE 4 Conductivity of some very common solid electrolytes;
values correspond to those in liquid electrolytes. Good H2 SO4 included for comparison.
P1: ZBU/GJP P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN015D-711 August 2, 2001 12:38
TABLE IV Conductivity of Several Silver Ion TABLE VI Conductivity of Several Sodium Ion
Conductors Conductors
Temperature σ Temperature σ
Compound (◦ C) ×102 /Ω cm)
(× Compound (◦ C) ×102 /Ω cm)
(×
Doped zirconia dioxide and thorium dioxide are impor- The first solid electrolytes with high copper ion conduc-
tant solid electrolytes that owe their conductivity to trans- tivity at room temperature were discovered in 1973. An
port of oxygen ions. They can be used between 600 and example is 7CuBrC6 H12 N4 CH3 Br, whose conductivity at
1600◦ C. They are also an interesting example of how high room temperature is 0.017 −1 cm−1 . Several other cop-
ionic conductivity can arise by processes other than struc- per ion conductors have since been described. One of these
tural disorder. The disorder centers responsible for the conductors represented by the formula Rb4 Cu16 I7 Cl13 has
ionic conductivity of zirconium dioxide are charged oxy- a conductivity of 0.34 −1 cm−1 at 25◦ C. This is the solid
gen ion vacancies. These are produced by dissolution of electrolyte with the highest conductivity at room temper-
CaO, MgO, or Y2 O3 in the zirconium dioxide. The cal- ature known at present. In Table VII the conductivity of
cium is incorporated at zirconium positions, since, how- some copper ion conductors are listed.
ever, only one oxygen ion is introduced with each cal-
cium ion, one oxygen position remains unoccupied for E. Proton Conductors
each calcium atom introduced. The amount of doping is
Solid-state proton conductors with high ionic conduc-
of the order of 10 mol%; in this way it can be seen why
tivity are eagerly sought because they could have im-
doping produces a very large number of vacancies in the
portant practical applications, for example, in fuel cells,
zirconium dioxide. In Table V the conductivities of several
water electrolyzers, and sensors. Substances with an ap-
oxygen conductors in given temperature ranges are listed.
preciable proton conductivity known today are hydrogen
uranyl phosphate tetrahydrate (HUP) HUO2 PO4 ·4H2 O—
C. Sodium Ion Conductors usable above 1◦ C—with an ionic conductivity of σ = 4 ×
The most important Na+ ion conductor is Na2 O 11Al2 O3 , 10−3 −1 cm−1 at 20◦ C and hydrogen uranyl arsenate
the so-called β-Al2 O3 . The mobile sodium ions are incor- tetrahydrate HUO2 AsO4 ·4H2 O—usable above 29◦ C—
porated into planes of the lattice and can therefore move with an ionic conductivity of σ = 6 × 10−3 −1 40◦ C.
only in two dimensions; polycrystalline material is used Another kind of solid proton conductor is the pro-
tonic β-alumina. It can be produced by exchanging the
TABLE V Conductivity of Several Oxygen Ion
Conductors TABLE VII Conductivity of Several Copper Ion
Conductors
Temperature σ
Compound (◦ C) ×102 /Ω cm)
(× Temperature σ
Compound (◦ C) ×102 /Ω cm)
(×
ZrO2 (10 mol% Sc2 O3 ) 600–1400 2–100
ZrO2 (10 mol% Y2 O3 ) 600–1400 0.3–50 Rb4 Cu16 I7 Cl13 10–110 28–62
ZrO2 (13 mol% CaO) 640–1400 0.2–30 7CuBr·C6 H12 N4 CH3 Br 10–130 1.5–14
ThO2 (7.5 mol% Y2 O3 ) 1000–1500 1.3–12 7CuCl·C6 H12 N4 HCl 20–110 0.4–5
Bi2 O3 (20 mol% Er2 O3 ) 270–730 0.001–45 17Cul·3C6 H12 N4 CH3 I 20–140 0.1–2.2
P1: ZBU/GJP P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN015D-711 August 2, 2001 12:38
whole sodium content of the solid sodium ion conductor electrochemistry of solids galvanic cells with solid elec-
Na–β-A12 O3 (see Section IV.C) for H+ , H+ (H2 O)n , or trolytes play a very important role. In analogy to galvanic
NH+ 4 ions. The ionic conductivity of these compounds is cells with liquid electrolytes, those with solid electrolytes
in the region of 10−4 –10−5 −1 cm−1 at room temperature. consist of at least two electrodes separated by an elec-
Various possible mechanisms of the conduction process in trolyte, which in this case is a solid ionic conductor. The
the different proton conductors are being discussed. important properties of such cells will be discussed. For
The longtime stability of these compounds, which is this purpose the following galvanic cell will be considered
especially important for possible technical applications, as an example:
has not been clearly established. There exist still other pO , Pt / ZrO2 (+Y2 O3 ) / Pt, pO
2 2
solid proton conductors but as their conductivity in general ←−−− ←−−−−−−− ←−−−.
is not as high as in the compounds mentioned earlier they 4e− 2O2− 4e−
the gas constant and T is the absolute temperature. Sub- One way to obtain a relation between the Gibbs energy
stitution of Eq. (46) into Eq. (45) gives: and the emf of the cell is to regard the so-called virtual
cell reaction. We assume that a certain amount of charge
RT pO (right electrode) RT pO
E= ln 2 = ln 2 . (47) is passed through the cell as a current. In this example, the
4F pO2 (left electrode) 4F p O2 cell reaction is the formation of 1 mole NiO from solid
This shows a relation existing between E, the emf of the nickel and oxygen by passing a flow of electricity of 2 f
galvanic cell, and the ratio of oxygen partial pressures at through the cell:
the two electrodes. If one oxygen partial pressure is known Ni + 12 O2 = NiO · · · G NiO .
the other one can be determined.
According to Eqs. (45)–(47) the emf of a galvanic cell This reaction does not take place under open-circuit
contains different thermodynamic information. The emf conditions. It would take place under current flow, but
makes it possible to determine the Gibbs energy of the cell then, in general, polarization effects will occur. The maxi-
reaction and the chemical potentials of the electrode com- mum possible electrical energy that we could obtain from
ponents or the partial pressures of gases. It shall be men- the cell for the virtual reaction is the measured emf under
tioned here that reaction enthalpies and reaction entropies open-circuit conditions multiplied by 2 f in this example.
can also be deduced from the temperature dependence of This electrical energy is related to the Gibbs energy G
the emf. More details on thermodynamic investigations of the cell reaction; in this case to the Gibbs energy of
will be given in Section V.A. formation of NiO from solid nickel and oxygen by:
The kinetic properties of galvanic cell I with doped
G 0NiO = −2EF. (49)
zirconia as solid electrolyte arise from the fact that the
flux of current through a cell such as cell I is a measure of Here G 0NiO is written because NiO as well as Ni and O2
the reaction rate by which oxygen is passed from one side are in their standard states. Similar investigations have
of the cell to the other. Only oxygen ions can flow through been carried out on many other systems, for exam-
the electrolyte when the electrical circuit is closed. For the ple, Cu2 O, FeO, PbO, In2 O3 , WO2 , ZnO, SiO2 , MoO2 ,
rate of transport of the O2− particles in moles per unit time NiCl2 O4 , FeCr2 O4 , NiAl2 O4 , MgF2 , ThF4 , UF3 , and AlF3 .
J through the electrolyte we can write: Furthermore, enthalpies and entropies of reaction as well
as partial molar enthalpies and entropies and partial molar
J = I /z F, (48)
volumina can be measured using similar cells.
where I is the electrical current; in the case of oxygen the
valence z is −2.
B. Kinetic Investigations
The transport of mass in the form of ions through the
electrolyte can often be attributed to a chemical reaction According to the electrochemistry of liquids kinetic in-
or a transport process at an electrode. In this way reaction vestigations using solid electrolytes can be carried out in
rates can be measured electrically. It is often possible to different ways.
analyze reaction mechanisms in detail by a combination
of rate measurements by means of the electrical current
with measurements of thermodynamic quantities—in par- 1. Measurements at Zero Current
ticular, chemical potentials—by means of the emf of the In the case of zero-current measurements the electrical
galvanic cell. More details on kinetic investigations using potential difference between the two end phases of the cell
galvanic cells will be given in Section V.B. is measured under open circuit conditions. Information
about thermodynamic quantities of reaction systems, for
example, about chemical potentials, activities, or partial
A. Thermodynamic Investigations
pressures, is obtained from such measurements. This was
As a typical example for thermodynamic investigations already described.
using solid electrolytes the determination of the Gibbs
energy G 0NiO at temperatures of 800–1000◦ C will be con-
sidered. The following cell with doped ZrO2 as solid elec- 2. Measurements under Steady-State Conditions
trolyte for oxygen ions can be used: In this case there is no time dependence of currents and
Pt, Ni, NiO / ZrO2 (+Y2 O3 ) / Pt, pO2 = 1atm
potentials. Therefore, it is experimentally unimportant
2f: ←−−−−− ←−−−−−−− ←−−−−−−−. whether the potentials or the currents are controlled. Here
2e− O2− 2e− the steady-state current that represents a reaction rate may
II be measured as a function of the potential or vice versa.
P1: ZBU/GJP P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN015D-711 August 2, 2001 12:38
3. Measurements under Controlled Potential TABLE VIII Determination of the Diffusion Coefficient
of Oxygen in Various Solid and Liquid Metals
In this case a constant potential difference is applied to the
cell and the resulting current is measured as a function of Solid Temperature Diffusion coefficient
Metal or liquid range (◦ C) (cm2 /s)
time. These investigations are called potentiostatic.
Ag s 760–900 1.5 × 10−5 –2.9 × 10−5
Ag l 970–1200 8.2 × 10−5 –1.7 × 10−4
4. Measurements under Controlled Current
Cu l 990–1220 1.4 × 10−4 –2.2 × 10−4
In this case a constant current is passed through the cell and Cu s 800–1030 9.3 × 10−6 –3.5 × 10−5
the resulting potential difference is measured as a function Sn l 730–930 4.5 × 10−5 –7.4 × 10−5
of time. These investigations are called galvanostatic. If Ni s 1393 1.3 × 10−6
galvanostatic or potentiostatic measurements or investi- Pb l 800–1100 1.0 × 10−5 –1.7 × 10−5
gations under steady-state conditions are carried out it is Fe l 1620 1.5 × 10−4
often preferable to use separate cells of the same type un- Sb l 750–950 1.4 × 10−6 –2.9 × 10−6
der zero current to measure potential differences, so that Bi l 750–950 8.6 × 10−6 –1.4 × 10−5
the reference electrode is not polarized by a current. In the
following, a few examples of kinetic investigations using
solid electrolytes will be discussed. tions. Several systems have been investigated in this way.
Some results are shown in Table VIII.
C. Electrochemical Measurements
D. Measurements of Chemical
of Oxygen Diffusion in Metals
Diffusion Coefficients
The principle of the electrochemical measurement of oxy-
The process of attaining a uniform composition, which
gen diffusion in a metal consists in bringing the metal
occurs in compounds where an existing gradient of sto-
from a well-defined state into another well-defined state
ichiometry is allowed to equalize, can be described by
and following the diffusion-controlled relaxation pro-
the chemical diffusion coefficient D̃ discussed in Sec-
cess electrochemically. For example, the metal sample is
tion III.D.
placed on one side of the solid electrolyte ZrO2 and func-
For such equilibration processes it is necessary, on the
tions as one electrode of a galvanic cell. On the other side
grounds of electrical neutrality, that both ions and elec-
of the electrolyte there is a practically unpolarizable elec-
trons or electron defects must migrate simultaneously
trode such as porous platinum in contact with air, or an
whereby the fluxes of ions and electrons are related to
Fe/FeO electrode, which has a fixed oxygen partial pres-
one another. The electrochemical method for the determi-
sure of about 10−19 atm at 800◦ C. The following cell may
nation of chemical diffusion coefficients D̃ will be shown
be used:
here as an example for the mixed conductor wüstite FeO.
Fe, FeO / ZrO2 (+Y2 O3 ) / Me + O(dissolved)
←−−− ←−−−−−−− ←−−−−−−−−− The basic element for the investigation of wüstite
2e− O2− O Fe1−δ O (δ = deviation from ideal stoichiometry) is the
←−−−−−−−−−. solid-state galvanic cell
2e−
pO 2 , Pt/ZrO2 (+Y2 O3 )/Fe1−δ O)/Pt
III
IV
In cell III there is an Fe/FeO electrode on one side and a
metal containing dissolved oxygen on the other side. The with doped ZrO2 as solid electrolyte, an electrode consist-
emf of the cell before beginning the experiment is a mea- ing of porous platinum in contact with air at one side and
sure of the initial activity or concentration of the dissolved the wüstite being investigated as electrode at the other side.
oxygen. At a certain time an emf is applied to the cell to The experimental setup is shown in Fig. 5. The principle
make the oxygen activity at the metal/electrolyte interface of the measurement is that, starting from a suitable initial
very small. Then oxygen diffuses out of the metal and is state, the potential difference E of cell IV or the current I ,
carried as an electrical current through the electrolyte to respectively, are varied systematically. The other variable
the other side of the cell. In this way the diffusion cur- I or E is measured as a function of time. From the obtained
rent is transformed into an electrical current and can be results the chemical diffusion coefficient D̃ of wüstite can
measured. be calculated. In a potentiostatic experiment a definite
From the time dependence of the current the diffusion value of the chemical potential of oxygen, correspond-
coefficient can be calculated using suitable diffusion equa- ing to a certain deviation δ from the ideal stoichiometry,
P1: ZBU/GJP P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN015D-711 August 2, 2001 12:38
E. Electrochemical Investigations of
Formation of Surface Layers on Metals
FIGURE 5 Experimental setup for the measurement of the
chemical-diffusion coefficient of wüstite: W, working electrode; R,
The formation of nickel sulfide on nickel will be discussed
reference electrode; C, counter electrode. as an example. The experimental arrangement is shown
in Fig. 7. Silver iodide was used as the solid electrolyte,
is set up in the wüstite before starting the measurement. being a pure Ag+ ionic conductor under the experimen-
Then a sudden change of the potential of the cell stipulated tal conditions. The negative pole of a power source was
potentiostatically causes a current flow that is measured connected to the left-hand side of the arrangement and
as a function of time. This current is primarily a measure the positive pole to the right-hand side. The electrolytic
of the addition or removal of oxygen at the phase bound- cell itself consisted of tablets pressed together in a glass
ary ZrO2 /wüstite. As a consequence of this, iron diffuses tube furnace flushed with nitrogen. An electrical current
within the wüstite to or from the phase boundary. Thus, passing through the cell is a measure of the rate at which
the current is equivalent to the diffusion current of iron, silver is removed from the silver sulfide, since silver ions
During this process the stoichiometry in the compound migrate through the AgI and electrons through the exter-
changes with time until a new δ value is attained. From nal circuit. In this case, however, the rate of loss of the
the solution of the diffusion equations for this problem the silver corresponds to the rate of the formation of nickel
chemical diffusion coefficient D̃ can be calculated. sulfide on nickel, since in this reaction nickel displaces
The result obtained for wüstite at 1000◦ C and a de- the silver from the Ag2 S. The experiments were carrid out
viation δ = 0.106 from ideal stoichiometry is D̃ = 3.2 × using the galvanostatic and potentiostatic methods. The
10−6 cm2 /s. Similar investigations with an improved ex- important quantities are the current, which is a measure
perimental setup allowed the determination of the chemi- of the reaction rate (here of the formation of nickel sul-
cal diffusion coefficient D̃ of Ag2+δ S over the total range fide), and the cell emf, which is not only a measure of
of stoichiometric composition of this compound. the chemical potential of the silver in silver sulfide, but,
FIGURE 6 Chemical-diffusion coefficients of Ag2+δ S as a function of the deviation δ from stoichiometry at 200 and
300◦ C.
P1: ZBU/GJP P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN015D-711 August 2, 2001 12:38
RT aAg (AgCl)
E= ln , (56)
F aAg (Ag)
where aAg (AgCl) denotes the activity of silver in the right-
hand AgCl-electrode, and aAg (Ag) denotes the activity of
silver in the left-hand silver electrode, respectively, which
has a value of one in this case because of the pure Ag metal.
The activity of silver in AgCl, which is in equilibrium
with the gas-atmosphere, having a certain chlorine partial
pressure pCl2 , is given by
−1/2
aAg (AgCl) = pCl2 /po exp G oAgCl RT , (57)
where p o = 1 atm and G oAgCl is the Standard-Gibbs en-
ergy of formation of AgCl. Inserting Eq. (57) into Eq. (56),
we obtain a relation between the chlorine partial pressure
to be determined and the measured emf E
RT
E= ln pCl2 /po − G oAgCl F . (58) FIGURE 11 Schematic diagram of a RbAg4 I5 accelerometer.
2F
Also, the partial pressures of other gases, for example,
B. Solid Electrolyte Batteries:
NO2 , O2 , or sulfur, can be measured by such sensors.
The Sodium–Sulfur Cell
Another method of determining the partial pressure of
a gas is given by measuring the current flowing through The principle of a sodium–sulfur cell is shown in Fig. 12.
a suitable galvanic cell with a solid electrolyte. This prin- The solid electrolyte is a Na+ ion conductor, consisting
ciple will be discussed in the following exemplified at an of β-Al2 O3 . It is generally used as a tube closed at one
oxygen sensor. In principle, cell I discussed in Section V end and filled with liquid sodium as the anode. An iron
can be used, where pO 2 is the oxygen partial pressure to sponge, which absorbs the liquid sodium, serves to im-
be measured. In this case, a current is passed through the prove the wetting of the electrolyte and to improve safety.
cell so that oxygen is transported from the right-hand side A metal wire leads out of the anode to carry the current.
of the cell to the left-hand side. For this an outer electri- The cathode consists of liquid sodium polysulfide and sul-
cal voltage has to be applied to the cell. To get a defined fur inserted in porous graphite. The working temperature
correlation between pO 2 and the electrical current flowing of the sodium–sulfur cell is around 300◦ C.
through the cell, a so-called “diffusion-barrier” has to be In the cell reaction sodium ions pass through the elec-
arranged in front of the electrode at the right-hand side. trolyte and electrons through the external circuit, so that
This may, for example, consist of a porous ceramic ma- sodium is dissolved in sodium polysulfide. In this way
terial. The external electrical voltage is chosen in such a electrical energy can be liberated. The energy density of
way that each oxygen molecule reaching the surface of the sodium–sulfur cell is many times greater than that
the right-hand electrode immediately reacts electrochem-
ically to an oxygen ion, which is then transported through
the solid electrolyte. Under these conditions, the flowing
current is proportional to the partial pressure pO 2 and so
can be used to measure pO 2 .
Another type of sensor—an accelerometer—can be
constructed by applying the principles used in the case
of the investigations on the forces of inertia of the mobile
ions in solid ion conductors, described in Section V.F. As
shown in Eq. (52) the voltage U measured at both ends of a
rod of a solid electrolyte is a direct measure of the acceler-
ation acting on this rod at every time; this means that such
an arrangement, consisting of a rod of RbAg4 I5 as solid
electrolyte with silver wires at both ends as electrodes for
measuring the voltage U , can be used as an accelerometer.
This is of interest for practical applications. The principle
of such an accelerometer is shown in Fig. 11. FIGURE 12 Sodium–sulfur cell.
P1: ZBU/GJP P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN015D-711 August 2, 2001 12:38
of the customary lead batteries, and the materials needed high-temperature fuel cell if required. This principle is be-
for the electrolytes and electrodes are available in large ing discussed in connection with large-scale energy stor-
quantities. The cell can be recharged by changing the di- age and transport of energy.
rection of the current. The sodium–sulfur cell is of great
interest for large-scale energy storage and for electrotrac- D. Chemotronic Components
tion for electric vehicles. Prototypes have already been
built. In addition to the sodium–sulfur cell other cell sys- Galvanic cells containing solid electrolytes, which find use
tems have been developed using other solid electrolytes. in electrical circuits, are often called chemotronic com-
ponents. Next we describe as an example coulometers
and time switches. However, there exist more chemotronic
C. Fuel Cells and Electrolyzers building units containing solid electrolytes such as analog
Figure 13 shows schematically a high-temperature fuel memories and capacitors. They will not be described here.
cell incorporating zirconium dioxide as the solid elec- The galvanic cell
trolyte. The zirconium dioxide in a tube or disk form Ag/RbAg4 I5 /Au
separates two electrode compartments, one containing air
or pure oxygen and the other a fuel gas (e.g., hydro- VI
gen). The zirconium oxide carries two porous electrodes: can be used as a coulometer or time switch. Here the
nickel can be used on the fuel side, and on the oxygen electrolyte RbAg4 I5 is a good Ag+ ion conductor even at
side lanthanum–nickel oxide or some other electron- room temperature. By passing a current through this cell
conducting oxide. with the negative pole at the silver side, silver is deposited
In a high-temperature fuel cell the oxygen takes up elec- on the gold electrode. The time switch is then in the loaded
trons at one electrode, then passes through the electrolyte state. The silver can be transported back to the original sil-
as ions and combines at the other electrode with H2 to give ver electrode by a current flowing in the reverse direction.
H2 O, whereby the electrons are given up and flow again In this stripping process the cell potential is determined
to the other electrode in the external current circuit. In this mainly by ohmic losses in the solid electrolyte that lie in
way energy is made available to the user. The advantages the millivolt range. When all the silver has been stripped
of the high-temperature fuel cell are that little or no polar- from the gold electrode the cell shows a sudden rise in po-
ization occurs at the electrodes and high current densities tential, which can be used as a signal. Such electrochemi-
can be achieved. cal switches are suitable for times in the region of seconds
By reversing the direction of the current flow in a high- to months. Cell VI can also be used as a coulometer; the
temperature fuel cell, that is, by supplying the cell with amount of a current flowing through the cell during charg-
electrical energy, steam can be decomposed and the cell ing is then determined by the discharge process.
can thus be used as an electrolyzer. The hydrogen pro-
duced can be stored or conducted by pipelines to remote
sites where it can serve for the production of energy in a SEE ALSO THE FOLLOWING ARTICLES
BIBLIOGRAPHY
Superacids
George A. Olah
G. K. Surya Prakash
University of Southern California
175
P1: GRB/MAG P2: GTY/GJP P3: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-745 July 31, 2001 14:33
176 Superacids
by Gillespie, all Brönsted (protic) acids stronger than A number of methods are available for estimating acid-
100% sulfuric acid are classified as superacids. Vari- ity of protic acids in solution. The best known is the direct
ous methods are available to measure protic superacid measurement of the hydrogen ion activity used in defining
strengths (vide infra). Lewis acids also cover a wide pH [Eq. (1)].
range of acidities extending beyond the strength of the
pH = log aH+ . (1)
most frequently used systems such as AlCl3 and BF3 .
Olah et al. (1985) suggested the use of anhydrous alu- This can be achieved by measuring the potential of a hy-
minum trichloride, the most widely used Friedel–Crafts drogen electrode in equilibrium with a dilute acid solu-
catalyst, as the arbitrary unit to define Lewis superacids. tion. In highly concentrated acid solutions, however, the
Lewis acids stronger than anhydrous aluminum trichloride pH concept is no longer applicable, and the acidity must
are considered Lewis superacids. There remain, however, be related very closely to the degree of transformation
many difficulties in measuring the strength of Lewis acid of a base with its conjugate acid, keeping in mind that
(vide infra). this will depend on the base itself and on medium effects.
The high acidity and the extremely low nucleophilicity The advantage of this method was shown in the 1930s by
of the counterions of superacidic systems are especially Hammett and Deyrup, who investigated the proton donor
useful for the preparation of stable, electron-deficient ability of the H2 O–H2 SO4 system over the whole concen-
cations, including carbocations. Many of these cations, tration range by measuring the extent to which a series of
which were formerly suggested only as fleeting metastable nitroanilines were protonated. This was the first applica-
intermediates and were detectable only in the gas phase tion of the very useful Hammett acidity function [Eq. (2)].
in mass spectrometric studies, can be conveniently stud-
BH+
ied in superacid solutions. New chemical transformations H0 = pKBH+ − log . (2)
and syntheses that are not possible using conventional B
acids can also be achieved with superacids. These include The pKBH+ is the dissociation constant of the conjugate
transformations and syntheses of many industrially im- acid (BH+ ) and BH+ /B is the ionization ratio, which is
portant hydrocarbons. The unique ability of superacids to generally measured by spectroscopic means [ultraviolet,
bring about hydrocarbon transformations, even to activate nuclear magnetic resonance (NMR), and dynamic NMR].
methane (the principal component of natural gas) for elec- Hammett’s “H0 ” scale is a logarithmic scale on which
trophilic reactions, has opened up a fascinating new field 100% sulfuric acid has an H0 value of −12.0.
in chemistry. Various other techniques are also available for acidity
measurements of protic acids. These include electrochem-
ical methods, kinetic rate measurements, and heats of pro-
I. ACID STRENGTH AND ACIDITY SCALE tonation of weak bases. Even with all these techniques it
is still difficult to measure the acidity of extremely acidic
The chemical species that plays the key role in Brönsted superacids, because of the unavailability of suitable weak
acids is the hydrogen ion, that is, the proton: H+ . Since the reference bases.
proton is the hydrogen nucleus with no electron in its 1s In contrast to protic (Brönsted) acids, a common quan-
orbital, it is not prone to electronic repulsion. The proton titative method to determine the strength of Lewis acids
consequently exercises a powerful polarizing effect. Due does not exist. Whereas the Brönsted acid–base interac-
to its extreme electron affinity, proton cannot be found tion always involves a common denominator—the proton
as a free “naked” species in the condensed state. It is al- (H+ ) transfer, which allows direct comparison—no such
ways associated with one or more molecules of acid or the common relationship exists in the Lewis acid–base inter-
solvent (or any other nucleophile present). The strength of action. The result is that the definition of “strength” has
protic acid thus depends on the degree of association of the no real meaning with Lewis acids.
proton in the condensed state. Free protons can exist only The “strength” or “coordinating power” of different
in the gas phase and represent the ultimate acidity. Due Lewis acids can vary widely against different Lewis bases.
to the very small size of a proton (105 times smaller then Despite the apparent difficulties, a number of qualitative
any other cation) and the fact that only 1s orbital is used relationships have been developed to characterize Lewis
in bonding by hydrogen, proton transfer is a very facile acids. Schwarzenbach and Chatt classified Lewis acids
reaction, reaching diffusion-controlled rates, and does not into two types: class a and class b. Class a Lewis acids
necessitate important reorganization of the electronic va- form their most stable complexes with the donors in the
lence shells. Understanding the nature of the proton is first row of the periodic table—N, O, and F. Class b acids,
important when generalizing quantitative relationships in on the other hand, complex best with donors in the sec-
acidity measurements. ond or subsequent row—Cl, Br, I, P, S, etc. Guttmann has
P1: GRB/MAG P2: GTY/GJP P3: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-745 July 31, 2001 14:33
Superacids 177
introduced a series of donor numbers (DN) and acceptor The complete description of the acidic properties of
numbers (AN) for various solvents in an attempt to quan- a solid requires the determination of the acid strengths
tify complexing tendencies of Lewis acids. Based on a as well as the number of acid sites. The methods that
similar premise, Drago came up with parameter E, which have been used to answer these questions are basically the
measures the covalent bonding potential of each series of same as those used for the liquid acids. Three methods
Lewis acids as well as bases. are generally quoted: (1) rate measurement to relate the
Pearson has proposed a qualitative scheme in which a catalytic activity to the acidity, (2) the spectrophotometric
Lewis acid and base are characterized by two parameters, method to estimate the acidity from the color change of
one of which is referred to as strength and the other as soft- adequate indicators, and (3) titration by a strong enough
ness. Thus, the equilibrium constant for a simple Lewis base for the measurement of the amount of acid. The above
acid–base reaction would be a function of four parameters, experimental techniques vary somewhat, but all the results
two for each partner. Subsequently, Pearson introduced the obtained should be interpreted with caution because of
hard and soft acids and bases (HSAB) principle to ratio- the complexity of the solid acid catalysts. The presence of
nalize behavior and reactivity in a qualitative way. Hard various sites of different activity on the same solid acid,
acids correspond roughly in their behavior to Schwarzen- the change in activity with temperature, and the difficulty
bach and Chatt’s class a acids. They are characterized of knowing the precise structure of the catalyst are some of
by small acceptor atoms that have outer electrons that the major handicaps in the determination of the strength
are not easily excited and that bear a considerable pos- of solid superacids.
itive charge. Soft acids, which correspond to class b acids,
have acceptor atoms of a lower positive charge and a large
size, with easily excited outer electrons. Hard and soft II. SUPERACID SYSTEMS
bases are defined accordingly. Pearson’s HSAB principle
states that hard acids prefer to bind to hard bases and soft Following Conant’s early work, the field of superacids,
acids prefer to bind to soft bases. The principle has proved which had been dormant till the late 1950s, started to un-
useful in rationalizing and classifying a large number of dergo rapid development in the early 1960s, involving the
chemical reactions involving acid–base interactions in a discovery of new systems and an understanding of their
qualitative manner, but it gives no basis for quantitative nature as well as their chemistry. As mentioned, superacids
treatment. encompass both Brönsted and Lewis types and their con-
Many attempts have been made in the literature to jugate combinations.
rate qualitatively the activity of Lewis acid catalysts in
A. Brönsted Superacids
Friedel–Crafts-type reactions. However, such ratings de-
pend largely on the nature of the reaction for which the Using Gillespie’s arbitrary definition, Brönsted superacids
Lewis acid catalyst is employed. are those with an acidity exceeding that of 100% sulfuric
Thus, the classification of Lewis superacids as those acid (H0 , −12). These include perchloric acid (HClO4 ),
stronger than anhydrous aluminum trichloride is only ar- fluorosulfuric acid (FSO3 H), trifluoromethanesulfonic
bitrary. Just as in the case of Gillespie’s classification of acid (CF3 SO3 H), and higher perfluoroalkanesulfonic acid
Brönsted superacids, it is important to recognize that acids (Cn Fn +2 SO3 H). Physical properties of some of the most
stronger than conventional Lewis acid halides exit, with commonly used superacids are listed in Table I.
increasingly unique properties. Studies by Gillespie have shown that truly anhydrous
Another area of difficulty is measuring the acid strength hydrogen fluoride (HF), which is extremely difficult to
of solid superacids. Since solid superacid catalysts are obtain in the pure form, has a Hammett acidity constant
used extensively in the chemical industry, particularly in (H0 ) of −15.1 rather than the −11.0 found for the usual
the petroleum field, a reliable method for measuring the anhydrous acid. However, traces of water impurity drop
acidity of solids would be extremely useful. The main the acidity to the generally observed value. Thus for prac-
difficulty to start with is that the activity coefficients for tical purposes, hydrogen fluoride, which always contains
solid species are unknown and thus no thermodynamic some water impurity, is not discussed here, as its acidity
acidity function can be properly defined. On the other of H0 = −11.0 is lower than that of H2 SO4 .
hand, because the solid by definition is heterogeneous, Teflic acid (TeF5 OH) has been suggested to have
acidic and basic sites can coexist with variable strength. an acidity comparable to that of fluorosulfuric acid.
The surface area available for colorimetric determinations However, no concrete acidity measurements are available
may have acidic properties widely different from those of to support such a claim. A number of carbocationic
the bulk material; this is especially true for well-structured salts bearing carborane anions [CB11 H6 Cl− 6 , etc.] have
solids such as zeolites. been studied. However, their parent Bronsted acids,
P1: GRB/MAG P2: GTY/GJP P3: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-745 July 31, 2001 14:33
178 Superacids
which can be considered as potential superacids, are still bond. It is prepared by the direct combination of sulfur tri-
unknown. oxide and dry hydrogen chloride gas. The reaction is very
exothermic and reversible, making it difficult to obtain
1. Perchloric Acid (HClO4 ) chlorosulfuric acid free of SO3 and HCl. On distillation,
Commercially, perchloric acid is manufactured by either even in a good vacuum, some dissociation is inevitable.
reaction of alkali perchlorates with hydrochloric acid or The acid is a powerful sulfating and sulfonating agent as
direct electrolytic oxidation of 0.5 N hydrochloric acid. well as a strong dehydrating agent and a specialized chlo-
Another commercially attractive method is the direct elec- rinating agent. Because of these properties, chlorosulfuric
trolysis of chlorine gas (Cl2 ) dissolved in cold, dilute per- acid is rarely used for its protonating superacid properties.
chloric acid. Perchloric acid is commercially available in a Gillespie and co-workers have measured systematically
concentration of 70% (by weight) in water, although 90% the acid strength of the H2 SO4 –ClSO3 H system using aro-
perchloric acid also had limited availability (due to its ex- matic nitro compounds as indicators. They found an H0
plosive hazard, it is no longer provided at this strength); value of −13.8 for 100% ClSO3 H.
for 70–72% HClO4 , an azeotrope of 28.4% H2 O, 71.6%
HClO4 , boiling at 203◦ C is safe for usual applications. It 3. Fluorosulfuric Acid (HSO3 F)
is a strong oxidizing agent, however, and must be handled Fluorosulfuric acid, HSO3 F, is a mobile colorless liquid
with care. Anhydrous acid (100% HClO4 ) is prepared by that fumes in moist air and has a sharp odor. It may be
vacuum distillation of the concentrated acid solution with regarded as a mixed anhydride of sulfuric and hydroflu-
a dehydrating agent such as Mg(ClO4 )2 . It is stable only oric acid. It has been known since 1892 and is prepared
at low temperatures for a few days, decomposing to give commercially from SO3 and HF in a stream of HSO3 F. It
HClO4 · H2 O (84.6% acid) and ClO2 . is readily purified by distillation, although the last traces
Perchloric acid is extremely hygroscopic and a very of SO3 are difficult to remove. When water is excluded,
powerful oxidizer. Contact of organic materials with an- it may be handled and stored in glass containers, but for
hydrous or concentrated perchloric acid can lead to violent safety reasons the container should always be cooled be-
explosions. For this reason, the application of perchloric fore opening because gas pressure may have developed
acid has serious limitations. The acid strength, although from hydrolysis.
not reported, can be estimated to be around H0 = −13 for
the anhydrous acid. HSO3 F + H2 O
H2 SO4 + HF
Although various cation salts can be prepared with per- Fluorosulfuric acid generally also contains hydrogen
chlorate gegen ions, the ionic salts tend to be unstable (ex- fluoride as an impurity, but according to Gillespie the
plosive) due to their equilibria with covalent perchlorates. hydrogen fluoride can be removed by repeated distillation
The main use of perchloric acid is in the preparation of under anhydrous conditions. The equilibrium HSO3 F
its salts, such as NH+ −
4 ClO4 , a powerful oxidant in rocket SO3 + HF always produces traces of SO3 and HF in stored
fuels and pyrotechniques. HSO3 F samples. When kept in glass for a long time, SiF4
and H2 SiF6 are also formed (secondary reactions due
2. Chlorosulfuric Acid (ClSO3 H)
to HF).
Chlorosulfuric acid, the monochloride of sulfuric acid, is Fluorosulfuric acid is employed as a catalyst and
a strong acid containing a relatively weak sulfur–chlorine chemical reagent in various chemical processes including
P1: GRB/MAG P2: GTY/GJP P3: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-745 July 31, 2001 14:33
Superacids 179
180 Superacids
The polymeric structure of the liquid SbF5 has been es- ciation, but it is a monomeric covalent compound with
tablished by 19 F NMR spectroscopy and is shown to have a high degree of coordinating ability. It is prepared by
the following frameworks: a cis-fluorine bridged structure reacting fluorine with arsenic metal or arsenic trifluo-
is found in which each antimony atom is surrounded by ride. As a strong Lewis acid fluoride, it is used in the
six fluorine atoms in an octahedral arrangement. preparation of ionic complexes and, in conjunction with
Brönsted acids, forms conjugate superacids. It also forms,
with graphite, stable intercalation compounds that show an
electrical conductivity comparable to that of silver. Great
care should be exercised in handling any arsenic com-
pound because of its potential high toxicity.
Superacids 181
+ −
their conjugate acids in HSO3 F solution. All the above HB(HSO4 )4 + H2 SO4
H3 SO4 + B(HSO4 )4
conjugate superacids were found to be highly conducting
and strongly ionizing over the entire conecentration range. The increase in acidity is, however, limited to H0 = −13.6
as a result of insoluble complexes that precipitate when
C. Conjugate Brönsted–Lewis Superacids the concentration of the boric acid approaches 30 mol%.
TABLE IV H0 Values for the H2 SO4 –SO3 System SbF5 + SbF5 (SO3 F)−
Sb2 F10 (SO3 F)−
Mol% SO3 H0 Mol% SO3 H0 Mol% SO3 H0 Due to these equilibria, the composition of the HSO3 F:
SbF5 system is very complex and depends on the SbF5 con-
1.00 −12.24 25.00 −13.58 55.00 −14.50
tent. Aubke and co-workers have investigated the struc-
2.00 −12.42 30.00 −13.76 60.00 −14.74
tures of complex anions in the Magic Acid system by
5.00 −12.73 35.00 −13.94 65.00 −14.84
modern 19 F NMR studies.
10.00 −13.03 40.00 −14.11 70.00 −14.92
The major reason for the wide application of the Magic
15.00 −13.23 45.00 −14.28 75.00 −14.90
Acid system compared with others (besides its very high
20.00 −13.41 50.00 −14.44
acidity) is probably the large temperature range in which
P1: GRB/MAG P2: GTY/GJP P3: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-745 July 31, 2001 14:33
182 Superacids
it can be used. In the liquid state, NMR spectra have been is the H3 F+ +
2 ions. The H2 F ion is observed only in highly
recorded from temperatures as low as −160◦ C (acid di- concentrated solutions (40–100 mol% SbF5 ), contrary to
luted with SO2 F2 and SO2 ClF) and up to +80◦ C (neat the widespread belief that it is the only proton-solvated
acid in a sealed NMR glass tube). Glass is attacked by the species in HF:SbF5 solutions.
acid very slowly when moisture is excluded. The Magic Ionization in dilute HF solutions of SbF5 (1–20% SbF5 )
Acid system can also be an oxidizing agent that results in is thus
reduction to antimony trifluoride and sulfur dioxide. On
− +
occasion this represents a limitation. SbF5 + 3HF
SbF6 + H3 F2
Superacids 183
10. HSO3 F–Au(SO3 F)3 and HSO3 F–Pt(SO3 F)4 tion studies indicate that the HF:TaF5 system is a weaker
superacid than HF:SbF5 .
These superacids based on gold and platinum have been
developed. They show a high acidity and good thermal
stability. However, the high cost of the metals involved 13. Hydrogen Fluoride–Boron Trifluoride
precludes their widespread use. (Tetrafluoroboric Acid)
Boron trifluoride ionizes anhydrous HF as follows:
−
11. Perfluoroalkanesulfonic Acid-Based Systems BF4 + H2 F+
BF3 + 2HF
a. CF3 SO3 H:SbF5 . CF3 SO3 H:SbF5 (n = 1) was in- The stoichiometric compound exists only in an excess
troduced by Olah as an effective superacid catalyst for iso- of HF or in the presence of suitable proton acceptors. The
merizations and alkylations. The composition and acidity HF:BF3 (fluoroboric acid)-catalyzed reactions cover many
of systems where n = 1, 2, 4 have been studied by Com- of the Friedel–Crafts type reactions. One of the main ad-
meyras and co-workers. The change in composition of the vantages of this system is the high stability of HF and BF3
triflic acid–antimony pentafluoride system depending on and their nonoxidizing nature. Both are gases at room
the SbF5 content has been studied. For the 1:1 composi- temperature and are easily recovered from the reaction
tion, the main counteranion is [CF3 SO3 SbF5 ]− , and for mixtures. Acidity measurements of the HF:BF3 system
the 1:2 composition [CF3 SO3 (Sb2 F11 )]− is predominant. have been limited to electrochemical determinations, and
With increasing SbF5 concentration, the anionic species a 7 mol% BF3 solution was found to have an acidity of
grow larger and anions containing up to 5 SbF5 units H0 = −16.6. This indicates that BF3 is a much weaker
have been found. In no circumstances could free SbF5 be Lewis acid compared with either SbF5 or TaF5 . Never-
detected. theless, the HF:BF3 system is strong enough to proto-
nate many weak bases and is an efficient and widely used
b. CF3 SO3 H:B(SO3 CF3 )3 . The acidity of triflic acid catalyst.
can also be substantially increased by the addition of boron
triflate B(OSO2 CF3 )3 as indicated by Engelbrecht and 14. Conjugate Friedel–Crafts Acids
Tschager. The increase in acidity is explained by the ion- (HBr:AIBr3 , HCl:AICl3 , Etc.)
ization equilibrium:
The most widely used Friedel–Crafts catalyst systems are
B(OSO2 CF3 )3 + 2HSO3 CF3 HCl:AlCl3 and HBr:AlBr3 . These systems are indeed su-
+
2HSO3 CF3 + − B(SO3 CF3 )4
peracids by Gillespie’s definition. However, experiments
directed toward preparation from aluminium halides and
The measurements were limited due to the lack of a hydrogen halides of the composition HAlX4 were unsuc-
suitable indicator base, and even 1,3,5-trinitrobenzene the cessful in providing evidence that such conjugate acids
weakest base used, was fully protonated (H0 ≈ −18.5) in are formed in the absence of proton acceptor bases.
a 22 mol% solution of boron triflate. The acid system
has found many synthetic applications, due mainly to the
efforts of Olah and co-workers. D. Solid Superacids
The acidic sites of solid acids may be of either the Brönsted
(proton donor, often OH group) or the Lewis type (electron
12. Hydrogen Fluoride–Tantalum Pentafluoride
acceptor). Both types have been identified by IR studies of
HF:TaF5 is a catalyst for various hydrocarbon conversions solid surfaces absorbed with pyridine. Various solids dis-
of practical importance. In contrast to antimony pentaflu- playing acidic properties, whose acidities can be enhanced
oride, tantalum pentafluoride is stable in a reducing en- to the superacidity range, are listed in Table V.
vironment. The HF:TaF5 superacid system has attracted
attention mainly through the studies concerning alkane
1. Immobilized Superacids
alkylation and aromatic protonation. Generally, heteroge-
(Bound to Inert Supports)
neous mixtures such as 10:1 and 30:1 HF:TaF5 have been
used because of the low solubility of TaF5 in HF (0.9% Ways have been found to immobilize and/or to bind su-
at 19◦ C and 0.6% at 0◦ C). For this reason, acidity mea- peracidic catalysts to an otherwise inert solid support.
surements have been limited to very dilute solutions, and These include graphite intercalated superacids. Graphite
an H0 value of −18.85 has been found for the 0.6% solu- possessing a layered structure can form intercalation com-
tion. Both electrochemical studies and aromatic protona- pounds with Lewis acids such as AsF5 and SbF5 . These
P1: GRB/MAG P2: GTY/GJP P3: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-745 July 31, 2001 14:33
184 Superacids
RHX ROH2
RH2 R2OH
ArH2 RCHCH3
RX ROH
RH R2O
ArH RCHCH2
R2CO RSH
R2C OH RSH2
HSO3F-SbF5
RCHO R2S
RCH OH or HF-SbF5 R2SH
(RO)2CO RCOOH
X
R RCOOH2
RCONR2 RCOOR′ R2CX2 X∆
(RO)2C OH R RCOH2O
RCONR2 X
H RCOOR′ R2CX R
H
R
FIGURE 1 Some ways of generating carbocations generated in superacids.
P1: GRB/MAG P2: GTY/GJP P3: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-745 July 31, 2001 14:33
Superacids 185
186 Superacids
Superacids 187
H
By similar procedures aromatics are also hydroxylated in
HF-SbF5 high yields at low temperatures.
O O
H H2O2
HSO3F/SO2ClF
HF-TaF5
isopentane CH
H2
5. Superelectrophilic Activation
Electrophiles such as NO+ + +
2 , CH3 CO , and H3 O can be
3. Carbonylation
further activated in strong protic acids to their respective
The reaction between carbocations and carbon monoxide dications: NO2 H2+ , CH3 COH2+ , and H4 O2+ . Such su-
affording oxocarbenium ions (acyl cations) is a key step perelectrophiles are responsible for the high electrophilic
in the well-known Koch–Haaf reaction for preparing car- reactivity in superacids. For example, acetyl cation is a
boxylic acids from alkenes. This reaction has been exten- poor acetylating agent for chlorobenzene in trifluoroacetic
sively studied under superacidic conditions. An example acid. However, in superacidic trifluoromethanesulfonic
is indicate below. acid medium, acetylation takes place with ease.
P1: GRB/MAG P2: GTY/GJP P3: GLQ Final Pages
Encyclopedia of Physical Science and Technology EN016A-745 July 31, 2001 14:33
188 Superacids
Many acid-catalyzed reactions can be advantageously car- Elemental sulfur, selenium, and tellurium give colored so-
ried out using solid superacids instead of conventional lutions when dissolved in a number of strongly acidic me-
acid systems. The reactions can be carried out in ei- dia. It has been shown that S2+ 2+ 2+ 2+ 2+
16 , S8 , S4 , Se8 , Te4 ,
2+
ther the gaseous or the liquid phase. Using the example and Te6 are present in such solutions. These cations
Nafion-H (a perfluoroalkane resin sulfonic acid, devel- are formed by the oxidation of elements by H2 S2 O7 or
oped by DuPont) solid acid, several simple procedures S2 O6 F2 ; for example,
−
were reported to carry out alkylation, transbromination, 4S + 6H2 S2 O7 → S2+
4 + 2HS3 O10 + 5H2 SO4 + SO2
nitration, acetalization, hydration, and so on.
Like the halogen cations, the sulfur, selenium, and
tellurium cations are highly electrophilic and undergo
J. Superacids in Inorganic Chemistry disproportionation in media with any appreciable basic
properties, although, as would be anticipated, the ease
1. Halogen Cations
of disproportionation increases in series tellurium <
It has often been postulated that the monoatomic ions I+ , selenium < sulfur.
Br+ , and Cl+ are the reactive intermediates in halogena-
tion reactions of aromatics and alkenes. The search for
4. Noble Gas Cations
the existence of such species has led to the discovery of I+
2
and other related halogen cations, which are stable in su- Noble gas cationic salts of xenon and krypton have also
peracids. The I+
2 cation may be generated by the oxidation been isolated from superacid medium. The examples
of I2 with S2 O6 F2 in HSO3 F solution, include XeF+ , Xe2 F+ + + +
3 , HCNXeF , XeOF3 , KrF , and
+
Kr2 F3 .
2I2 + S2 O6 F2 → 2I+
2 + 2SO3 F
−
Surface Chemistry
Simon R. Bare
G. A. Somorjai
University of California, Berkeley, and
Lawrence Berkeley Laboratory
. 373
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
3. Atomic-Beam Diffraction
Another technique that utilizes the principle of diffraction
is atomic- or molecular-beam diffraction. The deBroglie
wavelength λ associated with helium atoms is given by
the following:
h 0.14
FIGURE 2 Scheme of the low-energy electron diffraction experi- λ(Å) = 1/2
= , (1)
ment. (Courtesy of Lawrence Berkeley Laboratory.) (2ME) E(eV)1/2
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
FIGURE 3 Scheme of the low-energy electron diffraction apparatus employing the postacceleration technique.
FIGURE 4 LEED pattern from a Pt(111) crystal surface at (a) 51 eV, (b) 63.5 eV, (c) 160 eV, and (d) 181 eV incident
electron energy. (Courtesy of Lawrence Berkeley Laboratory.)
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
FIGURE 6 Scanning tunneling microscope picture of a clean (1 × 5) reconstructed Au(100) surface with monatomic
steps. Divisions on the crystal axes are 5 Å, with approximately 1.5 Å from scan to scan. The inset shows the LEED
pattern of the predominant (1 × 5) corrugation. (Courtesy of Lawrence Berkeley Laboratory.)
atoms in the solid, thus the beam senses a frozen lattice. If proximation, can be written analytically which leads to an
the target is amorphous each atom would sense a uniform estimate of the two-atom surface peak intensity I :
distribution of impact parameters of the ions and diffuse
scattering results. However, the scattering spectrum from a R2 −R
I =1+ 1+ 2 exp , (3)
single crystal aligned with a major symmetry axis parallel 2ρ 2ρ 2
to the beam is drastically modified from that of the amor-
phous target. The impact parameter distribution is also where ρ is the two-dimensional root mean square thermal
uniform at the first monolayer, but the first atom shad- vibrational amplitude. The first term represents the unit
ows the second from the beam, and small angle scattering contribution from the first atom in the string, the second
events determine the impact parameter distribution at the term represents the variable contribution from the second
second atom. This results in a unique (nonuniform) flux
distribution at the second atom. Figure 7 illustrates the ef-
fect of small-angle scattering in a two-atom model. Ions
incident at the smallest impact parameter undergo large-
angle scattering, those at large impact parameter suffer
small deflections which determine the flux distribution of
ions near the second atom. The closest approach of the
ion to the second atom, R, can be approximated assuming
Coulomb scattering as follows:
1/2
R = 2 Z 1 Z 2 e2 d E , (2)
C. Surface Relaxation
Generally, the surface unit cells of clean metal surfaces
have been found to be those expected from the projection
of the bulk X-ray unit cell onto the surface, referred to
as a (1 × 1) structure (in Miller index notation), and the
uppermost layer z spacing (spacing in the direction nor-
mal to the surface plane) is equal to the bulk value within
about 5%. Such surfaces include the (111) crystal faces
of face centered cubic aluminum, platinum, nickel, and
rhodium, and the (0001) crystal faces of hexagonal close
packed cadmium and beryllium. This information has al-
most exclusively been determined by a detailed intensity
analysis of the diffraction beams in LEED as a function
of incident electron energy, and the interatomic positions
in the surface layer are calculated to within 0.1 Å. The
Al (110) surface shows a 5–15% contraction, the Mo(100)
surface a 11–12% contraction, and the W(100) surface a
FIGURE 8 Schematic of the dependence of the intensity of the 6% contraction of the top-layer z spacing with respect
surface peak (SP) on different crystal surface structures. (a) The
to the bulk, while retaining the (1 × 1) surface unit cell.
ideal crystal SP from “bulklike” surface, (b) enhanced SP observed
in normal incidence for a reconstructed surface, (c) enhanced SP Generally, crystal planes whose atoms are less densely
observed in nonnormal incidence for a relaxed surface, and (d) re- packed [for example, bcc (100) and fcc (110) planes] will
duced SP observed in normal incidence for a registered overlayer. be more likely to show relaxation than the more densely
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
packed planes. In forming a surface of the less densely gold, platinum, and iridium. The ideal unreconstructed
packed planes it is necessary to remove a larger num- surfaces have a square net of atoms. Surface reconstruction
ber of nearest-neighbor atoms. Thus, to minimize the sur- produces a superlattice that is basically five times larger
face free energy a relocation of the surface atoms from in one direction than for the ideal surface. For Ir(100)
their bulk positions is quite likely. There are several ex- the superlattice is denoted (5 × 1), for Pt(100) by the ma-
planations as to why surface relaxation is prevalent on trix notation ( −15 14
1
), and for Au(100) by the superlattice
the more open surfaces. First, it can be imagined that the (5 × 20). From LEED I (V ) analyses, evidence indicates
electron cloud attempts to smooth its surface, thereby pro- that the nature of the reconstruction is similar on all three
ducing electrostatic forces that draw the surface atoms metals and consists of a close-packed hexagonal top layer
toward the substrate, the effect being stronger the less that is positioned in slightly different ways on the square
closely packed the surface. Second, with fewer neigh- net substrate. These reconstructions are consistent with
bors the two-body repulsion energy is smaller, allowing the knowledge that the (111) face of fcc metals is ener-
greater atomic overlap and therefore more favorable bond- getically the most favorable. It is worth noting here that
ing at shorter bond lengths. Third, for surface atoms the the adsorption of gases such as oxygen, carbon monox-
bonding electrons are partly redistributed from the bro- ide, or hydrogen, or the presence of impurities can inhibit
ken bonds to the remaining unbroken bonds, thereby the these surface reconstructions. On the other hand, the pres-
charge content of the latter is increased, reducing the bond ence of such adsorbates can also induce different surface
length. reconstructions.
The nature and cause of these surface phase transfor-
mations are not well established at present. The case of
D. Surface Reconstruction
structural change from metastable to stable on adsorption
Surface atoms in any crystal are in an anisotropic envi- or removal of adsorbates indicates the likelihood of elec-
ronment which is very different from that around bulk tronic transitions that accompany reconstruction. At the
atoms. The crystal symmetry that is experienced by each surface there are fewer nearest neighbors as compared to
surface atom is markedly lower than when the atom is in atoms in the bulk. The electronic structure that is the most
the bulk. This symmetry change and lack of neighbors in stable in this reduced-symmetry environment may be sub-
the direction perpendicular to the surface allows displace- stantially different from that of the bulk metal. Since the
ment of surface atoms in ways which are not allowed in surface atoms are surrounded by atoms only on one side
the bulk. Surface relaxation is one consequence of this, and there is vacuum on the other, they may change their co-
the other major consequence being surface reconstruc- ordination number by slight relocation with simultaneous
tion. Here the two-dimensional surface unit cell is differ- changes in the electronic structure. It is indeed surprising
ent from that given by the termination of the bulk structure that more surfaces do not show reconstruction.
on the plane of interest. Surface reconstruction can give
rise to a multitude of different structures depending on the
E. Stepped and Kinked Surfaces
electronic structure of a given substance. The phenomenon
is more frequent on semiconductor surfaces than on metal The close-packed faces of solids (low-Miller-index faces)
surfaces. While the geometry is readily observed in the have the lowest surface free energy, and therefore they
LEED pattern, the actual elucidation of the real-space re- are the most stable with respect to rearrangement on
constructed surface structure is often extremely difficult, disordering up to or near the melting point. However,
and in some cases even after years of study and many stepped and/or kinked surfaces (high-Miller-index
proposals of the structure, the true structure is still not faces), although of higher surface free energy, are very
known. Such a system is the Si(111) surface. Upon cleav- important. They are known to play important roles during
ing in UHV at room temperature, the surface exhibits a evaporation, condensation, and melting. Steps and kinks
(2 × 1) surface structure. On heating to about 700 K the are sites where atoms break away as an initial process
surface structure changes to one with (7 × 7) periodicity. leading to desorption, or where atoms migrate during
This (7 × 7) structure is then the stable structure of the condensation to be incorporated into the crystal lattice.
(111) face. While this surface has been studied by a mul- Theories of crystal growth, evaporation, and the kinetics
titude of techniques, including LEED, STM, and He atom of melting have identified the significance of these lower
diffraction there is still no generally accepted structure of coordination-number sites in controlling the rate processes
either the (2 × 1) or (7 × 7) reconstructions. associated with phase changes. In addition, studies of
One of the best known examples of reconstruction of chemisorption and catalysis using single-crystal surfaces
metallic surfaces is that for the (100) faces of three 5d have revealed different binding energies and enhanced
transition metals that are neighbors on the periodic table: chemical activity at steps and kinks on high-Miller-index
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
FIGURE 10 LEED patterns (left) and surface structures (right) of (a) flat, (b) stepped, and (c) kinked platinum surfaces.
(Courtesy of Lawrence Berkeley Laboratory.)
transition metal surfaces as compared to low-Miller- ognizable in LEED. Upon removing the impurities from
index surfaces. Adsorption of diatomic or polyatomic the surface, the original one-atom-height step ordered ter-
molecules frequently leads to dissociation with greater race surface structure is usually regenerated.
probability of steps and kinks than on flat atomic
terraces.
II. SURFACE STRUCTURE OF
The presence of steps on a single crystal surface is read-
ADSORBATES ON SOLID SURFACES
ily discernible by LEED. The LEED patterns differ from
those expected from crystals with low-index faces in that
A. Introduction
the diffraction beams are split into doublets. This splitting
(Fig. 10) is a function of ordered steps on the surface. The While the knowledge of the structure of clean solid sur-
distance between the split beams is inversely related to the faces is important in its own right for determining var-
distance between the steps, i.e., the terrace width. From ious properties of those surfaces, many phenomena are
the variation of the intensity maximum of the doublet associated with the presence of adsorbates on the sur-
spots with electron energy the step height can be faces. In fact, in the natural environment of our planet,
determined. surfaces are never truly free of adsorbates. On approach-
Many stepped surfaces exhibit high thermal stability. ing a surface each atom or molecule encounters a net at-
In particular the one-atom-height step periodic terrace tractive potential. This results in a finite probability that
configuration appears to be the stable surface structure it will be trapped on the surface. This trapping, adsorp-
of many high-Miller-index surfaces. While most of the tion, is always an exothermic process. At the low pressure
stepped surfaces are stable when clean in their one-atom- of 1 × 10−6 torr, approximately 1 × 1015 gas molecules
height step ordered terrace configuration, in the presence collide with each square centimeter of surface per sec-
of a monolayer of carbon or oxygen many stepped surfaces ond. Since the surface concentration of atoms is about
undergo structural rearrangement. The step height and ter- 1015 cm−2 , at this pressure the surface may be covered
race width may double, or faceting may occur. Faceting with a monolayer of gas within seconds; this is the ma-
occurs when the step orientation becomes as prominent as jor reason why surface studies are performed under ultra-
that of the terrace and new diffraction features become rec- high vacuum conditions (P < 1 × 10−8 torr). The very low
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
pressure is needed to maintain clean surface conditions B. The Ordering of Adsorbed Monolayers
for a time long enough to perform experimental measure-
ments. At atmospheric pressure the surface will be covered The ordering process in the adlayer is due to an interplay
within a fraction of a second. The constant presence of of the bonding with the substrate and the bonding between
the adsorbate layer influences the chemical, mechanical, the adatoms or admolecules. Once a molecule adsorbs it
and surface electronic properties. Adhesion, lubrication, may diffuse on the surface or remain bound at a specific
and resistance to mechanical or chemical attack or photo- site during most of its residence time. Thermal equilibra-
conductivity are just a few of the many macroscopic sur- tion among the adsorbate and between the adsorbate and
face processes that are controlled by various properties of substrate atoms (i.e., adsorption) is assured if Hads and
∗
monolayers. E D(bulk) , the activation energy for bulk diffusion, are high
Two macroscopic experimentally determinable param- enough as compared to kT (≥10kT ). However, ordering
eters characterize the adsorbed monolayer: the coverage primarily depends on the depth of the potential energy
and the heat of adsorption. The coverage is defined as barrier that keeps an atom or molecule from hopping to
the ratio of the number of adsorbed atoms or molecules a neighboring site along the surface. The activation en-
∗
to the total number of adsorption sites (usually taken as ergy for surface diffusion, E D(surface) , is an experimen-
the number of atoms in the surface plane). The heat of tal parameter that is of the magnitude of this potential
adsorption Hads , is implicitly linked to the strength of energy barrier. E D∗ can be experimentally determined
the adsorbate–substrate bond. Knowledge of both param- on well-characterized surfaces by field-ion microscopy,
eters often reveals the nature of bonding in the adsorbed for example, and for Ar and W adatoms and O atoms on
layer. tungsten surfaces has the value 2, 15, and 10 kcal mol−1 ,
∗
Atoms or molecules that impinge on the solid surface respectively. For small values of E D(surface) ordering is
from the gas phase will have a residence time τ on the sur- restricted to low temperatures, since as the temperature is
face. If the impinging molecules achieve thermal equilib- increased the adsorbate becomes very mobile. As the value
∗
rium with the surface atoms τ = τ0 exp Hads /RT , where of E D(surface) increases, ordering cannot commence at
τ0 is related to the average vibrational frequency associ- low temperature since the adsorbate atoms need to have
ated with the adsorbate. The heat of adsorption is always a considerable mean free path along the surface to find
positive and is defined as the binding energy of the ad- their equilibrium position once they land on the surface
sorbed species. A larger Hads and lower temperature in- at a different location. Naturally, if the temperature is too
crease the residence time. For a given incident flux, larger high, the adsorbate will desorb or vaporize.
Hads and lower temperature yield higher coverages. The binding forces of adsorbates on substrates have
It is conventional to divide adsorption into two cate- components perpendicular and parallel to the surface. The
gories: physisorption and chemisorption. Physisorption perpendicular component is primarily responsible for the
(or physical adsorption) systems are characterized by binding energy ( Hads ), while the parallel component of-
weak interactions ( Hads < 15 kcal mol−1 , accompanied ten determines the binding site on the surface. The binding
by short residence times) and require adsorption studies site may also be affected by adsorbate–adsorbate interac-
to be performed at low temperature and relatively high tions, which produce ordering within an overlayer. These
pressure (high flux). Adsorbates that are characterized by interactions may be subdivided into direct adsorbate–
stronger chemical interactions ( Hads ≥ 15 kcal mol−1 ), adsorbate interactions (not involving the substrate at all)
where near-monolayer adsorption commences even at and substrate mediated interactions. The latter are compli-
room temperature and at low pressures (≤10−6 torr), cated many-atom interactions, for example dipole–dipole
are called chemisorption systems. While the two names interactions.
imply two distinct types of adsorption, there is a grad- The adsorbate–adsorbate interactions may be repulsive;
ual change from the physisorption to the chemisorption they always are repulsive at sufficiently small adsorbate–
regime. adsorbate separations. At larger separations they may be
One of the most fascinating facts about the structure attractive, giving rise to the possibility of island formation.
of these physisorbed and chemisorbed overlayers in the They may also be oscillatory, moving back and forth be-
submonolayer to few monolayer regime is the preponder- tween attractive and repulsive as a function of adsorbate–
ance of the formation of long-range ordered structures. adsorbate separation, with a period of several angstroms
Well over 1000 two-dimensional unit cells have been doc- giving rise, for example, to non-close-packed islands.
umented in the literature. While only the shape, size, and Except for the strong repulsion at close separations, the
orientation of the cells are known for most of them, the adsorbate–adsorbate interactions are usually weak com-
adsorption site and bond lengths have been determined for pared to the adsorbate–substrate interactions, even when
about 500 of them. we consider only the components of the forces parallel
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
temperature or pressure at which one adsorbate surface ing a molecule such as CO, which has interactions strong
structure converts into another can also be affected by the enough to induce ordering in the overlayer. Similar phe-
presence of uncontrolled surface irregularities (surface de- nomena have been observed on Pt(111), and it is thought
fects). Other causes that could influence ordering are the that this coadsorbate-induced ordering may prove to be a
presence of small amounts of surface impurities that block very general phenomenon.
nucleation sites or interfere with the kinetics of ordering,
or impurities below the surface that are pulled to the sur-
C. Ordered Adsorbate Structures
face during adsorption and ordering.
The effect of surface irregularities on ordering can be As mentioned in the introduction to this section, well over
investigated in a more controlled way using stepped crys- 1000 ordered adsorbate structures have been observed
tal surfaces. In general, the smaller the ordered terrace with LEED. A full listing and discussion of these struc-
between the steps, the stronger the effect of steps on or- tures is outside the scope of this article. Instead, one ex-
dering. For instance, the ordering of small molecular ad- ample of each of the three following categories of ad-
sorbates on a high-Miller-index Rh(S)-[6(111) × (100)] is sorption are presented to give an illustrative indication of
largely unaffected by the presence of steps whereas on the the types of structures found: (i) an ordered monolayer of
Rh(S)[3(111) × (111)] the ordering is influenced by the atoms, (ii) an ordered organic monolayer, and (iii) an or-
higher step density. Steps can also affect the nucleation of dered molecular monolayer. First, a few generalities of the
ordered domains. It is frequently observed on W and Pt ordered adsorbate structures are discussed, based on the
stepped surfaces that when two or three equivalent ordered large number of LEED observations: the so-called “rules
domains may form in the absence of steps, only one of the of ordering”:
ordered domains grows in the presence of steps.
1. The rule of close-packing. Adsorbed atoms or
molecules tend to form surface structures characterized
3. The Effect of Coadsorbates on Ordering by the smallest unit cell permitted by the molecular dimen-
It has been found that although certain molecules may not sions and adsorbate–adsorbate and adsorbate–substrate
order when present on their own on a surface they can be interactions. They prefer close-packing arrangements.
induced to order by coadsorption of either carbon monox- Large reciprocal unit meshes are uncommon and the
ide or nitric oxide. For example Table I summarizes the most frequently observed meshes are the same size as the
ordered structures that have been observed by the coad- substrate mesh, i.e., (1 × 1) or are approximately
√ √ twice as
sorption of alkylidynes, acetylene, aromatics, and alkalis large, e.g., (2 × 2), c(2 × 2), (2 × 1), ( 3 × 3).
with CO on Rh(111). 2. The rule of rotational symmetry. Adsorbed atoms
At low temperature both Na and ethylidyne form (2 × 2) or molecules form ordered structures that have the same
overlayers on Rh(111) but with increasing temperature rotational symmetry as the substrate surface. If the surface
begin to disorder. If CO is coadsorbed then the adsorbates unit mesh has a lower symmetry than the substrate, then
can be reordered into a c(4 × 2) unit cell. domains of the various possible mesh orientations are to be
It is thought that the nature of this type of order- expected on different areas of the surface with a resulting
ing process is due to adsorbate–adsorbate interactions: increase in symmetry.
A molecule that might not otherwise order due to weak 3. The rule of similar unit cell vectors. Adsorbed atoms
adsorbate–adsorbate interactions is ordered by coadsorb- as molecules in monolayer thickness tend to form ordered
surface structures characterized by unit cell vectors closely
TABLE I Ordered Structures Induced by CO on Rh(111)
related to the substrate unit cell vectors. The surface struc-
ture bears a closer resemblance to the substrate structure
Type of molecule LEED pattern System than to the structure of the bulk condensate.
Alkylidynes c(4 × 2) CCH3 + CO
√ √ These are not hard-and-fast rules but rather are gener-
(2 3 × 2 3)R30◦ 3CCH2 CH3 + CO
Acetylene c(4 × 2) C2 H2 + CO
alizations of a great many systems.
(3 × 3) C6 H5 F + 2CO
Aromatics (3 × 3) C6 H6 + 2CO 1. Ordered Atomic Monolayers
√
c(2 3 × 4)Rect C6 H6 F + CO
√ Some important conclusions can be drawn from the known
c(2 3 × 4)Rect C6 H6 + CO
√ structures of atomic adsorbates on single-crystal surfaces.
Alkalis ( 3 × 7)Rect Na + 7CO
First, the adsorbed atoms tend to occupy sites where they
c(4 × 2) Na + CO
are surrounded by the largest number of substrate atoms
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
FIGURE 12 Top and side views (in top and bottom sketches of each panel) of adsorption geometries on various
metal surfaces. Adsorbates are drawn shaded. Dotted lines represent clean surface atomic positions; arrows show
atomic displacements due to adsorption.
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
2. Ordered Molecular Monolayers LEED pattern. A LEED structural analysis has not been
performed for this structure, but supporting evidence us-
Molecules adsorbed on surfaces may retain their ba-
ing infrared spectroscopy indicates that the CO molecules
sic molecular identity, bonding as a whole to the sub-
sit in the threefold hollow sites. As the coverage is in-
strate. They may dissociate into their constituent atoms,
creased to one-half of a monolayer the LEED pattern trans-
which bond individually to the substrate. Alternatively,
forms to a c(4 × 2). In this structure the CO molecules all
molecules may break up into smaller fragments which
sit in twofold bridge sites. If the adsorption takes place at
become largely independent or recombine into other con-
low temperature (90 K), incrcasing the CO coverage be-
figurations. There are also cases of intermediate character
yond = 0.5 leads to the appearance of a series of LEED
where relatively strong bonding distorts the molecule.
patterns arising from hexagonal superstructures, which by
One example of ordered molecular monolayers is CO
a continuous compression and rotation of the c(4 × 2) unit
on Pd(111). The (111) surface of fcc metals is the close-
cell lead to a (2 × 2) coincidence pattern at a coverage of
packed plane and shows similar ordering for adsorbed
√ √CO = 0.75. These transformations are also shown schemat-
for a variety of transition metals. That is the ( 3 × 3)-
ically in Fig. 14.
R30◦ structure, formed at a coverage of one-third of a
monolayer on the (111) faces of Pd, Ni, Pt, Ir, Cu, and
Rh. This similarity in ordering is probably due to their
3. Ordered Organic Monolayers
surfaces being rather smooth with respect to variations in
the CO adsorption energy. Smaller diffusion barriers be- The adsorption characteristics of organic molecules on
tween different adsorption sites are to be expected, and solid surfaces are important in several areas of surface
for large coverages repulsive interactions will be mainly science. The nature of the chemical bonds between the
responsible for the arrangement of the adlayer.
√ √Figure 14 substrate and the adsorbate and the ordering and orien-
shows a schematic representation of this ( 3 × 3)-R30◦ tation of the adsorbed organic molecules play important
CO structure on Pd(111) and the corresponding observed roles in adhesion, lubrication, and hydrocarbon catalysis.
FIGURE 14 Schematic representation of the CO on Pd(111) system. Structure models and observed LEED structures
for the various CO coverages are shown.
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
B. Surface Tension in a
One-Component System
Creating a surface involves breaking chemical bonds and
removing neighboring atoms, and this requires work.
Under conditions of constant temperature and pressure
at equilibrium, the surface work δW S is given by the
following:
δWT,P
S
= d(G S A), (6)
where A is the increase in the surface area. If G S is
independent of the surface area, surface work is as follows:
δWT,P
S
= G S d A. (7)
In a one-component system the specific surface free en-
ergy, G S , is frequently called the surface tension or surface
pressure and is denoted by γ . Here γ may be viewed as
a pressure along the surface opposing the creation of new
surface. It has dimensions of force per unit length (dynes
per centimeter, ergs per square centimeter, or newtons
per meter). The surface tension γ for an unstrained phase
is also equal to the increase of the total free energy of the
system per unit increase of the surface area as follows:
∂G
γ =G = S
. (8)
∂ A T,P
FIGURE 18 EELS spectra of the adsorption and decomposition
of ethylene (C2 H4 ) on Rh(111) at (a) 77 K, (b) 220 K, and (c) The free energy of formation of a surface is always
450 K. positive, since work is required in creating a new surface,
which increases the total free energy of the system. In or-
atoms are surrounded by fewer nearest neighbors than der to minimize their free energy solids or liquids assume
bulk atoms, and these neighbors are not distributed evenly shapes in equilibrium with the minimum exposed surface
around the surface atoms. An atom in the interior experi- area as possible. For example, liquids tend to form a spher-
ences no net forces, but these forces become unbalanced ical shape and crystal faces which exhibit the closest pack-
at the surface. Consequently the thermodynamic parame- ing of atoms tend to be the surfaces of lowest free energy
ters used to describe surfaces are defined separately from of formation and thus the most stable. Surface tension
those that characterize the bulk phase. The specific surface is one of the most important thermodynamic parameters
energy E S , the energy per surface area, is related to the characterizing the condensed phase. Table II lists selected
total energy E by the following equation: experimentally determined values of surface tensions of
liquids and solids that were measured in equilibrium with
E = NE 0 + AE S , (4)
their vapor.
where A is the surface area of a solid composed of N Comparing the surface tension values of metals and ox-
atoms, and E 0 is the energy of the bulk phase per atom. ides in Table II it can be seen that oxides have in general
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
TABLE II Selected Values of Surface Tension of related to the heat of vaporization, which is related to the
Solids and Liquids energy input necessary to break all the bonds of atoms in
Material γ (ergs cm−2 ) T (◦ C) the condensed phase. In fact, it has been found experimen-
tally that the molar surface free energy of a liquid metal
He (1) 0.308 −270.5 can be estimated by the following:
N2 (1) 9.71 −195
Ethanol (1) 22.75 20 γlm = 0.15 Hvap , (9)
Water 72.75 20 where Hvap is the heat of vaporization of the liquid, and
Benzene 28.88 20 the molar surface free energy of a solid metal is given by
n-Octane 21.80 20 the following:
Carbon tetrachloride 26.95 20
Bromine 41.5 20 γsm = 0.16 Hsub , (10)
W (s) 2900 1727
where Hsub is the heat of sublimation of the solid.
Nb (s) 2100 2250
For other materials, oxides, or organic molecules, such
Au (s) 1410 1027
a simple relationship does not work due to the complexity
Ag (s) 1140 907
of bonding and the rearrangement or relaxation of surface
Ag (l) 879 1100
atoms at the freshly created surfaces.
Fe (s) 2150 1400
Fe (l) 1880 1535
Pt (s) 2340 1311 C. Surface Tension of Multicomponent Systems
Cu (s) 1670 1047
In many important surface phenomena, such as heteroge-
Ni (s) 1850 1250
neous catalysis or passivation of the surface by suitable
Hg (l) 487 16.5
protective coatings, the chemical composition of the top-
NaCl (s) 227 25
most layer controls the surface properties and not the com-
KCl (s) 110 25
position in the bulk. Thus, investigations of the physical–
CaF2 (s) 450 −195
chemical parameters that control the surface composition
MgO (s) 1200 25
are of great importance. One of the major driving forces
SiO2 (s) 307 1300
for the surface segregation of impurities from the bulk and
Al2 O3 (s) 690 2323
for the change of composition of alloys and other multi-
Polytetrafluoroethylene 18.5 20
component systems is the need to minimize the surface
Polyethylene 31 20
free energy of the condensed phase system.
Polystyrene 33 20
The change of the total free energy of a multicomponent
Poly(vinyl chloride) 39 20
system can be expressed with the inclusion of the surface
term as follows:
a low surface tension. Therefore, a reduction in the total dG = S dT + V d P + γ d A + µi dn i , (11)
i
free energy of the system can be achieved by oxidation of
the surface and a uniform oxide layer covering the surface where µi is the chemical potential of the ith compo-
is expected under conditions near thermodynamic equilib- nent and dn i is the change in the number of moles of
rium. Similarly, deposition and growth of a metal film on a the ith component. At constant temperature and pressure,
metallic substrate of higher surface tension should yield a Eq. (11) can be rewritten as follows:
uniform layer that is evenly spread to completely cover the dG T,P = γ d A − µi dn i , (12)
substrate surface. Likewise, a very poor spreading of the i
film is expected on deposition of a metal of high surface
where the minus sign indicates the decrease of the bulk
tension on a low-surface-tension substrate. This latter con-
concentration of the ith component. Comparing this equa-
dition results in “island growth” and the deposited high-
tion with Eq. (8), the surface tension γ is no longer equal
surface-tension metal will grow as whiskers to expose
to the specific surface free energy per unit area for a mul-
as much of the low-surface-tension substrate during the
ticomponent system. Using simple arguments in which
growth as possible. These, of course, are surface thermo-
number of moles of the condensed phase are transferred
dynamic predictions and may be overidden by the presence
to the freshly created surface, the Gibbs equation can be
of impurities at the surface or difficulties of nucleation.
derived as follows:
Since atomic bonds must be broken to create surfaces,
it is expected that the specific surface free energy will be dγ = −S S dT = i dµi , (13)
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
TABLE III Surface Composition of Alloys: system. As a result, the surface composition may markedly
Experimental Results and Predictions of the change with changing ambient conditions.
Regular Solution Model
The mechanical properties of solids, embrittlement,
Segregating constituent and crack propagation, among others, depend markedly
on the surface composition. These studies indicate that
Alloy Predicted
system regular solution Experimental
the surface composition and the mechanical properties of
structural steels may change drastically when the ambi-
Ag–Pd Ag Ag ent conditions are changed from reducing to oxidizing
Ag–Au Ag Ag environments.
Au–Pd Au Au
Ni–Pd Pd Pd
D. Equilibrium Shape of a Crystal
Fe–Cr Cr Cr
or a Liquid Droplet
Au–Cu Cu Au, none, or Cu
depending on In equilibrium the crystal will take up a shape that corre-
composition
sponds to a minimum value of the total surface free energy.
Cu–Ni Cu Cu
In order to have the equilibrium shape, the integral γ d A
Au–Ni Au Au
over all surfaces of the crystal must be a minimum. Crystal
Au–Pt Au Au
faces that have high atomic density have the lowest sur-
Pb–In Pb Pb
face free energy and are therefore most stable. The plot of
Au–In In In
the surface free energy as a function of crystal orientation
Al–Cu Al Al
is called the γ plot.
Pt–Sn Sn Sn
Solids and liquids will always tend to minimize their
Fe–Sn Sn Sn
surface area in order to decrease the excess surface free
Au–Sn Sn Sn
energy. For liquids, therefore, the equilibrium surface be-
comes curved where the radius of curvature will depend
on the pressure difference on the two sides of the interface
the segregating components that were experimentally and on the surface tension as follows:
observed and also predicted by the regular solution
(Pin − Pext ) = 2γ /r, (18)
model. The agreement is certainly satisfactory. It appears
that for binary metal alloy systems that exhibit regular where Pin and Pext are the internal and external pressures,
solution behavior there are reliable methods to predict respectively, and r is the radius of curvature. In equi-
surface composition. librium a pressure difference can be maintained across
So far we have discussed the surface composition of a curved surface. The pressure inside the liquid drop or
multicomponent systems that are in equilibrium with their gas bubble is higher than the external pressure, because
vapor or in which clean surface–bulk equilibrium is ob- of the surface tension. The smaller the droplet or larger
tained in ultrahigh vacuum. In most circumstances, how- the surface tension, the larger is the pressure difference
ever, the surface is covered with a monolayer of adsor- that can be maintained. For a flat surface r = ∞, and the
bates that frequently form strong chemical bonds with the pressure difference normal to the interface vanishes.
surface atoms. This solid–gas interaction can markedly Let us now consider how the vapor pressure of a
change the surface composition in some cases. For exam- droplet depends on its radius of curvature r . We obtain
ple, carbon monoxide, when adsorbed on the surface of a the following:
Ag–Pd alloy, forms much stronger bonds with Pd. While
ln (P/P0 ) = 2γ Vin /RT r, (19)
the clean surface is enriched with Ag, in the presence of
CO, Pd is attracted to the surface to form strong carbonyl where Vin is the internal volume. This is the well-known
bonds. When the adsorbed CO is removed, the composi- Kelvin equation for describing the dependence of the va-
tion returns to its original Ag-enriched state. Nonvolatile por pressure of any spherical particle on its size. Small
adsorbates, such as carbon or sulfur, may have a simi- particles have higher vapor pressures than larger ones.
lar influence on the surface composition as long as their Similarly, very small particles of solids have greater solu-
bonding to the various constituents of the multicomponent bility than large particles. If we have a distribution of parti-
system is different. cles of different sizes, we will find that the larger particles
Adsorbates should therefore be viewed as an additional will grow at the expense of the smaller ones. Nature’s way
component of the multicomponent system. A strongly in- to avoid the sintering of small particles that would occur
teracting adsorbate converts a binary system to a ternary according to the Kelvin equation is to produce a system
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
with particles of equal size. This is the world of colloids ting ability of the liquid at the solid interface is so im-
where particles are of equal size and therefore stabilized portant in practical problems of adhesion or lubrication,
and are usually charged either all negative or all positive there is a great deal of work being carried out to deter-
or to repel each other by long-range electrostatic forces. mine the interfacial tensions for different combinations of
Milk and our blood are only two examples of systems that interfaces.
contain colloids. The usefulness of a lubricant is determined by the extent
to which it wets the solid surface and maintains complete
E. Adhesion and the Contact Angle coverage of the surface under various conditions of use.
The strength of an adhesive is determined by the extent to
Let us turn our attention to the interfacial tension, that which it lowers the surface free energy by adsorption on
is, the surface tension that exists at the interface of two the surface. The work of adhesion is defined as follows:
condensed phases. Let us place a liquid droplet on a solid
surface. The droplet either retains its shape and forms a WAs = γ1,0 + γs,0 − γsl , (21)
curved surface or it is spread evenly over the solid. These where γ1,0 and γs,0 are the surface tensions in vacuum of
two conditions indicate the lack of wetting or wetting of the liquid and solid, respectively. In general, solids and liq-
the solid by the liquid phase, respectively. The contact uids that have large surface tension form strong adhesive
angle between the solid and the liquid, to a large extent, bonds, i.e., have large work of adhesion. The work of ad-
permits us to determine the interfacial tension between hesion is in the range of 40–150 ergs/cm2 for solid–liquid
the solid and the liquid. The contact angle is defined by pairs of various types. Organic polymers often make ex-
Fig. 20. If the contact angle is large ( approaching 90◦ ), cellent adhesives because of the large surface area covered
the liquid does not readily wet the solid surface. If ap- by each organic molecule. The adhesive energy per mole
proaches zero, complete wetting of the solid surface takes is much larger than that for adhesion between two metal
place. For larger than 90◦ the liquid tends to form spher- surfaces or between a liquid and a solid metal because of
ical droplets on the solid surface that may easily run off, the many chemical bonds that may be formed between the
i.e., the liquid does not wet the solid surface at all. Re- substrate and the adsorbed organic molecule.
membering that the surface tension always exerts a pres-
sure tangentially along a surface, the surface free energy
F. Nucleation
balance between the surface forces acting in opposite di-
rections at the point where the three phases solid, liquid, Another important phenomenon that owes its existence to
and gas meet is given by the following: positive surface free energy is nucleation. In the absence
cos = (γsg − γsl )/γlg . (20) of a condensed phase, it is very difficult to nucleate one
from vapor atoms because the small particles that would
Here γlg is the interfacial tension at the liquid–gas inter- form have a very high surface area and dispersion and, as
face and γsg and γsl are the interfacial tensions between a result, a very large surface free energy. The total energy
the solid–gas and the solid–liquid interfaces, respectively. of a small spherical particle has two major components:
Knowing γlg and the contact angle in equilibrium at the its positive surface free energy, which is proportional to
solid–liquid–gas interface, we can determine the differ- πr 2 γ , where r is the radius of the particle, and its neg-
ence γsg − γsl but not their absolute values. Since the wet- ative free energy of formation of the particle with vol-
ume V . The volumetric energy term is proportional to
−r 3 ln (P/Peq ), where P is the pressure over the system
and Peq is the equilibrium vapor pressure:
4πr 3 P
G(total) = − RT ln + 4πr 2 γ , (22)
3Vm Peq
where Vm is the molar volume of the forming particle. Ini-
tially, the atomic aggregate is very small and the surface
free energy term is the larger of the two terms. In this cir-
cumstance a condensate particle cannot form from the va-
por even at relatively high saturation (P > Peq ). Similarly,
a liquid may be cooled below its freezing point without
FIGURE 20 Definition of the contact angle between a liquid and solidification occurring.
solid and the balance of surface forces at the contact point among Above a critical size of the spherical particles the
the three phases (solid, vapor, and liquid). volumetric term becomes larger and dominates since it
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
2
real situation. The Langmuir isotherm assumes that ad- 1
sorption is terminated on completion of one molecular P=
k 1−
adsorbed gas layer (monolayer) by asserting that any
or
gas molecule that strikes an adsorbed atom must reflect
from the surface. All the other assumptions used to de- (k P)1/2
= . (33)
rive Eq. (28) are maintained (i.e., homogeneous surface 1 + (k P)1/2
and noninteracting adsorbed species). If [n 0 ] is the sur- A clear weakness of the Langmuir model is the assumption
face concentration of a completely covered surface, the that the heat of adsorption is independent of coverage.
number of surface sites available for adsorption, after ad- Several other isotherms have been developed which are
sorbing [n a ]s molecules is [n 0 ] − [n a ]s . Of the total flux all modifications of the Langmuir model. For example, the
incident on the surface, a fraction ([n a ]s /[n 0 ])F will strike Temkin isotherm can be derived if a linearly declining heat
molecules already adsorbed and, therefore, be reflected. of adsorption is assumed, i.e., H = H0 (1−β), where
Thus, a fraction (1 − [n a ]s /[n 0 ])F of the total incident H0 is the initial enthalpy of adsorption. The isotherm is
flux will be available for adsorption. Equation (25) should
RT
then be modified as follows: = ln A P, (34)
β H0
[n a ]s
[n a ]s = 1 − Fτ, (30) where A is a constant related to the enthalpy of adsorption.
[n 0 ]
The possibility of multilayer adsorption is envisaged
which can be rearranged to give
in the Brunauer–Emmett–Teller (BET) isotherm. The as-
[n 0 ]Fτ [n 0 ]k P sumption is made that the first layer is adsorbed with a
[n a ]s = = (31)
[n 0 ] + Fτ [n 0 ] + k P heat of adsorption H1 and the second and subsequent lay-
from which ers are all characterized by heats of adsorption equal to
the latent heat of evaporation, HL . By considering the dy-
k P
= , (32) namic equilibrium between each layer and the gas phase
1 + k P the BET isotherm is obtained,
where k = k/[n 0 ]. p 1 c−1 p
Equation (32) is the Langmuir isotherm. The adsorp- = + . (35)
V ( p0 − p) Vm c Vm c p0
tion of CO on Pd(111) obeys the Langmuir isotherm, and
typical isotherms from this system are shown in Fig. 24. In this equation V is the volume of gas adsorbed, p
It can readily be shown that in the case of dissociative the pressure of gas, p0 the saturated vapor pressure of the
adsorption the Langmuir isotherm becomes liquid at the temperature of the experiment, and Vm the
volume equivalent to an adsorbed monolayer. The BET The last, and most common, method of determining
constant c is given by the following: the heat of adsorption is a kinetic method called tem-
perature programmed desorption (TPD). The method is
c = exp(H1 − HL )/RT. (36)
as follows. The sample is cleaned in ultrahigh vacuum
The BET equation owes its importance to its wide use and a gas is allowed to adsorb on the surface at known
in measuring surface areas, especially of films and pow- pressures while the surface is kept at a fixed temperature.
ders. The method followed is to record the uptake of an The sample is then heated at a controlled rate, and the
inert gas (Kr) or nitrogen at liquid nitrogen temperature pressure changes during the desorption of the molecules
(−195.8◦ C). A plot of p/V ( p0 − p) versus p/ p0 , usually are recorded as a function of time and temperature. The
for p/ p0 up to about 0.3, yields Vm , the monolayer up- pressure–temperature profile is usually referred to as the
take. This value is expressed as an area by assuming that desorption spectrum. The desorption rate F(t) is com-
the area per molecule for nitrogen is 16.2 Å and 25.6 Å for monly expressed as follows:
krypton.
E des
In general, the BET isotherm is most useful for de- F(t) = v f (σ ) exp − , (39)
scribing physisorption for which H1 and HL are of the RT
same order of magnitude while the preceding isotherms where v is the preexponential factor and f (σ ) an adsor-
are more useful for chemisorption. It is worth noting that bate concentration-dependent function. The various pro-
the BET isotherm reduces to the Langmuir isotherm when cedures for determining these parameters are well de-
H1 HL . scribed in the literature.
Assuming that v and E des are independent of the ad-
sorbate concentration σ and t, E des can be obtained for
I. Heat of Adsorption
zeroth-, first-, and second-order desorption, respectively,
An important physical–chemical property that character- as follows:
izes the interaction of solid surfaces with gases is the bond
E0 ν0 E
energy of the adsorbed species. The determination of bond = exp − (40)
R σa RTp
energy is usually made indirectly by measuring the heat
of adsorption (or heat of desorption) of the gas. The heat E1 v1 E1
of adsorption can be determined readily in equilibrium by = exp − (41)
RTp2 α RTp
measuring several adsorption isotherms. The Clausius–
Clapeyron equation
E2 v2 σ E2
= exp − , (42)
∂(ln P) Hads RTp2 α RTp
= (37)
∂T RT 2 where Tp is the temperature at which a desorption peak
can be integrated to give is at a maximum and σ is the initial adsorbate concentra-
tion. The subscripts 0, 1, and 2 denote the zeroth-, first-,
P1 − Hads 1 1
ln = − . or second-order desorption processes; α is a constant of
P2 R T1 T2
proportionality for the temperature rise with time, usu-
Measuring the adsorption isotherm at two different tem- ally T = T0 + αt; that is, the temperature of the sample
peratures, provided that proper equilibrium is established is raised linearly with time. As seen from the equations,
between the adsorbed and gas phase, yields the heat of Tp is independent of σ for the first-order desorption pro-
adsorption. cess. Alternatively, Tp is increased or is decreased with
The heat of adsorption can also be obtained by direct σ0 for the zeroth- and second-order process, respectively.
calorimetry. The method most commonly used consists of Equations (40)–(42) allow us to determine the activation
measuring the temperature rise caused by the addition of energy and the preexponential factor and also to distin-
a known amount of gas to a film of the metal prepared guish between zeroth-, first-, and second-order desorption
by evaporation in vacuo. This measurement will yield the processes from the measurements of the dependence of
differential heat of adsorption qd at the particular value the peak temperatures on initial adsorbate concentrations
of . The differential heat of adsorption is related to the and heating rate α. A typical TPD spectrum is shown in
isosteric heat of adsorption by the following: Fig. 25.
The bond energy Hbond is readily extracted from the
q = qd + RT ; (38)
heat of adsorption. In the case of the chemisorption of
the difference is only RT, which is within experimental a diatomic molecule X2 onto a site on a uniform solid
error. surface M the molecule may adsorb without dissociation
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
FIGURE 31 Energy-level diagram (a) in the absence of any FIGURE 32 Potential energy diagram illustrating the work func-
space charge and (b) with a surface space charge due to de- tion. EF is the Fermi energy, φ is the work function, and W is
pletion of electrons in the surface region. the potential well bonding the conduction band electrons into the
solid.
depleted of electrons it would require more energy to trans- insulators it can be regarded as the difference in energy
fer an electron to the conduction band from, for example, between an electron at rest in the vacuum just outside
the reference state E F , due to the space charge potential the solid (i.e., at the level of zero kinetic energy) and the
barrier. This is depicted schematically in Fig. 31. Con- most loosely bound electrons in the solid. Thus, the work
versely, it is now easier to transfer a hole to the surface function is evidently an important parameter in situations
since the difference between E F and E V becomes smaller. where electrons are removed from the solid.
It is very likely that there is accumulation or depletion A schematic energy level diagram assuming the free-
of charges at semiconductor or insulator surfaces under electron model of a metal showing the work function is
all ambient conditions. For surfaces under atmospheric depicted in Fig. 32.
conditions, adsorbed gases or liquid layers at the interface From the figure it can be seen that the value of φ de-
provide trapping of charges or become the source of free pends on W , the depth of the potential well bonding the
carriers. For clean surfaces in ultrahigh vacuum, there conduction electrons into the solid. Here W is a bulk prop-
are electronic surface states that act as traps or sources of erty determined by the attraction for its electrons of the
electrons and produce a space-charge layer of appreciable lattice of positive ions as a whole; it has an energy of the
height. Thus, the mobile carriers from the surface layer are order of a few electron volts.
swept into the interior or are trapped at the surface as the The origin of the work function itself can be considered
space-charge layer consists dominantly of static charges, as being due to the image potential of the escaping elec-
the one most frequently encountered in experimental tron. Electrostatic theory shows that a charge −e outside
situations. a conductor is attracted by an image charge +e placed at
We have so far considered the space-charge layer prop- the position of the optical image of −e in the conducting
erties only in the insulating solid, assuming that the surface plane. If −e is a distance x from the plane the image force
layer that acts as a donor or the electron trap is of mono- is e2 /16π ε0 x 2 . This force is experienced by the electron
layer thickness. However, considering the properties of escaping into the vacuum and is negligible beyond 10−6 –
solid–liquid interfaces or semiconductor–insulator con- 10−5 cm away from the surface.
tacts, it should be recognized that the space-charge layer The image potential is a specific surface contribution to
may extend to effective Debye lengths on both sides of the W , and a second surface contribution is the existence of a
interface. This is a most important consideration when we surface double layer or dipole layer. Surface atoms are in
investigate the surface properties of colloid systems or of an unbalanced environment, they have other atoms on one
semiconductor–electrolyte interfaces. side of them but not on the other; thus, the electron dis-
tribution around them will be unsymmetrical with respect
to the positive ion cores. This leads to the formation of
D. Work Function and Contact Potential
a double layer. Two important effects emanate from this;
The work function of a solid is a fundamental physical the work function is sensitive to both the crystallographic
property of the solid which is related to its electronic plane exposed and to the presence of adsorbates.
structure. It is defined as the potential that an electron The orientation of the exposed crystal face affects φ
at the Fermi level must overcome to reach the level of because the strength of the electric double layer depends
zero kinetic energy in the vacuum. In semiconductors and on the density of positive ion cores which in turn will vary
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
TABLE IV Work Functions Measured from a tungsten tip. The changes in φ produced by adsorbed
Different Crystal Faces of Tungsten and atoms or molecules can be followed in the FEM.
Molybdenum
The work function of a solid is also sensitive to the pres-
Work function (eV) ence of adsorbates. In fact, in virtually all cases of adsorp-
Crystal
tion the work function of the substrate either increases or
face Tungsten Molybdenum
decreases; the change being due to a modification of the
(110) 4.68 5.00 surface dipole layer. The formation of a chemisorption
(112) 4.69 4.55 bond is associated with a partial electron transfer between
(111) 4.39 4.10 substrate and adsorbate and the work function will change.
(001) 4.56 4.40 Two extreme cases are (i) the adsorbate may only be po-
(116) 4.39 — larized by the attractive interaction with the surface giving
rise to the build up of a dipole layer, as in the physisorption
of rare gases on metal surfaces; and (ii) the adsorbate may
from one face to another. The work function of various be ionized by the substrate, as in the case of alkali metal
crystal planes of tungsten and molybdenum are listed in adsorption on transition metal surfaces. If the adsorbate
Table IV. It can be seen that there is more than 0.3 eV is polarized with the negative pole toward the vacuum the
difference in work function values. This variation of work consequent electric fields will cause an increase in work
function from one crystal face to another can clearly be function. Conversely, if the positive pole is toward the vac-
demonstrated using a field emission microscope (FEM). uum then the work function of the substrate will decrease.
This microscope is identical in construction to the FIM The work function is a rather complicated (and not fully
described earlier. However, instead of having helium or understood) function of the surface composition and ge-
another imaging gas in the vacuum, no gas is admitted. ometry. Nevertheless, general systematic observations of
The potential on the sample tip is reversed so that electrons φ are quite helpful. For example, the sign of φ for
are accelerated out of it by a very high local electric field atomic adsorption is mostly that implied by the magni-
(∼ 4 × 107 V cm−1 ). The current emitted from the tip sur- tude of the ionization potential, electron affinity, or dipole
face where the work function is φ is approximately propor- moment of the adsorbates as one would expect. The most
tional to exp(−Aφ 3/2 ) and is a very fast function of φ. The common usage of work function changes in surface chem-
brightness observed on the fluorescent screen is a function istry is in the monitoring of the various stages of adsorp-
of the value of φ at that place on the tip, and the FEM image tion as a function of coverage. Often the work function
will consist of darker and brighter areas, the brightness de- change will go through a maximum or minimum at par-
pending on the work function of each crystal face exposed. ticular coverages corresponding to the completion of an
An image is shown in Fig. 33 which is the FEM image of ordered atomic arrangement.
Experimentally, the most accurate way of measuring
changes in work function is by the Kelvin method, which
uses a vibrating probe as a variable capacitor. A contact
potential difference is set up between two conductors con-
nected externally and the sample and a reference electrode
form a parallel plate condenser. The distance between the
two is periodically varied, thus generating an alternating
current in the connecting wire. If a voltage source is placed
in the connecting circuit just balancing out the contact po-
tential difference, no current will flow. Once this situation
has been achieved for the clean surface, a change in work
function due to adsorption is simply the additional voltage
which needs to be applied to compensate the change and
keep the current zero. Accuracies of φ to within ±1 meV
are obtainable.
Intimately linked to the concept of work function is the
process of thermionic emission. Thermionic emission is,
as the name suggests, the phenomenon whereby electrons
FIGURE 33 Field emission pattern of a tungsten tip. The are ejected from a metal when it is heated in vacuum.
(011) plane is in the center. (Courtesy of Lawrence Berkeley The electrons that require the least amount of thermal en-
Laboratory.) ergy to overcome their binding energy in the solid and
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
FIGURE 35 Universal curve for the electron mean free path as a function of electron kinetic energy. Dots indicate
individual measurements.
incidence. Photons have a much larger penetration depth 5000 eV) strikes the atoms of a material, electrons that
into the solid due to the much smaller scattering cross have binding energies less than the incident beam energy
section. However, electrons created by excitation below may be ejected from the inner atomic level. By this process
a few atomic layers from the surface cannot escape due a singly ionized, excited atom is created. The electron
to inelastic scattering within the solid. If a monoenergetic
beam of electrons of energy E p strikes a metal surface
then a typical plot of the number of scattered electrons
N (E) as a function of their kinetic energy E is shown in
Fig. 36. The curve is dominated by a strong peak at low
energies due to secondary electrons created as a result
of inelastic collisions between the incident electrons and
electrons bound to the solid. Other features in the spec-
trum include (i) the elastic peak at E p that is utilized in
LEED, (ii) inelastic peaks at loss energies of 10–500 meV
which provide information about the vibrational structure
of the surface–adsorbate complex utilized in EELS, (iii)
inelastic peaks at greater loss energies (plasmon losses)
which provide information about the electronic struc-
ture of surface atoms, and (iv) small peaks on the large
secondary electron peak due to Auger electrons which
provide information on the chemical composition of the
surface.
to break an atom from its neighbors and to move along square centimeters per second. If D is determined experi-
the surface. This bond breaking energy is of the order 15– mentally as a function of temperature, then a plot of ln D
50 kcal mol−1 for many metal surfaces. As the temperature versus 1/T will yield us the activation energy of the diffu-
of the surface is increased, more and more surface atoms sion process, provided that the diffusion occurs by a single
may acquire enough activation energy to break bonds with mechanism.
their neighbors and move along the surface. Such surface The rms distance X 2 1/2 can be expressed in terms of
diffusion plays an important role in many surface phe- the diffusion coefficient by substitution of Eq. (50) into
nomena involving atomic transport, e.g., crystal growth, Eq. (49) to give for b = 6 as follows:
vaporization, and adsorption. The migration of atoms or
X 2 1/2 = (4Dt)1/2 . (52)
molecules along the surface is one of the most important
steps in surface reactions and has proved to be the rate- From measurements of the mean travel distance of dif-
limiting step for many reactions that have been studied at fusing atoms the diffusion coefficient can be evaluated.
low pressures. Conversely, knowledge of the diffusion coefficient allows
A surface contains many defects on an atomic scale. us to estimate the rms distance or the time necessary to
Atoms in different surface sites have different binding en- carry out the diffusion. For example, the diffusion coef-
ergies. Surface diffusion can be considered as a multistep ficients of silver ions on the surface of silver bromide
process whereby atoms break away from their lattice po- can be estimated to be 10−19 and 10−13 cm2 /sec at 300
sition (e.g., a kink site at a ledge) and migrate along the and 100 K, respectively. Assuming that a rms distance of
surface until they find a new equilibrium site. 10−4 cm is required for silver particle aggregation (print-
The frequency f with which an atom will escape from out) to commence, of what duration are the light-exposure
a site will depend upon the height, E D∗ , of the potential times required? Using Eq. (52) we have t = 5 sec and
energy barrier it has to climb in order to escape as follows: t = 5 × 104 sec at 300 and 100 K, respectively. The ex-
ponential temperature dependence of D is, of course, the
E D∗
f = zν0 exp − , (48) reason that silver bromide photography cannot be carried
kB T out at low temperatures (much below 300 K) but is eas-
where z is the number of equivalent neighboring sites. For ily utilized at about room temperature. We can also see
a (111) face of an fcc metal, z = 6, the vibration frequency that at slightly elevated temperature (∼450 K) the ther-
is of the order of 1012 sec−1 . Assuming that E D is 20 kcal mal diffusion of silver particles should be rapid enough
mol−1 , at 300 K the atom makes one jump in every 50 sec, (D ≈ 3 × 10−7 cm2 sec−1 ) so that their aggregation will
while at 1000 K one in 10−8 sec. Thus, the rate of surface take place rapidly even in the dark (t ≈ 10−2 sec) in the
diffusion varies rapidly with temperature. This is the case absence of any photoreaction.
for a single jump to a neighboring equilibrium surface site. Surface diffusion has so far been discussed in terms of
What is of great importance is the long-distance motion a single surface atom. However, on a real surface many
of a surface atom. The result is derived from considering atoms diffuse simultaneously and in most diffusion ex-
a mathematical treatment of an atom executing a random periments the measured diffusion distance after a given
walk for a time t over a mean-square distance X 2 . diffusion time is an average of the diffusion lengths of a
For a sixfold symmetrical surface, we obtain the large, statistical number of surface atoms. A thermody-
following: namic treatment in terms of macroscopic parameters can
be followed to yield the following:
X 2 = f td 2 /3. (49)
D = D0 exp(−Q/RT ), (53)
The value of f d 2 is a property of the material that char-
acterizes its atomic transport. Its value provides informa- where Q is the total activation energy for the overall
tion about the mechanism of atomic transport, and it is diffusion process and only one diffusion mechanism is
customary to define the diffusion coefficient D as follows: involved.
Experimentally, the diffusion coefficient D is obtained
D = f d 2 /2b, (50) by using a relationship between the diffusion rate and cov-
where b is the number of coordinate directions in which erage gradient, namely Fick’s second law of diffusion in
diffusion jumps may occur with equal probability. one dimension:
Equation (50) can therefore be rewritten as ∂c/∂t = D(∂ 2 c/∂ x 2 ), (54)
D = D0 exp − E D∗ kB T , (51)
where c is the concentration of adatoms, t the time, and x
where D0 = (v0 d 2 /6), (v0 d 2 /4), for sixfold or fourfold the distance along the surface. In most surface diffusion
symmetry, respectively; D is usually given in units of studies the surface concentration of diffusing atoms, c, is
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
measured as a function of distance x along the surface, plane at 327 K. The field ion images are taken at 1-min
and Eq. (54) is solved by the use of boundary conditions intervals and the Rh atom can clearly be seen to have
that approximate the experimental geometry. These ex- diffused across the surface. Unfortunately because of the
periments are by no means trivial, and many novel ex- high field strengths employed, adsorbates such as O or N
perimental techniques have been applied to study surface tend to be stripped from the surface as ions, so their mi-
diffusion on single crystals. croscopic diffusion cannot be studied by this method. An
A technique which has been used to measure surface interesting result from FIM studies of metal adatoms on
diffusion rates is scanning Auger electron spectroscopy, metals is the recognition of clusters as important contri-
which can follow adsorbate diffusion. A particular Auger butions to material transport. It has been found that rhe-
transition of the adsorbate under investigation is used as a nium dimers diffuse more rapidly than single Rh atoms on
monitor of relative concentration versus distance scanned the W(112) plane, and Rh trimers diffuse at roughly the
across the surface. Profiles are recorded after heating pe- same rate as dimers. This is not a general trend, how-
riods to observe the change in concentration profile as a ever, as iridium dimers move much more slowly than
function of time and temperature. singles.
While this technique monitors mass transport, and val- While the single adatom diffusion technique gives us
ues of D and Q are averaged values, field ion microscopy detailed microscopic information, the mass transport tech-
can be used to follow the diffusion of individual atoms niques are of use as they help to give understanding of the
across a surface. To study diffusion, the metal is vapor technologically important processes such as sintering and
deposited onto the tip. The tip is then heated to remove creep.
evaporated adatoms until only one or two remain on the
surface plane of interest. The diffusion is then examined
C. Surface Reactions
by photographically recording the position of the adatom
at low temperatures, removing the applied field, and heat- Heterogeneous catalysis, corrosion, photosynthesis, and
ing to the desired temperature for a given time. The tip is adhesion are examples of chemical processes that are par-
then cooled, the field reapplied, and the field ion image tially or fully controlled by reactions at surfaces. For the
examined to see if the atom has moved to a neighboring case of gas–solid reactions the surface reactions can be
site. This process is then repeated many times to obtain divided into two major categories: (i) stoichiometric sur-
useful values of diffusion rates, and by examining the dif- face reactions where the solid surface participates directly
fusion over a temperature range, the activation barrier to in the reaction by compound formation and (ii) catalytic
surface diffusion can be determined. Figure 41 shows a surface reactions where the reaction occurs at the solid
series of field ion images of a Rh atom on the W(112) surface but the surface does not undergo any net chemical
change. In both cases gaseous molecules impinge on the
surface, adsorb, react, and form various intermediates of
varying stability, and then the products desorb into the gas
phase if they are volatile.
All surface reactions involve a sequence of elementary
steps that begins with the collision of the incident atoms or
molecules with the surface. As the gas species approaches
the surface it experiences an attractive potential whose
range depends upon the electronic and atomic structures of
the gas and surface atoms. A certain fraction of the incident
gas molecules is trapped in this attractive potential well
with a sticking probability given by the following:
S(, T ) = S0 (1 − ) exp(−E a /RT ), (55)
where S0 is the initial (zero coverage) sticking coeffi-
cient, the surface coverage (0 < < 1), and E a the
activation energy for adsorption. If this force attraction
is due to a van der Waals interaction, the trapping is due
to physical adsorption. If the attraction is much stronger,
FIGURE 41 Diffusion of rhenium atoms on W(211) at 327 K. Field having the character of chemical bonding then we have
ion images are taken after 60 sec diffusion intervals. (Courtesy of chemisorption and the process is known as sticking. The
Lawrence Berkeley Laboratory.) boundary between the two types of bonding is usually
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
set at a binding energy of 15 kcal mol−1 . Sticking by adsorbed species is given in terms of the gas pressure P
chemisorption is often preceded by trapping into a physi- by the Langmuir isotherms:
sorbed state, in which case the physisorbed state is known
= KP/(1 + KP), (56)
as a precursor state for chemisorption. The presence of a
precursor state is indicated by a sticking coefficient that where K is an equilibrium constant. Catalyzed surface
remains almost constant as surface coverage increases un- reactions usually take place between two or more coad-
til a saturation coverage is reached, when it rapidly falls to sorbed species which compete for adsorption sites on the
zero. This behavior arises because molecules in the rela- surface. When j gases adsorb competitively and associa-
tively mobile precursor state diffuse to parts of the surface tively, the surface coverage by species i is given by the
which are not covered by chemisorbed molecules. In di- following:
rect chemisorption the sticking coefficient varies strongely
with coverage and with ordering of the chemisorbed = K i Pi 1+ K j Pj . (57)
layer. j
The adsorbed species may also desorb from the surface Many catalyzed surface reactions can be treated as
if its energy overcomes the attractive surface forces. When a two-step process with an adsorption equilibrium fol-
a surface reaction occurs a certain proportion of the ad- lowed by one rate-determining step (diffusion, surface re-
sorbed species either decomposes (unimolecular reaction) action, or desorption). The surface reaction kinetics are
or reacts with a second adsorbed species (bimolecular re- usually discussed in terms of two limiting mechanisms,
action) before the product desorbs. the Langmuir–Hinshelwood (LH) and Eley–Rideal (ER)
During the initial interaction of the gas molecule with mechanisms. In the LH mechanism, reaction takes place
the surface as the incoming molecule falls into a potential directly between species which are chemically bonded
well the kinetic energy normal to the surface increases. (chemisorbed) on the surface. For a bimolecular LH sur-
Unless this energy is transferred to some other degree of face reaction. Aads + Bads → products, with competitive
freedom the molecule will simply bounce off; there will chemisorption of the reactants, the rate of reaction is given
be no trapping or sticking. In the case of physisorption en- by the following expression:
ergy transfer via phonons is usually most important while
Rate = kR A B
for chemisorption electronic excitation via electron–hole
pairs is thought to be important. = kR K A K B PA PB /(1 + K A PA + K B PB )2 . (58)
The exchange of translational energy T with the pho-
nons Vs is called T − Vs energy exchange. The gas mole- The reaction rate is proportional to the surface coverages
cule may also exchange internal energy, rotation R or vib- A and B and to the reaction rate constant kR . For non-
ration V with the vibrating surface atoms. In this case there competitive adsorption, the rate expression becomes the
are also R − Vs and V − Vs energy transfer processes. following:
In order to understand the dynamics of gas–surface in- kR K A K B PA PB
Rate = kR A B = . (59)
teraction, it is necessary to determine how much energy is (1 + K A PA )(1 + K B PB )
exchanged between the gas and surface atoms through the General rate expressions of the form given in equa-
various energy-transfer channels. In addition the kinetic tions and have been experimentally verified for many
parameters (rate constants, activation energies, and pre- types of LH reactions. Similar but more complicated
exponential factors) for each elementary surface step of rate expressions are easily derived assuming different
adsorption, diffusion, and desorption are required in order (non-Langmuir) isotherms, higher-order reaction steps,
to obtain a complete description of the gas–surface energy or dissociative chemisorption of the reactants. In the
transfer process. ER mechanism, surface reaction takes place between a
Most surface reactions take place at high pressures (1– chemisorbed species and a nonchemisorbed species, e.g.,
100 atm) either because of the chemical environment of Aads + Bg → products. The nonchemisorbed species may
our planet or to establish optimum reaction rates in chem- be physisorbed or weakly held in a molecular precursor
ical processing. Under these conditions, surfaces are usu- state. In this case, the rate expression for the surface
ally covered by at least one monolayer of adsorbed species. reaction becomes
Since activation energies for adsorption and surface dif-
fusion are generally small (a few kT ), equilibrium among Rate = kR A PB = kR K A PA PB /(1 + K A PA ). (60)
the different surface species, reactants, reaction interme- Presently no proven examples exist in which surface re-
diates, and products, is readily established. In the simplest action occurs by the ER mechanism.
(but not general and important) case of localized, associa- Surface reaction kinetics determined experimentally are
tive adsorption into a single state, the surface coverage by often expressed in the form of a power rate law as follows:
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
1. Molecular-Beam Scattering
The most powerful experimental technique for investigat-
ing the dynamics of the gas–solid interaction is molecular-
beam surface scattering (MBS). The experimental ar-
rangement is similar to that already described for helium
atom diffraction. Instead of using an atomic beam of a
light molecular weight gas and observing diffraction ef-
fects, a well-collimated beam of molecules strikes the ori-
ented, preferably single-crystal, surface, and the species
that are scattered at a specific solid angle are detected by
a mass spectrometer. The angular distribution of the scat-
tered molecules can be obtained by rotation of the mass
spectrometer about the sample. The velocity distribution
of the molecules after scattering is deduced by chopping FIGURE 42 Rectilinear plot displaying the (a) specular scattering
the scattered molecules and thereby measuring their time and (b) cosine angular distribution of scattered beams. The arrow
of flight to the detector. The surface residence times of the indicates the angle of incidence.
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
direct information on inelastic scattering than angular dis- Exchange of hydrogen and deuterium to form HD is one
tributions alone. Although considerable information can of the simplest reactions that can be catalyzed on clean
be gained from such studies it has been impossible to get metal surfaces at temperatures as low as 100 K. The same
state specific information and indeed it is often unclear reaction is immeasurably slow in the gas phase due to the
whether internal states present more efficient energy very high dissociation energies of the reacting molecules
transfer channels than phonons or vice versa. The diffi- (103 kcal mol−1 ). the H2 –D2 exchange reaction has
culties in studying internal energy transfer in molecular been studied over flat (111) and stepped (332) single
collisions with surfaces can be resolved by the applica- crystal surfaces of platinum. The Pt(332) surface contains
tion of state-specific detection techniques. Laser-induced high concentrations of periodic surface irregularities
fluorescence, multiphonon ionization, IR excitation with (steps) that are one atom in height. Reaction probabilities
bolometric detection, and IR emission techniques have averaged over the cosine HD angular distributions were
all been used to obtain state-resolved measurements of 0.07 on the (111) surface and 0.35 on the (332) surface
the internal energy distributions of molecules scattering under identical experimental conditions (Ts = 1100 K,
from surfaces. It has also been possible to separate exp- Tg = 300 K). The reaction probability on the stepped
erimentally direct inelastic and trapping-desorption scat- surface varied markedly with the angle of incidence
tering. In the direct inelastic scattering of diatomics, co- of the mixed H2 –D2 molecular beam. This is shown
upling to rotational energy has been found to be very in Fig. 43. The reaction probability was highest when
important and to exhibit several interesting phenomena: the beam was incident on the open edge of the step
rotational rainbows and the production of rotationally and lowest when the bottom of the step was shadowed
aligned molecules in scattering.
(curve a). When the H2 –D2 beam was incident parallel metals display heats of adsorption for oxygen that are com-
to the steps, the rate of HD production was independent parable to the heats of formation of the stable metal oxides.
of the angle of incident at all angles of crystal rotation Metals such as Ti, Zr, Mn, Al, Cu, and Fe dissolve oxy-
(curve b). These results indicate that the atomic step sites gen more readily and form stable oxide films even at room
are about seven times more active than the (111) terrace temperature. At low oxygen pressures these films often as-
sites for the dissociative chemisorption of hydrogen and sume a crystalline structure, whereas at higher pressures
deuterium molecules. Detailed analysis of the scattering (>10−3 atm) the films tend to be amorphous. At higher
data revealed a barrier height of 4–8 kJ mol−1 for temperatures (400–1000 K), oxide formation occurs read-
dissociative H2 chemisorption on the (111) surface. On ily on the surfaces of nearly all metals.
the other hand, this barrier did not exist (E a = 0) on the Growth of surface oxide films takes place only if
stepped surface. This difference in activation energy cations, anions, and electrons can diffuse through the
alone accounts for the different reaction probabilities of oxide layer. The growth kinetics of very thin films
the step and terrace sites. While the dissociation proba- (∼10–50 Å) often follow the Mott or Cabrera–Mott
bility of hydrogen molecules was higher on the stepped mechanisms in which electrons tunnel through the film
surface than on Pt(111), the kinetics and mechanism and associate with oxygen atoms to produce oxide ions
of HD recombination appear to be identical over both at the surface. A large local electric field (106 –107 V/cm)
surfaces once dissociation takes place. On both surfaces, results at the surface which facilitates cation diffusion
HD formation follows a parallel LH mechanism with from the metal–oxide interface to an interstitial site of the
one of the reaction branches operative over the entire oxide. The film thickness Z at time t is given by
temperature range of 300–1075 K. This branch has an
Z = α1 ln (α2 t + 1) (63)
activation energy and pseudo-first-order preexponential
factor of E a = 54 kJ mol−1 and A1 = 8 × 104 sec−1 or inverse logarithmic
for the stepped surface and E a = 65 kJ mol−1 and
1/Z = α3 − α4 ln t (64)
A1 = 3 × 105 sec−1 for the Pt(111) surface. A second
branch is observed for temperatures above 575 K, but the law of growth depending on whether electron tunneling
kinetic parameters for this pathway could not be accurately or cation diffusion is rate limiting. The constants α1 –α4
determined. are determined by the material, its structure, and the re-
action conditions. The electron field strength and rate of
growth decrease exponentially as the film thickens, result-
D. Stoichiometric Surface Reactions
ing in an effective limiting thickness for the surface oxide
Stoichiometric surface reactions are those in which the layer.
surface participates directly in the reaction by compound In addition to surface oxides, a vast array of surface
formation. Oxidation and corrosion are the two most im- compounds can be produced from the reactions of halo-
portant classes of such reactions. gens, chalcogenides, and carbon-containing molecules
Surface oxidation of metals encompasses a series of with metal surfaces. Chemisorption of chlorine near 300 K
at least three reaction steps that include (1) dissociative on Cu, Ti, W, Mo, Ta, Ni, Pd, and Au, for example, re-
chemisorption of oxygen on the metal surface, (2) rear- sults in the formation of stable surface compounds which
rangement of the surface atoms with dissolution of oxygen often evaporate as molecular chlorides upon heating at el-
into the near surface region, and (3) nucleation of oxide evated temperatures. Chemisorption of chlorine at 300 K
islands which grow laterally and eventually condense to on Ag(100) and Ag(111) produces chemisorbed chlorine
produce continuous oxide films. The oxide islands appear overlayers which react irreversibly at about 425 K to pro-
to precipitate suddenly once a critical oxygen concentra- duce AgCl with an activation energy of 56 kJ mol−1 . Upon
tion is reached in the near surface region. Nucleation takes heating, AgCl desorbs at about 830 K with a desorption
place most readily at surface irregularities such as atomic activation energy of 192 kJ mol−1 .
steps, dislocations, and stacking faults. At room tempera- MBRS has been used to investigate the dynamics of
ture, noble metals such as Rh, Ir, Pd, and Pt display little several surface corrosion reactions at low reactant pres-
tendency for oxygen incorporation or surface rearrange- sures. Systems studied include the oxidation Si, Ge, Mo,
ment. Initial heats of oxygen chemisorption on these met- and graphite, and the halogenation of Si, Ge, Ta, and Ni.
als are much greater than the heats of formation of the With the exception of silicon and germanium oxidation,
corresponding bulk oxides. Other metals such as Cr, Nb, where dissociative chemisorption of oxygen is apparently
Ta, Mo, W, Re, Ru, Co, and Ni, dissolve surface oxygen rate limiting, the kinetics of these surface reactions gen-
by a place exchange mechanism in which oxygen atoms erally appear to be controlled by surface or bulk diffusion
interchange positions with underlying metal atoms. These of the reacting species.
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
E. Catalytic Surface Reactions the sequence of elementary reactions that include ad-
sorption, surface migration, chemical rearrangements, and
A major goal of basic surface chemistry is in trying to
reactions in the adsorbed state, and desorption of the
understand heterogeneous catalysis on an atomic scale.
products, the rate of each step must be of steady state.
Virtually all chemical technologies and many technolo-
The rate of the overall catalytic reaction per unit area
gies in other fields use catalysis as an essential part of the
catalyst surface can be expressed as (moles of prod-
process. The most important catalytic processes are sum-
uct/catalyst area × time). Another expression for catalytic
marized in Table V. These processes are listed together
rate is the turnover number or turnover frequency. This
with the pertinent chemical reactions, widely used cata-
is the (number of molecules of product/number of cata-
lysts, and typical reaction conditions. There are several
lyst sites × time). For most heterogeneous catalyzed small
definitions of a catalyst, one general definition being that
molecule reactions the turnover number varies between
it is a substance that accelerates a chemical reaction with-
10−2 and 102 sec−1 . The calculation of the turnover num-
out visibly undergoing chemical change. Indeed, a major
ber is limited by the difficulty of determining the true
role of a catalyst is in accelerating the rate of approach to
number of active sites.
chemical equilibrium. However, a catalyst cannot change
The reaction probability reveals the overall efficiency
the ultimate equilibrium determined by thermodynamics.
of a catalyst. It is defined as follows:
Another major function of a catalyst is to provide reac-
tion selectivity. Under the conditions in which the reaction reaction probability
is to be carried out, there may be many reaction channels, rate of formation of product molecules
each thermodynamically feasible, that lead to the forma- =
rate of incidence of reactant molecules.
tion of different products. The selective catalyst will accel-
The determination of the rates of the net catalytic re-
erate the rate of only one of these reactions so that only the
actions and how the rates change with temperature and
desired product molecules form with near-theoretical or
pressure is of great practical importance. Although there
100% efficiency. One example is the dehydrocyclization
are many excellent catalysts that permit the achievement
of n-heptane to toluene:
of chemical equilibria (for example, Pt for oxidation of
CH3 CO and hydrocarbons to CO2 and H2 O), most catalyzed
reactions are still controlled by the kinetics of one of the
CH3 (CH2)5 CH3 4H2 surface processes. From the knowledge of the activation
energy and the pressure dependencies of the overall reac-
tion, the catalytic process can be modeled and the optimum
This is a highly desirable reaction that converts aliphatic reaction conditions can be calculated. Such kinetic anal-
molecules to aromatic compounds. The larger concentra- ysis, based on the macroscopic rate parameters, is vital
tion aromatic component in gasoline, for example, greatly for developing chemical technologies based on catalytic
improves its octane number. However, n-heptane may par- reactions.
ticipate in several competing simpler reactions. These in- The rates of reactions are extremely sensitive to small
clude hydrogenolysis, which involves C C bond scission changes of chemical bonding of the surface species
to form smaller molecular weight fragments (methane, that participate in the surface reaction. Since the energy
ethane, and propane); partial dehydrogenation, which pro- necessary to form or break the surface bonds appears
duces various olefins; and isomerization, which yields in the exponent of the Arrhenius expression for the
branched chains. All of these reactions are thermody- rate constant for the overall reaction, it can increase or
namically feasible, and since they appear to be less com- decrease the rate exponentially. For example, a change
plex than dehydrocyclization, they compete effectively. of 3 kcal in the activation energy alters the reaction rate
A properly prepared platinum catalyst surface catalyzes by over an order of magnitude at 500 K. Small variations
the selective conversion of n-heptane to toluene without of chemical bonding at different surface irregularities,
permitting the formation of other products. The catalyst steps, and kinks, as compared to atomic terraces, can give
selectivity is equally important for the reactions of small rise to a very strong structure sensitivity of the reaction
molecules (such as the hydrogenation of CO to produce rates and the product distribution. Rate measurements
a desired hydrocarbon) or very large molecules of bio- exponentially magnify the energetic alterations that
logical importance, where enzyme catalysts provide the occur on the surface and could provide a very sensitive
desired selectivity. probe of structural and electronic changes at the sur-
Catalysis is a kinetic phenomenon; we would like to face and changes of surface bonding on the molecular
carry out the same reaction at an optimum rate over and scale.
over again using the same catalyst. In most cases such One of the most important considerations in catalysis
a steady-state operation is desirable and aimed for. In is the need to provide a large contact area between the
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
3O2 O 3H2O
O
Olefins C2 H4 + 12 O2 → CH3 CHO V2 O5
1
O2 O SnO2 ·MoO3
2
O2 Bi2 O3 ·MoO3
O
H2O
Reforming
Dehydrogenation R R Pt, Pt–Re, Pt–Ge 700–800 K
3H2
continues
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
TABLE V (Continued )
Processes Typical reactions Catalyst Reaction conditions
Dehydroisomerization Ir Au/Al2 O3
3H2
Isomerization
Hydrogenolysis H2 2C3H8
Hydrogenation
H2
Selective
Hydrogenation H2 NiS 420–500 K
Olefins 1–10 atm
Alkynes R C CH + H2 → RHC CH2 Pt/Al2 O3 220–250 K
1–10 atm
Steam reforming CH4 + H2 O → CO + 3H2 Ni K2 O/Al2 O3 850–1100 K
30–100 atm
Water gas CO + H2 O → CO2 + H2 Fe2 O3 ·Cr2 O3 ZnO Cu2 O 650–800 J
20–50 atm
reactants and the surface. The total rate (moles of product square meters per gram of catalyst, is available in these
per time) is proportional to the surface area. As a conse- catalyst systems. Transition-metal catalysts are generally
quence, a lot of effort is expended to prepare large surfaces employed in a small, 10- to 100-Å-diameter particle form
area catalysts and to measure the surface area accurately. dispersed on large-surface-area supports. The support can
One example of high-surface-area catalysts is the group be a specially prepared alumina or silica framework (or
of catalysts known as zeolites, which are aluminosilicates a zeolite) that can be produced with surface areas in the
used for the cracking of hydrocarbons. They have crystal 102 -m2 /g range. These supported metal catalysts are of-
structures full of pores 8–20 Å in size. The structure of one ten available with near-unity dispersion (dispersion is de-
of the many zeolites used for catalysis, faujasite, is shown fined as the number of surface atoms per total number
in Fig. 44. Since the catalytic reactions occur inside the of atoms in the particle) of the metal particles and are
pores, an enormous inner surface area, of several hundred usually very stable in this configuration during the cat-
alytic reaction. The metal is frequently deposited from
solution as a salt and then reduced under controlled con-
ditions. Alloy catalysts and other multicomponent catalyst
systems can also be prepared in such a way that small al-
loy clusters are formed on the large-surface-area oxide
supports.
Most catalytic reactions take place via the formation of
intermediate compounds between the reactants or prod-
ucts and the surface. The surface atoms of the catalyst
form strong chemical bonds with the incident molecules,
and it is this strong chemical surface–adsorbate interac-
tion which provides the driving force for breaking high-
binding-energy chemical bonds (C C, C H, H H, N N,
and C O bonds), which are often an important part of the
catalytic reaction.
A good catalyst will also permit rapid bond breaking
FIGURE 44 Line drawing of the structure of the zeolite faujasite. between the adsorbed intermediates and the surface and
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
the speedy release or desorption of the products. If the TABLE VI Structure-Sensitive and Structure-
surface bonds are too strong, the reaction intermediates Insensitive Catalytic Reactions
block the adsorption of new reactant molecules, and the Structure sensitive Structure insensitive
reaction stops. For too-weak adsorbate-catalyst bonds, the
necessary bond-scission processes may be absent. Hence, Hydrogenolysis Ring opening
the catalytic reaction will not occur. A good catalyst is Ethane: Ni Cyclopropane: Pt
thought to be able to form chemical bonds of intermediate Methylcyclopentane: Pt
strength. These bonds should be strong enough to induce Hydrogenation Hydrogenation
bond scission in the reactant molecules. However, the bond Benzene: Ni Benzene: Pt
should not be too strong to ensure only short residence Isomerization Dehydrogenation
times for the surface intermediates and rapid desorption Isobutane: Pt Cyclohexane: Pt
of the product molecules, so that the reaction can proceed Hexane: Pt
with a large turnover number. Cyclization
Of course, activity is only one of many parameters that Hexane: Pt
are important in catalysis. The selectivity of the catalyst, its Heptane: Pt
thermal and chemical stability, and dispersion, are among
the other factors that govern our choices. While macro-
scopic chemical-bonding arguments can explain catalytic viewed as modified acid–base reactions. It is common to
activity in some cases, atomic-scale scrutiny of the surface refer to an oxide catalyst as acidic or basic according to
intermediates, catalyst structure, and composition, and an its ability to donate or accept electrons or protons.
understanding of the elementary rate processes are nec- The electron transfer capability of a catalyst is ex-
essary to develop the optimum selective catalyst for any pressed according to the Lewis definition. A Lewis acid
chemical reaction. is a surface site capable of receiving a pair of electrons
One of the important directions of research in catalysis from the adsorbate. A Lewis base is a site having a free
is the identification of the reaction intermediates. The sur- pair of electrons that can be transferred to the adsorbate.
face residence times of many of these species are longer The proton-transfer capability of a catalyst is expressed
than 10−5 sec under most catalytic reaction conditions (as according to the Brønsted definition. A Brønsted acid is
inferred from the turnover frequency). They may be de- a surface site capable of losing a proton to the adsorbate
tected by suitable spectroscopic techniques either during while a Brønsted base is a site that can accept a proton
the steady-state reaction or when isolated by interrupting from the adsorbed species.
the catalytic process. Perhaps the most widely used catalysts, the zeolites,
The concept of active sites is an important one in catal- best represent the group of oxides that exhibit acid–base
ysis. A surface generally possesses active sites in numbers catalysis. Zeolites are alumina silicates, some of which
that are smaller than the total number of surface atoms. The are among the more common minerals in nature. Modern
presence of unique atomic sites of low coordination and synthesis techniques permit the preparation of families of
different valency that are very active in chemical reactions zeolite compounds with different Si/Al ratios. Since the
has been clearly demonstrated by atomic-scale studies of Al3+ ions lack one positive charge in the tetrahedrally
metal and oxide surfaces. A catalytic reaction is defined to coordinated silica, Si4+ , framework, they are sites of pro-
be structure sensitive if the rate changes markedly as the ton or alkali–metal affinity. Variation of the Si/Al ratio
particle size of the catalyst is changed. Conversely, the re- gives rise to a series of substances of controlled but dif-
action is structure insensitive on a given catalyst if its rate ferent acidity. By using various organic molecules dur-
is not influenced appreciably by changing the dispersion ing the preparation of these compunds that build into the
of the particles under the usual experimental conditions. structure, subsequent decomposition leaves an open pore
In Table VI we list several reactions that belong to these structure, where the pore size is controlled by the skeletal
two classes. Clearly, variations of particle size give rise to structure of the organic deposit. Very high internal sur-
changes of atomic surface structure. The relative concen- face area catalysts (102 m2 /g) can be obtained this way
trations of atoms in steps, kinks, and terraces are altered. with controlled pore sizes of 8–20 Å and controlled acidity
Nevertheless, no clear correlation has been made to date [(Si/Al) ratio]. These catalysts are utilized in the cracking
between variations of macroscopic particle size and the and isomerization of hydrocarbons that occur in a shape
atomic surface structure. selective manner as a result of the uniform pore structure
Most surface reactions and the formation of surface and are the largest volume catalysts in petroleum refining.
intermediates involve charge transfer, either an electron They are also the first of the high-technology catalysts in
transfer or a proton transfer. These processes are often which the chemical activity is tailored by atomic-scale
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
study and control of the internal surface structure and prevent side reactions, and alter the reaction paths in a
composition. variety of ways.
A catalyst used in industry is very rarely a pure ele- While industrial catalytic systems are complex and are
ment or compound. Most catalysts contain a complex mix- not readily suited to basic science studies to understand
ture of chemical additives or modifiers that are essential how they work on an atomic scale, one approach to their
ingredients for high activity and selectivity. Promoters are understanding is the synthetic approach. In this approach
beneficial additives that increase activity, selectivity, or we begin with a very simple system then synthesize com-
useful catalyst lifetime (stability). Structural promoters plexity from this. The catalyst particle is viewed as com-
inhibit sintering of the active catalyst phase or present posed of single crystal surfaces, as shown in Fig. 45. Each
compound formation between the active component and surface has different reactivity and the product distribu-
the support. The most frequently used chemical promoters tion reflects the chemistry of the different surface sites.
are electron donors such as the alkali metals or electron We may start with the simplest single crystal surface [for
acceptors such as oxygen and chlorine. For example, in the example, the (111) crystal face of platinum] and examine
petroleum industry, chlorine and oxygen are often added its reactivity. It is expected that much of the chemistry
to commercial platinum catalysts used for reforming reac- of the dispersed catalyst system would be absent on such
tions by which aliphatic straight-chain hydrocarbons are a homogeneous crystal surface. Then high-Miller-index
converted to aromatic molecules (dehydrocyclization) and crystal faces are prepared to expose surface irregularities,
branched isomers (isomerization). steps, and kinks of known structure and concentration,
These additives accomplish several tasks during the re- and their catalytic behavior is tested and compared with
action. By changing the chemical bonding of some of the the activity of the dispersed supported catalyst under iden-
surface intermediates, the steady-state concentration of tical experimental conditions. If there are still differences,
these intermediates may be altered, and thus a somewhat the surface composition is changed systematically or other
higher concentration of the catalytically active species is variables are introduced until the chemistries of the model
obtained. In this way the rate of the reaction is increased system and the working catalyst become identical. This
and the selectivity may be improved. approach is described by the following sequence:
Often multicomponent catalyst systems are utilized to
carry out reactions consisting of two or more active metal
components or both oxide and metal constituents. For ex-
ample, a Pt–Rh catalyst facilitates the removal of pol-
lutants from car exhausts. Platinum is very effective for
oxidizing unburned hydrocarbons and CO to H2 O and
CO2 , and rhodium is very efficient in reducing NO to
N2 , even in the same oxidizing environment. Dual func-
tional or multifunctional catalysts are frequently used to
carry out complex chemical reactions. In this circumstance
the various catalyst components should not be thought
of as additives, since they are independently responsible
for different catalytic activity. Often there are synergistic
effects, however, whereby the various components ben-
eficially influence each other’s catalytic activity to pro-
vide a combined additive and multifunctional catalytic
effects.
It should be clear from this discussion that the working,
active, and selective catalyst is a complex, multicompo-
nent chemical system. This system is finely tuned and
buffered to carry out desirable chemical reactions with
high turnover frequency and to block the reaction paths for
other thermodynamically equally feasible but unwanted
reactions. Thus, an iron catalyst or a platinum catalyst
is composed not only of iron or platinum but of several
other constituents as well to ensure the necessary surface FIGURE 45 Catalyst particle viewed as a crystallite, composed
structure and oxidation state of surface atoms for optimum of well-defined atomic planes. (Courtesy of Lawrence Berkeley
catalytic behavior. Additives are often used to block sites, Laboratory.)
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
structure of crystal surfaces and adsorbed gases lower part of the high-pressure isolation cell is then lifted
↓↑ to enclose the sample in a 30-cm3 volume. The isolation
surface reactions on crystals at low pressures chamber can be pressurized to 100 atm if desired and is
(≤10−4 torr) connected to a gas chromatograph that detects the product
↓↑ distribution as a function of time and surface temperature.
surface reactions on crystals at high pressures The sample may be heated resistively both at high pres-
(10+3 −10+5 torr) sure or in ultrahigh vacuum. After the reaction study the
↓↑ isolation chamber is evacuated, opened, and the catalytic
reactions on dispersed catalysts surface is again analyzed by the various surface-diagnostic
techniques. Ion bombardment cleaning of the surface or
Investigations in the first step define the surface struc- means to introduce controlled amounts of surface addi-
ture and composition on the atomic scale and the chemical tives by vaporization are also available. The reaction at
bonding of adsorbates. Studies in the second step, which high pressures may be studied in the batch or the flow
are carried out at low pressures, reveal many of the ele- mode.
mentary surface reaction steps and the dynamics of surface Typical catalytic reactions that have been investigated,
reactions. Studies in the third and fourth steps establish the in some detail, using this approach include hydrocarbon
similarities and differences between the model system and conversion on platinum and modified platinum surfaces
the dispersed catalyst under practical reaction conditions. (isomerization, hydrogenolysis, hydrogenation, dehydro-
The advantage of using small-area catalyst samples is genation and cyclization), dehydrosulfurization on molyb-
that their surface structure and composition can be pre- denum, ammonia synthesis on iron, and carbon monoxide
pared with uniformity and can be characterized by the hydrogenation on iron.
many available surface diagnostic techniques.
In this approach to catalytic reaction studies the sur-
face composition and structure are determined in the same F. Photochemical Surface Reactions
chamber where the reactions are performed, without ex- Photochemical surface reactions form their own class
posing the crystal surface to the ambient atmosphere. This due to the fact that a thermodynamically uphill reaction
necessitates the combined use of an ultrahigh vacuum en- ( G > 0) may be carried out with the aid of an external
closure, where the surface characterization is carried out, source of energy, light. In fact, one of the most important
and a high-pressure isolation cell, where the catalytic stud- chemical reactions of our planet, photosynthesis, requires
ies are performed. Such an apparatus is shown in Fig. 46. the input of 720 kcal/mol of energy to convert carbon
The small-surface-area (approximately 1-cm2 ) catalyst is dioxide and water to one mole of sugar:
placed in the middle of the chamber, which can be evac-
uated to 10−9 torr. The surface is characterized by LEED light
6CO2 → 6H2 O−−−−−−→6H12 O6 + 6O2 (65)
and AES and by other surface diagnostic techniques. The chlorophyll
FIGURE 46 Schematic representation of the experimental apparatus to carry out catalytic reaction-rate studies on
single-crystal surfaces of low surface area at low and high pressures in the range 10−7 to 104 torr.
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
It is useful to consider light as one of the reactants in ically allowed photon-assisted reactions. The latter reac-
photosynthesis. By adding the light energy to Eq. (65), the tions are thermodynamically feasible without any external
reaction becomes athermic or even exothermic if excess energy input, but light is used to obtain certain product
light energy is utilized, selectively. Excitation of selected vibrations, rotations, or
electronic states of the incident or adsorbed molecules by
hv + H2 O + CO2 = CH2 + 32 O2
light permits us to change the reaction path or increase the
We may consider photon-assisted or photochemical reac- reaction rate. For example, the hydrogenation of acetylene
tions of many types that lead to the formation of lower or the oxidation of ammonia can be photon-assisted,
molecular weight hydrocarbons and of other products. leading to different reaction rates than in the absence of
One of the simplest of these important new classes of light.
reactions leads to the dissociation of water: As an example of a photocatalyzed surface reaction
we discuss the photoelectrochemical dissociation of wa-
hv + H2 O = H2 + 12 O2 (66)
ter. It was shown in 1972 that upon illumination of re-
Another leads to the formation of methane: duced titanium oxide (TiO2 ), which served as the anode
in basic electrolyte solution, oxygen evolution was de-
hv + CO2 + 2H2 O = CH4 + 32 O2 (67)
tectable at the anode, and hydrogen evolved at a metal
or to the fixation of nitrogen: (platinum) cathode. This reaction requires an energy of
1.23 V/electron (a two-electron process per dissociated
hv + 3H2 O + N2 = 2NH3 + 32 O2 (68)
water molecule). In the presence of light of energy equal
Light as a reactant may be employed in two ways. The to or greater than the band-gap energy of titanium oxide
adsorbed molecules can be excited directly by photons (3.1 eV), an external voltage as low as 0.2 V was suf-
of suitable energy to a higher vibrational or electronic ficient to dissociate water. The process stopped as soon
states. The excited species then may undergo chemical re- as the light was turned off, and started again upon reil-
arrangements or interactions that are different from those lumination. Shortly after, several other systems showed
in the ground vibrational or electronic states. Alterna- the ability to carry out photon-assisted dissociation of
tively, the solid can be excited by light in the near-surface water. When p-type gallium phosphide, GaP, was used
region. Photons of band-gap or greater energy may ex- as a cathode instead of platinum upon illumination of
cite electron–hole pairs at the surface. As long as these the TiO2 anode, O2 and H2 could be generated at the
charge carriers have a relatively long lifetime (i.e., they are semiconductor anode and cathode, respectively, without
trapped at the surface, so that their recombination is not an the need of applying any external potential. When stron-
efficient process), there is a high probability of their cap- tium titanate, SrTiO3 , was substituted for TiO2 as the an-
ture by the adsorbed reactants. These, in turn, can undergo ode, H2 O photodissociation was found to take place with-
reduction or oxidation processes using the photogenerated out external potential even when a platinum cathode was
electrons and holes, respectively. The photographic pro- employed.
cess is one example of this type of surface photochemical
reaction. However, we would like the photogenerated elec-
trons and holes to be captured by the adsorbed molecules
in order to carry out photochemical surface reactions of
the adsorbates instead of the photodecomposition of the
solid at the surface. The cross sections for adsorption of
band-gap or higher-than-band-gap energy photons are so
large that the photogeneration of electron–hole pairs is
a most efficient process. At present, this cannot be read-
ily matched by the efficiency of direct photoexcitation
of vibrational or electronic energy states of the adsorbed
molecules.
Many solid surfaces efficiently convert light to long-
lived electron–hole pairs that can induce the chemical
changes leading to the reactions in Eqs. (66)–(68). In fact,
FIGURE 47 Energy conditions needed to reduce B+ to B and
inorganic photoreaction is one of the exciting new fields
oxidize A− to A at a semiconductor surface. Electrons that are
of surface science and heterogeneous catalysis. excited by photons into the conduction band ECB must be able to
It is important to distinguish between thermodynami- reduce B+ , and electron vacancies (holes) in the valence band
cally uphill photochemical reactions and thermodynam- E VB must be able to oxidize A− .
P1: GRB/GJK P2: GSS Final Pages
Encyclopedia of Physical Science and Technology EN016H-757 July 31, 2001 16:23
Figure 47 shows a schematic energy diagram to indicate flat-band potential, may become photochemically active
the conditions necessary to carry out photoelectrochemi- as a result of strong band bending at the surface.
cal reactions efficiently. If the band-gap energy is greater Often the oxidation or reduction photoreactions lead
than the free energies for the reduction and oxidation reac- to the decomposition of the semiconductor electrode ma-
tions, the photoelectron that is excited into the conduction terial. Instead of the photoreactions of adsorbate ions or
band by light could reduce B+ to B by electron transfer molecules, a solid-state photoreaction occurs. This is par-
from the surface to the molecule. The photogenerated elec- ticularly noticeable at the surfaces of illuminated CdS, Si,
tron vacancies (holes) could also oxidize the A− anions and GaP. Much of the research is therefore directed toward
to A by capturing the electron. For the photodissociation stabilizing these photoelectrode materials by suitable ad-
of water, the conduction band must be above the H+ /H2 sorbates that could prevent the occurrence of photodecom-
potential and the valence band below the O2 /OH− po- position by providing an alternative chemical route for the
tential to be able to carry out the photoreaction without photoreduction or photooxidation.
an external potential. The band gap must be greater than
1.23 V and the flat-band potential of the conduction and
valence bands energetically well placed with respect to the ACKNOWLEDGMENT
(H+ /H2 ) and O2 /OH− couples. The flat-band potentials
can be obtained by capacitance measurements as a func- This work was supported by the Assistant Secretary for Energy Research,
Office of Basic Energy Sciences, Materials Sciences Division of the U.S.
tion of external potential.
Department of Energy under Contract No. DE-AC03-76SF00098.
There is, of course, considerable band bending of the
conduction and valence bands of any semiconductor at the
surface. This is due to the presence of localized electronic
surface states and to charge transfer between the adsor-
SEE ALSO THE FOLLOWING ARTICLES
bates and semiconductor. Potential-energy diagrams that
ADHESION AND ADHESIVES • ADSORPTION • AUGER
show the band positions schematically at an n-type or p-
ELECTRON SPECTROSCOPY • BONDING AND STRUCTURE
type semiconductor liquid interface are shown in Fig. 48.
IN SOLIDS • CATALYSIS, INDUSTRIAL • CATALYST
The band bending provides an efficient means of sepa-
CHARACTERIZATION • CHEMICAL THERMODYNAMICS •
rating electron–hole pairs, since the potential gradient as
CRYSTALLOGRAPHY • PHOTOCHEMISTRY, MOLECULAR •
shown for the n-type semiconductor drives the electrons
PHOTOELECTRON SPECTROSCOPY • SOLID-STATE ELEC-
away from the semiconductor surface while it attracts the
TROCHEMISTRY • TRIBOLOGY
holes in the valence band toward the semiconductor elec-
trolyte interface. As a result, the oxidation reaction takes
place at the oxide anode while the reduction reaction takes
place at the cathode to which the photoelectron migrates BIBLIOGRAPHY
along the external circuit. The magnitude of the band bend-
ing at the surface depends primarily on the carrier concen- Adamson, A. W. (1982). “Physical Chemistry of Surfaces,” 4th ed.,
Wiley, New York.
tration in the semiconductor and on the electron-donating Anderson, J. R., and Boudart, M. (1981). “Catalysis Science and Tech-
or -accepting abilities of the adsorbates at the surface. nology,” Vols. 1–7, Springer-Verlag, Berlin/New York.
Semiconductors that are not likely to carry out the pho- Ertl, G., and Gomer, R., eds. (1983). “Springer Series in Surface Sci-
todissociation of water, according to the location of their ences,” Vols. 1–4, Springer-Verlag, Berlin/New York.
Ertl, G., and Kuppers, J. (1979). “Low Energy Electrons and Surface
Chemistry,” Verlag Chemie, Weinheim.
Feuerbacher, B., Fitton, B., and Willis, R. F. (1979). “Photoemission and
the Electronic Properties of Surfaces,” Wiley, New York.
King, D. A., and Woodruff, W. P., eds. (1983). “The Chemical Physics
of Solid Surfaces and Heterogeneous Catalysis,” Vols. 1–4, Elsevier,
New York.
Morrison, S. R. (1977). “The Chemical Physics of Surfaces,” Plenum,
New York.
Roberts, M. W., and McKee, C. S. (1978). “Chemistry of the Metal-Gas
Interface,” Oxford Univ. Press, London.
Somorjai, G. A. (1981). “Chemistry in Two Dimensions: Surfaces,”
Cornell Univ. Press, Ithaca, NY.
Tompkins, F. C. (1978). “Chemisorption of Gases on Metals,” Academic
Press, NY.
FIGURE 48 Band bending at the n-type and p-type semiconduc- Vanselow, R., and Howe, R., eds. (1979). “Chemistry and Physics of
tor interfaces. Solid Surfaces,” Vols. 1–6, Springer-Verlag, Berlin/New York.
P1: GLM/MBQ P2: GLM Final Pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN008A-376 June 29, 2001 17:8
GLOSSARY tude depends on the orbit diameter and the mass and
velocity of the particle.
Charge transfer transition Change of the electronic Spin crossover complex Complex undergoes a spin tran-
state of a complex ion by the transfer of an electron sition induced by certain external factor such as tem-
from mainly the central ion state to mainly the ligand perature, pressure, light, etc.
system state or vice versa. Tanabe-Sugano diagram Term splitting as a function of
Coordination number Number of ligands bound at the ligand field strength for 3d-transition metal ions in oc-
central ion. tahedral field, originated by Tanabe and Sugano.
Electron spin Intrinsic angular momentum vector of the Term Entity of states of equal energy.
electron. It is a quantum phenomenon, which has no Transition metals Elements with incompletely filled d
analog in classical mechanics. orbitals: scandium, titanium, through copper (3d se-
Jahn-Teller effect For a nonlinear molecule in an elec- ries); yttrium, zirconium through silver (4d series); lan-
tronically degenerate ground state, distortion must oc- thanum, hafnium through gold (5d series).
cur to lower the symmetry and to lower the energy to a
more stable nondegenerate ground state.
Ligand Atomic ion, molecular ion, or molecule coordi- THE LIGAND FIELD CONCEPT is the basis of a quan-
nated at the central ion of a complex. tum theoretical model developed in the 1950s for describ-
Magnetic moment A property associated with a mag- ing the electron systems of transition metal complexes. A
netic domain. It is an experimental measure of the mag- transition metal complex is composed of a transition metal
netism of a compound, generally measured in units of ion (central ion) surrounded by a system of ligands (atomic
magnetons. ions, molecular ions, or molecules). The ligands produce
Orbital angular momentum Mechanical vector quan- the electrical field (the ligand field) acting on the electron
tity perpendicular to the orbit of a particle. Its magni- system of the central ion. As the ligand field theory shows,
523
P1: GLM/MBQ P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008A-376 June 29, 2001 17:8
tron pairs. Neither the ionic model nor the covalent model
provided a satisfactory interpretation of several optical
properties, since these models were concerned primarily
with the electronic ground state of the complex ions.
For a large number of complexes certain spectral re-
gions of the absorption spectra can be assigned alterna-
tively to the different components of the complex (central
ion, ligand system). On this account a model that starts
with the assumption of separated electron systems for the
central ion and for the ligands will be appropriate for a
theoretical treatment.
If one is interested primarily in the electronic states
of the central ion bound in the complex, one has to con-
sider the electric field generated by the ligand system, so-
FIGURE 3 L-edge absorption spectra of Fe2+ in Fe(phen)2
called the ligand field. The effect of this ligand field on the
(NCS)2 at 298 K (solid line) and at 15 K (dotted line). [From Lee,
J. J., Sheu, H. S., Lee, C. R., Chen, J. M., Liu, R. G., Lee, J. F., electronic states of the central ion is then taken into con-
Wang, C. C., Huang, C. H., and Wang, Y. (2000). J. Am. Chem. sideration. The theory treating this concept of the ligand
Soc. 122, 5742.] field is the ligand field theory, where the electronic struc-
ture of the ligand system is taken explicitly into account.
In other words, the complete molecular orbital treatment
In addition to the absorption spectra at UV-VIS range,
of the complex is undertaken. To a first approximation
the absorption in the much higher energy range has also
the charge distribution of the ligands is represented by
been observed in recent years. Here the electron transition
point charges and/or point dipoles in their centers, the lig-
is between the inner core orbitals and the valence orbitals
and field treatment can be reduced to an atomic orbital
of the central ions. An example of Fe L-edge absorption is
treatment of the central ion, this extended ionic model is
given in Fig. 3 to display the electron transition between
designated as crystal field theory.
2 p and 3d orbitals. The magnetic properties of the transi-
The starting point of the crystal field theory is the
tion metal complexes are known to exhibit quite a variety
description of the electronic states of the free, isolated
even with the same metal ion, for example, diamagnetic,
transition metal ion (the central ion) in a complex. Infor-
and paramagnetic are found in Fe2+ , Co3+ complexes of
mation on the electronic ground state and the excited states
various ligands (low-spin, high-spin complexes). The un-
even distribution of electron density around the metal ion
in a complex is demonstrated by the deformation density
at the metal center shown in Fig. 4, where the spherical
electron density is subtracted from the observed molecu-
lar electron density. Since the spherical density means that
even populations are among five degenerated d-orbitals,
the deformation density may give the direct observation
on the difference among d-orbital population. All these
recent experimental findings can be rationalized by the
ligand field concept.
where lj and sj are the orbital and the spin angular mo-
mentum of the jth electron, respectively. As shown by
quantum mechanics the absolute values of the vectors L
and S are restricted to certain discrete amounts FIGURE 5 Spatial orientations of a spin vector S with S = 32 .
|L| = h L(L + 1), L = 0, 1, 2, 3, . . . There are 2S + 12 = 4 different allowed values of projection on the
z axis.
|S| = h S(S + 1),
ration with the two electron spins antiparallel, ↑↓) and
0, 1, 2, 3, . . . for even number of electrons 3 (triplet, ↑↑). There are, in total, two triplet terms and
S=
1/2, 3/2, 5/2, 7/2, . . . for odd number of electrons three singlet terms: 3 P, 3 F, 1 S, 1 D, and 1 G. The 3 F [com-
where h is h/2π (h is Planck’s constant); L and S is the posed of (2 × 3 + 1)(2 × 1 + 1) = 21 states] is the ground-
quantum number of the total orbital angular momentum state term. The five Russell-Saunders term of a d 2 ion are
and the total spin of the system of electrons, respectively. shown in the energy-level diagram of Fig. 6. The com-
For the numerical values L = 0, 1, 2, 3, . . . it is conven- plete sets of Russell-Saunders terms for the d N ions with
tional to use the letters S, P, D, F, . . . By quantum mechani- N = 1, 2, . . . , 9 are given in Table II.
cal rules spin S with quantum number S can take 2S + 1 to
different special orientations with respect to a given direc- TABLE II Russell-Saunders Terms for Free dN Ionsa
tion shown in Fig. 5. The 2S + 1 is denoted as multiplicity
Occupation
M. Correspondingly, an orbital angular momentum L with of the d shell Russell-saunders term
quantum number L can assume 2L + 1 different spatial
orientations. Therefore, a given set of quantum numbers d1, d9 2D
FIGURE 6 Russell-Saunders terms of a d 2 free ion with and without spin-orbital coupling.
If the coupling between the total orbital angular mo- B. Crystal-Field Theory
mentum, L, and the total spin angular momentum, S, has
In the course of forming a complex, the central ion is bound
to be taken into account, the total angular momentum J
to be affected by the ligand field (electrical field of the lig-
should be used. Where J is the vector sum of L and S.
ands), and thereby the motion modes of the electrons of
The quantum number of the vector J is again restricted to
the central ion will be perturbed. Accordingly, the term
certain discrete amounts.
system of the central ion will be changed. Some terms of
the free ion are energetically merely shifted, while others
J = L + S, J = |L + S|, |L + S − 1|, . . . . . . . |L − S|
are split into progeny terms 2S+1 i , i = 1, 2, . . . with dif-
|J| = h J (J + 1) ferent energies (intracomplex Stark effect). The symbol
i describes the orbital state of the ith progeny term.
The number of the progeny terms 2S+1 i can be exactly
The energy term is symbolized as 2S+1 L J , for example, the determined by the methods of group theory. It depends on
ground state of d 2 (3 F) is split into 3 F4 , 3 F3 , and 3 F2 , where the L value of the (parent) term 2S+1 L and on the symmetry
the order of energy is such that 3 F4 > 3 F3 > 3 F2 as shown of the ligand system: Number of progeny terms 2S+1 i = f
in Fig. 6. Each term is in 2J + 1 degeneracy and can be (L, symmetry)
separated by applying magnetic field, i.e., the Zeemann
effect. However, the splitting due to the L − S coupling is 1. Term Splitting Under Oh Symmetry
much smaller than the splitting due to the electron-electron
repulsion designated the energy difference between the The general group theoretical result can be illustrated by
Russell-Saunders terms. a simple model system. First we consider a system of six
P1: GLM/MBQ P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008A-376 June 29, 2001 17:8
TABLE III Splitting of Orbital States with L = 0, 1, 2, 3, 4 in an Octahedral (Oh ) and a Square-Planar (D4h ) Ligand
Field
Orbital state of the free ion
Quantum State Orbital states in an
number L symbola Oh ligand field Orbital states in a D4h ligand field
(dx y ) terms of Ni2+ in roughly D4h symmetry as shown as parameters in the calculation and are fit to experimental
in Fig. 4. As for the splitting of the other terms, they are data. Normally a value in Dq is represented.
listed in Table III. It is worth noticing that the lower the Since both the electron-electron interaction and the in-
symmetry is, the more splitting of the terms occurs. In fluence of the ligand field have to be treated, there are two
D4h , the highest degeneracy is E, whereas in Oh , it is T. methods for finding the term system of a transition metal
complex when the central ion contains two or more d elec-
3. Weak- and Strong-Field Methods, trons. Both methods start with the free d N ion of which
Term Diagrams the electron-electron repulsion is not yet taken into ac-
The amount of the energetic splitting or shifting can be (ap- count. They differ only in the order of which one is treated
proximately) determined by the methods of quantum me- first. According to the expected relative amounts of these
chanical perturbation theory. Since the perturbation comes energetic quantities, the weak-field method is employed
from the electrical interaction between the electrons of when the effect of electron-electron interaction dominates,
the central ion and the charge distribution within the lig- whereas the strong-field method is appropriate when the
and system, the magnitude of energetic splitting or shift- influence of the ligand field is dominant. Complete treat-
ing will depend on the central ion-ligand distance R and ments of a complex ion by both methods will ultimately
on the charges q and electrical dipole moments µ of the yield the same results. The weak-field method is described
ligands: by the following scheme:
Magnitude of energetic splitting and/or shifting d N −−−−−−→ 2S+1 L −−−−−→ 2S+1
electr on−electr on ligand
= F(R, q, µ) interaction f ield
f g (cm−1 )a (cm−1 )b
10 Dq ≈ f (ligand) × g(central ion) Field Theory,” Chap. 26, Elsevier, New York.
b Experimentaln data from Lever, A. B. P. (1986).
Where the factor f describes the field strength of ligand “Inorganic Electronic Spectroscopy,” 2nd ed., Chaps. 6
relative to water, which is assigned to 1.0. The g fac- and 9, Elservier, New York.
P1: GLM/MBQ P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008A-376 June 29, 2001 17:8
FIGURE 13 Combinations of a central ion d : (a) dx2 −y2 (b) dxy or-
C. Ligand-Field theory bital, and (c) the σ group orbitals of the ligand σx + σ−x − σ y − σ−y .
FIGURE 14 Two atomic orbitals form a bonding and an antibond- FIGURE 16 Comparison of the ligand field strength, value of
ing state: (a) with small difference in energy of A and B and (b) ligands with σ -donor, π -donor, and π-acceptor character.
with large difference in energy of A and B.
µef f values based on the spin-only model are in adequate V. STABILIZATION OF COMPLEXES
agreement with the corresponding experimental ones. As
mentioned above, the magnetic properties are directly In the ionic model, the formation of a complex ion from
related to the spin multiplicity of the ground state, or the the free central ion and the ligands is combined with a
number of unpaired electrons at the ground state. In the stabilization by the electrostatic binding energy Eclass . The
case of d 1 ∼ d 3 and d 8 ∼ d 10 of octahedral complexes, resulting state corresponds to the electronic ground state
the spin multiplicity of the ground state keeps the same of the complex. From the crystal-field theory, however,
(high-spin state) even at very strong ligand field. But in
the case of d 4 ∼ d 7 system in Oh , the spin multiplicity
of the ground state drops beyond certain Dq value (VT )
called a low-spin state, therefore an apparent change TABLE VI Ligand Field Stabilization Energy for Octahedral
of magnetic moment is observed. For example, the Transition Metal Complexesa
µef f of high-spin state of Fe2+ (d 6 ) complex is ∼ 5.0 High-spin Low-spin
BM, whereas that of low spin is ∼ 0 BM. For the spin configuration configuration
crossover system, two different spin multiplicities can be Number of Ground LFSE Ground LFSE
detected on the same complex at different temperatures. d electrons term (Dq) term (Dq)
For example, the magnetic moment of Fe(phen)2 (NCS)2 1 t12 [2 T2 ] −4
measured at various temperatures is displayed in Fig. 20. 2 t22 [3 T1 ] −8
The spin multiplicity below 176 K is a singlet, but the spin 3 t32 [4 A2 ] −12
multiplicity above 176 K becomes a quintet. The corre- 4 t32 e1 [5 E] −6 t42 [3 T1 ] −16
sponding effective magnetic moment is 0.5 and 4.9 BM, 5 t32 e2 [6 T1 ] 0 t52 [2 T2 ] −20
respectively. 6 t42 e2 [5 T2 ] −4 t62 [1 A1 ] −24
The deviations between the theoretical (spin only) and 7 t52 e2 [4 T1 ] −8 t62 e1 [2 E] −18
experimental values of the magnetic moment in Table V 8 t62 e2 [3 A2 ] −12
are mainly due to the fact that besides the spins the orbital
9 t62 e3 [2 E] −6
angular momentum can also contribute to the effective
magnetic moments. a Calculated in the strong-field approximation.
P1: GLM/MBQ P2: GLM Final Pages
Encyclopedia of Physical Science and Technology EN008A-376 June 29, 2001 17:8
B. Heats of Hydration
m+
By the dissolving of gaseous metal ions Mgas in water, the
m+
corresponding hexa-aquo ions [M(H2 O)6 ]aq are formed
and the heat of hydration, H H , is generated:
2O
m
Mgas −−−→[M(H2 O)6 ]aq
m+
+ HH
The variation of the experimental values of − H H with
the atomic number of the bivalent and trivalent metal ions
FIGURE 20 Effective magnetic moment, µeff , of Fe(phen)2 of the first transition series is plotted in Fig. 21. The solid
(NCS)2 as a function of temperature. [From Lee, J. J., Sheu, H. S.,
Lee, C. R., Chen, J. M., Liu, R. G., Lee, J. F., Wang, C. C., Huang,
C. H., and Wang, Y. (2000). J. Am. Chem. Soc. 122, 5742.]
The Hclass values can be obtained by subtraction of the SEE ALSO THE FOLLOWING ARTICLES
CFSEs from the experimental H H values. If the CFSEs
are calculated with Dq values obtained from absorption ELECTROMAGNETICS • ELECTRON TRANSFER REAC-
spectra, the resulting Hclass values are very close to the TIONS • GROUP THEORY • MICROWAVE MOLECULAR
dashed lines as shown in Fig. 21. SPECTROSCOPY
By corresponding considerations, apparent irregular-
ities in the lattice energies of metallic halogenides in
the heats of formation and heats of reactions and in the
kinetic stability of transition metal complexes can be BIBLIOGRAPHY
rationalized.
The structural stability of transition metal complexes is Burger, K. (1973). “Coordination Chemistry: Experimental Methods,”
Butterworth, London.
largely influenced by the Jahn-Teller effect. As Jahn and Cotton, F. A. (1990). “Chemical Applications of Group Theory,” 3rd ed.,
Teller have proved, the nuclear configuration of a nonlin- Wiley, New York.
ear polyatomic molecule is unstable when its electronic Douglas, B. E., McDaniel, D. H., and Alexander, J. J. (1994). “Concepts
ground state is orbitally degenerate. A molecule in such and Models of Inorganic Chemistry,” 3rd ed., Wiley, New York.
a situation will undergo a structural distortion such that Drago, R. S. (1992). “Physical Methods for Chemistry,” 2nd ed.,
Saunders College, Orlando.
the resulting ground state is orbitally nondegenerate. The Huheey, J. E., Keiter, E. A., and Keiter, R. L. (1993). “Inorganic Chem-
character of the distortion can be static, yielding a per- istry: Principles of Structure and Reactivity,” 4th ed., Harper Collins
manent deformation in geometry, or dynamic by oscillat- College, New York.
ing between several structures with nondegenerate ground Jørgensen, C. K. (1971). “Modern Aspects of Ligand Field Theory,”
states. North-Holland, Amsterdam.
Lever, A. B. P. (1984). “Inorganic Electronic Spectroscopy,” Elsevier,
Only the following octahedral transition metal com- Amsterdam.
plexes have an orbitally nondegenerate ground state (see Schläfer, H. L., and Gliemann, G. (1969). “Basic Principles of Ligand
Tables V and VI): Field Theory,” Wiley, New York.
P1: LEF/GLT P2: GTY Final pages Qu: 00, 00, 00, 00
Encyclopedia of Physical Science and Technology EN017J-807 August 2, 2001 17:36
I. Introduction
II. Bonding in H+2
III. Bonding in H2
IV. Bonding of Hydrogen to Carbon
. 411
P1: LEF/GLT P2: GTY Final pages
Encyclopedia of Physical Science and Technology EN017J-807 August 2, 2001 17:36
for predicting excited states and is quite popular in spec- χl = N exp(−Ra ) (1)
troscopy. Also, it is the framework used in rigorous QM
and
calculations identified by names such as Hartree–Fock
(HF), self-consistent field (SCF) and density functional χr = N exp(−Rb ), (2)
theory (DFT). Thus, to describe N2 , we would take the
respectively, where χl and χr denote hydrogen 1s or-
five occupied atomic orbitals (denoted 1s, 2s, 2 px , 2 p y ,
bitals centered on the left and right protons, respectively,
and 2 pz ) of each N atom and form 10 molecular orbitals
and N is a normalization factor—a scalar factor included
(denoted σg 1s, σu 1s, etc.) and occupy the lowest 7 with
to guarantee that the wave function contains only one
two electrons each (one with up spin and one with down
electron.
spin).
For finite R, the exact wave functions no longer have
2. Valence bond (VB) theory. In VB theory one starts
the atomic form, but useful approximate wave functions
with the occupied atomic orbitals of the atoms and con-
are obtained by combining the atomic orbitals to form the
structs a many-electron wave function to describe bonding
wave function
directly in terms of these atomic orbitals. Although sim-
ilar to MO, the differences will become transparent (be- ϕ = Cl χl + Cr χr . (3)
low). VB theory is most useful for describing reactions
and bond dissociations because the many-electron states This simple wave function is often referred to as linear
of the atoms are built into VB. However, VB is computa- combination of atomic orbitals (LCAO). The optimum
tionally much more complicated than MO, and it is much linear combination of atomic orbital wave functions is the
less obvious how to describe excited states in terms of VB. symmetric combination
Important chemical concepts such as resonance are based ϕ g = χl + χ r , (4)
on VB concepts.
where for simplicity we now ignore the normalization fac-
In this article, we will illustrate the use of VB and MO tor. The other combination of the orbitals is the antisym-
concepts for the simplest molecules and then apply these metric combination,
ideas to two simple problems that detail how to use them. ϕu = χl − χr , (5)
where again we ignore normalization.
II. BONDING IN H+ The g and u labels denote the symmetry of the wave
2 functions with respect to inversion, and stand respectively
Consider first the smallest possible molecule, H+ for the German words gerade (even) and ungerade (un-
2 , consist-
ing of one electron and two protons that are separated by even). Although symmetry arguments can be very power-
a distance R. This system is sketched in Fig. 1, where the ful, we eschew them here, and the reader may assume that
two protons are denoted as a and b. The most important these labels represent good and ungood, respectively.
question is whether this system forms a bond (i.e., is the The energies for the wave functions ϕg and ϕu in Eqs. (4)
lowest energy for a finite value of R). and (5) are shown as a function of R in Fig. 2. Here
we see that the g state is strongly bonding (the energy
left and electron 2 on the right. The wave function for this
configuration is
a (r1 , r2 ) = χl (r1 )χr (r2 ). (9)
This wave function a says that the probability of elec-
tron 1 being at a particular position is independent of where
electron 2 is, and vice versa. Since the atoms are infinitely
far apart, the electrons are not influenced by each other.
A second wave function that is just as good, or as bad,
as Eq. (9), is
b (r1 , r2 ) = χr (r1 )χl (r2 ), (10)
where the electrons have been interchanged. This wave
function b is different from a since electron 1 is now
FIGURE 6 Overlap of 1s orbitals at (a) bonding and (b) too-close on the opposite side of the universe. However, the energies
distances, and overlap of 2 p orbitals at (c) bonding and (d) too- of b and a must be the same since electrons 1 and 2 have
close distances. Note that in both of the too-close examples, the
contragradient region extends over a smaller region of space than
the same properties (they are indistinguishable particles).
in the bonding examples, leading to less stabilization. It will be useful to combine a and b into two new
wave functions,
We saw in Section II.B that kinetic energy dominates = χl (1)χr (2) + χr (1)χl (2), (11)
bonding in H+2 . An analysis of kinetic energy allows us to VB
u (1, 2) = a (1, 2) − b (1, 2)
make some general predictions about the shape of wave
functions. = χl (1)χr (2) − χr (1)χl (2), (12)
Consider a one-dimensional potential V (x) and con-
without normalization, because at finite R these are the
sider the set of all eigenfunctions (all solutions of the
optimum wave functions. Before examining the energies,
Schrödinger Equation)
we need to understand how to think about the relative
Hφj = Eφj . (8) locations of the electrons in these wave functions.
In Fig. 7, we plot the four wave functions a , b , VBg ,
We can show quite generally that, except for very diaboli-
and VB u . We see that VB
u has a nodal plane corresponding
cal V (x), the ground state is nodeless. It is always positive
to z 1 = z 2 , whereas g does not. Indeed, along the line be-
except at the boundaries, where it goes to zero. A simple
tween the two peaks in Fig. 7c, we notice that the gradient
example is seen with H+ 2 , where we found two orbitals g of the VBg wave function is smaller than that of a or b ,
and u. The nodal theorem indicates that the lowest state is
g-like, i.e., nodeless. We will use this below to work out whereas the gradient of the VB u wave function is larger.
the bonding in H2 . This decrease in the gradient of VB g and increase for u
VB
ing, and indeed this is the case: Figure 8 shows the energies
III. BONDING IN H2
E g and E u for the valence bond wave functions of H2 .
With this understanding of the fundamental of chemical
bonding in H+
2 , we move our attention to H2 . B. The Molecular Orbital Description of H2
The simplest wave function for H2 is found by starting
A. The Valence Bond Description of H2 with an electron in the best molecular orbital of H+ 2 and
In the VB description of a molecule, we start with the full to place a second electron in this ϕg orbital. This leads to
wave function for each atom at R = ∞ and combine the the molecular orbital wave function for H2 ,
wave functions to form the wave function of the molecule, MO
gg (r1 , r2 ) = ϕg (r1 )ϕg (r2 ). (13)
and then bring the atoms together to obtain optimal bond-
ing. For H2 at R = ∞, this configuration consists of two The first step is to examine the meaning of the wave
hydrogen atoms infinitely far apart, say electron 1 on the function (13). The total probability for electron 1 to be
P1: LEF/GLT P2: GTY Final pages
Encyclopedia of Physical Science and Technology EN017J-807 August 2, 2001 17:36
FIGURE 8 Energies of the g and u states of the VB wave FIGURE 9 Simple energy diagram for molecular orbital wave
function. functions of H2 .
P1: LEF/GLT P2: GTY Final pages
Encyclopedia of Physical Science and Technology EN017J-807 August 2, 2001 17:36
BIBLIOGRAPHY
FIGURE 20 Energy profile for the singlet N state and the triplet T Cooper, D. L., Gerratt, J., and Raimondi, M. (1987). Modern valence
state of ethylene as the molecule is twisted about the C C bond. bond theory, Adv. Chem. Phys. 67, 319.
Coulson, C. A. (1961). “Valence,” Oxford Univ. Press, London and
New York.
As the molecules are rotated, the strength of the pi bond Klein, D. J., and Trinajstić, N., eds. (1990). “Valence Bond Theory and
is weakened, and the CC bond length increases. Thus, for Chemical Structure,” Elsevier, Amsterdam.
the N state, RCC = 1.34 Å at θ = 0◦ and RCC = 1.50 Å at Matsen, F. A., and Pauncz, R. (1986). “The Unitary Group in Quantum
Chemistry,” Elsevier, Amsterdam.
θ = 90◦ . McWeeny, R., and Sutcliffe, B. T. (1969). “Methods of Molecular Quan-
In the T state, the optimum geometry is twisted, tum Mechanics,” Academic Press, New York.
θ = 90◦ , since triplet pairing of the orbitals prefers that Pauling, L. (1969). “The Nature of the Chemical Bond,” Cornell Univ.
the pi orbitals not overlap. As the molecule is twisted to- Press, Ithaca, New York.
ward the planar geometry, the CC bond length increases Pauncz, R. (1979). “Spin Eigenfunctions,” Plenum, New York.
Schaefer, H. F., III, ed. (1977). “Methods of Electronic Structure Theory,”
from 1.50 Å at θ = 90◦ to 1.57 Å at θ = 0◦ , and the Plenum, New York.
energy barrier is 25 kcal. Note that Fig. 20 shows the Slater, J. C. (1963). “Quantum Theory of Molecules and Solids,” Vol. 1.
T -to-N energy separation at θ = 0, using the ground-state McGraw-Hill, New York.