Formation of Galaxy Clusters - Annurev-Astro-081811-125502
Formation of Galaxy Clusters - Annurev-Astro-081811-125502
ANNUAL
REVIEWS Further Formation of Galaxy Clusters
Click here for quick links to
Annual Reviews content online,
including: Andrey V. Kravtsov1 and Stefano Borgani2
• Other articles in this volume 1
• Top cited articles Department of Astronomy & Astrophysics, Kavli Institute for Cosmological Physics,
• Top downloaded articles The University of Chicago, Chicago, Illinois 60637; email: [email protected]
• Our comprehensive search 2
Dipartimento di Fisica dell’Università di Trieste, Sezione di Astronomia, I-34131 Trieste,
Italy; INAF Osservatorio Astronomico di Trieste, Italy; and INFN Istituto Nazionale di Fisica
Nucleare, Trieste, Italy; email: [email protected]
353
AA50CH09-Kravtsov ARI 16 July 2012 12:19
1. INTRODUCTION
The tendency of nebulae to cluster was discovered by Charles Messier and William Herschel, who
constructed the first systematic catalogs of these objects. This tendency has become more apparent
as larger and larger samples of galaxies were compiled in the nineteenth and early twentieth
centuries. Studies of the most prominent concentrations of nebulae, the clusters of galaxies, were
revolutionized in the 1920s by Edwin Hubble’s proof that spiral and elliptical nebulae were bona
fide galaxies like the Milky Way located at large distances from us (Hubble 1925, 1926), which
implied that clusters of galaxies are systems of enormous size. Just a few years later, measurements
of galaxy velocities in regions of clusters made by Hubble & Humason (1931) and assumption
of the virial equilibrium of galaxy motions were used to show that the total gravitating cluster
masses for the Coma (Zwicky 1933, and see also Zwicky 1937) and Virgo clusters (Smith 1936)
were enormous as well.
The masses implied by the measured velocity dispersions were found to exceed the combined
mass of all the stars in clusters galaxies by factors of ∼200–400, which prompted Zwicky to pos-
tulate the existence of large amounts of dark matter (DM), inventing this widely used term in the
process. Although the evidence for dark matter in clusters was disputed in the subsequent decades,
as it was realized that stellar masses of galaxies were underestimated in the early studies, dark
matter was ultimately confirmed by the discovery of extended hot intracluster medium (ICM)
emitting at X-ray energies by thermal bremsstrahlung that was found to be smoothly filling in-
tergalactic space within the Coma cluster (Cavaliere, Gursky & Tucker 1971; Gursky et al. 1971;
Meekins et al. 1971; Forman et al. 1972; Kellogg et al. 1972). The X-ray emission of the ICM
has not only provided a part of the missing mass (as was conjectured on theoretical grounds by
Limber 1959 and van Albada 1960), but also allows the detection of clusters out to z > 1 (Rosati,
Borgani & Norman 2002). Furthermore, measurement of the ICM temperature has provided an
independent confirmation that the depth of gravitational potential of clusters requires additional
dark component. It was also quickly realized that inverse Compton scattering of the cosmic mi-
crowave background (CMB) photons off thermal electrons of the hot intergalactic plasma should
lead to distortions in the CMB spectrum, equivalent to black body temperature variations of about
10−4 –10−5 [the Sunyaev-Zel’dovich (SZ) effect; Sunyaev & Zel’dovich 1970, 1972b, 1980]. This
effect has now been measured in hundreds of clusters (e.g., Carlstrom, Holder & Reese 2002).
Given such remarkable properties, it is no surprise that the quest to understand the formation
and evolution of galaxy clusters has become one of the central efforts in modern astrophysics
over the past several decades. Early pioneering models of collapse of initial density fluctuations
in the expanding Universe have shown that systems resembling the Coma cluster can indeed
form (van Albada 1960, 1961; Peebles 1970; White 1976). Gott & Gunn (1971; see also Sunyaev
& Zel’dovich 1972a) showed that hot gas observed in the Coma via X-ray observations can be
explained within such a collapse scenario by heating of the infalling gas by the strong accretion
shocks. Subsequently, emergence of the hierarchical model of structure formation (Press &
Schechter 1974, Gott & Rees 1975, White & Rees 1978), combined with the cold dark matter
(CDM) cosmological scenario (Bond, Szalay & Turner 1982; Blumenthal et al. 1984), provided
a powerful framework for interpretation of the multiwavelength cluster observations. At the
same time, rapid advances in computing power and new, efficient numerical algorithms have
allowed fully three-dimensional ab initio numerical calculations of cluster formation within a
self-consistent cosmological context in both dissipationless regime (Klypin & Shandarin 1983,
Efstathiou et al. 1985) and including dissipational baryonic component (Evrard 1988, 1990).
In the past two decades, theoretical studies of cluster formation have blossomed into a vibrant
and mature scientific field. As we detail in the subsequent sections, the standard scenario of cluster
formation has emerged and theoretical studies have identified the most important processes that
shape the observed properties of clusters and their evolution, which has enabled usage of clusters
as powerful cosmological probes (see, e.g., Allen, Evrard & Mantz 2011 for a recent review). At the
same time, observations of clusters at different redshifts have highlighted several key discrepancies
between models and observations, which are particularly salient in the central regions (cores) of
clusters.
In the current paradigm of structure formation, clusters are thought to form via a hierarchical
sequence of mergers and accretion of smaller systems driven by gravity and DM that dominates
the gravitational field. Theoretical models of clusters employ a variety of techniques determined
by a particular aspect of cluster formation they aim to understand. Many of the bulk properties
of clusters are thought to be determined solely by the initial conditions, dissipationless DM that
dominates cluster mass budget, and gravity. Thus, cluster formation is often approximated in
models as DM-driven dissipationless collapse from cosmological initial conditions in an expanding
Universe. Such models are quite successful in predicting the existence and functional form of
correlations between cluster properties, as well as their abundance and clustering, as we discuss
in detail in Section 3. One of the most remarkable models of this kind is a simple self-similar
model of clusters (Kaiser 1986; see Section 3.9 below). Despite its simplicity, the predictions of
this model are quite close to results of observations and have, in fact, been quite useful in providing
baseline expectations for evolution of cluster scaling relations. Studies of abundance and spatial
distribution of clusters using dissipationless cosmological simulations show that these statistics
retain remarkable memory of the initial conditions.
The full description of cluster formation requires detailed modeling of the nonlinear processes
of collapse and the dissipative physics of baryons. The gas is heated to high, X-ray emitting temper-
atures by adiabatic compression and shocks during collapse and settles in hydrostatic equilibrium
(HE) within the cluster potential well. Once the gas is sufficiently dense, it cools, the process that
can feed both star formation and accretion onto supermassive black holes (SMBHs) harbored by
the massive cluster galaxies. The process of cooling and formation of stars and SMBHs can then
result in energetic feedback due to supernovae (SNe) or active galactic nuclei (AGN), which can
inject substantial amounts of heat into the ICM and spread heavy elements throughout the cluster
volume.
Galaxy clusters are therefore veritable crossroads of astrophysics and cosmology: While abun-
dance and spatial distribution of clusters bear indelible imprints of the background cosmology,
gravity law, and initial conditions, the nearly closed-box nature of deep cluster potentials makes
them ideal laboratories to study processes operating during galaxy formation and their effects on
the surrounding intergalactic medium.
In this review we discuss the main developments and results in the quest to understand the
formation and evolution of galaxy clusters. Given the limited space available for this review and
the vast amount of literature and research directions related to galaxy clusters, we have no choice
but to limit the focus of our review, as well as the number of cited studies. Specifically, we
focus on the most basic and well-established elements of the standard paradigm of DM-driven
hierarchical structure formation within the framework of CDM cosmology as it pertains to galaxy
clusters. We focus mainly on the theoretical predictions of the properties of the total cluster mass
distribution and properties of the hot intracluster gas, and only briefly discuss results pertaining
to the evolution of stellar components of clusters, understanding of which is still very much a
work in progress. Comparing model predictions to real clusters, we mostly focus on comparisons
with X-ray observations, which have provided the bulk of our knowledge of ICM properties so
far. In Section 5, we briefly discuss the differences in formation of clusters in models with the
non-Gaussian initial conditions and modified gravity. Specifically, we focus on the information
that statistics sensitive to the cluster formation process, such as cluster abundance and clustering,
can provide about the primordial non-Gaussianity and possible deviations of gravity from General
Relativity. We refer readers to recent extensive reviews on cosmological uses of galaxy clusters by
Allen, Evrard & Mantz (2011) and Weinberg et al. (2012) for a more extensive discussion of this
topic.
a b –2 –2
2
–58.72
8
2
Declination
–58.74
8
–2
–2
2
–58.76
8
2
30 arcsec
–58.78 –2
–2
316.58 316.56 316.54 316.52 316.50 316.48 316.46 316.44
Right ascension
Figure 1
(a) The composite X-ray/optical image (556 kpc on a side) of the galaxy cluster Abell 1689 at redshift z = 0.18. The purple haze shows
X-ray emission of the T ∼ 108 K gas, obtained by the Chandra X-ray Observatory. Images of galaxies in the optical band, colored in
yellow, are from observations performed with the Hubble Space Telescope. The long arcs in the optical image are caused by the
gravitational lensing of background galaxies by matter in the galaxy cluster, the largest system of such arcs ever found (Credit: X-ray:
NASA/CXC/MIT; Optical: NASA/STScI). (b) The galaxy cluster SPT-CL J2106-5844 at z = 1.133, the most massive cluster known
at z > 1 discovered via its Sunyaev-Zel’dovich (SZ) signal (M 200 ≈ 1.3 × 1015 M ). The color image shows the Magellan/LDSS3
optical and Spitzer/IRAC mid-IR measurements (corresponding to the blue-green-red color channels). The frame subtends 4.8 ×
4.8 arcmin, which corresponds to 2.4 × 2.4 Mpc at the redshift of the cluster. The solid yellow contours correspond to the South Pole
Telescope SZ significance values, as labeled, where dashed yellow contours are used for the negative significance values. Adapted from
Foley et al. 2011.
.2
0.1
5
z < 0.1
0.0
=0
0.1 < z < 0.6
00
g ,5
/M
00
*,5
M
10
M ,500 (1012 M☉)
*
1 10
Mg,500 (1013 M☉)
Figure 2
The mass in stars versus the mass of hot, X-ray emitting gas. Both masses are measured within the radius
R500 estimated from the observationally calibrated Y X − M 500 relation, assuming flat CDM cosmology
with m = 1 − = 0.26 and h = 0.71. Red circles show local clusters located at z < 0.1, whereas blue
squares show higher redshift clusters: 0.1 < z < 0.6 (see Lin et al. 2012 for details). The dotted line
corresponds to the constant stellar-to-gas mass ratio M ∗,500 /M g,500 = 0.1, whereas the dashed lines
correspond to the values of 0.05 and 0.2 for this ratio.
cluster-centric arcs (Bartelmann 2010, and references therein). At larger radii, the lensing effect is
weaker. Although not easily visible by eye, it can still be reliably measured by averaging the shapes
of many background galaxies and comparing the average with the expected value for an isotropic
distribution of shapes. The gravitational lensing is a direct probe of the total mass distribution in
clusters, which makes it both extremely powerful in its own right and a very useful check of other
methods of measuring cluster masses. The figure shows several bright elliptical galaxies that are
typically located near the cluster center. A salient feature of such central galaxies is that they show
little evidence of ongoing star formation, despite their extremely large masses.
The diffuse plasma is not associated with individual galaxies, and constitutes the intracluster
medium, which contains the bulk of the normal baryonic matter in massive clusters. Although
the hot ICM is not directly associated with galaxies, their properties are correlated. For example,
Figure 2 shows the mass of the ICM gas within the radius R500 , defined as the radius enclosing
mean overdensity of c = 500ρcr , versus stellar mass in galaxies within the same radius for a
number of local (z 0.1) and distant (0.1 < z < 0.6) clusters (Lin et al. 2012). Here, ρcr (z) =
3H (z)2 /(8π G) is the critical mean density of the Universe, defined in terms of the Hubble function
H(z). The figure shows a remarkably tight, albeit nonlinear, correlation between these two baryonic
components. It also shows that the gas mass in clusters is on average about ten times larger than
the mass in stars, although this ratio is systematically larger for smaller mass clusters, ranging from
M ∗ /M g ≈ 0.2 to ≈0.05, as mass increases from group scale (M 500 ∼ few × 1013 M ) to massive
clusters (M 500 ∼ 1015 M ).
The temperature of the ICM is consistent with velocities of galaxies and indicates that both
galaxies and gas are nearly in equilibrium within a common gravitational potential well. The mass
of galaxies and hot gas is not sufficient to explain the depth of the potential well, which implies that
most of the mass in clusters is in a form of DM. Given that hydrogen is by far the most abundant
element in the Universe, most of the plasma particles are electrons and protons, with a smaller
number of helium nuclei. There are also trace amounts of heavier nuclei, some of which are only
partially ionized. The typical average abundance of the heavier elements is about one-third of that
found in the Sun or a fraction of one percent by mass; it decreases with increasing radius and can
be quite inhomogeneous, especially in merging systems (see Werner et al. 2008 for a review).
Thermodynamic properties of the ICM are of utmost importance because comparing such
properties to predictions of baseline models without cooling and heating can help to isolate
the impact of these physical processes in cluster formation. The most popular baseline model
is the self-similar model of clusters developed by Kaiser (1986), which we consider in detail in
Section 3.9 below. In its simplest version, this model assumes that clusters are scaled versions of
each other, so that gas density at a given fraction of the characteristic radius of clusters, defined
by their mass, is independent of cluster mass. Figure 3 shows the electron density in clusters
as a function of ICM temperature (and hence mass) at different radii. It is clear that density
Chandra
XMM-Newton
0.15R 500
E –2 (z) ne (10 –4 cm –3)
10
R 2,500
1
R 500
1 10
kT500 (keV)
Figure 3
The observed electron number density, ne , in galaxy clusters and groups, measured at different radii (from
top to bottom: 0.15R500 , R2,500 , R500 ; see labels) as a function of the intracluster medium temperature at
R500 . The values of ne are rescaled by E −2 (z), the scaling expected from the definition of the radii at which
densities are measured. Squares and circles show systems observed with the Chandra X-Ray Observatory from
the studies by Vikhlinin et al. (2009a) and Sun et al. (2009); triangles show systems observed with the
XMM-Newton telescope by Pratt et al. (2010). Note that electron densities at large radii are independent of
temperature, as expected from the self-similar model, whereas at small radii the rescaled densities increase
with temperature. Note also that the scatter from cluster to cluster increases with decreasing radius,
especially for low-temperature groups. After Sun (2012).
1.2 XMM-Newton
Cosmological simulations
1.1
1.0
kT/kTM
0.9
0.8
0.7
0.6
Figure 4
Comparison between temperature profiles, normalized to the global temperature measured within R180 by
Leccardi & Molendi (2008), for a set of about 50 nearby clusters with z 0.3 and with temperature
k B T X > 3 keV, observed with the XMM-Newton (dots with error bars) and results from cosmological
hydrodynamical simulations including the effect of radiative cooling, star formation, and supernova feedback
in the form of galactic winds (solid curve; Borgani et al. 2004). From Leccardi & Molendi (2008).
is independent of temperature only outside the cluster core at r ∼ R500 , although there is an
indication that density is independent of temperature at r = R2,500 for k B T 3 keV clusters.
This indicates that processes associated with galaxy formation and feedback affect the properties
of clusters at r R2,500 , but their effects are mild at larger radii.
During the past two decades, it has been established that the core regions of the relaxed clusters
are generally characterized by a strongly peaked X-ray emissivity, indicating efficient cooling of
the gas (e.g., Fabian 1994). Quite interestingly, spectroscopic observations with the Chandra and
XMM-Newton satellites have demonstrated that, despite strong X-ray emission of the hot gas,
only a relatively modest amount of this gas cools down to low temperatures (e.g., Böhringer et al.
2001, Peterson et al. 2001). This result is generally consistent with the low levels of star formation
observed in the brightest cluster galaxies (BCGs; e.g., McDonald et al. 2011). It implies that a
heating mechanism should compensate for radiative losses, thereby preventing the gas in cluster
cores from cooling down to low temperature. The presence of cool cores is also reflected in the
observed temperature profiles (e.g., Vikhlinin et al. 2006, Pratt et al. 2007, Leccardi & Molendi
2008; see also Figure 4), which exhibit decline of temperature with decreasing radius in the
innermost regions of relaxed cool-core clusters.
One of the most important and most widely studied aspects of ICM properties are correla-
tions between its different observable integrated quantities and between observable quantities and
total mass. Such scaling relations are the key ingredient in cosmological uses of clusters, where
it is particularly desirable that the relations are characterized by small scatter and are indepen-
dent of the relaxation state and other properties of clusters. Although clusters are fascinatingly
complex systems overall, they do exhibit some remarkable regularities. As an example, Figure 5
10 2
a b
r < R500 0.15 < r/R500 < 1.0
σlnL = 0.38 σlnL = 0.16
Relaxed
Disturbed
10 –1
10 1 10 2 10 1 10 2
YX (10 13 M☉ keV)
Figure 5
Correlation of bolometric luminosity of intracluster gas and Y X ≡ M gas T X , where M gas is the mass of the
gas within R500 and T X is temperature derived from the fit to gas spectrum accounting only for emission
from radial range (0.15–1)R500 . Results are shown for the local clusters from the Representative XMM-
Newton Cluster Structure Survey sample of Pratt et al. (2009). (a) Total luminosity integrated within radius
R500 ; (b) bolometric luminosity calculated with the central 0.15R500 of the cluster excised. Labels in the top
left corner indicate the radial range used in computing the luminosity and logarithmic scatter of luminosity
at fixed Y X . The blue points show cool core clusters, whereas magenta points are non–cool core clusters.
Clusters classified as relaxed and disturbed are shown by circles and squares, respectively. Note that
exclusion of the cluster cores reduces the scatter between luminosity and Y X by more than a factor of two.
shows the correlation between the bolometric luminosity emitted from within R500 and the Y X
parameter defined as a product of gas mass within R500 and ICM temperature derived from the
X-ray spectrum within the radial range (0.15 − 1)R500 (Kravtsov, Vikhlinin & Nagai 2006) for the
Representative XMM-Newton Cluster Structure Survey (REXCESS) sample of clusters studied
by Pratt et al. (2009). Different symbols indicate clusters in different states of relaxation, whereas
clusters with strongly peaked central gas distribution (the cool core clusters) and clusters with
less centrally concentrated gas distribution are shown with different colors. Figure 5a shows total
luminosity integrated within radius R500 , whereas Figure 5b shows luminosity calculated with the
central region within 0.15R500 excised. Quite clearly, the core-excised X-ray luminosity exhibits
remarkably tight correlation with Y X , which, in turn, is expected to correlate tightly with total
cluster mass (Kravtsov, Vikhlinin & Nagai 2006; Stanek et al. 2010; Fabjan et al. 2011). This figure
illustrates the general findings in the past decade that clusters exhibit strong regularity and tight
correlations among X-ray observable quantities and total mass, provided that relevant quantities
are measured after excluding the emission from cluster cores.
contrast (or overdensity) field: δ(x) = (ρ(x) − ρ̄m )/ρ̄m , where ρ̄m is the mean mass density of the
Universe. Properties of the field δ(x) depend on specific details of the processes occurring during
the earliest inflationary stage of evolution of the Universe (Guth & Pi 1982; Starobinsky 1982;
Bardeen, Steinhardt & Turner 1983) and the subsequent stages prior to recombination (Peebles
1982, Bond & Efstathiou 1984, Bardeen et al. 1986, Eisenstein & Hu 1999). A fiducial assumption
of most models that we discuss is that δ(x) is a homogeneous and isotropic Gaussian random field.
We briefly discuss non-Gaussian models in Section 5.1.
Statistical properties of a uniform and isotropic Gaussian field can be fully characterized by
its power spectrum, P(k), which depends only on the modulus k of the wavevector, but not on its
direction. A related quantity is the variance of the density contrast field smoothed on some scale
R, where δ R (x) ≡ δ(x − r)W (r, R)d 3 r:
1
δ 2R ≡ σ 2 (R) = P(k)|W̃ (k, R)|2 d 3 k, (1)
(2π )3
where W̃ (k, R) is the Fourier transform of the window (filter) function W (r, R), such that δ R (k) =
δ(k)W̃ (k, R) [see, e.g., Zentner (2007) or Mo, van den Bosch & White (2010) for details on the
definition of P(k) and choices of window function]. For the case when one is interested in only a
narrow range of k, the power spectrum can be approximated by the power-law form, P(k) ∝ kn ,
and the variance is σ 2 (R) ∝ R−(n+3) .
At a sufficiently high redshift z, for the spherical top-hat window function, mass and radius are
interchangeable according to the relation M = 4π/3ρm (z)R3 . We can think about the density
field smoothed on the scale R or the corresponding mass scale M. The characteristic amplitude
of peaks in the δ R (or δ M ) field smoothed on scale R (or mass scale M) is given by σ (R) ≡ σ (M ).
The smoothed Gaussian density field is, of course, also Gaussian with the probability distribution
function (PDF) given by
1 δ2
p(δ M ) = √ exp − 2M . (2)
2πσ (M ) 2σ (M )
During the earliest linear stages of evolution in the standard structure formation scenario, the initial
Gaussianity of the δ(x) field is preserved, as different Fourier modes δ(k) evolve independently
and grow at the same rate, described by the linear growth factor, D+ (a), as a function of expansion
factor a = (1 + z)−1 , which for CDM cosmology is given by (Heath 1977)
a
5m da
δ(a) ∝ D+ (a) = E(a) 3
, (3)
2 0 [a E(a )]
Once the amplitude of typical fluctuations approaches unity, σ (M , a) ∼ 1, the linear approxi-
mation breaks down. Further evolution must be studied by means of nonlinear models or direct
numerical simulations. We discuss results of numerical simulations extensively below. However,
we consider first the simplified, but instructive, spherical collapse model and associated concepts
and terminology. Such a model can be used to gain physical insight into the key features of the
evolution and is used as a basis for both definitions of collapsed objects (see Section 3.6) and
quantitative models for halo abundance and clustering (Sections 3.7 and 3.8).
3.2.1. Spherical collapse model. The simplest spherical collapse model considers a spherically
symmetric density fluctuation of initial radius Ri , amplitude δi > 0, and mass M = (4π/3)(1 +
δi )ρ̄ Ri3 , where Ri is physical radius of the perturbation and ρ̄ is the mean density of the Universe
at the initial time. Given the symmetry, the collapse of such perturbation is a one-dimensional
problem and is fully specified by evolution of the top-hat radius R(t) (Gunn & Gott 1972, Lahav
et al. 1991). It consists of an initially decelerating increase of the perturbation radius—until it
reaches the maximum value, Rta , at the turnaround epoch, tta —and subsequent decrease of R(t)
at t > tta until the perturbation collapses, virializes, and settles at the final radius Rf at t = tcoll .
Physically, Rf is set by the virial relation between potential and kinetic energy and is Rf = Rta /2
in cosmologies with = 0. The turnaround epoch and the epoch of collapse and virialization
are defined by initial conditions.
The final mean internal density of a collapsed object can be estimated by noting that in an =
0 Universe,
the time interval tcoll − tta = tta should be equal to the free-fall time of a uniform sphere
tff = 3π/(32Gρta ), which means that the mean density of perturbation at turnaround is ρta =
3π/(32Gtta2 ) and ρcoll = 8ρta = 3π/(Gtcoll 2
). These densities can be compared with background
mean matter densities at the corresponding times to get mean internal density contrasts: = ρ/ρ̄m .
In the Einstein–de Sitter model (m = 1, = 0), background density evolves as ρ̄m = 1/(6π Gt 2 ),
which means that density contrast after virialization is
ρcoll
vir ≡ = 18π 2 = 177.653. (6)
ρ̄m
For general cosmologies, density contrast can be computed by estimating ρcoll and ρ̄m (tcoll ) in a
similar fashion. For lower m models, fluctuation of the same mass M and δ has a larger initial
radius and smaller physical density and, thus, takes longer to collapse. The density contrasts of
collapsed objects therefore are larger in lower density models because the mean density of matter
at the time of collapse is smaller. Accurate (to 1% for m = 0.1–1) approximations for vir in
open ( = 0) and flat CDM (1 − − m = 0) cosmologies are given by Bryan & Norman
(1998, their equation 6). For example, for the concordance CDM cosmology with m = 0.27
and = 0.73 (Komatsu et al. 2011), density contrast at z = 0 is vir ≈ 358.
Note that if the initial density contrast δi would grow only at the linear rate, D+ (z), then the
density contrast at the time of collapse would be more than a hundred times smaller. Its value can be
derived starting from the density contrast linearly extrapolated to the turnaround epoch, δta . This
epoch corresponds to the time at which perturbation enters in the nonlinear regime and detaches
from the Hubble expansion, so that δta ∼ 1 is expected. In fact, the exact calculation in the case of
m (z) = 1 at the redshift of turn-around gives δta = 1.062 (Gunn & Gott 1972). Because tcoll = 2tta ,
further linear evolution for m (z) = 1 until the collapse time gives δc = δta D+ (tc )/D+ (tta ) ≈ 1.686.
In the case of m = 1, we expect that δta should have different values. For instance, for m < 1,
density contrast at turn-around should be higher to account for the higher rate of the Hubble
expansion. However, linear growth from tta to tcoll is smaller due to the slower redshift dependence
of D+ (z). As a matter of fact, these two factors nearly cancel, so that δc has a weak dependence on
m and (e.g., Percival 2005). For the concordance CDM cosmology at z = 0, for example,
δc ≈ 1.675.
Additional interesting effects may arise in models with DE characterized by small or zero speed
of sound, in which structure growth is affected not only because DE influences linear growth, but
also because it participates nontrivially in the collapse of matter and may slow down or accelerate
the formation of clusters of a given mass depending on the DE equation of state (Abramo et al.
2007, Creminelli et al. 2010). DE in such models can also contribute nontrivially to the gravitating
mass of clusters.
3.2.2. The nonlinear mass scale MNL . The linear value of the collapse overdensity δc is useful
in predicting whether a given initial perturbation δi 1 at initial zi collapses by some later
redshift z. The collapse condition is simply δi D+0 (z) ≥ δc (z) and is used extensively to model the
abundance and clustering of collapsed objects, as we discuss below in Section 3.7. The distribution
of peak amplitudes in the initial Gaussian overdensity field smoothed over mass scale M is given
by a Gaussian PDF with an rms value of σ (M ) (Equation 2). The peaks in the initial Gaussian
overdensity field smoothed at redshift zi over mass scale M can be characterized by the ratio
ν = δi /σ (M , zi ) called the peak height. For a given mass scale M, the peaks collapsing at a given
redshift z according to the spherical collapse model have the peak height given by
δc (z)
ν≡ . (7)
σ (M , z)
Given that δc (z) is a very weak function of z (changing by 1−2% typically), whereas
σ (M , z) = σ (M , z = 0)D+0 (z) decreases strongly with increasing z, the peak height of collapsing
objects of a given mass M increases rapidly with increasing redshift.
Using Equation 7, we can define the characteristic mass scale for which a typical peak (ν = 1)
collapses at redshift z:
This nonlinear mass, M NL (z), is a key quantity in the self-similar models of structure formation,
which we consider in Section 3.9.
a z=3 b z=1
c z = 0.5 d z=0
Figure 6
Evolution of a dark matter density field in a comoving region of 15h−1 Mpc on a side around cluster mass
density peak in the initial perturbation field. The panels show redshifts (a) z = 3, (b) z = 1, (c) z = 0.5, and
(d ) z = 0. The forming cluster has a mass M 200 1.2 × 1015 h −1 M at z = 0. The figure illustrates the
complexities of the actual collapse of real density peaks: strong deviations from spherical symmetry,
accretion of matter along filaments, and the presence of smaller-scale structure within the collapsing
cluster-scale mass peak.
scope of this review, and we refer readers to recent reviews on this subject (Bertschinger 1998,
Dolag et al. 2008, Norman 2010, Borgani & Kravtsov 2011). Here, we simply discuss the main
features of gravitational collapse learned from analyses of such simulations.
Figure 6 shows evolution of the DM density field in a cosmological simulation of a comoving
region of 15h−1 Mpc on a side around cluster mass–scale density peak in the initial perturbation
field from z = 3 to the present epoch. The overall picture is quite different from the top-hat
collapse. First of all, real peaks in the primordial field do not have the constant density or sharp
boundary of the top-hat, but have a certain radial profile and curvature (Bardeen et al. 1986, Dalal
et al. 2008). As a result, different regions of a peak collapse at different times so that the overall
collapse is extended in time and the peak does not have a single collapse epoch (e.g., Diemand,
Kuhlen & Madau 2007). Consequently, the distribution of matter around the collapsed peak can
smoothly extend to several virial radii for late epochs and small masses (Prada et al. 2006, Cuesta
et al. 2008). This creates ambiguity about the definition of halo mass and results in a variety of
mass definitions adopted in practice, as we discuss in Section 3.6.
Second, the peaks in the smoothed density field, δR (x), are not isolated but are surrounded by
other peaks and density inhomogeneities. The tidal forces from the most massive and rarest peaks
in the initial density field shepherd the surrounding matter into massive filamentary structures
that connect them (Bond, Kofman & Pogosyan 1996). Accretion of matter onto clusters at late
epochs occurs preferentially along such filaments, as can be clearly seen in Figure 6.
Finally, the density distribution within the peaks in the actual density field is not smooth, as
in the smoothed field δR (x), but contains fluctuations on all scales. Collapse of density peaks on
different scales can proceed almost simultaneously, especially during early stages of evolution in the
CDM models when peaks undergoing collapse involve small scales, over which the power spectrum
has an effective slope n ≈ −3. Figure 6 shows that at high redshifts the protocluster region contains
mostly small-mass collapsed objects, which merge to form a larger and larger virialized system
near the center of the shown region at later epochs. Nonlinear interactions between smaller-
scale peaks within a cluster-scale peak during mergers result in relaxation processes and energy
exchange on different scales, and mass redistribution. Although the processes accompanying major
mergers are not as violent as envisioned in the violent relaxation scenario (Valluri et al. 2007), such
interactions lead to significant redistribution of mass (Kazantzidis, Zentner & Kravtsov 2006) and
angular momentum (Vitvitska et al. 2002), both within and outside of the virial radius.
3.4. Equilibrium
Following the collapse, matter settles into an equilibrium configuration. For collisional baryonic
component this configuration is approximately described by the HE equation, in which the pres-
sure gradient ∇ p(x) at point x is balanced by the gradient of local gravitational potential ∇φ(x):
∇φ(x) = −∇ p(x)/ρg (x), where ρg (x) is the gas density. Under the further assumption of spherical
symmetry, the HE equation can be written as ρg−1 d p/dr = −GM (< r)/r 2 , where M (< r) is the
mass contained within the radius r. Assuming the equation of state of ideal gas, p = ρg k B T /μm p ,
where μ is the mean molecular weight and m p is the proton mass, cluster mass within r can be
expressed in terms of the density and temperature profiles, ρg (r) and T(r), as
rk B T (r) d ln ρg (r) d ln T (r)
M HE (< r) = − + . (9)
Gμm p d ln r d ln r
Interestingly, the slopes of the gas density and temperature profiles that enter the above equation
exhibit correlation that appears to be a dynamical attractor during cluster formation ( Juncher,
Hansen & Macciò 2012).
For a collisionless system of particles, such as CDM, the condition of equilibrium is given
by the Jeans equation (e.g., Binney & Tremaine 2008). For a nonrotating spherically symmetric
system, this equation can be written as
rσ 2 d ln ν(r) d ln σr (r)2
MJ (< r) = − r + + 2β(r) , (10)
G d ln r d ln r
σ2
where β = 1 − 2σt 2 is the orbit anisotropy parameter defined in terms of the radial (σr ) and
r
tangential (σt ) velocity dispersion components (β = 0 for isotropic velocity field). We con-
sider equilibrium density and velocity dispersion profiles, as well as anisotropy profile β(r) in
Section 3.5.2. Equation 10 is also commonly used to describe the equilibrium of cluster galaxies.
Although, in principle, galaxies in groups and clusters are not strictly collisionless, interactions
between galaxies are relatively rare, and the Jeans equation should be quite accurate.
Note that the difference between equilibrium configuration of collisional ICM and collisionless
DM and galaxy systems is significant. In HE, the isodensity surfaces of the ICM should trace
the iso-potential surfaces. The shape of the isopotential surfaces in equilibrium is always more
spherical than the shape of the underlying mass distribution that gives rise to the potential. Given
that the potential is dominated by DM at most of the cluster-centric radii, the ICM distribution
(and consequently the X-ray isophotes and SZ maps) will be more spherical than the underlying
DM distribution.
As we noted in the previous section, the gravitational collapse of a halo is a process extended in
time. Consequently, a cluster may not reach complete equilibrium over the Hubble time due to
ongoing accretion of matter and the occurrence of minor and major mergers. The ICM reaches
equilibrium state following a major merger only after ≈3–4 Gyr (e.g., Nelson et al. 2012). Devi-
ations from equilibrium affect observable properties of clusters and cause systematic errors when
Equations 9 and 10 are used to estimate cluster masses (e.g., Rasia, Tormen & Moscardini 2004;
Nagai, Kravtsov & Vikhlinin 2007; Lau et al. 2009; Piffaretti & Valdarnini 2008; Ameglio et al.
2009).
3.5.1. Density profile. Internal structure of collapsed halos may be expected to depend both on
the properties of the initial density distribution around collapsing peaks (Hoffman & Shaham 1985)
and on the processes accompanying hierarchical collapse (e.g., Syer & White 1998, Valluri et al.
2007). The fact that simulations have demonstrated that the characteristic form of the spherically
averaged density profile arising in CDM models, characterized by the logarithmic slope steepening
with increasing radius (Dubinski & Carlberg 1991; Katz 1991; Navarro, Frenk & White 1995,
1996), is virtually independent of the shape of the power spectrum and background cosmology
(Katz 1991; Cole & Lacey 1996; Navarro, Frenk & White 1997; Huss, Jain & Steinmetz 1999b) is
nontrivial. Such a generic form of the profile also arises when small-scale structure is suppressed
and the collapse is smooth, as is the case for halos forming at the cut-off scale of the power
spectrum (Moore et al. 1999; Diemand, Moore & Stadel 2005; Wang & White 2009) or even
from noncosmological initial conditions (Huss, Jain & Steinmetz 1999a).
The density profiles measured in dissipationless simulations are most commonly approximated
by the “NFW” form proposed by Navarro, Frenk & White (1995) based on their simulation of
cluster formation:
4ρs
ρNFW (r) = , x ≡ r/rs , (11)
x(1 + x)2
where rs is the scale radius, at which the logarithmic slope of the profile is equal to −2, and
ρ s is the characteristic density at r = rs . Overall, the slope of this profile varies with radius as
d ln ρ/d ln r = −[1 + 2x/(1 + x)], i.e., from the asymptotic slope of −1 at x 1 to −3 at
x 1, where the enclosed mass diverges logarithmically: M (< r) = M f (x)/ f (c ), where M
is the mass enclosing a given overdensity , f (x) ≡ ln(1 + x) − x/(1 + x), and c ≡ R /rs is
the concentration parameter. Accurate formulae for the conversion of mass of the NFW halos
defined for different values of are given in the appendix of Hu & Kravtsov (2003).
Subsequent simulations (Navarro et al. 2004, Graham et al. 2006, Merritt et al. 2006) showed
that the Einasto (1965) profile and other similar models designed to describe deprojection of
the Sérsic profile (Merritt et al. 2006) provide a more accurate description of the DM density
profiles arising during cosmological halo collapse, as well as profiles of bulges and elliptical galaxies
(Cardone, Piedipalumbo & Tortora 2005). The Einasto profile is characterized by the logarithmic
slope that varies as a power law with radius:
2
ρE (r) = ρs exp (1 − x α ) , x ≡ r/rs , (12)
α
where rs is again the scale radius at which the logarithmic slope is −2; but now for the Einasto
profile, ρs ≡ ρE (rs ), and α is an additional parameter that describes the power-law dependence of
the logarithmic slope on radius: d ln ρE /d ln r = −2x α .
Note that unlike the NFW profile and several other profiles discussed in the literature, the
Einasto profile does not have an asymptotic slope at small radii. The slope of the density profile
becomes increasingly shallower at small radii at the rate controlled by α. The parameter α varies
with halo mass and redshift: at z = 0 galaxy-sized halos are described by α ≈ 0.16, whereas massive
cluster halos are described by α ≈ 0.2–0.3; these values increase by ∼0.1 by z ≈ 3 (Gao et al.
2008). Although α depends on mass and redshift (and thus also on the cosmology) in a nontrivial
way, Gao et al. (2008, and see also Duffy et al. 2008) showed that these dependencies can be
captured as a universal dependence on the peak height ν = δc /σ (M , z) (see Section 3.2.2 above):
α = 0.0095ν 2 + 0.155. Finally, unlike the NFW profile, the total mass for the Einasto profile is
finite due to the exponentially decreasing density at large radii. A number of useful expressions
for the Einasto profile, such as mass within a radius, are provided by Cardone, Piedipalumbo &
Tortora (2005), Mamon & Łokas (2005), and Graham et al. (2006).
The origin of the generic form of the density profile has recently been explored in detail by
Lithwick & Dalal (2011), who show that it arises due to two main factors: (a) the density and
triaxiality profile of the original peak and (b) approximately adiabatic contraction of the previously
collapsed matter due to deepening of the potential well during continuing collapse. Without
adiabatic contraction the profile resulting from the collapse would reflect the shape of the initial
profile of the peak. For example, if the initial profile of mean linear overdensity within radius r
−γ
around the peak can be described as δ̄L ∝ rL , it can be shown that the resulting differential density
profile after collapse without adiabatic contraction behaves as ρ(r) ∝ r −g , where g = 3γ /(1 + γ )
(Fillmore & Goldreich 1984). Typical profiles of initial density peaks are characterized by shallow
slopes, γ ∼ 0–0.3 at small radii, and very steep slopes at large radii (e.g., Dalal et al. 2008; Dalal,
Lithwick & Kuhlen 2010), which means that resulting profiles after collapse should have slopes
varying from g ≈ 0−0.7 at small radii to g ≈ 3 at large radii.
However, Lithwick & Dalal (2011) showed that contraction of particle orbits during subsequent
accretion of mass interior to a given radius r leads to a much more gradual change of logarithmic
slope with radius, such that the regime within which g ≈ 0−0.7 is shifted to very small radii
(r/rvir 10−5 ), whereas at the radii typically resolved in cosmological simulations the logarithmic
slope is in the range of g ≈ 1−3, so that the radial dependence of the logarithmic slope g(r) =
d ln ρ/d ln r is in good qualitative agreement with simulation results. This contraction occurs
because matter that is accreted by a halo at a given stage of its evolution can deposit matter
over a wide range of radii, including small radii. The orbits of particles that accreted previously
have to respond to the additional mass, and they do so by contracting. For example, for a purely
spherical system in which mass is added slowly so that the adiabatic invariant is conserved, radii r
of spherical shells must decrease to compensate an increase of M (< r). This model thus elegantly
explains both the qualitative shape of density profiles observed in cosmological simulations and
their universality. The latter can be expected because the contraction process crucial to shaping
the form of the profile should operate under general collapse conditions, in which different shells
of matter collapse at different times.
Although the model of Lithwick & Dalal (2011) provides a solid physical picture of halo profile
formation, it also neglects some of the processes that may affect details of the resulting density
profile, most notably the effects of mergers. Indeed, major mergers lead to resonant dynamical
heating of a certain fraction of collapsed matter due to the potential fluctuations and tidal forces
that they induce. The amount of mass that is affected by such heating is significant (e.g., Valluri
et al. 2007). In fact, up to ∼40% of mass within the virial radii of merging halos may end up
outside of the virial radius of the merger. This implies, for example, that virial mass is not additive
in major mergers. Nevertheless, in practice the merger remnant retains the functional form of the
density profiles of the merger progenitors (Kazantzidis, Zentner & Kravtsov 2006), which means
that major mergers do not lead to efficient violent relaxation.
Although the functional form of the density profile arising during halo collapse is generic for a
wide variety of collapse conditions and models, initial conditions and cosmology do significantly
affect the physical properties of halo profiles such as its characteristic density and scale radius
(Navarro, Frenk & White 1997). These dependencies are often discussed in terms of halo con-
centrations, c ≡ R /rs . Simulations show that the scale radius is approximately constant during
late stages of halo evolution (Bullock et al. 2001, Wechsler et al. 2002), but evolves as rs = c min R
during early stages when a halo quickly increases its mass through accretion and mergers (Zhao
et al. 2003, 2009). The minimum value of concentration is c min = const ≈ 3−4 for = 200. For
massive cluster halos, which are in the fast growth regime at any redshift, the concentrations are
thus expected to stay approximately constant with redshift or may even increase after reaching a
minimum (Klypin, Trujillo-Gomez & Primack 2011; Prada et al. 2012).
The characteristic time separating the two regimes can be identified as the formation epoch
of halos. This time approximately determines the value of the scale radius and the subsequent
evolution of halo concentration. The initial conditions and cosmology determine the formation
epoch and the typical mass accretion histories for halos of a given mass (Navarro, Frenk & White
1997; Bullock et al. 2001; Zhao et al. 2009), and therefore determine the halo concentrations.
Although these dependences are nontrivial functions of halo mass and redshift, they can also be
encapsulated by a universal function of the peak height ν (Zhao et al. 2009, Prada et al. 2012).
Baryon dissipation and feedback are expected to affect the density profiles of halos appreciably,
although predictions for these effects are far less certain than predictions of the DM distribution
in the purely dissipationless regime. The main effect is contraction of DM in response to the
increasing depth of the central potential during baryon cooling and condensation, which is often
modeled under the assumption of slow contraction conserving adiabatic invariants of particle orbits
(e.g., Zel’dovich et al. 1980, Barnes & White 1984, Blumenthal et al. 1986, Ryden & Gunn 1987).
The standard model of such adiabatic contraction assumes that DM particles are predominantly
on circular orbits, and for each shell of DM at radius r the product of the radius and the enclosed
mass rM(<r) is conserved (Blumenthal et al. 1986). The model makes a number of simplifying
assumptions and does not take into account effects of mergers. Nevertheless, it was shown to
provide a reasonably accurate description of the results of cosmological simulations (Gnedin et al.
2004). Its accuracy can be further improved by relaxing the assumption of circular orbits and
adopting an empirical ansätz, in which the conserved quantity is r M (<r̄), where r̄ is the average
radius along the particle orbit, instead of rM(<r) (Gnedin et al. 2004). At the same time, several
recent studies showed that no single set of parameters of such simple models describes all objects
that form in cosmological simulations equally well (Gustafsson, Fairbairn & Sommer-Larsen 2006;
Abadi et al. 2010; Tissera et al. 2010; Gnedin et al. 2012).
A more subtle but related effect is the increase of the overall concentration of DM within the
virial radius of halos due to redistribution of binding energy between DM and baryons during the
process of cluster assembly (Rudd, Zentner & Kravtsov 2008). The larger range of radii over which
this effect operates makes it a potential worry for the precision constraints from the cosmic shear
power spectrum ( Jing et al. 2006; Rudd, Zentner & Kravtsov 2008). This effect depends primarily
on the fraction of baryons that condense into the central halo galaxies and may be mitigated by
the blow-out of gas by efficient AGN or SN feedback (van Daalen et al. 2011). The effects of
baryons on the overall concentration of mass distribution in clusters are, thus, uncertain, but can
potentially increase halo concentration and thereby significantly enhance the cross section for
strong lensing (Puchwein et al. 2005, Rozo et al. 2008, Mead et al. 2010) and affect statistics of
strong lens distribution in groups and clusters (e.g., More et al. 2011).
A number of studies have derived observational constraints on density profiles of clusters and
their concentrations (Pointecouteau, Arnaud & Pratt 2005; Vikhlinin et al. 2006; Buote et al. 2007;
Schmidt & Allen 2007; Mandelbaum, Seljak & Hirata 2008; Ettori et al. 2010; Okabe et al. 2010;
Wojtak & Łokas 2010; Umetsu et al. 2011a,b; Sereno & Zitrin 2012). Although most of these
studies find that the concentrations of galaxy clusters predicted by CDM simulations are in the
ballpark of values derived from observations, the agreement is not perfect and there is tension
between model predictions and observations, which may be due to effects of baryon dissipation
(e.g., Rudd, Zentner & Kravtsov 2008; Fedeli 2012).
Some studies do find that the concordance cosmology predictions of the average cluster con-
centrations are somewhat lower than the average values derived from X-ray observations (Buote
et al. 2007, Schmidt & Allen 2007, Duffy et al. 2008). Moreover, lensing analyses indicate that the
slope of the density profile in central regions of some clusters may be shallower than predicted
(Tyson, Kochanski & dell’Antonio 1998; Sand et al. 2004, 2008; Newman et al. 2009, 2011),
whereas concentrations are considerably higher than both theoretical predictions and most other
observational determinations from X-ray and WL analyses (Comerford & Natarajan 2007; Oguri
et al. 2009, 2012; Zitrin et al. 2011).
At this point, it is not clear whether these discrepancies imply serious challenges to the CDM
structure formation paradigm, unknown baryonic effects flattening the profiles in the centers, or
unaccounted systematics in the observational analyses [e.g., N. Dalal & C. R. Keeton, unpublished
(astro-ph/0312072); Hennawi et al. 2007]. When considering such comparisons, it is important
to remember that density profiles in cosmological simulations are always defined with respect to
the center defined as the global density peak or potential minimum, whereas in observations the
corresponding location is not as unambiguous as in simulations and the choice of center may affect
the derived slope.
It should be noted that improved theoretical predictions for cluster-sized systems generally pre-
dict larger concentrations for the most massive objects than do extrapolations of the concentration-
mass relations from smaller mass objects (Zhao et al. 2009; Bhattacharya, Habib & Heitmann 2011;
Prada et al. 2012). In addition, as we noted above, the evolution predicted for the concentrations
of these rarest objects is much weaker than c ∝ (1 + z) found for smaller mass halos, so rescaling
the concentrations of high-redshift clusters by (1 + z) factor, as is often done, could lead to an
overestimate of their concentrations.
3.5.2. Velocity dispersion profile and velocity anisotropy. Velocity dispersion profile is a
halo property related to its density profile. Simulations show that this profile generally increases
from the central value to a maximum at r ≈ rs and slowly decreases outward (e.g., Cole & Lacey
1996; Rasia, Tormen & Moscardini 2004). One remarkable result illustrating the close connection
between density and velocity dispersion is that for collapsed halos in dissipationless simulations
the ratio of density to the cube of the rms velocity dispersion can be accurately described by a
power law over at least three decades in radius (Taylor & Navarro 2001): Q(r) ≡ ρ/σ 3 ∝ r −α with
α ≈ 1.9.
An important quantity underlying the measured velocity dispersion profile is the profile of the
mean velocity, and the mean radial velocity, v̄r , in particular. For a spherically symmetric matter
distribution in HE, we expect v̄r = 0. Therefore, the profile of v̄r is a useful diagnostic of deviations
from equilibrium at different radii. Simulations show that clusters at z = 0 generally have zero
mean radial velocities within r ≈ Rvir and turn sharply negative between 1 and ≈ 3Rvir , where
density is dominated by matter infalling onto clusters (Cole & Lacey 1996; Eke, Navarro & Frenk
1998; Cuesta et al. 2008).
The distinguishing characteristic between gas and DM is the fact that gas has an isotropic
velocity dispersion tensor on small scales, whereas DM in general does not. On large scales,
however, both gas and DM may have velocity fields that are anisotropic. The degree of velocity
anisotropy is commonly quantified by the anisotropy profile, β(r) (see Section 3.4). DM anisotropy
is mild: β ≈ 0−0.1 near the center and increases to β ≈ 0.2−0.4 near the virial radius (Cole
& Lacey 1996; Eke, Navarro & Frenk 1998; Colı́n, Klypin & Kravtsov 2000; Rasia, Tormen
& Moscardini 2004; Lemze et al. 2012). Interestingly, velocities exhibit substantial tangential
anisotropy outside the virial radius in the infall region of clusters (Cuesta et al. 2008, Lemze et al.
2012). Another interesting finding is that the velocity anisotropy correlates with the slope of the
density profile (Hansen & Moore 2006), albeit with significant scatter (Lemze et al. 2012).
The gas component also has some residual motions driven by mergers and gas accretion
along filaments. Gas velocities tend to have tangential anisotropy (Rasia, Tormen & Moscardini
2004) because radial motions are inhibited by the entropy profile, which is convectively stable in
general.
3.5.3. Shape. Although the density structure of mass distribution in clusters is most often de-
scribed by spherically averaged profiles, clusters are thought to collapse from generally triaxial
density peaks (Bardeen et al. 1986, Doroshkevich 1970). The distribution of matter within halos
resulting from hierarchical collapse is triaxial as well (Frenk et al. 1988, Dubinski & Carlberg
1991, Warren et al. 1992, Cole & Lacey 1996, Jing & Suto 2002, Kasun & Evrard 2005, Allgood
et al. 2006), with triaxiality predicted by dissipationless simulations increasing with decreasing
distance from halo center (Allgood et al. 2006). Triaxiality of halos decreases with decreasing mass
and redshift (Kasun & Evrard 2005, Allgood et al. 2006) in a way that again can be parameterized
in a universal form as a function of peak height (Allgood et al. 2006). The major axis of the triaxial
distribution of clusters is generally aligned with the filament connecting a cluster with its nearest
neighbor of comparable mass (e.g., West & Blakeslee 2000, Lee et al. 2008), which reflects the
fact that a significant fraction of mass and mergers is occurring along such filaments (e.g., Onuora
& Thomas 2000, Lee & Evrard 2007).
Jing & Suto (2002) showed how the formalism of density distribution as a function of distance
from cluster center can be extended to the density distribution in triaxial shells. Accounting for
such triaxiality is particularly important in theoretical predictions and observational analyses of
weak and strong lensing (N. Dalal & C. R. Keeton, unpublished data (astro-ph/0312072); Clowe,
De Lucia & King 2004; Oguri et al. 2005; Corless & King 2007; Hennawi et al. 2007; Becker &
Kravtsov 2011). At the same time, it is important to keep in mind that, as with many other results
derived mainly from dissipationless simulations, the physics of baryons may modify predictions
substantially.
The shape of the DM distribution in particular is quite sensitive to the degree of central
concentration of mass. As baryons condense toward the center to form a central galaxy within a
halo, the DM distribution becomes more spherical (Dubinski 1994; Evrard, Summers & Davis
1994; Tissera & Dominguez-Tenreiro 1998; Kazantzidis et al. 2004). The effect increases with
decreasing radius, but is substantial even at half of the virial radius (Kazantzidis et al. 2004).
The main mechanism behind this effect lies in adiabatic changes of the shapes of particle orbits
in response to more centrally concentrated mass distribution after baryon dissipation (Dubinski
1994, Debattista et al. 2008).
In considering effects of triaxiality, it is important to remember that triaxiality of the hot
intracluster gas and DM distribution are different (gas is rounder; see, e.g., discussion in Lau
et al. 2011, and references therein). This is one of the reasons why mass proxies defined within
spherical aperture using observable properties of gas (see Section 4 below) exhibit small scatter
and are much less sensitive to cluster orientation.
The observed triaxiality of the ICM can be used as a probe of the shape of the underlying
potential (Lau et al. 2011) and as a powerful diagnostic of the amount of dissipation that is
occurring in cluster cores (Fang, Humphrey & Buote 2009) and of the mass of the central cluster
galaxy (Lau et al. 2012).
to define the entire mass that will ever collapse onto a halo in simulations (Cuesta et al. 2008,
Anderhalden & Diemand 2011), it is impossible to measure this mass in observations, which makes
it of interest only from the standpoint of the theoretical models of halo collapse.
3.6.1. The friends-of-friends mass. Historically, the FoF algorithm was used to define groups
and clusters of galaxies in observations (Huchra & Geller 1982, Press & Davis 1982, Einasto et al.
1984) and was adopted to define collapsed objects in simulations of structure formation (Einasto
et al. 1984, Davis et al. 1985). The FoF algorithm considers two particles to be members of the
same group (i.e., friends) if they are separated by a distance that is less than a given linking length.
Friends of friends are considered to be members of a single group—the condition that gives the
algorithm its name. The linking length, the only free parameter of the method, is usually defined
in units of the mean interparticle separation: b = l/l,¯ where l is the linking length in physical units
and l¯ = n̄−1/3 is the mean interparticle separation of particles with mean number density of n̄.
Attractive features of the FoF algorithm are its simplicity (it has only one free parameter), a
lack of any assumptions about the halo center, and the fact that it does not assume any particular
halo shape. Therefore, it can better match the generally triaxial, complex mass distribution of
halos forming in the hierarchical structure formation models.
The main disadvantages of the FoF algorithm are the difficulty in theoretical interpretation
of the FoF mass, and sensitivity of the FoF mass to numerical resolution and the presence of
substructure. For the smooth halos resolved with many particles, the FoF algorithm with b = 0.2
defines the boundary corresponding to the fixed local density contrast of δFoF ≈ 81.62 (More et al.
2011). Given that halos forming in hierarchical cosmologies have concentrations that depend on
mass, redshift, and cosmology, the enclosed overdensity of the FoF halos also varies with mass,
redshift, and cosmology. Thus, for example, for the current concordance cosmology, the FoF
halos (defined with b = 0.2) of mass 1011 −1015 M have enclosed overdensities of ∼450−350
at z = 0 and converge to overdensity of ∼200 at high redshifts where concentration reaches
its minimum value of c ≈ 3−4 (More et al. 2011). For small particle numbers, the boundary
of the FoF halos becomes “fuzzier” and depends on the resolution (and so does the FoF mass).
Simulations most often have fixed particle mass, and the number of particles therefore changes
with halo mass, which means that properties of the boundary and mass identified by the FoF
are mass dependent. The presence of substructure in well-resolved halos further complicates the
resolution and mass dependence of the FoF-identified halos (More et al. 2011). Furthermore, it
is well known that the FoF may spuriously join two neighboring distinct halos with overlapping
volumes into a single group. The fraction of such neighbor halos that are “bridged” increases
significantly with increasing redshift.
3.6.2. The spherical overdensity mass. The spherical overdensity algorithm defines the bound-
ary of a halo as a sphere of radius enclosing a given density contrast with respect to the reference
density ρ. Unlike the FoF algorithm, the definition of an SO halo also requires a definition of the
halo center. The common choices for the center in theoretical analyses are the peak of density,
the minimum of the potential, the position of the most bound particle, or, more rarely, the center
of mass. Given that the center and the boundary need to be found simultaneously, an iterative
scheme is used to identify the SO boundary around a given peak. The radius of the halo boundary,
R , is defined by solving the implicit equation
4π
M (< r) = ρ(z)r 3 , (13)
3
where M (< r) is the total mass profile, ρ(z) is the reference physical density at redshift z, and r is
in physical (not comoving) radius.
The choice of and the reference ρ may be motivated by theoretical considerations or by
observational limitations. For example, one can choose to define the enclosed overdensity to be
equal to the virial overdensity at collapse predicted by the spherical collapse model, ρ = vir,c ρcr
(see Section 3.2). Note that in m = 1 cosmologies, there is a choice for reference density to be
either the critical density ρcr (z) or the mean matter density ρm (z), and both are in common use.
The overdensities defined with respect to these reference densities, which we denote here as c
and m , are related as m = c / m (z). Note that m (z) = m0 (1 + z)3 /E 2 (z), where E(z) is
given by Equation 4. For concordance cosmology, 1 − m (z) < 0.1 at z ≥ 2 and the difference
between the two definitions decreases at these high redshifts. In observations, the choice may
simply be determined by the extent of the measured mass profile. Thus, masses derived from
X-ray data under the assumption of HE are limited by the extent of the measured gas density and
temperature profiles and are therefore often defined for the high values of overdensity: c = 2,500
or c = 500.
The crucial difference from the FoF algorithm is the fact that the SO definition forces a spherical
boundary on the generally nonspherical mass definition. In addition, spheres corresponding to
different halos may overlap, which means that a certain fraction of mass may be double counted
(although in practice this fraction is very small; see, e.g., discussion in section 2.2 of Tinker et al.
2008).
The advantage of the SO algorithm is the fact that the SO-defined mass can be measured both
in simulations and in observational analyses of clusters. In the latter, the SO mass can be estimated
from the total mass profile derived from the hydrostatic and Jeans equilibrium analysis for the
ICM gas and galaxies, respectively (see Section 3.4 above) or gravitational lensing analyses (e.g.,
Vikhlinin et al. 2006, Hoekstra 2007). Furthermore, suitable observables that correlate with the SO
mass with scatter of 10% can be defined (see Section 4 below), thus making this mass definition
preferable in the cosmological interpretation of observed cluster populations. The small scatter
shows that the effects of triaxiality are quite small in practice. Note, however, that the definition
of the halo center in simulations and observations may not necessarily be identical, because in
observations the cluster center is usually defined at the position of the peak or the centroid of
X-ray emission or SZ signal, or at the position of the BCG.
3.7.1. The mass function and its universality. Although real clusters are usually characterized
by some quantity derived from observations (an observable), such as the X-ray luminosity, such
quantities are generally harder to predict ab initio in theoretical models because they are sensitive
to uncertain physical processes affecting the properties of cluster galaxies and intracluster gas.
Therefore, the predictions for the abundance of collapsed objects are usually quantified as a
function of their mass, i.e., in terms of the mass function d n(M , z) defined as the comoving volume
number density of halos in the mass interval [M , M + dM] at a given redshift z. The predicted
mass function is then connected to the abundance of clusters as a function of an observable using
a calibrated mass-observable relation (discussed in Section 4, below).
The first statistical model for the abundance of collapsed objects as a function of their mass was
developed by Press & Schechter (1974). The main powerful principle underlying this model is that
the mass function of objects resulting from nonlinear collapse can be tied directly and uniquely to
the statistical properties of the initial linear density contrast field δ(x).
Statistically, one can define the probability F(M) that a given region within the initial overden-
sity field, smoothed on a mass scale M, δM (x), will collapse into a halo of mass M or larger:
∞
F (M ) = p(δ)Ccoll (δ)d δ, (14)
−1
where p(δ)d δ is the PDF of δM (x), which is given by Equation 2 for the Gaussian initial density
field, and Ccoll is the probability that any given point x with local overdensity δM (x) will actually
collapse. The mass function can then be derived as a fraction of the total volume collapsing into
halos of mass (M , M + dM), i.e., dF/dM, divided by the comoving volume within the initial density
field occupied by each such halo, i.e., M /ρ̄:
d n(M ) ρ̄m d F
= . (15)
dM M dM
In their pioneering model, Press & Schechter (1974) have adopted the ansätz motivated by the
spherical collapse model (see Section 3.1) that any point in space with δM (x)D+0 (z) ≥ δc will
collapse into a halo of mass ≥ M by redshift z; i.e., Ccoll (δ) = (δ − δc ), where is the Heaviside
step function. Note that δM (x) used above is not the actual initial overdensity, but the initial
overdensity evolved to z = 0 with the linear growth rate. One can √ easily check that, for a Gaussian
initial density field, this assumption gives F (M ) = 12 erfc[δc /( 2σ (M , z))] = F (ν). This line of
arguments and assumptions thus leads to an important conclusion that the abundance of halos
of mass M at redshift z is a universal function of only their peak height ν(M , z) ≡ δc /σ (M , z).
In particular, the fraction of mass in halos per logarithmic interval of mass in such a model
is
d n(M ) ρ̄m d F ρ̄m d ln ν ∂ F ρ̄m d ln ν ρ̄m
= = ≡ g(ν) ≡ ψ(ν). (16)
d ln M M d ln M M d ln M ∂ ln ν M d ln M M
Clearly, the shape ψ(ν) in such models is set by the assumptions of the collapse model.
Numerical studies based on cosmological simulations have eventually revealed that the shape
ψPS (ν) predicted by the Press & Schechter (1974) model deviates by 50% from the actual shape
measured in cosmological simulations (e.g., Klypin et al. 1995, Gross et al. 1998, Tormen 1998,
Lee & Shandarin 1999, Sheth & Tormen 1999, Jenkins et al. 2001).
A number of modifications to the original ansätz have been proposed that result in ψ(ν),
which more accurately describes simulation results. Such modifications are based on the collapse
conditions that take into account asphericity of the peaks in the initial density field (Monaco 1995;
Audit, Teyssier & Alimi 1997; Lee & Shandarin 1998; Sheth & Tormen 2002; Desjacques 2008)
and stochasticity due to the dependence of the collapse condition on peak properties other than
ν or shape (e.g., Desjacques 2008; Maggiore & Riotto 2010; Corasaniti & Achitouv 2011; de
Simone, Maggiore & Riotto 2011; Ma et al. 2011). The more sophisticated excursion set models
match the simulations more closely, albeit at the expense of more assumptions and parameters.
There may be also inherent limitations in the accuracy of such models given that they rely on
the strong assumption that one can parameterize all the factors influencing collapse of any given
point in the initial overdensity field in a relatively compact form. In the face of complications
to a simple picture of peak collapse, as discussed in Section 3.3, one can indeed expect that the
excursion set ansätze are limited in how accurately they can ultimately describe the halo mass
function.
0.5
a
Δ lnψ 0
– 0.5
10 –1
b
10 –2
ψ (ν)
10 –3
10 –5
1
ν
Figure 7
The function ψ(ν) defining the comoving abundance of collapsed halos via d n/d ln M = (ρ̄m /M )ψ(ν) as a
function of ν from different models and simulation-based calibrations. Panel a shows deviations of specific
models and calibrations for z = 0 from Tinker et al. (2010) based on a suite of CDM cosmological
simulations.
0.5
a
Δ ln b 0
–0.5
b
10 2 Tinker et al. 2010
Sheth, Mo & Tormen 2001
Mo & White 1996
10 1
b 2(ν)
10 0
10 –1
1
ν
Figure 8
The square of the bias b 2 (ν) as a function of peak height ν corresponding to halos of mass M 200m in the bias
model–based excursion set and spherical collapse barrier ( gray dashed line) (Mo & White 1996), in the
excursion set model based on ellipsoidal collapse ( purple dot-dashed line) (Sheth, Mo & Tormen 2001), and
the bias function calibrated using CDM cosmological simulations (solid blue line) (Tinker et al. 2010) for
spherical overdensity halos defined using an overdensity of = 200 with respect to the mean density. Panel
a shows deviations of excursion set models from the calibration by Tinker et al. (2010).
functional form of the bias dependence on halo mass, b h (M ), depends on the functional form
of the mass function explicitly in this framework. Simulations show that the peak–background
split model provides a fairly accurate (to ∼20%) prediction for the linear halo bias (Tinker et al.
2010).
Another line of argument illustrating the close connection between the halo abundance and
bias is the fact that if one assumes that all of the mass is in the collapsed halos, as is done,
for example, in the halo models (Cooray & Sheth 2002), the requirement that matter in the
Universe is not biased against itself implies that b(ν)g(ν)d ln ν = 1 (e.g., Seljak 2000), where
g(ν) ≡ |d ln ν/d ln M |−1 ψ(ν) (see Equation 16). This integral constraint requires that the form
of the bias function b(ν) is changed whenever ψ(ν) changes. Incidentally, the close connection
between b(ν) and ψ(ν) implies that if ψ(ν) is a universal function, then the bias b(ν) should be a
universal function as well.
The function b(ν) recently calibrated for the SO-defined halos of different overdensities using
a suite of large cosmological simulations with accuracy 5% and satisfying the integral constraint
(Tinker et al. 2010) is shown in Figure 8 for halos defined using an overdensity of = 200 with
respect to the mean. This calibration of the bias is compared to the corresponding prediction of
the Press & Schechter (1974) and the Sheth, Mo & Tormen (2001) ansätze. The figure shows that
b(ν) is a rather weak function of ν at ν < 1, but steepens substantially for rare peaks of ν > 1. It
also shows that the rarest clusters (ν ∼ 5) in the Universe can have the amplitude of the correlation
function or power spectrum that is two orders of magnitude larger than the clustering amplitude
of the galaxy-sized halos (ν 1).
3.9.1. Self-similar model: assumptions and basic expectations. The self-similar model de-
veloped by Kaiser (1986) makes three key assumptions. The first assumption is that clusters form
via gravitational collapse from peaks in the initial density field in an Einstein-de Sitter Universe,
m = 1. Gravitational collapse in such a Universe is scale-free or self-similar. The second assump-
tion is that the amplitude of density fluctuations is a power-law function of their size, (k) ∝ k3+n .
This means that initial perturbations also do not have a preferred scale (i.e., they are scale-free
or self-similar). The third assumption is that the physical processes that shape the properties of
forming clusters do not introduce new scales in the problem. With these assumptions, the problem
has only two control parameters: the normalization of the power spectrum of the initial density
perturbations at an initial time, ti , and its slope, n. Properties of the density field and halo pop-
ulation at t > ti (or corresponding redshift z < zi ), such as typical halo masses that collapse or
halo abundance as a function of mass, depend only on these two parameters. One can choose
any suitable variable that depends on these two parameters as a characteristic variable for a given
problem. For evolution of halos and their abundance, the commonly used choice is to define the
characteristic nonlinear mass, MNL (see Section 3.1), which encapsulates such dependence. The
halo properties and halo abundance then become universal functions of μ ≡ M /M NL in such a
model. Thus, for example, clusters with masses M 1 (z1 ) and M 2 (z2 ) that correspond to the same
ratio M 1 (z1 )/M NL (z1 ) = M 2 (z2 )/M NL (z2 ) have the same dimensionless properties, such as gas
fraction or concentration of their mass distribution.
As we have discussed above, in more general cosmologies the halo properties and mass function
should be universal functions of the peak height ν, which encapsulates the dependence on the
shape and normalization of the power spectra for general, nonpower-law shapes of the fluctuation
spectrum.
3.9.2. The Kaiser model for cluster scaling relations. In this section, we define cluster mass
to be the mass within the sphere of radius R, encompassing characteristic density contrast, ,
with respect to some reference density ρr (usually either ρcr or ρm ): M = (4π/3)ρr R3 . In this
definition, radius and mass are directly related and interchangeable. The model assumes spherical
symmetry and that the ICM is in equilibrium within the cluster gravitational potential, so that the
HE equation (Equation 9) holds. The mass M (<R) derived from the HE equation is proportional
to T(R)R, and the sum of the logarithmic slopes of the gas density and temperature profiles at R.
In addition to the assumptions about self-similarity discussed above, a key assumption made in
the model by Kaiser (1986) is that these slopes are independent of M, so that
M 1 2
T ∝ ∝ (ρr ) 3 M 3 . (17)
R
Note that, formally, the quantity T appearing in the above equation is the temperature measured
at R, whereas some average temperature at smaller radii is usually measured in observations. How-
ever, if we parameterize the temperature profile as T (r) = T ∗ T̃ (x), where T ∗ is the characteristic
temperature and T̃ is the dimensionless profile as a function of dimensionless radius x ≡ r/R,
and we assume that T̃ (x) is independent of M, any temperature averaged over the same fraction
of radial range [x1 , x2 ] will scale as ∝T ∗ ∝ T (R). The latter is not strictly true for the “spectro-
scopic” temperature, TX , derived by fitting an observed X-ray spectrum to a single-temperature
bremsstrahlung model (Mazzotta et al. 2004, Vikhlinin 2006), although in practice deviations of
TX from the expected behavior for T ∗ are small.
The gas mass within R can be computed by integrating over the gas density profile, which,
by analogy with temperature, we parameterize as ρg (r) = ρg∗ ρ̃g (x), where ρg∗ is the characteristic
density and ρ̃g is the dimensionless profile. The gas mass within R can then be expressed as
1
ρg∗
M g (< R) = 4πρg∗ R3 x 2 ρ̃g (x)d x = 3M Iρg ∝ M (< R). (18)
0 ρ r
The latter proportionality is assumed in the Kaiser (1986) model, which means that ρg∗ and Iρg
are assumed to be independent of M. Note that ρg∗ ∝ ρr , so ρr does not enter the M g − M
relation.
Using the scalings of M g and T with mass, we can construct other cluster properties of interest,
such as the luminosity of ICM emitted due to its radiative cooling. Assuming that the ICM emission
is due to the free-free radiation and neglecting the weak logarithmic dependence of the Gaunt
factor on temperature, the bolometric luminosity can be written as (e.g., Sarazin 1986)
1 M g2 1 7 4
Lbol ∝ ρg2 T 2 V ∝ T 2 ∝ 6 M 3 . (19)
V
We omit ρ r in these equations for clarity; it suffices to remember that ρ r enters into the scaling
relations exactly as . Note that the bolometric luminosity of a cluster is not observable directly,
and the X-ray luminosity in soft band (e.g., 0.5–2 keV), LXs , is frequently used. Such soft band
X-ray luminosity is almost insensitive to temperature at T > 2 keV (as can be easily verified with
a plasma emission code; Fabricant & Gorenstein 1983), so that its temperature dependence can
be neglected. LXs then scales as
M g2
LXs ∝ ρg2 V ∝ ∝ M . (20)
V
At temperatures T < 2 keV, temperature dependence is more complicated both for the bolometric
and soft X-ray emissivity due to significant flux in emission lines. Therefore, strictly speaking, for
lower mass systems the above L−M scaling relations are not applicable, and scaling of the emissivity
with temperature needs to be calibrated separately, taking also into account the ICM metallicity.
The same is true for luminosity defined in some other energy band or for the bolometric luminosity.
Another quantity of interest is the ICM “entropy” defined in X-ray analyses as
kB T X
KX ≡ 2/3
∝ ρg−2/3 T ∝ −1/3 M 2/3 , (21)
ne
where ne is the electron number density. Finally, the quantity Y = M g T , where gas mass and
temperature are measured within a certain range of radii scaled to R , is used to characterize the
ICM in the analyses of SZ and X-ray observations. This quantity is expected to be a particularly
robust proxy of the cluster mass (e.g., da Silva et al. 2004; Motl et al. 2005; Kravtsov, Vikhlinin
& Nagai 2006; Nagai 2006; Fabjan et al. 2011; see also discussion in Section 4) because it is
proportional to the global thermal energy of the ICM. Using Equations 18 and 17, the scaling of
By analogy with the M g –M relation, we can assume that the T−M relation can be well described
by a power law of the form
1 2
+αT
T = CT 3 M 3 , (25)
where α T describes mild deviation from the scaling due, e.g., to mild dependence of gas and tem-
perature profile slopes in the HE equation. The dimensionless quantities involving temperature
2/3
T can be constructed using ratios of T with T NL = (μm p /k B )GM NL /RNL ∝ 1/3 M NL . As be-
fore for the gas fraction, requirement that such dimensionless ratio depends only on μ requires
−α
CT ∝ M NLT so that
1 2 1 2
−α +αT
T = CTo M NLT 3 M 3 = CTo 3 M 3 μαT . (26)
Other observable-mass scaling relations can be constructed in the manner similar to the derivation
of the original relations above. These are summarized below for the specific choice of ρr ≡
c ρcr (z) ∝ h 2 E 2 (z):
−αg 1+αg
M g ∝ M NL (z)M c , (27)
−α 2/3+α
T ∝ E(z)2/3 M NLT M c T , (28)
α
−2αg −αT /2 4/3+2αg + T
Lbol ∝ E(z)7/3 M NL (z)M c 2
, (29)
2 2
α −α (1−αg )+αT
3 g T
K ∝ E −2/3 (z)M NL (z)M 3c , (30)
−αg −αT 5/3+αg +αT
Y ∝ E(z)2/3 (z)M NL (z)M c . (31)
In all of the relations one can, of course, recover the original relations by setting αg = αT = 0. The
observable-mass relations can be used to predict observable-observable relations by eliminating
mass from the corresponding relations above.
Note that the evolution of scaling relations in this extended model arises both from the redshift
dependence of (z)ρr (z) and from the extra redshift dependence due to factors involving M NL .
The practical implication is that if measurements show that αg = 0 and/or αT = 0 at some
redshift, the original Kaiser scaling relations are not expected to describe the evolution, even if the
evolution is self-similar. Instead, relations given by Equations 27–31 should be used. Note that
at z ≈ 0, observations indicate that within the radius r500 , enclosing overdensity c = 500 and
αg ≈ 0.1−0.2, while αT ≈ 0.0 − 0.1. Therefore, the extra evolution compared to the Kaiser model
predictions due to factors involving αg and, to a lesser degree, factors involving αT is expected.
Such evolution, consistent with predictions of the above equations, is indeed observed both in
simulations (see, e.g., figure 10 in Vikhlinin et al. 2009a) and in observations (Lin et al. 2012;
although see Böhringer, Dolag & Chon 2012).
In practice, evolution of the scaling relations can be quite a bit more complicated than the
evolution predicted by the above equations. The complication is not due to any deviation from
self-similarity but rather due to specific mass definition and the fact that cluster formation is an
extended process that is not characterized by a single collapse epoch. Some clusters evolve only
mildly after their last major merger. However, the mass of such clusters will change with z even if
their potential does not change, simply because mass definition is tied to a reference density that
changes with expansion of the Universe and because density profiles of clusters extend smoothly
well beyond the virial radius. Any observable property of clusters that has radial profile differing
from the mass profile, but which is measured within the same R , will change differently than
mass with redshift. As a simplistic toy model, consider a population of clusters that does not evolve
from z = 1 to z = 0. Their X-ray luminosity is mostly due to the ICM in the central regions
of clusters and it is not sensitive to the outer boundary of integration as long as it is sufficiently
large. Thus, X-ray luminosity of such a nonevolving population will not change with z, but masses
M of clusters will increase with decreasing z as the reference density used to define the cluster
boundary decreases. Normalization of the LX − M relation will thus decrease with decreasing
redshift simply due to the definition of mass. The strength of the evolution will be determined by
the slope of the mass profile around R , which is weakly dependent on mass. Such an effect may,
thus, result in the evolution of both the slope and normalization of the relation. In this respect,
quantities that have radial profiles most similar to the total mass profile (e.g., M g , Y ) will suffer
the least from such spurious evolution.
Finally, we note again that, in principle, for general nonpower-law initial perturbation spectra of
the CDM models, the scaling with M /M NL needs to be replaced with scaling with the peak height
ν. For clusters within a limited mass range, however, the power spectrum can be approximated
by a power law and, thus, a characteristic mass similar to M NL can be constructed, although such
mass should be within the typical mass range of the clusters. The latter is not true for M NL , which
is considerably smaller than typical cluster mass at all z.
equation, we have M HE ∝ TR ∝ d A ∝ h −1 . The same scaling is expected for the mass derived
from the weak lensing shear profile measurements.
If the gas mass is measured from the X-ray flux from a volume V ∝ R3 ∝ θ 3 d A3 , which scales
as f = LX /(4π d L2 ) ∝ ρg2 V /d L2 ∝ M g2 /(V d L2 ) ∝ M g2 /(θd L2 d A3 ) and where f and θ are observables,
3/2
gas mass then scales with distance as M g ∝ d L d A ∝ h 5/2 . This dependence can be exploited to
constrain cosmological parameters, as in the case of X-ray measurements of gas fractions in clusters
(Ettori, Tozzi & Rosati 2003; Allen et al. 2004, 2008; LaRoque et al. 2006; Ettori et al. 2009) or
abundance evolution of clusters as a function of their observable (e.g., Vikhlinin et al. 2009b). In
this respect, the M g –M relation has the strongest scaling with distance and cosmology, whereas
the scaling of the T−M relation is the weakest (e.g., see discussion by Vikhlinin et al. 2009a).
3.10.1. Gravitational collapse of the intracluster gas. The diffuse gas infalling onto the
DM-dominated potential wells of clusters converts the kinetic energy acquired during the
collapse into thermal energy via adiabatic compression and shocks. As gas settles into HE, its
temperature approaches values close to the virial temperature corresponding to the cluster mass.
In the spherically symmetric collapse model of Bertschinger (1985), supersonic accretion gives
rise to the expanding shock at the interface of the inner hydrostatic gas with a cooler, adiabatically
compressed, external medium. Real three-dimensional collapse of clusters is more complicated and
exhibits large deviations from spherical symmetry, as accretion proceeds both in a quasi-spherical
fashion from low-density regions and along relatively narrow filaments. The gas accreting along
the latter penetrates much deeper into the cluster virial region and does not undergo a shock
at the virial radius (see Figure 9). The strong shocks are driven not just by the accretion of gas
from the outside but also “inside-out” during major mergers (e.g., Poole et al. 2007).
The shocks arising during cluster formation can be classified into two broad categories: strong
external shocks surrounding filaments and the virialized regions of DM halos and weaker internal
shocks, located within the cluster virial radius (e.g., Pfrommer et al. 2006, Skillman et al. 2008,
Vazza et al. 2009). The strong shocks arise in the high–Mach number flows of the intergalactic
gas, whereas weak shocks arise in the relatively low–Mach number flows of gas in filaments and
accreting groups, which was preheated at earlier epochs by the strong shocks surrounding filaments
or external groups. Figure 9a shows these two types of shocks in a map of the shocked cells
identified in a cosmological adaptive mesh refinement simulation of a region surrounding a galaxy
cluster (from Vazza et al. 2009), along with the gas velocity field. This map highlights the strong
external shocks, characterized by high Mach numbers M > 30, surrounding the cluster at several
virial radii from the cluster center and weaker internal shocks, with M 2–3. The cluster is
shown at the epoch immediately following a major merger, which generated substantial velocities
of gas within virial radius. As we discuss in Section 4 below, incomplete thermalization of these
gas motions is one of the main sources of nonthermal pressure support in the ICM.
Figure 9b shows the distribution of the kinetic energy processed by shocks, as a function of
the local shock Mach number, for different redshifts (Skillman et al. 2008). The figure shows
a b
10 42 z
0
10 40
10 39
10 38
Figure 9
(a) Map of the shocked cells identified by the divergence of velocity colored by the local shock Mach number and turbulent gas velocity
field (streamlines) in a slice of simulation box 7.5 Mpc on a side and depth of 18 kpc at z = 0.6 for a simulated cluster that reaches a mass
of ∼ 2 × 1014 M by z = 0 (from Vazza et al. 2009). (b) Redshift evolution of the distribution of the kinetic energy processed by shocks,
as a function of the Mach number M in a cosmological simulation (from Skillman et al. 2008). Results shown in both panels are based
on the adaptive mesh refinement ENZO code (O’Shea et al. 2004).
that a large fraction of the kinetic energy is processed by weak internal shocks, and this fraction
increases with decreasing redshift as more and more of the accreting gas is preheated in filaments.
Yet, Figure 9a highlights that large M shocks surround virialized halos in such a way that most
gas particles accreted in a galaxy cluster must have experienced at least one strong shock in their
past.
Because gravity does not have a characteristic length scale, we expect the predictions of the
self-similar model, presented in Section 3.9, to apply when gravitational gas accretion determines
the thermal properties of the ICM. The scaling relations and their evolution predicted by the
self-similar model are indeed broadly confirmed by the nonradiative hydrodynamical simulations
that include only gravitational heating (e.g., Navarro, Frenk & White 1995; Eke, Navarro & Frenk
1998; Nagai, Kravtsov & Vikhlinin 2007), although some small deviations arising due to small
differences in the dynamics of baryons and DM were also found (Ascasibar et al. 2006, Nagai 2006,
Stanek et al. 2010).
As discussed in Section 2, observations carried out with the Chandra and XMM-Newton
telescopes during the past decade showed that the outer regions of clusters (r r2,500 ) exhibit
self-similar scaling, whereas the core regions exhibit strong deviations from self-similarity. In
particular, gas density in the core regions of small-mass clusters is lower than expected from self-
similar scaling of large-mass systems. These results indicate that some additional nongravitational
processes are shaping properties of the ICM. We review some of these processes studied in cluster
formation models below.
3.10.2. Phenomenological preheating models. The first proposed mechanism to break self-
similarity was high-redshift (zh 3) pre-heating by nongravitational sources of energy, presum-
ably by a combined action of the AGN and stellar feedback (Evrard & Henry 1991, Kaiser 1991).
The specific extra heating energy per unit mass, Eh , defines the temperature scale T ∗ ∝ Eh /k B ,
such that clusters with virial temperature T vir > T ∗ should be left almost unaffected by the extra
heating, whereas in smaller clusters with T vir < T ∗ gas accretion is suppressed. As a result, gas
density is relatively lower in lower mass systems, especially at smaller radii, while their entropy is
higher.
Both analytical models (e.g., Tozzi & Norman 2001, Babul et al. 2002, Voit et al. 2003) and
hydrodynamical simulations (e.g., Bialek, Evrard & Mohr 2001; Borgani et al. 2002; Muanwong
et al. 2002) have demonstrated that, with a suitable preheating prescription and typical heating
injection of Eh ∼ 0.5 –1 keV per gas particle, self-similarity can be broken to the degree required
to reproduce observed scaling relations. Studies of the possible feedback mechanisms show that
such amounts of energy cannot be provided by SNe (e.g., Kravtsov & Yepes 2000, Renzini 2000,
Borgani et al. 2004, Kay et al. 2007, Henning et al. 2009) and must be injected by the AGN
population (e.g., Wu, Fabian & Nulsen 2000; Lapi, Cavaliere & Menci 2005; Bower, McCarthy
& Benson 2008) or by some other unknown source.
However, regardless of the actual sources of heating, strong widespread heating at high redshifts
would conflict with the observed statistical properties of the Lyman-α forest (Shang, Crotts &
Haiman 2007, Borgani & Viel 2009). Moreover, hydrodynamical simulations have demonstrated
that simple preheating models predict large isentropic cores (e.g., Borgani et al. 2005, Younger &
Bryan 2007) and shallow pressure profiles (Kay et al. 2012). This is at odds with the entropy and
pressure profiles of real clusters, which exhibit smoothly declining entropy down to r ∼ 10−20 kpc
(e.g., Cavagnolo et al. 2009, Arnaud et al. 2010).
3.10.3. The role of radiative cooling. The presence of galaxies in clusters and low levels of the
ICM entropy in cluster cores are a testament that radiative cooling has operated during cluster
formation in the past and is an important process shaping thermodynamics of the core gas at
present. Therefore, in general, radiative cooling cannot be neglected in realistic models of cluster
formation. Given that cooling generally introduces new scales, it can break self-similarity of the
ICM even in the absence of heating (Voit & Bryan 2001). In particular, cooling removes low-
entropy gas from the hot ICM phase in the cluster cores, which is replaced by higher entropy gas
from larger radii. Somewhat paradoxically, the cooling thus leads to an entropy increase of the
hot, X-ray emitting ICM phase. This effect is illustrated in Figure 10, which shows the entropy
maps in the simulations of the same cluster with and without cooling. In the absence of cooling
(Figure 10a), the innermost region of the cluster is filled by low-entropy gas. Merging substruc-
tures also carry low-entropy gas, which generates comet-like features by ram-pressure stripping,
and is hardly mixed in the hotter ambient of the main halo. In the simulation with radiative cooling
(Figure 10b), most of the low-entropy gas associated with substructures and the central cluster
region is absent, and most of the ICM has a relatively high entropy.
A more quantitative analysis of the entropy distribution for these simulated clusters is shown in
Figure 11, in which the entropy profiles of clusters simulated with inclusion of different physical
processes are compared with the baseline analytic spherical accretion model; this model predicts
the power-law entropy profile K (r) ∝ r 1.1 (e.g., Tozzi & Norman 2011; see also Voit 2005).
The figure shows that the entropy profile in the simulation with radiative cooling is signifi-
cantly higher than that of the nonradiative simulation. The difference in entropy is as large as
an order of magnitude in the inner regions of the cluster and greater by a factor of two even at
r500 .
Figure 10
Maps of entropy in cosmological hydrodynamical simulations of a galaxy cluster of mass M500 1015 h −1 M
at z = 0, carried out (a) without and (b) with radiative cooling. Brighter colors correspond to lower gas
entropy. Each panel encompasses a physical scale of 6.5 h−1 Mpc, which corresponds to ≈2.5 virial radii for
this cluster. The simulations have been carried out using the GADGET-3 smoothed particle hydrodynamics
code (Springel 2005).
Interestingly, the predicted level of entropy at r ∼ r2,500 –r500 in the simulations with cooling
(but no significant heating) is consistent with the ICM entropy inferred from X-ray observations.
However, this agreement is likely to be spurious because it is achieved with the amount of cooling
that results in conversion of ≈40% of the baryon mass in clusters into stars and cold gas, which
is inconsistent with observational measurements of cold fraction varying from 20−30% for
small-mass, X-ray-emitting clusters to 10% for massive clusters (see Section 2).
Finally, note that inclusion of cooling in simulations with preheating discussed above usually
results in problematic star-formation histories. In fact, if preheating takes place at a sufficiently
high redshift, clusters exhibit excessive cooling at lower redshifts, as preheated gas collapses and
cools at later epochs compared to the simulations without preheating (e.g., Tornatore et al. 2003).
These results highlight the necessity to treat cooling and heating processes simultaneously using
heating prescriptions that can realistically reproduce the heating rate of the ICM gas as a function
of cosmic time. We discuss efforts in this direction next.
3.10.4. Thermodynamics of the intracluster medium with stellar and active galactic nuclei
feedback. The results discussed above strongly indicate that, in order to reproduce the overall
properties of clusters, cooling should be modeled together with a realistic prescription for non-
gravitational heating. This is particularly apparent in the cluster cores, where a steady heating is
required to offset the ongoing radiative cooling observed in the form of strong X-ray emission
(see, e.g., Peterson & Fabian 2006). Studies of the feedback processes in clusters are one of the
frontiers in cluster formation modeling. Although we do not yet have a complete picture of the
ICM heating, a number of interesting and promising results have been obtained.
In Figure 11, the solid line shows the effect of the SN feedback on the entropy profile. In
these simulations, the kinetic feedback of SNe is included in the form of galactic winds carrying
the kinetic energy comparable to all of the energy released by Type II SNe expected to occur
10 4
10 3 SF
K(r)
+ AGN
SF + SN
SF + SN
.1
r1
∝
r)
K(
10 2
e
ativ
radi
Non
10 –2 10 –1 r500 10 0
r/rvir
Figure 11
Radial profiles of entropy (in units of kiloelectronvolt-centimeters squared) for the same simulations whose
entropy maps are shown in Figure 10. Magenta dotted, long-dashed blue, continuous red, and short-dashed
dark yellow curves refer to the nonradiative simulation and to the three radiative simulations including only
cooling and star formation (SF), the effect of galactic ejecta from supernova (SN), and the effect of active
galactic nuclei (AGN) feedback, respectively. The gray dot-dashed line shows the power-law entropy profile
with slope K (r) ∝ r 1.1 , whereas the thin gray vertical dotted line marks the position of r500 .
according to star formation in the simulation. This energy partially compensates for the radiative
losses in the central regions, which leads to a lower level of entropy in the core. However, the
core ICM entropy in these simulations is still considerably higher than observed (e.g., Sun et al.
2009). The inefficiency of the SN feedback in offsetting the cooling sufficiently is also evidenced
by temperature profiles.
Figure 4 (from Leccardi & Molendi 2008) compares the observed temperature profiles of a
sample of local clusters with results from simulations that include the SN feedback. The figure
shows that simulations reproduce the observed temperature profile at r 0.2r180 . The overall
shape of the profile at these large radii is reproduced by simulations including a wide range of
physical processes, including nonradiative simulations (e.g., Loken et al. 2002; Borgani et al. 2004;
Nagai, Kravtsov & Vikhlinin 2007). At small radii, however, the observed and predicted profiles
do not match: The profiles in simulated clusters continue to increase to the smallest resolved radii,
whereas the observed profiles reach a maximum temperature T max ≈ 2T 180 and then decrease with
decreasing radius to temperatures of ∼0.1−0.3 T max . The high temperatures of the central gas
reflect its high entropy and are due to the processes affecting the entropy, as discussed above.
Another indication that the SN feedback alone is insufficient is the fact that the stellar mass of
the BCGs in simulations that include only the feedback from SNe is a factor of two to three larger
than the observed stellar masses. For example, in the simulated clusters shown in Figure 10, the
baryon fraction in stars within r500 decreases from 40% in simulations without SN feedback to
30%, which is still a factor of two larger than observational measurements. The overestimate
of the stellar mass is reflected in the overestimate of the ICM metallicity in cluster cores (e.g.,
Borgani et al. 2008, and references therein).
Different lines of evidence indicate that energy input from the AGN in the central cluster
galaxies can provide most of the energy required to offset cooling (see McNamara & Nulsen 2007
for a comprehensive review). Because the spatial and temporal scales resolved in cosmological
simulations are larger than those relevant for gas accretion and energy input, the AGN energy
feedback can only be included via a phenomenological prescription. Such prescriptions generally
model the feedback energy input rate by assuming the Bondi gas accretion rate onto the SMBHs,
included as the sink particles, and incorporate a number of phenomenological parameters, such
as the radiative efficiency and the feedback efficiency, which quantify the fraction of the radiated
energy that thermally couples to the surrounding gas (e.g., Springel, Di Matteo & Hernquist
2005). The values of these parameters are adjusted so that simulations reproduce the observed
relation between black hole mass and the velocity dispersion of the host stellar bulge (e.g., Marconi
& Hunt 2003). An alternative way of implementing the AGN energy injection is the AGN-driven
winds, which shock and heat the surrounding gas (e.g., Omma et al. 2004, Dubois et al. 2011,
Gaspari et al. 2011).
In general, simulations of galaxy clusters based on different variants of these models have shown
that the AGN feedback can reduce star formation in massive cluster galaxies and reduce the hot
gas content in the poor clusters and groups, thereby improving agreement with the observed
relation between X-ray luminosity and temperature (e.g., Sijacki et al. 2007; Puchwein, Sijacki &
Springel 2008). Figure 12 (from Martizzi, Teyssier & Moore 2012) shows that simulations with
the AGN feedback result in stellar masses of the BCGs that agree with the masses required to
match observed stellar masses of galaxies and masses of their DM halos predicted by the models.
The figure also shows that stellar masses are overpredicted in the simulations without the AGN
feedback. Incidentally, the large-scale winds at high redshifts and stirring of the ICM in cluster
cores by the AGN feedback also help to bring the metallicity profiles in cluster simulations into
better agreement with observations (Fabjan et al. 2010, McCarthy et al. 2010).
Although results of simulations with the AGN feedback are promising, simulations so far
have not been able to convincingly reproduce the observed thermal structure of cool cores. As an
example, Figure 11 shows that the entropy profiles in such simulations still develop large constant
entropy cores inconsistent with observed profiles. Interestingly, the adaptive mesh refinement
simulations with jet-driven AGN feedback by Dubois et al. (2011) reproduce the monotonically
decreasing entropy profiles inferred from observations. However, such agreement only exists if
radiative cooling does not account for the metallicity of the ICM; in simulations that take into
account the metallicity dependence of the cooling rates, the entropy profile still develops a large
constant entropy core.
The presence of a population of relativistic particles in AGN-driven high-entropy bubbles
has been suggested as a possible solution to this problem (Guo & Oh 2008, Sijacki et al. 2008). A
relativistic plasma increases the pressure support available at a fixed temperature and can, therefore,
help to reproduce the observed temperature and entropy profiles in core regions. However, it
remains to be seen whether the required population of the cosmic rays is consistent with available
constraints inferred from γ and radio observations of clusters (e.g., Brunetti 2011, and references
therein). A number of additional processes, such as thermal conduction (e.g., Narayan & Medvedev
2001) or dynamical friction heating by galaxies (El-Zant, Kim & Kamionkowski 2004), have been
proposed. Generally, these processes cannot effectively regulate cooling in clusters by themselves
(e.g., Dolag et al. 2004, Conroy & Ostriker 2008), but they may play an important role when
operating in concert with the AGN feedback (Voit 2011) or instabilities in plasma (e.g., Sharma
et al. 2012).
1013
1011
z=0
1010
1011 1012 1013 1014 1015
Mhalo (M☉)
Figure 12
Comparison of the relation between stellar mass and total halo mass as predicted by cosmological
hydrodynamical simulations of four early-type galaxies (symbols) (from Martizzi, Teyssier & Moore 2012).
The open triangle and square refer to the simulations presented by Naab, Johansson & Ostriker (2009) and
by Feldmann et al. (2010), both based on the smoothed particle hydrodynamics codes and not including
active galactic nuclei (AGN) feedback. The filled symbols refer to the simulations by Martizzi, Teyssier &
Moore (2012) with the brightest cluster galaxies forming at the center of a relatively poor cluster carried out
with an AMR code, both including (triangle) and excluding ( pentagon) AGN feedback. The red dotted line
represents the relation expected for 20% efficiency in the conversion of baryons into stars. The solid blue
line is the prediction from Moster et al. (2010) of a model in which dark matter halos are populated with
stars in such a way as to reproduce the observed stellar mass function. The gray shaded areas represent the
1-, 2-, and 3σ scatter around the average relation.
In summary, results of the theoretical studies discussed above indicate that the AGN energy
feedback is the most likely energy source regulating the stellar masses of cluster galaxies throughout
their evolution and suppressing cooling in cluster cores at low redshifts. The latter likely requires
an interplay between the AGN feedback and a number of other physical processes: e.g., injection
of the cosmic rays in the high-entropy bubbles, buoyancy of these bubbles stabilized by magnetic
fields, dissipation of their mechanical energy through turbulence, thermal conduction, and thermal
instabilities. Although details of the interplay are not yet understood, it is clear that it must result
in a robust self-regulating feedback cycle in which cooling immediately leads to the AGN activity
that suppresses further cooling for a certain period of time.
to cluster cores and affects a small fraction of volume and mass of the clusters. In this regime,
clusters’ observational properties exhibit strong deviations from the self-similar scalings described
in Section 3.9 (see also Voit 2005). At larger radii, the ICM is remarkably regular. In this section, we
discuss the origins of such highly regular behavior and the range of radii where it can be expected.
We argue that the existence of this radial range allows us to define integral observational quantities,
which have low scatter for clusters of a given mass that are not sensitive to the astrophysical
processes operating during cluster formation and evolution. This fact is especially important for
the current and future uses of clusters as cosmological probes (Allen, Evrard & Mantz 2011;
Weinberg et al. 2012).
10 2
101
P/P500
100
10 –1 REXCESS
Average simulations HE
Average simulations
Ratio
10 –2 10 –1 100
Radius (R 500 )
Figure 13
Comparison between observed (blue lines) and simulated (red lines with orange shaded area) pressure profiles
(from Arnaud et al. 2010). Observational data refer to the Representative XMM-Newton Cluster Structure
Survey (REXCESS) sample of nearby clusters (Böhringer et al. 2007) observed with XMM-Newton.
Simulation results are obtained by combining different sets of clusters simulated with both smoothed particle
hydrodynamics and adaptive mesh refinement codes (see Arnaud et al. 2010 for details). The continuous red
line corresponds to the average profile from simulations, after rescaling profiles according to the values of
R500 and M500 predicted by hydrostatic equilibrium, with the orange area showing the corresponding rms
scatter. The red dashed line shows the simulation results when using instead the true M500 value. The lower
panel shows the ratio between average simulation profiles and average observed profiles.
3. Cluster outskirts, r > r500 , where scatter is increasing with radius and scaling with mass
can be expected to be close to self-similar on theoretical grounds, but have not yet been
constrained observationally. In this regime, clusters are dynamically younger, characterized
by recent mergers, departures from equilibrium, and a significant degree of gas clumping.
Significant progress is expected in the near future due to a combination of high-sensitivity
SZ and X-ray observations using the next generation of instruments.
The physical origin of the regular scaling with mass is the fact that cluster mass is the key
control variable of cluster formation, which sets the amount of gas mass, the average temperature
of the ICM, etc. It is important to note that the close to self-similar scaling with mass outside
the cluster core does not imply that the nongravitational physical processes are negligible in this
regime. For instance, Sun et al. (2009) showed that entropy measured at r500 has a scaling with
temperature quite close to the self-similar prediction, yet its level is higher than expected from
a simple model in which only gravity determines the evolution of the intracluster baryons. This
implies that, whatever mechanism one invokes to account for such an entropy excess, it must
operate in such a way as to not violate the self-similar scaling. The scatter around the mean profile
exhibited by clusters at different radii can be due to a number of factors. In particular, the small
scatter at intermediate radii is a nontrivial fact, given that different clusters of the same mass are
in different stages of their dynamical evolution and physical processes affecting their profiles may
have operated differently due to different formation histories.
One of the interesting implications of the small scatter in the pressure profiles is that it provides
an upper limit on the contribution of nonthermal pressure support or, at least, on its cluster-by-
cluster variation. A well-known source of nonthermal pressure is represented by residual gas
motions induced by mergers, galaxy motions, and gas inflow along large-scale filaments. Cosmo-
logical hydrodynamical simulations of cluster formation have been extensively used to quantify the
pressure support contributed by gas motions and the corresponding level of violation of HE (e.g.,
Rasia, Tormen & Moscardini 2004; Nagai, Kravtsov & Vikhlinin 2007; Piffaretti & Valdarnini
2008; Ameglio et al. 2009; Biffi, Dolag & Böhringer 2011; Lau et al. 2009). All these analyses
consistently found that ICM velocity fields contribute a pressure support of about 5–15% of the
thermal one, the exact amount depending on the radial range considered (being larger at larger
radii) and on the dynamical state of the clusters. Currently, there are only indirect indications for
turbulent motions in the ICM of the real clusters from fluctuations of gas density measured in
X-ray observations (e.g., Schuecker et al. 2004, Churazov et al. 2012). Direct measurements or
upper limits on gas velocities and characterization of their statistical properties should be feasible
with future high-resolution spectroscopic and polarimetric instruments on the next-generation
X-ray telescopes (e.g., Inogamov & Sunyaev 2003, Zhuravleva et al. 2010).
The galaxies and groups orbiting or infalling onto clusters not only stir the gas, but also make
the ICM clumpier. The dense inner regions of clusters ram-pressure strip the gas on a fairly short
timescale, so that the clumping is fairly small near cluster cores. However, it is substantial in the
outskirts in cluster simulations where orbital times are longer and accretion of new galaxies and
groups is ongoing. Given that the X-ray emissivity scales as the square of the local gas density,
the clumpiness can bias the measurement of gas density from X-ray surface brightness profiles
toward higher values if it is not accounted for. Because clumping is expected to increase with
increasing cluster-centric radius, the inferred slope of gas density profiles can be underestimated,
thus affecting the resulting pressure profile and hydrostatic mass estimates. Furthermore, gas
clumping also affects X-ray temperature, which is measured by fitting the X-ray spectrum to a
single-temperature plasma model (Mazzotta et al. 2004, Vikhlinin 2006). Clumping can, therefore,
contribute to the scatter of pressure profiles at large radii, especially at r > r500 (e.g., Nagai & Lau
2011).
Indirect detections of gas clumping through X-ray observations out to r200 have been recently
claimed, based on Suzaku observations of a flattening in the X-ray surface brightness profiles at
such large radii (Simionescu et al. 2011). However, these results are prone to significant systematic
uncertainties (Ettori & Molendi 2011). Independent analyses based on the ROSAT data (e.g.,
Eckert et al. 2011) show the surface brightness profiles steepen beyond r500 (see also Vikhlinin,
Forman & Jones 1999, Neumann 2005), which is inconsistent with the degree of gas clumping
inferred from the Suzaku data, but consistent with predictions of hydrodynamical simulations.
Clearly, the clumpiness of the ICM depends on a number of uncertain physical processes, such
as efficient feedback, which removes gas from merging structures, or thermal conduction, which
homogenizes the ICM temperatures (e.g., Dolag et al. 2004). The degree of gas clumping in density
and temperature is, therefore, currently uncertain in both theoretical models and observations.
Future high-sensitivity SZ observations of galaxy clusters with improved angular resolution will
allow a direct measurement of projected pressure profiles. Their comparison with X-ray-derived
profiles will help in understanding the impact of gas clumping on the thermal complexity of the
ICM.
Additional nonthermal pressure support can be provided by the magnetic fields and relativistic
cosmic rays, the presence of which in the ICM is demonstrated by radio observations of the radio
halos: diffuse and faint radio sources filling the central ∼1-Mpc3 region of many galaxy clusters
(e.g., Venturi et al. 2008, Giovannini et al. 2009) arising due to the synchrotron emission of highly
relativistic electrons moving in the ICM magnetic fields. The origin of these relativistic particles
still needs to be understood, although several models have been proposed. Shocks and turbulence
associated with merger events are expected to compress and amplify magnetic fields and accelerate
relativistic electrons (see, e.g., Dolag, Bykov & Diaferio 2008 and Ferrari et al. 2008 for reviews).
Numerical simulations including injection of cosmic rays from accretion shocks and SN explosions
(e.g., Pfrommer et al. 2007, Vazza et al. 2012) indicate that cosmic rays contribute a pressure
support, which can be as high as ∼10% for relaxed clusters and ∼20% for unrelaxed clusters at
the outskirts. At smaller radii, the pressure contribution of cosmic rays in these models becomes
small (3% at r 0.1rvir ), which is consistent with the upper limits from γ -ray observations by
the Fermi Gamma-ray Space Telescope (e.g., Ackermann et al. 2010).
The role of intracluster magnetic fields has been investigated in a number of studies using
cosmological simulations (see Dolag, Bykov & Diaferio 2008 for a review). The general result is
that pressure support from magnetic fields should be limited to 5%, which is consistent with
observational constraints on the magnetic field strength (∼μG) (see, e.g., Vogt & Enßlin 2005,
Govoni et al. 2010) and upper limits on the contribution of magnetic fields to nonthermal pressure
support (e.g., Laganá, de Souza & Keller 2010).
As a summary, the scatter in cluster profiles in the cluster cores is mainly driven by differences in
the physical processes such as cooling and heating by AGN feedback and different merger activity
that different clusters experienced during their evolution. At intermediate radii, the scatter is
small because the ICM is generally in good HE within cluster gravitational potential and because
processes that shaped its thermodynamic properties have not introduced new mass scale so that
self-similar scaling is not broken. In the cluster outskirts, the scatter is expected to be driven by
deviations from HE and other sources of nonthermal pressure support, such as the cosmic rays,
as well as by a rapid increase of ICM clumpiness with increasing radius.
1015
1014
Simulations; z = 0:
Relaxed
Unrelaxed
Observed clusters; z = 0
Figure 14
The Y X − M 500 relation for a set of simulated clusters at z = 0 (circles) and for a sample of relaxed Chandra
clusters from Vikhlinin et al. (2006) (stars with error bars). Filled and open circles refer to simulated clusters,
which are classified as relaxed and unrelaxed, respectively. Core regions inside 0.15r500 are excised in the
measurement of the X-ray temperature entering into the computation of YX for both simulated and real
clusters. True and hydrostatic masses are shown for simulated and observed clusters, respectively. The
dot-dashed line shows the best-fit power-law relation for the simulated clusters with the slope fixed to the
self-similar value of α = 3/5. The dashed line shows the same best-fit power-law relation to simulations, but
with the normalization scaled down by 15%, which takes into account the putative effect of hydrostatic mass
bias due to residual gas motions. From Kravtsov, Vikhlinin & Nagai (2006).
a set of clusters with detailed Chandra observations from a study by Kravtsov, Vikhlinin & Nagai
(2006), where YX was introduced and defined specifically to use the temperature estimated only at
0.15r500 < r < r500 in order to minimize the scatter. The relation of YX with M500 in simulations
has scatter of only ≈8% when both relaxed and unrelaxed clusters are included and evolution
of its normalization with redshift is consistent with expectations of the self-similar model. The
insensitivity of the relation to the dynamical state of clusters is not trivial and is due to the fact that
during mergers clusters move almost exactly along the relation (e.g., Poole et al. 2007, Rasia et al.
2011). In addition, the slope and normalization of the Y X − M 500 relation are also not sensitive to
specific assumptions in modeling cooling and feedback heating processes in simulations (Stanek
et al. 2010, Fabjan et al. 2011), which makes them more robust theoretically.
The Y SZ − M relation also exhibits a comparably low scatter, and the slope and evolution of
normalization are close to the predictions of the self-similar model (da Silva et al. 2004, Motl et al.
2005), which is not surprising given the similarity between the YX and the integrated YSZ measured
from SZ observations. Its normalization changes by up to 30–40% depending on the interplay
between radiative cooling and feedback processes included in the simulations (e.g., Nagai 2006,
Bonaldi et al. 2007, Battaglia et al. 2012, and references therein). At the same time, simulation
analysis is also shedding light on the effect of projection (e.g., Kay et al. 2012) and mergers (e.g.,
Krause et al. 2012) on the scatter in the Y SZ − M scaling.
Mgas – M YX – M
14.6
14.4
log C
14.2
14.0
1.0
α
0.5
NR-SV CSF CSF-M-W CSF-M-AGN NR-RV CSF-C CSF-M-NW
NR-RV CSF-C CSF-M-NW NR-SV CSF CSF-M-W CSF-M-AGN
Figure 15
Sensitivity of different mass proxies on the physical description of the intracluster medium included in
cosmological hydrodynamical simulations for a set of galaxy clusters (from Fabjan et al. 2011). Results for
the scaling relation of M500 with gas mass Mgas and YX are shown in the left and right panels, respectively.
Best fitting normalization C and slope α of the scaling relations of these two mass proxies are shown in the
upper and lower panels, respectively. Here, Tmw is the mass-weighted temperature computed excluding the
central cluster regions within 0.15r500 . In the lower panels, the horizontal dashed lines mark the values of the
slope of the scaling relations predicted by the self-similar model. Results are shown for simulations only
including nonradiative hydrodynamics with standard (NR-SV) and reduced artical viscosity (NR-RV);
cooling and star formation without (CSF) and with thermal conduction (CSF-C); cooling and star formation
with metal enrichment, with (CSF-M-W) and without (CSF-M-NW) galactic winds from SN explosions;
and cooling and star formation with the effect of AGN feedback (CSF-M-AGN) (see Fabjan et al. 2011 for
further details).
clusters that take into account instrumental effects of detectors and projection effects (e.g., Rasia
et al. 2006; Nagai, Vikhlinin & Kravtsov 2007; Biffi et al. 2012).
Ultimately, calibration of mass proxies for precision use should be obtained via independent
observational mass measurements using the weak lensing analysis, HE, or velocity dispersions
of member galaxies. The combination of future large, wide-area X-ray, SZ, and optical/near-IR
surveys should provide significant progress in this direction.
degree of non-Gaussianity exist as well (Bartolo et al. 2004). The current CMB constraint on scale-
independent non-Gaussianity is fNL = 30 ± 20 (at the 68% confidence level; see, e.g., Komatsu
2010), and there is thus still room for the existence of sizable deviations from Gaussianity.
The non-Gaussian fields with fNL < 0 have a PDF of the potential field that is skewed toward
positive values and the abundance of peaks that seed the collapse of halos is reduced compared to
Gaussian initial conditions. Conversely, the PDF of the potential field in models with fNL > 0
has negative skewness, and, hence, an increased number of potential minima (density peaks). This
would result in an enhanced abundance of rare objects, such as massive distant clusters, relative
to the Gaussian case [see, e.g., figure 1 in Dalal et al. (2008) for an illustration of the effect of fNL
on the large-scale structure that forms]. The suppression or enhancement of abundance of halos
increases with increasing peak height.
The mass functions resulting from non-Gaussian initial conditions have been studied both
analytically (e.g., Chiu, Ostriker & Strauss 1998; Matarrese, Verde & Jimenez 2000; Afshordi &
Tolley 2008; Lo Verde et al. 2008) and using cosmological simulations (Grossi et al. 2007, Dalal
et al. 2008, Lo Verde et al. 2008, Lo Verde & Smith 2011, Wagner & Verde 2011). These studies
showed that accurate formulae for the halo abundance from the initial linear density field exist for
the non-Gaussian models as well. The general result is that the fractional change in the abundance
of the rarest peaks is of order unity for the initial fields with | fNL | ∼ 100. The abundance of clusters
is, thus, only mildly sensitive to deviations of Gaussianity within the currently constrained limits
(Scoccimarro, Sefusatti & Zaldarriaga 2004; Sefusatti et al. 2007; Cunha, Huterer & Doré 2010;
Sartoris et al. 2010). In contrast, primordial non-Gaussianity may also leave an imprint in the
spatial distribution of clusters in the form of a scale dependence of large-scale linear bias.
As was discovered by Dalal et al. (2008) and confirmed in subsequent analytical (Afshordi &
Tolley 2008; Matarrese & Verde 2008; McDonald 2008; Slosar et al. 2008; Taruya, Koyama
& Matsubara 2008) and numerical studies (Desjacques, Seljak & Iliev 2009; Grossi et al. 2009;
Pillepich, Porciani & Hahn 2010; Shandera, Dalal & Huterer 2011), the linear bias of collapsed
objects in the models with local non-Gaussianity can be described as a function of wavenumber
k by b NG = b G + fNL × const/k2 , where bG is the linear bias in the corresponding cosmological
model with the Gaussian initial conditions discussed in Section 3.8. This scale dependence arises
because in the non-Gaussian models the large-scale modes that boost the abundance of peaks are
correlated with the peaks themselves, which enhances (or suppresses) the peak amplitudes by a
factor proportional to fNL φ ∝ fNL δ/k2 . Because this effect of modulation increases with increasing
peak height, ν = δc /σ (M , z), the scale dependence of bias increases with increasing halo mass.
This unique signature can be used as a powerful constraint on deviations from Gaussianity (at least
for models with local non-Gaussianity) in large samples of clusters in which the power spectrum
or correlation function can be measured on large scales.
scale dependent; whereas on very large scales gravity behaves similarly to GR gravity, on smaller
scales it is enhanced compared to GR and the rate of structure formation is thereby also enhanced.
The nonlinear halo collapse and growth are also faster in f (R) models, which leads to enhanced
abundance of massive clusters (Schmidt et al. 2009, Ferraro, Schmidt & Hu 2011, Zhao, Li &
Koyama 2011) compared to the predictions of the models with GR gravity and identical cosmo-
logical parameters. Likewise, the peaks collapsing by a given z have lower peak height ν in the
modified gravity models compared to the peak height in the standard gravity model. This results
in the reduced bias of clusters of a given mass compared to the standard model. Furthermore,
the scale dependence of the linear growth also induces a scale dependence of bias, thus offering
another route to detect modifications of gravity (Parfrey, Hui & Sheth 2011). Qualitatively simi-
lar effects on cluster abundance and bias are expected in the braneworld-modified gravity models
based on higher dimensions, such as the Dvali-Gabadadze-Porrati (DGP; Dvali, Gabadadze &
Porrati 2000) gravity model (Schäfer & Koyama 2008; Khoury & Wyman 2009; Schmidt 2009;
Schmidt, Hu & Lima 2010) and its successors with similar large-scale structure phenomenology
consistent with current observational constraints, such as models of ghost-free massive gravity
(D’Amico et al. 2011; de Rham, Gabadadze & Tolley 2011).
A general consequence of modifying gravity is that the Birkhoff theorem no longer holds, which
does not allow a straightforward extension of the spherical collapse model described in Section 3.2
to a generic model of modified gravity. Nevertheless, numerical calculations of spherical collapse
have been presented for a number of specific models (e.g., Schäfer & Koyama 2008; Martino,
Stabenau & Sheth 2009; Schmidt et al. 2009; Schmidt, Hu & Lima 2010). For both the f (R)
and the DGP classes of models, the results of simulations obtained so far suggest that halo mass
function and bias can still be described by the universal functions of peak height, in which the
threshold for collapse and the linear growth rate are modified appropriately from their standard
model values (Schmidt et al. 2009; Schmidt, Hu & Lima 2010). This implies that it should be
possible to calibrate mass function and bias of halos in the modified gravity models with the
accuracy comparable to that in the standard structure formation models.
the diffuse hot baryons. At the same time, processes accompanying galaxy formation also leave
a mark on the ICM properties at larger radii. In these regions, the gas entropy measured from
observations is considerably higher than predicted by simple models that do not include such
processes, and the ICM is also significantly enriched by heavy elements. This highlights that the
ICM properties are the end product of the past interaction between the galaxy evolution processes
and the intergalactic medium. Nevertheless, at intermediate radii, r2,500 r r500 , the scaling of
the radial profiles of gas density, temperature, and pressure with the total mass is close to simple,
self-similar expectations for clusters of sufficiently large mass (corresponding to kT 2–3 keV).
This implies that the baryon processes affecting the ICM during cluster formation do not introduce
a new mass scale. Such regular behavior of the ICM profiles provides a basis for the definition
of integrated quantities, such as the core-excised X-ray luminosity and temperature, gas mass, or
integrated pressure, which are tightly correlated with each other and with the total cluster mass.
The low-scatter scaling relations are used to interpret abundance and spatial distribution of
clusters and derive cosmological constraints (see Allen, Evrard & Mantz 2011 and Weinberg et al.
2012 for recent reviews). Currently, cluster counts measured at high redshifts provide interesting
constraints on cosmological parameters complementary to other methods (e.g., Vikhlinin et al.
2009b, Mantz et al. 2010b, Rozo et al. 2010) and a crucial test of the entire class of CDM and
quintessence models (e.g., Benson et al. 2011; Jee et al. 2011; Mortonson, Hu & Huterer 2011).
100
PS
NC
NC + PS
50
fNL 0
– 50
–100
0.6 0.7 0.8 0.9 1.0
σ8
Figure 16
The potential of future cluster X-ray surveys to constrain deviations from Gaussian density perturbations
(adapted from Sartoris et al. 2010). The figure shows constraints on the power-spectrum normalization, σ 8 ,
and non-Gaussianity parameter, fNL , expected from surveys of galaxy clusters to be carried out with the
next-generation Wide Field X-ray Telescope. The dot-dashed blue curve and dashed green curve show the
68% confidence regions provided by the evolution of power spectrum (PS) of the cluster distribution and
cluster number counts (NC), respectively. The solid red ellipse shows the constraints obtained by combining
number counts and power spectrum information. Cosmic microwave background Planck priors for Gaussian
perturbations have been included in the analysis.
Although the statistical power of large future cluster surveys will put increasingly more stringent
requirements on the theoretical uncertainties associated with cluster scaling relations and mass
function (Cunha & Evrard 2010; Wu, Zentner & Wechsler 2010), future cluster samples can
provide competitive constraints on the non-Gaussianity in the initial density field and deviations
from GR gravity.
A combination of cluster abundance and large-scale clustering measurements can be used to
derive stringent constraints on cosmological parameters and possible deviations from the standard
CDM paradigm. As an example, Figure 16 shows the constraints on the normalization of the
power spectrum and the fNL parameter (from Sartoris et al. 2010) expected for a future high-
sensitivity X-ray cluster survey. It shows that future cluster surveys can achieve a precision of
σ fNL ≈ 5−10 (see also Cunha, Huterer & Doré 2010; Pillepich, Porciani & Reiprich 2012), thus
complementing at smaller scales constraints on non-Gaussianity, which are to be provided on
larger scales by observations of CMB anisotropies from the Planck satellite.
Although a variety of methods will provide constraints on the equation of state of DE and other
cosmological parameters (e.g., Weinberg et al. 2012), clusters will remain one of the most powerful
ways to probe deviations from GR gravity (e.g., Lombriser et al. 2009). Even now, the strongest
constraints on deviations from GR on the Hubble horizon scales are derived from the combination
of the measured redshift evolution of cluster number counts and geometrical probes of cosmic
expansion (Schmidt, Vikhlinin & Hu 2009). Figure 17 illustrates the potential constraints on the
1.1
model
1.0 ed gravity
Modifi
el
ΛCDM mod
0.9
G(z) × (1 + z)
0.8
0.7
0.6
0 1 2
z
Figure 17
The potential of future cluster surveys to constrain deviations from General Relativity (from Vikhlinin et al.
2009c). The linear growth factor of density perturbations, G(z) = D(z) (not normalized to unity at z = 0),
recovered from 2,000 clusters, distributed in 20 redshift bins, each containing 100 massive clusters, identified
in a high-sensitivity X-ray cluster survey. The solid gray line indicates the evolution of the linear growth
factor for a CDM model, whereas the dashed blue curve is the prediction of a modified gravity model (the
braneworld model by Dvali, Gabadadze & Porrati 2000), having the same expansion history of the CDM
model.
linear rate of perturbation growth that can be derived from a future high-sensitivity X-ray cluster
survey using similar analysis. The figure shows that a sample of about 2,000 clusters at z < 2
with well-calibrated mass measurements would allow one to distinguish the standard CDM
model from a braneworld-modified gravity model with the identical expansion history at a high
confidence level.
The construction of such large, homogeneous samples of clusters will be aided in the next
decade by a number of cluster surveys both in the optical/near-IR (e.g., DES, PanSTARRS,
EUCLID) and in X-ray (e.g., eROSITA, WFXT) bands. At the same time, the combination of
higher resolution numerical simulations including more sophisticated treatment of galaxy forma-
tion processes and high-sensitivity multiwavelength observations of clusters should help to unveil
the nature of the physical processes driving the evolution of clusters and provide accurate calibra-
tions of their masses. The cluster studies thus will remain a vibrant and fascinating area of modern
cosmology for years to come.
DISCLOSURE STATEMENT
The authors are not aware of any affiliations, memberships, funding, or financial holdings that
might be perceived as affecting the objectivity of this review.
ACKNOWLEDGMENTS
We are grateful to Brad Benson, Klaus Dolag, Surhud More, Piero Rosati, Elena Rasia, Paolo
Tozzi, Alexey Vikhlinin, and Mark Wyman for useful discussions and comments, and to John
Carlstrom for a careful reading of the manuscript. We thank Dunja Fabjan and Barbara Sartoris
for their help in producing Figures 15 and 16, respectively. The authors wish to thank the Kavli
Institute for Theoretical Physics (KITP) in Santa Barbara for hospitality during the early phase
of preparation of this review and participants of the KITP workshop “Galaxy clusters: crossroads
of astrophysics and cosmology” for many stimulating discussions. A.K. was supported by NSF
grants AST-0507596 and AST-0807444, NASA grant NAG5-13274, and by the Kavli Institute
for Cosmological Physics at the University of Chicago through grants NSF PHY-0551142 and
PHY-1125897. S.B. acknowledges partial support by the European Commissions FP7 Marie Curie
Initial Training Network CosmoComp (PITN-GA-2009-238356), by the PRIN-INAF09 project
“Towards an Italian Network for Computational Cosmology,” by the PRIN-MIUR09 “Tracing
the growth of structures in the Universe,” and by the PD51 INFN grant.
LITERATURE CITED
Abadi MG, Navarro JF, Fardal M, Babul A, Steinmetz M. 2010. MNRAS 407:435–46
Abramo LR, Batista RC, Liberato L, Rosenfeld R. 2007. J. Cosmol. Astropart. Phys. 11:12
Ackermann M, Ajello M, Allafort A, Baldini L, Ballet J, et al. 2010. Ap. J. Lett. 717:L71–78
Afshordi N, Tolley AJ. 2008. Phys. Rev. D 78:123507
Alimi JM, Füzfa A, Boucher V, Rasera Y, Courtin J, Corasaniti PS. 2010. MNRAS 401:775–90
Allen SW, Evrard AE, Mantz AB. 2011. Annu. Rev. Astron. Astrophys. 49:409–70
Allen SW, Rapetti DA, Schmidt RW, Ebeling H, Morris RG, Fabian AC. 2008. MNRAS 383:879–96
Allen SW, Schmidt RW, Ebeling H, Fabian AC, van Speybroeck L. 2004. MNRAS 353:457–67
Allgood B, Flores RA, Primack JR, Kravtsov AV, Wechsler RH, et al. 2006. MNRAS 367:1781–96
Ameglio S, Borgani S, Pierpaoli E, Dolag K, Ettori S, Morandi A. 2009. MNRAS 394:479–90
Anderhalden D, Diemand J. 2011. MNRAS 414:3166–72
Arnaud M, Pratt GW, Piffaretti R, Böhringer H, Croston JH, Pointecouteau E. 2010. Astron. Astrophys.
517:A92
Ascasibar Y, Sevilla R, Yepes G, Müller V, Gottlöber S. 2006. MNRAS 371:193–203
Audit E, Teyssier R, Alimi JM. 1997. Astron. Astrophys. 325:439–49
Babul A, Balogh ML, Lewis GF, Poole GB. 2002. MNRAS 330:329–43
Bahcall NA, Soneira RM. 1983. Ap. J. 270:20–38
Bardeen JM, Bond JR, Kaiser N, Szalay AS. 1986. Ap. J. 304:15–61
Bardeen JM, Steinhardt PJ, Turner MS. 1983. Phys. Rev. D 28:679–93
Barnes J, White SDM. 1984. MNRAS 211:753–65
Bartelmann M. 2010. Class. Quantum Gravity 27:233001
Bartolo N, Komatsu E, Matarrese S, Riotto A. 2004. Phys. Rep. 402:103–266
Battaglia N, Bond JR, Pfrommer C, Sievers JL. 2012. Ap. J. Submitted (arXiv:1109.3709)
Becker MR, Kravtsov AV. 2011. Ap. J. 740:25
Benson BA, de Haan T, Dudley JP, et al. 2011. Ap. J. Submitted (arXiv:1112.5435)
Bertschinger E. 1985. Ap. J. Suppl. 58:39–65
Bertschinger E. 1998. Annu. Rev. Astron. Astrophys. 36:599–654
Bhattacharya S, Habib S, Heitmann K. 2011. Ap. J. Submitted (arXiv:1112.5479)
Bialek JJ, Evrard AE, Mohr JJ. 2001. Ap. J. 555:597–612
Biffi V, Dolag K, Böhringer H. 2011. MNRAS 413:573–84
Biffi V, Dolag K, Böhringer H, Lemson G. 2012. MNRAS. Submitted (arXiv:1112.0314)
Binney J, Tremaine S. 2008. Galactic Dynamics. Princeton, NJ: Princeton Univ. Press. 2nd ed.
Blumenthal GR, Faber SM, Flores R, Primack JR. 1986. Ap. J. 301:27–34
Blumenthal GR, Faber SM, Primack JR, Rees MJ. 1984. Nature 311:517–25
Böhringer H, Belsole E, Kennea J, Matsushita K, Molendi S, et al. 2001. Astron. Astrophys. 365:L181–87
Böhringer H, Dolag K, Chon G. 2012. Submitted (arXiv:1112.5035)
Böhringer H, Schuecker P, Pratt GW, Arnaud M, Ponman TJ, et al. 2007. Astron. Astrophys. 469:363–77
Bonaldi A, Tormen G, Dolag K, Moscardini L. 2007. MNRAS 378:1248–58
Bonamente M, Hasler N, Bulbul E, Carlstrom JE, Culverhouse TL, et al. 2012. Ap. J. Submitted
(arXiv:1112.1599)
Bond JR, Efstathiou G. 1984. Ap. J. Lett. 285:L45–48
Bond JR, Kofman L, Pogosyan D. 1996. Nature 380:603–5
Bond JR, Szalay AS, Turner MS. 1982. Phys. Rev. Lett. 48:1636–39
Borgani S, Fabjan D, Tornatore L, Schindler S, Dolag K, Diaferio A. 2008. Space Sci. Rev. 134:379–403
Borgani S, Finoguenov A, Kay ST, Ponman TJ, Springel V, et al. 2005. MNRAS 361:233–43
Borgani S, Governato F, Wadsley J, Menci N, Tozzi P, et al. 2002. MNRAS 336:409–24
Borgani S, Kravtsov A. 2011. Adv. Sci. Lett. 4:204–27
Borgani S, Murante G, Springel V, Diaferio A, Dolag K, et al. 2004. MNRAS 348:1078–96
Borgani S, Viel M. 2009. MNRAS 392:L26–30
Bower RG, McCarthy IG, Benson AJ. 2008. MNRAS 390:1399–410
Brunetti G. 2011. Mem. Soc. Astron. Ital. 82:515–17
Bryan GL, Norman ML. 1998. Ap. J. 495:80
Bullock JS, Kolatt TS, Sigad Y, Somerville RS, Kravtsov AV, et al. 2001. MNRAS 321:559–75
Buote DA, Gastaldello F, Humphrey PJ, Zappacosta L, Bullock JS, et al. 2007. Ap. J. 664:123–34
Capozziello S, de Laurentis M. 2011. Phys. Rep. 509:167–321
Cardone VF, Piedipalumbo E, Tortora C. 2005. MNRAS 358:1325–36
Carlstrom JE, Holder GP, Reese ED. 2002. Annu. Rev. Astron. Astrophys. 40:643–80
Cavagnolo KW, Donahue M, Voit GM, Sun M. 2009. Ap. J. Suppl. 182:12–32
Cavaliere AG, Gursky H, Tucker WH. 1971. Nature 231:437–38
Chiu WA, Ostriker JP, Strauss MA. 1998. Ap. J. 494:479–90
Churazov E, Vikhlinin A, Zhuravleva I, Schekochihin A, Parrish I, et al. 2012. MNRAS 421(1):1123–35
Clowe D, De Lucia G, King L. 2004. MNRAS 350:1038–48
Cohn JD, White M. 2008. MNRAS 385:2025–33
Cole S, Kaiser N. 1989. MNRAS 237:1127–46
McCarthy IG, Schaye J, Ponman TJ, Bower RG, Booth CM, et al. 2010. MNRAS 406:822–39
McDonald M, Veilleux S, Rupke DSN, Mushotzky R, Reynolds C. 2011. Ap. J. 734:95–106
McDonald P. 2008. Phys. Rev. D 78:123519
McNamara BR, Nulsen PEJ. 2007. Annu. Rev. Astron. Astrophys. 45:117–75
Mead JMG, King LJ, Sijacki D, Leonard A, Puchwein E, McCarthy IG. 2010. MNRAS 406:434–44
Meekins JF, Fritz G, Chubb TA, Friedman H. 1971. Nature 231:107–8
Merritt D, Graham AW, Moore B, Diemand J, Terzić B. 2006. Astron. J. 132:2685–700
Mo H, van den Bosch F, White S. 2010. Galaxy Formation Evolution. Cambridge, UK: Cambridge Univ. Press
Mo HJ, White SDM. 1996. MNRAS 282:347–61
Monaco P. 1995. Ap. J. 447:23–34
Moore B, Quinn T, Governato F, Stadel J, Lake G. 1999. MNRAS 310:1147–52
More S, Kravtsov AV, Dalal N, Gottlöber S. 2011. Ap. J. Suppl. 195:4
Mortonson MJ, Hu W, Huterer D. 2011. Phys. Rev. D 83:023015
Moster BP, Somerville RS, Maulbetsch C, van den Bosch FC, Macciò AV, et al. 2010. Ap. J. 710:903–23
Motl PM, Hallman EJ, Burns JO, Norman ML. 2005. Ap. J. Lett. 623:L63–66
Muanwong O, Thomas PA, Kay ST, Pearce FR. 2002. MNRAS 336:527–40
Naab T, Johansson PH, Ostriker JP. 2009. Ap. J. Lett. 699:L178–82
Nagai D. 2006. Ap. J. 650:538–49
Nagai D, Kravtsov AV, Vikhlinin A. 2007. Ap. J. 668:1–14
Nagai D, Lau ET. 2011. Ap. J. Lett. 731:L10–15
Nagai D, Vikhlinin A, Kravtsov AV. 2007. Ap. J. 655:98–108
Narayan R, Medvedev MV. 2001. Ap. J. Lett. 562:L129–32
Navarro JF, Frenk CS, White SDM. 1995. MNRAS 275:720–40
Navarro JF, Frenk CS, White SDM. 1996. Ap. J. 462:563–75
Navarro JF, Frenk CS, White SDM. 1997. Ap. J. 490:493–508
Navarro JF, Hayashi E, Power C, Jenkins AR, Frenk CS, et al. 2004. MNRAS 349:1039–51
Nelson K, Rudd DH, Shaw L, Nagai D. 2012. Ap. J. 751:121
Neumann DM. 2005. Astron. Astrophys. 439:465–77
Newman AB, Treu T, Ellis RS, Sand DJ. 2011. Ap. J. Lett. 728:L39–43
Newman AB, Treu T, Ellis RS, Sand DJ, Richard J, et al. 2009. Ap. J. 706:1078–94
Norman ML. 2010. Simulating galaxy clusters. Presented at Enrico Fermi Summer School “Astrophysics of
Galaxy Clusters,” Varenna, Italy, July 15–25, 2008 (arXiv:1005.1100)
Oguri M, Bayliss MB, Dahle H, Sharon K, Gladders MD, et al. 2012. MNRAS 420:3213–39
Oguri M, Hennawi JF, Gladders MD, Dahle H, Natarajan P, et al. 2009. Ap. J. 699:1038–52
Oguri M, Takada M, Umetsu K, Broadhurst T. 2005. Ap. J. 632:841–46
Okabe N, Zhang YY, Finoguenov A, Takada M, Smith GP, et al. 2010. Ap. J. 721:875–85
Omma H, Binney J, Bryan G, Slyz A. 2004. MNRAS 348:1105–19
Onuora LI, Thomas PA. 2000. MNRAS 319:614–18
O’Shea BW, Bryan G, Bordner J, Norman ML, Abel T, et al. 2005. In Adaptive Mesh Refinement—Theory
and Applications, ed. T Plewa, T Linde, VG Weirs, pp. 341–49. Lect. Notes Comput. Sci. Eng. 41. Berlin:
Springer
Parfrey K, Hui L, Sheth RK. 2011. Phys. Rev. D 83:063511
Peebles PJE. 1970. Astron. J. 75:13–20
Peebles PJE. 1982. Ap. J. Lett. 263:L1–5
Percival WJ. 2005. Astron. Astrophys. 443:819–30
Peterson JR, Fabian AC. 2006. Phys. Rep. 427:1–39
Peterson JR, Paerels FBS, Kaastra JS, Arnaud M, Reiprich TH, et al. 2001. Astron. Astrophys. 365:L104–9
Pfrommer C, Enßlin TA, Springel V, Jubelgas M, Dolag K. 2007. MNRAS 378:385–408
Pfrommer C, Springel V, Enßlin TA, Jubelgas M. 2006. MNRAS 367:113–31
Piffaretti R, Valdarnini R. 2008. Astron. Astrophys. 491:71–87
Pillepich A, Porciani C, Hahn O. 2010. MNRAS 402:191–206
Pillepich A, Porciani C, Reiprich TH. 2012. MNRAS 422:44–69
Pointecouteau E, Arnaud M, Pratt GW. 2005. Astron. Astrophys. 435:1–7
Poole GB, Babul A, McCarthy IG, Fardal MA, Bildfell CJ, et al. 2007. MNRAS 380:437–54
Prada F, Klypin AA, Cuesta AJ, Betancort-Rijo JE, Primack J. 2012. MNRAS In press (arXiv:1104.5130)
Prada F, Klypin AA, Simonneau E, Betancort-Rijo J, Patiri S, et al. 2006. Ap. J. 645:1001–11
Pratt GW, Arnaud M, Piffaretti R, Böhringer H, Ponman TJ, et al. 2010. Astron. Astrophys. 511:A85
Pratt GW, Böhringer H, Croston JH, Arnaud M, Borgani S, et al. 2007. Astron. Astrophys. 461:71–80
Pratt GW, Croston JH, Arnaud M, Böhringer H. 2009. Astron. Astrophys. 498:361–78
Press WH, Davis M. 1982. Ap. J. 259:449–73
Press WH, Schechter P. 1974. Ap. J. 187:425–38
Puchwein E, Bartelmann M, Dolag K, Meneghetti M. 2005. Astron. Astrophys. 442:405–12
Puchwein E, Sijacki D, Springel V. 2008. Ap. J. Lett. 687:L53–56
Rasia E, Ettori S, Moscardini L, Mazzotta P, Borgani S, et al. 2006. MNRAS 369:2013–24
Rasia E, Mazzotta P, Evrard A, Markevitch M, Dolag K, Meneghetti M. 2011. Ap. J. 729:45
Rasia E, Tormen G, Moscardini L. 2004. MNRAS 351:237–52
Reed DS, Bower R, Frenk CS, Jenkins A, Theuns T. 2007. MNRAS 374:2–15
Renzini A. 2000. In Large Scale Structure in the X-Ray Universe, ed. M Plionis, I Georgantopoulos, p. 103.
Paris: Atlantisciences
Rosati P, Borgani S, Norman C. 2002. Annu. Rev. Astron. Astrophys. 40:539–77
Rozo E, Nagai D, Keeton C, Kravtsov A. 2008. Ap. J. 687:22–38
Rozo E, Wechsler RH, Rykoff ES, Annis JT, Becker MR, et al. 2010. Ap. J. 708:645–60
Rudd DH, Zentner AR, Kravtsov AV. 2008. Ap. J. 672:19–32
Ryden BS, Gunn JE. 1987. Ap. J. 318:15–31
Salopek DS, Bond JR. 1990. Phys. Rev. D 42:3936–62
Sand DJ, Treu T, Ellis RS, Smith GP, Kneib JP. 2008. Ap. J. 674:711–27
Sand DJ, Treu T, Smith GP, Ellis RS. 2004. Ap. J. 604:88–107
Sarazin CL. 1986. Rev. Mod. Phys. 58:1–115
Sartoris B, Borgani S, Fedeli C, Matarrese S, Moscardini L, et al. 2010. MNRAS 407:2339–54
Schäfer BM, Koyama K. 2008. MNRAS 385:411–22
Scherrer RJ, Weinberg DH. 1998. Ap. J. 504:607–11
Schmidt F. 2009. Phys. Rev. D 80:123003
Schmidt F, Hu W, Lima M. 2010. Phys. Rev. D 81:063005
Schmidt F, Lima M, Oyaizu H, Hu W. 2009. Phys. Rev. D 79:083518
Schmidt F, Vikhlinin A, Hu W. 2009. Phys. Rev. D 80:083505
Schmidt RW, Allen SW. 2007. MNRAS 379:209–21
Schuecker P, Finoguenov A, Miniati F, Böhringer H, Briel UG. 2004. Astron. Astrophys. 426:387–97
Scoccimarro R, Sefusatti E, Zaldarriaga M. 2004. Phys. Rev. D 69:103513
Sefusatti E, Vale C, Kadota K, Frieman J. 2007. Ap. J. 658:669–79
Seljak U. 2000. MNRAS 318:203–13
Sereno M, Zitrin A. 2012. MNRAS 419:3280–91
Shandera S, Dalal N, Huterer D. 2011. J. Cosmol. Astropart. Phys. 3:17
Shang C, Crotts A, Haiman Z. 2007. Ap. J. 671:136–45
Sharma P, McCourt M, Quataert E, Parrish IJ. 2012. MNRAS 420(4):3174–94
Sheth RK, Mo HJ, Tormen G. 2001. MNRAS 323:1–12
Sheth RK, Tormen G. 1999. MNRAS 308:119–26
Sheth RK, Tormen G. 2002. MNRAS 329:61–75
Sijacki D, Pfrommer C, Springel V, Enßlin TA. 2008. MNRAS 387:1403–15
Sijacki D, Springel V, De Matteo T, Hernquist L. 2007. MNRAS 380:877–900
Silvestri A, Trodden M. 2009. Rep. Prog. Phys. 72:096901
Simionescu A, Allen SW, Mantz A, Werner N, Takei Y, et al. 2011. Science 331:1576–79
Skillman SW, O’Shea BW, Hallman EJ, Burns JO, Norman ML. 2008. Ap. J. 689:1063–77
Slosar A, Hirata C, Seljak U, Ho S, Padmanabhan N. 2008. J. Cosmol. Astropart. Phys. 8:31
Smith S. 1936. Ap. J. 83:23–30
Sotiriou TP, Faraoni V. 2010. Rev. Mod. Phys. 82:451–97
Springel V. 2005. MNRAS 364:1105–34