1564 Axioms For Centrality
1564 Axioms For Centrality
10: 222–262
Abstract. Given a social network, which of its nodes are more central? This question
has been asked many times in sociology, psychology, and computer science, and a whole
plethora of centrality measures (a.k.a. centrality indices, or rankings) were proposed to
account for the importance of the nodes of a network. In this study, we try to provide a
mathematically sound survey of the most important classic centrality measures known
from the literature and propose an axiomatic approach to establish whether they are
actually doing what they have been designed to do. Our axioms suggest some simple,
basic properties that a centrality measure should exhibit.
Surprisingly, only a new simple measure based on distances, harmonic centrality,
turns out to satisfy all axioms; essentially, harmonic centrality is a correction to Bave-
las’s classic closeness centrality [Bavelas 50] designed to take unreachable nodes into
account in a natural way.
As a sanity check, we examine in turn each measure under the lens of informa-
tion retrieval, leveraging state-of-the-art knowledge in the discipline to measure the
effectiveness of the various indices in locating webpages that are relevant to a query.
Although there are some examples of such comparisons in the literature, here, for the
first time, we also take into consideration centrality measures based on distances, such
as closeness, in an information-retrieval setting. The results closely match the data we
gathered using our axiomatic approach.
Our results suggest that centrality measures based on distances, which in recent years
have been neglected in information retrieval in favor of spectral centrality measures,
do provide high-quality signals; moreover, harmonic centrality pops up as an excellent
general-purpose centrality index for arbitrary directed graphs.
C Taylor & Francis Group, LLC
222 ISSN: 1542-7951 print
Boldi and Vigna: Axioms for Centrality 223
1. Introduction
In recent years, there has been an ever-increasing research activity in the study
of real-world complex networks [Wasserman and Faust 94] (the world-wide web,
the autonomous-systems graph within the Internet, coauthorship graphs, phone-
call graphs, e-mail graphs and biological networks, to cite but a few). These
networks, typically generated directly or indirectly by human activity and inter-
action (and, therefore, hereafter dubbed “social”), appear in a large variety of
contexts and often exhibit a surprisingly similar structure. One of the most im-
portant notions that researchers have been trying to capture in such networks is
“node centrality”: ideally, every node (often representing an individual) has some
degree of influence or importance within the social domain under consideration,
and one expects such importance to surface in the structure of the social net-
work; centrality is a quantitative measure that aims at revealing the importance
of a node.
Among the types of centrality that have been considered in the literature
(see [Borgatti 05] for a good survey), many have to do with distances between
nodes.1 Take, for example, a node in an undirected connected network: if the sum
of distances to all other nodes is large, the node under consideration is peripheral ;
this is the starting point to define Bavelas’s closeness centrality [Bavelas 50],
which is the reciprocal of peripherality (i.e., the reciprocal of the sum of distances
to all other nodes).
The role played by shortest paths is justified by one of the most well-known
features of complex networks, the so-called small-world phenomenon. A small-
world network [Cohen and Havlin 10] is a graph where the average distance
between nodes is logarithmic in the size of the network, whereas the clustering
coefficient is larger (that is, neighborhoods tend to be denser) than in a random
Erdős-Rényi graph with the same size and average distance.2 The fact that social
networks (whether electronically mediated or not) exhibit the small-world prop-
erty has been known at least since Milgram’s famous experiment [Milgram 67]
and is arguably the most popular of all features of complex networks. For in-
stance, the average distance of the Facebook graph was recently established to
be just 4.74 [Backstrom et al. 12].
1 Here and in the following, by “distance” we mean the length of a shortest path between
two nodes.
2 The reader might find this definition a bit vague, and some variants are often spotted in the
literature: this is a general well-known problem, also highlighted recently, for example, in [Li
et al. 05].
224 Internet Mathematics
The purpose of this article is to pave the way for a formal, well-grounded as-
sessment of centrality measures based on some simple guiding principles; we seek
notions of centrality that are at the same time robust (they should be applicable
to arbitrary directed graphs, possibly nonconnected, without modifications) and
understandable (they should have a clear combinatorial interpretation).
With these principles in mind, we shall present and compare the most popular
and well-known centrality measures proposed in the last decades. The comparison
will be based on a set of axioms, each trying to capture a specific trait.
In the last part of the article, as a sanity check, we compare the measures we
discuss in an information-retrieval setting, extracting from the classic GOV2 web
collection, documents satisfying a query and ranking by centrality the subgraph
of retrieved documents.
The results are somehow surprising, and suggest that simple measures based
on distances, and in particular harmonic centrality (which we introduce formally
in this study) can give better results than some of the most sophisticated indices
used in the literature. These unexpected outcomes are the main contribution of
this work, together with the set of axioms we propose, which provide a conceptual
framework for understanding centrality measures in a formal way. We also try
to give an orderly account of centrality in social and network sciences, gathering
scattered results and folklore knowledge in a systematic way.
2. An Historical Account
In this section, we sketch the historical development of centrality, focusing on
ten classical centrality measures that we decided to include in this study: the
overall growth of the field is, of course, much more complex, and the literature
contains a myriad of alternative proposals that will not be discussed here.
Centrality is a fundamental tool in the study of social networks: the first ef-
forts to define formally centrality indices were put forth in the late 1940s by the
Group Networks Laboratory at MIT directed by Alex Bavelas [Bavelas 50], in the
framework of communication patterns and group collaboration [Leavitt 51, Bave-
las and Barrett 51]; those pioneering experiments concluded that centrality was
related to group efficiency in problem-solving, and agreed with the subjects’ per-
ceptions of leadership. In the following decades, various measures of centrality
were employed in a multitude of contexts (to understand political integration in
Indian social life [Cohn and Marriott 58], to examine the consequences of central-
ity in communication paths for urban development [Pitts 65], to analyze their
implications to the efficient design of organizations [Beauchamp 65, Macken-
zie 66], or even to explain the wealth of the Medici family based on their central
Boldi and Vigna: Axioms for Centrality 225
3 The notion can also be generalized to a weighted summation of node contributions mul-
tiplied by some discount functions applied to their distance to a given node [Cohen and Ka-
plan 07].
4 Dominant eigenvectors were rediscovered as a generic way of computing centralities on
graphs by [Bonacich 72].
Boldi and Vigna: Axioms for Centrality 227
In the same span of years, Jon Kleinberg defined another centrality measure
called hyperlink-induced topic search (HITS) [Kleinberg 99]. The idea5 is that
every node of a graph is associated with two importance indices: one (called
“authority score”) measures how reliable (important, authoritative. . . ) a node
is, and another (called “hub score”) measures how good the node is in pointing
to authoritative nodes, with the two scores mutually reinforcing each other. The
result is again the dominant eigenvector of a suitable matrix. Stochastic approach
for link-structure analysis (SALSA) [Lempel and Moran 01] is a more recent and
strictly related score based on the same idea, with the difference that it applies
some normalization to the matrix.
We conclude this brief historical account mentioning that there were, in the
past, some attempts to axiomatize the notion of centrality: we postpone a dis-
cussion on these attempts to Section 4.4.
5 To be precise, Kleinberg’s algorithm works in two phases; in the first phase, one selects
a subgraph of the starting web graph based on the pages that match the given query; in the
second phase, the centrality score is computed on the subgraph. Because in this paper we
are looking at HITS simply as a centrality index, we will simply apply it to the graph under
examination.
228 Internet Mathematics
3.1.1. Indegree. Indegree, the number of incoming arcs d− (x), can be considered
a geometric measure: it is simply the number of nodes at distance one.6 It is
probably the oldest measure of importance ever used, because it is equivalent
to majority voting in elections (where x → y if x voted for y). Indegree has
a number of obvious shortcomings (e.g., it is easy to spam), but it is a good
baseline, and in some cases it turned out to provide better results than more
sophisticated methods (see, e.g., [Craswell et al. 03]).
6 Most centrality measures proposed in the literature were actually described only for undi-
rected, connected graphs. Because the study of web graphs and online social networks has
posed the problem of extending centrality concepts to networks that are directed, and possi-
bly not strongly connected, in the rest of this study we consider measures depending on the
incoming arcs of a node (e.g., incoming paths, left dominant eigenvectors, distances from all
nodes to a fixed node). If necessary, these measures can be called “negative,” opposed to the
“positive” versions obtained by considering outgoing paths, or (equivalently) by transposing
the graph.
Boldi and Vigna: Axioms for Centrality 229
3.1.2. Closeness. Bavelas introduced closeness in the late forties [Bavelas 50]; the
closeness of x is defined by
1
. (3.1)
y d(y, x)
The intuition behind closeness is that nodes that are more central have smaller
distances, and thus a smaller denominator, resulting in a larger centrality. We
remark that for this definition to make sense, the graph must be strongly con-
nected. Lacking that condition, some of the denominators will be ∞, resulting
in a null score for all nodes that cannot coreach the whole graph.
It was probably not Bavelas’s intent to apply the measure to directed graphs,
and even less so to graphs with infinite distances; nonetheless, closeness is some-
times “patched” by simply not including unreachable nodes, that is,
1
,
d(y ,x)< ∞ d(y, x)
and assuming that nodes with an empty coreachable set have centrality 0 by
definition: this is actually the definition we shall use in the rest of the article.
These apparently innocuous adjustments, however, introduce a strong bias to-
ward nodes with a small coreachable set.
3.1.3. Lin’s Index. [Lin 76] tried to repair the definition of closeness for graphs
with infinite distances by weighting closeness using the square of the number of
coreachable nodes; his definition for the centrality of a node x with a nonempty
coreachable set is
{y | d(y, x) < ∞}2
.
d(y ,x)< ∞ d(y, x)
The rationale behind this definition is as follows: first, we consider closeness not
the inverse of a sum of distances, but rather the inverse of the average distance,
which entails a first multiplication by the number of coreachable nodes. This
change normalizes closeness across the graph. Now, however, we want nodes with
a larger coreachable set to be more important, given that the average distance is
the same, so we multiply again by the number of coreachable nodes. Nodes with
an empty coreachable set have centrality 1 by definition.
Lin’s index was (somewhat surprisingly) ignored in the following literature.
Nonetheless, it seems to provide a reasonable solution for the problems caused
by the original definition of closeness.
3.1.4. Harmonic Centrality. As we noticed, the main problem of closeness lies in the
presence of pairs of unreachable nodes. We thus get inspiration from [Marchiori
230 Internet Mathematics
and Latora 00]: faced with the problem of providing a sensible notion of “average
shortest path” for a generic directed network, they propose to replace the average
distance with the harmonic mean of all distances (i.e., the n(n − 1) distances
between every pair of distinct nodes). Indeed, in case a large number of pairs
of nodes are not reachable, the average of finite distances can be misleading:
a graph might have a very low average distance while it is almost completely
disconnected (e.g., a perfect matching has average distance 1/2). The harmonic
mean has the useful property of handling ∞ cleanly (assuming, of course, that
∞−1 = 0). For example, the harmonic mean of distances of a perfect matching
is n − 1: in fact, for every node there is exactly another node at a noninfinite
distance, and its distance is 1; so the sum of the inverse of all distances is n,
making the harmonic average equal to n(n − 1)/n = n − 1.
In general, for each graph-theoretical notion based on arithmetic averaging
or maximization, there is an equivalent notion based on the harmonic mean. If
we consider closeness the reciprocal of a denormalized average of distances, it
is natural to consider also the reciprocal of a denormalized harmonic mean of
distances. We thus define the harmonic centrality of x as7
1 1
= . (3.2)
d(y, x) d(y, x)
y =x d(y ,x)< ∞,y =x
The difference with (3.1) might seem minor, but actually it is a radical change.
Harmonic centrality is strongly correlated to closeness centrality in simple net-
works, but naturally also accounts for nodes y that cannot reach x. Thus, it can
be fruitfully applied to graphs that are not strongly connected.
discuss such issues, because there is a large body of established literature about
the topic. All observations in this section are true for strongly connected graphs;
the modifications for graphs that are not strongly connected can be found in
[Berman and Plemmons 94, Vigna 09].
3.2.1. The Left Dominant Eigenvector. The first and most obvious spectral measure is the
left dominant eigenvector of the plain adjacency matrix. Indeed, the dominant
eigenvector can be thought of as the fixed point of an iterated computation in
which every node starts with the same score and then replaces its score with
the sum of the scores of its predecessors. The vector is then normalized, and the
process is repeated until convergence.
Dominant eigenvectors fail to behave as expected on graphs that are not
strongly connected. Depending on the dominant eigenvalue of the strongly con-
nected components, the dominant eigenvector might or might not be nonzero on
nonterminal components (a detailed characterization can be found in [Berman
and Plemmons 94]).
3.2.2. Seeley’s Index. The dominant eigenvector rationale can be slightly amended
with the observation that the update rule we described can be thought of as if
each node gives away its score to its successors; or even that each node has a
reputation and is giving its reputation to its successors so that they can build
their own.
Once we take this viewpoint, it is clear that it is not very sensible to give away
our own amount of reputation to everybody; it is more reasonable to divide
equally our reputation among our successors. From a linear-algebra viewpoint,
this corresponds to normalizing each row of the adjacency matrix using the 1
norm.
Seeley advocated this approach [Seeley 49] for computing the popularity among
groups of children, given a graph representing whether each child liked another
one. The matrix resulting from the 1 -normalization process is stochastic, so
the score can be interpreted as the stationary state of a Markov chain. In par-
ticular, if the underlying graph is symmetric, then Seeley’s index collapses to
a degree (modulo normalization) because of the well-known characterization of
the stationary distribution of the natural random walk on a symmetric graph.
Also, Seeley’s index does not react very well to the lack of strong connectivity:
the only nodes with a nonzero score are those belonging to terminal components
that are not formed by a single node of outdegree zero.
3.2.3. Katz’s Index. Katz introduced his celebrated index [Katz 53] using a sum-
mation over all paths coming into a node, but weighting each path so that the
232 Internet Mathematics
because of the interplay between the powers of the adjacency matrix and the
number of paths connecting two nodes. For the summation above to be finite,
the attenuation factor β must be smaller than 1/λ, where λ is the dominant
eigenvalue of A.
Katz immediately noted that the index was expressible using linear algebra
operations:
k = 1(1 − βA)−1 .
It took some more time to realize that, due to Brauer’s theorem on the displace-
ment of eigenvalues [Brauer 52], Katz’s index is the left dominant eigenvector of
a perturbed matrix
βλA + (1 − βλ)eT 1, (3.3)
where e is a right dominant eigenvector of A such that 1eT = λ [Vigna 09].
An easy generalization (actually suggested by [Hubbell 65]) replaces the vec-
tor 1 with some preference vector v so that paths are also weighted differently
depending on their starting node.8
The normalized limit of Katz’s index when β → 1/λ is a dominant eigenvector;
if the dominant eigenvector is not unique, the limit depends on the preference
vector v [Vigna 09].
3.2.4. PageRank. PageRank is one of the most discussed and quoted spectral indices
in use today, mainly because of its alleged use in Google’s ranking algorithm.9
By definition, PageRank is the unique vector p satisfying
p = αpĀ + (1 − α)v, (3.4)
where Ā is again the 1 -normalized adjacency matrix of the graph, α ∈ [0 . . 1) is
a damping factor, and v is a preference vector (which must be a distribution).
8 We
∞
β i Ai =
∞ must note that the original definition of Katz’s index is 1A
∞ i i i= 0
1/β i+ 1 i+ 1
i= 0
β A = (1/β) i= 0
β A − 1/β. This additional multiplication by A is some-
what common in the literature, even for PageRank; clearly, it alters the order induced by the
scores only when there is a nonuniform preference vector. Our discussion can be easily adapted
for this version.
9 The reader should be aware, however, that the literature about the actual effectiveness of
PageRank in information retrieval is rather scarce, and comprises mainly negative results such
as those found in [Najork et al. 07a] and [Craswell et al. 03].
Boldi and Vigna: Axioms for Centrality 233
This is the definition appearing in Brin and Page’s paper about Google [Brin and
Page 98]; the authors claim that the PageRank score is a probability distribution
over webpages, that is, it has unit 1 norm; but this is not necessarily true if A has
null rows. The following scientific literature thus proposed several ways to patch
the matrix Ā to make it stochastic, which would guarantee p 1 = 1. A common
solution is to replace every null row with the preference vector v itself, but
other solutions have been proposed (e.g., adding a loop to all nodes of outdegree
zero), which have led to different scores. This issue is definitely not academic;
in typical web snapshots a significant fraction of the nodes have outdegree zero
(the so-called dangling nodes).
It is interesting to note, however, that in the preprint written in collaboration
with Motwani and Winograd and commonly quoted as defining PageRank [Page
et al. 98], Brin and Page themselves propose a different but essentially equivalent
linear recurrence in the style of Hubbell’s index [Hubbell 65] and acknowledge
that Ā can have null rows, in which case the dominant eigenvalue of Ā could be
smaller than one, and the solution might need to be normalized to have unit 1
norm.
(3.4) is of course solvable even without any patching, giving
−1
p = (1 − α)v 1 − αĀ , (3.5)
and finally
∞
p = (1 − α)v αi Āi , (3.6)
i=0
which shows immediately that Katz’s index and PageRank differ only by a con-
stant factor and by the 1 normalization applied to A, similarly to the difference
between the dominant eigenvector and Seeley’s index.
If A has no null rows, or Ā has been patched to be stochastic, PageRank
can be equivalently defined as the stationary distribution (i.e., the left dominant
eigenvector) of the Markov chain with transition matrix
which is of course analogous to (3.3). [Del Corso et al. 06] have shown that
the resulting scores (which have always unit 1 norm) differ from the PageRank
vector defined by (3.4) only by a normalization factor, provided that in (3.7)
null rows have been replaced with v. If A had no null rows, the scores of course
would be identical as (3.5) can be easily derived from (3.7).
Both definitions have been used in the literature: the linear-recurrence def-
inition (3.4) is particularly useful when one needs linear dependence on v
234 Internet Mathematics
[Boldi et al. 05]. The Markov-chain definition (3.7) is nonetheless more com-
mon, albeit it raises the issue of patching null rows.
Analogously to what happens with Katz’s index, if the dominant eigenvalue of
Ā is one, then the limit of PageRank when α goes to 1 is a dominant eigenvector
of Ā, that is, Seeley’s index, and if the dominant eigenvector is not unique, the
limit depends on the preference vector v [Boldi et al. 09].
In the rest of this article, except when explicitly stated, we shall define PageR-
ank as in (3.5), without any patching or normalization, and use a uniform pref-
erence vector.
3.2.5. HITS. Kleinberg introduced his celebrated HITS algorithm [Kleinberg 99]10
using the web metaphor of “mutual reinforcement”: a page is authoritative if it
is pointed to by many good hubs—pages that contain good lists of authoritative
pages— and a hub is good if it points to authoritative pages. This suggests an
iterative process that computes at the same time an authoritativeness score ai
and a “hubbiness” score hi starting with a0 = 1, and then applying the update
rule
hi+1 = ai AT
ai+1 = hi+1 A.
This process converges to the left dominant eigenvector of the matrix AT A, which
gives the final authoritativeness score, and which we label with HITS throughout
the paper.11
Inverting the process, and considering the left dominant eigenvector of the
matrix AAT , gives the final hubbiness score. The two vectors are the left and
right singular vectors associated with the largest singular value in the singular-
value decomposition of A. Note also that hubbiness is the positive version of
authoritativeness.
1 0 A few years before, [Bonacich 91] had introduced the same method for computing the
centrality of individuals and groups related by a rectangular incidence matrix. Kleinberg’s
approach is slightly more general, as it does not assume that the two scores are assigned to
different types of entities.
1 1 As discussed in [Farahat et al. 06], the dominant eigenvector may not be unique; equiva-
lently, the limit of the recursive definition given above may depend on the way the authority
and hub scores are initialized. Here, we consider the result of the iterative process starting with
a0 = 1.
Boldi and Vigna: Axioms for Centrality 235
hi+1 = ai AT
ai+1 = hi+1 Ā.
We remark that this normalization process is analogous to the one that brings
us from the dominant eigenvector to Seeley’s index or from Katz’s index to
PageRank.
Similarly to what happens with Seeley’s index on symmetric graphs, SALSA
does not need such an iterative process to be computed.12 First, one computes
the connected components of the symmetric graph induced by the matrix AT A;
in this graph, x and y are adjacent if x and y have some common predecessor
in the original graph. Then, the SALSA score of a node is the ratio between its
indegree and the sum of the indegrees of nodes in the same component, multiplied
by the ratio between the component size and n. Thus, contrarily to HITS, a single
linear scan of the graph is sufficient to compute SALSA, albeit the computation
of the intersection graph requires time proportional to x d+ (x)2 .
1 2 This property, which appears to be little known, is proved in Proposition 2 of the original
paper [Lempel and Moran 01].
236 Internet Mathematics
3.3.2. Spectral Measures as Path-Based Measures. It is a general observation that all spectral
measures can be interpreted as path-based measures because they depend on
taking the limit of some summations of powers of A, or on the limit of powers
of A, and in both cases we can express these algebraic operations in terms of
suitable paths.
For instance, the left dominant eigenvector of a nonnegative matrix can be
computed with the power method by taking the limit of 1Ak / 1Ak for k → ∞.
Since, however, 1Ak is a vector associating with each node the number of paths
of length k coming into the node, we can see that the dominant eigenvector
expresses the relative growth of the number of paths coming into each node as
their length increases.
Analogously, Seeley’s index can be computed (modulo a normalization factor)
by taking the limit of 1Āk (in this case, the 1 norm cannot grow, so we do
not need to renormalize at each iteration). The vector 1Āk has the following
combinatorial interpretation: it assigns to each x the sums of the weights of the
paths coming into x, where the weight of a path x0 , x1 , . . . , xt is
t−1
1
. (3.8)
i=0
d+ (xi )
When we switch to attenuated versions of the previous indices (that is, Katz’s
index and PageRank), we switch from limits to infinite summations and at the
same time multiply the weight of paths of length t by β t or αt . Indeed, the Katz
index of x was originally defined as the summation over all t of the number of
paths of length t coming into x multiplied by β t , and PageRank is the summation
over all paths coming into x of the weight (3.8) multiplied by αt .
The reader can easily work out similar definitions for HITS and SALSA,
which depend on a suitable definition of alternate “back-and-forth path” (see,
e.g., [Borodin et al. 05]).
1 3 The authors claim to formalize PageRank [Page et al. 98], but they do not consider the
damping factor (equivalently, they are setting α = 1), so they are actually formalizing Seeley’s
venerable index [Seeley 49].
238 Internet Mathematics
by isomorphism, that is, they depend just on the structure of the graph, and not
on particular labeling chosen for each node.
To meet these constraints, we propose to study the reaction of centrality mea-
sures to change of size, to (local) change of density and to arc additions. We
expect that nodes belonging to larger groups, when every other parameter is
fixed, should be more important, and that nodes with a denser neighborhood
(i.e., having more friends), when every other parameter is fixed, should also be
more important. We also expect that adding an arc should increase the impor-
tance of the target.
How can we actually determine if this happens in an exact way, and possibly
in an asymptotic setting? To do so, we need to try something entirely new—
evaluating exactly (i.e., in algebraic closed form) all measures of interest on all
nodes of some representative classes of networks.
A good approach to reduce the amount of computation is to use strongly con-
nected vertex-transitive 14 graphs as basic building blocks: these graphs exhibit
a high degree of symmetry, which should entail a simplification of our computa-
tions. Finally, since we want to compare density, a natural choice is to pick the
densest strongly connected vertex-transitive graph, the clique, and the sparsest
strongly connected, the directed cycle. Choosing two graphs at the extreme of
the density spectrum should better highlight the reaction of centrality measures
to density. Moreover, k-cliques and directed p-cycles exist for every k and p (this
might not happen for more complicated structures, e.g., a cubic graph).
Definition 4.1. (Size Axiom). Consider the graph Sk ,p made by a k-clique and a directed
p-cycle (Table 1). A centrality measure satisfies the size axiom if for every k there
is a Pk such that for all p ≥ Pk in Sk ,p the centrality of a node of the p-cycle
is strictly larger than the centrality of a node of the k-clique, and if for every
p there is a Kp such that for all k ≥ Kp in Sk ,p the centrality of a node of the
k-clique is strictly larger than the centrality of a node of the p-cycle.
Note that our axiom does not specify any constraint when k = p. While study-
ing the behavior of the graph Dk ,p of the previous definition when k = p sheds
some light on the inner behavior of centrality measures, it is essential, in an
axiom asserting the sensitivity to density, that size is not involved.
In our proofs for the density axiom, we actually let k and p be independent
parameters (even if the axiom is stated for k = p) to compute the watershed,
that is, the value of k (expressed as a function of p) at which the centrality of
x becomes larger than the centrality of y (if any). The watershed can give some
insight as to how badly a measure can miss satisfying the density axiom.
1 8 There are actually two notions of radiality, which correspond to our notions of “positive”
and “negative” centralities.
242 Internet Mathematics
density axioms, we compute in closed form the values of all measures, from which
we can derive the desired results, whereas, for the score-monotonicity axiom we
provide direct proofs or counterexamples.
We remark that in all our tables we use the proportionality symbol ∝ to mark
values that have been rescaled by a common factor to make them more readable.
5.1. Size
Table 1 provides scores for the graph Sp,k , from which we can check whether the
size axiom is satisfied. The scores are immediately computable from the basic
Degree k−1 1
Harmonic k−1 Hp −1
1 2
Closeness
k−1 p(p − 1)
k2 2p
Lin
k−1 p−1
(p − 1)(p − 2)
Betweenness 0
2
Dominant ∝ 1 0
Seeley ∝ 1 1
1 1
Katz
1 − (k − 1)β 1−β
PageRank ∝ 1 1
HITS ∝ 1 0
SALSA ∝ 1 1
Table 1. Centrality scores for the graph Sk , p . Hi denotes the ith harmonic num-
ber. The parameter β is Katz’s attenuation factor.
Boldi and Vigna: Axioms for Centrality 243
5.2. Density
Table 2 provides scores for the graph Dp,k . Because the graph is strongly con-
nected, there is no uniqueness issue. Although the computation of geometric and
path-based centrality measures is a tedious but rather straightforward exercise
(it is just a matter of finite summations), spectral indices require some more care.
In the rest of this section, we shall first detail the steps leading to the explicit
formulae of Table 2. Then we will prove that the density axiom holds.
5.2.1. Explicit Formulae for Spectral Indices. We will write the parametric equations ex-
pressing the matrix computation that defines each spectral index and solve them.
As noted before, even if the axiom requires k = p, we prefer to perform the com-
putation with two independent parameters k and p to compute the watershed.
In all cases, we can always use the bounds imposed by symmetry to write
down just a small number of variables: c for the centrality of an element of the
clique, for the clique bridge (“left”), r for the cycle bridge (“right”), and some
function t(d) of the distance from the cycle bridge for the nodes of the cycle
(with 0 ≤ d < p), with the condition t(0) = r.
• The left dominant eigenvector. In this case, the equations are given by
the standard eigenvalue problem of the adjacency matrix:
λ = r + (k − 1)c
λc = + (k − 2)c
r
λr = + p−1 ,
λ
subject to the condition that we choose λ real and positive with maximum abso-
lute value. Note that in the case of the last equation we “unrolled” the equations
about the elements of the cycle, λt(d + 1) = t(d). Solving the system and choos-
ing c = 1/(λ − k + 1) gives the solutions found in Table 2.
Since for nonnegative matrices the dominant eigenvalue is monotone in the
matrix entries, λ ≥ k − 1, because the k-clique has dominant eigenvalue equal
to k − 1. Otherwise, λ ≤ k by the row-sum bound. As the eigenvalue equations
have no solution for λ = k − 1, we conclude that k − 1 < λ ≤ k.
244
Centrality Clique Clique Bridge Cycle Bridge Cycle (d > 0 from the Bridge) Watershed
Degree k −1 k 2 1 —
k −1 1 k −1
Harm onic k − 2 + Hp + 1 k − 1 + Hp 1+ + H p −1 + + H p −1 —
2 d+1 d+2
1 1 1 1
Closeness k ≤p
k − 1 + 2p + p(p − 1)/2 k − 1 + p + p(p − 1)/2 2k − 1 + p(p − 1)/2 k (d + 2) − 1 + p(p − 1)/2
(p − 1)(p − 2) (p − 1)(p − 2) p2 + p + 2
Betweenness 0 2p(k − 1) 2k (p − 1) + 2k (p − 2) + k≤
2 2 4
1 1 1+λ
Dom inant 1+ 1+λ —
λ −k + 1 λ −k + 1 λd
Seeley ∝ k −1 k 2 1 —
1 + β 1 β 1 βd+ 1
Katz ∝ + + —
1 − β (k − 2) 1 −β 1 − βp 1 −β 1 − βp
(k − 1)(k − α k + α ) α − k α − k
PageRank ∝ 2+2 1 + αd —
k (k − 1 − α (k − 2)) k (2 − α p ) k (2 − α p )
Table 2. Centrality scores for the graph Dk , p . The parameter β is Katz’s attenuation factor, α is PageRank’s damping
Internet Mathematics
factor, λ is the dominant eigenvalue of the adjacency matrix A and μ is the dominant eigenvalue of the matrix AT A. Lin’s
centrality is omitted because it is proportional to closeness (the graph being strongly connected).
Boldi and Vigna: Axioms for Centrality 245
• Katz’s Index. In this case, the equations can be obtained by the standard
technique of “taking one summand out,” that is, writing
∞
∞
∞
k=1 i i
β A =1+1 i i
β A =1+ 1 β i Ai βA = 1 + kβA.
i=0 i=1 i=0
= 1 + βr + β(k − 1)c
c = 1 + β + β(k − 2)c
1 − β p−1
r = 1 + β + β + β p−1 r ,
1−β
where again we “unrolled” the equations about the elements of the cycle, as we
would have just t(d + 1) = 1 + βt(d), so
1 − βd
t(d) = + β d r.
1−β
The explicit values of the solutions are quite ugly, so we present them in Table 2
as a function of the centrality of the clique bridge .
• PageRank. To simplify the computation, we use 1, rather than 1/(k + p), as
preference vector (by linearity, the result obtained just needs to be rescaled). We
use the same technique employed in the computation of Katz’s index, leading to
1
= 1 − α + αr + αc
2
α k−2
c=1−α+ +α c
k k−1
α 1
r = 1 − α + + α 1 − αp−1 + αp−1 r ,
k 2
noting once again that unrolling the equation of the cycle t(1) = 1 − α + αr/2
and t(d + 1) = 1 − α + αt(d) for d > 1 we get
1
t(d) = 1 − αd + αd r.
2
The explicit values for PageRank are even uglier than those of Katz’s index, so
again we present them in Table 2 as a function of the centrality of the clique
bridge .
• Seeley’s Index. This is a freebie; we can just compute PageRank’s limit when
α → 1.
246 Internet Mathematics
• HITS. In this case, we write down the eigenvalue problem for AT A. Writing
t for t(1), we have
μc = (k − 1)c + (k − 2)2 c + (k − 2) + r
μ = k + (k − 1)(k − 2)c + t
μr = 2r + (k − 1)c
μt = t + .
By normalizing the result so that c = μ2 − μ(k + 1) + k − 1, we obtain the com-
plex but somewhat readable values shown in Table 2. Note that p has no role in
the solution, because AT A can be decomposed into two independent blocks, one
of which is an identity matrix corresponding to all elements of the cycle except
for the first two.
• SALSA. It is easy to check that the components of the intersection graph
of predecessors are given by the clique together with the cycle bridge and its
successor and then by one component for each node of the cycle. The computation
of the scores is then trivial using the noniterative method.
5.2.2. Proofs. Armed with our explicit formulation of spectral scores, we have now
to prove whether the density axiom holds, that is, whether > r when k = p.
Note that in Table 2 we report no watershed for all spectral centrality measures,
which means even more: > r even when k = p, provided that k, p ≥ 3. The
proofs in this section cover this stronger statement.
Proof. As we have seen, we can normalize the solution to the HITS equations so
that
= (k − 1)(k − 2)(μ − 1)
r = μ3 − (k 2 − 2k + 4)μ2 + (3k 2 − 7k + 6)μ − (k − 1)2 .
Moreover, the characteristic polynomial can be computed explicitly from the set
of equations and by observing that the vectors χk +i and χ0 − χi , 0 < i < p, are
linearly independent eigenvectors for the eigenvalue 1:
p(μ) = μ4 − (k 2 − 2k + 6)μ3 + (5k 2 − 12k + 15)μ2
− (6k 2 − 16k + 14)μ + k 2 − 2k + 1 (μ − 1)k +p−4 .
The largest eigenvalue μ0 satisfies the inequality (k − 1)2 ≤ μ0 ≤ k 2 − 2k + 5/4
for every k ≥ 9 as shown following (the statement of the theorem can be verified
in the remaining cases by explicit computation; it does not depend on p). Using
Boldi and Vigna: Axioms for Centrality 247
Proof. From the proof of Theorem 5.1. we know that λ2 ≤ k2 − 2k + 5/4 for every
√
k ≥ 9, because μ is the spectral norm of A and thus dominates its spectral
radius λ, that is, λ2 ≤ μ. We conclude that
1 −λ2 + (k − 1)λ + 1
−r =1+ − (1 + λ) =
λ−k+1 λ−k+1
−(k 2 − 2k + 5/4) + (k − 1)2 + 1 3
> = > 0.
λ−k+1 4(λ − k + 1)
Theorem 5.3. Katz’s index satisfies the density axiom when β ∈ (0 . . 1/λ).
Theorem 5.4. PageRank with constant preference vector satisfies the density axiom
when α ∈ (0 . . 1).
Proof. The proof is similar to that of Theorem 5.3. Recall that the equations for
PageRank are
1
= 1 − α + αr + αc
2
α α(k − 2)
c=1−α+ + c
k k−1
α 1
r = 1 − α + + α 1 − αp−1 + αp−1 r .
k 2
First, we remark that as α → 1 PageRank tends to Seeley’s index, so > r for α
close enough to 1. By continuity, we just need to show that = r never happens
in our range of parameters. If we solve the equations above for c, , and r and
Boldi and Vigna: Axioms for Centrality 249
impose = r, we obtain
2α2 − (k 2 − 4k + 6)α + k 2 − 3k + 2
ln − 2
(k − 3k + 2)α2 − (2k 2 − 3k)α + k 2 − k
p=1+ .
ln α
Now observe that 2α2 − (k 2 − 4k + 6)α + k 2 − 3k + 2 ≥ 0 for k ≥ 3. Thus, a so-
lution for p exists only when the denominator is negative. However, in that region
2α2 − (k 2 − 4k + 6)α + k 2 − 3k + 2
− ≥ 1.
(k 2 − 3k + 2)α2 − (2k 2 − 3k)α + k 2 − k
This implies that under the same conditions p ≤ 1, which concludes the proof.
2
y x
k
Figure 1. A counterexample showing that Lin’s index fails to satisfy the score-
monotonicity axiom.
250 Internet Mathematics
x z
Katz. The score of y after adding x → y can only increase because the set of
paths coming into y now contains new elements.19 If the constant vector 1 is
replaced by a preference vector v in the definition, it is necessary that x have a
nonzero score before the addition in order for score monotonicity to hold.
Dominant eigenvector, Seeley’s index, HITS. If we consider a clique and
two isolated nodes x, y, the score given by the dominant eigenvector, Seeley’s
index and HITS to x and y is zero, and it remains unchanged when we add the
arc x → y.
SALSA. Consider the graph in Figure 2: the indegree of y is 1, and its component
in the intersection graph of predecessors is trivial, so its SALSA centrality is
(1/1) · (1/6) = 1/6. After adding an arc x → y, the indegree of y becomes 2, but
now its component is { y, z }; so the sum of indegrees within the component is
2 + 3 = 5, hence the centrality of y becomes (2/5) · (2/6) = 2/15 < 1/6.
PageRank. Score monotonicity of PageRank was proved by Chien, and col-
leagues [Chien et al. 04]. Their proof works for a generic regular Markov chain:
in the case of PageRank this condition is true, for instance, if the preference
vector is strictly positive or if the graph is strongly connected. Score monotonic-
ity under the same hypotheses is also a consequence of the analysis made by
Avrachenkov and Litvak [Avrachenkov and Litvak 06] of the behavior of PageR-
ank when multiple new links are added.
Their result can be extended to a much more general setting. Suppose that
we are adding the arc x → y, with the proviso that the PageRank of x before
adding the arc was strictly positive. We will show that, under this condition,
the score of y will increase for arbitrary graphs and preference vectors. The same
argument shows also that if the score of x is zero, the
−1score of y does not change.
For this proof, we define PageRank as v 1 − αĀ (i.e., without the normal-
izing factor 1 − α), so to simplify our calculations. By linearity, the result for the
standard definition follows immediately.
1 9 It should be noted, however, that this is true only for the values of the parameter β that
still make sense after the addition.
Boldi and Vigna: Axioms for Centrality 251
Consider two nodes x and y of a graph G such that there is no arc from x to
y, and let d be the outdegree of x. Given the normalized matrix Ā of G, and the
normalized matrix Ā of the graph G obtained by adding to G the arc x → y,
we have
Ā − Ā = χTx δ,
where δ is the difference between the rows corresponding to x in Ā and Ā , which
contains 1/d(d + 1) in the positions corresponding to the successors of x in G,
and −1/(d + 1) in the position corresponding to y (note that if d = 0, we have
just the latter entry).
We now use the Sherman–Morrison formula to write down the inverse of 1 −
αĀ as a function of 1 − αĀ. More precisely,
−1 −1 −1
1 − αĀ = 1 − α Ā − χTx δ = 1 − αĀ + αχTx δ
−1 −1
−1 1 − αĀ αχTx δ 1 − αĀ
= 1 − αĀ − −1 .
1 + αδ 1 − αĀ χTx
We now multiply by the preference vector v, obtaining the explicit PageRank
correction:
−1 −1
−1 −1 1 − αĀ αχTx δ 1 − αĀ
v 1 − αĀ = v 1 − αĀ −v −1
1 + αδ 1 − αĀ χTx
−1 −1
αrχTx δ 1 − αĀ αrx δ 1 − αĀ
=r− −1 =r− −1 .
1 + αδ 1 − αĀ χTx 1 + αδ 1 − αĀ χTx
−1
Now remember that rx > 0, and note that 1 − αĀ χTx is the vector of positive
contributions to the PageRank of x, modulo the normalization factor 1 − α. As
such, it is made of positive values adding up to, at most, 1/(1 − α). When the
vector is multiplied by δ, in the worst case (d = 0), we obtain 1/(1 − α), so
given the conditions on α, it is easy to see that the denominator is positive. This
implies that we can gather all constants in a single positive constant c and just
write
−1 −1
v 1 − αĀ = v − cδ 1 − αĀ .
The above equation rewrites the rank one correction due to the addition of the
arc x → y as a formal correction of the preference vector. We are interested in
the difference
−1 −1 −1
v − cδ 1 − αĀ − v 1 − αĀ = −cδ 1 − αĀ ,
because we can conclude our proof by just showing that its yth coordinate is
strictly positive.
252 Internet Mathematics
We now note that being 1 − αĀ strictly diagonally dominant, the (nonnega-
−1
tive) inverse B = 1 − αĀ has the property that the entries bii on the diagonal
are strictly larger than off-diagonal entries bk i on the same column [McDonald
et al. 95, Remark 3.3], and in particular they are nonzero. Thus, if d = 0
−1 c
−cδ 1 − αĀ y
= by y > 0,
d+1
and if d = 0
−1 c c c
−cδ 1 − αĀ y
= by y − bz y > by y
d+1 x→z
d(d + 1) d + 1
c
− by y = 0.
x→z
d(d + 1)
6. Roundup
All our results are summarized in Table 3, where we distilled them into sim-
ple yes/no answers to the question: does a given centrality measure satisfy the
axioms?
It was surprising for us to discover that only harmonic centrality satisfies all ax-
ioms.20 All spectral centrality measures are sensitive to density. Row-normalized
spectral centrality measures (Seeley’s index, PageRank and SALSA) are insen-
sitive to size, whereas the remaining ones are sensitive only to the increase of k
(or p in the case of betweenness). All nonattenuated spectral measures are also
2 0 It is interesting to note that it is actually the only centrality satisfying the size axiom—in
fact, one needs a cycle of ≈ ek nodes to beat a k-clique.
Boldi and Vigna: Axioms for Centrality 253
Table 3. For each centrality and each axiom, we report whether it is satisfied.
non-monotone. Both Lin’s index and closeness centrality fail density tests.21
Closeness has, indeed, the worst possible behavior, failing to satisfy all our ax-
ioms. Although this result might seem counterintuitive, it is actually a conse-
quence of the known tendency of very far nodes to dominate the score, hid-
ing the contribution of closer nodes whose presence is more correlated to local
density.
All centralities satisfying the density axiom have no watershed: the axiom is
satisfied for all p, k ≥ 3. The watershed for closeness (and Lin’s index) is k ≤ p,
meaning that they just miss it, whereas the watershed for betweenness is a
quite pathological condition (k ≤ (p2 + p + 2)/4): one needs a clique whose size
is quadratic in the size of the cycle before the node of the clique on the bridge
becomes more important than the one on the cycle (compare this with closeness,
where k = p + 1 is sufficient).
We remark that our results on geometric indices do not change if we replace
the directed cycle with a symmetric (i.e., undirected) cycle, with the additional
condition that k > 3. It is possible that the same is true also of spectral central-
ities, but the geometry of the paths of the undirected cycle makes it extremely
difficult to carry on the analogous computations in that case.
2 2 It is actually now possible to approximate them efficiently [Boldi and Vigna 13].
Boldi and Vigna: Axioms for Centrality 255
Table 4. The names and definition of the four naive centrality measures used in
Table 5. Each centrality is obtained by multiplying the values described by its
row and column labels.
a null score. How is it possible that PageRank and SALSA work so well (albeit
less than harmonic centrality on the whole graph) with so little information?
Our suspect was that these measures were actually picking up some much
more elementary signal than their definition could make one think. In a highly
disconnected graph, the values assigned by such measures depend mainly on the
indegree and on some additional ranking provided by coreachable (or weakly
reachable) nodes.
We thus devised four “naive” centrality measures around our axioms. These
measures depend on density, and on size. We use two kinds of scores based on
density: the indegree and the negative β-measure [van den Brink and Gilles 94],23
that is,
1
.
y →x
d+ (y)
2 3 Note that the β-measure originally defined by van den Brink and Gilles in [van den Brink
and Gilles 94] is the positive version, that is, the negative β-measure can be obtained by
applying the β-measure defined in [van den Brink and Gilles 94] to the transposed graph.
Boldi and Vigna: Axioms for Centrality 257
actually pays off. We consider this fact as a further confirmation that the traits
of centrality represented by our axioms are important.
Acknowledgments. We thank David Gleich for useful pointers leading to the proof of the
score-monotonicity of PageRank in the general case, and Edith Cohen for useful discus-
sions on the behavior of centrality indices. Marco Rosa participated in the first phases
of the development of this paper.
Funding. The authors have been supported by the EU-FET grant NADINE (GA 288956).
References
[Anthonisse 71] Jac M. Anthonisse. “The Rush in a Graph.” Technical Report, Ams-
terdam: University of Amsterdam Mathematical Centre, 1971.
[Altman and Tennenholtz 05] Alon Altman and Moshe Tennenholtz. “Ranking Sys-
tems: The PageRank Axioms.” In Proceedings of the 6th ACM Conference on Elec-
tronic Commerce, pp. 1–8. New York, NY: ACM, 2005.
[Avrachenkov and Litvak 06] Konstantin Avrachenkov and Nelly Litvak. “The Effect
of New Links on Google PageRank.” Stochastic Models 22:2 (2006), 319–331.
[Backstrom et al. 12] Lars Backstrom, Paolo Boldi, Marco Rosa, Johan Ugander, and
Sebastiano Vigna. “Four Degrees of separation.” In ACM Web Science 2012: Con-
ference Proceedings, pp. 45–54. ACM Press, 2012.
[Bavelas 50] Alex Bavelas. “Communication Patterns in Task-Oriented Groups.” Jour-
nal of Acoustical Socciety of America 22:6 (1950), 725–730.
[Bavelas and Barrett 51] A. Bavelas, D. Barrett, and American Management Associ-
ation. An Experimental Approach to Organizational Communication. Publications
Boldi and Vigna: Axioms for Centrality 259
[Chien et al. 04] Steve Chien, Cynthia Dwork, Ravi Kumar, Daniel R. Simon, and
D. Sivakumar. “Link Evolution: Analysis and Algorithms.” Internet Math. 1:3 (2004),
277–304.
[Cohen and Kaplan 07] Edith Cohen and Haim Kaplan. “Spatially-Decaying Aggre-
gation over a Network.” Journal of Computer and System Sciences 73:3 (2007),
265–288.
[Cohen and Havlin 10] Reuven Cohen and Shlomo Havlin. Complex Networks: Struc-
ture, Robustness and Function. New York, NY: Cambridge University Press, 2010.
[Cohn and Marriott 58] B.S. Cohn and M. Marriott. “Networks and Centres of Inte-
gration in Indian Civilization.” Journal of Social Research 1:1 (1958), 1–9.
[Craswell et al. 03] Nick Craswell, David Hawking, and Trystan Upstill. “Predicting
Fame and Fortune: PageRank or indegree?” In Proceedings of the Australasian Doc-
ument Computing Symposium, ADCS2003, pp. 31–40, 2003.
[Del Corso et al. 06] Gianna Del Corso, Antonio Gullı̀, and Francesco Romani. “Fast
PageRank Computation via a Sparse Linear System.” Internet Math. 2:3 (2006),
251–273.
[Farahat et al. 06] Ayman Farahat, Thomas Lofaro, Joel C. Miller, Gregory Rae, and
Lesley A. Ward. “Authority Rankings from HITS, PageRank, and SALSA: Existence,
Uniqueness, and Effect of Initialization.” SIAM Journal on Scientific Computing 27:4
(2006), 1181–1201.
[Freeman 77] Linton C. Freeman. “A Set of Measures of Centrality Based on Between-
ness.” Sociometry 40:1 (1977), 35–41.
[Freeman 79] L. Freeman. “Centrality in Social Networks: Conceptual Clarification.”
Social Networks 1:3 (1979), 215–239.
[Friedkin 91] N.E. Friedkin. “Theoretical Foundations for Centrality Measures.” The
American Journal of Sociology 96:6 (1991), 1478–1504.
[Hubbell 65] Charles H. Hubbell. “An Input-Output Approach to Clique Identifica-
tion.” Sociometry 28:4 (1965), 377–399.
[Järvelin and Kekäläinen 02] Kalervo Järvelin and Jaana Kekäläinen. “Cumulated
Gain-Based Evaluation of IR Techniques.” ACM Trans. Inf. Syst. 20:4 (2002),
422–446.
[Katz 53] Leo Katz. “A New Status Index Derived from Sociometric Analysis.” Psy-
chometrika 18:1 (1953), 39–43.
[Kendall 55] Maurice G. Kendall. “Further Contributions to the Theory of Paired Com-
parisons.” Biometrics 11:1 (1955), 43–62.
[Kleinberg 99] Jon M. Kleinberg. “Authoritative Sources in a Hyperlinked Environ-
ment.” Journal of the ACM 46: (1999), 604–632.
[Knuth 92] Donald E. Knuth. “Two Notes on Notation.” American Mathematical
Monthly 99:5 (1992), 403–422.
[Leavitt 51] H. J. Leavitt. “Some Effects of Certain Communication Patterns on Group
Performance.” J. Abnorm. Psychol. 46:1 (1951), 38–50.
Boldi and Vigna: Axioms for Centrality 261
[Lempel and Moran 01] Ronny Lempel and Shlomo Moran. “SALSA: The Stochastic
Approach for Link-Structure Analysis.” ACM Trans. Inf. Syst. 19:2 (2001), 131–160.
[Lempel and Moran 05] Ronny Lempel and Shlomo Moran. “Rank-Stability and Rank-
Similarity of Link-Based Web Ranking Algorithms in Authority-Connected Graphs.”
Information Retrieval 8:2 (2005), 245–264.
[Li et al. 05] Lun Li, David L. Alderson, John Doyle, and Walter Willinger. “Towards
a Theory of Scale-Free Graphs: Definition, Properties, and Implications.” Internet
Math. 2:4 (2005), 431–523.
[Lin 76] Nan Lin. Foundations of Social Research. McGraw-Hill, New York, 1976.
[Mackenzie 66] Kenneth Mackenzie. “Structural Centrality in Communications Net-
works.” Psychometrika 31:1 (1966), 17–25.
[Marchiori and Latora 00] Massimo Marchiori and Vito Latora. “Harmony in the
Small-World.” Physica A: Statistical Mechanics and its Applications 285:3–4 (2000),
539–546.
[McDonald et al. 95] J.J. McDonald, M. Neumann, H. Schneider, and M.J. Tsat-
someros. “Inverse m-Matrix Inequalities and Generalized Ultrametric Matrices.”
Linear Algebra and its Applications 220 (1995), 321–341.
[Milgram 67] Stanley Milgram. “The Small World Problem.” Psychology Today 2:1
(1967), 60–67.
[Najork et al. 09] Marc Najork, Sreenivas Gollapudi, and Rina Panigrahy. “Less is
More: Sampling the Neighborhood Graph Makes SALSA Better and Faster.” In
Proceedings of the Second ACM International Conference on Web Search and Data
Mining, pp. 242–251. New York, NY: ACM, 2009.
[Najork et al. 07a] Marc Najork, Hugo Zaragoza, and Michael J. Taylor. “HITS on the
Web: How Does it Compare?” In Wessel Kraaij, Arjen P. de Vries, Charles L. A.
Clarke, Norbert Fuhr, and Noriko Kando, pp. 471–478. SIGIR 2007: Proceedings of
the 30th Annual International ACM SIGIR Conference on Research and Develop-
ment in Information Retrieval. Amsterdam, The Netherlands: New York, NY: ACM,
2007.
[Najork et al. 07b] Marc A. Najork, Hugo Zaragoza, and Michael J. Taylor. HITS on
the web: How does it compare? In Proceedings of the 30th annual international ACM
SIGIR conference on Research and development in information retrieval, SIGIR ’07,
pages 471–478. ACM, 2007.
[Nieminen 73] U.J. Nieminen. “On the Centrality in a Directed Graph.” Social Science
Research 2:4 (1973), 371–378.
[Padgett and Ansell 93] John F. Padgett and Christopher K. Ansell. “Robust Action
and the Rise of the Medici, 1400–1434”. The American Journal of Sociology 98:6
(1993), 1259–1319.
[Page et al. 98] Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd.
“The PageRank Citation Ranking: Bringing Order to the Web.” Technical Report
SIDL-WP-1999-0120, Stanford Digital Library Technologies Project, Stanford Uni-
versity, 1998.
262 Internet Mathematics
[Pan and Saramäki 11] Raj Kumar Pan and Jari Saramäki. “Path Lengths, Correla-
tions, and Centrality in Temporal Networks.” Phys. Rev. E 84:1(2011),016105.
[Pitts 65] Forrest R. Pitts. “A Graph Theoretic Approach to Historical Geography.”
The Professional Geographer 17:5 (1965), 15–20.
[Sabidussi 66] G. Sabidussi. “The Centrality Index of a Graph.” Psychometrika 31:4
(1966), 581–603.
[Seeley 49] John R. Seeley. “The Net of Reciprocal Influence: A Problem in Treating
Sociometric Data.” Canadian Journal of Psychology 3:4 (1949), 234–240.
[Shannon 148] Claude E. Shannon. “A Mathematical Theory of Communication.” Bell
Syst. Tech. J. 27:3 (1948), 379–423, 623–656.
[Stephenson and Zelen 89] Karen Stephenson and Marvin Zelen. “Rethinking Central-
ity: Methods and Examples.” Social Networks 11:1 (1989), 1–37.
[Upstill et al. 03] Trystan Upstill, Nick Craswell, and David Hawking. “Query-
Independent Evidence in Home Page Finding.” ACM Trans. Inf. Syst. 21:3 (2003),
286–313.
[van den Brink and Gilles 94] René van den Brink and Robert P. Gilles. “A Social
Power Index for Hierarchically Structured Populations of Economic Agents.” In Im-
perfections and Behavior in Economic Organizations, vol. 11 edited by Robert P.
Gilles and Pieter H. M. Ruys, pp. 279–318. Netherlands: Springer, 1994.
[Vigna 09] Sebastiano Vigna. “Spectral Ranking.” Preprint, 2009. (http://arxiv.org/
abs/0912.0238).
[Wasserman and Faust 94] Stanley Wasserman and Katherine Faust. Social Network
Analysis: Methods and Applications. New York, Cambridge: Cambridge Univ Press,
1994.
[Wei 52] T.-H. Wei. The Algebraic Foundations of Ranking Theory. PhD thesis, Uni-
versity of Cambridge, 1952.