(Norman - Ehrentreich) Agent-Based Modeling The Santa Fe Institute Artificial Stock Market Model Revisited
(Norman - Ehrentreich) Agent-Based Modeling The Santa Fe Institute Artificial Stock Market Model Revisited
Managing Editors:
Prof. Dr. G. Fandel
Fachbereich Wirtschaftswissenschaften
Fernuniversität Hagen
Feithstr. 140/AVZ II, 58084 Hagen, Germany
Prof. Dr. W. Trockel
Institut für Mathematische Wirtschaftsforschung (IMW)
Universität Bielefeld
Universitätsstr. 25, 33615 Bielefeld, Germany
Editorial Board:
A. Basile, A. Drexl, H. Dawid, K. Inderfurth, W. Kürsten
Norman Ehrentreich
Agent-Based
Modeling
The Santa Fe Institute
Artificial Stock Market
Model Revisited
123
Dr. Norman Ehrentreich
RiverSource Investments, LLC
262 Ameriprise Financial Center
Minneapolis, MN 55474
USA
[email protected]
DOI 10.1007/978-3-540-73878-7
This work is subject to copyright. All rights are reserved, whether the whole or part of the material
is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation,
broadcasting, reproduction on microfilm or in any other way, and storage in data banks. Duplication
of this publication or parts thereof is permitted only under the provisions of the German Copyright
Law of September 9, 1965, in its current version, and permission for use must always be obtained
from Springer. Violations are liable to prosecution under the German Copyright Law.
The use of general descriptive names, registered names, trademarks, etc. in this publication does
not imply, even in the absence of a specific statement, that such names are exempt from the relevant
protective laws and regulations and therefore free for general use.
987654321
springer.com
Meinen Eltern gewidmet.
Foreword
When the original Santa Fe Institute (SFI) artificial stock market was
created in the early 1990’s, the creators realized that it contained many
interesting new technologies that had never been tested in economic
modeling. The authors kept to a very specific finance message in their
papers, but the hope was that others would pick up where these papers
left off and put these important issues to the test. Tackling the com-
plexities involved in implementation has held many people back from
this, and many parts of the SFI market remain unexplored. Ehren-
treich’s book is an important and careful study of some of the issues
involved in the workings of the SFI stock market.
As Ehrentreich’s book points out in its historical perspective, the
SFI market was intended as a computational test bed for a market
with boundedly rational learning agents replacing the standard setup
of perfectly rational equilibrium modeling common in economics and fi-
nance. These agents exhibit reasonable, purposeful behavior, but they
are not able to completely process every aspect of the world around
them. This can be viewed much more as a function of the complex-
ity of the world, rather than the computational limitations of agents.
In a financial world out of equilibrium, optimal behavior would re-
quire knowledge of strategies being used by all the other agents, an
information and computational task which seems well out of reach of
any trader. The SFI market’s main conclusion was that markets where
agents were learning might not converge to traditional simple rational
expectations equilibria. They go to some other steady state in which
a rich set of trading strategies survives in the trader population. In
this steady state the market demonstrates empirical signatures that
are present in most financial time series.
VIII Foreword
Minneapolis, MN
September 2007 Norman Ehrentreich
Contents
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
4 Learning in Economics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
4.2 Definitions of Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
4.3 Rationality-Based Learning Models . . . . . . . . . . . . . . . . . . . 31
4.4 Biologically Inspired Learning Models . . . . . . . . . . . . . . . . . 32
XIV Contents
Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 187
11.1 Timing in the Stock Market . . . . . . . . . . . . . . . . . . . . . . . . . 187
11.2 Fundamental and Technical Trading Bits . . . . . . . . . . . . . . 189
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 227
Part I
2.1 Introduction
4
A comparison of the different versions by Schelling is given by Pancs and Vriend
[332]. The most important distinction is the dimensionality of neighborhoods. In
his 1969 model, a neighborhood is defined only in one dimension, i.e., agents popu-
late a line, while his later models use two dimensional lattices. A two-dimensional
web-based simulation example in NetLogo by Wilensky [436] can be found at
http://ccl.northwestern.edu/netlogo/models/Segregation.
5
Pancs and Vriend [332] extend the framework by asking how individual prefer-
ences may alter the outcome. Surprisingly, the results are very robust to changes
in preferences. Even if all agents strictly prefer perfect integration, neighborhood
segregation will still occur.
2.2 The Representative Agent Modeling Approach 7
Lucas, however, criticized that it is likely that some of the the parame-
ters contained in θ change due to a shift of policy regime λ. Aggregate
quantities and prices might react differently than predicted since agents
may change their behavior in a way which is not captured in the ag-
gregate equations. For instance, agents could adapt their expectations
about future inflation rates or, in the case of rational expectations,
change them even before an anticipated policy shift is implemented.
An attempt to exploit a potential trade-off between unemployment and
the inflation rate through an expansionary monetary policy may thus
be foiled. Taking the Lucas-critique into account, equation (2.1) should
be rewritten as
yt+1 = F (yt , xt , θ(λ), µ, ǫt ) , (2.2)
where θ(λ) contains regime dependent parameters, while µ is thought
to consist of truly invariable taste and technology parameters.
While Lucas offered no solution to this fundamental problem, rep-
resentative agent models were soon to be thought of offering an easy
escape from it. Going beyond simple aggregate relationships and an-
alyzing the economy at a deeper level than before, macroeconomists
pretended to know the structural equations from which the aggregate
2.2 The Representative Agent Modeling Approach 9
supply and demand curves are derived [371]. If a policy change is an-
nounced or implemented, the representative agent simply recalculates
his optimization problem, given his objective function and budget con-
straints. This approach also satisfied the desire for a microfoundation
in the new classical sense since behavior is derived from a utility max-
imization problem.8
Hartley argues though that it is impossible to identify truly in-
variable taste and technology parameters [172]. Acknowledging this,
economists should realize that the Lucas critique imposes a standard
that no macroeconomic model can probably ever fulfil. Since represen-
tative agent models suffer from the same deficiencies as old style Key-
nesian macroeconomic models, the justification for their use is greatly
undermined.
8
According to [172], there is another view of what constitutes an appropriate mi-
crofoundation. Keynesian models, for instance, backed up their macroeconomic
relationships with some explanatory story which is thought to be entirely suffi-
cient.
9
Hartley contrasts the requirement of true structural assumptions in Walrasian
equilibrium models with Friedman’s famous dictum that the realism of assump-
tions is irrelevant [172]. For Friedman, the validity of a theory is based on how
well its prediction match reality, no matter how realistic its assumptions.
10 2 The Rationale for Agent-Based Modeling
that the actual world would not look very different from the model if
populated by identical clones [172, p. 66].
There is mounting evidence, however, that this is not the case. First
of all, representative agent models are usually characterized by a com-
plete absence of trade and exchange in equilibrium [225], one of the
most basic activities in a market economy.10 For instance, in the classi-
cal CAPM [389, 252, 313], there is no trade after agents have completed
their initial portfolio-diversification.
It has long been known in economics that what is true for individual
agents may not hold for the aggregate economy. This phenomenon is
called the fallacy of composition [61, 172]. Together with its logical
counterpart, the fallacy of division, it highlights the tension between
micro- and macroeconomics. The economy as a whole is formed by
many consumers and firms whose interactions may cause emergent be-
havior at the macroeconomic level. Correct policy recommendations for
individual economic units may not work for the aggregate economy or
vice versa. For instance, in times of recession, a profit maximizing firm
is likely to lay off workers in order to survive, while a similar action
by the government as an aggregate player will aggravate the economic
downturn.
Representative agent models usually commit the fallacy of com-
position by ignoring valid aggregation concerns. Kirman, for instance,
provides a graphical example based on [203] in which the representative
agent disagrees with all individuals in the economy [225]. Policy rec-
ommendations based on the representative agent, a common practice
in today’s macroeconomics, are illegitimate in this case.
A rigorous treatment of this logical fallacy can be found in the liter-
ature on exact aggregation.11 Gorman [160] was the first who derived
general conditions under which the aggregation of individual prefer-
ences is possible. He showed that aggregate demand is dependent on
income distribution unless all agents have identical homothetic utility
functions. Only when their Engel curves are parallel and linear, a redis-
tribution of income will leave the aggregate demand unaffected. Other
authors [412, 203, 249] derived similar conditions for exact aggregation,
10
In the literature, there are as many no-trade theorems [306, 12, 422] as attempts
to solve this apparent contradiction with economic reality. These attempts are
usually characterized by a relaxation of the assumption of strict homogeneity of
market participants [83, 230, 428].
11
An introduction to the problem of exact aggregation can be found in [161].
2.2 The Representative Agent Modeling Approach 11
the least restrictive ones given in [249]. In any case, those conditions
are still so special that no economist would ever consider them to be
plausible [225]. In the unlikely event of them being satisfied, no-trade
situations would be the result.
For apparent reasons, the literature on exact aggregation has largely
been ignored by representative agent modelers. In summarizing this
body of research, Kirman concludes that the reduction of a group of
heterogeneous agents to a representative agent is not just an analytical
convenience, but is “both unjustified and leads to conclusions which are
usually misleading and often wrong”. Hence, “the ‘representative’ agent
deserves a decent burial, as an approach to economic analysis that is
not only primitive, but fundamentally erroneous” [225, p. 119].
age opinion to be” with some participants forming even higher order
beliefs about other participants’ beliefs.12
Asset pricing models with higher order beliefs, however, are prone to
an excess reliance on public signals [6]. Instead of always tracking the
fundamental value, those models may exhibit bubbles and sunspots,
i.e., equilibria that are different from the fundamental equilibrium sim-
ply because agents believe that a purely extrinsic random variable, a
so-called sunspot13 variable, has an effect on the equilibrium alloca-
tion. Bubbles and self-fulfilling sunspot equilibria constitute a kind of
market created uncertainty [418] or behavioral uncertainty [334], which
is caused by the direct or indirect interaction between heterogeneous
market participants.
The prevailing hypothesis in expectation formation since the early
seventies is that of rational expectations [315]. It assumes that agents
have perfect knowledge of the underlying market equilibrium equations
and that they use these equations to determine their rational expec-
tations forecast. As a result, the agents’ subjective expectations are
identical to the objective mathematical expectations given the avail-
able information set. Any errors in expectations must be random with
a mean of zero. In other words, a consistent bias in expectations is
inconsistent with the presumed rationality of the agents.
When forming their expectations of future payoffs, investors with
rational expectations do not stop with third or fourth order beliefs
about other beliefs. Their perfect deductive logic requires them to do
an infinite regress about the expectations of every other investor. Only
for homogeneous expectations among investors are these expectations
well defined. The expectations of heterogeneous investors, on the other
hand, that their expectations remain indeterminate under the Ratio-
nal Expectations Hypothesis [334, 11]. Thus, heterogeneous investors
cannot use deductive logic in forming their expectations about future
payoffs. Perfect rationality and rational expectations in particular can-
not be well defined in analytical asset pricing models with many het-
erogeneous investors.
12
A formalization of Keynes’ beauty contest can be found in [334, p. 276–281].
Further discussions of expectation formation among heterogeneous investors are
[425, 33].
13
Instead of “sunspots”, the terms “animal spirits” [218] or “self-fulfilling prophe-
cies” are sometimes used in the literature. The term “sunspot” is used to point out
the arbitrariness of the variable. A model solution depends on a sunspot variable
only because everyone believes in its importance. Reviews on sunspot equilibria
and endogenous fluctuations can be found in [29, 14].
2.3 Rational Expectations and Disequilibrium Dynamics 13
20
A commonly accepted formal definition of emergence in CAS does not yet exist
to my knowledge. Bankes [22] observed that the declaration of emergent behavior
seems to be at the discretion of the researcher who intuitively relies on visual ob-
servations of differences between the micro- and macrolevel. Thus, he calls for an
operational definition of emergence such as the specification of some standardized
threshold level of micro- and macrolevel differences above which emergence can
be declared.
2.5 Some Methodological Aspects of Agent-Based Simulations 17
21
Axelrod [13] views simulation as a third research method in science. Like deduc-
tion, it starts with explicit assumptions, but it does not prove theorems. The
simulated data are then analyzed by means of indexinduction induction. But un-
like induction, these data come from a specified rule set and not from real world
observations.
22
According to Popper [341], the long-standing debate about the problem of induc-
tion in science was initiated by David Hume in the eighteenth century. Because
past evidence tells us only about past events, to base expectations about future
events on them is simply irrational. Popper then claimed to have solved Hume’s
induction problem by positing his principle of falsification as a direct antithesis
to induction. In economics, the tension between inductive and deductive methods
culminated in the famous “Methodenstreit” (conflict of methods) between Gus-
tav Schmoller, a representative of the Younger Historical School in Germany, and
Carl Menger as a member of the Austrian School of Economics.
23
A similar trade-off between rigor and relevance in economics is presented in [291].
18 2 The Rationale for Agent-Based Modeling
making source codes available.24 But as with the lack of sufficient sen-
sitivity analyses and standardized testing methods for computational
models, it should not be a severe obstacle for the usefulness of agent-
based simulation in economics.
24
The Santa Fe Artificial Stock Market, for instance, was first programmed in
Objective-C. Later, it was reprogrammed to utilize the SWARM-simulation li-
brary. Wilpert [437] ported the model to Borland-C++, while Ehrentreich [106]
reprogrammed the market under Java and the RePast-library. Other artificial
stock markets use mathematical program packages such as GAUSS [268, 269] or
Matlab [240].
3
The Concept of Minimal Rationality
I have had my results for a long time: but I do not yet know
how I am to arrive at them.
Karl Friedrich Gauss
3.1 Introduction
For a long time economists have felt that the idealization of rationality
for modeling purposes needed theoretical justification. The first evo-
lutionary arguments for rationality appeared during the 1930’s in the
works of Schumpeter [382] or in the famous marginalism controversy
by Harrod [171] who compared new business procedures to “mutations
in nature”. Alchian [3] later claimed that competition will lead to a
survival of firms with positive profits, whether or not they consciously
maximize them or not.1
The debate whether forces of competition will assure that only the
“fittest”, i.e., most rational actors survive in the market was further
spurred on by Friedman’s famous “as-if” argument [140]. He claimed
that individuals who do not behave completely rationally, for instance,
by not consciously maximizing expected returns, could still be consid-
ered as if they do. Business behavior that is consistent with the maxi-
mization of returns hypothesis, even though it may only be by chance,
will lead to prospering firms, while “natural selection” will weed out
those firms whose behavior contradicts the maximization hypothesis.
1
A discussion of the marginalist controversy can be found in [310]. Evolutionary
ideas in economics, however, can be traced back even further than this. Hodgson
[180] gives a comprehensive account of the history of evolutionary thought in
economics.
20 3 The Concept of Minimal Rationality
2
An extensive discussion and critique of Friedman’s evolutionary argument can be
found in the chapter “Optimization and Evolution”, in [180, ch. 13]. Celebrating
the fiftieth anniversary of Milton Friedman’s ‘The methodology of positive eco-
nomics’, the 2003 annual meeting of the Allied Social Science Associations devoted
a complete symposium to it. The papers delivered at that session, e.g., [353, 273],
are contained in a special issue of the Journal of Economic Methodology.
22 3 The Concept of Minimal Rationality
3
The term “economic rationality” might not be optimal since it is by no means
“economical.” It always requires the use of all our cognitive and computational
capabilities—sometimes even beyond that—to perform simple tasks for which
simple heuristics would suffice. In this respect, procedural or situational rational-
ity, to be defined below, are more economical concepts of rationality.
4
A survey about violations of the basic postulates of economic rationality can be
found in [64].
5
Herbert A. Simon revived the Scottish word ”satisficing” (=satisfying) to denote
problem solving and decision making that sets an aspiration level, searches until
an alternative is found that is satisfactory by the aspiration level criterion, and
elects that alternative [394, Part IV ].
6
For a discussion of Weber’s and Schütz’s contributions to rational choice theory,
see [379].
7
A typical definition of economic rationality by Hirshleifer [179] illustrates the
lack of any situational components. “In the light of one’s goals (preferences),
if the means chosen (actions) are appropriate, the individual is rational; if not,
3.2 Economic, Bounded, and Situational Rationality 23
irrational. ‘Appropriate’ here refers to method rather than result.” Note that
rationality is solely defined in terms of one’s preferences, not in terms of the
specific environment the decision maker faces.
8
Langlois [232] points out that models in the field of New Institutional Economics
are able to explain the evolution of rules. Norms and conventions emerge through
a process of repeated games. Eventually, agents will discover “evolutionary stable
strategies.” These strategies are simple rules which then become institutionalized.
9
A dictionary definition simply asserts that being rational means “agreeable to
reason, reasonable, sensible”. In psychology, an appropriate deliberation process
is viewed as a key ingredient of rationality. It is in this tradition that Herbert
Simon used bounded rationality and procedural rationality as synonyms since
the focus is on the reasoning process itself rather than on identifying a goal-
maximizing action in a given situation.
10
However, the term consistent expectations equilibria has now been claimed by
Hommes and Sorger [191] and Sögner and Mitlöhner [406]. It is an informationally
less demanding concept than the Rational Expectations Hypothesis. Agents do
24 3 The Concept of Minimal Rationality
not need to know the underlying market equilibrium equations. They only need
to have beliefs which turn out to be consistent with actual observations.
3.3 Situational Analysis, Minimal Rationality, and the Prime Directive 25
11
Rubinstein, however, allows for investor heterogeneity by allowing differences of
opinion and uncertainties about other investors’ characteristics.
12
Friedman also claims that if there is one hypothesis that is consistent with the
available evidence, then there is an infinite number that are consistent (p. 9).
13
In Occam’s own words it reads as “Pluralitas non est ponenda sine neccesitate”,
which translates into English as “Causes are not to be multiplied beyond neces-
sity.”
14
Karl Popper points out that the application of Occam’s razor may be used inap-
propriately, which may lead to bad problem reductions [341, p. 308].
26 3 The Concept of Minimal Rationality
17
For static inflation expectation, i.e., for c0 = 1 and all other cn = 0 in equation
3.1, agents do not adapt their inflation expectations at all.
28 3 The Concept of Minimal Rationality
4.1 Introduction
The
normalization through the denominator ensures that the condition
h∈H p(h, t) = 1 is fulfilled for each period. As time proceeds, the
probability p(h, t) for the correct hypothesis should converge to 1, while
the probabilities for the other hypotheses should approach zero. Given
these updated probabilities for each hypothesis, agents chose an action
that maximizes their expected utility.
While Brenner points out that the basic mechanism of Bayesian
learning roughly corresponds to the psychological notion of cognitive
learning, he admits that it lacks empirical or experimental justification.
It is very doubtful whether people are indeed able to do the necessary
calculations that Bayesian updating requires. He also refers to the psy-
chology literature which finds that people tend not to simultaneously
consider many competing hypotheses [52]. Experimental evidence such
as Monty Hall’s three-door anomaly, too, suggests that agents consis-
tently deviate from the rational Bayesian solution [139].
4
In population genetics , fitness is often defined as reproductive success which is
only an ex post measure. Since species or members of a specific species do not come
with a fitness function attached to them, evolutionary biology has tremendous
difficulties in defining fitness. Lewontin admits that “although there is no difficulty
in theory in estimating fitnesses, in practice the difficulties are insuperable. To
the present moment no one has succeeded in measuring with any accuracy the net
fitness of genotypes for any locus in any environment in nature” [250, p. 236].
5
In evolutionary algorithms, this depends on the specific implementation of the
selection operator. Elitism, for instance, retains some of the fittest individuals in
each generation [308].
6
Even though most evolutionary algorithms have the Markov property, past ex-
periences could be implicitly captured in an appropriate definition of the fitness
function.
7
The application of “Darwinism” to economics and the use of biological analogies,
however, is vehemently defended by Hodgson [181]. He finds that most criticisms
of biological analogies are unfounded and that Darwinism involves a basic philo-
34 4 Learning in Economics
fi (x, t)
xi (t + 1) = xi (t) , i = 1, . . . , n. (4.3)
f (x, t)
In this case, multiple equilibria are possible, and the mean fitness of the
population may or may not increase [234]. That is, under the conditions
likely to exist in competitive markets, the formalization of Friedman’s
argument with replicator dynamics shows that evolutionary market
forces do not ensure emerging rationality as an inevitable result.11
Replicator equations are often used to model social learning in which
agents learn from others by imitating other, more successful agents
[377, 378]. However, replicator equations predetermine the benchmark
level of efficiency to that of the most efficient firm contained in the
initial population [102]. Since replicator dynamics account for selection
only and contain no element of discovery or invention, a superior be-
havior missing in the initial population of behavioral rules cannot be
found. Novelty can be included with the so-called selection-mutation
equation, sometimes also referred to as the Fisher-Eigen equation.12
This is an extension of simple replicator dynamics by which novelty
is accounted for by adding a mutation term to equation 4.2.13 Since
replicator dynamics and the selection-mutation equation allow for a
10
The continuous time
version
of the replicator equation is more widely known and
reads as ẋi = xi fi (t) − f¯ . However, since computer simulations can only evolve
in discrete time steps, the discrete versions of any formula will usually be given.
11
In a more detailed analysis, Blume and Easley show on the one hand that market
selection favors profit maximizing firms. The long-run behavior of evolutionary
market models, however, is not consistent with equilibrium models based on the
profit-maximization hypothesis [43].
12
This equation is named after two evolutionary biologists, Ronald A. Fisher from
Great Britain, and Manfred Eigen from Germany.
13
More details are given in [52].
36 4 Learning in Economics
BEGIN
randomly create initial population P0
WHILE NOT stopping condition DO
t := t + 1
Evaluation of Pt−1
Selection from Pt−1 and Reproduction into Pt
Recombination (Crossover) on Pt
Mutation on Pt
END
END
Fig. 4.1. Basic structure of the canonical GA.
Si = { 0 1 1 0 1 0 1 0 1 1 0 0 1 0 1 1}.
16
A combination of social and individual learning would correspond to migration
between different populations. In population genetics this is analyzed with so-
called island or stepping-stone models [126].
17
Often, the adherents of the competing school of evolutionary strategies criticized
the necessity of constantly coding and decoding the choice variables [351].
40 4 Learning in Economics
Evaluation
constant C to all raw fitness values Φ(i). This, however, causes a prob-
lem since simple fitness proportionate selection is not scaling invariant,
i.e., adding an identical offset C tends to equalize the selection proba-
bilities.
Table 4.1. The property of scaling invariance for fitness proportionate selec-
tion means that adding an identical offset—in this case 100 in column four—to
the raw fitness values Φ(i) of three genetic individuals almost equalizes their
selection probabilities ΠRW (i).
Recombination (Crossover)
Since crossover involves the exchange of genetic material between two
parents, it is often called a sexual genetic operator. By selecting two
chromosomes from the mating pool with high fitness values as parents,
genetic material from both chromosomes is exchanged to create two
new offspring. In the canonical GA, the offspring replace their parents,
and simple one point crossover is used.
parent 1 parent 2
0 1 0 0 0 1 1 0 0 1 1 0 1 1 1 0 0 1 1 0 1 1 0 1 1 1 0 0 0 1 0 0
0 1 0 0 0 1 0 1 1 1 0 0 0 1 0 0 0 1 1 0 1 1 1 0 0 1 1 0 1 1 1 0
offspring 1 offspring 2
Fig. 4.2. Example of one-point crossover with a random break point after
the fifth bit.
parent 1 parent 2
0 1 0 0 0 1 1 0 0 1 1 0 1 1 1 0 0 1 1 0 1 1 0 1 1 1 0 0 0 1 0 0
0 1 1 0 0 1 1 1 1 1 1 0 0 1 0 0 0 1 0 0 1 1 0 0 0 1 0 0 1 1 1 0
offspring 1 offspring 2
Mutation
Mutation refers to the creation of a new solution from only one par-
ent through spontaneous allele changes. It is an important element of
any evolutionary algorithm to prevent premature convergence. Unlike
crossover, mutation is able to introduce non-existing alleles at previ-
ously homogeneous bit positions. If, for instance, the most significant
bit is set in none of the trading rules, yet the optimal solution would
require it to be set, a GA without mutation could never reach the op-
timum. Mutation aims at maintaining a sufficiently diverse population
since without it, the final outcome of a GA is a uniform population
with no diversity at all.
In mutation, each parent bit is flipped with mutation probability
ΠM . Mutation probabilities are usually chosen to be very low, usu-
ally in the magnitude of 10−2 − 10−3 . Originally intended only as a
background operator [154], recent empirical and theoretical research
demonstrates improvements in the GA when strengthening the role of
mutation as a search operator [17]. Fogarty [130] empirically finds that
GAs with initially bigger mutation rates which exponentially decreases
over time do better than fixed mutation probabilities, a finding that is
theoretically confirmed by Bäck [16].
As with crossover, a mutated offspring replaces its parent in the
canonical GA. In both social and individual learning contexts, muta-
tion is usually interpreted as a metaphor for either experimentation or
discovery through unintended mistakes.
4.4 Biologically Inspired Learning Models 45
fξ (t)
Nξ (t + 1)|Nξ (t) ≥ Nξ (t) [1 − Dc (ξ)][1 − Dm (ξ)]. (4.8)
f (t)
The observed fitness fξ (t) is given by the mean fitness of all instances
belonging to schema ξ in generation t, while f is the mean fitness of the
whole population at t. Dc (ξ) and Dm (ξ) denote upper bounds for the
disruptive effects on schema membership by crossover and mutation.25
These disruptive effects have to be analyzed specifically for each opera-
tor. For instance, one-point crossover with crossover probability Πc can
disrupt the schema membership of an offspring only if its cross point
falls within the defining region of a schema. For chromosomes of length
L, the upper bound for the disruptive effect of one-point crossover on
schema membership is then given by Πc δ(ξ)/(L − 1). Similarly, for mu-
tation in which each gene is altered with mutation probability Πm ,
schema membership is only disrupted if the mutation operator alters
at least one of the defining positions of the chromosome. Since, by def-
inition, the probability that none of the defining positions of a schema
is altered is (1 − Πm )o(ξ) , the membership disruption Dm (ξ) caused by
24
A more general analysis for alphabets with higher cardinality can be found in
[347].
25
Note that if we neglect these disruptive terms, i.e., if we have a GA without
crossover and mutation, the evolution of the GA population is completely driven
by the dynamics of the selection operator. The inequality equation 4.8 then re-
duces to the already known replicator dynamics equation 4.3 on page 35.
46 4 Learning in Economics
fξ (t) δ(ξ)
Nξ (t + 1)|Nξ (t) ≥ Nξ (t) 1 − Πc [1 − Πm o(ξ)] . (4.9)
f (t) L−1
The Schema Theorem thus shows how low order schemata with
small defining length and above average fitness will quickly propa-
gate through a population. Holland and Goldberg [182, 154] call these
schemata building blocks, which, through recombination, are likely to
form more complex solutions with potentially higher fitness. They refer
to this conjecture as the Building Block Hypothesis. However, while the
Schema Theorem is easy to prove, Radcliffe doubts that the Building
Block Hypothesis can ever be proved [347].
Overall, the significance of the Schema Theorem and of the Building
Block Hypothesis remains highly debated [352, 92]. One of the short-
comings of the schema theorem is, for instance, that it only considers
the disruptive effects of crossover and mutation [89]. In addition, the
notion of implicit parallelism [182], i.e., the idea that GAs can pro-
cess many different schemata at once, and its relation to the cardinal-
ity of the representation remains highly disputed. Implicit parallelism
and the supposed optimality of binary representation is also discussed
in [154, 347]. An alternative set of principles for understanding the
working of a GA—the evolutionary progress principle, the genetic re-
pair hypothesis, and the mutation-induced speciation by recombination
principle—is suggested by Beyer [31].
Classifier systems are machine learning systems that derive their name
from their ability to actively classify their environment into recurrent
patterns. Classifier systems in the tradition of the Michigan-approach
were developed by John Holland and Judy Reitman at the University of
Michigan [187], while those in the tradition of the Pitt-approach were
developed by Smith at the University of Pittsburgh [403].26 Both are
a synthesis of expert systems as described in [301, 434] and Holland’s
genetic algorithms. In contrast to expert systems, though, classifier
26
For a comparison of both styles, see [402]. The following description is based on
Michigan-style classifier systems since they are more popular and have undergone
more development.
4.4 Biologically Inspired Learning Models 47
systems do not need to be initialized with a fixed rule for every possible
situation. Instead, classifier systems are flexible in that a GA adapts
the rule sets to a changing environment. Today, it is common to speak
of learning classifier systems (LCS) instead of classifier systems. The
general structure of LCS and their interactions with the environment
are shown in figure 4.4.
Environment
5 1 6
action
Message Board
1.65 0101011100
matching
4
Conflict Resolution
2 Classifier List
(rule selection)
condition part action part strength active
{01##01###0,0.95,1.25} 01##01###0 0.95 1.25 ¥
or 3 ##1100##10 2.25 0.78 –
{0##101##00,1.65,1.33} 01##00#100 3.50 0.33 –
?? : : : :
0##101##00 1.65 1.33 ¥
The state of the environment is sent to the LCS with a set of detec-
tors. These detectors post standardized messages about the environ-
ment to a message board with which a list of classifiers are matched.
Classifiers are condition/action rules of the form
48 4 Learning in Economics
5.1.1 Definitions
1
In his 1991 article [116], however, Fama relabels the taxonomy of information sets
which he developed in 1970. The first category is now called “tests for return pre-
dictability”. Semi-strong tests of market efficiency are relabeled as event studies,
while for strong-form tests, he suggests the term “tests for private information”.
2
A critique and solution to Grossman and Stiglitz’s information paradox is given by
Helliwg [176]. In an informationally efficient market, investors who do not engage
in costly information acquisition derive information from the market price even
before any transaction has taken place. In a dynamic setting, however, investors
who actively acquire information are able to use it before it is revealed through
prices.
5.1 Efficient Markets and the Efficient Market Hypothesis 53
pt+1 = pt + ǫt (5.1)
Pearson and Lord Rayleigh in 1905 in the journal Nature [333, 349]. These three
letters can be downloaded from http://www.e-m-h.org/Pear05.pdf.
6
Fama [115] notes that the terminology in finance is loose at this point. First
of all, expected price changes can be non-zero such that we have a “random
walk with drift”. Secondly, if one period returns are iid, prices will not follow a
random walk since the distribution of price changes is dependent on the price
level. Third, instead of simple white noise ǫt ∼ iid(0, σǫ2 ) disturbances, Gaussian
white noise ǫt ∼ iidN (0, σǫ2 ) is often assumed [59]. Besides these variations, three
versions of the random walk hypothesis are identified in the finance literature: The
“independently and identically distributed returns” version, the “independent
returns” version, and the “uncorrelated returns” version [65].
7
For a more detailed account of the early history of efficient markets, see [115, 245,
100].
8
It is common in the EMH literature to speak of prices as following martingales.
We will adopt this somewhat imprecise but convenient usage by specifying that
prices should be understood to include reinvested dividends. Since prices can
easily be converted to returns and vice versa, equation 5.2 could also be written as
Et [p̃t+1 |Φt ] = pt [1 + E(r̃t+1 |Φt )], with r̃t+1 = (p̃t+1 − pt )/pt being next period’s
return of the asset.
5.1 Efficient Markets and the Efficient Market Hypothesis 55
At the time when the tenets of the EMH were formulated, logarithmic
asset returns r△t (t) = ln p(t + △t) − ln p(t) were usually thought to
be normally distributed. At the same time, however, Mandelbrot [276]
presented convincing evidence that this assumption cannot be upheld.
In his long-term analysis of cotton prices, he showed that for various
choices of △t, ranging from one day up to one month, extreme events
occur much more often than implied under the Gaussian hypothesis.
Subsequent empirical studies unanimously confirmed this finding of
leptokurtosis for virtually all financial asset return distributions, i.e.,
5.2 Stylized Facts of Financial Markets 57
0,030
0,025
relative frequency
0,020
0,015
0,010
0,005
0,000
-7 -6 -5 -4 -3 -2 -1 0 1 2 3 4 5
sigma
Fig. 5.1. Comparison of daily DAX returns for the period 01/03/1986 −
12/30/2005 with normally distributed returns. The negative 9.47σ-event on
October 16, 1989 was cut off. The daily mean return at σ = 0 equals 0.027%,
the standard deviation is 0.014 (implying an annual variance of 23.0%), the
skewness is negative with −0.463, and the excess kurtosis is determined as
5.84.
11
Kurtosis derives from the Greek word kyrtos for “curved” and is a measure of
how tall or squat a curve is. A normal distribution is mesokurtotic with a kur-
tosis of three. Thin-tailed distributions with a kurtosis less than three are called
platykurtotic. Excess kurtosis is defined as κ − 3.
58 5 Replicating the Stylized Facts of Financial Markets
three −6σ, and one −9σ events occurred. Under the Gaussian assump-
tion of asset returns, this would have been extremely unlikely. The oc-
currence of extreme events in figure 5.1 is also in line with Mandelbrot’s
observation that the positive tails contain systematically fewer events
than the negative tails. This particular feature of empirical return dis-
tributions is captured by a distribution’s third moment skewness
N
1 (rj − r̄)3
S= . (5.5)
N σ3
j=1
A skewness smaller than zero indicates that the negative tail of the
distribution has more probability mass than the positive tail.
As a joint test of sample skewness S and sample kurtosis κ, the
Jarque-Bera test statistic [197, 30]
2
S κ−3
JB = N + ∼ χ2 (2) (5.6)
6 24
of 7313.4 with a p-value of 0.000 clearly rejects the null hypothesis of
normality.12
The leptokurtosis of return data is explained by the mixture of dis-
tributions hypothesis [74] which asserts that the returns are sampled
from a mixture of distributions with different conditional variances. The
stable Paretian hypothesis advanced by Mandelbrot, on the other hand,
claims that returns distributions are best characterized by a different
class of distributions. In addition to being non-Gaussian, Mandelbrot
found that the shape of daily, weekly, and monthly return distributions
of cotton prices appeared to be similar under different time scales [276].
This invariance is called scaling and it is characterized by power law be-
havior in the tails of a distribution, i.e., extreme events are distributed
according to f (r) ∼ |r|−(1+α) .13 The stable Paretian hypothesis states
12
Other popular tests for normality are, for instance, the Kolmogorov-Smirnov test,
the Shapiro-Wilk W -test, and the Anderson-Darling test. For a review of normal-
ity tests, see [86].
13
Power-law scaling was first discovered by Vilfredo Pareto (1848 − 1923) in the
distribution of wealth. Pareto found that in a population, the proportion of in-
dividuals with wealth W above a certain threshold is determined according to
a power law f (W ) ∼ CW −(1+α) . The power law exponent α was estimated by
Pareto to be around −3/2, f (W ) being the probability density function, and C
a positive constant. Scaling and power law distributions are also very familiar
to physicists because they appear near critical phase transitions from ordered to
disordered states. In fact, much of the work on scaling behavior in finance has
been done by physicists in the econophysics-movement. For an introduction into
econophysics, see [283].
5.2 Stylized Facts of Financial Markets 59
iδq − γ|q|α 1 + iβ q tan π α if α = 0
ln f (q) =
|q| 2 (5.7)
iδq − γ|q| 1 + iβ 2 q ln |q| if α = 1
π |q|
The first stylized fact of empirical return distributions stated that the
frequency of extreme events is larger than under the Gaussian hypoth-
esis. Volatility clustering as the second stylized fact notes that these
large returns often tend to emerge in clusters, i.e., large returns tend
to be followed by large returns of either sign.17 This is also known as
volatility persistence. The volatility clusters for the daily DAX returns
for the period 01/1986 − 12/2005 are displayed in figure 5.2.
Volatility persistence means that volatility is serially correlated and,
therefore, partially predictable. Another way to visualize this is to plot
17
Fat tails and clustered volatility are closely connected and not unique to financial
time series. In a study of hydrology, Mandelbrot coined the term “Noah-effect” for
the first phenomenon, referring to extreme precipitation in the biblical account
of Noah [281]. The phenomenon of clustered volatility, i.e., persisting high or
low levels in rivers, was labeled by them as the “Joseph-effect”, drawn from the
familiar biblical story of Joseph interpreting the Pharaoh’s dream to mean seven
years of feast followed by seven years of famine.
5.2 Stylized Facts of Financial Markets 61
0,10
0,05
log. returns
0,00
-0,05
-0,10
-0,15
1 / 86
1 / 87
1 / 88
1 / 89
1 / 90
1 / 91
1 / 92
1 / 93
1 / 94
1 / 95
1 / 96
1 / 97
1 / 98
1 / 99
1 / 00
1 / 01
1 / 02
1 / 03
1 / 04
1 / 05
Fig. 5.2. Daily DAX-returns for the period 01/03/1986 − 12/30/2005.
0,2
raw returns
0,15
0,1
0,05
0
-0,05
1 51 101 151 Lag 201
Fig. 5.3. The first 200 autocorrelations for the raw, squared, and absolute
daily DAX returns (01/03/1986 − 12/30/2005).
rt = µt + σt ǫt (5.8)
q
2
σt = a0 + ai ǫ2t−i , (5.9)
i=1
for the conditional variance.19 Pagan, however, points out that the
residuals from estimated GARCH models are still non-Gaussian and
leptokurtotic [328]. In addition, Mikosch and Stărică [302] discuss some
theoretical properties of GARCH(p,q) processes that are not in line
with real financial data. For instance, stochastic long-term memory
processes exhibit hyperbolic decays in their autocorrelation functions
(ACF), yet the decline in the theoretical ACF for all GARCH(p,q) pro-
cesses is exponential, thus, indicating only short-memory processes.20
Mikosch and Stărică have also recently questioned the validity of
GARCH models to describe stochastic volatility and argued that long-
range dependence effects might be only spurious. They detected that
structural changes, for instance, due to business cycles, cause a viola-
tion of the stationarity assumption by shifting unconditional variances
[304, 305].
Various other refinements for the conditional variance have been
suggested. In asymmetric GARCH models such as the Exponential
19
All parameter estimates are highly significant and the stationarity condition is
fulfilled.
20
Note, however, that Fractionally Integrated ARCH (FIGARCH) models [18] are
able to model long-run dependence in stochastic volatility. For a technical dis-
tinction between short- and long-term dependence, see [303].
64 5 Replicating the Stylized Facts of Financial Markets
GARCH [318], volatility depends not only on the size, but also on the
sign of past shocks. Asymmetric GARCH models thus allow incorpo-
rating the leverage effect, i.e., the empirically observed asymmetric re-
actions of market volatility to positive and negative shocks. For reviews
on GARCH time series models, see [46, 251]. An empirical comparison
of different volatility forecast models can be found in [169, 168].
The continuing dependence on Gaussian innovations with time-
varying variances by GARCH type models is challenged by Mandel-
brot and Hudson [280]. As an alternative, they suggest the Multifractal
Model of Asset Returns (MMAR) [279]. The MMAR combines frac-
tional Brownian motion with the concept of multifractal trading time,
i.e., a random distortion of clock time.21 Rather than being an au-
toregressive process, the MMAR is a cascade model which generates
price series by randomly splitting up a complete interval in smaller
sub-intervals, thereby increasingly roughening the price series by suc-
cessive interpolations. In the limit, the generated price series reproduce
the main features of financial prices: fat tails in the return distribu-
tions, scale consistency, and long memory in volatility. In addition,
the MMAR can generate either martingales or long memory behavior
in log prices. First empirical investigations of the MMAR hypothesis
show promising results [63, 314, 44], yet the assumption of multifractal
time in an economic context is still being questioned [307].
The idea that asset prices and trading volumes tend to form certain
geometric patterns that can be profitably exploited by forecasting fu-
ture price movements is known as technical trading. Charting, as it is
sometimes called, had been popular even before the disclosure of finan-
cial balance sheet information sparked the development of fundamental
analysis as its main competitor. Its origins can be found in the works
of Charles H. Dow (1851–1902) and William P. Hamilton (1867–1929),
who published a series of 252 editorials in the Wall Street Journal at
23
Coordination on similar trading strategies because of media-induced herding be-
havior would constitute a strategic complementarity [125] which can lead to large
deviations from the outcome predicted by rational models.
66 5 Replicating the Stylized Facts of Financial Markets
the turn of the century. These editorials contained the basic tenets of
what became later known as the “Dow Theory” [320]. Another school of
technical analysis, Elliott-wave theory, claims that market prices follow
several superimposed market trends of different degrees, all of which
are rooted in cyclical investor psychology [107, 108, 342].
Since technical analysis is based on the assumption that markets
exhibit not even the lowest degree of market efficiency, it is anathema
to many academics. Malkiel [274], for instance, compares the scien-
tific merits of chart reading to those of alchemy.24 Jegadeesh [199], too,
points out that academic concerns against technical analysis run deeper
than simple “linguistic barriers” [256]. He identifies its weak founda-
tions, i.e., the missing plausible explanations why one should expect
market patterns to repeat, as the main culprit.
The academic studies that deal with the extent to which technical trad-
ing is used in real markets mainly investigate the foreign exchange mar-
ket. Frankel and Froot, for instance, attribute the speculative bubble
in the period from 1981–85 to a shift from fundamental trading strate-
gies towards technical ones [138]. Questionnaire evidence from forex
traders about their use of technical analysis was gathered by Allen
and Taylor [7, 419]. Both studies found that the relative importance
of charting strategies diminishes as the investment horizons increase.
For horizons ranging from intra-day to one week, up to 90% of the
respondents relied more on technical than fundamental analysis when
forming exchange rate expectations, a finding that is reversed for time
horizons exceeding one year. It is also reported that around 2% of
traders appeared to never use fundamental analysis, independent of
their investment horizons. Menkhoff confirms these results for a differ-
ent place (Germany) and a different time and concludes “that technical
24
According to Popper’s demarcation criterion citePopper1959, however, technical
analysis principally follows a scientific methodology. It produces falsifiable propo-
sitions that can be refuted by empirical observations. Given the questionable track
record of many technical trading systems, their survival surprises many EMH ad-
herents. One possible answer will be given at the end of chapter 9. Another
plausible answer to this mystery is immunization strategies. A website dedicated
to the Dow Theory, for instance, states the following: “To define Dow Theory one
must be open to the concept that there are specific rules that must be accepted with-
out question, while at the same time being ever aware that the theory is dynamic
in its evolution” (http://www.dowtheoryproject.com/define.php). It is hard too
conceive of a more blatantly unscientific statement than this.
5.2 Stylized Facts of Financial Markets 67
just in order to maintain its fitness relative to other species with which
it is co-evolving [427]. It is often referred to as an evolutionary arms
race between competing species. One of the first to study Red Queen
effects in economics was Robson [357], and Markose further discusses
Red Queen examples in an economic context [286].
Fig. 5.6. Typical results from one of Gode and Sunder’s experiments. Top: ZI-
U traders. Middle: ZI-C traders. Bottom: Human traders. Supply and demand
schedules for this experiment are shown on the left. Source: [76, p. 6], redrawn
from [150, p 127].
A series of models with little rationality on the agents’ part was devel-
oped by researchers from the field of econophysics [81, 194, 195, 408,
407]. These models have in common that the decision processes leading
to the individual demands are not explicitly modeled, and individual
demands are assumed to be a random variable. Cont and Bouchaud
[81] argue that it is highly unlikely that the individual demands in
real markets are independent. Therefore, they add a random communi-
cation structure, allowing agents to share information and coordinate
their actions. Random communication structures, even though differ-
80 5 Replicating the Stylized Facts of Financial Markets
and supply
Z(t) = Si (t),
i:Si (t)<0
38
Some early computational models belonging to this group are discussed in [243].
39
The fundamental value in Lux’ model is simply a constant. In [268, 67], an exoge-
nous news arrival process is introduced and the log changes of the fundamental
value are assumed to be Gaussian random variables.
84 5 Replicating the Stylized Facts of Financial Markets
40
That is, arbitrage profits for fundamentalist tends to be realized over the course
of several trading periods. Capital gains and losses, however, accrue immediately.
This asymmetry might explain why myopic fundamentalists might choose not to
use their information about the asset’s fundamental value.
5.4 Agent-Based Computational Models of Financial Markets 85
41
In the physics literature, this dynamic behavior is known as on-off intermittency,
i.e., an attracting state becomes temporarily unstable due to a local bifurcation.
Endogenous forces, though, drive the physical system back to its equilibrium
state.
86 5 Replicating the Stylized Facts of Financial Markets
42
In order to focus on learning behavior, Lettau assumes that the stock price is
exogenously determined and not influenced by the adaptive agents.
43
Note that this specification is a single-population GA where each agent repre-
sents a solution. In multi-population GAs, each agent holds a variety of different
candidates and the GA is actually run inside each agent. Single-population GAs
are sometimes equated with social learning since the genetic operators are ap-
plied across different individuals. Multi-population GAs then refer to individual
learning since no exchange of genetic material between individuals occurs. Vriend
showed that there are different aggregate outcomes depending on whether learn-
ing is modeled as social or individual learning [431].
5.4 Agent-Based Computational Models of Financial Markets 87
S
Ucum = Ui (ωi,j ). (5.21)
j=1
44
The SFI-ASM [244] has the same basic structure and will be introduced in more
detail in the next chapter.
88 5 Replicating the Stylized Facts of Financial Markets
45
This learning structure addresses some methodological criticism concerning the
spreading of trading strategies within populations. It is argued that in single-
population GAs/GPs, only a sequence of actions may be observable, but not the
strategies that lead to these actions. Thus, imitation cannot explain the spread-
ing of strategies within the population. Since Chen and Yeh consider the multi-
population GA/GP approach to be an unsatisfactory response to this criticism,
they resolve this methodological issue through a “business school” in which fac-
ulty members are forced to publish their results.
46
Contrary to their older paper [68], the share of martingale believers is not zero,
but with a share of approximately 1%, very small.
Part II
6.1 Introduction
trading rules. Faster learning speeds with more mutations per time
period imply increased levels of technical (and fundamental) trading.
Thus, postulating the emergence of technical trading by only looking
at the aggregate level of technical trading bits may be too premature.
Since the debate could not be settled without having an actual version
of an artificial stock market, Arthur and Holland decided to program
their own stock market simulation which became known as the Santa
Fe Institute Artificial Stock Market (SFI-ASM). The first version was
operable by the end of 1989 and was described in [331]. The studies
by Arthur et al. [11] and LeBaron et al. [244] were based on a modi-
fied Objective-C version which used a market clearing mechanism in-
stead of an excess demand price adjustment mechanism. It was later
revised by Brandon Weber and Paul Johnson to run with the Swarm li-
braries, a well-known toolkit for agent-based simulations for Objective-
C and Java. This ongoing effort is currently hosted by Paul Johnson
at http://ArtStkMkt.sourceforge.net [204]. The SFI-ASM has inspired
various modelers to do their own research. Joshi et al. [206, 207] slightly
adapted the original Objective-C version to analyze wealth levels. Tay
and Linn [418] extended the original model by using fuzzy logic for
expectation formation. Wilpert [437], who used his own Borland C++
implementation, tested the model with different modifications. He used,
for instance, the generated profits of trading rules instead of forecast
accuracies as a fitness criterion. Gulyás [166] programmed a participa-
tory market model in which real humans were placing market orders
alongside the artificial adaptive agents. They were thus able to study
the effects on actual human decision making on the market behavior
and vice versa.
For this book, the Objective-C version 7.1.2. was ported to Java
(Java 2 SDK, standard edition, version 1.4.0).4 In order to distinguish
between the original SFI-ASM and the current Java-version with which
the simulation results were derived, I will refer to the latter as either
the Java-version of the SFI-ASM, or, when referring to the suggested
modification of the mutation operator, as the modified SFI-ASM. The
Java-version of the SFI-ASM was programmed by using the Repast
(Recursive Porous Agent Simulation Toolkit) library [78].5 Agent-based
4
The source code is available upon request.
5
Repast is being developed at the University of Chicago and is freely avail-
able under the BSD license agreement. The Repast website is located at
94 6 The Original Santa Fe Institute Artificial Stock Market
demand for the risky stock, agents are perfectly myopic in that they
only consider next period’s expected returns. Agents maximize their
expected utility subject to the budget constraint
where xi,t is the amount of stock agent i holds in period t. Under the
assumption of normally distributed stock returns, the optimal amount
of stock x
i,t that agents desire to hold is then determined as
where Ei,t [pt+1 + dt+1 ] is i’s expectation in t about next period’s real-
ization of the stock price and dividend, and σt,p+d2 the empirically ob-
served variance of the stock’s combined price plus dividend time series.
LeBaron et al. point out that the normality assumption of stock returns
holds in the homogeneous rational expectations equilibrium (hree), but
outside the hree-regime, it is not clear whether stock returns will be
normally distributed [244].
The effective demand of an agent is the difference of his actual
and desired stock holdings. Once agents have determined their effective
demands, they submit them as well as their partial derivatives with
respect to the price to a specialist, who tries to balance the effective
demands by setting a market clearing price. If the specialist is not able
to find a market clearing price in the first place, an iterative process
is started in which new trial prices are announced and agents update
their effective demands and partial derivatives accordingly. If complete
market clearing is not reached within a specified number of trials, one
side of the market will be rationed.
where at,i,j and bt,i,j are real-valued parameters constituting the pre-
dictor part of the chosen trading rule j. Only when no rules match
the market descriptor, parameters a and b are determined as a fitness-
weighted average of all trading rules in his rule set.
One period later, the accuracy of all activated rules is checked by
comparing their predictions E[pt+1 + dt+1 ] with the actual realization
of (pt+1 + dt+1 ). A rule’s forecast accuracy is determined as
2
2 1 2 1
νt,i,j = 1 − νt−1,i,j + (pt +dt )−[at,i,j (pt−1 +dt−1 )+bt,i,j ] . (6.7)
θ θ
This forecast accuracy is measured as a weighted average of previous
and current squared forecasting errors. The parameter θ determines the
size of the time window that agents take into account when estimating
a rule’s accuracy. As LeBaron et al. have pointed out, the value of θ is
a crucial design question since it strongly affects the speed of accuracy
adjustment and learning in the artificial stock market. If θ = 1, trading
rules would be judged only on last period’s performance, and forecast
accuracy would be strongly prone to noise. As in LeBaron et al., a value
of 75 is chosen for θ.
2
The forecast accuracy νt,i,j is used as a rule’s variance estimate
2
σt,(p+d) , which is used in equation 6.4 to determine the optimal stock
holdings of agents. Furthermore, it is the main determinant of a rule’s
fitness 2
Φt,i,j = C − νt,i,j + bit cost × specificity , (6.8)
with specificity being the number of conditions in a rule that are not
ignored, with bit cost as an associated cost for each non-ignored con-
dition, and C as a positive constant to ensure positive fitness.10 Non-
zero-bit costs per set trading bit (0 or 1) could be interpreted as the
cost of acquiring and evaluating new information. It could also be seen
as a complexity aversion, since simple rules are favored over more spe-
cific ones. Most importantly, however, LeBaron et al. emphasize that
positive bit costs would ensures that non-# bits in a trading rule serve
low fitness. The specific problems that arise from roulette wheel selection were
already discussed in section 4.4.2 on page 41.
10
The maximum squared forecast error of a trading rule has an arbitrary ceiling of
100, hence, the numerical value for C is set at this value.
6.4 The Basic Structure of the SFI-ASM 99
So far, agents have been equipped with a static rule set. Feedback learn-
ing in the stock market has taken place by identifying and using the
rules that produced better forecasts than others. The quality and the
speed of this type of learning was strongly dependent on the parameter
θ which determined over how many past periods agents averaged the
forecast accuracy of their trading rules. However, if agents started with
a rule set that contained only bad performing rules, in the absence of
any other learning mechanism, they would not be able to find better
ones.
Agents therefore use an additional genetic algorithm (GA) learn-
ing procedure that allows them to alter their rule set by replacing
poorly performing rules with new, possibly better ones.12 Exploratory
GA learning usually happens on a slower evolutionary time scale than
the feedback learning and examines the search space in a random, yet
not directionless fashion. For each agent, the GA is, on average, asyn-
chronously invoked every K periods and replaces the 20 worst rules of
the rule set. The GA-invocation interval K alters the learning speed of
agents and turns out to be the most crucial model parameter.
New trading rules are created either through mutation (with pre-
dictor mutation probability Π = 0.9) or through crossover (with prob-
ability 1 − Π). Crossover is a sexual genetic operator that needs two
parents to work, both of whom are chosen by tournament selection.13
Even though there are a variety of different crossover operators avail-
able, the SFI-ASM uses exclusively uniform crossover for the condition
11
“The purpose of this bit cost is to make sure that each bit is actually serving
a useful purpose in terms of a forecasting rule” (p. 1497). Emphasis added by
author.
12
See, also, section 4.4.2 on GA learning.
13
For tournament selection in the SFI-ASM, two genetic individuals, i.e., trading
rules, are randomly selected from the gene pool, and the fitter of both is chosen.
Goldberg [154] points to another popular tournament selection algorithm by Wet-
zel. Instead of randomly selecting two candidates, Wetzel-ranking picks the two
candidate solutions by using roulette wheel selection. It is not clear whether sim-
ple tournament or Wetzel-ranking is superior or whether it greatly affects results,
yet Wetzel-ranking should return, on average, fitter parents.
100 6 The Original Santa Fe Institute Artificial Stock Market
parts. Here, an offspring’s bit is chosen with equal probability from the
corresponding bit positions of either parent.
0 1 #
0 0 1/3 2/3
P= 1 1/3 0 2/3 . (6.9)
# 1/3 1/3 1/3
This matrix of transition probabilities specifies that a 0-bit is
changed with a probability of one third to a 1-bit, and with probability
of two thirds to a #-bit. Similarly, a 1-bit changes with one third prob-
ability to 0, and with two thirds to #. Don’t care signs # change with
equal probability of one third to either 1 or 0, or remain unchanged.
LeBaron et al. assert that these transition probabilities would, on av-
erage, maintain the specificity, i.e., the fraction of #’s in a rule.
For crossover, the offspring inherits the average forecast accuracy of
its two parents. For mutation, the offspring’s forecast accuracy is set
at the median forecast error over all rules in the agent’s rule set.
Trading rules in the SFI-ASM contain bit sequences that belong to-
gether. Within such a sequence, certain bit combinations are invalid
and constitute an illogical trading rule.
The SFI-ASM deals with this problem in two ways: through a large
rule set, and with a generalization procedure. Once a trading rule has
not been matched for more than 4,000 periods, it is generalized by
converting one fourth of its 0 and 1 trading bits to #. The fitness of a
generalized rule is set at the median value.
Agents also face some trading restrictions in that their orders are
constrained to lie within their budget constraint and that they cannot
go short more than five shares. In addition, the stock price is capped
by the specialist at 200 and is bounded from below at 0.01. These
constraints, however, seem to be binding only in the beginning of the
simulation when the randomly initialized trading rules of untrained
agents result in causing the stock price to fluctuate wildly. The same is
true for a constraint that limits the squared forecast error to 100. This
allows us to choose the constant C in equation 6.8, on page 98, such
that fitness values are always positive.
A graphical depiction of the timing sequence of major activities in
the SFI-ASM can be found in appendix 11.1
ahree = ρ (6.15)
and
bhree = (1 − ρ) (1 + f )d¯ + g . (6.16)
The hree-specialist thus sets the hree-price according to the given div-
idend. This price then ensures that all agents want to hold exactly one
unit of stock, given their hree-forecast of next period’s price plus div-
idend. The normal model behavior for heterogeneous agents can now
be assessed by comparing it with the properties of the homogeneous
rational expectations equilibrium.
Finally, equation 6.10 reveals that agents only need to know last pe-
riod’s price and dividend to derive their forecasts about next period’s
price and dividend, given the mean-reverting dividend process. There-
fore, all the information provided by the condition parts of the classifier
system is unnecessary. Given that the use of technical and fundamen-
tal trading bits is punished with associated bit costs, the hree-solution
should be characterized by complete negligence of technical and funda-
mental trading bits.
and the estimated residual time series ǫ̂t is analyzed to see whether it
satisfies being iid and N(0,4) distributed. The results are summarized
in table 6.4.
First of all, one notices that for both learning speeds, the new Java-
version of the SFI-ASM produces time series that are close to those of
the original SFI-ASM, but generally a little bit further away from the
hree-benchmark. The standard deviations in the residuals are slightly
6.7 Simulation Results of the SFI-ASM 105
.......
0.12 ....... ............
..
Total fraction of bits set ...
..
.. ....
................ ...
...
........ .. .. ..
..... ....... .....
0.11
...
.......... .... ...
...
....... .... .... ...
.......... ... ... ...
...
........... ...... .... ...
0.1 .......... .......
...... ......
...... ...... ....
..... ........ ...
...
..
...
...
...
...
...
......... ... .. ...
.. ......... ... ........................ ....
0.09
... ........... .... ..................
... ................. ... .................................................................
.... ........... ..... . ..................
. .
................................................................... ... .........................
......................... ................. . .................................................................
... ....................... .............. ...... . .................. .
... ......................................................... ................................................................................ ....... . .
... ........... ................................................ .............. .. .. . ...... .............................................................................................................
0.08 .... ........ ........ ............................................................................................. . . ...................
..... ................ ...... ................................. .................................................................................... ..... .........
.... . . ...... .. . . . . ... ............................ ....... ....... ..................................................................................................................................... .... ...
..... . ............................. ................ ...................
...... .... ................................. .................................
........ ........ ............... .........................
............ ........ ............................ .............................. ...
...................... ... ......... ............................................. .....
0.07
.............. ........ . ... . .... . .
......................................................... . .... ..............
...... .. ... . .. .
...................... .. .. ...................................................
.................................................................. ............................. .
.......................
..................................................... .................................................. . ..
.......................................................................... .... .........................................................
............................................................................. ...........
................................................ ..............................................................................................................................................................
.... .................................................................................. . .
0.06
............................................................................ ..... ..
. .... ....
Fig. 6.1. Total fraction of bits set as a function of learning speed in the
replicated Java version of the SFI-ASM. Data were averaged over 5 separate
runs at different random seeds for a GA-invocation intervals of 1,000 (bottom
at period 750,000), 250, 100, 75, 50, and 25 (top at period 750,000).
levels had not been reached at period 250,000. Even after 750,000 peri-
ods, some of the curves in figure 6.1 did not settle down to their equi-
librium levels. However, the basic statement of the original Objective-C
SFI-ASM can be confirmed. There is an increase in the aggregate bit
level for faster learning speeds.
When trying to identify the reasons why the adjustment processes
towards bit equilibrium obviously take longer than in the original ver-
sion, one immediately thinks of the increase in checked trading con-
ditions. The original version had a meager amount of 12 conditions
(including two dummy bits), while the current Java-version checked a
total of 64 trading conditions.16 It seems logical that when agents have
many more opportunities to combine trading conditions to create elab-
orate trading rules, it will take much longer to test them all. Chances
16
The hypothesis that an increase in trading conditions leads to an increase in
adjustment time was confirmed on two occasions. At first, there were only 57
conditions encoded in the Java-version. The two long integers that represent the
condition parts, however, can hold up to 64 conditions. When making full use of
this available space, a slight increase in adjustment time was observable. When
realizing that the break points in the Objective-C version of Joshi et al. [206]
were not optimally chosen, a reassignment of the break points much closer to the
one’s used by LeBaron et al. resulted in an even higher increase in adjustment
time and a higher equilibrium bit level.
108 6 The Original Santa Fe Institute Artificial Stock Market
are also higher that they end up with a higher fraction of useful trading
rules. This could explain the higher equilibrium levels that are attained
by the new Java-version. There is, however, one caveat at this point.
When starting with different initial bit probabilities, the final equilib-
rium bit levels are different, too. There seems to be a path dependency
hidden in the model structure. The reasons for this path dependency
will become clear in the final chapter.
In summing up the behavior under the two learning speeds, Arthur
et al. [11] label the slow learning case as the rational expectations
regime. The price closely tracks the value predicted by the homogeneous
rational expectations equilibrium, and the trading volume is small. The
low number of set trading bits implies that the information contained in
the condition parts is largely neglected by agents. The fast learning case
is characterized by Arthur et al. as the complex or rich psychological
regime. The price series exhibit larger deviations from the hree-case
such as a larger kurtosis and volatility persistence. Equally important,
though, is the increase in the level of set trading bits. Agents appear
to create more trading rules that can exploit useful information in the
price series.
The difference in bit levels between the two GA-invocation intervals
of 1,000 and 250 shown in figure 6.1 does not seem that impressive in
the new Java implementation. The increases in bit levels for learning
speeds faster than 250 are more apparent. Even though the boundary
between the slow and fast learning regime is not a strict one, I prefer
to speak of a complex regime for learning speeds equal to or smaller
than a GA-invocation interval of 100.
For the creators of the SFI-ASM, the increase in technical trading bits
for faster learning speeds pointed to emergent technical trading. Be-
cause of the cost they had attached to every non-# bit, they conjec-
tured that emerging trading bits must have, on average, some fitness-
based advantages by producing more accurate forecasts. Their intuition
guided them to conclude that the classifier systems enabled agents to
detect short-term trends in the price series upon which they started to
act. At the same time, the price dynamics became more complicated,
yet more realistic, and fast learning regimes were labeled as complex.
The dependence of equilibrium bit levels on learning speed became one
of the main results of the original SFI-ASM and has been replicated by
subsequent studies, e.g., by Joshi et al. [206, 207]), and Wilpert [437].
6.8 A Potential Problem: A Biased Mutation Operator 109
0.125
..
.....
... ...
... ....
...... ...
Total fraction of bits set
. .
..... ....
0.1 .............
........
...
....
.....
Π = 0.90
....... ...........
..... ...... ............
...... .. .......
...... ... .............
...... .... ........................
........................
...... ... ..................
..... . .... .................................
............................
Π = 0.75
...... .... .................................
.....
0.075
...... .......... .......................................
..................................... .
...... ................... .............................................................
...... ......................... .
... .... .
...............................................................................
......... .. .......................... ..................................................
... .... ..............
................................................
..... ... ................................................................ .
....................................................................................
...... .. . . . . ..... . .
...... .. . ........ ........ .. . ..... . .
. . ................................................................................................................. .......
...... ... . ... . .
...... ...
...... ...
0.05
...... ...
...... ...
..... ...
......
... ...
.. ...
Π = 0.50
.....
.........
.......................................... .....
.... ...................................................................................................................................................................................................................................................................... ............................... ...............
.. .. .. . . . . .... . ..... .....................................................................................
.. ..
.. ..
Π = 0.25
... ...
.. ...
.. .... .......... ..............................................................................................................................................................................................................................................................................................................................................................................................
..
... ................................................................................ ...... .
0.025 ..
...........................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................
Π = 0.00
0.0
0 200,000 400,000 600,000
Periods
It is apparent from figure 6.2 that the level of bit usage in the
model does not only depend on learning speed, but also on mutation
probability.17 It should thus be crucial to investigate why increasing
17
In figure 6.2, the predictor mutation probability Π was adjusted. A similar effect
could have been achieved by changing the bit mutation probability π and holding
Π constant. Figure 6.2 nicely demonstrates that the GA converges quickly for no
mutation at all. The more agents experiment, the longer it takes them to arrive
their equilibrium bit levels.
110 6 The Original Santa Fe Institute Artificial Stock Market
18
An alternative approach to derive the fixed point of one half is to start by denoting
the initial fraction of bits set before mutation with P ∈ [0, 1]. The non-# bits are
′ 1
mutated to non-# bits with a probability of P|0,1 →1,0 = 3 P , while the probability
′
that a #-bit is mutated to either 1 or 0 equals P|#→0,1 = 32 (1 − P ). Thus, for
any given P , the fraction of bits set after mutation is determined by adding the
′ 1 ′
two probabilities above, i.e., P|0,1,# →1,0 = 3 (2 − P ). Since P|0,1,#→1,0 ∈ [0, 1] is
a continuous function for all P ∈ [0, 1], we know by a fixed point theorem that
6.8 A Potential Problem: A Biased Mutation Operator 111
0.5 p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p pppppppppppp pppppp ppppppppppppppppppppppppppppppp pp ppppp ppp p ppp ppppppppppppppppppppppppppp pppppppppppppppppppppppppppppppppppppppppppppppppppp pppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppp ppppppppppppppppppppppppppppppp ppppp pppppppppppppppppp ppppp ppppp pppppppppppppppppppp ppppppppppppp
Fractions of bits set
p p p ppp ppp
ppp p p p p p
pppp pp p pp p pp pp pp p ppp
p
pp
pp pp
0.25 p p pp p
p pp p p
pp p
pppp
ppppppppppppppppppppp technical bits
p p pp p p p
ppp pp pp pp pp pp fundamental bits
0.1 ppp
p
p ppp
ppp
0
0 50 100 150 200 250
Periods
Fig. 6.3. Fixed point convergence of trading bits in the SFI-ASM for Π = 1.0
and π = 1.0. Further parameter settings were 25 agents, a GA-invocation
interval of 10, no bit costs, and a complete generational replacement of all
trading rules.
′
an equilibrium exists. By repeatedly invoking the mutation operator, P|0,1,# →1,0
converges to its equilibrium value of one half. This result also holds if every bit
in the bitstring is mutated with a probability of less than one. In the SFI-ASM,
this mutation probability π is set at 0.03. Deriving the fraction of non-# bits
′ 2 2
in the same manner as above yields P|0,1,# →1,0 = 3 π(1 − P ) + P 1 − 3 π . It is
easy to check this formula by setting π = 1, which will yield equation (18), or
′
by setting π = 0, which will yield P|0,1,# →1,0 = P since no mutation will ever be
performed. The equilibrium level equals 1/2, even though convergence occurs a
little bit slower than before.
112 6 The Original Santa Fe Institute Artificial Stock Market
19
Upon closer inspection, however, the mean bit levels seem to hover slightly below
the value of half. This either points to an unidentified parameter that also affects
the equilibrium level, or a minor plus/minus one programming problem when
determining the bit fractions in the model. A careful analysis of the source code,
however, leads me to believe that the latter is highly unlikely.
7
A Suggested Modification to the SFI-ASM
An expert is a man who has made all the mistakes which can
be made in a very narrow field.
Niels Bohr
7.1 Introduction
In order to derive valid conclusions about the bit usage in the model,
one should take care in designing bit-neutral operators and procedures.
Bit-neutral refers to the feature of leaving the fractions of set bits unal-
tered, unless an impact is explicitly desired. The bit decreasing effect of
the bit cost parameter, for instance, is desirable as it is a fitness-based
influence. The bit-increasing effect of the SFI mutation operator, on
the other hand, seems problematic since it is completely technical and
economically uninterpretable.
The suggested alternative bit-neutral mutation operator works with
dynamically adjusting bit transition probabilities. In order to infer
whether technical and fundamental bit usage differs in the stock mar-
ket, this mutation operator works separately for fundamental and tech-
nical trading bits.1 Therefore, it is necessary to distinguish between the
initial fraction of fundamental bits set Ff und. , and the initial fraction
of technical bits set Ftechn. . The transition matrix for the fundamental
bits is then given by
0 Ff und. 1 − Ff und.
Pf und. = Ff und. 0 1 − Ff und. , (7.1)
1 1
2 F f und. 2 Ff und. 1 − Ff und.
1.0 p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p
p p p p p pp p p
Fraction of Zero-Bit Agents
p p p p p p p p p p pp p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p p
pppp pppp p pp pp ppp p p p p pp p pp ppp pp ppp
0.8 pp p ppppp pp ppppppppp p ppp pp ppppp ppppp p ppp ppppp p p p
pp pp pp pp p p p p p p p p pp ppp p pp pppppp
pp p pp pp pp pp ppp p pppppp p ppp pppppppppppp ppppppppppp p p p p p pppppppp pppppppp pppppppp pppppppp pppppppp
ppppppppppppppppppppppppppppppppppppp p p p pppppp pppppppp
pppppppppppppppp p ppp p ppp p
0.6 p p p p p p p
p p pppppppppp p p p p pp p pp pp p pp p p p p
p
pp p p p p p p p p p p pp ppp
p pp p ppp p
pppp p p p ppp p p p p pp p p pp
p p pp
pp p pp pp ppp p pppppppppppppppppppppppppppppppppppppppppppppp
0.4 ppp pp ppp p p GA= 25
pp pp pp
ppp pp pp p pp p pp pp pp pp pp pp pp pp pp pp pp pp p GA= 100
pppp ppp pp p
pp p ppp
p p
0.2 ppp p
ppp pp p p
pp
ppp p
pp pp ppppp p ppp ppppp ppppp ppppp ppp GA= 250
p p p
pp p pp p p p pp p pp
p pppppppp pppppppp pppppppp ppp GA= 1, 000
pp p pp p p ppp pp
pp p pp p p pp
0 ppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppppp pppppppp p pppppppp ppppp ppppppppp ppppp p pppppp pppppppppp pppppppp pppppppp p ppppp p
100 250 500 1,000 5,000 25,000 100,000 500,000
Periods
Fig. 7.1. Fraction of bit-neutral agents which discovered the correct hree
non-bit usage solution (Zero-Bit Agents) recorded for different GA-invocation
intervals and averaged over 10 simulation runs. (25 agents with λ = 0.3, bit
cost = 0.01, and initial bit probability of 0.01.)
up the use of their classifier system and neglect any information pro-
vided by it. Since homogeneous agents in a rational expectations equi-
librium would be characterized by a total neglect of trading information
provided by the classifier system, the Marimon-Sargent Hypothesis is
finally supported with respect to bit behavior. When replacing the orig-
inal mutation operator with an updated operator that has the desirable
property of being bit-neutral, most agents seem to realize that, under
the given dividend process, all they need for their forecast production
is the last period’s price and dividend information. Given the biased
mutation operator in the original SFI-ASM, emergent technical trading
bits thus seem indeed to be a design artifact and not a reflection of on-
going technical trading in the market. Changing the mutation operator
appears to be a relatively small change in the design of the artificial
stock market. It leads, however, to radically different results with re-
spect to bit behavior from the original model. I will henceforth refer to
this version as the modified SFI-ASM and to the agents as bit-neutral
agents.
Bit-neutral agents who sooner or later arrive at the zero-bit solu-
tion do so by using smaller and smaller mutation rates. Since Fogarty
[130] has shown that decreasing mutation rates may improve the per-
7.3 Simulation Results with the Modified SFI-ASM 117
150,000 pppp∗pp∗pppp
p∗ppppp ∗pp ∗ ∗ppp
pp∗pp pp∗pppp∗pp ∗p ∗p∗pp∗pp∗pppp pppppp
50,000 ∗∗p∗∗∗∗pppppp
Average GA-invocations
p
∗pppp ∗∗p ppppp p p p
ppppp ∗ppp∗ppppp
25,000 pppp ∗pppppppp pp∗pppppppppppppppppp∗ppppppppppppppppppp∗pppppp ρ = 1.00
p p p p pp
pppp pppp
p p ∗pppppp
10,000
p
p p p p p
pppppppppp ppp pp pppppppp
p p
ppp ppp ∗ppppp
p p p pppppppppppppppppppppppppppppppppppppppppppp ρ = 0.99
∗ppp p pp
pp
p p p pp
pp pppppp ∗pp p p p p p p p p p p p p⋆p p p p p p p p⋆p p ρ = 0.95
5,000 ppp pp p
ppppp p p p ⋆
pppp
p pppp ppppp ∗ppppppppp
p∗pppppp pp pp pp ppppp pp ∗ p pppppppp
2,500 pppppppppp ∗p
ppppppppp pp ppp p pppppppp pp
p
p ⋆
pppp⋆pp⋆p⋆p p ⋆pppp ⋆p ⋆pp
pp p
pppppppppppp ∗pp∗ppppppp
p
p pp p p p p p p
p p p p p p p p p pp p p p p p ⋆ppp⋆p ⋆p pp ⋆p ⋆⋆pp ppppp⋆pppp⋆p⋆p⋆ppp p p⋆p p p p p pppp
p ∗ ∗ppppppp
pppp ∗ppp pppppp p ⋆ p p ppppp
1,000 ⋆p p p
p p ⋆ p pp ⋆ p p
⋆ p p p p ⋆ p p p p p p p
p p p p ppppppppp p p
p ⋆ p pp p p
⋆p p ppp ⋆ ppp pp
p p p p p pp p p p p p p∗ppppppppp
⋆
ppppppp ppp ∗ pppppppp∗ppppppp
p p p ⋆ ⋆p p p⋆p p ⋆ p p pppp
pppp ⋆ p p p ⋆ ⋆p p pp p p p p p p p p
500 pppp p p p pppppp p⋆ ppp pp
⋆
p p ppp
p p p p pp p
ppppppppppppp ∗ ppp∗ppppp∗ppppppppp∗ppp
⋆p p p ⋆ p p p pp p p p ppppppppp p p p p p
⋆p p p p p pp p p⋆
p
⋆ ⋆p p
250
pppp
⋆p p p p p p pp p p pppp
p p p
pppppppppppppp pppppp p
⋆p p p p p p ⋆p p p p p pppppppppp∗ppppppppppppppppppp
⋆pp p⋆p p p pp ∗
p ⋆p p p p⋆p p ⋆ppp p p ⋆p p p p p
100 ⋆p p p⋆p p⋆p p p p⋆p p p p p p⋆p
75
1 5 10 25 50 100 250 500 1,000 2,500 7,500
GA-invocation interval
aver 25 simulation runs which sometimes lasted several million periods—, the
simulations were not repeated when the model was upgraded to utilize all possible
32 bit locations in the two condition parts.
4
There is, however, no monotone relationship between speed of mean reversion and
required time to find the zero-bit solution. For very small ρ, the dividend process
is very close to a white noise process, and agents again experienced difficulties in
detecting this randomness.
7.3 Simulation Results with the Modified SFI-ASM 119
The estimated residual time series ǫ̂t are then analyzed to see whether
they satisfy being i.i.d. and N(0,4) distributed. The results are summa-
rized in table 7.1.
Table 7.1. Time series comparison of the modified SFI (M-SFI) and the
replicated Java SFI-ASM with the original mutation operator (J-SFI). Means
over 25 runs. Numbers in parentheses are standard errors estimated using
the 25 runs. Numbers in brackets are the fraction of tests rejecting the no-
ARCH or iid-hypothesis for the ARCH and BDS tests, respectively, at the
95% confidence level.
Description GA 1,000 GA 250 GA 20 GA 1
M-SFI J-SFI M-SFI J-SFI
Std. Dev. 2.084 2.145 2.141 2.225 2.229 3.397
(.009) (.010) (.013) (.015) (.013) (.034)
Excess kurtosis 0.004 0.085 0.001 0.285 0.050 9.046
(.009) (.015) (.001) (.033) (.011) (1.56)
ρ1 0.011 0.031 0.014 0.025 0.029 0.491
(.002) (.008) (.002) (.012) (.001) (.006)
ARCH(1) 2.610 3.502 2.754 25.34 5.722 1,871.9
[0.20] [0.60] [0.40] [1.00] [0.48] [1.00]
ρ21 0.013 0.020 0.015 0.075 0.020 0.425
(.002) (.002) (.004) (.008) (.003) (.017)
BDS 1.06 1.41 1.10 34.08 1.44 38.63
[0.20] [0.32] [0.24] [0.92] [0.28] [1.00]
Excess return 1.52% 2.92% 1.59% 3.25% 1.51% 25.34%
(.02%) (.03%) (.03%) (.08%) (.03%) (3.41%)
Trading volume 0.244 0.364 0.271 0.854 0.876 1.359
(.008) (.025) (.007) (.065) (.009) (.015)
First of all, one notices that the modified SFI-ASM produces time
series that are usually closer to the hree-benchmark than those of the
original SFI-ASM. The standard deviations in the residuals are gen-
erally smaller, thus indicating less price variability. Excess kurtosis is
almost negligible for both the fast and slow learning cases, which does
not line up very well with empirical fat tailed return distributions.
Yet when further enhancing the learning speed, both the increase in
standard deviation and excess kurtosis suggest that the modified SFI-
model shifts into a more complex regime for faster learning rates than
120 7 A Suggested Modification to the SFI-ASM
5
While LeBaron et al. have reported the results only for the GA-intervals of 1,000
and 250, two additional learning speeds are included in table (7.1). The statistical
tests were performed for even more GA-intervals, in particular, for 100, 50 ,25,
20, 10, 5, 2, and 1.
6
The autocorrelation coefficient for a GA-interval of 2 with a value of 0.06 (stan-
dard deviation 0.0035) is considerably lower than for an invocation interval of
one. This supports the hypothesis that there is a structural break in the model
when using the fastest possible learning speed.
7
Even for an invocation interval of two, the no-ARCH hypothesis cannot be re-
jected for 16% of the test runs.
8
There are two free parameters for this test. The distance r is measured as a
fraction of the standard deviation and has been set to a value of 0.5, while for
the embedding dimension m, a value of two has been chosen.
7.4 Robustness of the Zero-Bit Solution 121
nature of the dividend process without any added noise, agents became
heterogeneous in that most of them decided to use a different technical
trading rule.
In summing up one can state that the simulation results under pe-
riodic and stochastic dividends show that the classifier system works
quite efficiently. When confronted with periodic dividends, it detects
these patterns, yet when working with stochastic data, it also discovers
the “right” solution of non-bit usage. Even though the mean-reverting
dividend process is able to produce short term trends toward its mean,
these are by no means regular. Therefore, in the long run, the stochas-
tic nature of the dividend process dominates any (random) short term
trends and patterns.
Fig. 7.4. Demonstration of the upward pressure on the bit distribution when
switching the consistency check on in mid-simulation.
bits. Using a smaller denominator increases the bit fraction for a given
number of set trading bits. The simple recalculation explains the jump
of the bit fractions the moment the consistency check is turned on.
But remember that these augmented bit fractions for technical and
fundamental bits are then used to determine the dynamically adjusting
bit transition probabilities for the updated mutation operator. Since
the bit-neutral mutation operator applies these elevated bit transition
probabilities on the whole bit string, turning the consistency check on
results in a slight increase in the bit distribution in addition to the one
time effect shown in figure 7.4.
The major result of agents finding the correct hree-solution are usu-
ally replicated with or without the consistency check. It is probably
a useful extension of the SFI-ASM since it allows agents to focus on
creating and testing only logical trading rules. From a computational
perspective, it could even help save computational time since agents do
not need a huge rule set to work with.
8
An Analysis of Wealth Levels
8.1 Introduction
Fig. 8.1. Wealth levels for the situation when one agent includes technical
rules while all others exclude them. Note that the singular agent using techni-
cal rules accumulates significantly more wealth than those agents using only
fundamental rules almost all through the run, and that this difference grows
over time. Source: [206, p. 11]
and some being poor. While the identity of winners and losers changes over time,
the distribution stays rather constant.
130 8 An Analysis of Wealth Levels
learning rates that agents can chose from.3 Given the assumption that
N − 1 traders in the market have adopted a common, but unknown
revision rate K, a single trader has to find his optimal learning speed
K ∗ .4 Since all traders in the market attempt to find their optimal
responses, this situation constitutes a symmetric simultaneous-move
N -person game with a S × S decision matrix in which a single trader’s
dominant strategy would be a dominant strategy for all other agents.
The payoffs in the S ×S decision matrix are determined by fixing the
base revision rate K to one of the S different values, and then recording
the terminal wealth levels that the remaining agent can acquire by
varying his learning rate K. The information in the S × S decision
matrix can be summarized in a reaction function, which assigns to
every K the optimal response K ∗ at which the single trader maximizes
his terminal wealth level.
3
These rates are 5, 10, 50, 100, 250, and 5,000.
4
For simplicity, Joshi et al. assume that all other agents chose the same single
base revision rate K. They claim, however, that their results do not change much
when other agents are allowed to follow mixed strategies.
8.3 Previous Studies Based on Wealth Levels in the SFI-ASM 131
In a similar and related study, Joshi et al. [206] vary their focus and
ask whether agents will choose to include technical trading information
in their trading rules. Throughout this study, a constant learning rate
of K = 100 is used. This is done for two reasons: First, at this rate,
all agents may use technical trading information since they are in the
complex regime. Second, they have shown in their 2002 article that this
would be the learning rate chosen by agents.
The framework is slightly modified from the study above in which
agents had to choose their optimal learning rate. This time, a single
agent faces a classic 2 × 2 decision problem of whether to include the
technical trading information or not, given his assumption that all of
the other traders either include them or not.5 Again, this design resem-
bles a multi-person simultaneous-move game. Thus, the only potential
5
All traders have access to fundamental trading information though. The exclusive
use of technical trading information is ruled out as unrealistic.
132 8 An Analysis of Wealth Levels
Nash equilibria are symmetric in the sense that all agents either include
or exclude the technical trading bits.
Table 8.1. “The decision table for an agent contemplating whether to include
technical trading rules to make her market forecasts, when she is uncertain
whether the other traders in the market are doing so. The agent’s payoff in
each of the four situations A–D is her expected final wealth (divided by 104 ,
to make more readable), derived by averaging the results of 45 simulation
runs of each situation. Error bounds are calculated using standard deviations
of each set of 45 simulations.” Source: Joshi, Parker, and Bedau [206, p. 9]
A: 113 ± 7 B: 154 ± 7
technical rules
excludes
C: 97 ± 7 D: 137 ± 5
technical rules
6
For each cell, Joshi, Parker, and Bedau averaged over 45 simulations, each of
which was run for 300,000 periods.
8.4 Wealth Levels in the SFI-ASM: Alternative Explanations 133
The following sections, however, will provide other answers to the ques-
tion of what determines the wealth accumulated by agents. Section 8.4.1
analyses the model design and finds that the economic explanations for
absolute wealth levels given by Joshi et al. [206] turn out to be conjec-
tures that are not supported by the SFI-ASM design. This theoretical
analysis is supplemented by some simulation evidence in section 8.4.2.
The question of relative wealth, i.e., why certain trader types become
richer than others, is addressed in section 8.4.3.
xi,t being the amount of stock held by agent i. Equation 8.1 denotes the
previous wealth before adjustment, and equation 8.2 takes interest and
dividend earnings into account, while equation 8.3 lowers cash through
tax payments at a tax rate equal to the risk-free interest rate. Equations
8.1 − 8.3 can be summarized as
which is the equation with which the cash positions of traders in the
model are updated. Note that for risk aversion, the term in parentheses
reflects a positive risk premium, and Ci,t > Ci,t−1 holds.
Gulyás et al. [166] point out that in the SFI-ASM, agents usually
increase their wealth more or less independent of their actions. It is,
therefore, possible to determine a benchmark wealth level under the
assumption of inactivity. Agent i would hold onto his one unit of stock,
i.e., xi,t = 1 for all t, and would take the market price as given. In this
case, equation 8.4 would simplify to
Since wealth is defined as an agent’s cash plus the value of his stock
position, which is xi,t = 1 in this case, the first benchmark wealth level
is calculated as
Wi,t = Ci,t−1 + dt + pt (1 − rf ) (8.6)
and will be referred to as base wealth.
To determine an agent’s base wealth, however, it is necessary to
have an actual simulation run to have stock prices with which stock
positions can be valued. One way to determine an absolute benchmark
wealth level without the need for a simulation is to assume that the
market is run in hree-mode, i.e., not only one, but all agents are inac-
tive, holding onto their one unit of stock, collecting its dividend, and
receiving interest on their cash. An approximation for hree-base wealth
can be obtained by substituting the simulated prices and dividends in
period t by their theoretical averages d and phree .7 Since these are con-
stants, the recursive relationship of equation 8.5 can then be written
as a function of initial cash endowment C0 , i.e.,
Wthree ≈ Cthree = C0 + t d − rf phree . (8.7)
Since agents are identical, an agent subscript is not needed. In the long
run, the value of one unit of stock becomes negligible in comparison
to the cash value, hence, the theoretical hree-base wealth level Wthree
in period t is reasonably well approximated by hree-base cash. Again,
risk aversion implies that phree < d/rf , hence, 8.7 states that hree-
base wealth Wthree grows linearly in t and that the only source for
7
The hree-price was derived in section 6.5. It is a linear function of agent’s risk
aversion, the risk-free interest rate, the theoretical dividend mean, speed of mean
reversion of the dividend process, and the variance of the dividend noise process.
8.4 Wealth Levels in the SFI-ASM: Alternative Explanations 135
wealth accumulation in the SFI-ASM are the risk premiums that agents
collect.8
Table 8.2. Some numerical examples for hree-prices and hree-base wealth
levels at different periods. Other parameter values were ρ = 0.95, rf = 0.1,
d¯ = 10, and C0 = 20, 000.
125,000 .........
.........
.........
.........
.........
.........
.........
.........
100,000 .........
.........
.........
.........
.........
.........
.........
.........
75,000
.........
.........
.........
.........
.........
.........
.........
.........
.........
50,000
.........
.........
.........
.........
.........
.........
...........
................
......................
25,000 ..............................
...............................
... .................................
.........................................
.............................................. .
................................................. .......................................
.................................................... ..........................................................................................
................................................ ...................................................................... ...........................................................................................................................................
......................................................................................................................................................................... . ........
....................................
................................................ .................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................
0
.. .. . . .
.................................... .
.........................................................................................................................................................................................................................................................................................................................................
........................................... ................................................... .............................. .....................
................................... .................................................
...................................... .........................................................................................................
Fig. 8.3. Wealth differences between SFI and non-classifier agents, both
trader types using select best for rule selection, recorded for different GA-
invocation intervals and averaged over 25 simulation runs. Positive values in-
dicate that SFI agents outperform the non-classifier agents. The shaded area
indicates mean wealth difference ± one standard deviation.
formation.9 Figure 8.3, however, tells a different story. While for most
GA-invocation intervals the classifier agents acquire more wealth, there
is a significant dip in the curve when non-classifier agents do better
than their SFI-counterparts. Better performing non-classifier agents,
however, are a clear indicator that reasons other than pattern recog-
nition and exploitation are responsible for differences in accumulated
wealth levels.10
Since the risk premium is the only source for wealth accumulation in
the model, one obvious conclusion is that wealthier trader groups hold,
on average, more risky stocks. Figure 8.4 verifies this hypothesis by
plotting the acquired wealth levels of individual SFI- and non-classifier
traders as a function of their average stock holdings over their entire
lifetime.11
Besides the highly linear relationship between average stock holdings
and final wealth, figure 8.4 also shows that the non-classifier agents, as
a group, hold more stock than SFI agents. When regressing individual
wealth Wi on average stock holdings xi ,
Wi = a0 + a1 xi , (8.8)
9
That hypothesis was posited before I actually established that risk premiums are
the only source of possible wealth accumulation in the SFI-ASM.
10
Initially, I doubted the validity of the simulation results in [206]. In [105], I re-
ported that wealth levels of classifier and non-classifier agents rise equally after
some initial divergence during the warm-up phase, which is in contradiction to
figure 8.1 with diverging wealth slopes. I then noticed, however, that while the
absolute wealth differences were tiny, the classifier agents did slightly better in
23 out of 25 simulation runs. A paired two sided t-test then revealed that these
minuscule wealth differences were significant, sometimes even at the 1% percent
level. It was this outperformance of the classifier agents that prompted me to look
for alternative explanations as to how wealth differences in the SFI-ASM come
about.
11
The specific parameter combination was chosen in order to clearly visualize that
the two trader groups hold, on average, different amounts of stock.
138 8 An Analysis of Wealth Levels
• • ו• • ×
× ×• × •
300,000 •×
× ×× ×
××
275,000 × ××
×
×× ××
250,000 × ×
0.7 0.8 0.9 1.0 1.1 1.2
average stock holdings of an individual trader
Fig. 8.4. Final wealth levels of SFI- and non-classifier agents as a function
of average stock holdings (period 250,000; GA-interval = 250; risk aversion
=0.3; 10 trading rules; 25 SFI- and 25 non-classifier agents).
led to different tax payments which, when added up, may explain the
difference in final wealth.
15000
5000
Wealth Difference
-5000
-15000
-25000
-35000
100 Rules
-45000 75 Rules
5 50 Rules
10
25 25 Rules
50
75 10 Rules
100
250
GA-interval 500 5 Rules
750
1000
Fig. 8.5. Wealth differences between SFI and non-classifier agents for select
best as a function of learning speed and number of trading rules they possess.
Data were averaged over 25 simulation runs. Note that non-classifier agents
outperform the SFI agents for most parameter combinations (negative wealth
differences) and that a peak in the upper left corner reaching as high as 300,000
was truncated for better visibility.
5000
-5000
-10000
Wealth Difference
-15000
-20000
-25000
-30000
-35000
-40000
Fig. 8.6. Wealth differences between SFI and non-classifier agents for select
roulette as a function of learning speed and number of trading rules they
possess. Data were averaged over 25 simulation runs. Note that non-classifier
agents outperform the SFI agents for most parameter combinations (negative
wealth differences).
the absolute wealth levels of one agent type alone. They are shown in
the top sections of figure 8.7 for SFI agents and 8.8 for non-classifier
agents. It is interesting to note that subtracting the absolute wealth
levels of non-classifier agents from those of SFI agents does not yield the
same wealth differences as depicted in figures 8.5 and 8.6. Obviously, the
co-evolution of two competing agent types creates a different economic
environment in terms of aggregate price level and risk premium than
just having one agent type in a simulation.
The middle and bottom sections in figures 8.7 and 8.8 point to
another reason why wealth levels may vary for different trader types
and learning speeds. Even when told to use select best or roulette wheel
selection for forecast production, SFI agents may not be able to do so
142 8 An Analysis of Wealth Levels
450000
475000
425000
Wealth
400000
375000
350000
100 Rules 325000
100 Rules 300000 75 Rules
1000
750
75 Rules 50 Rules
1000
500
750
250
50 Rules 25 Rules
500
100
75
250
25 Rules 10 Rules
50
100
25
75
5 Rules
10
10 Rules
50
5
25
10
5 Rules
5
2500
2000
2500
Active Rules
1500
2000
1500
1000
1000
500 500
100 Rules 0
75 Rules 100 Rules 0
1000
75 Rules
750
1000
750
50 Rules 50 Rules
500
500
250
250
25 Rules
100
25 Rules
100
75
10 Rules
50
75
25
10 Rules 5 Rules
50
10
25
5 Rules
10
5
8000000 8000000
Select Average
6000000 6000000
4000000
4000000
2000000
2000000
100 Rules 0
100 Rules 0 75 Rules
1000
75 Rules
750
1000
50 Rules
500
750
50 Rules
250
500
25 Rules
100
250
25 Rules
75
100
10 Rules
50
75
25
10 Rules
50
5 Rules
10
25
GA-interval
5
5 Rules
10
GA-interval
5
Fig. 8.7. Top: Final wealth levels for SFI agents with select best (left) and
roulette wheel selection (right). Middle: Number of activated rules. Bottom:
Number of “select averages” during simulation. Data were averaged over 10
simulation runs.
425000
475000
Wealth
375000 425000
375000
325000 325000
100 Rules 100 Rules 275000
275000
75 Rules
1000
75 Rules
1000
750
50 Rules
750
500
50 Rules
500
250
25 Rules
250
100
25 Rules
100
75
10 Rules
75
50
10 Rules
50
25
5 Rules
25
10
5 Rules
10
5
5
3000 3000
2500 2500
Active Rules
2000 2000
1500 1500
1000 1000
500 500
100 Rules 100 Rules 0
0
75 Rules 75 Rules
1000
1000
750
750
50 Rules
500
50 Rules
500
250
250
25 Rules
100
25 Rules
100
75
75
10 Rules
50
10 Rules
50
25
25
5 Rules
10
5 Rules
10
5
5
8000000 8000000
Select Average
6000000 6000000
4000000 4000000
2000000 2000000
750
50 Rules 50 Rules
750
500
500
250
25 Rules
250
25 Rules
100
100
75
10 Rules
75
10 Rules
50
50
25
5 Rules
25
10
5 Rules GA-interval
10
GA-interval
5
5
Fig. 8.8. Top: Final wealth levels for non-classifier agents with select best
(left) and roulette wheel selection (right). Middle: Number of activated rules.
Bottom: Number of “select averages” during simulation. Data were averaged
over 10 simulation runs. Note that the wealth peak in the upper left corner for
select best, reaching as high as 1.7 million, was truncated for better visibility.
thus influences the model behavior. The relative frequencies with which
those methods are used depend on the the parameterization, especially
on learning speed, and will thus lead to different model behaviors. I
subsume both fall-back mechanism in the SFI-ASM as select average.
The bottom section in figure 8.7 shows how SFI agents increasingly
resort to select average when the number of activated rules becomes
smaller.
Non-classifier agents, on the other hand, are not plagued by the
problem of a diminishing active rule set. Their rules are always acti-
vated (middle section of figure 8.8) and hence, there is no need for them
to resort to any of the fall-back methods (bottom section). Even though
non-classifier agents indeed stick to the intended selection mechanism,
we still see a wealth peak for GA-intervals around 50 and a valley at
about 500 for roulette wheel selection. These remaining wealth differ-
ences most likely reflect the working of the GA which changes the fore-
cast parameters, yet other still unidentified influences are possible, too.
Separating and attributing price and wealth effects to each individual
factor seems impossible, though. Changing one factor affects the price
series, which in turn will affect the GA and the forecast parameters,
which in turn will affect the triggering change. Because of these inter-
dependencies, an exhaustive explanation of price and wealth behavior
cannot be given.
Identifying all causes and interdependencies of wealth dynamics,
however, is not necessary at this point and would not justify the com-
putational expenses. All that is necessary is to show that some reasons
other than pattern recognition and exploitation influence the wealth
distribution to a much greater extent than technical pattern exploita-
tion ever could. Even though the game-theoretic analyses by Joshi et
al. themselves are correct, they are based on wealth levels that should
not be given any economic meaning.
Up to this point, we have not addressed whether one of the two
selection mechanisms, select best or select roulette, is better suited.
However, section 4.4.2 has already discussed the problem of scaling
invariance for simple roulette wheel selection. Remember that scaling
invariance means that adding an offset to all fitness values tends to
equalize the selection probabilities in roulette wheel selection. In order
to avoid negative fitness for trading rules, a constant C with a value
of 100 is added to each raw fitness.13 This effectively leads to almost
uniform selection probabilities. The fitness information is thus widely
neglected, especially in large active rule sets. Hence, we can safely as-
13
See also equation 6.8 on page 98.
8.5 A Verdict on Wealth Analysis in the SFI-ASM 145
This chapter has clearly demonstrated that the only source for wealth
accumulation in the SFI-ASM framework is the aggregate risk pre-
mium. Since simulated prices are usually below the hree-benchmark,
wealth generated in the hree-regime can be said to constitute a lower
bound for wealth in non-hree simulations. In a way, the smartest agents
who are able to coordinate on the correct hree-solution make the least
amount of money. It was furthermore shown that an agent’s wealth is
highly affected by the size of his activated rule set and the selection
procedure that he uses to choose a trading rule to act upon. Last, but
not least, agents in the SFI-ASM are modeled as myopic by having
a constant absolute risk aversion utility function. Since the CARA-
assumption implies that an agent’s wealth level has no impact on his
demand function, wealthier traders do not have a greater impact on
market prices than poorer traders [242]. Because of all these issues, the
SFI-ASM is highly unsuitable to address issues of wealth accumulation.
The game-theoretic analyses by Joshi et al. thus rest on simulation re-
sults that should not be given any economic meaning.
It was also noted before that the trading rules in the SFI-ASM are
not evaluated according to their wealth generating capabilities, but
on their forecast accuracy. Wilpert [437] based the fitness evaluation
of trading rules according to the wealth they would have generated if
used by agents.14 Equation 6.7 on page 98 as the main determinant of
rule fitness (equation 6.8) would have to be adapted to
1 1
fW,t = 1 − fW,t−1 + [pt + dt − (1 + rf )pt−1 ] x
t−1 , (8.9)
θ θ
with fW,t being a raw fitness values based on wealth, and x t−1 the op-
timal demand of last period’s portfolio decision. The term in brackets
multiplied by that optimal demand would denote an excess profit. It
is now a legitimate question to ask whether this type of rule evalua-
tion is still justified since we just have derived that wealth levels in the
SFI-ASM are economically not meaningful. There is, however, noth-
ing wrong with this profit based rule evaluation. Agents derive their
optimal demands by neglecting any subsequent wealth taxation. Since
14
This type of fitness evaluation was first suggested by Brock and Hommes [54].
146 8 An Analysis of Wealth Levels
9.1 Introduction
The first signs that a superficial look at the aggregate bit level is in-
appropriate in judging the emergence of technical trading appeared in
figure 7.3 on page 122. It was obvious that certain bits within the econ-
omy were often set in multiples of 100, which is the number of trading
rules agents possess. This implied that some bit positions in the rule
sets of bit-neutral agents were completely filled with 0- or 1-bits while
other bit positions seemed to be completely neglected. While figure 7.3
was derived for a noiseless periodic dividend process, the bit behavior
should have been similar for stochastic dividend processes if techni-
cal trading had emerged. Under the standard AR(1) dividend process,
the number of instances of bit fixations at the 100% level almost com-
pletely disappeared for bit-neutral agents. The few exceptions were seen
as temporary lock-ins at a suboptimal solution since in the long run,
they also ended up with the zero-bit solution.
The simulation evidence of certain bit positions in an agent’s rule be-
coming completely filled with non-# bits is consistent with the expected
long run asymptotic behavior of genetic algorithms. It was shown by
Holland that once an allele or a schema has been found to be useful,
it will spread through the whole population [182]. In the long run, all
chromosomes in a population will become identical. Once an agent has
found a good trading rule, one would expect the complete rule set to
be characterized by bit fixations at either the 0 or 100% level.1
Therefore, the remark by LeBaron et al. that bit costs were intended
to ensure that each bit is actually useful in improving the forecast
ability of a trading rule takes on a different meaning [244, p. 1497].
For a long time I, had interpreted that remark literally, i.e., I simply
assumed that it meant one single bit in one particular trading rule.
It now becomes clear that “each bit” only makes sense if it refers to
1
GAs that employ niching methods [155] such as fitness sharing and crowding are
capable of locating and maintaining several solutions within a single population.
Mahfoud [270] derives lower bounds for the population size to maintain a desired
number of niches. An overview of niching methods can be found in [272].
9.2 Technical Trading and the Aggregate Bit Level 149
2
Depending on the interpretation, the “each bit” formulation in their papers may
not be wrong, but I would call it at least unfortunate. It may have caused consid-
erable confusion and misinterpretation in the area of genetic algorithms among
novices, such as I was when I started working with the SFI-ASM.
150 9 Selection, Genetic Drift, and Technical Trading
When looking at figure 7.1 on page 116, one notices that there are
some latecomer agents who experience difficulties in finding the correct
hree-solution. They appear to be temporarily locked into a suboptimal
solution, i.e., one or two specific trading conditions are set to either
zero or one in all of their trading rules. Since these bit fixations cannot
be changed by the the crossover operator, they are effectively fixed.
Only mutations and generalizations are able to change several of these
bits such that the #-bits can finally take over. It is obvious that this
may take a long time to happen. Some long run testing with bit costs
greater than or equal to 0.01, however, yielded that the latecomers
are eventually able to arrive at the zero-bit solution, too. The question
arises, though, as to why some of those bit fixations arose to begin with.
Was it a random mistake on the agents’ part, or did they temporarily
discover some useful trading information?
Another peculiarity that figure 7.1 reveals is the apparent speed
with which the majority of agents is able to find the zero-bit solution.
No matter what GA-invocation interval is used, the first agents to dis-
card their classifier system need only between 10 to 20 GA-invocations.
Sixty to seventy percent of the other agents follow shortly after. This,
however, seems quite fast for a GA-learning algorithm to discover the
optimum solution. It is only later on that the discovery process slows
down.3
The simulation results with the modified SFI-ASM have so far un-
equivocally shown that the zero-bit solution is robust. My growing con-
viction of its validity, however, was shaken when I ran the model under
no bit cost. If small efficiency gains from using condition bits had been
overcompensated by the associated bit costs, agents would have been
forced into the zero-bit solution. Thus, the model behavior was inves-
tigated under zero bit cost. To exclude that the bit decreasing effect
3
I should add at this point that figure 7.1 was derived with the consistency check
switched on. Since the rule set then contains only logical trading rules, the num-
ber of rules were reduced to 50. The reason why this might have an impact on
the speed with which agents find the zero-bit solution will become clear in the
following sections.
9.3 The Zero-Bit Solution: Some Disturbing Evidence 151
Fig. 9.1. Aggregate bit level in the modified SFI-ASM for zero-bit costs,
no generalization, 25 bit-neutral agents with 100 trading rules, an initial bit
probability of 0.05, a crossover probability of 0.1, and a GA-invocation interval
of 100. In spite of zero-bit costs, 4 agents arrived at the zero bit level for
fundamental bits and 5 agents for technical bits within less than 17,000 peri-
ods.
Fig. 9.2. Aggregate bit level in the modified SFI-ASM for zero-bit costs,
no generalization, 25 bit-neutral agents with 100 trading rules, an initial bit
probability of 0.15, a crossover probability of 0.1, and a GA-invocation interval
of 100. Under this parameterization, no zero-bit agents appeared within the
first 30,000 periods.
agents appeared within the first 30,000 periods and the aggregate bit
level seemed to have reached its long run equilibrium. Note that the
only difference in the parameterization between figures 9.1 and 9.2 is
the initial bit level with which trading rules were initialized.
4
Genetic drift is also sometimes characterized as random fluctuations in finite
populations, “evolutionary forgetting”, or allele loss. It is occasionally referred to
as the Sewall Wright effect, named after one of the founders of population genetics.
Genetic drift in the context of population genetics is discussed, for instance, in
[414, 170, 174].
154 9 Selection, Genetic Drift, and Technical Trading
with b̄ being the average gene value. The offspring generation is created
through random sampling, i.e., one offspring is obtained by randomly
choosing one of the µ parents, a procedure that is repeated µ times.
Beyer then derives the expected population variance of the offspring
generation as
2 µ−1 2
σO = σP . (9.2)
µ
Equation 9.2 demonstrates that random selection in finite popula-
tions reduces the expected population variances. A similar calcula-
tion of genetic drift based on population variance was done by Rogers
and Prügel-Bennet who then applied their results to various selection
schemes such as steady generational replacement, state selection, and
generation gap methods [358].
1 − e−2N sp
U (p) ≈ . (9.3)
1 − e−2N s
1.0 ....
......
.............................................................................................................................................................................................................................................................................................
..............
....................
.. ..................
...... ...... .....
... ........... ...... ........ .....
... ......... ...... .... .. .. ..
..... ........ ...... ..... ... .. ..
0.9 50 ....
......
.... ...........
. .. ......... ......... ..... .... .....
...... ........ ......
. . .. ..
Fixation Probability U (p)
.. ...... ..... ... ...
... ..... ......
.
..... ....... ... ...
... .... ...... .... . ... ..
..... ..... ..... .......
0.8 ..
...
...
.....
....
....
..........
.....
.....
.
. ........
.....
.
.....
.. ....
.
....
.
....
..
.. .
. ... .....
.. ..
... .....
.... ..... ..... .... ...
... .... .... .... .... ... ..
... .... ..... ..... .... ...
0.7 .....
.... 5 .. ...
....
.....
..
. .....
... .....
.....
.........
........
....
...
.....
.
.....
.
...
.
...
...
.
.
.
.
..
...
..
. ... ..... .... ..... .. ...
... ... ..... ..... ..... ... ...
... .. ..... .... ....
0.6 ...
.....
...
.
.
...
...
1 ....
........
.....
.....
.........
.....
....
........
....
.
.
...
...
... .
.
.
...
...
...
.. ..... .... .... ...
... ... ..... ..... .... ... ...
... .. ..... .... .... ...
0.5 ..
...
.... ..
...
.
...
...
....
.. .
.....
.....
....
.....
........
.....
.....
.....
........
....
...
..
.
...
.. ...
.
.
..
...
... .... ..... ..... ...
... .... .... ...
... ... .... ..... .... .. ...
0.4 .
...
..... ..
...
.
....
.....
.....
......
.
....
....
.
.....
........
.
.....
.....
.........
.....
−1 ...
....
.
... ..
.
.
..
...
... ... ..... ..... ..... .. ...
... ... .... .... ..... ... ...
... .... ..... ..... ...
0.3 .....
..
.
.
...
...
.
....
....
.......
.
....
.....
........
....
.....
. ..
. .....
−5 ....
....
.... .
..
...
...
.
... ... .... ..... ..... .... ...
... ... .... .... ..... .... ...
... ... .... ..... ..... ....
0.2 ..... ....
.. ...
... .. ....
.
...
.....
.
.... ........
.... ..
......
......
.....
.... . ........
.....
....
.....
..
......
..
...
.
....
The idea that the fixation of trading bits at the 100% level in the SFI-
ASM necessarily reflects their advantageousness is obviously equivalent
to the position held by the selectionist faction. The view that vanishing
trading bits are likewise a result of selection would equally be favored
by the selectionists.
The emergence of zero-bit agents under no bit costs, however, clearly
points to genetic drift as a major evolutionary force in the modified SFI-
ASM. Since genetic drift has not been taken into account yet, the sim-
8
The N in equation 9.3 actually refers to effective population size. In order to
determine effective population sizes, evolutionary biologists make adjustments
for unequal sex ratios, strong sexual selection, or fluctuating population sizes
[80, 174]. In genetic algorithms, such adjustments are usually unnecessary. If,
however, the SFI-ASM founders would have opted to prevent illogical trading
rules from reproducing, the effective population size would have been lower than
the number of trading rules.
158 9 Selection, Genetic Drift, and Technical Trading
Table 9.1. Typical rule set of a bit-neutral “latecomer” agent. The rule set
contains 100 trading rules, each with 64 trading bits. The last row shows the
sum of all non-# bits.
1 2 3 4 5 6 7 · · · · · · 58 59 60 61 62 63 64
1 0 # 0 # # # # ······ # 1 # # # # #
2 # # 0 # # # # ······ # 1 # # # # #
3 # # 0 # # # # ······ # 1 # # # # #
.. .. ..
. . .
98 # # 0 # # # # · · · · · · # 1 # # # # #
99 # # 0 # # # # · · · · · · # # # # 1 # #
100 # # 1 # # # # · · · · · · # 1 # # # # #
1 0 100 0 0 1 0 0 99 0 0 1 0 0
Table 9.2. Typical rule set of a SFI agent. The rule set contains 100 trading
rules, each with 64 trading bits. The last row shows the sum of all non-# bits.
1 2 3 4 5 6 7 · · · · · · 58 59 60 61 62 63 64
1 # 0 # 0 # 1 # ······ # # 1 # 0 # #
2 # # # 1 # 1 # ······ # # 1 # 0 # #
3 # # # 0 # 1 # ······ # # 1 1 0 # #
.. .. ..
. . .
98 # # # # # 1 # · · · · · · # # # # # 1 #
99 # 0 # 0 # 1 # · · · · · · # # 1 1 0 # #
100 # # # # # 1 # · · · · · · # # # # 0 1 #
1 4 0 96 0 100 0 0 1 93 5 99 5 0
A typical rule set of SFI agents shown in table 9.2 reveals that there
are indeed bit positions that have fixated. Instead of providing data on
the aggregate bit level in the economy, this could have been a much
stronger argument for emergent technical trading. After all, those bits
close to the 100% level have emerged from low initial bit frequencies.
But again, it is possible that those bit positions became fixed through
random genetic drift and not through selective forces. If we compare the
two rule sets shown in tables 9.1 and 9.2, one notices several things.
First, compared to bit-neutral latecomers, SFI agents have more bit
positions that are close to zero, but not exactly zero. Second, some bit
positions in SFI-rule sets tend to be only “nearly” fixated, i.e., they
are also close to, but not exactly at the 100% level. In the rule sets
of bit-neutral agents, the fixation is usually complete. Both effects are
probably an effect of the SFI-mutation operator which is more likely,
i.e., with probabilities of two thirds, to introduce a 0- or a 1-bit when
it finds a #-bit at a particular position, or to change non-# bits to the
don’t care symbol.
It is also possible to offer an explanation for the apparent path
dependency of equilibrium bit levels. In section 6.7.2, it was pointed
out that the equilibrium bit frequencies differ when the trading rules
are initialized with different bit probabilities. Equation 9.3 showed that
the fixation probability of trading bits increases for higher initial allele
frequencies. Starting at higher initial bit probabilities, thus, will lead to
more bit fixations and a higher aggregate bit level. It does not matter
at this point whether bit fixation is a result of genetic drift or selection,
but once it has occurred, it is hard to reverse.
9.6 Fitness Driven Selection or Genetic Drift? 161
How much selective pressure is necessary to ensure that the two model
versions of the SFI-ASM are working in the regime where selection
dominates genetic drift? Remember that section 9.5 established that
the condition s > 1/(2N ) has to be satisfied for selection to be primarily
responsible for allele frequencies. Certainly, this is more a rule of thumb
than a strict border separating the two regimes, but it will help us get
an idea of how much a trading bit has to improve a rule’s forecast
accuracy in order to be selected.
An individual’s i relative fitness wi was defined as its absolute fit-
ness fi over mean population fitness f¯, and the selection coefficient
in favor of individual i as s = w − 1 with s ∈ [−1, ∞). Inspection of
typical trading rule sets shows the mean fitness values of trading rules
to be mostly in the interval 92–94. Since we are only interested in an
estimate of the magnitude of the fitness differential, the exact value
does not matter that much. If we then assume that mean fitness is
93.0, by how much does a rule’s fitness value have to be improved by
an additional trading bit such that this bit is likely to become fixated
through selection? Substituting s with w − 1 and rearranging equation
9.4 yields
¯ 1
fi > f 1 + (9.5)
2N
1
> 93.000 1 +
200
> 93.465.
1400
1200
number of instances
1000
800
600
400
200
0
0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
bit frequency
Fig. 9.4. Number of bit positions that have a certain bit frequency within an
agent’s rule set. Data were obtained at period 250,000 from a single GA-only
simulation run with 1,000 SFI agents, each endowed with 100 trading rules
(Π = 0.9, π = 0.03, bit costs = 0, GA-interval = 1,000).
Figure 9.4 shows the bit frequency of bit positions in the rule sets of
SFI agents. A bit frequency of zero means that a certain bit position in
an agent’s rule set has no set trading bits at all. A bit frequency of one
refers to the situation when all 100 trading bits are set to either zero or
one. In a simulation run with 1,000 SFI agents who all possessed 100
trading rules consisting of 64 trading bits, a total of 64,000 bit positions
could be investigated at once. Figure 9.4 shows that a bit frequency of
zero is the most common one, even though it does not fit very well into
the overall shape of the remaining histogram. Apart from this outlier,
the bit frequencies between 0.15 to 0.35 are the most common ones.
It is also worth noticing that a few bit positions with a bit frequency
of one have emerged even though the initial bit probability was set at
0.05.
164 9 Selection, Genetic Drift, and Technical Trading
1200
1000
number of instances
800
600
400
200
0
0,0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1,0
bit frequencies
Fig. 9.5. Number of bit positions that have a certain bit frequency within an
agent’s rule set. Data were obtained at period 250,000 from a single GA-only
simulation run with 1,000 SFI agents, each endowed with 100 trading rules
(Π = 0.9, π = 0.03, bit costs = 0, GA-interval = 50).
45.000
40.000
number of instances
35.000
30.000
25.000
20.000
15.000
10.000
5.000
0
0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
bit frequencies
Fig. 9.6. Number of bit positions that have a certain bit frequency within
an agent’s rule set when bit costs are positive. Data were obtained at period
250,000 from a single GA-only simulation run with 1,000 SFI agents, each
endowed with 100 trading rules (Π = 0.9, π = 0.03, bit costs = 0.005, GA-
interval = 5).
13
Under positive bit costs, the 20 worst trading rules were replaced. With no other
fitness influence than bit costs, the exact value does not matter, i.e., the bit
distributions for different bit costs would look identical.
166 9 Selection, Genetic Drift, and Technical Trading
60.000
50.000
number of instances
40.000
30.000
20.000
10.000
0
0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
bit frequencies
Fig. 9.7. Number of bit positions that have a certain bit frequency in the
rule sets of all bit-neutral agents when bit costs are zero. Data were obtained
at period 250,000 from a single GA-only simulation run with 1,000 bit-neutral
SFI agents, each endowed with 100 trading rules (Π = 0.9, π = 0.03, bit costs
= 0, GA-interval = 5).
There are 3,403 out of 64,000 possible bit fixations at the 100%
level, that is, 5.3% of trading bits became fixated. Given an initial bit
probability of 0.05, this is remarkably close to the theoretical prediction
of 5% by equation 9.3. The bit equilibrium distribution in the presence
of bit costs is trivial—there are no remaining bits at all.
The previous sections have shown that replacing the original mutation
operator with an unbiased operator will result in drastically different
9.7 The Effect of Mutation on Genetic Drift 167
pt+1 = (1 − pt )u − pt µ. (9.6)
14
The mean of a Beta distribution with parameters α and β is α/(α + β). The mode
of a Beta distribution (α − 1)/(α + β − 2) is only defined for α > 1 and β > 1.
168 9 Selection, Genetic Drift, and Technical Trading
10.0 ..
......
... ...
... ...
..... ..... 80
.. ..
... .....
... ...
... ...
.... ...
... ...
7.5 ..... ...
Probability Density
...
.... ...
.... ...
... ...
...
.... ...
0.1
..
. ...
... . ... .
... .
.
. ... ...
... ..... ... ...
... . ... ...
. ...
5 ...
... . ... .
.
... . ... ..
... ... ... ...
... ... ... ...
... ... ... ..
... ..
.
. .
. ...
.
.
... ...
... ... . ..
...
... ... .................. .... ...
... ... ...... .......
...
... ...... ......
...
... . .
......... .......
. .. .
2.5
. ... ..... .
... .... .. .
...
10
... .... ... ... .....
.... .. ... ..... ...
... ... .... ... .... ...
... .... ....
...
... .
.....
. ...
.
.
...
. ....
. .
.
. . ... .... .
... .... ... .... ...
... .... ...
... .... ...
1
. .... . .
.....................................................................................................................................................................................................................................................................................
..... .
.....
....... ..
.
.
.... ....
.
...
. ..... . ........
............ .. ... ..... ..
................................................. .. . ............
....................................................................................................................................................................
0.0
...
......................................................................................................... ...............................................................................................................
2.0
......................
....... .....
..... .....
..... .....
..... ....
...
1.75
...
.... ...
.. ...
Probability Density
.. ...
..... ...
... ...
1.5
. ...
.... ...
.... ...
...
... ...
1.25
.
...
.
...
...
...
. ...
... ...
1.0
. ...
...
. ...
.... ...
...
.... ...
0.25
.. ...
.
... ...
.
.... .....
...
. .....
........
. .....
......
0
...... ...
...............................................................................................................................................................................................................................................................................
Fig. 9.9. Equilibrium distribution of allele frequencies under genetic drift and
mutation with the original SFI-mutation operator.
The effective mutation rates u and µ for the bit-neutral agents are
dynamically self-adjusting. We can, however, generate a probability
density plot for equilibrium allele frequencies if we keep the bit tran-
sition probabilities from the bit transition matrix fixed at their values
for the initial bit frequency of 0.05. The effective mutation rates from
A to #-bits and vice versa are not symmetrical anymore and equal
u = 0.95Ππ = 0.95 × 0.9 × 0.03 = 0.02565 (9.10)
µ = 0.05Ππ = 0.05 × 0.9 × 0.03 = 0.00135. (9.11)
170 9 Selection, Genetic Drift, and Technical Trading
1.25 ...
...
...
...
1.00
...
Probability Density
...
...
...
...
...
...
...
0.75 ...
...
...
...
...
...
...
0.50
...
...
...
...
...
...
...
0.25
...
...
....
....
......
.......
..........
.................
.................................
.....................................................................................
0.00
.............................................................................................................................
0.04 ...
.....................
.... .......
......
... .......
.......
..... ......
.......
....
Probability Density
.......
.... ......
.......
0.03
... .......
.......
..... ......
.......
.... .......
... .......
.......
..... .......
.......
.... ......
......
.... .......
0.02 ...
.....
.......
.......
.......
.......
........
.... ........
.... .........
.........
... .........
..........
..... ...........
The previous analysis has shown that the original mutation operator
is more efficient in combating genetic drift than the new bit-neutral
operator. Its effective mutation rates for individual bits are higher and
it is more disruptive. The difference in disruptiveness between the two
operators is highest when the bit level has reached zero. For the bit-
neutral mutation operator, the zero-bit level is truly an absorbing state.
It becomes inactive and the genetic algorithm converges prematurely.
16
The critical population size for the original mutation operator turns out to be
only 28 trading rules.
172 9 Selection, Genetic Drift, and Technical Trading
Even though the built-in upward bias of the original mutation op-
erator may be questionable at first sight, a deeper analysis reveals that
it is superior to the bit-neutral design. With the benefit of hindsight,
the new mutation operator did not effectively combat genetic drift; it
even became subject to genetic drift itself. Since the new operator was
designed to keep the bit fraction before and after mutation, on aver-
age, unaltered, the bit level in an agent’s rule set can successively be
changed in one direction several periods in a row by pure chance.17
Eventually, one of the two absorbing states will be reached. With low
initial bit probabilities, the absorbing state at the zero-bit level is the
more likely one.
As it turns out, the problem with the original SFI-ASM is not in model
design, but in the interpretation of certain simulation results. Given the
original mutation operator’s fixed point of one half, an isolated look at
the aggregate bit level to detect the existence of technical trading is not
sufficient. Without prior knowledge of the degree to which the increase
in learning speed affects the aggregate bit level through the biased
mutation operator, additional tests need to be done to establish the
existence of technical trading in the model.
Note: Means over 25 runs. Numbers in parenthesis are standard errors estimated using the 25 runs.
table 9.4 show the pure forecast accuracy of trading rules, independent
of their specificity.
Table 9.4. Comparison of bit cost adjusted mean fitness values for SFI- and
non-classifier agents. for. Fitness values were averaged over all rules of one
agent type within one simulation run.
Table 9.5. Comparison of bit cost adjusted maximum fitness values for SFI-
and non-classifier agents. For each simulation run, the maximum fitness value
of all rules over all agents within the economy was used.
their forecast production. When using only the best trading rule per
simulation run, one suddenly realizes that SFI agents are able to pro-
duce significantly better trading rules than non-classifier agents. Both
test statistics from a paired t-test are highly significant for both the
one- and the two-sided versions of the test. One also notices that in
the slow learning regime, the best trading rules produce slightly better
forecasts than in the fast learning case.18
The analysis of fitness information finally proves beyond a doubt
that there is indeed technical trading in the model. A superficial look at
the aggregate bit level or a reference to remaining predictability in the
price series does not have the same power of persuasion than the fitness
information stored for each trading rule. If the authors of the original
SFI-ASM would have performed such a fitness analysis to begin with,
no questions would have arisen about whether the continued existence
of trading bits reflects technical trading or not.
18
For the best trading rules of SFI agents, the t-statistics for an unpaired t-test
with unknown and unequal variances is 0.81. The t-value is only 0.48 for the best
trading rules of non-classifier agents, hence, in both cases, the forecasting quality
in the slow learning regime is better, but not significantly better.
176 9 Selection, Genetic Drift, and Technical Trading
19
The increased fitness differential between the best rules of SFI and non-classifier
agents in the fast learning case suggests that there is a potential for higher degrees
of technical trading. But again, the fitness values alone do not allow a statement
about levels of technical trading at both learning speeds.
9.9 An Evolutionary Perspective on Technical Trading 177
32500
30000
27500
25000
number of instances
22500
20000
17500
15000
12500
10000
7500
5000
2500
0
0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
bit frequency
Fig. 9.12. Number of bit positions that have a certain bit frequency in the
rule sets of slow learning SFI agents. Data were obtained at period 250,000
from 25 simulation runs with 30 SFI agents, each endowed with 100 trading
rules (Π = 0.9, π = 0.03, bit costs = 0.005, GA-interval = 1,000).
learning case.20 This is definitely the case, with 4.46% of all possible
bit positions fixed. When checking the significance of the increase from
3.35% to 4.46% with an unpaired t-test with unknown and unequal
variances, the t-value turns out to be 8.9. Since this is significant at
even the highest levels of confidence, we can finally establish the emer-
gence of technical trading for faster learning speeds beyond a doubt.
25000
22500
20000
17500
15000
12500
10000
7500
5000
2500
0
0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 1
bit frequency
Fig. 9.13. Number of bit positions that have a certain bit frequency in the
rule sets of fast learning SFI agents. Data were obtained at period 250,000
from 25 simulation runs with 30 SFI agents, each endowed with 100 trading
rules (Π = 0.9, π = 0.03, bit costs = 0.005, GA-interval = 100).
fitness increase per single trading bit to be around 0.5 in order for
selection to offset the effect of genetic drift (section 9.6.3 on page 161).
With an average specificity of four, the fitness increases of only 0.2 for
mean fitness and of about 0.5 for maximum fitness values in the fast
learning case suggest that selection cannot guarantee that favorable
trading bits will spread through the population. The observed effect of
subpopulation differentiation, i.e., different agents discovering different
trading rules, is most likely a result of insufficient selection pressure
in favor of successful trading bits. With fitness gradients too small to
counter genetic drift, it is hard to say which trading bits will be selected
since bit fixation remains a noise-prone process.
While we have developed this argument within a model that is char-
acterized by successful technical trading rules, the existence of technical
trading strategies in real financial markets remains a puzzle for many
adherents of the EMH. However, they can now use the same argument
to explain that existence even in the presence of efficient markets.
Those who voiced the earliest evolutionary arguments in favor of
emergent rationality or efficient markets seemed to have come from the
selectionist camp. They implicitly assumed that selection is the only
evolutionary force at work and, “if things have had time to hammer
9.9 An Evolutionary Perspective on Technical Trading 179
logic into men” [382, p. 80], a rational and/or efficient outcome will be
inevitable. An evolutionary perspective that extends its scope beyond
selection is not that certain anymore.
In the light of the above argument, the sheer amount of differ-
ent technical trading strategies in real financial markets suggests two
things. First, their possible profits are at best relatively small. Only
highly profitable and easily detectable trading strategies could, in prin-
ciple, spread through the whole population, which would then lead to
their own demise. Second, to the extent that these trading strategies
are unprofitable (or less profitable than the market portfolio), the losses
they produce are equally too small to quickly eradicate them.21 With
only small selective forces in favor of or against most trading strategies,
“financial evolution” cannot guarantee that only the fittest strategy will
survive.
Under small selection pressures in either direction, random evolu-
tionary forces other than selection might prevail and maintain a suffi-
ciently diverse population of trading strategies over extended periods
of time. Keeping in mind that a moderate increase in checked trading
conditions in the SFI-ASM drastically prolonged the adjustment pe-
riod towards equilibrium, the “evolutionary time scale” in real financial
markets may be too long to observe evolutionary change in real time.22
The existence of technical trading strategies does not necessarily prove
their profitability. It does not disprove the validity of the EMH either.
Technical trading rules may simply be a sign of insufficient selection
pressures against them.
21
Positive profits below the returns of the market portfolio do not strike one as a
particulary strong selective force against a particularly trading strategy.
22
A similar concern was voiced in [242].
10
Summary and Future Research
It was argued in the beginning of this book that the current surge in
agent-based simulation models in economics cannot be explained by the
sudden increase in computational resources alone. Thus, the first part
discussed agent-based simulations in the light of the limitations of cur-
rent research methodologies and practices in the economics field. It was
highlighted that agent-based simulation approaches are highly suited
to address heterogeneous agents problems. Giving up the assumption
of agent homogeneity necessarily leads to an abandoning of the rational
expectations paradigm. In addressing what degree of rationality should
be chosen instead, chapter 3 argued in favor of the principle of minimal
rationality [70]. When considering cognitive resources as scarce, agents
should be modeled as economizing on reasoning processes and employ-
ing only as much reasoning power as necessary to perform the task they
are facing. It was shown at the end of chapter 3 that the trigger for
the rational expectations revolution, the famous Phillips-curve debate,
closely followed the principle of minimal rationality.
After discussing some stylized facts of financial markets in the light
of the EMH, chapter 5 introduced several agent-based simulations.
Through their attempt to replicate certain stylized facts, additional
insight can be gained as to which assumptions are essential for the
emergence of these stylized facts. The selection and sequencing of pre-
sentation of agent-based simulations was guided by the principle of
minimal rationality, discussing first the zero-intelligence approach by
182 10 Summary and Future Research
learning speed, all agents discovered the zero-bit solution which tem-
porarily suggested two things. First, the classifier system does not pro-
vide any informational advantage that agents could exploit. Second, the
emergence of technical trading in the original SFI-ASM is an artifact
of design assumption.
The tentative result of a useless classifier system stood in contra-
diction to two studies by Joshi, Parker, and Bedau [206, 207]. Based
on the terminal wealth levels acquired by agents in the SFI-ASM, they
performed two game-theoretic analyses. In the first study, they asked
whether it would be optimal for agents to include technical trading
rules or not. They concluded that technical trading constitutes a typi-
cal prisoner’s dilemma. They argued that it is rational for each agent to
use technical trading, yet at the same time, they found the aggregate
market outcome to be less efficient than when agents had no access
to technical trading rules. In the second study, they endogenized the
learning speed of agents and found the optimal learning rate to be at
a GA-invocation interval of 100. This clearly falls into the less efficient
complex regime, and Joshi, Parker, and Bedau inferred that financial
markets can operate at sub-optimal equilibria [207]. While the game-
theoretic analyses themselves are correct, chapter 8 deduced that the
SFI-ASM is highly unsuitable for addressing wealth issues. Because of
taxation in the model, a programming trick to avoid a long-run ex-
plosion of wealth, a wedge between long-term wealth and maximized
short-term wealth was introduced. It was analytically shown that the
only source of wealth accumulation in the SFI-ASM is the aggregate
risk premium that agents collect. Wealth contributions of the kind “buy
low and sell high” are impossible. A series of simulations then demon-
strated that wealth differences between different trader types generally
arise for various reasons, none of which has any economic meaning.
In the final chapter, some simulation evidence was presented that
showed the zero-bit solution to be too robust, i.e., it even emerged un-
der parameterizations under which it should not have emerged. As a
reason for this behavior, the effect of genetic drift was identified. Sub-
sequently, the two mutation operators were analyzed in light of genetic
drift. By transferring the Wright-Fisher model from population genetics
to the field of genetic algorithms, conditions were derived under which
selection dominates genetic drift. In a second step, a so-called neutral
model from population genetics was used to analyze under which con-
ditions mutation dominates genetic drift. While the original mutation
operator proved to dominate genetic drift, the same could not be said
for the suggested bit-neutral operator. In addition to being unable to
184 10 Summary and Future Research
2
CRRA preferences are used, for instance, in [247, 241].
11
Appendix
188
∀ i: with prob 1 K : invoke GA on ℑi ∀ i: with prob 1 K : invoke GA on ℑi
{ } { }
11 Appendix
∀ i: choose j ∈ ω i ,t = k ∈ ℑi : cond. part k = market descr. ∀ i: choose j ∈ ω i ,t +1 = k ∈ ℑi : cond. part k = market descr.
t t+1
[ ]
∀ i: Ei ,t pt +1 + d t +1 = ai , j ( ptrial + d t ) + bi , j ∀ i: Ei ,t +1[ pt +2 + dt +2 ] = ai , j ( pt +1 + dt +1 ) + bi , j
spec.: ptrial = f (∑ N
i =1
xi ,t − N ) spec.: ptrial = f (∑ N
i =1
xi ,t +1 − N )
Ei ,t [ pt +1 + dt +1 ] − ptrial (1 + r ) Ei ,t +1[ pt +2 + d t +2 ] − ptrial (1 + r )
∀ i: xi ,t = ∀ i: xi ,t +1 =
λσ t2,i , j , p +d λσ t2+1,i , j , p +d
N N
spec.: ∑ x i ,t = N? NO spec.: ∑ x i ,t +1 = N? NO
i =1 i =1
YES YES
. fitness . fitness
Table 11.1. Condition bits documented in [11, p. 27] and [244, p. 1494].
Bit Condition
Arthur et al. [11] and LeBaron et al. [244] report only 12 condi-
tions for the SFI-ASM. Bits 1 to 6 are the fundamental bits. Note that
even though these are 6 individual bits, they can only code one piece
of fundamental information since they logically belong together. The
rule consistency check introduced in section 7.4.3 compresses this bit
sequence to a maximum of 2 bits, and a neural network trading rule
would only use one real valued number. Bits 7 to 10 are the technical
trading bits. LeBaron et al. remark that removing 3 out of the 4 tech-
nical bits can have a big impact, but removing only one will not change
things. They do not, however, mention what would happen if the list
of trading bits were substantially extended.
The last two bits convey no useful market information, but Arthur
et al. explain that they were intended as experimental controls to check
to which degree agents act upon useless information. They also assert
that these two bits would allow them to detect emergence of technical
trading “if bits 7–10 become set significantly more often, statistically,
than the control bits” [11, p. 27]. Emergence of technical trading in
the SFI-ASM has surely been declared without ever having done such
a test. Since the two control bits are always set, i.e., they are never
#, the “benchmark” of control bits is always 100 %. It is, therefore,
190 11 Appendix
impossible for any trading bit to be set significantly more often. Since
these bits are not mentioned anywhere in the discussions of simulation
results, they are not implemented in the Java version of the SFI-ASM.
Table 11.2. First part of the condition bits in the SFI-ASM Objective-C
version 7.1.2 used by Joshi et al. (1998 and 2002).
Bit Condition
1 dummy bit – always on
2 dummy bit – always off
3 dummy bit – random on or off
4 dividend went up this period
5 dividend went up one period ago
6 dividend went up two periods ago
7 dividend went up three periods ago
8 dividend went up four periods ago
9 5-period moving average of dividend went up
10 20-period moving average of dividend went up
11 100-period moving average of dividend went up
12 500-period moving average of dividend went up
13 dividend > 5 period moving average
14 dividend > 20 period moving average
15 dividend > 100 period moving average
16 dividend > 500 period moving average
17 dividend: 5-period moving average > 20-period moving average
18 dividend: 5-period moving average > 100-period moving average
19 dividend: 5-period moving average > 500-period moving average
20 dividend: 20-period moving average > 100-period moving average
21 dividend: 20-period moving average > 500-period moving average
22 dividend: 100-period moving average > 500-period moving average
23 dividend/mean-dividend > 1/4
24 dividend/mean-dividend > 1/2
25 dividend/mean-dividend > 3/4
26 dividend/mean-dividend > 7/8
27 dividend/mean-dividend > 1
28 dividend/mean-dividend > 9/8
29 dividend/mean-dividend > 5/4
30 dividend/mean-dividend > 3/2
31 dividend/mean-dividend > 2
32 dividend/mean-dividend > 4
Table 11.3. Second part of the condition bits in the SFI-ASM Objective-C
version 7.1.2 used by Joshi et al. (1998 and 2002).
Bit Condition
33 price*interest/dividend > 1/4
34 price*interest/dividend > 1/2
35 price*interest/dividend > 3/4
36 price*interest/dividend > 7/8
37 price*interest/dividend > 1
38 price*interest/dividend > 9/8
39 price*interest/dividend > 5/4
40 price*interest/dividend > 3/2
41 price*interest/dividend > 2
42 price*interest/dividend > 4
43 price went up this period
44 price went up one period ago
45 price went up two periods ago
46 price went up three periods ago
47 price went up four periods ago
48 5-period moving average of price went up
49 20-period moving average of price went up
50 100-period moving average of price went up
51 500-period moving average of price went up
52 price > 5-period moving average
53 price > 20-period moving average
54 price > 100-period moving average
55 price > 500-period moving average
56 price: 5-period moving average > 20-period moving average
57 price: 5-period moving average > 100-period moving average
58 price: 5-period moving average > 500-period moving average
59 price: 20-period moving average > 100-period moving average
60 price: 20-period moving average > 500-period moving average
61 price: 100-period moving average > 500-period moving average
The cutoff points for the prf /d-ratios had been chosen by LeBaron
et al. to have good coverage of the range of relevant ratios. Note that the
192 11 Appendix
Table 11.4. Fundamental and technical condition bits in the current Java
version of the SFI-ASM.
Bit Fundamental conditions Technical conditions
0 dividend / dividend-mean > 0.6 price / price-mean > 0.25
1 dividend / dividend-mean > 0.8 price / price-mean > 0.5
2 dividend / dividend-mean > 0.9 price / price-mean > 0.75
3 dividend / dividend-mean > 1.0 price / price-mean > 0.875
4 dividend / dividend-mean > 1.1 price / price-mean > 1.0
5 dividend / dividend-mean > 1.12 price / price-mean > 1.125
6 dividend / dividend-mean > 1.4 price / price-mean > 1.25
7 price * interest / dividend > 0.25 price went up this period
8 price * interest / dividend > 0.5 price went up one period ago
9 price * interest / dividend > 0.75 price went up two periods ago
10 price * interest / dividend > 0.875 price went up three periods ago
11 price * interest / dividend > 0.95 price went up four periods ago
12 price * interest / dividend > 1.0 5-period price-MA went up
13 price * interest / dividend > 1.125 10-period price-MA went up
14 dividend went up this period 20-period price-MA went up
15 dividend went up one period ago 100-period price-MA went up
16 dividend went up two periods ago 500-period price-MA went up
17 dividend went up three period ago price > 5-period price-MA
18 5-period dividend MA went up price > 10-period price-MA
19 10-period dividend MA went up price > 20-period price-MA
20 100-period dividend MA went up price > 100-period price-MA
21 500-period dividend MA went up price > 500-period price-MA
22 dividend > 5-period div.-MA price: 5-period MA > 10-period
23 dividend > 10-period div.-MA price: 5-period MA > 20-period
24 dividend > 100-period div.-MA price: 5-period MA > 100-period
25 dividend > 500-period div.-MA price: 5-period MA > 500-period
26 div.: 5-period MA > 10-period MA price: 10-period MA > 20-period
27 div.: 5-period MA > 100-period MA price: 10-period MA > 100-period
28 div.: 5-period MA > 500-period MA price: 10-period MA > 500-period
29 div.: 10-period MA > 100-period MA price: 20-period MA > 100-period
30 div.: 10-period MA > 100-period MA price: 20-period MA > 500-period
31 div.: 100-period MA > 500-period MA price: 100-period MA > 500-period
References
[1] C. Ahn, E.-B. Lee, and E.-H. Suh. Noise trading in the Korean
foreign exchange market: Some questionnaire evidence. Bank of
Korea Economic Papers, 5(2):133–155, 2002.
[2] V. Akgiray, G. G. Booth, and O. Loistl. Stable laws are in-
appropriate for describing German stock returns. Allgemeines
Statistisches Archiv, 73:115–121, 1989.
[3] A. A. Alchian. Uncertainty, evolution, and economic theory. Jour-
nal of Political Economy, 58(3):211–222, 1950.
[4] M. Allais. Le comportement de l’homme rationnel devant le
risque: Critique des postulats et axiomes de l’ecole americaine.
Econometrica, 21(4):503–546, 1953.
[5] F. Allen and R. Karjalainen. Using genetic algorithms to
find technical trading rules. Journal of Financial Economics,
51(2):245–271, 1999.
[6] F. Allen, S. Morris, and H. S. Shin. Beauty contests, bubbles
and iterated expectations in asset markets. Cowles Foundation
Discussion Paper 1406, 1406, March 2003.
[7] H. L. Allen and M. P. Taylor. Charts, noise, and fundamen-
tals in the London foreign exchange market. Economic Journal,
100(400):49–59, 1990.
[8] H. M. Amman, D. A. Kendrick, and J. Rust, editors. Handbook
of Computational Economics, volume 1. Elsevier, Amsterdam,
New York, 1996.
[9] J. Arifovic. Genetic algorithm learning and the cobweb model.
Journal of Economic Dynamics and Control, 18(1):3–28, 1994.
[10] B. W. Arthur, S. Durlauf, and D. Lane, editors. The Economy
as an Evolving Complex System II. SFI Studies in the Sciences
of Complexity, vol. 27. Addison-Wesley, Reading, MA, 1997.
196 References
[194] G. Iori. A threshold model for stock return volatility and trad-
ing volume. International Journal of Theoretical and Applied
Finance, 3(3):467–472, 2000.
[195] G. Iori. A microsimulation of traders activity in the stock market:
The role of heterogeneity, agents’ interactions and trade frictions.
Journal of Economic Behavior and Organization, 49(2):269–285,
2002.
[196] J. F. Jaffe. Special information and insider trading. Journal of
Business, 47(410):410–428, 1974.
[197] C. M. Jarque and A. K. Bera. Efficient tests for normality, het-
eroscedasticity and serial independence of regression residuals.
Economics Letters, 6(3):255–259, 1980.
[198] T. Jasic and D. Wood. The profitability of daily stock market
indices trades based on neural network predictions: Case study
for the S&P 500, the DAX, the TOPIX and the FTSE in the
period 1965-1999. Applied Financial Economics, 14(4):285–297,
2004.
[199] N. Jegadeesh. Foundations of technical analysis: Computational
algorithms, statistical inference, and empirical implementation:
Discussion. Journal of Finance, 55(4):1765–1770, 2000.
[200] N. Jegadeesh and S. Titman. Returns to buying winners and
selling losers: Implications for stock market efficiency. Journal of
Finance, 48(1):65–91, 1993.
[201] M. C. Jensen. Some anomalous evidence regarding market effi-
ciency. Journal of Financial Economics, 6(2-3):95–101, 1978.
[202] M. C. Jensen and G. A. Benington. Random walks and tech-
nical theories: Some additional evidence. Journal of Finance,
25(2):469–482, 1970.
[203] M. C. Jerison. Aggregation and pairwise aggregation of demand
when the distribution of income is fixed. Journal of Economic
Theory, 33(1):1–33, June 1984.
[204] P. E. Johnson. Agent-based modeling: What I learned from
the artificial stock market. Social Science Computer Review,
20(2):174–186, 2002.
[205] S. Joshi and M. A. Bedau. An explanation of generic behavior in
an evolving financial market. Santa Fe Institute Working Paper
98-12-114, 1998.
[206] S. Joshi, J. Parker, and M. A. Bedau. Technical trading creates
a prisoner’s dilemma: results from an agent-based model. Santa
Fe Institute Working Paper 98-12-115, 1998.
210 References
[252] J. Lintner. The valuation of risk assets and the selection of risky
investments in stock portfolios and capital budgets. The Review
of Economics and Statistics, 47(1):13–37, 1965.
[253] B. L. Lipman. How to decide how to decide how to ... : Modeling
bounded rationality. Econometrica, 59(4):1105–1125, 1991.
[254] R. G. Lipsey. The relation between unemployment and the rate
of change of money wages in the United Kingdom 1862-1957: A
further analysis. Economica, 27(105):1–31, 1960.
[255] A. W. Lo. The Adaptive Markets Hypothesis: Market efficiency
from an evolutionary perspective. Journal of Portfolio Manage-
ment, 30:15–29, 2004.
[256] A. W. Lo, H. Mamaysky, and J. Wang. Foundations of techni-
cal analysis: Computational algorithms, statistical inference, and
empirical implementation. Journal of Finance, 55(4):1705–1765,
2000.
[257] I. N. Lobato and C. Velasco. Long memory in stock-market
trading volume. Journal of Business and Economic Statistics,
18(4):410–426, 2000.
[258] G. Loomes and C. Taylor. Non-transitive preferences over gains
and losses. Economic Journal, 102(411):357–365, 1992.
[259] M. Loretan and P. C. B. Phillips. Testing the covariance station-
arity of heavy-tailed time series. Journal of Empirical Finance,
1(2):211–248, 1994.
[260] J. Lucas, Robert E. Expectations and the neutrality of money.
Journal of Economic Theory, 4(2):103–124, April 1972.
[261] J. Lucas, Robert E. Econometric policy evaluation: A critique. In
K. Brunner and A. H. Meltzer, editors, The Phillips Curve and
Labor Markets, volume 1 of Carnegie-Rochester Conference Series
on Public Policy, pages 19–46. North-Holland, Amsterdam, 1976.
[262] J. Lucas, Robert E. Asset prices in an exchange economy. Econo-
metrica, 46(6):1429–1445, 1978.
[263] Y.-H. Lui and D. Mole. The use of fundamental and technical
analyses by foreign exchange dealers: Honk Kong evidence. In-
ternational Journal of Finance and Economics, 17(3):535–545,
1998.
[264] T. Lux. Herd behaviour, bubbles, and crashes. Economic Journal,
105(431):881–896, 1995.
[265] T. Lux. The stable Paretian hypothesis and the frequency of
large returns: An examination of major German stocks. Applied
Financial Economics, 6(6):463–475, 1996.
214 References
Adaptive Market Hypothesis 72–74 classifier system 29, 36, 46–49, 95,
aggregation 10–11 105, 110, 115, 118, 123–125, 129,
allele 36, 44, 96, 150, 155, 157–159, 130, 137, 149, 165, 175, 185
164 Coherent Market Hypothesis 70–71
allele frequency 155, 162 complex adaptive systems 6, 14–16
AR(1)-process 94, 123, 150 conditional heteroskedasticity
ARCH models autoregressive 62
ARCH(q) model 62 consistent expectations equilibria
Exponential ARCH 63 23
Fractionally Integrated ARCH crossover 32, 33, 37, 43–45, 49, 86,
63 99–101, 110, 116, 125, 152, 155,
Generalized ARCH(p,q) 62 167, 169
artificial neural network 30, 36, 71, crowding methods 150
187 CRRA see risk aversion, constant
as-if argument 19–21 relative
aspiration level 22, 26 cybernetics 15
volatility Wetzel-ranking 99
clustering 55, 60–64, 74, 75, 82, Wright-Fisher model 157–160, 185
84, 107, 109, 122
conditional 61, 62 zero-intelligence traders 76–79, 184
Lecture Notes in Economics
and Mathematical Systems
For information about Vols. 1–512
please contact your bookseller or Springer-Verlag
Vol. 513: K. Marti, Stochastic Optimization Techniques. Vol. 534: M. Runkel, Environmental and Resource Policy
VIII, 364 pages. 2002. for Consumer Durables. X, 197 pages. 2004.
Vol. 514: S. Wang, Y. Xia, Portfolio and Asset Pricing. XII, Vol. 535: X. Gandibleux, M. Sevaux, K. Sörensen, V. T’kindt
200 pages. 2002. (Eds.), Metaheuristics for Multiobjective Optimisation.
IX, 249 pages. 2004.
Vol. 515: G. Heisig, Planning Stability in Material
Requirements Planning System. XII, 264 pages. 2002. Vol. 536: R. Brüggemann, Model Reduction Methods for
Vector Autoregressive Processes. X, 218 pages. 2004.
Vol. 516: B. Schmid, Pricing Credit Linked Financial
Instruments. X, 246 pages. 2002. Vol. 537: A. Esser, Pricing in (In)Complete Markets. XI,
122 pages, 2004.
Vol. 517: H. I. Meinhardt, Cooperative Decision Making
in Common Pool Situations. VIII, 205 pages. 2002. Vol. 538: S. Kokot, The Econometrics of Sequential Trade
Models. XI, 193 pages. 2004.
Vol. 518: S. Napel, Bilateral Bargaining. VIII, 188 pages.
Vol. 539: N. Hautsch, Modelling Irregularly Spaced
2002.
Financial Data. XII, 291 pages. 2004.
Vol. 519: A. Klose, G. Speranza, L. N. Van Wassenhove
Vol. 540: H. Kraft, Optimal Portfolios with Stochastic
(Eds.), Quantitative Approaches to Distribution Logistics
Interest Rates and Defaultable Assets. X, 173 pages. 2004.
and Supply Chain Management. XIII, 421 pages. 2002.
Vol. 541: G.-y. Chen, X. Huang, X. Yang, Vector
Vol. 520: B. Glaser, Efficiency versus Sustainability in
Optimization. X, 306 pages. 2005.
Dynamic Decision Making. IX, 252 pages. 2002.
Vol. 542: J. Lingens, Union Wage Bargaining and
Vol. 521: R. Cowan, N. Jonard (Eds.), Heterogenous Economic Growth. XIII, 199 pages. 2004.
Agents, Interactions and Economic Performance. XIV,
339 pages. 2003. Vol. 543: C. Benkert, Default Risk in Bond and Credit
Derivatives Markets. IX, 135 pages. 2004.
Vol. 522: C. Neff, Corporate Finance, Innovation, and
Vol. 544: B. Fleischmann, A. Klose, Distribution Logistics.
Strategic Competition. IX, 218 pages. 2003.
X, 284 pages. 2004.
Vol. 523: W.-B. Zhang, A Theory of Interregional
Vol. 545: R. Hafner, Stochastic Implied Volatility. XI, 229
Dynamics. XI, 231 pages. 2003.
pages. 2004.
Vol. 524: M. Frölich, Programme Evaluation and Vol. 546: D. Quadt, Lot-Sizing and Scheduling for Flexible
Treatment Choise. VIII, 191 pages. 2003. Flow Lines. XVIII, 227 pages. 2004.
Vol. 525: S. Spinler, Capacity Reservation for Capital- Vol. 547: M. Wildi, Signal Extraction. XI, 279 pages. 2005.
Intensive Technologies. XVI, 139 pages. 2003.
Vol. 548: D. Kuhn, Generalized Bounds for Convex
Vol. 526: C. F. Daganzo, A Theory of Supply Chains. VIII, Multistage Stochastic Programs. XI, 190 pages. 2005.
123 pages. 2003.
Vol. 549: G. N. Krieg, Kanban-Controlled Manufacturing
Vol. 527: C. E. Metz, Information Dissemination in Systems. IX, 236 pages. 2005.
Currency Crises. XI, 231 pages. 2003.
Vol. 550: T. Lux, S. Reitz, E. Samanidou, Nonlinear
Vol. 528: R. Stolletz, Performance Analysis and Dynamics and Heterogeneous Interacting Agents. XIII,
Optimization of Inbound Call Centers. X, 219 pages. 2003. 327 pages. 2005.
Vol. 529: W. Krabs, S. W. Pickl, Analysis, Controllability Vol. 551: J. Leskow, M. Puchet Anyul, L. F. Punzo, New
and Optimization of Time-Discrete Systems and Tools of Economic Dynamics. XIX, 392 pages. 2005.
Dynamical Games. XII, 187 pages. 2003. Vol. 552: C. Suerie, Time Continuity in Discrete Time
Vol. 530: R. Wapler, Unemployment, Market Structure Models. XVIII, 229 pages. 2005.
and Growth. XXVII, 207 pages. 2003. Vol. 553: B. Mönch, Strategic Trading in Illiquid Markets.
Vol. 531: M. Gallegati, A. Kirman, M. Marsili (Eds.), The XIII, 116 pages. 2005.
Complex Dynamics of Economic Interaction. XV, 402 Vol. 554: R. Foellmi, Consumption Structure and Macro-
pages, 2004. economics. IX, 152 pages. 2005.
Vol. 532: K. Marti, Y. Ermoliev, G. Pflug (Eds.), Dynamic Vol. 555: J. Wenzelburger, Learning in Economic Systems
Stochastic Optimization. VIII, 336 pages. 2004. with Expectations Feedback (planned) 2005.
Vol. 533: G. Dudek, Collaborative Planning in Supply Vol. 556: R. Branzei, D. Dimitrov, S. Tijs, Models in
Chains. X, 234 pages. 2004. Cooperative Game Theory. VIII, 135 pages. 2005.
Vol. 557: S. Barbaro, Equity and Efficiency Considerations Vol. 583: I.V. Konnov, D.T. Luc, A.M. Rubinov† (Eds.),
of Public Higer Education. XII, 128 pages. 2005. Generalized Convexity and Related Topics. IX, 469 pages,
Vol. 558: M. Faliva, M. G. Zoia, Topics in Dynamic Model 2006.
Analysis. X, 144 pages. 2005. Vol. 584: C. Bruun, Adances in Artificial Economics: The
Vol. 559: M. Schulmerich, Real Options Valuation. XVI, Economy as a Complex Dynamic System. XVI, 296 pages,
357 pages. 2005. 2006.
Vol. 560: A. von Schemde, Index and Stability in Bimatrix Vol. 585: R. Pope, J. Leitner, U. Leopold-Wildburger, The
Games. X, 151 pages. 2005. Knowledge Ahead Approach to Risk. XVI, 218 pages, 2007
(planned).
Vol. 561: H. Bobzin, Principles of Network Economics.
XX, 390 pages. 2006. Vol. 586: B.Lebreton, Strategic Closed-Loop Supply Chain
Management. X, 150 pages, 2007 (planned).
Vol. 562: T. Langenberg, Standardization and
Expectations. IX, 132 pages. 2006. Vol. 587: P. N. Baecker, Real Options and Intellectual
Property: Capital Budgeting Under Imperfect Patent
Vol. 563: A. Seeger (Ed.), Recent Advances in Protection. X, 276 pages , 2007.
Optimization. XI, 455 pages. 2006.
Vol. 588: D. Grundel, R. Murphey, P. Panos , O. Prokopyev
Vol. 564: P. Mathieu, B. Beaufils, O. Brandouy (Eds.), (Eds.), Cooperative Systems: Control and Optimization.
Artificial Economics. XIII, 237 pages. 2005. IX, 401 pages , 2007.
Vol. 565: W. Lemke, Term Structure Modeling and Vol. 589: M. Schwind, Dynamic Pricing and Automated
Estimation in a State Space Framework. IX, 224 pages. Resource Allocation for Information Services:
2006. Reinforcement Learning and Combinatorial Auctions.
Vol. 566: M. Genser, A Structural Framework for the XII, 293 pages , 2007.
Pricing of Corporate Securities. XIX, 176 pages. 2006. Vol. 590: S. H. Oda, Developments on Experimental
Vol. 567: A. Namatame, T. Kaizouji, Y. Aruga (Eds.), The Economics: New Approaches to Solving Real-World
Complex Networks of Economic Interactions. XI, 343 Problems. XVI, 262 pages, 2007.
pages. 2006. Vol. 591: M. Lehmann-Waffenschmidt, Economic
Vol. 568: M. Caliendo, Microeconometric Evaluation of Evolution and Equilibrium: Bridging the Gap. VIII, 272
Labour Market Policies. XVII, 258 pages. 2006. pages, 2007.
Vol. 569: L. Neubecker, Strategic Competition in Vol. 592: A. C.-L. Chian, Complex Systems Approach to
Oligopolies with Fluctuating Demand. IX, 233 pages. Economic Dynamics. X, 95 pages, 2007.
2006. Vol. 593: J. Rubart, The Employment Effects of
Vol. 570: J. Woo, The Political Economy of Fiscal Policy. Technological Change: Heterogenous Labor, Wage
X, 169 pages. 2006. Inequality and Unemployment. XII, 209 pages, 2007
Vol. 571: T. Herwig, Market-Conform Valuation of Vol. 594: R. Hübner, Strategic Supply Chain Management
Options. VIII, 104 pages. 2006. in Process Industries: An Application to Specialty
Vol. 572: M. F. Jäkel, Pensionomics. XII, 316 pages. 2006 Chemicals Production Network Design. XII, 243 pages,
2007
Vol. 573: J. Emami Namini, International Trade and
Multinational Activity. X, 159 pages, 2006. Vol. 595: H. Gimpel, Preferences in Negotiations: The
Attachment Effect. XIV, 268 pages, 2007
Vol. 574: R. Kleber, Dynamic Inventory Management in
Reverse Logistics. XII, 181 pages, 2006. Vol. 596: M. Müller-Bungart, Revenue Management
with Flexible Products: Models and Methods for the
Vol. 575: R. Hellermann, Capacity Options for Revenue Broadcasting Industry. XXI, 297 pages, 2007
Management. XV, 199 pages, 2006.
Vol. 597: C. Barz, Risk-Averse Capacity Control in
Vol. 576: J. Zajac, Economics Dynamics, Information and Revenue Management. XIV, 163 pages, 2007
Equilibnum. X, 284 pages, 2006.
Vol. 598: A. Ule, Partner Choice and Cooperation in
Vol. 577: K. Rudolph, Bargaining Power Effects in Networks: Theory and Experimental Evidence. Approx.
Financial Contracting. XVIII, 330 pages, 2006. 200 pages, 2007
Vol. 578: J. Kühn, Optimal Risk-Return Trade-Offs of Vol. 599: A. Consiglio, Artificial Markets Modeling:
Commercial Banks. IX, 149 pages, 2006. Methods and Applications. XV, 277 pages, 2007
Vol. 579: D. Sondermann, Introduction to Stochastic Vol. 600: M. Hickman, P. Mirchandani, S. Voss
Calculus for Finance. X, 136 pages, 2006. (Eds.): Computer-Aided Scheduling of Public Transport.
Vol. 580: S. Seifert, Posted Price Offers in Internet Auction Approx. 424 pages, 2007
Markets. IX, 186 pages, 2006. Vol. 601: D. Radulescu: CGE Models and Capital Income
Vol. 581: K. Marti; Y. Ermoliev; M. Makowsk; G. Pflug Tax Reforms: The Case of a Dual Income Tax for Germany.
(Eds.), Coping with Uncertainty. XIII, 330 pages, 2006. XVI, 168 pages, 2007
Vol. 582: J. Andritzky, Sovereign Default Risks Valuation: Vol. 602: N. Ehrentreich: Agent-Based Modeling: The
Implications of Debt Crises and Bond Restructurings. Santa Fe Institute Artificial Stock Market Model
VIII, 251 pages, 2006. Revisited. XVI, 225 pages, 2008