Polysemy: Brigittp Zazie Rood David D. Clarke
Polysemy: Brigittp Zazie Rood David D. Clarke
ZAZIE rOOD
        VIMAlA HERMAN
        DAVID D. ClARKE
        (Editors)
Polysemy
Editors
WaIter Bisang
(main editor for this volume)
Hans Henrich Hock
Werner Winter
Mouton de Gruyter
Berlin . New York
Polysemy
Flexible Patterns of Meaning
in Mind and Language
Edited by
Brigitte Nerlich
Zazie Todd
Vimala Herman
David D. Clarke
Mouton de Gruyter
Berlin . New York   2003
Mouton de Gruyter (formerly Mouton, The Hague)
is a Division of Waiter de Gruyter GmbH & Co. KG, Berlin.
ISBN 3-11-017616-5
© Copyright 2003 by Waiter de Gruyter GmbH & Co. KG, D-10785 Berlin
All rights reserved, including those of translation into foreign languages. No part of this
book may be reproduced or transmitted in any form or by any means, electronic or mechan-
ical, including photocopy. recording or any information storage and retrieval system, with-
out permission in writing from the publisher.
Printed in Germany.
This book is dedicated to the memory of
Andreas Blank (1961-2001), our friend
and colleague, who died far too young.
Acknowledgements
This book was written with the generous support of the Leverhulme Trust.
   The book has been long in the making. The editors want to express their
gratitude to all contributors for their patience and forbearance, to Birgit Sievert
for her help and encouragement, and to Simon Cave and Sue Lightfoot for
helping the book over its last formatting and indexing hurdles.
Contents
Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ..       vii
List of contributors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ..    xi
Cognitive approaches
Polysemy and conceptual blending ............................                                           79
   Gilles Fauconnier and Mark Turner
Reconsidering prepositional polysemy networks: the case of over                                         95
  Andrea Tyler and Vyvyan Evans
Polysemy as flexible meaning: experiments with English get
and Finnish pitiiii .......................................... 161
   Jarno Raukko
Metonymic polysemy and its place in meaning extension .......... 195
  Ken-ichi Seta
Synchrony/diachrony approaches
Polysemy in derivational affixes .............................. 217
   Adrienne Lehrer
The role of links and/or qualia in modifier-head constructions ...... 233
  Beatrice Warren
x   Colltents
Computational approaches
"I don't believe in word senses" .............................. 361
   Adam Kilgarriff
Senses and texts ........................................... 393
   Yorick Witks
This chapter has three parts. In the first part we provide a brief overview of
the varied fortunes of polysemy research since it was made popular by Michel
Breal in 1897 (Breal 1924 [1897]: Chapter 15; for more details see NerIich,
this volume), we summarize what has been done so far, what has proven
problematic in this type of research, and which gaps are still left to be
filled. We also link the recent upsurge of interest into polysemy with a
renewed interest into the cognitive bases of figurative language (metaphor
and metonymy). In the second part we explore various theories of polysemy,
proposing ourselves a graded theory of polysemy as flexible meaning, which
underlies discursive polysemy, lexicalized synchronic polysemy and dia-
chronic processes of polysemization, and we contrast this view with other
monosemous views of meaning. Throughout this introduction, we combine
the analysis of central issues with references to the topics discussed in the
articles that follow. In the last part of the introduction we shall provide an
overview of the chapters that follow and indicate the ways in which they
contribute to the overall theme of the book. In all the chapters a case will be
made for a theory of polysemy as flexible meaning.
Fifty years ago the linguist and semanticist Stephen Ullmann wrote that
polysemy, the fact that some words have a network of multiple but related
meanings, is "the pivot of semantic analysis" (Ullmann 1957 [1951]: 117).
He was referring to traditional synchronic and diachronic lexical semantics
as it had developed after Breal. Fifty years after Ullmann and a century after
Breal polysemy has become central to modem cognitive semantics of the
synchronic and diachronic type as well as to computational semantics and
AI (see Brugman 1997). It has become clear that the study of polysemy is of
fundamental importance for any semantic study of language and cognition.
Polysemy can therefore be regarded as providing a privileged access to the
4    Brigitte Nerlich and David D. Clarke
then prototype and frame semantics (see WiIliams 1992; Rueckl 1995).
Polysemy can in fact be regarded as the hub around which all the multiple
but related synchronic and diachronic theories of language and cognition
turn.
    Developmental psychologists and linguists interested in language acqui-
sition have also begun to study polysemy (see Mason, Kniseley and Kendall
 1979; Durkin, Crowther and Shire 1986; Johnson 1997,2001; Nerlich, Todd
and Clarke 1998; Israel, to appear). One contribution to this volume will
explore the acquisition of the different meanings of the polysemous verb
get from 4 to 10 years, and it will also investigate whether developmental
psychology can shed light on recent developments in cognitive linguistics
and vice versa (Nerlich, Todd and Clarke, this volume; see also Raukko, this
volume, for a parallel investigation of the "adult polysemy" of get).
    There are still many questions to be answered in this field of research.
Here are just a few of them: do children learn the various subsenses of a word
in fixed collocational patterns and only later relate them to one core meaning?
Or do they first have a very vague global meaning for one word and then
tease out the subsenses? Do they learn the most frequently used subsenses
first? How do children start to enjoy and understand jokes and puns based
on the exploitation of polysemy/ambiguity? Why does this seem to happen
only after the age of 7? And why is this difficult for autistic children? As
conceptual metaphorical mappings are not primarily matters of language,
but part of our conceptual systems which allow us to use sensory-motor
concepts in the service of abstract reason, it is assumed by many cognitive
linguists that children acquire conceptual metaphorical mappings automatically
and unconsciously via their everyday functioning in the world (see Lakoff
and 10hnson 1999: Chapter 4). All this needs to be studied empirically.
    Could it be that children put together knowledge elements into networks
that follow the same pathways as the accretion of polysemous meanings
by a word form over time? Do children's spontaneous creations of multiple
meanings match the patterns of adult polysemy, that is, follow metaphorical
and metonymical pathways? And if not, how are children's networks of
polysemous meanings adjusted to the adult ones? Another question one should
try to answer is: do children readily understand polysemous words? What
kind of polysemous words do they understand best? Does it depend on the
grammatical category? Does it depend on the frequency/familiarity or salience
(see Hughes 1989; Giora 1997)? Does it depend on the type of figurative
mapping underlying the mUltiple meanings of a word? Are metonymical
                          Polysemy and flexibility: introduction and overview        7
In the context of this book, we adopt as a working hypothesis the view that
almost every word is more or less polysemous, with senses linked to a
prototype by a set of relational semantic principles which incorporate a greater
or lesser amount of flexibility. We follow the now common practice in
poly se my research and regard polysemy as a graded phenomenon (see Cowie
1982; Lipka 1990; Gibbs 1994; Cruse 1995), ranging between what some
call contrastive polysemy and complementary polysemy (Weinreich 1966;
Pustejovsky 1995), where contrastive polysemy deals with homonyms such
as match (a small stick with a tip which ignites when scraped on a rough
surface) and match (contest in a game or sport), whereas complementary
polysemy deals with interrelated semantic aspects of a word, such as, in the
case of record, for example, the physical object and the music. Cases, such
as smart as in "a smart person" or "a smart dress", and/air as in "a fair trial"
and "fair hair" lie somewhere in between. Other polysemy researchers, such
as Cruse, have tried to deal with this semantic gradedness by invoking the
notion of sense-nodules or regions of higher semantic density within the
extreme variability of senses (Cruse 2000). The concept of gradedness also
applies to the difference between literal and metaphorical meaning, as recent
research seems to confirm (see Gibbs 1994; Giora, this volume).
    Take the following examples:
    In the interpretation of polysemous words (in fact almost any word) there
is therefore always a process of accommodation (and assimilation) involved,
between what is given semantically, syntactically, and what we infer from
the surrounding pragmatic context of discourse -leading to what Fauconnier
and Turner call "conceptual integration". As Wittgenstein said in his Philo-
.mphical Grammar: "Well, 'Understanding' is not the name of a single process
accompanying reading or hearing, but of more or less interrelated processes
against a background, or in a context" (Wittgenstein 1974, §35: 74).
    Wittgenstein, the "father" of the well-known concept of "family resemb-
lances", so central to prototype theory, also stressed the gradedness of the
multiple senses of a word in this passage from his Philosophical Grammar:
       features and be similar to each other, while distant ones belong to the same
       family without any longer having anything in common. Indeed even if a
       feature is common to all members of the family it need not be that feature
       that defines the concept.
           The relations between the members of a concept may be set up by the
       sharing of features which show up in the family of the concept, crossing and
       overlapping in very complicated ways.
           Thus there is probably no simple characteristic which is common to all the
       things we call games. But it can't be said either that "game" just has several
       independent meanings (rather like the word "bank"). What we call "games"
       are procedures interrelated in various ways with many different transitions
       between one and another. (Wittgenstein 1974, §35: 75; emphasis added)
   Geeraerts gives three reasons, all of which are linked to the functions of
the human mind, namely: storing information or accumulating world knowl-
edge and using it for cognitive purposes.
but it may also have communicational benefits, as salient meanings that have
not been deactivated may easily be reused by the discourse participants for
special purposes. This may then lead to strengthening of the semantic bonds
between the senses of a word. Becoming salient in conversation, they will
become more accessible and therefore more frequently used and usable.
However, the communicational costs of using polysemous words are also
obvious. There is always a danger of being misunderstood or of falling into
semantic traps: as when the topic of the long overdue renovation of the staff
toilets is discussed at a departmental meeting and someone says: "Any
movement on this issue would be welcome" ...
    Any theory of polysemy has to achieve a very precarious balancing act
between the maximization of polysemy, where the words themselves carry
most of the polysemous workload and speakers just have to choose correctly
in context, and a minimization of polysemy which leaves most of the work
to the pragmatic component, that is to the interpretational work done by the
speakers. One way out of this dilemma is proposed by Anna Wierzbicka
(1996) who distinguishes between semantic primes (undefinable words),
words that can be defined or paraphrased reductively using these primes and
are therefore not polysemous, and words which need several paraphrases
and can therefore be regarded as polysemous.
    Maximalist descriptions of polysemy differ from minimalist ones in their
psycho-communicational consequences. If most of the relevant meaning is
coded, it will be recoverable from memory and need not be constructed on
the spot. The consequence is reliable functioning of the communication but
lack of flexibility. If little of the relevant meaning is coded, the interpreter's
memory will not be overburdened, but his reasoning capacity will be occupied
in the situation concerned. The message will not become available indepen-
dently of the communication situation and will be less stereotyped. As a
consequence there will be less reliable functioning but more flexibility and
creativity (see Posner 1996: 236; on ways out of the minimalistlmaximalist
dilemma, see Warren, this volume; Tyler and Evans, this volume; Taylor,
this volume; Janssen, to appear).
    Related to the minimalistlmaximalist difference in approaches to polysemy
is the abstractivistlcognitivist difference (see Behrens 1999). According to
the abstractivist view advocated for example by Caramazza and Grober
(1976), each polysemous word has an abstract overall (literal) sense and the
extended senses of the word can be derived contextually. This view has been
contested in more recent work in lexical semantics which suggests that the
                         Polysemy and flexibility: introduction and ove",iew          15
   We know that the word table has a fuzzy floating set of references; but we
   also know that there is some precise limit to the range of references. The
   trouble comes in attempting to capture formally the precision and the
   fuzziness. We can use the word table to refer to all manners of things that
   have some property of tableness in a particular setting. The degree to which
   we all ascribe the property of tableness to something in any setting will vary.
   And the quality of tableness will change for each of us. What seems to be a
   table at some time in some place may not be a table to us at another time in
   another place.
       Language is inherently ambiguous and uncertain. That is the problem and
   the power of the system .... We must always creatively interpret an utterance:
   the new interpretation always has the potential of achieving a new insight.
   (Lieberman 1984: 82)
    Although there is a danger that this might be taken the wrong (namely
modular) way, one could say that it is the function of sentential context to
mediate between semantics and pragmatics and, in a sense, between maxima-
list and minimalist polysemy:
16      Brigitte Nerlich and David D. Clarke
     The rules of syntax have a functional purpose: they limit some of the semantic
     referents of a word. In the sentence It's important to bank your money, the
     syntax of English limits the semantic referents of the word bank to the range
     of concepts relevant to its functioning as a verb. (Lieberman 1984: 85-86)
This introduction will be followed by two contributions which set the scene
for the following articles, a theoretical one by Taylor and a historiographical
one by Nerlich, both exploring the puzzles that polysemy has posed to linguists
and philosophers of language over time and is still posing to linguists now.
The articles by Fauconnierffurner, TylerlEvans, Raukko, and Seto explore
the wider theoretical and methodological debates connected with issues of
polysemy, blending and flexible meaning, and the status of metonymy inside
a cognitive theory of polysemy. Fauconnierffurner and TylerlEvans focus
on polysemy as part of an exploration of human conceptualization by studying
examples of blending and the various uses of over in (American) English.
Raukko's contribution provides detailed comparative analyses of the English
word get and the Finnish word pitiiii. It establishes a direct link to the
chapters by Warren and Lehrer, where polysemous English word forms
and polysemous constructions are investigated in detail. All three articles
widen the field of polysemy research from the lexicon to morphology and
syntactic constructions and make new contributions to methodology. The
articles by AitchisonlLewis and Blank study various levels of polysemy and
bleaching from the synchronic as well as diachronic point of view, not only
                        Polysemy and flexibility: introduction and overview   17
    The paper by Andrea Tyler and Vyvyan Evans (reprinted here with the
permission of Language) explores lexical poly se my through an in-depth
examination of the English preposition over. Working within a cognitive
linguistic framework, it illustrates the non-arbitrary quality of the mental
lexicon and the highly creative nature of the human conceptual system. The
analysis takes the following as basic: (i) human conceptualization is the
product of embodied experience, i.e. that the kinds of bodies and neural
architecture humans have, in conjunction with the nature of the spatio-
physical world humans inhabit, determines human conceptual structure; and
(ii) semantic structure derives from and reflects conceptual structure. As
humans interact with the world, they perceive recurring spatial configurations
which become represented in memory as abstract, imagistic conceptualiz-
ations. The authors posit that each preposition is represented by a primary
meaning, which they term a protoscene. The protoscene, in turn, interacts
with a highly constrained set of cognitive principles to derive a set of additional
distinct senses, forming a motivated semantic network. Tyler and Evans
conclude that previous accounts have failed to adequately develop criteria to
distinguish between coding in formal linguistic expression, and the nature of
conceptualization which integrates linguistic prompts in a way that is maxi-
mally coherent with and contingent upon sentential context and real-world
knowledge. To rectify this, they put forward a methodology for identifying
the protoscene and for distinguishing among distinct senses.
    Like Seto in the following chapter, Jarno Raukko disputes some traditional
assumptions underlying polysemy research and puts forward a new experi-
mental approach focusing on flexibility. His chapter is therefore entitled:
"Polysemy as flexible meaning: experiments with English get and Finnish
pitiiii". Raukko claims that it is common to view polysemy as a collection or
network of several (interrelated) meanings that is fairly stable, fairly unprob-
lematic to segment and establish, and fairly much agreed upon by different
speakers. Yet, as he shows in his chapter, there are also opponents who wish
to reduce polysemy to patterns of contextual specifications or claim that
polysemy is irrelevant to the study of communication. One way of building
a bridge between these camps is to see polysemy as patterns of flexibility in
(lexical) meaning in much the same way as it is accepted that situational
(utterance or discourse level) meaning is nonfixed, imprecise, and negotiable.
The view about polysemy as flexible meaning does not imply that word-
specific descriptions of semantic variation would be unnecessary, but it seeks
to leave room for dynamicity and open-endedness in categorization as well
                        Polysemy and flexibility: introduction and overview   19
(5)    At least 62 people were killed and 3,000 missing last night after an
       underwater earthquake sent 50ft tidal waves crashing into the coast
       of Nicaragua. More than 227 people were injured in the disaster.
(6)    All other efforts to lose the fat from the offending areas proved to be
       a disaster. If [lost weight below 54 kg my bust disappeared, yet nothing
       went from my Legs or posterior!
    This chapter explores some of the resulting polysemy. It asks two main
questions: first, how do hearers successfully interpret such fast-moving,
polysemous words? Second, what general processes can be identified in the
development of such polysemy? The diachronic theme explored in Aitchisonl
Lewis's chapter is continued in Blank's chapter.
    In his chapter "Polysemy in the lexicon and in discourse" Andreas Blank
agrees initially with Breal who claimed that polysemy could be regarded as
the "synchronic side" of lexical semantic change. He argues that if we
conceive of semantic change as being based on associations between concepts
or concepts and linguistic signs, we can describe polysemy as the continuation
of these associative relations in synchrony. Blank warns us, however, not to
commit an easy fallacy: there is no complete isomorphism between diachronic
processes and synchronic states. His first aim is to explain the specific
                        Polysemy and flexibility: introduction and overview    21
    After having analysed some of the main synchronic, diachronic and dis-
cursive patterns of polysemy, we now come to patterns of polysemy which
emerge in and perhaps even structure the acquisition of certain lexical items.
Linking back to the articles by Seto, Blank, and Giora/Gur we put polysemy
back into the context of the study of figurative language, especially metaphor
and metonymy, but from a specifically psychological perspective. Ann Dowker
stresses in her chapter "Young children's and adults' use of figurative language:
how important are cultural and linguistic influences?" that there have been
numerous studies of the development and of the use and comprehension of
metaphors and other forms of figurative language. At least some types of
figurative language appear to develop very early: well before school age.
Figurative language seems to be a cross-cultural universal, but the precise
forms that it takes vary with language and culture. However, a recent study
by the author showed marked cross-cultural differences in the extent to which
4- to 6-year-old children used figurative language in their invented rhymes
and chants. One possible reason for cross-cultural variations in figurative
language may involve differences in the nature and extent of polysemy in
different languages. This chapter discusses (i) the broad course of develop-
ment of figurative language in early and middle childhood; (ii) cross-cultural
similarities and differences in its development and use; and (iii) the extent to
which cross-cultural variations in the development of figurative language
may reflect cross-linguistic variations concerning polysemy.
    The developmental theme is continued in the chapter "Emerging patterns
and evolving polysemies: the acquisition of get between four and ten years",
in which Brigitte Nerlich, Zazie Todd and David D. Clarke study the
acquisition of the different meanings of the polysemous verb get by children
aged between 4 and 10 years. This chapter therefore complements Raukko's
chapter devoted to the flexible use of get by adults. Fifty-nine children took
part in an experiment that involved production and ranking tasks. The pro-
duction task showed that 4-year-olds only produced the main senses of 'have',
'fetch' and 'obtain'; by 10 years a much wider array of meanings was
produced; and at all ages syntactic frames were embedded in everyday
experiences. The ranking task involved selecting the best example of get from
'obtain', 'fetch', 'go', 'become', and 'understand'. Four-year-olds' knowledge
was structured around 'obtain' as the most prototypical meaning, and 10-
year-olds' responses were best explained by a prototype model of semantic
representation. It seems that in between, that is with the 7-year-olds, knowl-
edge of meaning was not yet organized into a prototypical scene. A theoretical
                        PoLysemy and flexibility: introduction and overview   23
not at once: that sense-tagging of corpora cannot be done, and that it has
been solved.
    All the chapters in this book either propose novel approaches to polysemy,
or stringent criticisms of older approaches, or else are built on older approaches
to investigate novel problems. They all combine theoretical analysis with
empirical research and provide the reader with a multitude of examples and
a wealth of theoretical reflections. We hope that the field of poly se my research
will be enriched by them and that readers will come away with a deeper
knowledge, not only of polysemy, but of the functioning of language in mind
and in discourse, in synchrony and diachrony, in use and in acquisition.
References
Fritz, Gerd
     1998a      Historische Semantik. Stuttgart and Weimar: Metzler.
     1998b      Review of Geeraerts (1997). Lexicology 4(1): 183-192.
Geerael1s, Dirk
     1985       Cognitive restrictions on the structure of semantic change. In: Jacek
                Fisiak (ed.), Historical Semantics. Historical Word-Formation, 126-
                 153. Berlin, New York, Amsterdam: Mouton de Gruyter.
    1993        Vagueness's puzzles, polysemy's vagaries. Cognitive Linguistics 4(3):
                223-272.
    1994        Polysemy. In: R. E. Asher and J. M. Y. Simpson (eds.), Encyclopedia
                of Language and Linguistics, 3227-3228. Oxford: Pergamon.
     1997       Diachronic Prototype Semantics. A contribution to historicallexicology.
                Oxford: Clarenden.
Gibbs, Raymond w., Jr.
     1994       The Poetics of Mind: Figurative Thought, Language, and Understanding.
                Cambridge: Cambridge University Press.
Giora, Rachel
    1997        Understanding figurative and literal language: the graded salience
                hypothesis. Cognitive Linguistics 7(1): 183-206.
    1999        On the priority of salient meanings: studies of literal and figurative
                language. Journal of Pragmatics 31: 919-929.
Groefsema, Marjolein
    1995        Can, may, must and should - a relevance theoretic account. Journal
                of Linguistics 31(1): 53-79.
Heger, Klaus
    1963        Homographie, Homonymie und Polysemie. Zeitschriftfur Romanische
                Philologie 79: 471-491.
Hughes, Alva T.
    1989        Polysemy and Frequency in Word Knowledge Acquisition. Dissertation
                Abstracts International, 1989 Mar, v49 (n9-B): 4046-4047.
Hiillen, Werner and Rainer Schulze (eds.)
    1988.       Understanding the Lexicon: Meaning, Sense and World Knowledge
                in Lexical Semantics. Tiibingen: Niemeyer.
Israel, Michael
    (to appear) Systematic idiomaticity in the acquisition of English get construc-
                tions. In: Arie Vergagen and Jeroen van de Weijer (eds.), Agreement
                in Form: Conventions on Different Linguistic Levels.
Janssen, Theo A. J. M.
    (to appear) Monosemy versus polysemy. In: Hubert Cuyckens, Rem! Dirven and
                John Taylor (eds.), Cognitive Linguistic Approaches to Lexical Semantics.
                Cognitive Linguistics Research Series. Berlin: Mouton de Gruyter.
Johnson, Christopher Ronald
    1997        Learnability in the acquisition of multiple senses: SOURCE recon-
                         Polysemy and flexibility: introduction and overview      27
Raukko, Jamo
    1999        An 'intersubjective' method for cognitive-semantic research on
                polysemy: the case of get. In: Masaka Hiraga, Chris Sinha and
                Sherman Wilcox (eds.), Cultural, Psychological, and Typological
                Issues in Cognitive Linguistics, 87-105. Current Issues in Linguistic
                Theory, 152. Amsterdam and Philadelphia: John Benjamins.
Ravin, Yael and Claudia Leacock (eds.)
   2000         Polysemy and Ambiguity: Linguistic and Computational Approaches.
                Oxford: Oxford University Press.
Rueckl, Jay G.
   1995         Ambiguity and connectionist networks - still settling into a solution.
                Journal of ExperimentaL PsychoLogy: Learning, Memory and Cognition
                21(2): 501-508.
Ruhl, Charles
    1989        On Monosemy. New York: SUNY.
Schneider, Edgar
   1988         On polysemy in English, considering consider. In: Hiillen and
                Schulze (1988), 157-169.
Schulze, Rainer
    1988        A short story of down. In: Hiillen and Schulze (1988), 395-414.
   1994         Image schemata and the semantics of off In: Schwarz (1994),197-
                214.
Schwarz, Monika (ed.)
    1994        Kognitive SemantikJCognitive Semantics. Ergebnisse, Probleme,
                Pespektiven. Tiibingen: Gunter Narr Verlag.
Sinha, Chris and Tania Kuteva
   1994         Spatial and non-spatial uses of prepositions: conceptual integrity
                across semantic domains. In: Schwarz (1994), 215-238.
Sweetser, Eve
    1990        From Etymology to Pragmatics: Metaphorical and CuLtural Aspects
                of Semantic Structure. Cambridge: Cambridge University Press.
Taylor, John R.
   1992         How many meanings does a word have? Stellenbosch Papers in
                Linguistics 25: 133-168.
Traugott, Elizabeth Closs
   1994         Grammaticalization and lexicalization. In: R. E. Asher and J. M. Y.
                Simpson (eds.), The Encyclopedia of Language and Linguistics,
                Volume 3, 1481-1486. Pergamon Press. Reprinted (1999) in: Keith
                Brown and Jim Miller (eds.), Concise Encyclopedia of Grammatical
                Categories, Oxford: EIsevier Science.
Tuggy, David
   1993         Ambiguity, polysemy, and vagueness. Cognitive Linguistics 4(3):
                273-290.
30    Brigitte Nerlich and David D. Clarke
Ullmann, Stephen
    1951        The Principles of Semantics. Oxford: Basil Blackwell; Glasgow:
                Jackson, Sons and Co.
Victorri, Bernard
   1997         La polysemie: un artefact de la linguistique? Revue de Semantique
                et Pragmatique 2: 41-62.
Viegas, Evelyne and Victor Raskin
   1998         Computational Semantic Lexicon Acquisition - Methodology and
                Guidelines. Technical Report MCCS-98-315, CRL, NMSU.
Weinreich, Uriel
   1966         Explorations in semantic theory. In: ThomasA. Sebeok (ed.), Current
                Trends in Linguistics rn, 395-477. The Hague: Mouton.
Wierzbicka, Anna
   1996        Semantics: Primes and Universals. Oxford: Oxford University Press.
Wilks, Yorick
   1977        Frames for machine translation. New Scientist 76(108): 802-803.
Williams, John N.
   1992        Processing polysemous words in context: evidence for interrelated
                meanings. Journal of Psycholinguistic Research 21(3): 193-218.
Wittgenstein, Ludwig
   1974        Philosophical Grammar, edited by Rush Rhees; translated by Anthony
                Kenny. Oxford: Basil Blackwell.
Wunderli, Peter
   1995        Strukturelle Semantik, Polysemie und Architektur der Sprache. Zu
               einigen Problemen der Bedeutungsanalyse. In: Ulrich Hoinkes (ed.),
                Panorama der Lexikalischen Semantik. Thematische Festschrift aus
               Anlaft des 60. Geburtstags von Horst Geckeler, 791-806. Tiibingen:
               Narr.
Cognitive models of polysemy
John R. Taylor
l. Introduction
One of the firmest results to have come out of the Cognitive Linguistics
enterprise over the past couple of decades has been the realization that word
meanings need to be understood against broader knowledge configurations,
variously studied as "frames", "scenes", "domains", and "idealized cognitive
models" (Lakoff 1987; Langacker 1987; Croft 1993). Consider, as an illus-
tration, the word bachelor, and the fact that it would be odd to speak of the
Pope as a bachelor, even though the Pope clearly instantiates each of the
four defining features of bachelorhood, namely, "human", "male", "adult",
und "unmarried" (Fill more 1982; Lakoff 1987; Taylor 1995). The problem
arises not because the definition of bachelor in terms of these four features
is wrong in any material sense. Rather, it is because the concept "bachelor"
needs to be understood against an idealized cognitive model of bachelorhood,
und, more generally, of marriage practices. The model prescribes, for both
men and women, a "marriageable age", a stage in life at which a person is
expected to marry. Bachelor is used to designate males who have reached the
marriageable age but fail to marry, because they do not yet want the "commit-
ments" of marriage. The model is "idealized" in the sense that it offers a
simplified view of society and overlooks the many individuals and groups
who do not meet its background assumptions. For example, the model makes
!IO provision for celibate clergy. Given the presuppositions of the cognitive
model, it is quite legitimate to define a bachelor, quite simply, as a man who
husn't married. The application of the word becomes problematic with respect
10 the Pope, because the Pope lies outside the idealized situation covered by
the model.
    In this article, I examine some of the cognitive models against which we
understand the concept of polysemy. Such an exercise may be useful, for a
number of reasons. First, it may wean us away from the idea that polysemy
IR a well-defined natural category - a "brute fact" (Searle 1969: 50) about
language, as it were, which, like brute facts of the material and biological
32    John R. Taylor
ensuing ambiguities. Rather, it could be the case that the models are applicable
only to highly idealized data. Polysemy presents itself as a problematic
concept in those cases where the language data do not fit the idealized models.
    In the following sections, I discuss the models in more detail, drawing
attention to some discrepancies between certain kinds of data and the ideal
situation presupposed by the models.
(i)    Both pronunciation and conceptual content are liable to vary according
       to context of use. This kind of variation need not of itself be prob-
       lematic for the semiotic ideal. As noted, both poles of the linguistic
       sign, the "concept" and the "acoustic image", are taken to be mental
       entities, representations which are schematic for the specific semantic
       and phonological values manifest in an act of speech. Depending on
       speech tempo, degree of stress, and perhaps other factors, upper may
                                                Cognitive models of polysemy       35
   The three degrees of deviation from the semiotic ideal constitute a con-
linuum, with borderline cases often difficult to classify as, say, polysemy vs.
context-conditioned variation, or, alternatively, as polysemy vs. homonymy.
36    John R. Taylor
drivers of early generative phonology (Chomsky and Halle 1968) was the
desire to associate each morpheme of a language with a single, unique
"underlying form", from which all the surface altemants could be derived by
rule.
     Since variation is not limited to the semantic pole, the possibility arises
that deviations from the one meaning ideal may be accompanied by deviations
from the one form ideal. Langacker (1987: 398) reports that in his speech,
route is associated with two distinct phonological shapes, [raut] and [ru:t],
both pronunciations being possible for the noun, but only the latter for the
verb. Orthography suggests that we are here dealing with a single linguistic
unit (though the matter is indeed moot). Sometimes, however, orthography
is also subject to variation. (Variation at the orthographic level might be
referred to as "polygraphy".) Speakers of British-based varieties of English
often distinguish, in writing, between "program" and "programme", and
between "disk" and "disc". To be sure, the written variants may not be
differentiated in pronunciation; they are, however, likely to be differentiated
semantically, "program" and "disk" being reserved for computer-related uses.
It is again a moot point whether we are here dealing with a single linguistic
sign, unified at the phonological pole, or with two different signs differen-
tiated both semantically and orthographically (Love, to appear).
The semiotic model provides us with a set of linguistic signs, each of which
associates a meaning with a phonological form. According to the building
block metaphor, complex expressions are formed by assembling these smaller
units; conversely, complex expressions can be exhaustively segmented into
their component parts. The metaphor requires that each constituent building
block have a fixed and determinate semantic and phonological content which
it contributes to the whole. Hence, the meaning of a complex expression will
be a function of the meanings of its constituent parts, just as the phonological
form of a complex expression will result from the alignment of the phonol-
ogical forms of its constituents.
    The metaphor is pervasive in our deliberations on language and its
structure; indeed, as Langacker (1991: 186) observes, the metaphor may be
unavoidable "for expository purposes", as when we introduce beginning
linguistics students to the concept of the morpheme. Yet, paradoxically, it is
38    John R. Taylor
example, how might we characterize the concept that gets coerced into the
specific readings "novel-as-text" and "novel-as-object"?
    A more general failing of the building block metaphor is that it ignores
the possibility that the whole may be organized in ways which go beyond, or
which are even at variance with, the properties which are contributed by the
parts. Take Langacker's example of a football under the table (Langacker
 1987: 279-280). The (literal) interpretation of football under the table goes
well beyond the compositionally derived meaning of the expression. For
example, we would probably picture the football as lying on the floor within
an area circumscribed by the table's legs. It is not only that the specifics of
this scene are not encoded in the expression; the football, if we consider
the matter carefully, is not actually "under" the table at all- the football, that
is, is not located at a place that is lower than the place occupied by the
table. For the football to be "under" the table, it would have to be under
the floor on which the table is standing! Conceptualization of the global
scene has forced an elaboration and readjustment of the compositionally
derived meaning (Taylor 2002). The matter is equally evident with respect to
phonological structure. Typically, did you would have the pronunciation
[ldI.d3U], rather than the "compositional" pronunciation [dld.ju:]. It is not
just that the boundary segments of the building blocks [dId] and [ju:] have
mutually influenced each other. Rather, the complex expression has been
organized in terms of syllable and foot structure. The component syllables of
[ldI.d3U] do not match up with the units contributed by the component
structures; moreover, the two syllables have been structured in terms of a
strong-weak relation within a trochaic foot.
    The interplay of word meanings (that is, the semantic units which words,
in their status as building blocks, contribute to the whole) and the meanings
of larger expressions in which the words occur, has come to the fore in studies
of constructions. Consider Goldberg's (1995: 29) often-cited example of a
person "sneezing the napkin off the table". Sneeze does not plausibly belong
in the class of caused-motion verbs (put, push, throw, etc.). The caused-motion
sense of sneeze - "cause (a thing) to go (to a place) by sneezing on it" - is
contributed by the syntactic construction [V NP PP] in which it occurs. The
primacy of the construction over its parts is supported by the fact that in
different instantiations of the construction the meanings of the parts are
mapped in different ways onto the meanings of the complex expressions
(Mandelblit and Fauconnier 2000). Thus, in sneeze the napkin off the table,
the verb designates the causing activity, whereas in trot the horse into the
40    John R. Taylor
stabLe, the verb designates the caused activity. Appeal to constructions, then,
may again lessen the need to postulate extensive polysemy at the level of
words. On the other hand, polysemy is liable to re-emerge at the level of
constructions. As argued by Goldberg (1995), the caused-motion construction
itself has a number of distinct variants, each of which severely restricts the
range of items which are eligible to occur in it.
(i)      A language can be analysed into a lexicon and a syntax. The lexicon
         lists the words, while the syntax contains general rules for the com-
         bination of words.
(ii)     Each word associates a fixed and determinate semantic structure with
         a fixed and determinate phonological structure.
(iii)    The semantic and phonological properties of a complex expression,
         as assembled by the syntax, is a compositional function of the semantic
         and phonological properties of the component words.
     Given the extent and the ubiquity of these deviations from the idealiz-
at ions, it is not surprising that polysemy (understood as the association of
related meanings to a single linguistic form) should be such a problematic
concept, and one which has engendered so much discussion and controversy.
It was not my intention, in this article, to attempt to resolve these con-
troversies. The above discussion may, however, be useful, to the extent that
it may throw some light on why the seemingly so straightforward notion of
polysemy is so problematic. It also suggests that it may be useful to approach
the problematic aspects of polysemy, not so much by a refinement of the
concept itself, but by addressing the cognitive models which frame our
understanding of it. 2
44    John R. Tay/or
Notes
I. The point here is that. although kick the bucket can be segmented into words.
   and can even be regarded as a perfectly regular verb phrase. the (idiomatic)
   meaning of the expression cannot be distributed over the meanings of its parts.
2. Given my somewhat critical account of the cognitive models discussed in this
   chapter. it may be asked whether there are other models of language which enable
   us more accurately to conceptualize the object of our study. Here are two sugges-
   tions. which I offer without further discussion. but which I think may be worthy
   of further investigation:
   (i) The corpus model. Instead of partitioning the facts of a language into a
        lexicon and a syntax (a dictionary and a grammar book), it may be worth
        exploring the metaphor of a language as a giant corpus, and to construe
        knowledge of a language in terms of a mental corpus, comprising memory
        traces of previously encountered utterances. Words. with their phonological
        and semantic properties, would then emerge as the summed activation of
        already encountered instances.
   (ii) The cut-and-paste model. Instead of studying the production of complex
        expressions in terms of the assembly of smaller units in accordance with the
        rules of syntax, it may be more appropriate to view language production as
        a process of sticking together various bits and pieces retrieved from the
        mental corpus.
   Interestingly, neither model has a place for polysemy, as traditionally understood.
References
Brugman. Claudia
   1981       Story of over. M.A. thesis. University of California, Berkeley.
Bybee, Joan
   2001       Phonology and Language Use. Cambridge: Cambridge University
              Press.
Bybee. Joan and Joanne Scheibman
   1999       The effect of usage on degrees of constituency: the reduction of don't
              in English. Linguistics 37: 575-596.
Chomsky, Noam
   1988       Language and Problems of Knowledge. Cambridge, MA: Massachusetts
              Institute of Technology Press.
Chomsky, Noam and Morris Halle
   1968       The Sound Pattern of English. New York: Harper and Row.
                                                Cognitive models of polysemy      45
Coseriu, Eugenio
    1977        L'Etude fonctionnelle du vocabulaire: precis de lexematique. Cahiers
                de iexicoiogie 29: 5-23.
Croft, William
    1993        The role of domains in the interpretation of metaphors and metonymies.
                Cognitive Linguistics 4: 335-370.
Cuyckens, Hubert and B. Zawada (eds.)
    2001        Polysemy in Cognitive Linguistics. Amsterdam: Benjamins.
Deane, Paul
    1993        Multimodal spatial representations: on the semantic unity of "over"
                and other polysemous prepositions. Duisburg: Linguistic Agency,
                University of Duisburg.
Dewell, Robert B.
    1994        Over again: image-schematic transformations in semantic analysis.
                Cognitive Linguistics 5, 351-380.
Fillmore, Charles
    1982        Frame semantics. In: Linguistic Society of Korea (ed.), Linguistics
                in the Morning Calm, 111-138. Seoul: Hanshin.
Foolen, Ad and Frederike van der Leek (eds.)
    2000        Constructions in Cognitive Grammar. Amsterdam: Benjamins.
Geeraerts, Dirk
    1985        Cognitive restrictions on the structure of semantic change. In: Jacek
                Fisiak (ed.), Historical Semantics, ]27-153. Berlin: Mouton de
                Gruyter.
Goldberg, Adele
    1995        Constructions: A Construction Grammar Approach to Argument
                Structure. Chicago, IL: University of Chicago Press.
lackendoff, Ray
    1997        The Architecture of the Language Faculty. Cambridge, MA:
                Massachusetts Institute of Technology Press.
Kirsner, Robert S.
    1993        From meaning to message in two theories: cognitive and Saussurean
                views of the Modern Dutch demonstratives. In: Richard Geiger and
                Brygida Rudzka-Ostyn (eds.), Conceptualizations and Mental
                Processing in Language, 81-114. Berlin: Mouton de Gruyter.
Kreitzer, Anatol
    1997        Multiple levels of schematization: a study in the conceptualization
                of space. Cognitive Linguistics 8: 291-325.
Langacker, Ronald
    1987        Foundations of Cognitive Grammar, Volume 1. Stanford, CA: Stanford
                University Press.
    1988        A view of linguistic semantics. In: Brygida Rudzka-Ostyn (ed.),
                Topics in Cognitive Linguistics, 49-90. Amsterdam: Benjamins.
46     John R. Taylor
Vandeloise, Claude
   1990        Representation, prototypes, and centrality. In: Savas Tsohatzidis (ed.),
               Meanings and Prototypes, 403-437. London: Routledge.
Van der Leek, Frederike
   2000        Caused-motion and the "bottom-up" role of grammar. In: Foolen and
               van der Leek (2000), 301-331.
WunderIich, Dieter
   1993        On German um: semantic and conceptual aspects. Linguistics 31:
               II 1-133.
Polysemy: past and present
Brigitte Nerlich
1. Introduction
It is fairly well known that research into polysemy began with the work of
the French semanticist Michel Breal in the late 19th century.· It is less well
known that the "multiplicity of meaning", with or without the label "polysemy",
was quite well researched by students of literature and by lexicographers
well before Breal. It should come as no surprise that the major pathways of
polysemous sense extension, namely metaphor, metonymy and synecdoche
have also been on the philosophical and linguistic agenda for a long time.
    To fully understand the import of the concept of polysemy for contem-
porary semantic thought, it is useful to reconstruct some of the contexts in
which it appeared for the first time. We hope that such a reconstruction may
lead to a new evaluation of contemporary concepts and theories. However,
one should not expect this work to be definitive, because the field will
undoubtedly change under the pressure of new historical discoveries: the
historical survey modifying the present status of polysemy research, this latter
in turn influencing the future of the former (see Eco 1996: 744).
    The first part of this survey will trace the development of various research
traditions analysing the pathways of polysemous sense extension before the
term "polysemy" was introduced into general linguistics by Breal in 1887.
This means, effectively, looking at research done in the wide field of rhetoric,
in the philosophy of metaphor, in etymology and in lexicography, where it
was essential to find elegant and plausible ways to write lexical entries for
words with multiple meanings. The second part of this survey will be devoted
to Breal's more revolutionary treatment of polysemy and the research leading
up to it. In the last part we shall give a brief and necessarily incomplete
survey of post-Brealian research into polysemy.2
50    Brigitte Nerlich
Metaphor has had three relatively unrelated waves of fame in the 20th
century: the first brought about by Ivor A. Richards, Kenneth Burke, and
Max Black in their reflections on metaphor published between 1936 and
 1962 (Richards 1936; Burke 1962; Black 1962); the second triggered by
Roman lakobson's papers "Two aspects of language and two types of aphasic
disturbances" and ''The metaphoric and metonymic poles". published in the
1950s (Jakobson 1983 [1956b], 1956a);3 the third, more recent one, was
unleashed by the George Lakoff and Mark 10hnson book Metaphors We Live
By in the 1980s (Lakoff and 10hnson 1980; see also Lakoff and Thrner 1989;
Lakoff and 10hnson 1999). Metaphors We Live By has become a standard
text for those interested in what has now become a new paradigm in ling-
uistics, namely cognitive semantics. One of the basic claims of this new
research paradigm is that "a lexical item is typically polysemic - comprising
a family of interrelated senses, forming a network centred on a prototypical
value. Although the precise array of senses conventionally associated with
the expression is not fully predictable, neither is it arbitrary - as the net-
work evolves from the prototype, each extension is MOTIVATED in some
cognitively natural fashion, and often in accordance with a general pattern
or principle" (Langacker 1988: 392). Some of the most important patterns or
principles are metaphor, metonymy and synecdoche.
    Although well aware that modem reflections on metaphor have their roots
in Aristotle, and in Greek and Latin rhetoric, and that they have parallels in
early 20th-century developments, such as the work done by Burke or Black
(see Turner 1999), only a small number of cognitive linguists seem to ap-
preciate just how much research into metaphor, as well as metonymy and
synecdoche, was done during the 19th and early 20th centuries in the fields
of rhetoric, lexical semantics and philosophy (see Smith 1982; lakel 1999;
Nerlich 1998; Nerlich and Clarke 1997,1999, 2oo0a, 2oo0b, 2001). These
figures of speech were gradually discovered as not only having an aesthetic,
but also an essentially cognitive function.
    The foundations for this fundamentally new research into metaphor and
other core figures of speech were laid in the 17th and 18th centuries with the
work of Giambattista Vico, John Locke, lohann Gottfried Leibniz and lohann
Heinrich Lambert amongst the philosophers, and Cesar Chesneau Du Marsais
amongst the grammarians.
                                                  Polysemy: past and present         51
    Like many of his philosophical colleagues, Locke had still been sceptical
about the value of metaphors, as they seemed to obscure the direct link
between words and ideas, but at the same time he saw that metaphors were
unavoidable and deeply rooted in our language and thought. He gave the
following examples for the metaphorical grounding of mental concepts:
   It may also lead us a little towards the Original of all our Notions and
   Knowledge, if we remark, how great a Dependence our Words have on
   common sensible Ideas; and how those, which are made use of to stand for
   Actions and Notions quite removed from sense, have their rise from thence,
   and from obvious sensible Ideas are transferred to more abstruse Significations
   ... v.g. to Imagine, Apprehend, Comprehend, Adhere, Conceive, Instill, Disgust,
   Disturbance, TranquiJity, andc. are all Words taken from the Operations of
   sensible Things, and applied to certain modes of Thinking. (Locke 1975
   [1689]: Ill, i: 5)
Locke's most quoted example was perhaps that the Latin word for "mind",
that is, spiritus, had its origin in the word for "breath".4
    Leibniz, who took up this example, approached the problem not only
from a philosophical angle, but also from a decidedly etymological one
and came to the conclusion that language was in fact permeated by meta-
phors, a view that became immediately popular. The German linguist Johann
Christoph Adelung, for example, pointed out almost a century after Locke,
in his 1798 entry on "metaphor" for his grammatical critical dictionary: "In
point of fact most of our words are metaphors. The word spirit when desig-
nating a spiritual rational entity is a metaphor, as it actually means breath"
(Adelung 1798: 192).5
    In his Nouveaux essais sur l'entendement humain published in 1765
(Leibniz 1981), a reply to Locke's Essay on human understanding (1975
11689]), Leibniz also stressed the importance of tropes, especially of metaphor,
metonymy and synecdoche for the changes in meanings that one can observe
everywhere (see also Leibniz 1882, 1956).
    A contemporary of Leibniz, the French philosopher and etymologist Anne
Robert Jacques Turgot, continued the reflection on how figures of speech
can bring about changes in the meanings of words. But he also pointed out,
in a thoroughly modern way, that for these figures to work and these changes
la occur, more than just "rhetoric" is needed. The use and uptake of figures
of speech depends on our ways of conceptualizing the world, of seeing the
world in certain ways/' What one would nowadays call "polysemization" is
52      Brigitte Nerlich
for him the result of the way in which the principal idea of a word spills over
onto the secondary idea. Then the word is used in its new extension for other
ideas, based solely on the secondary meaning without regard for the primitive
one - as when we say un chevalferre d'argent (a horse "ironed" with silver).
New metaphors are coined with this new meaning in mind, and then one on
top if the other, until we reach a point when a meaning emerges which is
completely opposed to the original meaning of the word (Turgot 1756: 105b).
    Du Marsais, like Turgot a contributor to Diderot's Encyclopedie, devel-
oped a rhetoric of ordinary language use in which he studied the multiplication
of meanings as well as the forgetting of the original meaning, two processes
that go hand in hand. As early as 1730 he claimed that catachresis, or the use
of a word in a context that differs from its proper application, was funda-
mental to the extension and multiplication of meanings, and he rejected the
widespread view that catachresis was nothing but the "incorrect" (mis- or
ab-) use of a word (see Nerlich 2000). He illustrates this point of view with
the same example that Turgot used some time later.
     ... it is the ordinary costume to nail iron horseshoes under the feet of horses,
     something we call "ferrer un cheval": ... if it so happens that instead of iron
     we use silver we say that "les chevaux sont ferre d' argent" instead of inventing
     an entirely new word which nobody would understand .... And so "ferrer"
     signifies by extension shoeing a horse with silver instead of iron. (Du Marsais
     1977 [1730]: 45)
     It has however been known for a long time that we compare the visible with
     the invisible, the world of the body with the world of the spirit, sensations
     with thoughts, and that we use the same words for either. Hence words
     necessarily acquire a double and sometimes a multiple meaning. Having light
     in the room and a light in one's thoughts are examples for this way of
     speaking. (Lambert 1764: Aleth. I., §45)
   ... all the words of a language could be divided into three classes, the first of
   which doesn't need any definition, as one points to the thing itself in its
   totality, and as one can make an immediate connection between word, concept,
   and thing. Another class uses the words of the first class in a metaphorical
   way and uses a tertium comparationis instead of a definition. The third class
   comprises those words which have to be defined, and this insofar as one uses
   the words of the first two classes to do so. The words of the third class thus
   defined can again be used for definitions. It is obvious that the words of the
   third class can also be used metaphorically and that most of them are used in
   this way already. (Lambert 1764: 11, ii: 45)
In the 19th century the exploration of metaphor and other figures of speech
began to be an albeit small part of the new science of historical linguistics,
which detached it somewhat from the anthropological and philosophical
explorations of the previous centuries. Metaphor, metonymy, generalization
and specialization became part and parcel of the instruments used in histori-
cal semantics used to study and classify types of semantic change and of
meaning extension. This early form of semantics was influenced by rhetoric,
association psychology, and aesthetics (see Nerlich 1992).
                                                   Polysemy: past and present         55
   The basis for the development of ideas in words is the association of thoughts
   in the group of representations .... However, certain associations of ideas are
   preferred in human representation, and rhetoric has given them certain names,
   which however are to some extent also appropriate for semasiology, namely
   synecdoche. metonymy. and metaphor. Insofar as these so-called figures aim
   at something aesthetic. they certainly belong to rhetoric, even when individuals
   make use of them; but insofar as a usage based on these figures of speech has
   been established in a particular language, and this is particular to the nation,
   these figures should be dealt with here. (Reisig 1890 [1839]: 2; emphasis
   added)
well before Mark 10hnson (1987), proposed a close link between "the body
and the mind" as the basis for metaphor. He wrote: "metaphor is therefore
not a poetic trope, but an original form of intuition for thought". For Biese,
metaphor is the most essential inner schema of the human mind" (Biese
1893: VI). Biese was one of the most outspoken advocates of a philosophy
of the metaphoric according to which metaphor was neither just a "figure of
speech", nor just a poetic fiction or decoration, but was regarded as underlying
the structure and evolution of human thought and language.
    A more empirical study of what Eve Sweetser calls the "Mind-as-Body
Metaphor" can be found in Hans Kurath's 1921 Chicago dissertation on the
semantic sources of the words for the emotions in Sanskrit, Greek, Latin,
and the Germanic languages:
   Kurath (1921) notes that Indo-European words for the emotions are very
   frequently derived from words referring to physical actions or sensations accom-
   panying the relevant emotions, or to the bodily organs affected by those
   physical reactions. (Sweetser 1990: 28)
Although the link between the historical development of emotion words and
the psychosomatic nature of the emotions is not as fundamental as Kurath
thinks, we can see here a first insight into the metaphorical nature of expres-
sions such as "my anger boiled over" which have so frequently been studied
in modern literature on emotions and metaphors (see Koevesces 2000).
    While philosophers such as Biese explored the all-pervasive influence of
metaphor on language and thought, linguists continued to study the processes
and results of metaphorization in language, and psychologists began to ex-
plore what Gilles Fauconnier has recently called "backstage cognition", that
is, "how language works in concert with 'behind the scenes' understandings
and cognitive processes" (Fauconnier 1998: 277).
    In the following we shall first investigate how philosophers and linguists
analysed the results of the use of figurative language, namely the increase in
the spectrum of meanings of a word, what Breal called "polysemy", then
how Breal revolutionized this study.
58      Brigitte Nerlich
The modern term polysemy was popularized by Breal in 1887 (Breal 1991
[1887]). Most modern linguists dealing with the topic of polysemy refer to
this crucial date, but they rarely look further back into the past (but see
NerIich and Clarke 1997).
    The roots of the concept of "polysemy" lie in Greek philosophy, that is,
the debate surrounding the problem of the naturalness or arbitrariness of
signs as debated in Plato's (429-347 B.C.) Cratylus. In his account of Plato's
contribution to linguistics, Fred Householder points out that
     a rapid river that flows by producing new senses which, as it waves and
     whirls, come one after the other in such a way that no single one annuls the
     other; instead they accumulate and increasingly enrich this immense treasure
     of divine wisdom; everyone, according to one's own intellectual ability, can
     glean something from this inexhaustible storage of senses. (Eco 1996: 739)
  The first to use the term polysemous in a relatively modern sense was
Dante, who wrote about the polysemous character of the poem: "Istius operis
                                                     Polysemy: past and present          59
non est simplex sensus, immo dici potest polysemum, hoc est plurium
sensum" (This work doesn't have one simple meaning, on the contrary, I say
that it can be polysemous, that is can have many meanings) (quoted in OED
1989, Vol. XII: polysemous, p. 75, col. 1). As Eeo points out:
The difference between the old reading of the bible and Dante's is that Dante
is actually "taking a way of reading the bible as an example of how to read
his own mundane poem!" (Eco 1996: 741.) The debate about the literal or
figurative, the single or multiple meaning of the bible continued throughout
the 16th and 17th centuries and is still with us today (see McFague 1983).
    Thinking about meaning, language and its relation to the real and spiritual
world advanced enormously during the Renaissance (see Was wo 1987), but
real mundane research into the multiplicity of meaning only began in the
18th century, with the study of neologisms, synonyms and the figures of
speech. The emancipation of semantics or the study of meaning from religious
lhought was followed by the emancipation of semantics from etymology or
the quest for the true original meaning of a word.
    In the early 19th century the literary scholar August Wilhelm Schlegel
spoke about the difficulties encountered when dealing with "les termes
polysemantiques" in dictionaries (Schlegel 1832: 42) and the etymologist
August Friedrich Polt pointed out in 1861:
   that even without the creation of new words a language can be enriched
   through the new conceptual use of old words (Polysemantics [Polysemantie])
   alone. at least on its internal level, that is, semantically, linguistic structures
   can gain an extraordinary depth and focus. (Pott 1861: 5)
60    Brigitte Nerlich
problem for computers - see Kilgarriff, this volume; Wilks, this volume.)
The most important factor that brings about the multiplication of meanings
diachronically and that helps us to "reduce" the multiplicity of meanings
synchronically is the context of discourse, a fact already mentioned by
Leibniz. We understand polysemous words because the words are always
used in the context of a discourse and a situation, which eliminate all the
adjoining meanings in favour of only the one in question (Brea11991 [1887]:
156-157).
    However, in the constant dialectical give and take between synchrony and
diachrony, and between meaning and understanding, or what Traugott and
Blank call in Gricean terms conversational and conventional implicature
(Traugott 1988; Blank 1997: 366-369), incremental changes in the meaning
of words occur, insofar as hearers, having understood a word in a certain
context in a slightly divergent way, become themselves speakers and might
use a word in the newly understood way in yet another context, which again
brings about a different type of understanding, and so on. In the long run
these slight variations in use and uptake might lead to major semantic changes
(see Turgot 1756).
    More sudden shifts in meaning are brought about by the use of metaphor
and metonymy. Aristotle in The Rhetoric had already remarked that "strange
words simply puzzle us; ordinary words convey only what we know already;
it is from metaphor that we can best get hold of something fresh" (Smith
1982: 128). There are also shifts in meaning which have a more social than
poetic root, as when the word operation, studied by Breal, means something
different according to the social context in which it is used (by a mathema-
tician, a general, a surgeon, and so on). Analysing the multiplication of
meanings based on the speakers' and hearers' social and cognitive needs and
activities was central to Breal's semantics.
    Breal was fascinated by the fact that when talking to each other we neither
get confused by the multiplicity of meanings that a word can have, some of
which are listed in dictionaries of usage, nor are we bothered with the
etymological ancestry of a word, traced by historical dictionaries.
    Both the dictionary of usage and the historical dictionary classify the
meanings of polysemous words, which have been produced over time by a
nation. This is a social (abstract and decontextualized) classification, whereas
the classification of meanings in the head of a speaker and hearer is in each
case an individual (concrete and contextual) classification. Breal has in mind
an "isosynchronic competence", a half-conscious type of user knowledge, which
62      Brigitle Nerlich
only works inside concrete situations (see Desmet and Swiggers 1995: 283).
Modem polysemy research still debates whether it should predominantly deal
with the former type of polysemy or with the latter, and how to reconcile the
one with the other.
    Breal observed that most of the time it is the latest, most modem meaning
of the word, yesterday's or today's meaning, with which we first become fam-
iliar (Breal 1991 [1884]: 149) - something recently rediscovered in England:
     The child who hears a word first pronounced retains it with the meaning with
     which he has heard it used, and he associates it in his thought with the object
     itself to which it was applied, and to no other. We could say that for him all
     names are at first proper names. If a little later he hears the same word applied
     to other similar objects, then he learns to generalize the meaning ....
         Let us assume that the same word returns in another context, with another
     meaning [valeur]. If the two meanings are not too far apart, the child senses
     that it is the same word; he perceives the connexion between the two meanings
     and, taking stock of the distance travelled, he extends his initial concept. ... If
     on the other hand, the two meanings are too distant, he does not try to connect
                                                     Polysemy: past and present         63
     them, and he takes account of the new meaning as if dealing with a separate
     word. (Bn!al 1991 [1884]: 149)
   Breal was acutely aware of the fact that the semantic, cognitive, and
developmental side of language studies was not yet on a par with the advances
made in the study of phonetics, of the more physiological side of language.
In his article "How words are classified in our mind" (Breal 1991 [1884]),
Breal appeals to the future to supply us with insights into the cognitive aspects
of human language and writes:
With Breal semantics as a linguistic discipline made a first step into this
future, a future in which we are still participating and to which we are still
contributing beyond the end of the 20th century, the century of psycho-
linguistics, artificial intelligence, brain scanning and neuropsychology.
As we have seen, Breal had provided the conditions for new trends in
polysemy research, in general linguistics, in sociolinguistics, in psycho-
linguistics, and in the study of language acquisition. By the beginning of the
20th century a shift was noticeable from accommodating polysemy in his-
torical semantics towards observing and explaining polysemy in synchronic
semantics, as the study of la langue as well as the study of lexical fields. The
turn from the 19th to the 20th century also saw a steady influx of new types
64    Brigitte Nerlich
and similarity inherited from Hume and which had structured most of 19th-
century metaphor research up to and including Wilhelm Wundt, was replaced
by a psychology of consciousness and thinking, developed mostly in
Wiirzburg, and by a new Gestalt psychology developed mainly in Berlin.
Insights from both traditions were merged in empirical research into the
understanding of metaphors, especially in the works of Biihler and Wilhelm
Stahlin (1913), who both explored metaphor as a process of conceptual
integration or Sphiirenmischung.
    These advances in the psychology of metaphor and polysemy were accom-
panied by advances in the sociology of poly se my. The polysemy of words
was explained by looking at how different social groups use one and the
same word for different purposes. An example (already found in Breal) for
the social differentiation underlying some types of polysemy was the word
operation, as used by a mathematician, a general or a surgeon, for example
(Meillet 1921 [1905]: 193). The end-point and summa of this tradition of
studying polysemy from a linguistic, conceptual, psychological and socio-
logical point of view was the work of the semanticist Stephen Ullmann
(1951).
    There followed a period of polysemous latency, so to speak, after the
advent of transformational generative grammar with its focus on syntax and
later feature semantics. However, polysemy was not completely forgotten,
as illustrated by the research undertaken by Hans Blumenberg (1960), Uriel
Weinreich (1962, 1996), Harald Weinrich (1967), James McCawley (1968),
Jurij Apresjan (1974), Charles Fillmore (1975), Andrew Ortony (1975), and
Alfonso Caramazza and Ellen Grober (1976).
    At the same time a new wave of polysemy research, under the heading of
fJoly-isotopie and polyphonie, developed in France in the framework of
general rhetoric and text analysis. This type of polysemy research harks back
to Dante's first use of the word "polysemous" as indicating the mUltiple
readings of a poetic text. It was directly influenced by the rediscovery of
work on the multiplicity of meanings, polyphony, and heteroglossia by the
Russian philosopher of language Mikhael Bakhtin (see Todorov 1977).
    In the Anglo-American world, polysemy was rediscovered with the advent
of cognitive semantics in the 1980s (see Brugmann 1981; Lakoff 1987;
Langacker 1987; Geeraerts 1997). Cognitive linguists not only rediscovered
metaphor and metonymy as pathways and patterns which structure polysemous
sense extensions, they also began to reconnect synchronic and diachronic
research into meaning.
66      Brigitte Nerlich
     Synchronic polysemy and historical change of meaning really supply the same
     data in many ways. No historical shift of meaning can take place without an
     intervening stage of polysemy ... But if an intervening stage of polysemy was
     involved, then all the historical data, as evidence of past polysemy relations, is
     an interesting source of information about the reflection of cognitive structures
     in language. Even more crucially, the historical order in which senses are
     added to polysemous words tells us something about the directional relation-
     ship between senses; it affects our understanding of cognitive structure to
     know that spatial vocabulary universally acquires temporal meanings rather
     than the reverse. (Sweetser 1990: 9)
since the first efforts at systematizing the study of "la polysemie" were made
by Breal at the end of the 19th century.
Notes
I. Breal created the term in 1887 (Breal 1991 [1887]: 156-157) in his review of
     two seminal books: Arsene Darmesteter's book La Vie des mats (1887) and the
     second edition of Hermann Paul's book Prinzipien der Sprachgeschichte (1886
     [1880)). He used it again in his famous Essai de semantique in 1897, and from
      1900 onwards, when the Essai was translated into English, the term palysemy
     made its way into mainstream linguistics.
2.   Some parts of this chapter are based on previously published papers, such as
     Nerlich and Clarke (1997, 2000a, 2(01) and Nerlich (1998).
3.   lakobson's work launched two famous schools of thought, integrating research
     on metaphor with structuralism, hermeneutics and speech act theory via the work
     of Claude-Levi Strauss and Paul Ricoeur.
4.   In his seminal article on the inescapable metaphoricity of language, H. WaIter
     Schmitz quotes the Dutch philosopher lohann Clauberg (1622-1665) as the first
     to use this example (see Schmitz 1985). Those interested in Locke's views of
     metaphor and truth should read Oosthuizen Mouton (to appear).
5.   All translations are mine, BN.
6.   This type of argument can be found again in modern proponents of "cognitive
     rhetoric", such as Dan Sperber (1975) and Mark Turner (1991).
7.   "Explaining the meaning of a word by ostension, by pointing to something to
     which the word applies, has been variously thought to constitute (i) a form of
     explanation which provides language with a foundation, (ii) an explanation which,
     in presupposing a general grasp of language, is only secondary, and (iii) a
     procedure which does not qualify as a definition or explanation at all. While
     ostension may serve to point the learner in the right general direction, there is
     certainly a question as to how much eventual understanding may owe to any
     such procedure, and how much it requires exposure to word usage over a period
     of time." (http://www.xrefer.com/entry/55307l)
H.   This view echoes Lorenzo Valla's 15th-century saying that "housewives some-
     times have a better sense of the meaning of words than the greatest philosophers"
     (quoted in Waswo 1987: 95).
68    Brigitte Nerlich
References
Bryson, Bill
   1990         Mother Tongue: The English Language. London and New York:
                Penguin Books.
Buhler, Karl
    1990 [1934] Theory of Language: The Representational Function of Language.
                Amsterdam and Philadelphia: John Benjamins. English translation
                by Donald Fraser Goodwin, edited with an introduction by Achim
                Eschbach.
Burke, Kenneth
   1962         A Grammar of Motives and a Rhetoric of Motives. Cleveland: The
                World Publishing Company.
Cadiot, Pierre
   1992         Extensions et glissements polysemiques ... d'une langue a l'autre.
                In: Michele Lorgnet (ed.), Atth delta Fiera Internazionale della
                Traduzione J, Riccione, (decembre 1990), ateneo, Forli: 31-57.
   1993         Representations d'objets et semantique lexicale: qu'est ce qu'une
                bofte? Journal of French Language Studies 4: 1-23.
Caramazza, Alfonso and Ellen Grober
   1976         Polysemy and the structure of the subjective lexicon. In: Clea Rameh
                (ed.), Semantics: Theory and Application, 181-206. Washington, DC:
                Georgetown University Press.
Danesi, Marcel
   1990         Thinking is seeing: visual metaphors and the nature of thought.
                Semiotica 80(3/4): 221-237.
Darmesteter, Arsene
   1887         La vie des mots etudiee dans leurs signijications. Paris: Delagrave.
Desmet, Piet and Pierre Swiggers
   1995         De la grammaire comparee Cl. la semantique. Textes de Michel Breal
                publies entre 1864 et 1898. Introduction, commentaires et bibliographie
                par Piet Desmet et Pierre Swiggers. Leuven and Paris: Peeters.
Dirven, Rene and Johan Vanparys (eds.)
   1995         Current Approaches to the Lexicon. Frankfurt: Peter Lang.
Du Marsais, Cesar Chesneau
   1977 [1730] Traite des Tropes ou des differents sens dans lesquels on peut prendre
                un meme mot dans une meme langue. Postface de Claude Mouchard,
                suivi de Jean Paulhan, Traite des Figures ou La Rhetorique decryptee.
                Paris: Le Nouveau Commerce.
Eeo, Umberto
   1996         History and historiography of semiotics. In: Roland Posner, Kurt
                Robering and Thomas A. Sebeok (eds.), SemiotiklSemiotics: Ein
                Handbuch zu den zeichentheoretichen Grundlagen von Natur und
                KulturlA Handbook on the sign-theoretic foundations of nature and
                culture. Volume 1,730--746. Berlin and New York: Waiter de Gruyter.
70    Brigitte Nerlich
Jakel,Olaf
   1999        Kant, Blumenberg, Weinrich: some forgotten contributions to the
               cognitive theory of metaphor. In: Raymond Gibbs Jr. and Gerdard J.
               Steen (eds.), Metaphor in Cognitive Linguistics. Amsterdam and
               Philadelphia: John Benjamins.
Jakobson, Roman
    1956a      The metaphoric and metonymic poles. In: Roman lakobson and
               Morris Halle (eds.), Fundamentals of Language, 76--82. Gravenhage:
               Mouton and Co.
    1956b      Two aspects of language and two types of aphasic disturbances. In:
               Krystyna Pomorska and Stephen Rudy (eds.), Language in Literature,
               95-120. Cambridge, MA; London, England: The Belknap Press of
               Harvard University Press 1983.
Johnson, Mark
    1987       The Body in the Mind: The Bodily Basis of Meaning, Imagination
               and Reason. Chicago: Chicago University Press.
Kleiber, Georges
    1990       La semantique du prototype. Paris: Presses Universitaires de France.
    1999       Problemes de semantique: La Polysemie en questions. Paris: Presses
               Universitaires du Septentrion.
Knobloch, Clemens
    1988       Geschichte der psychologischen Sprachauffassung in DeutschLand
               von 1850 bis 1920. Ttibingen: Niemeyer.
Koch. Peter
    1998       Saussures mouton und Hjelmslevs trre: zwei Schulbeispiele zwischen
               Semstructur und Polysemie. In: Edeltraud Wemer, Ricarda Liver.
               Yvonne Stork, and Martina Nicklaus (eds.), Et multum et multa.
               Festschrift fur Peter Wunderli zum 60. Geburtstag, 113-136.
               Ttibingen: Gunter Narr.
Koevecses, Zoltan
   2000        Metaphor and Emotion: Language, Culture and Body in Human
               Feeling. Cambridge: Cambridge University Press.
Lakoff. George
    1987       Women, Fire and Dangerous Things. Chicago: University of Chicago
               Press.
Lakoff. George and Mark Johnson
   1980        Metaphors We Live By. Chicago: University of Chicago Press.
   1999        Philosophy in the Flesh: The Cognitive Unconscious and the Embodied
               Mind: How the Embodied Mind Creates Philosophy. New York: Basic
               Books.
Lakoff, George and Mark Turner
    1989       More Than Cool Reason. Chicago: University of Chicago Press.
72    Brigitte Nerlich
Rastier Fran~ois
    1987        Semantique interpretative. Paris: Presses Universitaires de France.
Reisig, Christian Karl
    1890 [1839] Vorlesungen uber lateinische Sprachwissenschajt. Mit den Anmerkungen
                von Friedrich Haase. Unter Benutzung der hinterlassenen Manuskripte
                neu bearbeitet von Hermann Hagen. Zweiter Bd. 1890. Lateinische
                Semasiologie oder Bedeutungslehre. Neu bearbeitet von Ferdinand
                Heerdegen. Berlin: Calvary.
Richards, Ivor Armstrong
    1936        The Philosophy of Rhetoric. Second printing 1950. New York: Oxford
                University Press.
Ritter, Joachim (ed.)
    1972        Historisches Worterbuch der Philosophie; herausgegeben von
                Joachim Ritter; unter Mitwirkung von mehr als 700 Fachgelehrten;
                in Verbindung mit Gunther Bien ... [et al.]. Base1: Schwabe 1971-
                <1998>. From Volume 4 edited by Karlfried Grunder.
Schepping, Marie-Therese
    1994        Un cas de polysemie dans le domaine verbal: arriver. In: Hiltraud
                Dupuy-Engelhardt (ed.), Actes d'EUROSEM 94. Colloque du 20 au
                23 septembre 1994, Reims.
SchlegeJ, August Wilhelm von
    1832        Rejlexions sur l'etude des langues asiatiques adressees a Sir lames
                Mackintosh, suivies d'une lettre a M. Horace Hayman Wilson. Bonn:
                Weber; Paris: Maze.
Schlieben-Lange, Brigitte
    1997        Uberrlegungen zu einer einfachen Systematik der Zuweisung von
                (polysemen) Lesarten. In: Ulrich. Hoinkes and Wolfgang Dietrich
                (eds.), Kaleidoskop der Lexikalischen Semantik, 239-247. Tiibingen:
                Narr.
Schmitz, H. WaIter
    1985        Die durchgangige Tropisierung der Sprache. Uber einen Aspekt von
                "Zeichen im Wandel". In: Klaus D. Dutz and Peter Schmitter (eds.),
                Historiographia Semioticae: Studien zur Rekonstruktion der Theorie
                und Geschichte der Semiotik, 241-270. Miinster: MAkS Publikationen.
Schumacher, Rene
    1997        "Metapher". Erfassen und Verstehen frischer Metaphern. (Basler
                Studien zur deutschen Sprache und Literatur 75.) Tiibingen und Basel:
                A. Francke.
Schwarze, Christoph
    1995        Polysemie im Sprachvergleich. In: WaIter Dahmen et al. (eds.),
                Konvergenz und Divergenz in den romanischen Sprachen, 204-219.
                Romanistisches Kolloquium vm. Tiibingen: Gunter Narr Verlag.
                                                  Polysemy: past and present      75
Turner, Mark
    1999        Contribution to cogling list, 30 January 1999 on "history of cognitive
                linguistics". X-UIDL: 284bda4409504c43ad4da2924844ab36
Ullmann, Stephen
    1951        The Principles of Semantics. Oxford: Basil Blackwell; Glasgow:
                Jackson, Sons and Co.
    1962        Semantics. Oxford: BIackwell.
Ungeheuer, Gerold
   1980         Lamberts semantische Tektonik des Wortschatzes als universales
                Prinzip. In: Gunter Brettschneider and Christian Lehmann (eds.),
                Wege der Universalienforschung. Sprachwissenschaftliche Beitriige
                zum 60. Geburtstag von Hansjakob Seiler, 87-93. Tiibingen: Gunter
                Narr.
Vico, Giambattista
   1948 [1725] The New Science of Giambattista Vico (Revised edition; T. G. Bergin
                and M. H. Fisch, editors and translators). Ithaca, NY: Corn ell
                University Press, 1948.
Victorri, Bernard and Catherine Fuchs
   1996         La Polysemie: Construction Dynamique Du Sens. Paris: Hermes.
Was wo, Richard
   1987         Language and Meaning in the Renaissance. Princeton: Princeton
                University Press.
Wegener, Philipp
   1991 [1885] Untersuchungen iiber die Grundfragen des Sprachlebens. HaIIe a.d.S.:
                Niemeyer. New edition by E. F. K. Koerner with an introduction by
                Clemens Knobloch. Amsterdam and Philadelphia: Benjamins 1991.
Weinreich, Uriel
   1962         Lexicographic definition in descriptive semantics. In: Fred W.
                Householder and Sol Saporta (eds.), Problems of Lexicography, 25-
                43. Bloomington, IN: Indiana University Press. Reprinted in:
                Weinreich (1980), 295-314.
   1966         Explorations in semantic theory. In: Thomas A. Sebeok (ed.), Current
                Trends in Linguistics, Volume 3, 395-477. The Hague: Mouton.
   1980         On Semantics. Edited by WilIiam Labov and Beatrice S. Weinreich.
                Philadelphia: University of Pennsylvania Press.
Weinrich, Harald
   1967         Semantik der Metaphor. Folia Linguistica I: 3-17.
Cognitive approaches
Polysemy and conceptual blending'"
1. Introduction
Science is an eternal battle against common sense. How can it be the earth
that moves, when we so clearly see the sun travelling majestically across the
sky? Since when do feathers fall as fast as stones? Where did imaginary
numbers ever come from?
    In thinking about meaning, common sense is no less of an obstacle. What
could be more obvious than the platitude that words have meanings, that
"dog" means dog and "house" means house? This reasonable and simple
view serves us well in everyday life and is widely shared. And yet there is
considerable evidence that it is deeply wrong - not just wrong because it is
oversimplified and in need of refinement, but more deeply wrong and mis-
leading in the very notion of "meaning" that it takes for granted.
   The clash between our common-sense, self-evident view of meaning
contained in words and the infinitely more complex and remarkable reality
of meaning construction has certainly not escaped the notice of thinkers
through the ages. But it is only recently that we have started to come to grips
directly with the dynamics of on-line meaning construction and the wealth
and variety of cognitive capacities that we bring to bear on the most ordinary,
mundane situations.
   It has been useful, in approaching such issues, to forget notions like
"meaning of an expression", "semantic representation", "truth-function", and
the like, and to think instead of the "meaning potential" of a language form.
Meaning potential is the essentially unlimited number of ways in which an
expression can prompt dynamic cognitive processes, which include concep-
tual connections, mappings, blends, and simulations. Such processes are
inherently creative, and we recognize them as such when they are triggered
or produced by art and literature. In everyday life, the creativity is hidden by
the largely unconscious and extremely swift nature of the myriad cognitive
operations that enter into the simplest of our meaning constructions. It is
also hidden by the necessary folk-theory of our everyday behaviour which is
80    Gilles Fauconnier and Mark Turner
based quite naturally on our conscious experience rather than on the less
accessible components of our cognition.
    In this article, we look at some aspects of polysemy which derive from
the power of meaning potential. More specifically, we focus on aspects linked
to the operation of conceptual blending, a major cognitive resource for
creativity in many of its manifestations.
    Poly se my is pervasive in language and appears in many forms. It is not
just an accident of history or of synchrony, but rather an essential manifes-
tation of the flexibility, adaptability, and richness in meaning potential that
lie at the very heart of what a language is and what it is for. It is also a
symptom (rather than a primitive component) of the way in which various
cognitive operations allow for creativity at many levels.
    The diversity and wide range of polysemy, and the wealth of theoretical
implications associated with it, are richly attested in the present volume. In
this article we review a number of cases of polysemy associated with concep-
tual blending. A majority of these cases have been discussed in other contexts
and for a variety of theoretical purposes. Rather than repeat the analyses in
extenso, we will frequently refer the interested reader to the appropriate source
for a more detailed treatment. In the sources, polysemy was seldom in itself
the major focus. By bringing together a large number of cases in the present
context, we hope to give an idea of the overall importance of blending in
polysemy phenomena.
On 8 July 1999, The New York Times reported that Hicham el-Guerrouj had
broken the record for the mile, with a time of 3:43.13. The illustration that
accompanied the article, reproduced in Figure 1, shows a one-quarter mile
racetrack with six figures running on it, representing el-Guerrouj in a race
against the fastest milers from each decade since Roger Bannister broke the
4-minute barrier in 1954. EI-Guerrouj is crossing the finish line as Bannister,
trailing everyone, is still 120 yards back. This illustration prompts us to
construct a conceptual packet that blends together structure from six separate
input mental spaces, each with a one-mile race in which the record is broken
by a runner. The blend places all six of the runners on a single racetrack,
with a single beginning time.
    This blend has the familiar features of conceptual integration networks.
                                                               Polysemy and conceptual blending   81
time. Subtracting the distance travelled from 1 mile yields the distance by
which the runner trails el-Guerrouj. For example, Bannister trails el-Guerrouj
by [1760 yards] - [(3:43.13/3:59.4)(1760 yards)] = 120 yards, rounding to
the nearest yard.
    To see further that there is nothing automatic or inevitable about this
blend as an instrument for highlighting competition and record-breaking, we
can compare it to the blend for the history of breaking the distance record
for a fixed time in bicycling. In the standard one-hour competition in bicycling,
the time of the performance is invariant, while the distance varies. So one
breaks the record by going farther in one hour than anyone else has ever
gone in one hour. Now, for this blend, we can project both the time and the
distance for each of the previous record holders without having to perform
any calculation. We simply place all the record-holders on the same track,
each at the distance he had achieved after one hour. In fact, the blend for the
milers looks like the blend for the bicyclists, but in the first case some
aggressive manipulations were required to achieve the blend. In the bicycle
competition, the contestants in the inputs and in the blend all do in fact stop
after an identical period of time has elapsed, namely one hour. In the mile
race, the contestants in the blend effectively stop competing to win the
moment the winner crosses the finish line, even though their counterparts in
the input spaces continue to compete to win, to finish the mile, and in fact to
break the record.
    In the rest of this article, we will show how various kinds of polysemy
occur as a result of blending. We will argue that the following principles
guide the development of polysemy and furthermore that most polysemy is
invisible:
        that it could not have been used to pick out if the blend had not been
        built.
(iv)    Blending provides a continuum for polysemy effects. Polysemy is an
        inevitable and routine outcome of blending, but it is only rarely
        noticed. The noticeability of polysemy is a function of the availability
        of certain frames through defaults, contexts, or culture.
have just seen with the example of complex numbers, category extension
occurs by blending, which is not simply an operation of adding and deleting
features. We now turn to an example where it does not seem even intuitively
that the blend arises by adding or deleting features.
     Coulson (1997) analyses caffeine headache as having two conventional
readings, one in which the headache is caused by caffeine, the other in which
the headache is caused by lack of caffeine. In this second case, we need an
integration network involving a general schema for a headache and its cause,
a present scenario in which the person with the headache has had no caffeine,
and a counterfactual scenario in which someone has had caffeine and so has
no headache. In the blend, the particular person with the headache has had
no caffeine, the lack of caffeine is the cause of the headache, and the term
"caffeine" has been projected from the all-important and desirable counter-
factual scenario in which there is no headache. Although this looks and is
intricate, it is an instance of a general pattern in which the integration network
contains a fully activated and highly important mental space that is counter-
factual to the blend, and the simple term for the blend is taken from the
counterfactual scenario. For example, a nicotine fit is a fit caused by lack of
nicotine, where the term "nicotine" is taken from the counterfactual scenario
in which the person does not have the fit. Easy error, used in tennis to describe
an error in making what should have been an easy shot, takes the modifier
for the error from the counterfactual space. Perhaps most conventionally, a
money problem activates a counterfactual space where there is money and
no problem. In the blend, there is a problem, a causal relation, and a cause,
namely, no money, but the term indicating the cause is taken from the
counterfactual space in which there is money and therefore no problem.
     We see that terms like caffeine headache have more than one meaning,
being polysemous, because there is more than one blending possibility. The
striking possibility pointed out by Coulson depends on the general availability
of blending networks that have two highly active input spaces where one is
directly counterfactual to the other in some crucial respect. For examples
like money problem and caffeine headache that are licensed by such networks,
it is very clear even at the intuitive level that the polysemy cannot be a result
of adding and deleting semantic features attached to the two words. A caffeine
headache situation, on the counterfactual reading, has no features of caffeine.
In fact, its indispensable feature is a total absence of caffeine.
86     Gilles Fauconnier and Mark Turner
3. Gradients of blending
(2)    Zeus was the father of Sarpedon. He watched from Mount Olympus
       as his mortal son met his fated death.
   This example points to the fact that there was more pattern completion
and projection from inputs in the Paul and Sally case than we had realized.
In Paul is the father of Sally, we quietly projected from Input 1 a range of
conventional knowledge, such as the mortality of the father and his normal
paternal limitations. But it is a technical fact of blending that we can project
equally from either input. So in the case of Sarpedon, the framing of the
father's powers and limitations comes not this time from Input 1, but from
Input 2.
   Now consider:
(3)    Zeus is the father of Athena. She was born out of his head. fully clad
       in armour.
her to school, reading bedtime stories. That neighbour can say to Sally: "I'm
your father for today". Like the Zeus and Joseph blends, some family structure
and genealogy is projected. As in the Joseph blend (but not the Zeus blends),
progeneration is not projected. Many of the typical aspects of the father-
offspring relationship are projected (routines, taking care, responsibility,
affection, protection, guidance, authority, and so forth). Compositionality is
no longer at all an option to account for this case. Too many properties felt
as central are missing. We have moved along the conceptual integration
network continuum from the pole of "Fregean" networks. But clearly, we
have not reached a point on the continuum that would be felt intuitively to
be metaphorical. Fatherhood is not a metaphor for what the neighbour is
doing. In fact, although some analogy has now contributed to the mapping,
the function of this blend is stronger than just analogy between the neigh-
bour's actions and a father's actions. The neighbour in this local context is
really filling in the role of the father in relevant respects, not just doing
something "similar" to what the father does. The flexibility of blending with
selective projection and contextual elaboration allows for this intermediate
kind of situation which doesn't fit a prototypical semantic or pragmatic
characterization.
    In the Zeus and Joseph cases, there are obvious Principle (i) and Principle
(ii) polysemy effects. By Principle (i), "father" is projected to the blend from
the father-ego input, but now picks out new meaning in the blend. By
Principle (ii), we can now refer in general, across all contexts, to Zeus as
"the parent of Athena", whereas, by contrast, Paul cannot be referred to in
all contexts as "the parent of Sally". We can refer to birth as "leaving Zeus's
head" in the way we normally refer to birth as "leaving the womb". Many
similar expressions, each using words that already apply to the inputs, can
be fashioned that pick out meaning only in the blend. We can also refer to
Joseph as "Jesus's mortal father", giving "mortal father" a contrastive rather
than redundant meaning, which is likewise inappropriate for the father-ego
input.
    Consider further examples linked to father:
      They are further along the continuum. The first example still has people
                                        Polysemy and conceptual blending      89
in both inputs. From the kinship input that provides the word "father", we
project not progeneration at all but instead authority, size of the family,
responsibility, leadership, social role. From the second input, we project
specific properties of catholicism.
    The second example arguably projects the role of a child to a single social
entity (the Church). The blend reflects a type of socio-cultural model, in
which a social entity (church, nation, community) is the "child" of its leader.
The word "father" is now felt to have a different meaning, but not a particu-
larly metaphoric one.
    With the George Washington sentence, we go a little bit further by high-
lighting the causality in time between the parent and child, and between the
founder and the nation. This abstraction increases the perceived difference
between the two inputs and their domains. The impression of metaphor is
undoubtedly stronger. And that subjective impression reaches a higher point
when the two domains are even more explicitly distinguished, as in Newton
is the father of physics. Physics, as opposed to church and country, does not
even stand in metonymic relation to people and groups of people. Yet Newton
and Washington as adult men have all the criterial biological features of
possible fathers plus some of the stereotypical social ones (authority, respon-
sibility ... ). The conceptual integration networks directly bring in frame
structure from both inputs.
    Even more subjectively metaphorical are cases like Pound's Fear, father
of crueLty (Turner 1987), where the two domains (emotions/qualities and
people/kinship) have no literal overlap at all, and the projected shared schema
is correspondingly abstract (causality). And finally, Wordsworth's acrobatic
metaphor The Child is Father of the man comes around almost full circle by
using background knowledge (children grow into men) to create emergent
structure in the blend giving a rich instantiation to the abstract generic causal
structure which maps kinship to the human condition in an unorthodox way.
The oddness of its counterpart connections and the extensive drawing on the
frames of both inputs to create a new organizing frame for the blend help
make Wordsworth's line feel figurative. But the syntax and mapping scheme
of The Child is Father of the man are the same as the syntax and the mapping
scheme of Paul is the father of Sally.
    The kinds of blend we have been talking about are often constructed using
language. The reason language can prompt for blends that result in the same
word's being used to pick out different meanings is that language does not
represent meaning directly; it instead prompts for the construction of meaning
90    Gilles Fauconnier and Mark Turner
situation. Here, too, the linguistic expression selects "caffeine" and "head-
ache" in order to identify the disanalogous counterparts.
    Now consider a word like "likely". Sweetser (1999) considers the case in
which likely candidate means not someone likely to become a candidate or
succeed as a candidate but, e.g., a candidate likely to grant an interview. As
she writes, "So long as we can think up a scenario relative to the candidate
in question, and evaluate that scenario for likelihood, likely candidate can
mean the candidate who figures in the scenario we have labeled as likely."
On her analysis, conceiving of such a scenario and evaluating it consists of
finding a blend of the frame for likelihood, conceived of as probability of
occurrence in a sequence, and the frame for candidate. Like "safe" above,
"likely" prompts for a blend. Sweetser's examples make the point clearly
that the scenarios necessary to do the appropriate blending mayor may not
be connected at all to the particular lexical items (e.g. "candidate"), as we
see, for example, in the case where possible textbook refers to a textbook
that might possibly be chosen as the one to be used in a college course.
Just as the different meanings of "safe" may go unnoticed, so the different
meanings of "possible" and "likely" may go unnoticed. But from a logical
standpoint, a possible textbook in the sense of one that may be adopted is not
the same as a possible textbook in the sense of one that might exist, or might
be written, or a trade book that could double as a textbook. As before, in the
cases of "likely" and "possible", blending opens the possibility of extensive
polysemy in the logical sense, but which may go unnoticed. This is no
accident, since the function of these linguistic forms, like the linguistic forms
"Adj-Noun" and "Noun-Noun", is to prompt for blending.
Paul is the father of Sally, Father of cruelty, Father of the Catholic Church,
Vanity is the quicksand of reason, Wit is the salt of conversation, and so on
does not single out any particular blend or even any particular projection; it
only prompts for finding a way to construct a conceptual network that will
have a relevant meaning. What we have to do to construct that network is
nowhere represented in the linguistic structure. The single word "of' is thus
associated with an infinity of mappings. Of course, this infinity of mappings
is anything but arbitrary. It is constrained itself by the requirements on con-
ceptual integration networks. Different grammatical forms prompt different
infinities of conceptual mappings.
    Because linguistic expressions prompt for meanings rather than represent
meanings, linguistic systems do not have to be, and in fact cannot be, ana-
logues of much richer conceptual systems. Prompting for meaning construc-
tion is a job they can do; representing meanings is not. As we have shown in
this article, a byproduct of constructing conceptual integration networks will
be massive, though often unrecognized, polysemy.
Notes
References
Coulson, Seana
   1997        Semantic leaps: the role of frame-shifting and conceptual blending
               in meaning construction. Ph.D. dissertation, University of California,
               San Diego.
Fauconnier, Gilles and Mark Turner
   1998        Conceptual integration networks. Cognitive Science 22(2): 133-187.
   2002        The Way We Think: Conceptual Blending and the Minds Hidden
               Complexities. New York: Basic Books.
Sweetser, Eve
   1999        Compositionality and blending: working towards a fuller understanding
               of semantic composition in a cognitively realistic framework. In:
               Theo Janssen and Gisela Redeker (eds.), Scope and Foundations of
               Cognitive Linguistics, 129-162. The Hague: Mouton De Gruyter.
               (Cognitive Linguistics Research Series.)
94     Gilles Fauconnier and Mark Turner
Turner, Mark
     1987     Death is the Mother of Beauty: Mind. Metaphor; Criticism. Chicago:
              University of Chicago Press.
Turner, Mark and Gilles Fauconnier
   1995       Conceptual integration and formal expression. Journal of Metaphor
              and Symbolic Activity 10(3): 183-204.
Reconsidering prepositional polysemy networks:
the case of over *
1. Introduction
recently, Chomsky has reasserted this stance: "I understand the lexicon in a
rather traditional sense: as a list of 'exceptions', whatever does not follow
from general principles" (1995: 235). Models within this framework have
tended to represent different word senses as distinct lexical items (Croft 1998).
Polysemous forms are simply represented as an arbitrary list of discrete words
that happen to share the same phonological form.
    Over the years, this stand has been criticized for failing to account for
systematic ways in which numerous forms are clearly related (Jackendoff
 1997; Langacker 1991 a; Levin 1993; Pustejovsky 1998). Croft (1998) notes
that a number of linguists have argued for some type of derivation within the
lexicon that would represent distinct senses as arising from a primary sense
via a set of lexical operations. By and large, these analyses have focused on
polysemy involving changes in the argument structure of verbs or alterna-
tively in category changes, and have had little to say about the type of
polysemy demonstrated by English prepositions in which syntactic category
changes are often not involved.
    In fact, most linguists (cognitive linguists excepted) have not paid much
attention to the phenomena of polysemy. Pustejovsky notes that "The major
part of semantic research ... has been on logical form and the mapping from
a sentence-level syntactic representation to a logical representation" (1998:
33). The lexicon has been represented as a static set of word senses, tagged
with features for syntactic, morphological and semantic information, ready
to be inserted into syntactic frames with appropriately matching features.
Within this tradition the lexicon has been viewed as "a finite set of [discrete]
memorized units of meaning" (Jackendoff 1997: 4).
    Cognitive linguistics takes a significantly different perspective on the
nature of the mental lexicon. Of primary importance is the notion of embodied
meaning: the meanings associated with many individuallexemes are instanti-
ated in memory not in terms of features, nor as abstract propositions, but
rather as imagistic, schematic representations. Such image-schemas are held
to be embodied, in the sense that they arise from perceptual reanalysis of
recurring patterns in everyday physical experience (see Johnson 1987;
Mandler 1992, 1996 for a developmental perspective). I Perceptual analysis
creates a new, abstract level of information - information tied to the spatio-
physical world we inhabit but mediated by human perception and concep-
tualization. The central assumption of embodied meaning stands in stark
contrast to approaches to the mental lexicon that represent lexical items as
bundles of semantic. syntactic and morphological features.
                                                         The case of over   97
the fact that previous approaches have not developed well-motivated criteria
for (i) distinguishing between distinct senses within a network versus interpre-
tations produced on-line and (ii) determining the primary sense associated
with a preposition.
    Our first objective in the present article is to outline what we term a
"principled polysemy framework". This will anchor the semantic network of
over to a foundational conceptual representation (our protoscene), deriving
directly from uniquely human perceptions of and experience with the spatio-
physical world. The protoscene we posit is a highly abstract representation
of a recurring spatial configuration between two (or more) objects. Hence,
details of the physical attributes of the objects involved in a particular spatial
scene will be shown not to involve distinct senses (contra Brugman/Lakoff).
We will argue that many of the distinct senses posited in previous approaches
are produced on-line, as a result of a highly constrained process of integrating
linguistic prompts at the conceptual level. Key to distinguishing our frame-
work from previous ones will be outlining clear, motivated methodology for
determining the protoscene associated with a preposition and distinguishing
between senses that are instantiated in memory versus interpretations pro-
duced on-line. Our second objective is to demonstrate the usefulness of the
framework by providing a complete account of the polysemy exhibited by
over.
2. Previous approaches
   In (3) there is contact between the TR, the boy, and the LM, the wall,
whereas in (4) there is not. For Lakoff, this distinction warranted two distinct
senses. Kreitzer, by claiming that the sense provided by an image-schema is
defined at the relational level (rather than at the component level), is able to
                                                         The case of over     101
argue that both usages represent only one sense of over. His insight is that
the basic spatial relation between the TR and LM remains unchanged in (3)
and (4), even though the components of the spatial scene may vary contextu-
ally. For Kreitzer, topographical features, such as contact and extendedness
of the LM, are situated at the component level, and consequently do not
delineate distinct senses or image-schemas.
    Consequently, Kreitzer argues that the plethora of separate image-schemas
posited by Lakoff can be represented by three image-schemas at the relational
level. The primary sense, which he terms overI, is static, over2 is dynamic,
and over3 is what Kreitzer terms the occluding sense. Examples of these are:
    Kreitzer posits that (8) has two construals as a result of his assumption
that over has both a static and a dynamic relational schema. Construal 1
stipulates that the clouds moved above and across the city, such that they
102     Andrea Tyler and vyvyan Evans
originated in a position not above the city, moved over the city, and came to
be in a position beyond the city. Construal 2 stipulates that the clouds moved
from a position in which they were not over the city, to a position such that
they came to be directly over the city. These construals are diagrammed in
Figures I and 2.
                              o--..~o
                                   .
                                     D
Figure 1.   The clouds moved over the city: construal 1 (after Kreitzer 1997: 305).
                                0---+0
                                           D
Figure 2.   The clouds moved over the city: construal 2 (after Kreitzer 1997: 305).
0-.0
   In (9) construals 1 (Fig. 1) and 3 (Fig. 3) are ruled out, not because over
has both a dynamic and a static sense, but because walls are not extended
landmarks (as noted in Lakoff's analysis), whereas cities are, and moved
codes a path schema. Thus, when the sentential elements are integrated, the
TR follows a path, as designated by moved, such that the TR occupies a
position relative to the LM, as specified by the mental representation for
104     Andrea Tyler and VYvyan Evans
over. The clouds move, neither away from the wall, nor in a vertical manner
without crossing the wall, but from a position prior to the wall to a position
beyond the wall. That this should be so follows from conceptual integration
of the cues prompted by the linguistic elements in the sentence. Accordingly,
we argue that a polysemy network needs to allow for the distributed contribu-
tion of meaning played by all sentential elements, as well as the constraints
imposed by our experience of the world and our ability to construct a rich
and highly dynamic conceptualization based on minimal linguistic cues.
    Another problem with Kreitzer's account is that in attempting to constrain
Lakoff's analysis he has significantly understated the amount of polysemy
appropriately associated with over. For instance, many senses touched on by
Lakoff are simply ignored by Kreitzer. We will provide a detailed examination
of the semantic network for over in Section 4. Finally, neither Kreitzer nor
Lakoff attempts a serious account of how he determined which sense of over
should be considered the primary sense. We address this issue in detail in
Section 3.2.
    The spirit of our model is coherent with a number of previous analyses
that have addressed the mUltiple meanings associated both with prepositions
(Herskovits 1986; Vandeloise 1991, 1994) and with other linguistic forms
(Cushing 1990, 1991). While these scholars differ from each other and from
us in several key assumptions (e.g. the nature of lexical representation), they
do entertain the possibility that the polysemy exhibited might be best modelled
in terms of a central (or ideal) sense. 7
associated with over in (10) and (11) can be derived from context. To see
why this is so, contrast this instance with (12), in which the covering meaning
is derivable from context.
    The TR, the tablecloth, is higher than (and in contact with) the LM, the
table. As tablecloths are typically larger than tables, and the usual vantage
point from which such a spatial scene would be viewed is a point higher
than the table, the result would be that a substantial part of the table would
be covered and so obscured from view. The interpretation that the table is
covered/obscured could be inferred from the fact that the tablecloth is over
and hence higher than the table, in conjunction with our knowledge that
tablecloths are larger than tables and that we typically view tables from above
the top of the table. Such an inference is not possible in (10) as the spatial
relation holding between the TR and the LM is one that would normally be
coded by below (i.e. the board is below the hole in the ceiling), rather than
by over, given the typical vantage point. Similarly, in (11) the spatial con-
figuration between the TR and LM would normally be coded by something
like next to. In short, unless we already know that over has a covering/
obscuring meaning associated with it, there is no ready contextual means of
deri ving this meaning in sentences such as (10) and (11). From this, we
conclude that the covering/obscuring meaning associated with over in (10)
and (11) constitutes a distinct sense.
    The two assessment criteria being proposed are rigorous and, in the light
of future empirical research, may be shown to exclude senses that are legiti-
mately instantiated in the language user's mental lexicon and hence would
have to be adjusted. Nonetheless, without prejudging future findings, we
suggest that this methodology predicts many findings that have already come
to light, and so represents a reasonable approximation for assessing where
we should draw the line between what counts as a distinct sense conventional-
ized in semantic memory, and a contextual inference produced on-line for
the purpose of local understanding. The appeal of such methodology is that
it provides a rigorous and relatively consistent way of making judgements
about whether a sense is distinct, and provides methodology that can be used
in an intersubjective way.
                                                          The case of over     107
and down. The linguistically coded division of space and spatial relations is
relativistic in nature, depending largely on construal of the particular scene
being prompted for (Langacker 1987; Talmy 1988,2000). To a large extent,
the label assigned to denote a particular TR-LM configuration is determined
in relation to other labels in the composite set. So, for instance, what we
label as up is partially determined by what we label as down. In this sense,
the meaning of a preposition that participates in a compositional set is partially
determined by how it contrasts with other members of the set. The particular
sense used in the formation of such a compositional set would thus seem to
be a likely candidate as a primary sense. For over, the sense that distinguishes
this preposition from above, under, and below involves the notion of a TR
being located higher than but potentially within reach of the LM. We expand
on this argument in the next section.
    The choice of a primary sense gives rise to testable grammatical pre-
dictions. So, for instance, if we recognize that what are now distinct senses
were at one time derived from and related to a pre-existing sense and became
part of the semantic network through routinization and entrenchment of
meaning, we would predict that a number of the senses should be directly
derivable from the primary sense. This is consistent with Langacker's (1987)
discussion of a sanctioning sense giving rise to additional senses through
extension. Any senses not directly derivable from the primary sense itself
should be traceable to a sense that was derived from the primary sense. This
view of polysemy explicitly acknowledges that language is an evolving,
usage-based system. Grammatically, for any distinct sense that is represented
as directly related to the primary sense, we should be able to find sentences
whose context provides the implicature that gives rise to the additional
meaning associated with the distinct sense. We have already discussed this
notion briefly (Section 3.1) when we considered the additional meaning of
covering/obscuring associated with over in (10)-(12). We argued that the
use of over in (10) and (11) revealed additional meaning that could not be
derived from sentential context, while the additional meaning of covering/
obscuring could be derived from context in (12). By the criterion of gram-
matical prediction, (12) constitutes evidence that a likely candidate for the
primary sense associated with over involves the TR being located higher
than the LM, as the distinct covering/obscuring sense can be derived from
this primary sense and certain sentential contexts. Of course, the covering/
obscuring sense is only one of 14; all other senses would have to be tested
against this same criterion.
I 10   Andrea Tyler and lryvyan Evans
                                     •
Figure 4. The protoscene for over.
affect the LM in some way and vice versa. For instance, because of an
independently motivated experiential correlation (Grady 1997), we con-
ventionally understand power and control being associated with an entity
who is higher than the entity being controlled (we will discuss this in more
detail when we deal with the control sense for over). In physical terms we
can only control someone or something, and hence ensure compliance, if we
are physically proximal to the entity we seek to control. If, then, in recurring
human experience, control, and hence the ability to physically influence
someone or something, is dependent upon being higher than and physically
close to the entity we seek to control, we would expect that these notions
can be designated by over but not above. While both over and above desig-
nate spatial relations which are higher than, only over also designates the
functional relation of influence, precisely because part of its spatial con-
figuration involves the notion of potential contact between the TR and LM.
Consider (15).
Mandler (1988, 1992, 1996) argues that a basic aspect of human cognition is
the ability to submit salient (i.e. recurring) real-world scenarios and spatial
scenes to perceptual analysis that gives rise to a new level of conceptualized
information which is stored imagistically in the fonn of an abstract schematiz-
ation, termed an image-schema. 11 Once stored, the image-schema is available
for integration with other conceptualizations, further analysis, and reconcep-
tualization.
    Earlier, we used the term conceptualization in a nontechnical way. In order
to distinguish our nontechnical usage from a more sharpened operationalization,
we here introduce the tenn complex conceptualization. A complex concep-
tualization is a constructed representation,12 typically (but not inevitably)
produced on-line. A complex conceptualization represents our projection of
reality (in the sense of lackendoff 1983), and can represent static and relatively
simple phenomena, e.g. The cloud is over the sun, or dynamic and relatively
complex phenomena, e.g. The cat ran over the hill and ended up several miles
away. Our claim is that the integration of linguistic forms with other cognitive
knowledge prompts for the construction of a complex conceptualization.
    In our model, the image-schemas representing the spatial configurations
associated with prepositions are termed protoscenes. 13 The primary scene
(i.e. the protoscene) associated with a preposition can be used, in conjunction
with other linguistic prompts (i.e. within an utterance), to prompt for recurring
spatial scenes and real-world scenarios.
    Figure 5 represents the complex conceptualization which would be con-
structed in the interpretation of the recurring scenario prompted by sentences
such as (16) and (17).
                                 /
                                     /   --   _.- -- -
                                              B
                                               I
                             /
                                                             \
                                                                 \
                                                                     •C
                                                                     I
Figure 5. Schematization of sentences of the type The cat jumped over the wall.
3.4.3. Atemporality
In advancing the model of word meaning on which we will base our analysis
of over in Section 4, we note, following Langacker (1987, 1991a, 1991b,
1992; see also Talmy 1988, 2000) that prepositions profile (i.e. designate) a
spatio-functional relation that is scanned (i.e. apprehended) in summary
fashion. 16 That is, they do not profile a relation that evolves through time, as
is the case for example with verbs. Verbs profile processes that are scanned
116      Anarea Tyler and Vyvyan Evans
in serial fashion. For instance, in the sentence The boy runs home from school,
the process profiled by run constitutes a process that integrates all the points
occupied by the TR, The boy, which intervene between school and home,
hence the process evolves through time by integrating these sequential
components. The result is a sequential process. This contrasts with the relation
described by a preposition, which does not evolve through time. Prepositions
represent a conceptualized relation holding between two entities (a TR and a
LM), independent of sequentially evolving interdependencies. In this sense,
prepositions can be considered to profile atemporal relations.
We have argued that not all meanings assigned to a preposition, which arise
from interpreting the particle within an utterance, are stored as distinct senses,
and that previous models have often failed to recognize the contribution of
encyclopaedic knowledge and inferencing involved in natural language proces-
sing. In deriving on-line interpretations we employ a number of inferencing
strategies. Because of space constraints we will mention just three of the
most important. In Section 4 we provide a detailed illustration of how these
strategies enable us to produce meaning on-line.
(i)      Best fit. Only a tiny fraction of all PQssible spatial relations are coded
         by discrete lexical items. In linguistic terms, prepositions represent a
         closed class, that is, English speakers have a limited set of linguistic
         choices to represent a virtually unlimited set of conceptual spatial
         relations. Speakers choose the preposition that offers the best fit
         between the conceptual spatial relation and the speaker's communic-
         ative needs. The notion of best fit represents a crucial means for
         allowing us to fill in information about a particular spatial scene. To
         our knowledge, no other linguist has specifically discussed this notion,
         but it seems to be a logical extension of the notion of relevance (Grice
         1975; Sperber and Wilson 1986).
(ii)     Knowledge of real-world force dynamics. Although a spatial scene is
         conceptual in nature, in the creation and interpretation of an utterance
         the speaker and hearer will assume that all elements in a spatial scene
         are subject to real-world force dynamics. 17 For instance, in the inter-
         pretation of a sentence such as The cat jumped over the wall, it is
                                                          The case of over     117
How might on-line meaning construction apply to the protoscene (or indeed
any distinct sense) to produce a contextualized interpretation of a preposition?
To illustrate this process, we will consider the path sense posited by Lakoff
(1987) and Kreitzer (1997). Lakoff termed this the above-across sense, while
Kreitzer called it over2. Both Lakoff and Kreitzer sought to capture the
118     Andrea Tyler and \ryvyan Evans
    Crucially, they suggested that over codes the trajectory or path as a distinct
sense instantiated in semantic memory. Following the methodology pre-
viously suggested for determining whether a sense is distinct or not, we posit
that in sentences such as (18) the interpretation that the TR follows a particular
trajectory described by "above and across" can be inferred from context.
Based on this methodology, over does not have a distinct above-across path
sense associated with it.
    The case for attributing an above-across sense to over in examples such
as (18) relies on implied reasoning which runs as follows: (i) a spatial scene
is conceptualized in which a cat starts from a position on one side of the
wall and comes to be in a position on the other side; (ii) there is nothing in
the sentence, other than over, which indicates the trajectory followed by the
cat; (iii) therefore, over must prompt for an above-and-across trajectory. But
this conclusion is a non sequitur. Simply because a trajectory is not prompted
for by specific linguistic forms (formal expression) does not entail that such
information is absent. To reach this conclusion is to assume that the lack of
formal expression coding trajectory information implicates a lack of trajectory
information per se. On this view, all elements that are salient in the interpre-
tation of a scene must be coded linguistically.
    We offer an alternative account that argues that the meaning assigned to
any utterance is radically underdetermined by the lexical items and the
grammatical structures in which they occur. That is, sentential interpretation
is largely the result of various cognitive/inferential processes and accessing
appropriate world knowledge. Consider the conceptualizations prompted for
by the sentence in (18) and contrast this with (19).
   Lakoff's full-specification account for over would argue that (18) and
(19) represent two different senses of over. For (19) he assumes that over
has a meaning that can be paraphrased as "above" while in (18) over has a
meaning, as already intimated, of "above and across", The implied reasoning
                                                        The case of over    119
for adducing that over in (19) is associated with a static "higher than" sense
runs as follows: in the interpretation prompted for by (19), (i) no motion is
involved hence there is no trajectory; (ii) the branch is located above the
wall; and (iii) the only element that indicates the location of the branch in
relation to the wall is the word over; hence, (iv) over must have an above
sense.
    We suggest that it is wrong to conclude that examples (18) and (19)
represent two distinct senses. Rather than representing prepositions as
carrying detailed information about each scene being described, we argue
that they prompt for schematic conceptualizations (a protoscene and other
distinct senses instantiated in semantic memory) that are interpreted within
the particular contexts in which they occur. Under our analysis, a path (or its
absence) is typically prompted for by the verb as it relates to other words in
the sentence. JK
    In (18), the verb jumped does prompt for a conceptualization involving
motion, which entails a trajectory. Hence, the interpretation of the above-
across trajectory of the movement in (18) is not prompted for by over (i.e.
the concept of the TR in motion is not a semantic attribute of the protoscene),
nor for any of the other distinct senses associated with over, but rather arises
from the integration of linguistic prompts at the conceptual level. Most of
the information required to integrate the linguistic prompts and construct a
mental conceptualization of the spatial scene is filled in by inferencing and
real-world or encyclopaedic knowledge. In turn, this knowledge constrains
the possible interpretations that over can have in this particular sentence. In
the interpretation of (18), encyclopaedic knowledge (as adduced in part by
the inferencing strategy pertaining to real-world force dynamics) includes
(at the very least): (i) our understanding of the action of jumping, and in
particular our knowledge of the kind of jumping cats are likely to engage in
(that is, not straight up in the air as on a trampoline and not from a bungee
cord suspended from a tree branch extending above the wall); (ii) our
knowledge of cats (for instance, that they cannot physically hover in the air
the way a hummingbird can); (iii) our knowledge of the nature of walls (that
they provide vertical, impenetrable obstacles to forward motion along a path);
and (iv) our knowledge of force dynamics such as gravity (which tells us
that a cat cannot remain in mid-air indefinitely and that if the cat jumped
from the ground such that the trajectory of its path at point B matches the
relation described by over the wall, then it would have to come to rest beyond
the wall, providing an arc trajectory). Thus, we argue that the interpretation
t 20    Andrea Tyler and \ryvyan Evans
sitions fail to prompt for the key spatial configuration that prompts the listener
to construct the complex conceptualization represented in Figure 5. Given
the conceptualization the speaker wishes to convey, the speaker chooses from
the closed class of English prepositions the one that best fits the relevant
conceptual spatial relation between the TR and LM at one point in the cat's
trajectory, which will, in turn, prompt the appropriate entailments or inferen-
ces. This inferencing strategy is the notion of best fit. Accordingly, we
reiterate that a serious flaw in both the full- and partial-specification ap-
proaches is that neither fully distinguishes between formal expression in
language, which represents certain information, and patterns of concep-
tualization, which integrate information prompted for by other linguistic
elements of the sentence. Over does not itself prompt for an above-across
sense, that is, for a path. We hypothesize that all path or trajectory information
in the examples discussed results from conceptual integration of linguistic
and other prompts, following the notion of best fit, which determines that
the relation designated by the protoscene (and indeed other distinct senses)
will not precisely capture a dynamic real-world spatial relation, which is
constantly changing, but will provide a sufficient cue for conceptualization.
    In order to illustrate the strategy of topological extension, we offer
example (20).
(20) There are afew stray marks just above the line.
the line and the LM is not within the sphere of influence of the TR. On the
basis of sentences such as She walked above the bridge, in which no contact
between the TR and LM is possible, we hypothesize that the functional
element of the protoscene for above places the focus on the notion of non-
bridgeable distance between the LM and TR. Thus, the relation in (14b) is
best designated by above. This analysis is supported if we attempt to use
over in place of above, as in There are a few stray marks over the line,
which presents the ambiguous interpretation that the marks are in contact
with the line and potentially obscuring parts of it. This interpretation arises
from the covering sense, which we will address later.
    Grice (1975) noted with his maxim of manner that in everyday con-
versation speakers generally try to avoid ambiguity, unless there is a purpose
for the ambiguity. To avoid possible ambiguity, the inferencing strategy of
attempting best fit in the choice of lexical item suggests that the speaker will
choose the protoscene (or particular sense) that best facilitates conceptualiz-
ation of the scene he or she intends the listener to construct. In light of the
strategies of topological extension and best fit, we argue that above is the
most felicitous choice to prompt for the complex conceptualization that
involves a LM (a line), and a TR (stray marks) that is higher than and not in
contact with the LM, as attested by (20).19
Our model takes the view that formal aspects of language, such as syntactic
configurations, have conceptual significance. As syntax is meaningful, in
principle in the same way as lexical items, it follows that differences in
syntactic form reflect a distinction in meaning (Lakoff 1987; Langacker 1987,
1991 a, 1991 b; Sweetser 1990; Talmy 1988, 2000). We are using the generic
term "preposition" to describe the linguistic forms we are studying. But this
term subsumes a number of formal distinctions characterized by prepositions,
verb-particle constructions (or phrasal verbs), adpreps (which are adverbial
in nature, and do not overtly code a LM, e.g. the race is over; they are
discussed in Section 4), and particle prefixes (bound spatial particles as in
overflow, overhead, and so on).21
    2.B            2.C
Above-and-     Completion
  beyond
 (excess I)
                    •             2.D                                    4.A
      •                                         3.                    Focus-of-
                                                                      attention
.-+----~D'_
    2.A
   On-the-
                                                               /
 other-side-    trajectory
     of           cluster
                   5.                                                      6.
                                                                  [ Reflexive
   5.A
   More
                          •
                         5.B            Preference                •      6.A
                        Control                                       Repetition
        • 5.A.l
       Over-and-above
         (excess IT)
ations is potentially recursive and that a distinct sense can be the result of
multiple instances of reanalysis. Moreover, we believe that a complex concep-
tualization, such as the one represented in Figure 5, can be submitted to
mUltiple reanalyses and thus give rise to several distinct senses. When a
complex conceptualization gives rise to mUltiple senses, we term the set of
senses a "cluster of senses". A cluster of senses is denoted in our represen-
tation of a semantic network by an open circle. A single distinct sense is
represented by a dark sphere.
126      Andrea Tyler and VYvyan Evans
    Notice in this sentence that the verb, is, fails to indicate any sense of
motion. In our model, the verb typically codes for motion and hence prompts
for a trajectory. Thus, the lack of motion coded by is, in turn, results in
failure to prompt for a trajectory. If there is no trajectory, there is no beginning
or endpoint, hence no principled way of deriving an on-the-other-side-of
sense from this sentential context. Native speakers nevertheless will normally
                                                         The case of over   127
    The sentence in (22) is felicitous only if the construer (the vantage point)
is located in the vicinity of point A (in Fig. 5) and Arlington is construed as
point C. Thus, the reanalysis of over which results in the on-the-other-side-
of sense involves two changes vis-a-vis the protoscene - the privileging of
point C and interpreting it as the point at which the TR is located, and a shift
in vantage point such that the construer is located in the vicinity of point A.
While the on-the-other-side-of component (point C in Fig. 5) is correlated in
experiential terms with arc-shaped trajectories and jumping over (i.e. higher
than) obstacles by TRs such as cats, without the shift in vantage point this
In (25) and (26) over is used as predicted by the protoscene but with the
additional implicatures that the LM represents an intended goal or target and
that the TR moved beyond the intended or desired point.
(25)     The arrow flew over the target and landed in the woods.
(26)     Lissa just tapped the golf ball, but it still rolled over the cup.
                                            - - - --~
                        "   ... I
                              I'
unique derivation for each distinct sense, based ultimately on the protoscene.
We do not want to posit a simplicity rubric which claims that there is one
correct analysis and deny that there may be many means of instantiating a
distinct sense in memory. We find no strong evidence that human concep-
tualization and cognition is constrained by such a dictum (contra the wide-
spread view adopted in formalist approaches to meaning in the generative
tradition; for a critique of such views see Langacker 1991a: Chapter 10, and
the discussion of the generality fallacy in Croft 1998).
    At this point we see no principled reason to rule out the possibility that
an excess interpretation might arise through an alternative route, as repres-
ented in the network by the over-and-above (excess IT) sense (5.A.1). We in
fact hypothesize that some speakers might derive an excess interpretation
through one route while others arrive at it through the other. Still others may
use both routes; the two resultant senses would then serve to inform each
other in various ways. We further argue that it is inappropriate to treat this
flexibility (or redundancy) as evidence that our model is flawed. Nor should
an alternative analysis of the derivation of a particular sense be taken to
constitute a counterexample to the overall model being posited. We see this
flexibility (and redundancy) as an appropriate reflection of the richness of
human cognition and the way in which experience is meaningful to us as
human beings.
                      ••
                  Beg,"mng •
                               I
                               I
                                   I
                                       I
                                           /
                                               ,-
                                                    ----"1\
                                                      I       End
(29)     SaLLy turned the keys to the office over to the janitor.
(30)     The teLLer handed the money over to the investigating officer.
from one point to another. This follows from the conceptualization schema-
tized in Figure 5, in which an implicature of transfer arises, a consequence
of understanding the scene as one involving the transfer of a TR from one
location, point A, to a new location, point C (see Fig. 10). We suggest that
change in location of an entity is experientially correlated with transfer of
the entity; change in position often gives rise to the implicature that transfer
has taken place. Via pragmatic strengthening, this implicature is convention-
alized as a distinct meaning component and instantiated in the semantic
network associated with over as a distinct sense. As with the completion
sense, the transfer sense involves the reanalysis of the trajectory or process.
Again, in formal terms, over is represented not by a preposition but by an
adprep. In Figure 10, the TR has been transferred from the left side of the
impediment to the right side, as represented by the dark sphere, which is in
focusY
   Given our normal interactions with tables and tablecloths - we sit at tables
                                                         The case of over    133
or walk past them such that both the table and the tablecloth are lower than
our line of vision - it follows that our typical vantage point is such that when
a tablecloth is over the table we perceive it as covering the table. This being
so, the vantage point is not that depicted in the default representation of the
protoscene, in which the viewer/construer is offstage. Rather the vantage
point has shifted so that the TR is between the LM and the cons truer or
viewer. The perceptual effect of having the TR physically intervene between
the viewer and the LM is that the TR will often appear to cover the LM or
some significant portion of it. 2H
    In accordance with the position outlined previously - that spatial scenes
can be viewed from different vantage points - the covering interpretation
results from having a particular vantage point from which the situation is
construed. When a shift in vantage point occurs. the conceptualization con-
structed is likely to involve an additional implicature not part of the interpre-
tation when the default vantage of the protoscene is assumed. In sum, we
are arguing that the conceptualization constructed in the normal interpreta-
tion of (31) involves two changes from the default representation of the
protoscene: first, the TR is perceived as being larger than the LM and second.
the vantage point has shifted from offstage to higher than the TR. 29
    The covering implicature has been reanalysed as distinct from the spatial
configuration designated by the protoscene (see Fig. 11). As noted with
examples (10) and (11), when over prompts for a covering sense, the TR
need not be construed as being located higher than the LM; hence. the
covering sense must exist independently in semantic memory.30
® Vantage point
.-- TR
As noted earlier, any spatial scene can be viewed from a variety of vantage
points. The construal that gives rise to the examining sense is the result of a
shift from the default (i.e. offstage) vantage point. In particular, we argue
that in the scene associated with the examining sense, the vantage point is
that of the TR, and further that the TR's line of vision is directed at the LM.
   How might this construal arise? Consider the following sentence.
between the TR and LM - which match the physical correlates necessary for
examination, over is a likely candidate for developing an examining sense.
    But this is not the entire story. Notice that the use of over in (32) does not
prompt for the interpretation that Phyllis is examining the entrance, only that
she is located such that she could examine it. For the examining sense to
arise, the scene must contextually imply examination. Put another way,
examination must be an implicature deriving from the particular linguistic
prompts in a given sentence. Consider (35).
   The normal reading is that the mechanic examined the train's under-
carriage, but for such examination to occur, the mechanic, the TR, must be
physically underneath the train. In other words, in this conceptualization, the
TR is under the LM. Clearly, in this situation, there is no way of predicting
that over has associated with it an examination reading, given that the TR-
LM spatial configuration does not correspond with that normally associated
                                           r------,
                                           ~   _____   .J
with over, the very configuration that motivated the implicature of examin-
ation in the first place. This is good evidence, therefore, that the contextual
implicature of examination has been instantiated as a distinct sense in the
network via pragmatic strengthening. Hence, examination results from con-
struing a scene in a particular way. This being so, speakers are free to use
this examination-meaning component in the absence of the TR-LM con-
figuration which gave rise to the implicature of examination initially.
Sentences (37) and (38) illustrate what we call the focus-of-attention sense.
Notice that in (37) over can be paraphrased by about.
    In (37) and (38) the LM is the focus of attention. This sense is closely
related to the examining sense from which it derives. In the examining sense,
the vantage point is that of the TR, while the LM is physically below and
proximal to the TR. We further posited that the TR must be construed as
directing attention toward the LM. A natural consequence of the examining
sense is that the object being examined, the LM, is the focus of the TR's
attention. This natural consequence of examining has been privileged and
reanalysed as distinct from the spatial scene in which it originally occurred
(see Fig. 13), and via pragmatic strengthening, conventionalized as a distinct
sense. (Fig. 13 differs minimally from Fig. 12; here the LM is in focus.)
    Once this sense has been instantiated in memory, nonphysical TRs and
LMs can be mediated by this sense.
                                    -
Figure J3. The focus-of-attention sense.
                                                          The case of over     137
Four distinct senses fall under this cluster, as can be seen in Figure 6. Each
arises from construing a TR located physically higher than the LM as being
vertically elevated, or up, relative to the LM. Being up entails a particular
construal of the scene in which upward orientation is assigned to the TR (see
Fig. 14).
j.
    The normal interpretation of over in this context is "more than". The LM,
40 kinds of shells, is interpreted as a kind of standard or measurement. The
TR is not actually mentioned; in interpreting the sentence, we infer that the
TR is shell types 41 and greater. If over were interpreted in terms of the
protoscene in this sentence, we would obtain a semantically anomalous
reading in which the additional shells would be understood as somehow being
physically higher than the 40 kinds actually mentioned in the sentence. Again,
we see no direct way in which this interpretation can be constructed from
the protoscene and the sentential context alone. Moreover, there is no direct
correlation between the concept of more types and vertical elevation. The
concept here is more variety not greater quantity of shells. We argue that
the "more" sense associated with over has arisen because of the indepen-
dently motivated experiential correlation between greater quantity and greater
elevation. Because of this experiential correlation, the implicature of greater
quantity comes to be conventionaIly associated with over (which in terms of
the designation prompted by the protoscene, has a greater height meaning,
and hence also implicates greater quantity).
   The implicature of greater quantity or more comes to be reanalysed as
distinct from the conceptualization of the physical configuration that origi-
naIly gave rise to it (see Fig. 15). Once reanalysis has taken place, the distinct
sense comes to be associated with the form over, in the semantic network.
                                                         The case of over    139
The over-and-above (excess II) sense is closely related to the more sense. It
adds an interpretation of "too much" to the "more" construal. We believe that
a likely origin for this sense is the reanalysis of scenes involving containment,
such as those described in (42) and (43).
(42)     The heavy rains caused the river to flow over its banks.
(43)     Lou kept pouring the cereal into the bowl until it spilled over and
         onto the counter.
    In these scenarios the LMs are containers and the TRs are understood as
entities held by the container. When the level of liquid or cereal (or whatever)
that has been placed in the container is higher than but within reach of the
top of the LM, then the amount constitutes more than the container can hold.
A consequence of the capacity of a container being exceeded is that more of
the TR becomes an excess of the TR, which results in spillage. In sum, more
of the TR, the water, equals a higher level of water. Too much more of the
TR results in a mess (see Fig. 16).
   This node in the semantic network represents a second potential source
for the general notion of excess associated with certain uses of over. We see
subtle but distinguishable differences between the excess I sense, which seems
to us to be more closely tied to motion along a path and the interpretation of
going beyond a designated point, and the excess 11 sense, which seems to be
more closely related to exceeding the capacity of containers and exceeding
what is normal. For instance, in a compound such as overtired, it may be
that the conceptualization involved is not that an expected level of tiredness
is a goal that is missed, but rather, an expected or normal capacity for tiredness
has been exceeded. Consider (44).
(44) The child was overtired and thus had difficulty falling asleep.
    Clearly, this sentence does not mean that the TR, she, is higher than but
within reach of me, the LM. Rather, the conventional interpretation derived
from such an example is that the TR exerts influence, or control over the
LM (as observed earlier). This meaning could not be derived from context,
and is therefore suggestive, given our methodology, that this constitutes a
distinct control sense instantiated in semantic memory. How then did the
control sense derive from the semantic network associated with over? We
suggest that this sense is due to an implicature becoming conventionally
associated with over, from an independently motivated experiential correlation
between control and vertical elevation.
                                                        The case of over    141
   For most of human history, when one person has been in physical control
of another person, control has been experienced as the controller being
physically higher. In physical combat, the victor, or controller, is often the
one who finishes standing, in the up position; the loser finishes on the ground,
physically lower than the controller. Hence an important element of how we
actually experience control (and presumably from where the concept itself is
derived) is that of being physically higher than that which is controlled.
(46) The fight ended with John standing over Mac. his fist raised.
   Further, within the physical domain, the physically bigger, up, often
controls the physically smaller, down. Within the animal kingdom, a wide-
spread signal of the acknowledgment of power or status is for the submissive
animal to adopt a position in which its head is physically lower than the
head of the dominant animal. In experiential terms then, control and vertical
elevation are correlated. We suggest that because of an independently motiv-
ated experiential association between control and being vertically elevated,
there is an implicature of control associated with over.
   Nonetheless, if control were understood only in terms of vertical elevation,
we would expect that the English preposition above should also implicate
control. But as (47) demonstrates, this is not the case.
    To exert control in order to affect the subject's actions and thus guarantee
compliance, one must be physically proximal to the subject. In experiential
terms, there are two elements associated with the concept control; the first is
up, and the second is physical proximity. As we have argued throughout this
article, while the protoscene for over designates a TR being physically higher
and proximal to the LM, there is good evidence for supposing that above
designates that the TR will be physically higher but precludes physical
proximity. In linguistic terms, we would expect over to develop a control
reading. The linguistic usage, then, accords with how we actually experience
(see Fig. 17: the spiral shape denotes that the TR [sphere] controls the LM
[vertical lineD.
    As we have been arguing, distinct senses, once instantiated in semantic
memory, can be employed in situations that did not originally motivate them,
as a consequence of being instantiated as distinct within the semantic network.
142      Andrea Tyler and vyvyan Evans
                                  A
                                  (:~:~-=~~~
                                  (-   - - -- --
                                   ---     ----
                                       -----
Figure 17.   The control sense.
(48)     Cam ilia has authority over purchasing [= the act of deciding what
         will be purchased].
(49)     Personality has more influence over who we marry than physical
         appearance.
    We suggest that the preference sense derives in the following way: being
physically up in experiential terms can implicate greater quantity, which
generally is preferred to a lesser quantity. In another experiential pattern being
                                                                         s
physically up is associated with positive states such as happiness (He feeling
up today), while being physically down is associated with being unhappy
(I'mfeeting down today) (see Lakoff and 10hnson 1980). Given that happiness
is normally preferred to unhappiness, this experiential correlation results in
states associated with positions of vertical elevation being preferred to those
associated with a lower position. Hence, being over implicates a preferred
                                                           The case %ver        143
state (see Fig. 18: the TR, which is higher, is to be preferred to the LM,
which is hence not in focus).
    This implicature of preference is conventionalized, allowing a preference
interpretation (rather than a higher-than reading) in examples (50) and (51).
                                          I   I
                                          I   I
                                          I   I
4.5. Reflexivity
The repetition sense adds an iterative meaning component to the use of over,
a meaning component that could not be predicted from the protoscene alone
(or from any other sense considered so far). In examples (55) and (56), over
can be paraphrased by again or anew.
(55)     After the false start, they started the race over.
         (Cf. After the false start, they started the race again/anew.)
(56)     This keeps happening over and over.
    Many native speakers have informed us that sentences such as (56) prompt
for a conceptualization of a wheel or cycle, which seems to be evoked by the
notion of repetition. We hypothesize that the repetition-meaning component
associated with over may be the result of iterative application of the reflexive
sense (i.e. the 90-degree arc is repeated such that the TR passes through 360
degrees returning to its original starting point).
    Such an analysis is consistent with the intuition that repetition is concep-
tualized as cyclical in nature (Fig. 20). An alternative derivation may be due
to an iterative application of the A-B-C trajectory, such that when the end-
point or completion of the trajectory is reached the process begins again.32
A third possibility may be that the notions of completion and reflexivity are
conceptually integrated forming a conceptual blend (in the sense of Fauconnier
and Turner 1994, 1998, 2002). We remain agnostic about which of these
                                                                      The case %ver   145
                             I
                                 I
                              r---
                             IL ___
                                     ~/
                                          ~
I ,
                                                    ---.,
                                                         '~
                                                    _ _ _ .JI
                                                              \
                                                                  \
                                              I I
                                              I I             I
                                              I I
                                                         ~
routes led to the instantiation of the repetition sense in the semantic network
for over.
5. Conclusion
Previous polysemy accounts of over offer analyses that are too fine-grained.
These accounts fail to distinguish between coding in formal expression and
a level of conceptualization that integrates linguistic prompts in a way
maximally coherent with sentential context and real-world knowledge. The
selection of a linguistic prompt is, we argued, motivated by a principle of
best fit. That is, given that prepositions represent a closed class they cannot
possibly code the infinite array of all conceptual spatial relations. The speaker
selects the preposition which, given the scene being described, is closest to
accurately describing the key spatial relation. Conceptual integration results
from such underspecified cues being used to construct a complex conceptual-
ization, which elaborates the relatively impoverished linguistic input. A
sentence such as The cat jumped over the wall results in a dynamic complex
conceptualization in which the cat moves above and across the wall, not
because this trajectory is coded for linguistically but because this is the most
coherent and reasonable conceptualization, given the particular prompts, and
given what we know about cats and walls.
   In addition, we distinguish between constructed meanings and senses.
The former are constructed on-line in the course of constructing a concep-
tualization of a specific scene prompted by a particular utterance, whereas
senses are instantiated in memory, and can be recruited for the process of
conceptual integration. While complex conceptualizations result from the
process of conceptual integration taking account of motion and hence
146     Andrea 1Ijler and lYvyan Evans
temporal frames, it does not follow that prepositions themselves code dyna-
mism. Accordingly, we maintain the general assumption that prepositions
code atemporal relations.
    Within the polysemy network for over set forth here, the primary sense is
termed the protoscene, and represents a highly idealized abstraction from
our rich recurring experience of spatial scenes. We set forth a set of explicit
criteria for determining the primary sense. Other distinct senses instantiated
in the polysemy network for over result from pragmatic strengthening, i.e.
reanalysis and encoding. We recognize a use as distinct only if its interpre-
tation involves a change in the spatial configuration between the TR and
LM and/or additional non spatial information is involved. The polysemy
network for over contains 14 distinct senses. Other interpretations derive
from conceptual integration constrained by the cognitive principles discussed
in Section 3.
    The results of our study provide a means for distinguishing between
distinct senses and the process of on-line meaning construction, which is
primarily conceptual in nature. Clearly, a recognition of this distinction is
imperative for future research into the nature of semantic networks, and
provides additional insight into (i) the fundamentally non-arbitrary quality
of the mental lexicon; (ii) the highly creative nature of the human conceptual
system; and (iii) the fact that the way we experience renders spatio-physical
interactions meaningful. which in turn gives rise to emergent conceptual
structure.
Acknowledgements
Our thinking has benefited greatly from conversations with a number of colleagues.
We would particularly like to thank Joseph Grady, Elizabeth Lemon and the members
of the Georgetown Metaphor Group. We also thank Mark Aronoff, Steven Cushing,
and two anonymous Language referees for their detailed comments and suggestions.
We are grateful to Angela Evans for her assistance with the diagrams throughout,
and for supplying us with many of our linguistic examples. Andrea Tyler would like
to acknowledge the many insights and persistent questions raised by the students in
her classes on pedagogical grammar and applying cognitive linguistics. Vyvyan Evans
has benefited from detailed discussions with Craig Hamilton, and owes special thanks
to Mark Turner for his insight and encouragement.
                                                                The case of over      147
Notes
*    This article was first published in slightly different form in Language, Volume
     77, Number 4 (2001), 724-765. Reprinted by kind permission.
1.   Johnson's (1987) pioneering work argues that image-schemas are representations
     of recurring aspects of bodily sensory-motor experience, such as verticality, con-
     tainment, and so on, which are stored in long-term memory. Hence, they are not
     "mental pictures", but rather abstractions from rich experience. See also Cienki
     (1998) for an analysis of a single image-schema: straight.
2.   The figure-ground notions were developed by the cognitive linguist Leonard
     Talmy (e.g. 1978), and are derived from gestalt psychology.
3.   Ruhl (1989) has elegantly argued against a polysemy position, championing
     instead a monosemy framework. Monosemy holds that each lexical item is
     associated with a single highly abstract sense. On this view, the sense is so abstract
     that its precise meaning is filled in by context in conjunction with pragmatic
     knowledge. We will demonstrate (Section 4) that some senses cannot be predicted
     by context alone, a strong argument against a monosemy position.
4.   Future empirical analysis might find that speakers make such fine-grained distinc-
     tions, but the evidence to date does not bear this out. Although we cannot
     definitively prove Lakoff's full-specification model is wrong, it does result in
     questionable consequences, both in terms of its linguistic representations and in
     terms of the little experimental evidence that is available.
5.   The variations among just the two attributes of + / - or unspecified extended,
     and + / - or unspecified vertical, result in nine distinct senses. Each time another
     attribute is added to the model, the list of distinct senses multiplies accordingly -
     consider Table 1. The predictions become even more questionable when one
     considers that five of the nine senses involve attributes being unspecified.
          Analogous arguments can be made for specification of the exact, metric
     relationship between the TR and LM in terms of the presence or absence of
     contact, as Kreitzer (1997) underscores with the example Sam went over the wall,
     in which the precise manner of passing over the wall, either jumping or crawling,
     is unspecified, therefore the presence or absence of contact is unspecified.
+ Extended                    S                    S                   U
- Extended                    S                    S                   U
Unspecified                   U                    U                   U
S = specified; U = unspecified.
148     Andrea Tyler and \ryvyan Evans
6. In order to motivate the distinction between over] and over2, Kreitzer appeals to
    Langacker's notion of summary scanning (Langacker 1987, 199Ia). Langacker
    posits that summary scanning provides a means of integrating points occupied
    by a TR along a path into a construal of motion along a path. The path is reified
    at the conceptual level, even though it never actually exists in the world. Kreitzer
    argues that the dynamic over2 describes a relation between a TR and a LM in
    which it is the path that is the TR.
7. The term ideal meaning is from Herskovits 1986: Chapter 4.
8. It is important to note that some central (= most basic, to be explicated) senses
    associated with prepositions will crucially involve a coordinate system along the
    vertical or horizontal axes, while others will not. We will argue that the primary
    sense associated with over does involve such a system in which the spatial relation
    of the TR being located higher than the LM is essential. But this should not be
    interpreted as a claim that all prepositions prompt for such a system. While the
    English prepositions over and under regularly code respectively for the TR being
    in a higher-than or lower-than position relative to the LM, the preposition out
    appears to be insensitive to this dimension. Thus, we find sentences like The
    rain poured out of the sky (in which the TR is lower than the LM) and The water
    bubbled out of the hot springs (in which the TR is higher than the LM) which do
    not affect the basic interpretation associated with out. Whether a particular
    preposition is sensitive to the horizontal or vertical dimensions is part of its basic
    lexical entry.
9. Although there has been disagreement about the appropriate representation of
    the primary sense associated with over, all published analyses accept these two
    basic assumptions. Synchronically, evidence that the basic spatial configuration
    prompted for by over is something like a TR in a higher-than position relative to
    the LM comes from sentences with clearly contrasting interpretations: Nicole
    decided to walk over the bridge versus Nicole decided to walk under the bridge.
    Having argued that the primary sense for over involves a spatial configuration in
    which the TR is higher than the LM, we readily acknowledge that in many
    instances this spatial configuration is not prompted for by over. Our analysis
    attempts to model how these non-canonical spatial configurations have come to
    be associated with the form over.
10. It should be noted that our diagrammatic representations of protoscenes are made
    for ease of explication. They should not be interpreted as making any serious
    claim about the neurological nature of imagistic representation.
11. An image-schema, as Mandler uses the term, constitutes a representation distinct
    from purely perceptual information. As such, it constitutes a rudimentary "theory"
    as to the nature of a particular object or relation between objects. The image-
    schema relating to containment, for instance, is a concept as opposed to a
    percei ved entity, insofar as it constitutes a means of understanding the functional
    aspects of a particular spatial configuration.
12. This is akin to what Iackendoff (1983: 29) refers to as the projected world, and
                                                                The case of over       149
     is constructed at what Fauconnier (1997: 36) tenns the cognitive level or level C.
13. In terms of specifics our claim is as follows: a particular spatial scene is a rich
     real-world scenario. mediating two objects (TR and LM) via a conceptual spatial
     relation. Recurring spatial scenes perceived as resembling each other are stored
     as an abstract protoscene. The aspect of the protoscene coded by a preposition is
     the spatial relation mediating the TR and LM. and not the whole protoscene.
     From this. it follows that a preposition presupposes a TR and a LM (as the
     conceptual spatial relation holds by virtue of mediating a relation between a TR
     and a LM). In minimal terms. a preposition prompts for a TR and LM. which
     are typically supplied linguistically. e.g. The picture [TR] is over the mantel
     [LM].
14. The reanalysis of an aspect of a particular complex conceptualization results in
    privileging a different aspect or perspective on the complex conceptualization.
     Yet. because the pertinent complex conceptualization is first prompted for by the
    use of over. as in Figure 5. the derived sense is coded by the same linguistic
    form. namely over.
IS. eruse (1986) discusses this in terms of modulation of a lexical item. For instance.
    various parts of the car are highlighted in the following sentences: The car needs
    to be washed (where car is interpreted as the exterior body of the car) versus The
    car needs to be serviced (where car is interpreted as the engine) versus The car
    needs vacuuming (where car is interpreted as the interior). This constitutes
    modulation or highlighting different parts and backgrounding others.
16. Langacker ([ 992) discusses the atemporal nature of prepositions in terms of the
    relationships they profile. "With before and after. time functions as the domain
    in which the profiled relationship is manifested. Its role is consequently analogous
    to that of space in the basic sense of in. on or near. A verb. on the other hand, is
    said to be temporal in a very different way ... the profiled relationship is conceived
    as evolving through time and is scanned sequential\y along this temporal axis. It
    is by incorporating this further level of conceptual organization that precede and
    follow differ from the prepositions before and after ... [Verbs] specifical\y track
    [a process] through time ... A preposition can thus be characterized as profiling
    an atemporal relation that incorporates a salient landmark" ([ 992: 292).
17. Unless the world being discussed is explicitly designated as science fiction.
18. In sentence ([ 9) the lack of motion is the result of integrating what is coded by
    the verb extended with our knowledge of trees. In particular. the interpretation of
    lack of motion depicted by (19) is the result of the interpretation of extended as
    it relates to a tree branch. We understand trees to be slow-growing plants such that
    humans do not perceive the growth of a branch as involving motion. Thus. we
    interpret extended to depict a state. Notice that the stative interpretation of extended
    is contingent upon the precise sentential context in which it occurs. Extended
    can also be interpreted to convey motion as in He extended his arm towards the
    door. Since there is no sense of motion prompted for by the verb in the sentential
    context provided in (19). no path or trajectory is projected for the TR.
150     An.drea Tyler and \ryvyan Evans
19. We hasten to acknowledge that there are contexts in which two prepositions
    appear to be interchangeable and virtually synonymous: Susan hung the picture
    over the mantel versus Susan hung the picture above the mantel. We hypothesize
    that such substitutability arises because the semantic networks associated with
    each preposition represent continuums and at certain points the interpretations of
    two continuums can overlap. In addition, for over and above we find a close
    diachronic relationship, with over initially being used as the comparative form
    of above. The diachronic link may surface in these overlapping uses.
20. In terms of synchronic polysemy networks, the empirical work by Sandra, Rice,
    and their colleagues suggests that it may not be the case that a particular lexical
    form has a single primary sense from which language users perceive all other
    senses being derived. Their empirical work raises questions about the view that
    we can define polysemy as a strictly synchronic phenomenon in which speakers
    are consciously aware of a relationship holding between distinct senses of a
    particular lexical form. This is an empirical question for which we do not yet
    have sufficient evidence to determine the answer. If extensive experimental
    evidence shows that language users systematically and consistently fail to perceive
    some senses as being related, then we must question whether what we term
    polysemy constitutes a phenomenon that is wholly synchronic in nature. While
    we believe all the senses in a particular semantic network are diachronically
    (and perhaps developmentally) related, in terms of the adult lexicon, there may
    be differences in the perceived relatedness between distinct sets of senses, due to
    routinization and entrenchment, obscuring the original motivation for the deriv-
    ation of senses from pre-existing senses such as the protoscenes for language
    users (see in particular Rice, Sandra and Vanrespaille 1999).
21. In formal terms, the particle in a verb-particle construction (VPC) is a more
    grammaticized preposition in that the LM is linguistically covert, that is, it is contex-
    tually understood without being linguistically coded (Lindner 1981; O'Dowd
    1998). Such particles form part of a verb-particle construction with a verbal element,
    and each unit (the particle and the verb) contributes to the meaning of the whole
    unit (see Goldberg 1995 for a construction grammar approach, Morgan 1997 for
    a study of verb-particle constructions). We introduce the term adprep to describe
    a spatial particle which has adverbial meaning, that is, certain usages of the form
    over are adverbial in nature, describing an aspect of a conceptual process, as in
    The movie is over (= finished). Each formal component - preposition, particle (in
    a VPC), particle prefix, or adprep - contributes different kinds of meaning.
22. Recall that we are using the term "sense" for distinct meanings instantiated in
    memory (i.e. in the semantic network associated with each preposition).
23. It is worth pointing out that sentences such as (21 )-(25) offer strong evidence
    against a monosemy theory of word meaning. Monosemy (see Ruhl 1989), as
    noted previously, posits that all interpretations of a linguistic form, such as a
    preposition, are contextually derivable from a highly abstract primary sense.
    However, as can be seen from the on-the-other-side-of sense, neither of the
                                                              The case of over      151
    original aspects of the spatial configuration hold - the TR is not above the LM
    and the TR is not proximal to the LM. The nature of a primary sense that would
    derive both these senses simply from contextual cues would need to be extremely
    abstract. We cannot see how a representation so abstract would also be constrained
    enough to distinguish among many other English prepositions.
24. Lakoff (1987: 422-423) represents sentences such as Sam lives over the hill as
    an example of schema I.VX.C.E. (above-across, with a vertical, extended LM,
    contact between the TR and LM, and endpoint focus).
25. There is arguably a distinct sense which is derived from the on-the-other-side-of
    sense. In examples such as
   (60)     ?The small handkerchief was spread out over the tabLe.
   (61)     The small handkerchief was spread out on the tabLe.
        However, there are also many real-world scenarios in which the TR is actually
    smaller than the LM but because of the construer's vantage point (the TR inter-
    venes between the viewer and the LM), the TR appears larger than the LM. For
    instance, in The dark clouds moved over the sun, the clouds are not physically
    larger than the sun, but they appear larger to the earthbound viewer.
30. Lakoff (1987: 429) accounted for cases of the covering reading in which the TR
    is not higher than the LM by positing a rotation transformation. The covering
    schemas all have variants in which the TR need not be above (that is, higher
    than) the LM. In all cases, however, there must be an understood viewpoint from
    which the TR is blocking accessibility of vision to at least some part of the LM.
    We will refer to these as rotated (RO) schemas, though with no suggestion that
    there is actual mental rotation degree-by-degree involved. This is an extremely
    powerful transformation, potentially affecting all prepositions whose primary
    sense involves either a vertical or horizontal orientation. In a number of instances,
    the protoscenes for over, wider, before, and after would be essentially indisting-
    uishable. And this analysis offers no explanation for why TR-LM configurations
    that do not match the protoscene would develop this reading.
        A common consequence of the LM being covered by the TR is that the LM
    is occluded from the construer's view. Typically the scene described in (31) is
    that the tablecloth occludes the tabletop from the observer. As we see in examples
    such as the following, occlusion is not an inevitable consequence of covering.
   (66)    Hey! Why are you bringing in so many cases of motor oil? There must be
           a dozen cases here. That's well over the two cases I ordered.
    is that the model predicts that a particular sense may arise from more than one source.
         In forms such as overachieve, overkill, overdo, and overdress we do not see
    a clear basis for arguing for the superiority of the above-and-beyond interpretation
    versus the over-and-above interpretation. As noted earlier, we do not consider
    this a flaw in our model; rather we see it as testimony to the richness and
    complexity of conceptualization. We also hypothesize that native speakers are
    likely to vary in their intuitions about these cases.
32. Lindstromberg (1997) offers a very similar explanation.
References
Aronoff, Mark
    1994        Morphology By Itself: Stems and Inflectional Classes. Cambridge,
                MA: MIT Press.
Bloom, Paul, Mary A. Peterson, Lynn Nadel and Merrill F. Garrett (eds.)
    1996        Language and Space. Cambridge, MA: MIT Press.
Bloomfield, Leonard
    1933        Language. New York: Holt, Rinehart and Winston.
Bolinger, Dwight
    1971        The Phrasal Verb in English. Cambridge, MA: Harvard University
                Press.
Brisard, Frank
    1997        The English tense-system as an epistemic category: the case of
                futurity. In: Marjolijn Verspoor, Kee Dong Lee and Eve Sweetser
                (eds.), Lexical and Syntactical Constructions and the Construction
                of Meaning, 271-286. Amsterdam: John Benjamins.
Brugman, Claudia
    1981 [1988] The story of over. M.A. thesis. Linguistics Dept. University of
                California. Berkeley. Published [1988] as The Story of Over:
                                                           The case of over     155
O'Dowd, Elizabeth
    1998        Prepositions and Particles in English: A Discourse-Functional
                Account. New York: Oxford University Press.
Pustejovsky, James
    1998        The Generative Lexicon. Cambridge, MA: MIT Press.
Rice, Sally
    1993        Far afield in lexical fields: the English prepositions. In: Michael
                Bemstein (ed.), ESCOL '92: 206-217.
Rice, Sally, Dominiek Sandra and Mia Vanrespaille
    1999        Prepositional semantics and the fragile link between space and time.
                In: Masako Hiraga, Christopher Sin ha, and Sherman Wilcox (eds.),
                CuLtural, PsychoLogical and TypoLogicaL Issues in Cognitive Ling-
                uistics, 107-127. Amsterdam: John Benjamins.
Rosch, Eleanor
    1975        Cognitive representations of semantic categories. Journal of Experi-
                mental Psychology: General 104: 192-233.
Ruhl, Charles
   1989         On Monosemy: a Study in Linguistic Semantics. Albany: State
                University of New York Press.
Sand ra, Dominiek
   1998         What linguists can and can't tell you about the human mind: a reply
                to Croft. Cognitive Linguistics 9(4): 361-378.
Sandra, Dominiek and Sally Rice
   1995         Network analyses of prepositional meaning: mirroring whose mind -
                the linguist's or the language user's? Cognitive Linguistics 6(1):
                89-130.
Sperber, Daniel and Deirdre Wilson
   1986         Relevance: Communication and Cognition. Cambridge, MA: Harvard
                University Press.
Svorou, Soteria
   1993         The Grammar of Space. Amsterdam: John Benjamins.
Sweetser, Eve
   1990         From Etymology to Pragmatics: Metaphorical and CulturaL Aspects
               of Semantic Structure. Cambridge, England: Cambridge University
               Press.
Talmy, Leonard
   1978        Figure and ground in complex sentences. In: Joseph H. Greenberg
               (ed.), Universals in Human Language, Volume 4, 625~49. Stanford,
               CA: Stanford University Press.
   1983        How language structures space. In: Herbert Pick and L. Acredelo
               (eds.), SpatiaL Orientation: Theory, Research and Application, 225-
               320. New York: Plenum.
                                                           The case of over     159
Jarno Raukko
1. Introduction
are two verbs, one from English (get, meaning e.g. 'obtain', 'receive', 'arrive',
'take', 'become', 'make', 'have', 'understand', 'answer', 'must' and 'be
allowed to') and the other from Finnish (pitiiii, meaning e.g. 'like', 'must',
'hold', 'keep', 'consider', 'organize', 'wear' and 'not leak'). In most cases
these verbs are not translational counterparts; the reason for selecting them
lies mainly in the shared complex type of polysemy. The section on get
concentrates on the results from an experiment called the production test
and uses some secondary findings from the results of a so-called difference
evaluation test (also known as the similarity rating test). The section on
pitiiii focuses on the results of a third type of test, the sorting test.
A new dynamic view of meaning gained ground by the end of the 19th century
(Nerlich 1992: 100). Instead of a one-to-one relationship between word and
idea, meaning was already seen as flexible, elastic, adaptable and open to
change; "imperfect", so to speak. Such notions were at least presented in
Germany (Erdmann 1910; Paul), France (BreaI1868; Nyrop 1913; Paulhan),
England (Gardiner) and the United States (Whitney 1867).
    It could be said that mainstream semantics in the 20th century lost track
of the notion of flexible meaning as the driving force of semantics in general
and polysemy in particular. Structuralist semantics paid more attention to
the difference between homonymy and polysemy2 than to the Ubiquity of
polysemy. Indeed, many semanticists, including so-called cognitive semanti-
cists (e.g. Geeraerts 1993; Tuggy 1993), have paid much attention to another
tripartite division, namely ambiguity, polysemy and vagueness. Still, much
of that literature rests on a notion of vagueness as a fairly static property of
lexical meaning.
    One recent addition is offered by Zhang (1998), who draws attention to
the concept ofjuzziness in this four-partite framework:jitzziness, vagueness,
generality and ambiguity. (For Zhang, polysemy is closest to vagueness.)
Her framework is Gricean, and her view of polysemy does not seem to allow
for ambiguities that Bre left unresolved by the context. In her view only
fuzziness is not "resolvable with resort to context" and "is closely involved
with language users' judgments" (Zhang 1998: 13), After discussing syntactic
                           Experiments with English get and Finnish pitaa     163
and semantic tests for fuzziness, Zhang introduces some pragmatic tests.
Whereas for Zhang (i) bank is ambiguous irrespective of people's judgements;
(ii) person is general in that it does not specify sex, height or nationality;
(iii) good is vague as to whether it refers to the goodness of a student, food
or legs; (iv) beauty, by contrast, is fuzzy owing to the fuzziness of the concept
it denotes, to the fuzzy nature of language users' perceptions of its referential
boundary (1998: 27). Zhang calls such factors psychological, but does not
explicate if her understanding of "semantic cognition" would also involve a
more socially based procedure of "wanting to believe that interlocutors agree
on meanings" in place of truth conditions and Grice's maxims.
(i)     Nonfixedness. We cannot claim that language has a lexicon with ready-
        made meanings and polysemous categories for words; nor does a
        person's cognition contain a mental lexicon in the sense of a store-
        house of ready-made meanings and categories.
(ii)    Fuzziness, unclarity, non-pinpointability. We cannot aim at preciseness
        in communication nor in the analysis of meaning.
(iii)   Flexibility. Language is constituted by social conventions. These social
        conventions allow for change and flexibility, and some disagreement
        between speakers. Still, it is not a dead end to try to describe seman-
        tics; we just have to allow for loose ends, and for the possibility of
        change and flexibility.
(iv)    Ambivalence. Polysemous situations are not about two competing
        meanings at a time but (in principle) about an indefinitely large
        "number" of meanings. Whereas ambiguity is usually tied to intention,
        ambivalence is the idea that even speakers themselves do not (always)
        know what they intend to communicate (Ostman 1988, 2000).
(v)     Intersubjective differences (this can also be about variation). When
        we describe semantics, it is wiser to be descriptive and open-ended
        than prescriptive and definitive. Of course the social conventions
        themselves are (governed by) norms, but it is a different thing for
        linguists to decide what is the correct meaning - or set of meanings -
        ofa word.
(vi)    Polysemy (in "static semantics" this is about variation). The default is
        that words do not have just one fixed meaning. Therefore many
        practical things that we do, such as provide glosses for foreign lang-
        uage items, or make lists of contrastive vocabularies in different
        languages, are very problematic endeavours.
   These six shades of variability cover and partly explain the meaning and
place of polysemy and flexible meaning in semantics. The four Subsections
2.3, 2.4, 3.1 and 3.2 will show how I have operationalized variability and in
particular flexibility in my overall methodology and in my experiments.
potentials of that polysemy?", and take into account the properties of seman-
tics that I have spelt out above, we have to choose some methodology, and
choose some concrete methods.
    One method that has become something of a default in linguistics is the
corpus method. The linguist looks at a large and somewhat pre-processed
selection of text material and tries to find the relevant instances (instantiations,
specimens) of the item that slhe wants to study. However, problems arise,
because most semantic questions would require some knowledge about the
meaning-maker, the human beings producing and receiving and under-
standing the text and all its bits. When using a corpus, one is basically by
oneself when trying to determine what the bits mean. Of course one can use
one's linguistic knowledge - as a native or fluent speaker and as a linguistic
expert - in order to determine the meanings. However, it is not necessarily a
very interesting, reliable, nor "scientific" way to deal with semantic questions.
One reason is the ever-presence of semantic variability and flexibility. If
meanings are not out there as fixed entities, but instead emerge in language
use situations, we need to know more about the context and the participants
of the communicative situation in order to say what is being meant. Also, if
meanings are fuzzy, it is even more difficult to determine the meaning if we
are not ourselves part of the context of the communicative situation. More-
over, if people can have different opinions on what something means, why
would it be enough for a linguist just to look at a corpus and say what he or
she finds there?
    Another way might be to turn to a different scientific ideal, something
closer to the natural sciences, for instance cognitive neuroscience. So, for
instance, one could forget about questions that semanticians, pragmaticians
and discourse analysts are interested in, and concentrate on things that you
can see in the curves and measures that show features in the electrical activity
of the brain. Or feed a computer large masses of text and see how it categ-
orizes words and texts according to distributions. The problem here is that
we are quite far from linguistically interesting questions and methods. And
unlike natural scientists, semanticists should not even dream of finding
definitive or objective answers.
    Then there is a middle way: do something about the tacit assumptions
and hence the procedure of scientific discovery and linguistic analysis. This
is the road I am taking. I have chosen to try out what certain kinds of
experimental methods, experiments, experimentations can do for us.
    One field of linguistics where it has been customary to use experiments
166    larno Raukko
is psycho linguistics. Although two out of three test types that I am using
have been employed by semantically-oriented psycholinguists, my experi-
mental methodology as a whole is not exactly of psycholinguistic nature. I
am trying out experiments that fall between pure armchair speculation and
hard-core experimental methods with an ideal of objectivity. Some of these
experiments leave room for the creativity of the informant who is participating
in the experiment (and hence bear similarities with elicitation and inter-
viewing). Although these experiments use artificial settings, an open-minded
and non-objectivist semantician can use the experiments to make discoveries
about semantic flexibility and variability, as there is openness in the questions
and room for the informant's personal effort.
   Section 3 will introduce the three types of experiments that I have been
using in my research into polysemy. In Section 3.2 I will show how the six
shades of variability mentioned in Section 2.2 are operationalized in the
experiments.
For me polysemy, flexibility and variability are broad cover tenns, and I do
not even mind using them as covering one another. Polysemy is flexible
meaning, and it covers referential polysemy, regular polysemy, lexical poly-
semy (see Deane 1988), vagueness, generality and fuzziness. I prefer to
investigate all variants of "one form - not just one fixed meaning" with the
same tool pack, rather than concentrate on a conceptual typology of these
variants.
   Cognitive linguists such as Langacker (1988a) use a specific network
model to describe polysemy with nodes that are joined by semantically
plausible links (see Seto, this volume). I have tried to develop a more flexible,
loosely structured, and "contiguous" version ofthe model (Raukko 1994,1997)
where the nodes are only used as methodological tools and, in principle, a
polysemous semantic range is (ontologically) more like a flexible "mass"
than a "network".
   Flexible meaning is not only about graded membership of categories,
fuzzy boundaries of categories and continuum effects - characteristics of (at
least ideal) prototype theory. It is also about the nonspecificity of meaning
and meaning type category membership. In Section 4.2 we will see how
within a given range of the polysemy of get some instances can be more
                           Experiments with English get and Finnish pitlili    167
reliably allotted to a category than others. Some instances are vague or fuzzy
enough to simultaneously match two different categories, but we will find at
least two versions of this phenomenon. In some cases one can interpret the
same meaning in two methodological ways, and therefore the semantic
instance falls into a double membership of these (contradictory) categories.
In other cases one does not need to decide the proper category, although
related examples would fall into distinctive (neighbouring but still contra-
dictory) categories.
    The difference evaluation test is also called the similarity rating test, and
 my version follows the traditional setup (Lehrer 1974; Caramazza and Grober
 1976; Colombo and Flores d'Arcais 1984; Gibbs et al. 1994; Sandra and
 Rice 1995). The idea is to go through pairs of sentences which each contain
two uses of get and evaluate the semantic distance between the two uses in
each case. The function of this test is more confirmative in nature, and I
consider its reliability far lower than that of the production test. The reason
is that multiple choice tests require far less commitment, creative effort and
explicit signs of successful understanding of the instructions from the infor-
mant than tests where one has to write down one's own sentences, comment
on them and paraphrase them. Another reason is that when the informant
faces pairs of two stimulus sentences one pair at a time, it is much easier not
to take the question of semantic distance seriously than in the case of the
sorting test (introduced next below), where one needs to look at the whole of
50 stimuli from the outset and work much harder on them. It is no surprise
that the sorting test requires much more time and expertise from the informant
than the difference evaluation test. However, Section 4 will make two short
references to the results of the get difference evaluation test. The locus of
the experiment was the same as in the production test, i.e. USA 1994, and I
had 79 informants (see Appendix 2).
    The sorting test also follows a tradition, although I have devised my own
elaborate version for my research on the Finnish verb pitiiii. The idea of
sorting stimuli is familiar from psychology and psycholinguistics, and for
polysemy I have seen it used by at least Colombo and Flores d' Arcais (1984)
as well as Sandra and Rice (1995). I have performed the production test and
the difference evaluation test for pitiiii (in Helsinki, during 1995-1999) as
well, but the main emphasis in Section 5 will be on the results of the sorting
test (in Helsinki, during 1998). The design and usability of the sorting test
draws heavily on the production test, starting from the fact that the stimulus
sentences originate in the production test responses. As the two tests address
the same problem of "how to categorize meanings of pitiiii" from two different
perspectives, the analyses of the two types of experiments are necessarily
interlinked. The pitiiii sorting experiment was conducted in October 1998 in
Helsinki with 21 informants who were university students, either majoring
in general linguistics or taking an introductory course in general linguistics.
The English translation of the instructions part from the original Finnish
questionnaire can be found in Appendix 3.
                           Experiments with English get and Finnish pitiiii   169
Let us see how the experiments, or tests, help us grasp the six shades of
variability (listed in Section 2.2), or help us operationalize the research
questions pertaining to flexible meaning. The following is an illustrative list
of examples, not an exhaustive analysis.
neatly. Categories can be seen as continuums, but they also overlap and yield
insoluble categorization situations. And as pointed out in Section 2.4, flexible
meaning is not only about borderlines of categorization: in many cases the
semantics itself of the instances is so fuzzy that it is difficult to interpret in
the context of the polysemy of a word. We shall view these "problems" as
supporting the theory of flexible meaning instead of leaving us with an
unsatisfactory outcome.
    The polysemy of get forms an exceptionally abundant and complex net-
work of meanings and ranges of flexible meaning. This abundant poly se my
of get has been discussed by e.g. Kimball (1973), Niedzielski (1976) and
Lindstromberg (1991), whose main goal has been to provide a neat reduction-
istic representation (or even matrix).4 Lindstromberg points out (1991: 289),
however, that while his analysis mainly aims at simplicity (i.e. the polysemy
of get can be explained via extensions from the meanings 'obtain' and 'seize' /
'take hold of'), his approach is also more prototype-based and cognitive-
semantic in nature than that of Kimball and Niedzielski. I find Lindstromberg's
analysis of fairly little use, because my aim is not explanatory simplicity but
descriptive "relevance": I aim at concentrating on "central" and "crucial"
features in the polysemy of get partly from the perspective of a speaker who
has no training in linguistics (i.e. a typical informant of mine). Needless to
say, I am looking at colloquial and nonstandard (American) English, and
because my informants are mostly American teenagers, I do not make any
claims with respect to regional and social coverage and generalizability (see
Raukko (999).5
    To get a first taste of some delicious details in the polysemy of get, here
are some introductory examples that in themselves are flexible in meaning.
(0)    a.   I didn't get that. What did you say? (Lindstromberg 1991: 289)
       b.   Part of being a nation is getting the history wrong. 6
   The meaning types behind these examples are the top salient meaning
types in the polysemy of get. Some others, comparable to these but which
are not part of the salient higher-level groups (a) through (d) are: (e) UNDER-
STANDING; (f) STABLE POSSESSION; (g) OBLIGATION; and (h) ABILITY (or 'pennission').
Examples of these are given in (5) to (8) respectively.
                           Experiments with English get and Finnish pitiiii   173
Table J. Types of RECEIVING and OBTAINING, in the core of CHANGE OF POSSESSION. All
           example sentences are taken from the production test responses.
    The actual responses of the informants do not form such a neat picture as
Table 1 does. In particular, there were several informants who not only made
differentiations among these five types, but also differentiations within one
or more of these types. The clearest trend of all is that within METAPHORICAL
RECEIVING there are three sUbtypes which are sometimes distinguished from
one another rather systematically, namely 'receiving a grade', 'catching an
illness' and 'other types'. The strong representation of 'receiving a grade' is
of course explained by the composition of the informant population. Apart
from METAPHORICAL RECEIVING, we find type-internal differentiations in many
responses, but no clear patterns among informants.
    The problem is more general: how do we find organization in the rather
chaotic-looking, varied outcome of the responses? When addressing the prob-
lem of motivating the distinction between OBTAINING and RECEIVING, it is safe
to base the hypothesis on the concrete basic-level types, CONCRETE OBTAINING
FOR ONESELF and CONCRETE RECEIVING, which are often the source of vague
interpretations.
    In the data, as many as 199 informants (61 % of the total population)
                          Experiments with English get and Finnish pitiil:i   175
classify, some of them because they lack a paraphrase, others because the
object of getting, the paraphrase and the encyclopaedic knowledge about
the things talked about are necessarily unclear with respect to the com-
ponent of agentivity. Consider examples (11)-(13) (and the corresponding
paraphrases) which were the only instances of any type of (CONCRETE) CHANGE
OF POSSESSION that the particular informant (Nos. 124, 171 and 276 respec-
tively) produced.
(11)      Let'sgetthis.
( 1I ')   Let's acquire this.
(12)      Igetacoke.
(12')     I picked up [sic: past tense] a coke.
(13)      I need to get a new car.
( 13 ')   I need to acquire a new car.
we the analysts may be led to search only for one minimal difference in each
pair of sentences that the informants have produced.
   The other test type, difference evaluation, gives us some additional evi-
dence for the fact that the distinction is a difficult one. The pair in (14),
which contained CONCReTE OBTAINING FOR ONESELF and CONCRETE RECEIVING,
was not only judged as relatively different in meaning (the tenth highest
"different" rating among the tested 31 pairs, with average rating of 2.13 on
a scale from 0 to 4), but the judgements also deviated quite remarkably: the
standard deviation for this pair was the fifth highest among the tested pairs. 10
her have) the letter", but we usually tend to think that if one of them had
more to do with 'motion' and hence CARRYING, it would be (15), maybe
because of the partly locative preposition to.
   Some of my informants produced examples and paraphrases that portray
a possible difference between CONCRETE OBTAINING FOR ONESELF and CONCRETE
CARRYING; the following come from my get informants Nos. 6 and 77. (17a)
and (18a), which are incidentally both paraphrased with take, are the ones
that I would like to classify as CARRYING.
    The average and standard deviation figures are almost identical to the
ones for the evaluated pair OBTAINING vs. RECEIVING discussed in Section 4.1.
Thus, informants again differ greatly in their opinion about the similarity or
difference of the meanings of get in (l9a) and (l9b), which supports the
analysis of OBTAINING and CARRYING as being idealized abstractions forming
the ends of a continuum of flexible meaning where 'change of possession'
                           Experiments with English get and Finnish pitaa   179
and 'motion' are present to varying degrees. Even the component 'change of
state' is sometimes present as well, but in cases like getting something into
shape we are already dealing with METAPHORICAL CARRYING instead of the
concrete one.
    There are also situations where the informant has produced three related
instances, and while two represent OBTAINING and CARRYING, the third one can
be classified as CATCHING ETC. - e.g. Get the ball, or I will get you.
    Yet there are many cases where the informant has produced only one
example relating to our present discussion, and it is relatively impossible to
try and firmly classify the meaning type as either OBTAINING, CARRYING or
CATCHING ETC.
There are three basically distinct meaning types of get that all take an
infinitival complement: ABILITY, REACHING A STAGE and OBLIGATION. The first is
a technical term for meanings like 'permission' and 'ability', which were not
usually distinguished by informants and therefore do not occur as separate
meaning types in the analysis. REACHING A STAGE belongs to the macro-type
CHANGE OF STATE, and hence its meaning is more dynamic than that of the two
modals (OBLIGATION and ABILITY). Besides having obvious differences as to
aspects of modality and semantics, the three types favour different verb forms
of the main verb get. While OBLIGATION is supposed to be bound to the form
got (which appears as have got to, has got to, 've got to, S got to, got to,
gotta, gots to or don't got to), it is prototypical for ABILITY to occur in the
simple present tense (or future), while REACHING A STAGE is the most flexible
180     lama Raukko
go to school today, and although there is some confusion with the tense
choice, "going to school" would at least be a likely candidate for a complement
of OBLIGATION. Moreover, although an example like Why do I always get to
do the dishes? did not occur in the production test data, it could be seen as
marking the path between REACHING A STAGE and OBLIGATION. I claim that these
examples suggest variability for OBLIGATION, although the standard view limits
this meaning type to the form got.
    I hope to have shown that despite the significant semantic differences (on
the surface) between the three meaning types with an infinitival complement,
there is a continuum from ABILITY via REACHING A STAGE to OBLIGATION. Once
we have a continuum, it will be difficult to neatly classify instances of get
into meaning types. At least the distinction between ABILITY and REACHING A
STAGE is hard to establish solidly, even if we have prototypical instances of
both and see informants distinguishing the two meaning types. We have seen
again that the relationship between these two meaning types is best understood
in terms of flexible meaning, where there is no strict division into two isolated
meaning types.
portrays a concise list of the most crucial meaning types in the polysemy of
pitaa. By "crucial" I refer to such salience factors as frequency and earliness
of occurrence in the production test, while the indicators for distinguishing
meaning types now mainly derive from the results of the sorting test. 12 The
list starts with LIKING (in [25]) and OBLIGATION (in [26]) (one of the meanings
shared by get and pitaa) and ends with six types (30)-(35) that belong to the
problematic area of very flexible meaning.
    Sentence (36) was grouped with (34) by as many as 17 informants (81 %),
but with (31) only by nine informants (43%); the difference is statistically
significant. In contrast, there were ten stimuli (out of 50) that were sometimes
sorted with (31) but never with (34), and conversely, only two that were
sometimes sorted with (34) but never with (31). I think all this shows that
(31) is a much more difficult stimulus sentence than (34), even if on the
surface (31) would seem less idiomatic and more "basic" in meaning than
(34). Sentence (36), for its part, was surprisingly sorted together with (33)
by as few as two informants, although "having fun" and "having a party"
could have been easily thought of as attracting similarity judgements. One
possible reason is that there were two other stimuli that were thought of as
matching (33) so well that there was no demand for other group members.
    Let us look at one more example, namely (37). The interesting finding is
that (37) was sorted together with each of (38) through (40) by eight to ten
informants, but (38)-(40) in turn were sorted with one another by only two
to five informants. The hypothesis is that (37) seems to be a combining link
between the other three instances (38), (39) and (40), while these three are
not so easily linked without the presence of (37).
    Thus (37) can be seen as a flexible channel or path between the other
three. Moreover, (39) was sorted together with (30) by half of the infor-
mants - an expected result considering the seeming similarity between the
two sentences - but (40) was grouped with (31) by only one informant,
despite the seeming similarity between the two instances.
    We are left with a very unpredictable and intricate network of inter-
relations within the area of flexible meaning which cannot be reduced to
established categories.
6. Conclusions
                                              April _, 1994
                                              female/male
                                              Senior/Junior/SophomorelFreshman
                                              age_
186     lamo Raukko
Dear student of Putnam City North High School [Oklahoma City]. My name is
larno Raukko; I am a researcher of linguistics at the University of Helsinki, Finland,
and I am studying some aspects of everyday English. I very much appreciate your
participation in this test.
(I) The verb get (get, getting. got. gotten) is very common in spoken American
   English. and it seems to be used in very many different ways. You might say that
   get has different meanings. Could you please write down examples of the ways
   in which you use the word get differently. Your examples can be full sentences,
   but they do not need to be. You do not have to fill in every blank from A to K;
   just produce as many examples as you can think of where the use of get still
   differs from every other use you have mentioned.
Now please write down examples with get. Feel free to use casual and colloquial
style! These examples will not be corrected!
   A. ____________________________________________
   B.
   C.
   D. __________________________________________
   E.
   F.
   G. __________________________________________
   H. __________________________________________
   I.
   l.
   K. ________________________________________________
Please turn over when you have finished answering question No. 1.
(2) In your examples, would you say that one or some of the uses of get is more
    typical or central or important than the others? Is one of them the most central
    one? If you think so, please write down the letter(s) of the example(s). Also
    explain briefly why you think so.
(3) Do you think that the uses of get in your examples are somehow connected or
                             Experiments with English get and Finnish pitiiii      187
   linked to one another? Is there a common idea? If you think so, please explain or
   specify briefly.
(4) Finally, would you please once again read each of your examples for question
    No. I and try to express the same idea in another way. In other words, for each
    example, write down an alternative way of saying the same thing so that you
    don't use get any longer.
       [If I had to do this for "She followed the man along the river bank" and I had
   to avoid using the word bank again, I could write "She followed the man along
   the side of the river".]
April_, 1994
Now I would like you to estimate the meaning difference between the uses of get in
the following pairs of sentences:
                                                   MD
       I got home late.
                                                 01234
       I got a new car.
[etc.)
                                                           initials _ _ _ _ __
                                                           year of birth _ _ __
                                                           female/male
                                                           main subject _ _ __
\.(a) In the enclosed envelope you will find 50 cards, each with a sentence containing
    the word pitiiii. Could you please sort these instances of the meanings of the
    verb pitiiii into groups where each member shares a "similar enough" meaning.
    Let the sorting be solely based on the meaning of the word pitiiii, not on the
    meaning of the sentence as a whole - even if the meaning of pitiiii is understood
    on the basis of the sentential context. For the purposes of this research, it is not
    essential, either, e.g. which form the verb pitiiii occurs in.
        (The sentences are numbered only in order for you to be able to refer to those
    numbers in some of the questions to follow.)
        Sort the sentence cards into piles. A pile can consist of any number of
    sentences, and a pile can also consist of one sentence alone. When you have
    finished piling, join the cards in each pile with a small paper clip (found in the
    envelope), so that it will be easier for me to collect the piles in the end.
(b) If some sentences are especially difficult to sort, mark a "D" on such sentence
    cards.
(c) Can you think of names for these piles - i.e. for these types or classes of
    meanings? (For example, you can use some other Finnish verb, which roughly
                            Experiments with English get and Finnish pitiia      189
   means the same as the meaning of the pitiiii in question.) In that case, write that
   name on the topmost card.
(d) When you were doing the pile sorting, did it occur to you that some meanings
    were quite close to one another but not as close as the members of one pile?
    Would you be able to combine some piles into classes at an upper level? Use
    the bigger paper clips for combining.
2. The piles that you sorted can be said to correspond to meaning classes. In your
   opinion, what are the three most important/central meaning classes? Refer to the
   names you used or to the number of the topmost card.
_ _ _ _ _ _ and _ _ _ _ _ _ and _ _ _ __
3. Continuation to task No. 2: please choose one sentence (i.e. one card) out of
   each of the three most important classes which best represents the respective
   meaning class, in your opinion. Refer to the numbers on the cards.
_ _ _ _ _ _ and _ _ _ _ _ _ and _ _ _ __
4. The term polysemy is usually used to mean that one word (or some other item)
   has several meanings which are "related to one another" in a plausible way. The
   meanings have in other words developed from one another in the course of time,
   and even the present-day speaker can in principle think of the connection between
   the meanings. Do you feel that all of the meanings of pittltl that you dealt with
   are in this manner "related to one another", or is one of them / are some of them
   very much apart from the rest?
Could you now place the card piles into the envelope together with this questionnaire.
Notes
 I. Roy HaITis (personal communication) belongs to the latter type, who commented
    on Raukko (1997) that polysemy is a pseudo-problem in integrationist linguistics,
    because it is only a feature of langue. Although a possibly relevant point, the
    consequent discussion would ultimately concern the extension of polysemy, and
    I see my attempt to group together polysemy, flexibility, fuzziness, and situational
    variability as an attempt to bring polysemy also very much into the realm of
    parole.
2. Langacker for instance (1988a: 136) argues that the distinction between polysemy
    and homonymy is a matter of degree and cannot be reduced to a simple dichotomy.
3. I wish to express my gratitude to all the 329 anonymous informants as well as
    the people who helped me in organizing the 1994 experiments, especially WaIter
    Bower, Harry Brown, Kit Johnson, Melanie McGouran and Jackye Plummer.
4. I would also like to point out interesting ongoing research on the development of
    the polysemy of get in children, done by Simone Duxbury (Monash University,
    Melbourne, Australia) and Brigitte Nerlich, Zazie Todd and David Clarke (see
    Nerlich, Todd and Clarke, this volume).
5. The American flavour is represented for example in the distinction between have
    got ('possess', 'must') and have gotten (for all other meanings) and in the
    American spelling used in all of the examples, which authentically come from
    my American informants.
6. This sentence was used as an article title by my Helsinki colleague Pekka Kuusisto,
    and the interpretations below reflect the discussion we had over the title.
7. Henceforth I will use SMALL CAPITALS to indicate technical terms that function as
    names of meaning types that my experiments and my analyses suggest. Less
    technical descriptions are given in single quotes, such as 'permission' a few lines
    below.
8. METAPHORICAL OBTAINING POR SOMEONE ELSE is a fairly unimportant meaning type,
    if not virtually non-existent, and in this matrix it mainly serves analogy. None of
    my informants (neither [1991] in Helsinki nor [1994] in the USA) produced
    examples that I would like to interpret and classify as METAPHORICAL OBTAINING
    FOR SOMEONE ELSE; yet in dictionaries there are some possible candidates, such as
    He got her a job with the telephone company (Collins Cobuild English Language
    Dictionary [1987]: Reading 8.4), whose metaphoricity, however, is not very strong.
9. I have to explain an additional feature of informant No. 19's response, because
    it functions as an explicit proof that American high school students seem to have
                             Experiments with English get and Finnish pitaa        191
    a clue of what they are doing in my production test. Namely, she not only
    produced examples, but dictionary-type definitions. She explicated her first
    example (9a) as To get is to receive something and her example (9b) as To get
    something is to go buy something or accomplish something.
10. The average ratings ranged from 0.33 to 3.13, so that 2.13 is a relatively high
    difference rating. The standard deviation in the ratings for this pair was 1.35.
    The standard deviations ranged from 0.94 to 1.46.
11. Two orthographicaIly identical cases of, e.g., I'm glad J got to know you can
    mean either ABILITY or REACHING A STAGE depending on the intonation: ABILITY has
    stress on got, REACHING A STAGE on know. (I thank Diana ben-Aaron for pointing
    this out.) Needless to say, my method fails to account for prosody.
12. As in the case of the get production test, most pitiiii production test infonnants
    provided me with paraphrases of these examples (e.g. where they had to replace
    pitiiii with another word) so that I did not have to rely only on my native speaker
    intuition in order to interpret the meanings of pitiiii from the sentential context,
    but to save space I will not show these paraphrases and their glosses.
13. Explanations of the abbreviations for the suffixes referred to in examples:
          NOM      nominative                 INF          infinitive
          GEN      genitive                   IMPER        imperative
          PART     partitive                  PAST         past tense
          ACC      accusative                 SG           singular
          ESS      essive                     PL           plural
          INESS    inessive                                first person
          ELAT     elative                    2            second person
          ILLAT    iIlative                   3            third person
          ALLAT    aIlative
References
Bn!al, Michel
   1868       Les idees latentes du langage. Le~on jaite au College de France
              pour la reouverture du cours de grammaire comparee,le 7 dec. 1868.
              Paris: Hachette.
Caramazza, Alfonso and Ell en Grober
   1976       Polysemy and the structure of the subjective lexicon. In: Clea Rameh
              (ed.), Semantics: Theory and Application. Georgetown University
              Round Table on Languages and Linguistics 1976, 181-206.
              Washington, D.C.: Georgetown University Press.
Colombo, Lucia and Giovanni B. Flores d' Arcais
   1984       The meaning of Dutch prepositions: a psycholinguistic study of poly-
              semy. Linguistics 22: 51-98.
192     larno Raukko
Deane, Paul
   1988         Polysemy and cognition. Lingua 75: 325-361.
Erdmann, Karl OUo
   1910         Die Bedeutung des Wortes. Aufsatze aus dem Grenzgebiet der Sprach-
                psychologie und Logik. 2. Aufl. Leipzig: Avenarius.
Geeraerts, Dirk
   1993         Vagueness's puzzles, polysemy's vagaries. Cognitive Linguistics 4:
                223-272.
Gibbs, Raymond w., Jr., Dinara A. Beitel, Michael Harrington and Paul E. Sanders
   1994         Taking a stand on the meanings of stand: bodily ex.perience as moti-
                vation for polysemy. lournal of Semantics 11: 231-251.
Givon, Talmy and Lee Yang
   1994         The rise of the English GET-passive. In: Barbara Fox. & Paul J.
                Hopper (eds.), Voice: Form and Function, 119-149. Amsterdam and
                Philadelphia: John Benjamins.
Kimball, John
   1973         Get. In: John Kimball (ed.), Syntax and Semantics 2,205-215. New
                York: Seminar Press.
Langacker, Ronald
   1988a        A usage-based model. In: Rudzka-Ostyn (1988), 127-163.
   1988b        A view of linguistic semantics. In: Rudzka-Ostyn (1988), 49-90.
Lehrer, Adrienne
   1974         Homonymy and polysemy: measuring similarity of meaning. Language
                Sciences 32: 33-39.
Lindstromberg, Seth
   1991         Get: not many meanings. International Review ofApplied Linguistics
                in Language Teaching 29: 285-302.
Nerlich, Brigitte
   1992         Semantic Theories in Europe 1830-1930: From Etymology to Con-
                textuality. Amsterdam and Philadelphia: John Benjamins.
Niedzielski, Henry
   1976         Semantic considerations of get and some of its Polish equivalents.
                Papers and Studies in Contrastive Linguistics 5: 219-238.
Nyrop, Kristoffer
   1913         Semantique. Vo\. IV of his Grammaire historique de la langue
                franraise. Copenhagen: Gyldendalske Boghandel Nordisk Forlag.
6stman, Jan-Ola
   1988         Adaptation, variability, and effect: comments on IPrA Working Docu-
                ments I and 2. IPrA Working Document 3: 5-40. University of
                Antwerp: International Pragmatics Association.
   2000         Merkityksen ajankohtaiset haasteet yleiselle kielitieteelle [The current
                challenges of meaning for general linguistics]. In: Anu Airola, Heikki
                            Experiments with English get and Finnish piiliii     193
Ken-ichi Seto
1. Introduction
The last quarter of a century has seen a rising concern with metonymy. The
literature, from Nunberg's (1978) pioneering exploration to Radden and
Koevecses' (1999) penetrating investigation, by way of Lakoff and Johnson's
(1980) influential work, has shown that metonymy is not only as pervasive
as metaphor but also no less important in the daily use of language. In spite
of the continuing expansion of the metonymic realm as is shown in Panther
and Radden (1999) and Barcelona (2000), however, the core notion of
metonymy is, so far as I can see, not yet established. I shall argue specifically
(i) that there has been no satisfactory definition of metonymy yet, owing
to confusions about the difference between entities and categories; (ii) that
the ultimate reason why those confusions so often occur resides in the
(inevitable) spatial representation of categorical relations; and (iii) that the
network model (the prototype-extension-schema triangle), which is supposed
to deal with polysemy, does not work because metonymy has no proper place
in the model. After these arguments I shall offer a new way of looking at
polysemy: a cognitive triangle whose vertices are metaphor, metonymy, and
synecdoche.
define metonymy. But what does "contiguity" involve? So far as I can see,
this is the most important question to ask when defining metonymy. Does it
mean the contiguity of entities alone? Or does it also mean the contiguity of
categories? If the inclusive definition were taken, what would the contiguity
of categories mean? In other words, is metonymy concerned only with a
referential transfer between entities, or also with a nonreferential transfer
between a more comprehensive category and a less comprehensive category?
On the other hand, if one chose not to use the tenn "contiguity", at least in
some cases, then what should be the defining characteristic of metonymy?
To answer these questions, I shall first offer a new definition of metonymy
which departs in some crucial respects from traditional views.
(i)       While the traditional notion of metonymy has kept synecdoche as its
          servant, under the new definitions (1) and (2), synecdoche is indep-
          endent of metonymy.
(ii)      Whereas traditionally metonymy is a mixed category, in that it covers
          entities and categories alike, the definition of metonymy in (1) makes
          it clear that metonymy is a coherent category that only comprises
          entity-based referential transfers and excludes categorical transfers.
                   Metonymic polysemy and its place in meaning extension        197
    This division of work is not just notational but also substantial, because it
is supported on lexico-semantic grounds. Metonymy is concerned with ref-
erence; synecdoche is associated with (extensional and intensional) sense;
"partonomy" denotes a referential relation which holds, for example, between
a windmi\l and its sails (the sails are a part of a windmill); and "taxonomy"
(hyponymy) signifies a categorical relation which holds, for example, between
a ticket and a traffic ticket (a traffic ticket is a kind of ticket). Thus under the
new definitions (3a) is a metonymy, and (3b) is a synecdoche (in a parking
situation).
B C
                  D
                       ~        E
Figure 1.
Figure 2.
thing
human animal
Figure 3.
a tree
branches a trunk
twigs leaves
Figure 4.
Hence, Figure I is called a "tree" diagram. Note that once the taxonomy of
things is represented by Figure I, it would become "a tree". And a tree as an
entity, not tree as a category, is composed of different parts such as a trunk,
branches, roots, and so on. Likewise, each node below the top (thing) in
Figure 3 would be interpreted as one of these parts. Therefore, taxonomy
would become partonomy! But, of course, this is fallacious reasoning. Where
are we going wrong? A pitfall is contained in the representation of Figure 3
itself. The moment the framework of Figure 1 is adopted to represent a
taxonomical relation, it automatically forces us to reason along the above
line and to conclude that there is no substantial difference between taxonomy
and partonomy because each node is a part of a whole tree. Confusion of
this kind, which may be called the partonomy-taxonomy fallacy (or the PT
fallacy), is pervasive and it is caused by a specific spatial metaphor such as
a tree diagram, as in Figure 1.
200     Ken-ichi Seto
    Perhaps this is one of the reasons why it appears that categories are better
represented by rings within a circle (Fig. 2). Consider Figure 5, but note
again that although human beings are a kind of (living) thing, the small ellipse
of human beings could also be said to be a part of the larger ellipse of (living)
things in so far as the represented forms (figures) are concerned. Here taxon-
omy once again would look like partonomy. Or, one might take a slightly
different line of reasoning: in Figures 2 and 5, rings and smaller ellipses are
rather contained in a circle and a large ellipse, respectively, so in I got another
ticket, ticket (container) stands for "traffic ticket", contained, just as in The
kettle is boiling, kettle (container) stands for "the water in the kettle", contained.
Container-for-contained is a regular pattern of metonymy. Consequently, one
might conclude that (3b) is an instance of metonymy. But this reasoning,
too, is fallacious, because whereas "traffic ticket" is a kind of ticket, "the
water in the kettle" is not a kind of kettle.
Figure 5.
    Lakoff and Johnson (1999: 51) apparently succumb to the force of the PT
fallacy (see also Lakoff 1987, Chapter 5). They point out that there is a
conceptual metaphor A CATEGORY IS A CONTAINER. The primary experiential
basis of the metaphor is, according to them, that "things that go together
tend to be in the same bounded region (correlation between common location
and common properties, functions, or origins)." I do not deny that there is
such a conceptual metaphor because in fact there is one, not only in English
but also in Japanese and perhaps many other (possibly all) languages. Thus
we ask, for example, "Are tomatoes in the fruit or the vegetable category?"
However, it is this conceptual metaphor that is the ultimate cause of the PT
fallacy because it blurs the distinction between folk understanding and expert
knowledge. It is necessary in the analysis of metonymy (as a piece of expert
                 Metonymic polysemy and its place in meaning extension    201
raised in (2) also makes use of the metaphor A CATEGORY IS A CONTAINER. This
is true, for "inclusion" and "comprehensive" are based on the A CATEGORY IS
A CONTAINER metaphor. The two words might therefore better have been
omitted in the definition. Compare a revised definition (4) with (2):
(i)     Categories are in the mind (i.e. represented in the brain) but are very
        hard to access by way of (meta)language. perhaps because the nature
        of categories resists verbalization. Categorization (e.g. edible vs. non-
        edible) is an essential function of living things from human beings
        down to amoebas. There is no doubt that amoebas can categorize for
        this feature, edible vs. non-edible, but from this no-one would infer
        that they have the A CATEGORY IS A CONTAINER metaphor. Categorization
        itself is possible without language. 4
(ii)    The force of the A CATEGORY IS A CONTAINER metaphor is compelling.
        More generally, the pressure of spatialization metaphors is so high
        that the moment one tries to talk about categories, the A CATEGORY IS A
        CONTAINER metaphor or any other closely related spatial metaphor is
        ready on the tip of the tongue. s
(iii)   Perhaps a rare linguistic tool that is appropriate for describing the
        true nature of categories, and is relatively free from the bias of folk
        understanding, is "kind of'. "Kind of' can be used to distinguish
        between taxonomy and partonomy (e.g. a traffic ticket is a kind of
        ticket, not a part of a ticket; a leg is a part of a table, not a kind of
        table). Thus (4) may again be paraphrased: synecdoche is a categorical
                  Metonymic polysemy and its place in meaning extension       203
Figure 6.
it is clear that whereas metaphor and synecdoche hold their places in the
network model, there is no place for metonymy.
    Langacker started to change his position later, gradually and tantalizingly,
until he finally decided to give metonymy a place in the network model
when he referred to "extension (generally metaphorical and metonymic)"
(Langacker 1995b: 111).7 A new version of the network model may be shown
in Figure 7.
schema
                   "he~~~=atk
                       A   --------------~~~        B
                                 extension       metaphor
                   prototype
                                                 metonymy
Figure 7.
    "Skirts" are not similar to "the girls who wear them", but only contiguous
with them. Of course, ultimately any two things can be said to be similar in
one respect or another, but that is not the point here.
    One merit of the network model in Figure 7 is that three major extension
patterns, metaphor, metonymy, and synecdoche, are all present, provided that
synecdoche is substituted for schema. Metaphor is based on the similarity
relation (the S-relation), metonymy on the E-relation, and synecdoche on the
C-relation. They make up the three vertices of what is called the cognitive
triangle.
                                      t
                           ~~
                         t/p~t
                          C                       E
                ~ecd~~OnY~
Figure 8.
which together make up the cognitive triangle. And each vertex extends a
newly acquired meaning again in the three directions. Metaphor is similarity-
based, metonymy is entity-based, and synecdoche is category-based. The
extension to C is of two kinds: one is the species-to-genus type as in (8a)
and the other is the genus-to-species type as in (8b).
    "Ship" in (8a) extends its sense from "send something by ship" to "send
something by any means of transportation". ''Temperature'' in (8b) extends
its sense from "the temperature of the body" to "fever". In the latter example,
"extends" is used in an extended (Le. species-ta-genus, or schematic) sense.
Metonymy is of three types depending on the kinds of entities: spatial (e.g.
book), temporal (e.g. earthquake), and abstract (e.g. beauty) (for details and
examples, see Seto 1999).9
    Another important point to make about the cognitive triangle is that
metonymy and synecdoche belong to different cognitive domains. to See
Figure 9.
    Metonymy works in the E-domain, which is associated, typically, with
the real world where entities are arranged concentrically. We tend to see two
entities as closely connected if they are in a closer position than others. The
                  Metonymic polysemy and its place in meaning extension   207
metaphor
synecdoche metonymy
Figure 9.
The cognitive triangle also explains semantic changes along the time axis.
When speaking of the classification of diachronic semantic changes, Geeraerts
(1994) refers to "the classical quartet of specialization, generalization,
metonymy, and metaphor." Since specialization and generalization largely
correspond to the two aspects of synecdoche used in our sense, that is, the
genus-to-species transfer and the species-to-genus transfer respectively, the
classical quartet can be collapsed into the modern trio of synecdoche
(specialization and generalization), metonymy and metaphor. The cognitive
triangle works both for diachronic semantic development and for synchronic
semantic extension.
208     Ken-ichi Seto
4. Conclusion
Notes
     cognitive models, such cases are represented by metonymic models." For him,
     things that go together referentially and things that go together categorically can
     both be the basis of metonymy. Thus Lakoff (1987: 90) claims: "In short, a
     cognitive model may function to allow a salient example to stand metonymically
     for a whole category". Radden and Koevecses (1999), following this statement,
     refer to specific "category" metonymies such as STEREOTYPICAL MEMBER FOR A
     CATEGORY, CATEGORY FOR THE STEREOTYPICAL MEMBER, IDEAL MEMBER FOR THE
     CATEGORY, and CATEGORY FOR THE IDEAL MEMBER. All these fit the definition of
     synecdoche given in (2).
3.   Cruse (2000: 50) notes further that "Pustejovsky does not really explain why he
     opts for four qualia roles ... Croft (p.c.) sees no justification for four roles, or
     any definite number; if there were any super-domains, he would opt for just
     two, the taxonomic and the meronymic, the latter subsuming the constitutive,
     telic and agentive roles." While I agree with Cruse (and Croft) in dividing
     Pustejovsky's four qualia roles into two "super-domains" (the formal qualia and
     the other three), it seems hardly justified to squeeze the constitutive, telic and
     agentive roles into one and the same category of meronymy, at least under Crose's
     definition of the term. Rather, these three roles may naturally be put into the
     category of spatio-temporal contiguity which essentially characterizes metonymy,
     with the constitutive qualia related to spatial contiguity, and the agentive and
     telic qualias to temporal (i.e. causal) contiguity (see Seto 1999 for further dis-
     cussion and examples). Accordingly, Pustejovsky's formal qualia role may be
     associated with synecdoche in my sense, and the other three with metonymy.
     Although the latter three qualia roles do not exhaust the metonymic resources,
     the formal qualia does exhaust the synecdochic relations (genus-for-species, and
     species-for-genus).
4.   Since Rosch (1973, 1978), the term "prototype" has become very popular. In
     fact there may be little doubt of the psychological reality of the prototype structure
     of categories. But how are categories prototypically structured? Taylor (1995)
     points out that "[t]here are two ways in which to understand the term 'prototype'.
     We can apply the term to the central member, or perhaps to the cluster of central
     members, of a category.... Alternatively, the prototype can be understood as a
     schematic representation of the conceptual core of a category" (emphases added).
     Note that the prototypical understanding of the term "prototype" is based on the
     centre-periphery metaphor. This may have partly contributed to the genesis of
     the A CATEGORY IS A CONTAINER metaphor even on the expert knowledge level.
5.   Barcelona (personal communication) points out that another undesirable conse-
     quence of the A CATEGORY IS A CONTAINER metaphor is that it may mislead one into
     a belief that categories are discrete, since containers such as bowls and cups
     have discrete contours.
6.   Langacker's (1990: 271) original figure is shown in Figure 10. In this, "[t]he
     process of extension occurs because a speaker perceives some similarity between
     the basic value (i.e. the local or global prototype) and the extended value. This
                   Metonymic polysemy and its place in meaning extension         211
SCHEMA
Figure 10.
    similarity perception represents the commonality of the basic and extended values,
   so it constitutes a schema having the two for instantiations" (p. 271, emphases
    added). From this it is clear that Langacker's extension in the network model is
    exclusively metaphoric in nature (at the time the paper was written) because it is
   only in metaphoric extension that some schematic commonality can be extracted
   from a prototype and its extension, such as a real mouse and a computer mouse;
   on the other hand, it is virtually impossible, unless by force, to draw similarity
   perception from one sense - whether it is a prototype or not - and its metonymic
   extension, because a metonymic extension is associated with a prototype by way
   of contiguity, not similarity. It would be absurd, for example, to see some
   meaningful similarity between "long hair" per se and "a person with long hair"
   or between "a kettle" and "water in the kettle".
7. Langacker's (1984) "active zone" clearly constitutes part of metonymy, but there
   is no mention of metonymy there. The relationship between "active zone" and
   metonymy seems to have grown slowly. The first reference to metonymy is
   made in Langacker (1991: 456 footnote). Then Langacker (1993: 31) observes:
   "Metonymy largely overlaps with what I have called active zone phenomenon."
   The relationship between the two then develops rapidly into the total inclusion
   of "active zone" in the metonymy category in Langacker (1995a: 27): "Specifically,
   they [active-zone/profile discrepancies] represent a special case of METONYMY".
8. When dictionaries are compiled on the principle of clarifying the semantic
   networks of lexical entries, instead of describing their historical sense develop-
   ments or current sense frequencies, they should seriously take into account the
   distinction between metaphor, metonymy, and synecdoche. In this regard, The
   New Oxford Dictionary of English is worth mentioning because it identifies three
   major extension patterns: "figurative extension of the core sense", "specialized
   case of the core sense", and "other extension or shift in meaning, retaining one
   or more elements of the core sense". It is obvious that "figurative extension" is
   metaphoric and that "specialized case" corresponds to one of the two kinds of
   synecdoche (unfortunately, no mention is made of generalization). However, the
   third category is plainly miscellaneous, although, judging from the examples
   cited there (p. x), there is no question that metonymy (along with some others)
   comes into this third category.
9. An earthquake is a temporal entity, so we can say, for example, that "there was"
212     Ken-ichi Seto
References
Barcelona, Antonio
   2000         (ed.) Metaphor and Metonymy at the Crossroads: A Cognitive Per-
                spective. Berlin: Mouton de Gruyter.
   2002         Clarifying and applying the notions of metaphor and metonymy
                within cognitive linguistics: an update. In: Rene Dirven and RaJph
                Poerings (eds.), Metaphor and Metonymy in Comparison and Con-
                trast. Berlin: Mouton de Gruyter.
Browm, Lesley (ed.)
   1993         The New Shorter Oxford English Dictionary. New York: Oxford
                University Press.
Chomsky, Noam
   1965         Aspects of the Theory of Syntax. Cambridge, MA: MIT Press.
Croft, William
   1993         The role of domains in the interpretation of metaphors and meton-
                ymies. Cognitive Linguistics 4: 335-370.
Cruse, David A.
   1979         On the transitivity of the part-whole relation. Journal of Linguistics
                1: 29-38.
   1986         Lexical Semantics. Cambridge: Cambridge University Press.
   2000         Aspects of the micro-structure of word meanings. In: Yael Ravin
                and Claudia Leacock (eds.), Polysemy: Theoretical and Compu-
                tational approaches, 30-51. New York: Oxford University Press.
Geeraerts, Dirk
   1994         Historical semantics. In: Ron E. Asher (ed.), The Encyclopedia of
                Language and Linguistics, 1567-1570. Oxford: Pergamon Press.
                   Metonymic polysemy and its place in meaning extension       213
Jakobson, Roman
    1971 [1956] Two aspects of language and two types of aphasic disturbances. In:
                Roman Jakobson (ed.), Roman lakobson Selected Writings Volume
                2: Word and lAnguage, 239-259. The Hague: Mouton.
Lakoff, George
    1987        Women, Fire, and Dangerous Things: What Categories Reveal About
                the Mind. Chicago, IL: The University of Chicago Press.
Lakoff, George and Mark Johnson
    1980        Metaphors We Live By. Chicago, IL: The University of Chicago Press.
    1999        Philosophy in the Flesh: The Embodied Mind and its Challenge to
                Western Thought. New York: Basic Books.
Langacker, Ronald W.
   1984 [1990] Active zone. In: Ronald W. Langacker, Concept, Image, and Symbol:
                The Cognitive Basis of Grammar, 189-201. Berlin: Moulon de Gruyter.
   1990         A usage-based model. In: Ronald W. Langacker, Grammar and
                Conceptualization. Berlin: Mouton de Gruyter.
   1991         Foundations of Cognitive Grammar. Volume 11: Descriptive Applic-
                ation. Stanford, CA: Stanford University Press.
   1993         Reference-point constructions. Cognitive Linguistics 4: 1-38.
   1995a        Raising and transparency. lAnguage 71: 1--62.
   1995b        Cognitive grammar. In: Jan-Ola OSlman, Jef Verschueren, Jan
                Blommaert and Chris Bulcaen (eds.), Handbook of Pragmatics, 105-
                Ill. Amsterdam: John Benjamins.
Lyons, John
   1977         Semantics. 2 volumes. Cambridge: Cambridge University Press.
Nerlich, Brigiue
   (to appear) Synecdoche: a trope, a whole trope, and nothing but a trope. In:
                Armin Burkhardt and Brigiue Nerlich (eds.), Reflections on Tropes
                and Truth. Berlin: Mouton de Gruyter.
The New Oxford Dictionary of English
   1998         The New Oxford Dictionary of English, edited by Judy Pearsall. New
                York: Oxford University Press.
Nunberg, Geoffrey D.
   1978         The Pragmatics of Reference. Bloomington: The Indiana University
                Linguistics Club.
Panther, Klause-Uwe and Gtinter Radden (eds.)
   1999         Metonymy in lAnguage and Thought. Amsterdam: John Benjarnins.
Radden, Gtinter and Zoltan Koevecses
   1999         Towards a theory of metonymy. In: Panther and Radden (1999),
                17-59.
Reddy, Michael J.
   1979         The conduit metaphor: a case of frame conflict in our language about
214     Ken-ichi Seto
Adrienne Lehrer
1. Introduction
2(02). In some cases, however, the relationship of the senses has a different
"shape", as Joos (1958) observed. His analysis of the word code had circular
shape, and my analysis of -ist in Section 4 exhibits a chain-like structure,
where sense A gives rise to sense B, which in turn gives rise to sense C.
   Although this article deals only with English, it may serve as an example
for similar studies of other languages. My main source is the classic study
by Marchand (1966), supplemented by recent neologisms, which often reveal
aspects of productive senses of words and morphemes that are not necessarily
revealed in the conventional lexicon.
2. English prefixes
    Pre- and post- are predominantly temporal senses. The spatial senses are
restricted to anatomy and zoology (Marchand 1966: 134), and this sense
emerged in English later than the temporal one. The temporal sense of pre-
also has a range of meaning, influenced in part by the word-class of the
base. With nouns (prewar, precontact, pre-election) the time denoted is before
the event or period named. With verbal bases (preheat, preshrink, prepay),
the sense is "to perform some action before some pragmatically interpreted
action". To preheat the oven is "to turn it on to a certain temperature before
putting the food in". To prepay is "to pay [a bill] in advance of the due date
or the service performed". There has been a gradual drift in words like
preboard [an airplane] and preregister [for a class], where some individuals
act before other individuals do the same thing. To preboardlpreregister is
                                          Polysemy in derivational affixes   219
"to board or register, not before some other action, but before the regular
boarding or registration takes place".
   Another domain in which we find a familiar instance of polysemy is in
prefixes that denote hierarchies. Here the meanings include those of generality
(versus specificity), quantity (more or less), and, in the case of super- and
sub-, spatial orientation. See Table 2.
    The sense involving quantity implies a contextual norm, and all four of
the prefixes in Table 2 denote a quantity falling above or below that norm.
    The prefix arch- shares some of the senses of the four items in Table 2.
Its primary sense has to do with a social hierarchy, as in archbishop and
archduke, but it has developed a sense of quantity as well. Here we find
arch-heretic and arch-villain, where the meaning is "excessive". Marchand
(1966: 144) suggests that the use is pejorative, which may explain why he
has no examples like arch-hero and arch-saint.
    Generalizing over the nine items above, we see a set of meanings ranging
from space and time, and metaphorical extensions of spatial notions like
over and under, to relationships of power and to quantity. Generality is a
special case of quantity, since "more general" = "more inclusive".
3. Diminutives
    The metonymy of PLACE meanings for some of the items above are
transfers of the whole word, not the suffix. -ling, with the exception of
darling, is affectively pejorative, as in weakling, trivling "trivial, worthless",
giftling "trivial gift", witling "one with small wit" (examples and glosses are
from Walker [Dawson] 1936: 170-171). The meaning of the suffix -y is
primarily "like", "characterized by", "full of' (Marchand 1966: 352); the
diminutive sense is secondary. 1\vo diminutive suffixes are widely found in
personal names for girls, although speakers probably do not interpret them
as diminutives: Annette and Nancy. However, the -y is productively used for
nicknames: Barby, Keithy, Tommy.
                                               Polysemy in derivational affixes       221
    The suffix -er displays a traditional radial structure, with the AGENT sense
in the centre (see Fig. 1). (See Panther and Thomburg 2002 for a more detailed
analysis of the various senses of -er and their relationship.)
    Of the agent suffixes other than -er, the agent sense is not necessarily
central. Beard (1990) has described the frequency of agentive and instru-
mental polysemy in a number of languages, and such polysemy generally
occurs by means of -er and -ant/ent. The technical senses of -ite are mostly
from chemistry.
    The suffix -ist, which is attached to nouns or bound bases, has three
overlapping senses. The first is the agentive one, such as violinist or physicist.
The second, which is related to -ism, describes a proponent of an ideology,
222    Adrienne Lehrer
blender
                                                                 slipper,
                                                                 wrapper
                                                               clincher,
                                                               finisher
commissioner
such as federalist or capitalist. The third is like the second, but it additionally
adds a connotation of prejudice (Lehrer 1988, 1999). Historically, this sense
evolved from the word racist, "one who believes in innate difference among
races". But racist soon took on the sense "being unjustifiably prejudiced
against a group on the basis of race" and the negative connotation transferred
to the -ist, which expanded to sexist, ageist, speciesist, and other terms. This
shift illustrates Traugott's analysis, where pragmatic implicatures give rise
to conventional uses. "Semantic change ... involves specification, achieved
through inferencing" (1988: 413). More interesting, however, is her obser-
vation that (in the case of metonymy) generally the meaning shifts to "the
subjective belief state or attitude toward the situation" (p. 414). With -ist, the
earlier sense, as in federalist, has a neutral connotation, while the newer
sense, as in sexist, is negative and reveals the speaker's evaluation.
                                         Polysemy in derivational affixes   223
metamathematics
                                                      metaquestion.
                                                       metafiction
metagalru:y
that they have a sense STATE or CONDITION, but in addition each has its own
range of meanings.
   -dom (see Fig. 3) is in fact a reduced form of a formerly independent
word, with a central sense of "state" or "jurisdiction", as in kingdom or
Christendom. This sense includes both concrete and abstract meanings, so
we can have a phrase like kingdom of God, referring not to a physical place
but to an abstract realm. From this meaning it is a short step to the sense of
"territory", as in kingdom "place where the king rules", dukedom, and
sheikdom. Also from the general sense of "state, condition" it is a natural
shift to a special kind of condition illustrated by stardom, having the rank or
being in the class of stars.
                                                       dukedom
                         /TERRITORY
                STATE!
              CONDITION         •   STATUS, RANK       earldom
~colTIVITY clerkdom
  STATE!
CONDITION            •   STATE                       ~ STAGE OF LIFE
                                   motherhood - - -...                            babyhood
                    /"
                     .----r ROLE, POSITION                   leadership
STATE OF BEINGI                     t
 RELATIONSHIP ~                     •
                                                             kinship
                  ~ RESPECTFUL DESIGNATION                   ladyship. lordship
                                   ~
                                 STIPEND                     fellowship, scholarship
other cases, decisions are hard to make because the ambiguity tests are
indeterminate. Ambiguity tests often fail where a general sense includes a
specific sense, such as egg "reproductive body from various animals", "egg
from a bird, with a yellow yolk, surrounded by a hard shell" [and other
senses]. This type of polysemy has been termed autohyponymous polysemy
by eruse (1995).
    In some of the affixes I have examined, neither solution - monosemy nor
polysemy - seems optimal. Selecting monosemy leaves too much to the
pragmatics and ignores the rather specific, contextually determined senses;
speakers do not need to figure out that specific meaning each time they
encounter a new use of the affix. Yet postulating many specific senses fails
to capture the fact that there is a unity that connects the various meanings. In
Figures 1 and 2, I have tried to capture both the general and specific senses.
The general meaning includes the specific ones, but not necessarily exhaus-
tively, and this in turn allows for the generation of nuances and innovations
lying outside the specific senses. Tuggy's model (1993) is the closest one I
have found to resolving the dilemma.
    Tuggy, like Geeraerts (1993), shows that there is a continuum, not a sharp
break, between ambiguity and vagueness. Moreover, diachronic factors blur
the distinction even more. Tuggy shows how general schemas [general senses]
and their elaborations [specific senses] co-exist in languages. Schemas or
elaborations, or both, can become well-established through repeated usage
and thereby become salient (Thggy 1993: 279). When an elaborated sense
becomes salient, it is considered prototypical. If the general sense is not
salient but some of the elaborations [specific senses] are salient, the senses
strike us as a case of ambiguity. However, if the general sense is salient, but
the specific senses are not, we judge the item as exhibiting vagueness. When
the general sense and at least one of the elaborated senses are salient, we are
inclined to judge ambiguity. Tuggy's model of schema and elaborations does
not require us to choose vagueness or ambiguity - we get both, along with
an account of context and conditions that explain our judgements in clear
cases and borderline cases.
    Although the concepts represented by English affixes are limited, they
show polysemy that is similar to that found in lexemes. There are meta-
phorical transfers of "space" and "time", "vertical position (superiority)",
"generality", and "rank", and there is some metonymy, as in "a group or
class of people" and "inhabitant of'. Also there is the metonymic shift from
"membership of a group" to "award for election to the group" to "financial
230     Adrienne Lehrer
Notes
I. Mithun (1996) provides data from American Indian languages showing that the
   range of meanings of derivational affixes is larger than Beard claims.
2. Marchand (1966) lists all the items I discuss as prefixes, and I will follow him in
   this point. However, these morphemes differ as to their boundedness. Unfor-
   tunately, English spelling is no guide. Some items are usually spelt with a hyphen,
   some with a space before the base, and some are always part of the following
   word. Moreover, there are often inconsistencies in a given item.
3. Some opera companies call the translations of the text appearing above the stage
   supertitles; other call them surtitles. The prefix sur- is much less productive in
   English than super-.
4. Starlet suggests quantity - "a little bit of stardom" and possibly mild derogation.
   It also seems restricted to females, reinforcing an association of feminine and
   diminutive senses.
5. There are other agent/subject suffixes not dealt with here, some of which are
   discussed in Lehrer (1999).
6. The OED provides an 1853 citation of metamathematics as "beyond the scope
   of', but a 1926 citation is glossed as "pertaining to".
7. There is a whole scale from very concrete concepts expressed in words like
   table to extremely abstract ones, such as the meaning of metaphysics, with an
   indefinite number or points in between. Moreover, it is not always clear how to
   arrange concepts on this scale. Is time more abstract than space?
                                            Polysemy in derivational affixes    231
References
Lehrer. Adrienne
    1988        A note on the semantics of -ist and -ism. American Speech 63: 181-185.
   1993         Prefixes in English word formation. Folia Linguistica 2: 133-148.
    1996        Why neologisms are important to study. Lexicology 2: 63-73.
    1999        Are affixes signs?: The semantic relationships of English derivational
                affixes. In: Wolfgang U. Dressier. Oskar E. Pfeiffer. Markus A.
                Pochtrager and John R. Rennison (eds.), Morphologica 1996, 143-
                 154. (Papers from the Seventh International Morphology Conference,
                Vienna, Austria, February 1996.) The Hague: Holland Academic
                Graphics.
Lieber, Rochelle
    1992        Deconstructing Morphology. Chicago: University of Chicago Press.
Marchand, Hans
   1966         The Categories and Types of Present-Day English Word Formation.
                Birmingham: University of Alabama Press.
Mithun. Marianne
    1996        The meanings of roots and affixes. Papers from the Seventh Inter-
                national Morphology Conference, Vienna, Austria, February 1996.
Nerlich. Brigitte and David D. Clarke
   1992         Semantic change: case studies based on traditional and cognitive
                semantics. Journal of Literary Semantics 21: 204-225.
Norrick, Neal R.
   1981         Semiotic Principles in Semantic Theory. Amsterdam: John Benjarnins.
Oxford English Dictionary
   1989         Oxford: Clarenden.
Panther, Klaus-Uwe and Linda L. Thornburg
   2002         The roles of metaphor and metonymy in -er nominals. In: Rene
                Dirven (ed.), The Metaphor-Metonymy Continuum: The Ongoing
                Debate, 279-319. Berlin: Mouton de Gruyter.
Ruhl, Charles
   1989         On Monosemy. New York: State University of New York Press.
Traugott, Elizabeth C.
   1988         Pragmatic strengthening and grammaticalization. Berkeley Linguistics
                Society 14: 40~16.
Tuggy. David
   1993         Ambiguity, polysemy, and vagueness. Cognitive Linguistics 4:
                273-290.
Walker, John
   1936         Rhyming Dictionary, Revised and enlarged by Lawrence H. Dawson.
                New York: Dutton.
Wilbur. Ken
   1977         The Spectrum of Consciousness. Wheaton, fi..: Theosophical Publishing
                House.
The role of links and/or quaUa in modifier-head
constructions
Beatrice Warren
2. Empirical evaluation
Some years ago Geeraerts presented ingenious studies of the particular features
that are salient in naming new objects. We get a detailed description of one
of these concerning a garment first used in Belgium in 1987, frequently
referred to by the English loan word leggings (1997: 32-47). The occurrence
                         Links and qualia in modifier-head constructions    235
cart", etc. Ljung's study, however, showed that adjectives also contain a link,
or, more precisely, that the definition of adjectives typically involves a
predicative verb and its complement, which is normally identical to the
nominal stem in the case of denominal adjectives: stony "abounding in
stones", beautiful "possessing beauty", noisy "producing noise". Aarts and
Calbert (1979) formalized this mode of defining adjectives and stipulated
that there are at least two types of semantic elements in adjectives: one with
predicative and one with referential force. 3 The predicate links the referential
content of the adjective with its head: angry man "man experiencing anger",
sad event "event causing sadness". There is, they also postulated, a limited
number of preferred links. Which of these is relevant depends on the nature
of the referential content and the head of the adjective. Warren (1988) later
illustrated that the variability of the link accounts for a certain type of
polysemy in the case of adjectives. Consider Table 1.
    The studies listed above did not only have in common the assumption
that the semantics of nominal modifiers could involve implicit links. As just
indicated, they also postulated that there exists a list of preferred relations
or, to use my terminology, default relations, which would naturally be invoked
unless there was strong contextual evidence to the contrary (Warren 1985:
378-380, 1988: 132-134). Compare Pustejovsky's admission that a particular
interpretation specified by the qualia structure may be overridden in certain
contexts. For instance, Bob finished the novels could have two possible regular
readings, i.e. "Bob finished reading the novels", in which case a feature in
the TELlC quale would be activated (novels are for reading), or "Bob finished
                              Links and qualia in modifier-head constructions   237
writing the novels", in which case a feature in the AOENTlvE quale is activated
(novels come about by being written). However, the interpretation "Bob
finished wrapping up the novels" could be triggered in a case such as the
following: Bob, we know, works at a publisher's firm and one of his jobs is
to wrap and send various types of books - novels, textbooks, children's books,
etc. Linguists have in the past not been interested in default values, possibly
for ideological reasons, "all and only the grammatical constructions" being
the goal. If Pustejovsky succeeds in persuading the linguistic guild of the
validity of default values, this would amount to a minor breakthrough in my
view, in particular in semantics. 4
    The lists of links presented in the studies cited are not identical, but
strikingly similar, and could be said to be variations on a theme. My own
version is shown in Table 2.
    As we see from the above, the role combinations are reversible with the
                                                          Compositional
consist of/constituted by        electric power           SOURCE-RESULT
constituting                     tragic event             RESULT-SOURCE
                                                          Possessive
belonging to                    facial skin               WHOLE-PART
having                          rational creature         PART-WHOLE
                                                          Locative
occurring in/at/on              polar climate             PLACE-oBJ
                                medieval games            TIME-oBJ
containing                      poisonous plant           OBJ-PLACE
during which ... prevails       happy times               OBJ-llME
                                                          Causative
caused by                        nervous breakdown        CA USER-RESULT
causing                          noisy children           RESULT-CA USER
                                                          Purpose
be for                           culinary utensils        GOAL-INSTRUMENTS
                                                          Resemblance
be like                          golden hair              COMPARANT-COMPARED
be in accordance with            normal behaviour         NORM-ADHERENT
238    Beatrice Warren
exception of the resemblance and the purpose relations. The reason for this,
in the case of resemblance, is that although it is possible to compare the
shape of a tulip to that of a bell (bell tulip) or vice versa (tulip bell), the
referring item in this role combination automatically assumes the role COM-
PARED. In the case of the purpose compounds, I found that they could be
assigned two links and two role combinations: tablecloth: "cloth (OBlIINS)
which is for putting on tables (LOC/GOAL)"; card table: "table (LOc/INS) which
is for playing cards (OBI/GOAL) at"; night-dress: "dress (OBI/INS) which is for
wearing at night (TIME/GOAL)"; dinnertime: "time (TIME/INS) which is for having
dinner (OBI/GOAL) at"; ball bat: "bat (CAUSERlINS) which is for hitting balls
(OBI/GOAL) with"; football: "ball (OBI/INS) which is for hitting with foot
(CAUSERlGOAL)". As we see, the basic roles are reversible, but the superimposed
purpose role is not.
    There is quite substantial empirical support for the existence of these links.
Ljung (1970), whose aim was to establish the meanings that denominal
adjectivalization could express, examined the definitions of 218 adjectives
ending in -al, -Jul, -ic, -ish, -ly, -ous or -y in Websters Seventh New Collegiate
Dictionary and Webster's Third New International Dictionary. He found it
possible to reduce the numerous definitions he had collected into a small
number of types (gross definitions), either because they were synonymous
or hyponymous. The present writer analysed 4,500 noun-noun compounds
in context (types) and 291 adjectives (types) in altogether 12,890 contexts
(Warren 1978, 1984) and found that, if there was a covert link,5 it would
with very few exceptions be one of those listed above.
    An important discovery made by Ljung was that certain links ("meanings"
in his terminology) are incompatible with bona fide adjectives, i.e. adjectives
that could occur attributively (sad girl), predicatively (girl is sad), which
could be graded (very sad girl), and which could tolerate insertions (sad,
anxious girl). These links could, however, be found with modifiers which
had the morphological make-up of adjectives, but which in all other respects
were nouns: glandular pain, sulphuric acid, continental capital. These con-
structions he called N+aff N (noun+affix noun) compounds6 and (obviously)
he considered them types of compounds. He therefore suggested that certain
links were restricted to bona fide adjectives and others to compounds, whether
of the noun-noun or of N+aff N type. My own research into the semantics
of nominal modifiers did not completely confirm this surmise. Instead I found
that all the links that could occur in adjectives - also those compatible with
bona fide adjectives - could occur in noun-noun compounds. The restriction
                            Links and qualia in modifier-head constructions       239
was that not all links found in compounds could occur with bona fide
adjectives, although they could be found in the so-called N+aff adjectives.
Consider Table 3.
    I also found that the function of the modifier was a crucial factor as far as
stress, syntactic behaviour, morphology and the semantics of the modifier
are concerned. This was so in particular if the modifier was a classifier; in
which case it combines with the head to form a referring unit, or if it was
purely descriptive, in which case the modifier does not affect the meaning or
reference of the head but contributes additional information about it (i.e.
strictly speaking about the referent(s) of the head). Bolinger (1967) referred
to this distinction as reference-modifying and referent-modifying, respec-
tively. Referent-modifying modifiers, I found, are invariably bona fide
adjectives, which - as just indicated - could only occur with certain links,
viz. those in Table 4.
240        Beatrice Warren
    As I think the examples just given show, the meaning of the whole phrase
is influenced by the function of the modifier. I will expand on this point
later.
    Another consequence of the distinction between basically bona fide adjec-
tives (i.e. non-derived adjectives and those in -y, -Jul, -ish etc.) and the
potential N+aff adjectives (i.e. those in classical suffixes) is that the latter-
being compatible with a greater number of links - should be polysemous to
a greater extent than the former. This appears to be true, at least in principle.
Consider Table 7, which deals with musical.
    However, it should be pointed out in this connection that not all the -ai,
-an, -ar, -ic etc. adjectives occur with all the links in principle available for
classifiers. Nor can all bona fide adjectives occur with all the links in principle
available for these adjectives. There are of course natural notional restrictions:
the referential content of dusty could hardly take on the role NORM, musical
or nervous are hardly compatible with the role TIME, etc., but besides such
natural limitations (which would be taken care of by qualia), there are numerous
restrictions which must be characterized as idiosyncratic. For instance, sad
is compatible with the PART role (sad child) and with the RESULT role (sad
news "causing sadness"), but hungry is compatible only with the PART role
(hungry child), not with the RESULT role (?hungry smells "causing hunger");
nervous can occur with the PART role (my nervous sister) and the CAUSER role
(nervous breakdown), but not with the RESULT role (?nervous news "causing
nervousness"); sensational is - contrary to expectation - incompatible with
the experience link (?sensational being "having sensations").
compassionate                          pathetic
envious                                enviable
hungry                                 appetizing
contemptuous                           contemptible
furious                                infuriating
                          Links and qualia in modifier-head constructions   243
4. Referential metonymy
s. Evaluation
Links Qualia
    Admittedly, the FORMAL qualia seem poorly represented among the links
since only one of the features specified by Pustejovsky is included. It should
be remembered, however, that the other features would be involved in many
PARr-WHOLE combinations: tall building "building having great vertical length" ,
small house "house having little size", etc. The resemblance relation, however,
truly lacks a matching quale. A moment's reflection will establish that this is
natural. Resemblance arises because some feature or features of COMPARED -
one's intended referent - is reminiscent of some feature or features of COM-
PARANT - some other named entity: foothill "hill positioned like a foot",
frogman "man having shape and function (i.e. diving) of a frog", peanut
"nut growing (in pods) like peas", puppet government "government being
manipulatable like puppets". As we see, resemblance in these constructions
presupposes already-named entities which belong to distinct categories. It
246    Beatrice Warren
can therefore only be a relation, not a quale, which is what causes us to refer
some entities or phenomena to a single category. (It is possible to make use
of the resemblance relation to name something unnamed, in which case we
have a non literal metaphorical use of a word. That is to say, the COMPARED
(the intended referent) is implied and the COMPARANT is mentioned, assuming
- but only apparently - the role of a referring item.)
    This brings us to the question of whether we are dealing with a set of
implicit links or latent qualia roles. Pustejovsky's position ought to be that
the qualia give rise to the links: combining silver with spoon will highlight
the FORMAL role of matter in silver and the AGENTlVE role of artefact in
spoon, producing "made of'; combining tea and spoon will highlight the
CONSTITUTIVE role of "liquid" in tea and the TELIC role of "for stirring" in
spoon, giving the link "for stirring"; combining apple with tree will
highlight the AGENTIVE role of "produced by tree" in apple, which then
naturally links apple and tree; combining seal with skin will highlight the
CONSTITUTIVE role of skin, i.e. part of animate, giving "skin of seal". Whether
qualia roles could always be appealed to in this manner is an open question.
What latent qualia in apple and cake are activated to give a locative have
relation in apple cake, and similarly, in victory and garden to give "for
promoting victory" in victory garden, or frog and man to produce "resembling"
in frogman?
    Assuming that we accept the existence of both default links and qualia,
there seem to be the following possible hypotheses as to their relation: (i)
links and qualia just happen to coincide, but are in fact quite unrelated; (ii)
qualia are basic and give rise to links, which are not listed but completely
dependent on qualia to be activated; (iii) qualia give rise to implicit links,
but these, being frequently activated, become abstracted as grammatical types
of meanings, and also become memorized and no longer necessarily depen-
dent on qualia to be suggested; (iv) qualia and links are distinct because they
have distinct functions but have a common conceptual basis.
    Of these, I find the last two equally plausible, the second somewhat less
plausible and the first implausible for the following reason: in view of the
fact that modifier-head constructions pick out a referent - something we
conceive of as a unit - just as single nouns do, the links and the qualia ought
to reveal what constellations of entities or phenomena we tend to recognize
as potentially unit-forming. Those which coincide in time and/or place (i.e.
the relations involving composition, possession and location) are natural
candidates. In the case of causal relations we seem to recognize that producer
                         Links and qualia in modifier-head constructions    247
and product were once a unit. Consider moonlight, piano sound. We here
seem to have revealed the cognitive basis for perceiving distinct elements
which nevertheless are accepted as units. If this is indeed so, then links in
modifier-head constructions, which denote units, and qualia in single nouns,
which also denote units, ought to be of the same kind (with the natural
exception of resemblance links). The fact that they are strengthens this
surmise. It is also natural that we think of entities which are perceived as
units for the same reason as forming a category.
    Links and qualia are similar in that they become activated when words -
in particular nouns - are combined. As we have seen, they are also strikingly
similar as to type. Indeed it is remarkable that a deductively constructed
model happens to agree so well with the results of empirical studies, the
existence of which the constructor of the model was probably quite unaware
of. Our conclusion must be that this agreement lends considerable support to
the theory of qualia.
    It would, however, be a mistake to think of Pustejovsky's model as a
"finished product", as he himself points out. Above I have pointed out some
weaknesses, e.g. that it seems to be restricted to first-order entities and that
the fact that defining features may vary as to degree of salience is ignored.
The main omission of the model is, however, that it fails to allow for the
influence of contextual referents on word meaning. The fact is that the
interpreter does not expect contextual referents invariably to agree with stored
meanings and denotata. 12 There may be good enough matches, in which case
we have non-prototypical senses. It may be possible to include the contextual
referent in the conventional denotatum and yet other salient class-distinctive
features suggest themselves, in which case we have the non-compositional
meanings I have repeatedly exemplified above, i.e. gold fish, glamour girl,
beauty spot, black board, compact disc, etc. It may be impossible to include
the contextual referent in the conventional denotatum, in which case we have
different types of nonliteral meanings. New meanings arise not only when
words are combined, but above all when words are matched with referents.
As we have seen, non-prototypical meanings, literal non-compositional
meanings and most types of nonliteral meanings,l) i.e. meanings which are
truly polysemy-forming and invigorating factors in lexical semantics, are as
yet not accounted for in Pustejovsky's model. Nevertheless, since Pustejovsky's
model has opened our eyes to a possible basis for the semantic regularity we
find in modifier-head constructions, to the distinction between listable and
nonlistable polysemy, and to the difference between polysemy which arises
248     Beatrice Warren
when words are combined and the polysemy which arises when words are
matched with contextual referents, it may very well turn out to be a substantial
push forward within semantics.
Notes
     I see default values as the outcome of two tendencies in human language, striving
     in opposite directions: constraints and rules on the one hand, which facilitate
     interpretation, and flexibility on the other, which promotes expressiveness.
5. Far from all adjective-noun combinations contain a covert link. For instance,
     there is no implicit predicate in nominalizations such as national leader ("some-
     body who leads nations") or presidential appointees ("people whom the president
     appoints"). In some adjectives functioning as adverbials, there is no link at all:
    perfectfool ("somebody who is perfectly foolish") (see Paradis 2(00). (Pustejovsky
     accounts for these by means of selective binding [1996: 127-131].) Also, there
     are quite a number of adjectives which have conventionalized implied or fig-
     urative meanings. Familiar, e.g., can no longer mean "consisting oflbelonging
     t%ccurring in etc. family".
6. The term was originally coined by Lees (1960: 181).
7. That bona fide adjectives are restricted to these links is an intriguing discovery
    for which I have no good explanation.
8. Basically is a necessary qualification in this context, since it is possible to classify
     by describing (short story, black coffee, white people) or by identifying (writer's
    cramp, clown s attire), or identify by describing (bring me the red box), etc.
9. Experiencing is a type of possessive link which requires that the referential content
    of the modifier denotes some sensation and that the head denotes an animate.
    Manifesting is dependent on the experience link and requires that the referent of
    the head is thought of as a natural mediator of sensations: (sad) eyes, (angry)
    face, (nervous) hands, (happy) smile, (furious) letter, (bitter) comments. If this
    restriction is violated, we have what is sometimes referred to as transferred
    epithets or the Wodehouse effect, e.g. a sad cigarette, a contemplative lump of
    sugar.
         An alternative account of combinations of this kind sometimes suggested is
    to look upon them as metonymic: happy face "the possessor of the face experi-
    ences happiness"; angry letter "the author of the letter experiences anger".
10. For reasons of space I have refrained from discussing the semantics of genitive
    constructions. I will be content with the comment that the same type of default
    relations appear to be involved. Consider loan s picture "picture that Joan owns"
    (possessive link), "that Joan made" (causal link), "that represents Joan" (com-
    positional link).
11. For Pustejovsky's account of referential metonymy, see Pustejovsky (1996:
    90-95).
12. Compare NerIich and Clarke (200l), who point out that words keep recharging
    their semantic batteries because there always is and always will be a discrepancy,
    a fundamental incongruence, between the supply of words and our communicative
    demands.
13. Compare Kilgarriff (to appear), who also found that Pustejovsky's model failed
    to account for metaphors.
250     Beatrice Warren
References
1. The problem
   This tendency of words to lose the sharp, rigidly defined outline of meaning
   which they once possessed, to become of wide, vague, loose application
   instead of fixed, definite, and precise, to mean almost anything, and so really
   to mean nothing, is ... one of those tendencies, and among the most fatally
   effectual, which are at work for the final ruin of a language, and, I do not fear
   to add, for the demoralization of those that speak it (Trench 1856: 192).
    According to Trench: "The causes which bring this mischief about are
not hard to trace": words which get into general use are caught up "by those
who understand imperfectly and thus incorrectly their true value". Con-
sequently, words "become weaker, shallower, more indefinite; till in the end,
as exponents of thought and feeling, they cease to be of any service at all"
(1856: 193).
    This disapproval has continued. In a guide for radio published in Britain
in 1981 by the editor of the Oxford English Dictionary, for example, broad-
casters were advised in a section labelled "Inflation or modishness of diction"
to avoid the use ofthe word tragedy in sport by rephrasing (Burchfield 1981).
Similarly, the author of a book on the social history of English vocabulary
254    Jean Aitchison and Diana M. Lewis
2. Disaster nouns
       calamity, noun, an event causing great and often sudden damage or distress;
       a disaster.
       catastrophe, noun, an event causing great and often sudden damage or
       suffering; a disaster.
       disaster, noun, a sudden event such as an accident or a natural catastrophe
       that causes great damage or loss of life ...
       tragedy, noun, an event causing great suffering, destruction, and distress,
       such as a serious accident, crime or natural catastrophe.
    Of these words, only the entry for disaster includes the information that
it may be used in less dire circumstances: "an event or fact that has unfor-
tunate consequences ... informal, a person, act or thing that is a failure".
    Yet investigation of the British National Corpus (BNC), a database which
includes both spoken and written language, suggests that informal, humorous
or figurative uses of all these words are extensive, alongside their more
serious, older meanings. (The BNC contains approximately 100 million
"w-units", roughly, lexical items, of which approximately 90 million are
written, and around 10 million spoken.) Examples of (a) serious, and (b)
trivial usages are given below:
( I)       a.   This will make it much less likely that the entire human race will
                be wiped out by a calamity such as a nuclear war.
           b.   But what if ... you're out on the floor at some gay club, and you
                spot or (calamity) are spotted by some workmates ... what then?
(2)        a.   A large comet hitting the earth would mean catastrophe.
           b.   To fall in love with Alexander would be a catastrophe.
256        Jean Aitchison and Diana M. Lewis
(i)     The scale of the disaster: what is the proportion of serious "real"
        disasters, to trivial inconveniences?
(ii)    The interpretation problem: how do speakers/hearers distinguish
        between the various meanings of the word?
(iii)   The development of polysemy: to what extent can this process be
        successfully characterized?
                                                 Polysemy and bleaching     257
3. Scale of disaster
Disaster is widely used in both spoken and written language: the BNC
contains over 3,000 examples. Of these, all the spoken ones were analysed
for this paper, as well as 600 randomly selected written ones. Removal of
unclear occurrences left 185 spoken and 589 written examples.
    The proportion of "real" disasters to trivial inconveniences was assessed
by dividing the BNC examples of disaster into serious (S), medium (M) and
trivial (T) events. As a rough rule of thumb, an event which caused multiple
deaths was classed as (S); one which involved one or two deaths, inflicted
severe suffering on a small number of people, or caused environmental
damage was listed as (M); one which caused social inconvenience was classed
as (T). For example:
(5)    At least 62 people were killed and 3,000 missing last night after an
       underwater earthquake sent 50ft tidal waves crashing into the coast
       of Nicaragua. More than 227 people were injured in the disaster. (S)
(6)    But if all 22 million gallons escape, the disaster will be twice as bad
       as the 1988 Exxon Valdez spill off Alaska. (M)
(7)    All other efforts to lose the fat from the offending areas proved to be
       a disaster. If I lost weight below 54 kg my bust disappeared, yet nothing
       went from my legs or posterior! (T)
   Table 2 shows how instances of disaster fell into the (S), (M) and (T)
categories.
in the singular, but only 0.1: I in the plural. This imbalance was less noticeable
in the written corpus, where the ratio of trivial to serious disasters was 1: 1 in
the singular, and 0.4: 1 in the plural.
    Categorization into (S), (M) and (T) was a useful step, but proved to be
oversimple: the (S), (M), (T) categorizations sometimes overlapped, in that
the same event could seriously affect a huge number of people in one area,
by definition (S), yet only a single person in some other area, by definition
(M). We therefore subdivided the disasters into types which partly cut across
the (S), (M), (T) categories.
    The largest single group involved wars or serious accidents, which killed
a large number of people. These were categorized as (S), as in (3a) and (5).
Environmental disasters were mostly classed as (M), as in (6).
    Social, political and economic disasters were split between the (M) and
(T) classifications:
(8)       If the tests prove positive, the flock is slaughtered: "For many farmers
          it spells financial disaster ... " the association s chairman said. (M)
(9)       The Tories were heading for disaster if they continued to delude
          themselves that only a little fine tuning of presentation was required
          to secure a fourth consecutive general election victory. (T)
Personal disasters were (M) when one or few people died or were injured:
( 11)     Then disaster struck. The first two men to exit from the following party
          were killed when their parachutes failed to open.
   But the majority of personal disasters were quite trivial (T), covering
household inconveniences, sports losses, or minor social difficulties, as in
(3b) and (7), and also (12), (13) and (14):
                                                   Polysemy and bleaching      259
(12)    The only reason I'm running up these debts is that now I've got so
        little capital left, that I've got to keep the capital for sheer disasters
        Like the boiler.
(13)    The last wicket fell ... So it was another bLackwash, another disaster
        for EngLand.
(14)    There have been many disasters along the road, Yorkshire puddings
        you couLd sole your shoes with ... and last Christmas a chocolate log
        that disintegrated, the proud little Santa on top sinking without trace
        in a sea of chocolate gunge.
4. Interpretation problem
How, then, do speakers use these words appropriately, and how do hearers
interpret them in the way intended? Broadly speaking, the surrounding context
clarified the level of disaster. Partly, this was explicitly specified, and partly,
covert conventions operated, which are implicitly understood by English
speakers.
    The naming of a geographical location was the major clue that a serious
incident involving multiple deaths had taken place, sometimes (though not
necessarily) accompanied by further explanation of the type of disaster, as
in: the BradfordfootbaLl disaster, the 1986 Chernobyl nuclear disaster, the
Clapham disaster, the Clapham rail disaster, the HilLsborough disaster, the
Hillsborough football crowd disaster, the Kegworth air disaster, the Lockerbie
disaster, the Siberian pipeline disaster, the Stalingrad disaster, the Zeebrugge
ferry disaster.
    For lesser events, the type of problem was often appended, as: ecological
260     Jean Aitchison and Diana M. Lewis
(15)    The gravys a disaster. Its got too much fat in it.
(16)    Newspapers love a good disaster, and a wedding where the bride-
        grooms trousers fell down at the altar would stand afar better chance
        of being reported than one that went without a hitch.
(17)    My own efforts at making custard ... may have been forgotten by those
        who subsequently had to eat the awful stuff, but I am not likely to
        forget the watery culinary disaster.
     Sports losses were another common trivial disaster, as in (13), (18) and
(19):
(18)    His spell in Italian football was a disaster and he eagerly accepted
        the chance to join Manchester United for a record fee.
(19)    X must get through to him in the first six rounds or face disaster.
   The topics of food or sport therefore flagged that the disaster was a trivial
event. The origin of food disasters is unclear, though use of disaster in sport
has been attributed to Rudyard Kipling's If One sports report explained this:
(20)    Behind many afootball club dressing room door, you '11 find a copy of
        Rudyard Kipling's If pinned up, the poem that talks of "meeting
        triumph and disaster and treating those two imposters just the same".
(24)   Even if disaster strikes, as it seemed to for one student of mine who
       dropped her nearly completed head on the concrete floor and an ear
       snapped off [the wooden rocking horse she was carving], don't let it
       worry you unduly. We simply glued the broken ear back in place.
(25)   Disaster struck again for the home side after 57 minutes.
A prerequisite for the development of polysemy may be that the word must
be widely used. At first sight, this is a puzzle for disaster, in that serious
disasters are relatively rare. Yet the word disaster was frequently used,
primarily because many disasters discussed were potential, rather than actual.
Only a proportion (about halt) had happened, and the remainder were
impending, avoided, or hypothetical, as in:
(26)   The research station's scientists comment privately that only a big
       fire disaster will make the government look harder at fire research.
262     Jean Aitchison and Diana M. Lewis
     disaster area, noun, an area in which a major disaster has recently occurred:
     the vicinity of the explosion was declared a disaster area. n [in sing] informal,
     a place, situation, person or activity regarded as chaotic, ineffectual, or failing
     in some fundamental respect: the room was a disaster area. stuff piled every-
     where I she was a disaster area in fake leopard skin and stacked heels.
6. Conclusion
This paper has pointed out that words signifying catastrophic events are
subject to bleaching, and consequently, the development of polysemy. It has
provided a case study of the word disaster, noting that polysemy develops
from layering, simultaneous different usages of the same word. Speech and
writing, different forms of the same lexeme, different topic areas, and habitual
collocations were explored and compared. These enabled the sources of the
bleaching of disaster to be narrowed down and partially pinpointed. Only a
fine-grained approach of this type can lead to a full understanding of the
process of polysemy.
References
Aitchison, Jean
   2003         Words in the Mind: An Introduction to the Mental Lexicon. 3rd
                Edition. Oxford: Blackwell.
Aitchison. Jean and Diana M. Lewis
    1996        The mental word web: forging the links. In: Jan Svartvik (ed.), Wonts,
                39-47. Stockholm: Swedish Academy.
Burchfield, Robert
    1981        The Spoken Word: A BBC Guide. London: British Broadcasting
                Corporation.
Bybee, Joan L., Revere D. Perkins and WiIliam Pagliuca
    1994        The Evolution of Grammar: Tense. Aspect and Modality in the
                Languages of the World. Chicago: University of Chicago Press.
264     Jean Aitchison and Diana M. Lewis
Fischer, Olga
    1997        On the status of grammaticalisation and the diachronic dimension in
                explanation. Transactions of the Philological Society 95(2): 149-187.
Gabelentz, Georg von der
    1891        Die Sprachwissenschaft. Ihre Aufgaben, Methoden und bisherigen
                Ergebnisse. Leipzig: Weigel.
Geeraerts, Dirk
    1993        Vagueness's puzzles, polysemy's vagaries. Cognitive Linguistics 4(3):
                223-272.
Geeraerts, Dirk, Stefan Grondalaers and Peter Bakema
    1994        The Structure of Lexical Variation: Meaning, Naming and Context.
                Berlin: Mouton de Gruyter.
Gibbs, Raymond W.
    1994        The Poetics of Mind: Figurative Thought, Language, and Under-
                standing. Cambridge: Cambridge University Press.
Heine, Bernd, Ulrike Claudi and Friederike Hiinnemeyer
    1991        Grammaticalization: A Conceptual Framework. Chicago: University
                of Chicago Press.
Hopper, Paul J. and Elizabeth Closs Traugott
    1993        Grammaticalization. Cambridge: Cambridge University Press.
Hughes, Geoffrey
    1988        Words in Time: A Social History of the English Vocabulary. Oxford:
                Blackwell.
Lightfoot, David
    1979        Principles of Diachronic Syntax. Cambridge: Cambridge University
                Press.
Meillet, Antoine
    1912 [1948] Linguistique historique et linguistique generale. Paris: Champion.
Nerlich, Brigitte and David D. Clarke
    1997        Polysemy: patterns in meaning and patterns in history. Historiographia
                Linguistica 24(3): 359-385.
Pustejovsky, James
    1995        The Generative Lexicon. Cambridge, MA: MIT Press.
Pustejovsky, James and Bran Boguraev (eds.)
    1996        Lexical Semantics: The Problem of Polysemy. Oxford: Clarendon.
Scott, Biljana
    1996        Aspectogenesis and the categorisation of directionals in Chinese.
                D.Phil. dissertation, University of Oxford.
Sweetser, Eve E.
    1988        Grammaticalization and semantic bleaching. Proceedings of the 14th
                Annual Meeting, Berkeley Linguistics Society: 389-404.
    1990        From Etymology to Pragmatics: Metaphorical and Cultural Aspects
                of Semantic Structure. Cambridge: Cambridge University Press.
                                                 Polysemy and bleaching      265
Taylor, John R.
   1995        Linguistic Categorization: Prototypes in Linguistics. 2nd Edition.
               Oxford: Clarendon.
Traugou, Elizabeth Closs and Bernd Heine (eds.)
   1991       Approaches to Grammaticalization. Volumes 1 and 2. Amsterdam:
              John Benjamins,
Trench, Richard Chenevix
   1856        English, Past and Present. 3rd Edition. London: John Parker and
               Son.
Tuggy, David
   1993        Ambiguity, polysemy, and vagueness. Cognitive Linguistics 4(3):
               273-290.
Polysemy in the lexicon and in discourse
Andreas Blank
1. Introduction
   Le sens nouveau, quel qu'il soit, ne met pas fin a I'ancien. IIs existent tous
   les deux I'un a cote de I'autre. Le meme terme peut s'employer tour a tour au
   sens propre ou au sens metaphorique, au sens restreint ou au sens etendu, au
   sens abstrait ou au sens concret ... A mesure qu'une signification nouvelle est
   don ne au mot, il a I'air de se multiplier et de produire des exemplaires
   nouveaux, semblables de forme, mais differents de valeur. Nous appelons ce
   phenomene de multiplication la polysemie. (Breal 1897: 154-155)
4. Semantic restriction
VulgL homo 'human being' > 'man'                                 F homme, It uomo, Sp hombre etc. 'human being', 'man'
F ble 'corn' > 'wheat'                                           F ble 'corn', 'wheat'
7. Popular etymology
F forain 'non-resident' > 'belonging to the fair' «foire)    F forain 'non-resident', 'belonging to the fair'
Lat somnium 'dream' > Sp 'sleep' « somnus)                   Sp sueflO 'dream', 'sleep'
L levare 'to lift up', 'to erect', Sp alzar, It alzare Sp alzar, It alzare 'to lift up', 'to erect'
                                                                                                                         -
                                                                                                                         IV
   'to lift up' > 'to erect'                                                                                             ......:J
272       Andreas Blank
METON
Figure J.     E man n.
                                    Polysemy in the lexicon and in discourse          273
In a paper entitled "Polysemy and cognition", Paul Deane (1988) adopts the
so-called "standard version" (Kleiber 1990) of prototype-theory - which was
abandoned by mainstream Cognitive Linguistics at about the same time. 7
According to Deane, the prototypical structure of categories allows speakers
to utter sentences like (1a-d):
   There is, indeed, a kind of PART-WHOLE relation between (2a) and (2b), or
rather between a general conception of a car and (2a) and (2b). On the other
hand, there are no proper senses of E car corresponding to the exact readings
in (2). They are, according to Cruse, "not even clearly conceptualizable"
(1995: 33). In these cases the context allows us to focus on the relevant part
or sub-domain of a complex concept.
                                Polysemy in the lexicon and in discourse   275
U sing both the referential and the semantic criteria we can now distinguish
polysemy from homonymy. Consider (4):
(4)    a.   The arm on that statue looks better than yours. (Deane 1988: 347)
       b.   A special arm of the government is to investigate the matter.
            (WEUD, s.v. "arm2")
       c.   His religious convictions kept himfrom bearing arms, but he served
            as an ambulance driver with the Red Cross. (WEUD, s.v. "arm2")
       d.   Three lions passant gardant ... the Royal Arms of England. (Pomy,
            Heraldry [1787J; see OED, s.v. "arm2, IV014")
(6)    a.   OF voler 'to fly' (itr.) [>METON> 'to hunt with falcons' (tr.)
            >METON> 'to catch the prey' >META>J 'to steal' (hence ModF
            volerl 'to fly' - voler2 'to steal')
                                    Polysemy in the lexicon and in discourse   277
    In (6) we should, diachronically, have polysemy, but the senses that linked
the first and the last acceptation disappeared from usage (put into square
brackets), so that there is no semantic relation between the remaining senses:
this is "secondary homonymy". In (7), original homonymy is re-interpreted
as polysemy by speakers who feel a semantic relation between the two senses
in question: this is "secondary polysemy".
3.3. Referential class and semantic relation: a double test for polysemy
The last four examples have demonstrated that we need criteria that work
exclusively in synchrony. A typology based on the features of referential
class and semantic relation fulfils this need and helps us to separate homonymy
from polysemy and the latter from mere contextual variation. See Table 2.
    Contextual variation describes the semantic range of one lexicalized sense;
thus, if there is a semantic difference, it is below the level of the semantic
relations introduced in Section 2. Polysemy is a property of the semantic
status of a word and describes a network of related senses of this word, as
illustrated above with E man and F parler. Homonymy means that two words
Table 2.
are only phonetically identical; we are beyond the level of semantic relations. 11
The decision as to whether there is a semantic link or not is by no means
fortuitous - as it sometimes may appear - but can, in most cases, be found
by using the typology of semantic relations given above. This typology may
be based on diachronic processes, but it is essentially a description of syn-
chronic relations.
    Considering finally the terms "vagueness" and "ambiguity", I would like
to stress that it is not the senses of a word which are vague, but that sometimes
contexts are vague when they allow different readings, and that often referents
are vague insofar as they are difficult to attribute to an extensional category.
But once we decide to attribute the "poor little thing on one's stomach" to
the class of human anns, the word becomes fully linked to the corresponding
concept and linguistic content. PrototypicaIity is thus a property of ex tensional
classes and not of concepts or senses.
Let us first investigate the best-known monosemous approach to (8) and (9).
In his "two-level-semantics", Bierwisch treats examples such as E school as
conceptual shifts away from an abstract semantic representation, as a kind of
world-knowledge guided semantic extension of a core sense in specific
contexts (Bierwisch 1983: 85-88).12 Bierwisch's monosemous interpretation
leads to very complex or very abstract lexical entries (see his formalized
notation). This is a problem at the level of metalanguage, but does not
contradict monosemy as such. The "hard" problem with monosemy is that if
we really had a core sense that is extended by conceptual information, the
extension should work in all similar cases in one language and it should
even work universally. This is not the case, as e.g. (8e) would not be possible
in German, and (8d) would not allow insertion of E police station meaning
'duty hours':
(12)   a.   E John sleeps in this hotel. - This hotel sleeps 100 guests.
       b.   F Jean dort dans cet hOtel. - ?Cet hOtel dort 100 clients.
       c.   G Hans schliift in diesem Hotel. - ?Dieses Hotel schliift 100
            Giiste.
(13)   a.   F Marie sort de la maison. - Marie sort un pistolet de son sac.
       b.   E Mary comes out of the house. - ?Mary comes a pistol out of
            her handbag.
       c.   G Maria kommt aus dem Haus. - ?Maria kommt eine Pistole
            aus der Handtasche.
    While E lamb admits the polysemy of 'animal' and 'meat prepared for
cooking/eating', this is not allowed with cow/beef, and in (15b) a reference
to the whole animal is excluded, since usually only hindlegs of frogs are
eaten. If all these words had, on the intralinguistic level, a rather abstract
core sense which allowed semantic extension based on encyclopaedic, extra-
linguistic knowledge, the different idiosyncrasies would be very hard, if not
impossible, to explain. Monosemy thus raises serious methodological prob-
lems and should be rejected as a semantic model.
    We can conclude that the types of polysemy described in (8)-(14) result
from profiling against a cognitive background, a process which can be used
in similarly construed domains. Nevertheless these culture-specific rules are
only actualized on the level of one language and thus have to be learned one
by one (Schwarze and Schepping 1995). We have to learn their language-
specific restrictions and their individual "areas of non-application".
                                 Polysemy in the lexicon and in discourse    281
We can now understand why polysemy in Breal's sense does not provide a
satisfying explanation for this kind of polysemy, as it is indeed difficult to
explain the apparent regularities by pointing to a number of parallel semantic
changes. Here again, an interesting distinction can be found in Deane (1988):
Deane calls the nonpredictable, idiosyncratic polysemy (the "Breal-type")
"lexical polysemy", while polysemy as based on typical semantic relations,
as discussed throughout the present section, is called "regular polysemy"
(1988: 349-350). Deane's distinction in itself is ingenious; his terminology,
however, is problematic as all polysemies treated here are "lexical" by nature.
    Despite this imprecision, Deane was pointing in the right direction, as
what he calls "lexical polysemy" is, as defined above in Section 2, a syn-
chronic consequence of "lexicalized semantic change", while "regular poly-
semy" is obviously different. The first type can be illustrated with examples
such as E mouse 'small rodent', 'computer mouse' or G Schirm 'shelter',
'umbrella' and will henceforth be called "idiosyncratic polysemy". Examples
of the second type include E book 'printed work', 'contents of this work',
and this type of polysemy will be called "rule-based polysemy", as the
polysemy of book arises from the rule that metonymic transfers from the
CONTAINER to the CONTENT are widespread in discourse. Using the term "con-
ceptual metaphor" introduced two decades ago by Lakoff and 10hnson (1980),
we can call the CONTAINER-CONTENT relation a "conceptual metonymy" or a
"contiguity schema" (see Blank 1999b). In fact, all examples cited in this
section are instances of different conceptual metonymies, e.g. BUILDING-
FUNCTION/GOAL (8b), BUILDING-AFFECTED PERSONS (8c), CAUSE-RESULT (10), CON-
TAINER-CONTENT (9, 12a), OBJECT-ACTOR (l3a), ANIMAL-MEAT OF THIS ANIMAL (14).
   We can now evaluate our typology of poly se my in the greater context of
cognitive semantics. It is clear that not only conceptual metonymies but
also conceptual metaphors lead to rule-based polysemy, as shown in the
"extension" of the BUILDING-THEORY metaphor in (l6b).
   The central tenet of Metaphors We Live by, viz. that "our ordinary con-
ceptual system, in terms of which we both think and act, is fundamentally
282     Andreas Blank
metaphorical in nature" (Lakoff and Johnson 1980: 3), can easily be extended
to metonymy. Conceptual metaphors are one way of structuring the world
via language; conceptual metonymies represent another, maybe even more
important way (see Seto, this volume). Less often, it seems, do we use
taxonomic relations deriving from the relation between a prototypical instance
of a category and the category itself. IS It is obvious that we use all major
strategies, metaphor, metonymy and hyponymy (what Seto, this volume, calls
synecdoche) when we think and talk about the world.
    This means that on the level of discourse we use conventional metaphors,
metonymies (and taxonomies) and introduce innovations based on con-
ventional conceptual metaphors and metonymies and on prototype-<ategory
relations. It is also rather uncontroversial that metonymies rely on funda-
mental contiguities - often anchored in frames and scenarios - and that
metaphors result from perceptual or functional similarities between concepts
or domains that are not directly related (see Croft 1993; Koch 1995; Blank
 L997).
    Two questions remain, however. On which Level of knowledge are these
conceptual metaphors and metonymies stored in our mind? And, why are
there language-specific restrictions for the use of some words, while others
do not have these restrictions, despite the fact that they are embedded in the
same conceptual background (see the examples cited above)? It is obviously
not enough to say that the latter are based on encyclopaedic knowledge and
that the former is somehow idiosyncratic. At this point, it is necessary to add
two further levels of linguistically relevant rules, i.e. the level of rules
governing so-called "discourse types" or "discourse traditions" and the level
of language rules that are specific to a particular language.
    Discourse traditions are sets of rules for the correct production of a specific
discourse, e.g. the set of metaphors and cliches of Renaissance poetry or the
typical phrases and strategies used when buying a used car or presenting a
paper at a conference (for details see Koch 1997). Although realized, of
course, in a concrete language, a discourse rule is not language-specific: it
characterizes a type of discourse and is common to speakers of all languages
who use a discourse type in a specific way (e.g. the typical Petrarchian
metaphors were known all over Europe during the 16th century, but only
within a small group of connoisseurs). Language rules, on the other hand,
are idiosyncratic and only govern the language use of a certain speech
community or a geographically or socially defined group.
   These two levels interact with encyclopaedic or conceptual knowledge,
                                 Polysemy in the lexicon and in discourse   283
on the one hand, and with the actual discourse, on the other, as represented
in Figure 3.
    According to this model, innovations are either based on prominent
encyclopaedic knowledge and have their psychological foundation in salient
associations (e.g. salient similarity, contiguity or contrast), or speakers model
them analogically on already-existing conceptual metaphors, metonymies or
taxonomies which, of course, themselves have cognitive foundations.
Innovations can be lexicalized directly as a specific language rule - an
idiosyncratic metaphor, metonymy etc. (represented by the broken arrow) -
or, and this seems to be more common, as a rule of the specific discourse
type in which the innovation was first used. This innovation may later become
lexicalized as a proper language rule.
    In the case of metonymy, for instance, several conceptual contiguities are
permanently highlighted and enable us to construe analogous metonymies
for words that access the same, or a similarly construed, frame or the same
discourse type. As we have probably not learned all these potential meton-
ymies before, we must have learned the rules that underlie them, and their
specific restrictions. Waltereit (1998: 14-19, 26-28) has demonstrated that
these rules are mainly "discourse rules": every discourse type activates a set
of conceptual metonymies that can be filled with concrete lexical metonymies.
This explains why, e.g., (9) and (l0) are very widespread types of contiguity
corresponding to a rule in a large number of discourse types. The range of a
conceptual metonymy is thus limited by the range of its discourse type(s), as
shown in (17).
(17)   a.   The ham sandwich is waiting for his check [ORDERED DISH-CUSTOMER
            in waiters ' discourse].
       b.   G Ich werde verliingert [EMPLOYEE-CONTRACT in discourses con-
            cerning work].
     We can now understand why (lOb) was odd: the contiguity schema PLACE
OF WORK-TIME OF WORK       is not a usual discourse rule, although it is fully
understandable. The conceptual metonymy can be used creatively to produce
analogous metonymies, but only when based on an activated discourse type.
The same holds true for typical conceptual metaphors: once a similarity
scheme is anchored in the mind as a rule of discourse types it can be produc-
ti vely filled with analogous metaphors, as exemplified in (18).16
284    Andreas Blank
                                        lexicalized as
   Language-specific rules
                             I........ . _ - - - - - - - - - -     Discourse rules
                                                                 "-,..-------'
                             , ,lexicalized
                    in            , as
,~----
                                              i
                                              reolired "
Encyclopaedic knowledge
(19)   For since mad March great promise made of me; If now the May of
       my years much decline, What can be hoped my harvest time will be?
       (Sidney, Astrophel and Stella [-1580]. Cited after: Shakespeare, Sonnets,
       edited by C. Knox. London: Methuen 1918: 6 fn.)
s. Conclusion
In the previous section of this article I have tried to show that not all types
of polysemy can be derived from types of semantic change. I had to modify
Bn!al's view in this respect. Polysemy as the direct lexicalized consequence
of semantic innovation is only one type in a set of possibilities. Indeed,
speakers seem to prefer to apply conceptual patterns anchored in discourse
traditions and to create semantic innovations based on these patterns. Inno-
vations can become conventional on the level of a discourse type and can
eventually become lexicalized as a language rule with or without specific
idiosyncratic restrictions of both use and transferability.
    While even lay people can decide whether there is a semantic relation or
not, polysemy has puzzled specialists for a long time. In this paper I have
tried to develop a referential and semantic test that helps us to distinguish
polysemy from contextual variation and homonymy. I have also discovered
seven types of semantic relations between lexicalized senses of a word. This
allows us to define lexical polysemy positively as a theoretical concept in
semantics, and it can now be used with much more precision as a tool of
semantic and lexicographic description. It should be noted, though, that the
seven types of semantic relations and the distinction between polysemy,
288     Andreas Blank
vagueness and ambiguity can obviously not be used to explain all concrete
cases and to deliver explanations on which all those working in the field of
semantics could agree. This is, in fact, due to the "interpretative nature of
linguistic semantics" (Geeraerts 1993: 263).
Notes
1. See e.g. Cruse (1986: 80); Saeed (1997: 64); Taylor (1995: 99).
2. This type of relation can often be found across different dialects of one language,
    e.g. EurSp tigre 'tiger' vs. AmerSp tigre 'jaguar', or EurSp le6n 'lion' vs. AmSp
     'puma'. These are, however, marginal cases of polysemy because they have no
    reality for speakers in ordinary life situations.
3. For a controversial discussion of this problem see Blank (1997: 191; 1999a:
    Section 3; to appear); Gevaudan (1997); Koch (2001).
4. See Blank (l999b); Koch (1999). For a different view of metonymy, see Seto
    (this volume).
5. Synecdochical polysemy according to the narrow definition of synecdoche in
    Nerlich and Clarke (1999).
6. Example taken from WEUD and PR. Abbreviations: meta = metaphor; meton =
    metonymy; tax = taxonomic relation; ag = agent; coag = counter-agent.
7. Polysemy is a core concept in newer studies in Cognitive Linguistics. According
    to Lakoff (1987), the different senses of a word correspond to the members of
    one single extensional cognitive category which, instead of having one proto-
    typical member, shows a certain number of "prototype effects". This rather naiVe
    equation of "one word-one concept" not only turns out to be a typical mono-
    semous view of semantics, but leads to the puzzling interpretation of cases of
    clear homonymy, such as F louerl 'to rent' - louer2 'to praise', as a kind of
    monosemously conceived polysemy (e.g. in Geeraerts 1993: 234)! Cognitive
    semantics thus simply misses the main task of lexical semantics, i.e. to describe
    the senses of a word and the relations between them. For more details see Blank
    (2001b: Section 11).
8. This is probably the difference between (3b) and the robotic arm in (Id), which
    in fact is the arm of the robot, as it functions (and maybe looks) like a human
    arm, while the arm of the record player is not an arm but an artefact that can be
    perceived as an arm.
9. It is important not to equate word, concept and extensional category as is done
    in some directions of Cognitive Linguistics (especially Lakoff 1987; see the
    critique in Brown 1990: 23; Kleiber 1990: 147; Koch 1995: 37).
10. For further examples see Blank (1997: 430-433); Fritz (1998: 58); Ullmann
    (1962: 164-179); Widlak (1992).
11. The same set of criteria can be used to define other lexical relations such as
                                    PoLysemy in the Lexicon and in discourse        289
    paronymy (word fonn: similar; referential class: different; semantic relation: no),
    synonymy (word form: different; referential class: identical; semantic relation:
    no, but senses differ only on the connotational level), hyponymy (word fonn:
    different; referential class: indifferent; semantic relation: yes).
12. For a more detailed discussion of (newer) approaches to monosemy see Koch
    (1998: 126-130). Other possibilities used in structural approaches are maximizing
    homonymy, which would override the semantic relations between the sense, or
    the shift of polysemy from the level of the "system" to the level of the "norm",
    as defined by Coseriu (1967). This direction is chosen by Dietrich (1997). In this
    case, however, the semantic system of the language degrades from a network of
    lexicalized word meanings to a mere "semantic potential". This systematic
    semantic potential would not really contain language-specific semantic knowledge
    but encyclopaedic knowledge, conceptual frames, rules of presuppositions etc.
    which pennit deployment of this semantic potential on the level of lexicalized
    meanings. Such a "minimalist program of semantics" is suggested by Schlieben-
    Lange (1997: 244-247) (see the critique in Koch 1998). For a general critique of
    the monosemous view of word meaning in structural semantics as developed by
    Eugenio Coseriu and his disciples see Taylor (1999).
13. For further examples see Schwarze and Schepping (1995).
14. Examples of this kind not only challenge monosemous approaches to word
    meaning but also the decidedly polysemous approach advocated by Pustejovsky
    (1995): he has developed a complex semantic theory centred around four types
    of semantic features called qualia, which are modelled on the four types of causes
    as defined by Aristotle (see Organon IV, 2: Ch. 11). These qualia explain regular
    metonymic and taxonomic polysemy very well, but they do not explain exceptions
    as in example (13), and they seem to provide no explanation of metaphoric
    polysemy, as this type would require qualia or argument-structure mapping. Indeed,
    as Pustejovsky and Boguraev (1996: 4) admit: "the polysemy [in the case of cow
    or frog; AB] is the result of lexical rules rather than of alternations within the
    qualia of a single lexical item, such as with door [oo.]." An interpretation of these
    cases as "semi-productive polysemy" is given by Copestake and Briscoe (1996).
15. See Nerlich and Clarke (1999), who describe typical discourse types where
    taxonomic strategies are exploited.
16. There is, however, a greater number of very open conceptual metaphors (e.g.
    "orientational metaphors"); see Lakoff and 10hnson (1980: 14).
17. According to Schwarze and Schepping (1995) this is even a necessary development.
18. Imagine someone talking about a restaurant where he/she has been for dinner
    saying: "And then this ham sandwich sitting next to us went off without paying."
19. This was the finding of the empirical study by Soares da Silva (1992): his
    Portuguese subjects had to classify the semantic relation of marked words in
    pairs of sentences from "0" (no relation) to "4" (identical). The highest scores in
    his test were actually given to Pg livro 'book' (CONTAINER-CONTENT) (3.92) and
    triste 'sad' (RESULT-CAUSE) (3.62), which means that many test subjects were not
290     Andreas Blank
    fully aware of the polysemy, i.e. of the semantic difference. Instances of type
    (iii) were usually better classified as being polysemous (score between 1.5 and
    3.5). See also Blank (1997: 418-419).
References
Aristotle
   1995        Categories; Organon IV. Both in: Philosophische Schriften. Volume I.
               Darmstadt: Wissenschaftliche Buchgesellschaft.
Bierwisch, Manfred
    1983       Semantische und konzeptuelle Reprasentation lexikalischer Einheiten.
               In: Rudolf Ruzicka and Wolfgang Motsch (eds.), Untersuchungen
               zur Semantik, 61-99. Berlin: Akademie.
Blank, Andreas
    1997       Prinzipien des lexikalischen Bedeutungswandels am Beispiel der
               romanischen Sprachen. Tiibingen: Niemeyer.
    1999a      Les principes d'association et leur importance pour la structure du
               lexique. Studi italiani di linguistica teorica ed applicata XXVID.
    1999b      Co-presence and succession: a cognitive typology of metonymy. In:
               Giinter Radden and Klaus-Uwe Panther (eds.), Metonymy in Language
               and Thought, 169-192. Amsterdam: Benjamins.
   1999c       Why do new meanings occur? A typology of the motivations for
               lexical semantic change. In: Andreas Blank and Peter Koch (eds.),
               Historical Semantics and Cognition, 61-89. Berlin, New York: Mouton
               de Gruyter.
   2oo1a       Pathways of lexicalization. In: Martin Haspelmath, Ekkehard Ktinig,
               Wulf Oesterreicher and Wolfgang Raible (eds.), Language Typology
               and Universals: An International Handbook, 11, 1596-1608. Berlin,
               New York: De Gruyter.
   2oo1b       Semantik 11. Neuere Entwicklungen in der lexikalischen Semantik.
               In: Giinter Holtus, Michael Metzeltin, and Christian Schmitt (eds.),
               Lexikon der Romanistischen Linguistik. I, 1,918-939.
   (to appear) Fondamenti e tipologia del cambio semantico. In: Giampaolo Salvi
               (ed.), Semantica e lessicologia storica. Rome: Bulzoni.
Breal, Michel
   1897        Essai de semantique: Science des significations. Paris: Hachette.
Brown, Cecil H.
   1990        A survey of category types in natural language. In: Savas Tsohazidis
               (ed.), Meanings and Prototypes: Studies in Linguistic Categorization,
               17-47. London: Routledge.
                                   Polysemy in the lexicon and in discourse       291
1. Introduction
    Iddo and Omri (aged 7 years and 8 months, native speakers of Hebrew)
are playing together. Iddo fetches himself a glass of juice out of the
refrigerator.
Forster 1989), it can hardly inhibit activation of salient meanings (cf. Tabossi
 1988; Titone 1998; but see Martin et al. 1999; and Vu, Kellas and Paul 1998
for a different view). Salient meanings should always be accessed and always
initially, regardless of contextual bias.
    The question may need rephrasing, then. If contextual information is less
able to affect preselection of the appropriate meaning when there are other
equally or more salient competitors, why doesn't it suppress irrelevant mean-
ings that surface only because of their availability? This may not be the right
question either, because contextually inappropriate information does get
suppressed (e.g. Gemsbacher 1990; Onifer and Swinney 1981; Swinney
 1979). However, Gemsbacher and Robertson (1995) also showed that skilled
and less skilled readers differ in their suppression abilities. While skilled
readers did not outperform less skilled readers at the initial, access stage,
they did at the later, suppression stage: Less skilled comprehenders were
less capable of suppressing contextually irrelevant information (which must,
therefore, interfere with their comprehension). But this will not explain the
use Iddo made of the irrelevant meaning (in [1]). His laughter suggests that
he computed both meanings (noticing their incongruity), deliberately choosing
to use the contextually inappropriate meaning for a special purpose: to crack
a joke.
    Here, then, is an instance of an idiomatic meaning of an utterance, which
though inconsistent with contextual information, was neither inhibited nor
suppressed. Such behaviour is not explainable by an interactive approach to
discourse comprehension, which allows contextual information to affect
processing very early on so that only the contextually appropriate meaning
of words and sentences becomes available (e.g. the Relevance Theoretic
account proposed by Sperber and Wilson 1986/1995; Carston 1999; but see
Giora 1998a for an alternative interpretation of this account). Moreover,
activating inappropriate meanings after the appropriate meaning has been
captured is not motivated by such models. Such behaviour is not explainable
by the standard pragmatic model (Grice 1975; Searle 1979) either. According
to this model, language comprehension always begins with the literal interpre-
tation. If it is compatible with contextual information it is not revisited, and
search is terminated. Upon this view, then, literally intended language is
processed only literally. Since Mira's utterance was intended literally, this
model cannot account for the involvement of the contextually irrelevant
nonliteral meaning in Iddo's comprehension process (see also Gibbs 1980).
    According to the graded salience hypothesis (Giora 1997, 2003), however,
                  Irony in conversation: salience. role. and context effects   299
so-called irrelevant meanings are activated because they are salient. According
to the retention hypothesis (Giora 1995, 2003; Giora and Fein 1999a), such
meanings would, indeed, be attenuated or fade if they have no role in con-
structing the intended meaning. However, if they can be allocated some
function in the construction of the discourse interpretation, they would be
retained. Such is the case of the literal meaning of ironic utterances (e.g.
Giora 1995; Giora, Fein and Schwartz 1998). According to the graded salience
hypothesis, the contextually incompatible literal meaning of (the critical word
of nonconventional) irony is activated because it is salient, i.e., stored in the
mental lexicon. It is not dispensed with by contextual information, because
it has a role in constructing the ironic meaning: it allows the comprehender
to compute the difference between what is expected and what is (Giora 1995).
For instance, when I say What a lovely day for a picnic on a stormy day, the
literal meaning of the critical word lovely would be retrieved directly from
the mental lexicon on account of its salience. Despite its contextual misfit, it
would not be suppressed, because it is instrumental in deriving the ironic
meaning ('lousy' or 'far from being lovely'). Indeed, in Giora, Fein and
Schwartz (1998) we showed that the literal meaning of irony is made available
immediately, and remains active even 2000 msec after offset of the target
(ironic) sentence. However, after such a long delay, the same literal meaning
is no longer active in literally biasing contexts. Having been accessed and
integrated with contextual information, the literal meaning (of lovely) in the
literal context (e.g. when What a lovely day for a picnic is said on a sunny
day) has no further functions. It, therefore, begins to fade.
    Similar findings are reported by Giora and Fein (1999a) regarding contex-
tually inappropriate meanings of salient/conventional metaphors. In Giora
and Fein's studies, the salient, literal meaning of utterances embedded in
metaphorically biasing contexts was retained in spite of contextual misfit.
However, the reverse did not hold: the salient metaphoric meaning of the
same utterance embedded in a literally biasing context showed deactivation.
For instance, concepts (e.g. 'rise') related to the literal meaning of conven-
tional (Hebrew) metaphors (e.g. Only now did they wake up. meaning 'only
now did they start doing something') were not suppressed in the metaphori-
cally biasing context, even though they were contextually incompatible in
that context. In contrast, the salient metaphoric meaning ('do') of the same
conventional metaphor was less active in the literally biasing context, where
it had no role in constructing the literal meaning of the utterance.
    Such asymmetry has been shown to hold for balanced polysemous words
300     Rachel Giora and lnbal Gur
as well (words having related meanings that are similarly salient). Where a
contextually inappropriate meaning of a polysemous word was instrumental
in sustaining the contextually appropriate meaning, it was retained. Where it
was not, it was deactivated. In Williams (1992), (salient) central meanings
(e.g. 'solid') of polysemous words (e.g. firm) were activated immediately
and retained even after a long delay (of 859 msec) despite contextual misfit
(e.g. The school teacher was criticized for not being firm). Such meanings
are indeed conducive to the interpretation of less central interpretations.
(Salient) less central meanings (e.g. 'strict'), however, were not found to be as
context-resistant. Having been activated immediately, they were not retained
after a long delay, when they were not compatible with contextual information
(e.g. The couple wanted a bed that was firm). Though 'strict' may be related
tofirm, it is not conducive to the construction of its central meaning ('solid').
Therefore, it was not retained after it had been activated. However, 'solid' is
conducive to the 'strict' interpretation of firm, and therefore it was not
suppressed after being activated. This finding is consistent with the hypothesis
that contextually incompatible meanings that are instrumental in the interpre-
tation of the intended meaning are not suppressed (Giora 2003; Giora and
Fein 1999a).
    Salient meanings that have not been deactivated because they have some
role in constructing the meanings currently being built, may be easily reused
by the discourse participants on account of their availability (e.g. the idiomatic
meaning in [1] above). To testify to the availability of the salient, though
contextually incompatible, literal meaning of ironic discourse, we examine
here the kind of response irony elicits in naturally occurring conversation. If
irony is responded to literally, this would be consistent with both the graded
salience hypothesis (Giora 1997) and the indirect negation view of irony
(Giora 1995; Giora and Fein 1999b, 1999c) which assume that irony compre-
hension involves activation and retention of its salient, though contextually
inappropriate, literal meaning. Such behaviour, however, is less compatible
with the direct access view (e.g. Gibbs 1986) which assumes that in a rich
context, irony is processed more or less directly, without having to go though
a contextually incompatible interpretive phase. Such behaviour is also less
compatible with the standard pragmatic model (Grice 1975; Searle 1979)
which posits that the contextually inappropriate literal meaning activated
initially should be rejected and replaced by the intended ironic meaning. I
This suppression hypothesis (see also Gernsbacher, Keysar and Robertson
2001) predicts that immediately after the utterance has been comprehended,
                  Irony in conversation: salience. role. and context effects   301
the direct access view, the modular view, and standard pragmatic model?
    To illustrate the way the contextually incompatible literal meaning of irony
may avail itself for further elaboration, consider the following example (taken
from Drew 1987) cited in Clark (1996: 374), in which the ironic/teasing turn
is responded to literally (both in bold). In this example, Gerald has just bought
a brand-new sports car, and is late for a meeting. He could respond to Lee's
ironic reprimand by addressing the ironic meaning (the reprimand), saying,
e.g., "I am sorry" or coming up with a real explanation. Instead, he proceeds
along the lines proposed by Lee, thereby elaborating on the tease or irony (in
bold), i.e., on the literal meaning of the ironic utterance (in bold):
    In our data, we looked for such responses to irony. Responses to the literal
meaning of irony are predicted by the graded salience hypothesis (Giora
1997, 2003) and the retention hypothesis underlying the indirect negation
view (Giora 1995). On the assumption that the literal meaning of
(nonconventional) irony is the only one coded in the mental lexicon, it should
be accessed once the irony is encountered. It should not be suppressed even
when the ironic interpretation is derived, because it facilitates the computation
of the difference between the expected and the derided situation.
2.1. Method
Participants: The participants were five Israeli friends (two women, three
men) who spent a Friday evening together. They were 29-33 years old, native
speakers of Hebrew, living in TeI Aviv.
    Materials: Our data come from one-hour tape-recorded conversations
among the participants. The conversations took place in Tel Aviv, in October
1997. They comprise 9,380 words. For illustration, consider an English
translation of a Hebrew extract of the conversation (example [3] below, ironic
utterances underlined), containing about 300 words, which partly revolves
304   Rachel Giora and lnbal Gur
around Sara and Benjamin (Bibi) Netanyahu (the then Israeli Prime Minister
and his wife), who, in a newspaper interview, complained about the press
harassing them. The literal responses are in bold. 2
(3)    1.   A: You don't understand one thing. You think they initiate these
               things? This is maybe the first article they [wrote].
      2.    B:                                                      [They] stay home
               and after them the paparazzi come ... and they are simply
               [miserable].
      3.    A: [(They) ruin] their lives, what do you want?
      4.    B: I want to cite the last sentence of the article, yes, out of the
               potency's (the word used in Hebrew is heroism) mouth
               (meaning 'out of God's mouth', equivalent to the English
               idiom 'out of the horse's mouth') [as they say].
      5.    A:                                          [Out of the baby's mouth].
      6.    B: Out of the hero's mouth. In fact, it's not in this article, it's
               in the, it's in the article that appeared in Ma'ariv (an Israeli
               daily), but x Sara says that maybe following the tragedy of
               Princess Diana they will begin to understand ... she and Diana
               on the same level!
      7.    C: So, the last photograph, they chose the picture of Bibi and
               Sara and their two kids on the beach in Caesaria and <xx>
               ya'eni (a marker for irony) a spontaneous picture, but it's
               obvious that this picture is carefully arranged: the big boy
               with the father - the small one with the mother, and all this.
      8.    A: (0.1 second later) It's as if I take a picture of you (C) now,
               say, you are sitting next to D (C's wife), because ... really,
               come on!
      9.    C: One thing is certain, then ... th~ paparazzi photQ~raph~rs will
               not catch them at the sp~ed Qf 160 kph,
      10.   Everybody: @ (2 seconds).
      11.   D: So she cannot compare herself to Diana.
      12.   B: <xx> She is really miserable because th~y do her injJ,!stice.
      13.   A: Diana, my ass, this entire story, believe me, I feel like retching.
      14.   D: Yes.
      IS.   A: Bi~ deal! [<xx>]
      16.   C:             [1 dQD't f~~l] lik~ 1ll1!ObiD~ ill all. 1f~~llik~ libariD~
               lbe profits.
                  Irony in conversation: salience, role, and context effects   305
        17. D: [@@@@]
        18. C: [I feel like] sharine the profits. Throw me some bone
               (equivalent to the English idiom 'throw me a scrap').
        19. D: Open a florist shop.
Speaker A (in [3.1]) sympathizes with the Netanyahus, while speaker B (in
[3.2]) is critical of them. Speaker B describes them ironically as "miserable"
- the literal sense - echoing their complaint and indicating that they are far
from being miserable, and that they must be happy - the ironic meaning -
about being so popular in the press. However, B (in [3.3]) elaborates on their
"misery" - the literal sense of the irony - by retorting that their life is
"ruined". Whether this response can be viewed as resonating with the literal
meaning is dubious: it could be a repetition of that person's belief rather
than a response to the previous utterance.
    In (3.4), the reference to Bibi as God ("potency"/"heroism") is ironic.
Both the following responses referring to him as either "baby" (sort of
opposite) or "hero" in (3.6) are echoes of the literal meaning of the irony.
The irony at the end of (3.6) remains uncommented on at this stage. D is
going to refer to it later (in [3.11]), but it is hard to tell whether this is a
response to the literal or ironic meaning of the irony here.
    The ironical meaning 'spontaneous' in (3.7) is responded to by A in (3.8).
A disagrees with C, i.e. with the ironical meaning, and attempts to defend
the genuine spontaneity of the photograph of the Netanyahus (appearing in
the press to affect "spontaneity" in order to support their claim that they are
haunted by the press).
    The irony in (3.9), which suggests the couple will never really run away
from the paparazzo photographers, was responded to by a 2-second laughter.
    The irony in (3.12) is a repetition of the topic of this conversation. It is a
repetition of the literal meaning of the previous ironies, particularly those
generated by the same speaker himself.
    The irony in (3.16) is responded to by laughter in (3.17).
    In (3.18), C echoes the literal meaning of his own irony, and in (3.19) the
literal meaning of his irony is elaborated on by D.
306     Rachel Giora and Inbal Gur
Fifty-six ironic utterances were selected, on which there was 100% agree-
 ment (as to their ironiness, reached at times after a discussion) between two
native speakers of Hebrew who listened to the recording. One judge was a
participant in the conversations. Of these 56 ironic utterances, 42 (75%) were
responded to by reference to their literal meaning. The responses were judged
as literal by the two judges as above. Only those on which there was 100%
agreement that they were indeed responses to the literal meaning of the irony
were counted.
    These results suggest that the literal meaning of irony is accessed and
retained by both speakers and addressees. The occurrence of irony in the
conversations made its literal meaning available for further discussion and
elaboration, as predicted by the graded salience hypothesis (Giora 1997, 2003)
and indirect negation view (Giora 1995). However, they are partly inconsistent
with the standard pragmatic model (Grice 1975; Searle 1979) and modular
view (Onifer and Swinney 1981; Picoult and 10hnson 1992; Seidenberg et
al. 1982; Swinney 1979), attesting that the literal meaning was not suppressed
as irrelevant. Further, they are incompatible with the direct access view,
demonstrating that irony did not avail the ironic meaning exclusively.
    It could be argued, of course, that tapping processes involved in under-
standing and producing naturally occurring conversations is not comparable
to testing comprehension on-line. While the direct access view may be chal-
lenged on the basis of evidence about processes, evidence accumulated from
conversational discourse may be telling only about products. While the
challenge is valid, it is still important to note that these products are better
explained by a graded salience view of comprehension rather than by a direct
access view, which does not allow for any contextually incompatible
interpretation to be activated initially. Recall, further, that these findings do
not stand in isolation, but are consistent with previous findings attesting to
processes attained by on-line measures such as reading times (Giora, Fein
and Schwartz 1998; Pexman, Ferretti and Katz 2000; Schwoebel, Dews,
Winner and Srinivas 2000) and lexical decision tasks (Giora and Fein 1999c;
Giora, Fein and Schwartz 1998), showing that irony is not accessed ironically
first.
    Our findings can also be viewed as an instantiation of a more general
phenomenon of "dialogic syntax" (Du Bois 1998). Dialogic syntax occurs
when a speaker constructs an utterance based on an immediately co-present
                   Irony in conversation: salience, role, and context effects   307
3. Summary
 l00{}-2000 msec after their offset. These findings support the graded salience
hypothesis, but are inconsistent with the view that context affects comprehen-
sion significantly (e.g. Glucksberg, Kreuz and Rho 1986; Sperber and Wilson
 1986/1995, but see Burgess, Tanenhaus and Seidenberg 1989 for a critique).
    Previous research (Giora and Fein 1999c; Giora, Fein, and Schwartz 1998)
has also demonstrated that, contra the standard pragmatic model (Grice 1975;
see also Searle 1979) and the modular view (Swinney 1979), the contex-
tually incompatible meaning of irony is not suppressed by contextually biased
infonnation. Both salient and nonsalient ironies retained their contextually
incompatible literal meaning in spite of the availability of the ironic meaning
(that emerged at a different temporal stage for the two types of irony). These
findings support the indirect negation view (Giora 1995) which maintains
that the contextually incompatible literal meaning of irony should be retained
because it has a role in irony interpretation - it provides a reference point
relative to which the ironicized situation is evaluated. Making the expected
explicit allows for the computation of the difference between what is and
what is looked for.
    Spontaneous face-to-face talk can lend support to the retention hypothesis
(Giora 1995, 2003; Giora and Fein 1999a) if it is found to abound in ironic
utterances that get responded to literally. Indeed, having investigated irony
reception in a spontaneous environment, we found that more often than not,
irony is responded to by resonating with its salient, literal interpretation.
These findings corroborate those of Kotthoff (1998), who shows that in
friendly conversations, listeners very often respond to the literal meaning of
the ironic utterance while at the same time making it clear that they have
also understood the implicated meaning. Responding to the literal meaning
demonstrates that this contextually incompatible meaning has neither been
inhibited nor suppressed by contextual infonnation.
    Empirical research supporting the graded salience hypothesis (e.g. Giora
1998a, 1998b, 1999, 2003; Giora and Fein 1999a, 1999b, 1999c; Giora, Fein
and Schwartz 1998; Pexman, Ferretti and Katz 2000; Schwoebel, Dews,
Winner and Srinivas 2000) and indirect negation view (Giora 1995; Giora,
Fein and Schwartz 1998) has so far focused on the processes involved in
comprehension of written, often contrived discourses tested in the laboratory.
In this study, we provide evidence in favour of the claim that salient meanings
are involved in spontaneous discourse. In particular, we demonstrate that
salient meanings are involved in text comprehension and production even
when they are incompatible with the context or the intended meaning.
                   Irony in conversation: salience. role. and context effects   309
Acknowledgements
Support for this research was provided by grants from The Israel Science Foundation
and Lion Foundation to the first author. Thanks are also extended to Ray Gibbs,
John Du Bois and Brigitte Nerlich for their very helpful comments.
Notes
I. Even though Grice (1975) is not explicit about it, the processing model that
   follows from his assumptions is taken to be a replacement or substitution account
   (see, e.g .• Levinson 1983: 157).
2. Legend (following Du Bois et al. 1993):
                          half a second break
                          a shorter break
        []                overlap
        x                 unclear word
        <xx>              unclear utterance
        @                 laughter
        ()                for words not appearing in the Hebrew text
        underlinin~       ironic utterances
        bold              responses to the literal meaning
References
Clark, Herbert, H.
    1996        Using Language. Cambridge: Cambridge University Press.
Conrad, Carol
    1974        Context effects in sentence comprehension: a study of the subjective
                lexicon. Memory and Cognition 2: 130-138.
Dews, Shelly and Ellen Winner
    1997        Attributing meaning to deliberately false sentences: the case of irony.
                In: Charlotte MandelI and Alyssa McCabe (eds.), The Problem of
                Meaning: Behavioral and Cognitive Perspectives, 377-414. Amsterdam:
               Elsevier.
Drew, Paul.
    1987       Po-faced receipts of teases. Linguistics 25: 219-253.
Du Bois, John W.
   1998        Towards a dialogic syntax. Ms. UC Santa Barbara.
Du Bois, John W., Stephan Schuetze-Coburn, Susanna Cumming and Danae Paolino
    1993       Outline of discourse transcription. In: Jane A. Edwards and Martin
               D. Lampert (eds.), Talking Data: Transcription and Coding in
               Discourse Research, 45-87. HilIsdale, NJ: Lawrence Erlbaum
               Associates.
Duffy, Susan A., Robin K. Morris and Keith Rayner
   1988        Lexical ambiguity and fixations times in reading. Journal of Memory
               and Language 27: 429-446.
Fodor, Jerry
   1983        The Modularity of Mind. Cambridge: MIT Press.
Forster, Kenneth I.
   1979        Levels of processing and the structure of the language processor. In:
               William E. Cooper and E. C. T Walker (eds.), Sentence Processing:
               Psycho linguistic Studies Presented to Merrill Garrett, 27-65.
               HilIsdale, NJ: Erlbaum.
    1989       Basic issues in lexical processing. In: William Marslen-Wilson (ed.),
               Lexical Representation and Process, 75-107. Cambridge: MIT Press.
Gernsbacher, A. Morton
   1990        Language Comprehension as Structure Building. Hillsdale NJ:
               Erlbaum.
Gernsbacher, A. Morton, Boaz Keysar, Rachel W. Robertson and Necia K. Werner
   200 I       The role of suppression and enhancement in understanding metaphors.
               Journal of Memory and Language 45: 433-450.
Gernsbacher, A. Morton and Rachel W. Robertson
   1995        Reading skill and suppression revisited. Psychological Science 6:
                165-169.
Gibbs, Raymond w., Jr.
   1980        Spilling the beans on understanding and memory for idioms in
               conversation. Memory and Cognition 8: 449-456.
                   Irony in conversation: salience. role. and context effects     311
Grice, H. Paul
   1975         Logic and conversation. In: Peter Cole and Jerry Morgan (eds.),
               Speech Acts. Syntax and Semantics Volume 3, 41-58. New York:
                Academic Press.
Hogaboam, Thomas W. and Charles A. Perfetti
    1975        Lexical ambiguity and sentence comprehension. Journal of Verbal
                Learning and Verbal Behavior 14: 265-274.
Horton, WiIliam S. and Boaz Keysar
   1996        When do speakers take into account common ground? Cognition 59:
               91-117.
Inhoff, Alber Werner, Susan D. Lima and Patrick J. CarroIl
   1984        Contextual effects on metaphor comprehension in reading. Memory
               and Cognition 12: 558-567.
Jones, Janet Lee
   1991        Early integration of context during lexical access of homonym
               meanings. Current Psychology: Research and Review 10: 163-181.
Kemper, Susan
   1981        Comprehension and interpretation of proverbs. Journal of Psycho-
               linguistic Research 10: 179-183.
Keysar, Boaz
   1989        On the functional equivalence of literal and metaphorical interpre-
               tations in discourse. Journal of Memory and Language 28: 375-385.
Keysar, Boaz, Dale J. Barr, Jennifer A. Balin and Timothy S. Paek
   1998        Definite reference and mutual knowledge: process models of common
               ground in comprehension. Journal of Memory and Language 39:
                1-20.
Keysar, Boaz, Dale J. Barr and William S. Horton
   1998        The egocentric basis of language use: insights from a processing
               approach. Current Directions in Psychological Sciences 7: 46--50.
Kotthoff, Helga
   1998        Irony, quotation, and other forms of staged intertextuality: double or
               contrastive perspectivation in conversation. Interaction and Linguistic
               Structures 5: 1-27.
Levinson, Stephen
   1983        Pragmatics. Cambridge, UK: Cambridge University Press.
Lucas, Margery M.
   1987        Context effects on the processing of ambiguous words in sentence
               context. Language and Speech 30: 25-46.
Martin, Charles, Hoang Vu, George KeIlas and Kimberly Metcalf
   1999        Strength of discourse context as a determinant of the subordinate
               bias effect. The Quarterly Journal of Experimental Psychology 52A:
               813-839.
                    Irony in conversation: salience, role, and context effects     313
Onifer, William and David A. Swinney
    1981        Accessing lexical ambiguities during sentence comprehension: effects
                of frequency of meaning and contextual bias. Memory and Cognition
                9: 225-236.
Ortony. Andrew, Diane L. Schallert, Ralph E. Reynolds and Stephen J. Antos
    1978        Interpreting metaphors and idioms: some effects of context on
                comprehension. Journal of Verbal Learning and Verbal Behavior 17:
                465-477.
Pexman Penny, Todd Ferretti and Albert Katz
    2000        Discourse factors that influence irony detection during on-line reading.
                Discourse Processes 29: 201-222.
Picoult, Jonathan and Marcia K. Johnson
    1992        Controlling for homophone polarity and prime target relatedness in
                the cross-modal lexical decision task. Bulletin of the Psychonomic
                Society 30: 15-18.
Rayner, Keith and Lyn Frazier
    1989        Selection mechanisms in reading lexically ambiguous words. Journal
                of Experimental Psychology: Learning, Memory, and Cognition 15:
                779-790.
Rayner, Keith and Robin K. Morris
    1991        Comprehension processes in reading ambiguous sentences: reflections
                from eye movements. In: Greg B. Simpson (ed.), Understanding
                Word and Sentence, 175-198. Amsterdam: North Holland.
Rayner, Keith, Jeremy M. Pacht and Susan A. Duffy
    1994        Effects of prior encounter and global discourse bias on the processing
                of lexically ambiguous words: evidence from eye fixations. Journal
                of Memory and Language 33: 527-544.
Schvaneveldt, Roger W., David E. Meyer and Curt is A. Becker
    1976        Lexical ambiguity, semantic context, and visual word recognition.
               Journal of Experimental Psychology: Human Perception and Perfor-
               mance 2: 243-256.
Schwoebel, John, Shelly Dews, Ellen Winner and Kavitha Srinivas
   2000         Obligatory processing of the literal meaning of ironic utterances:
               further evidence. Metaphor and Symbol 15: 47-61.
Searle, John
    1979        Expression and Meaning. Cambridge: Cambridge University Press.
Seidenberg, Mark S., Michael K. Tanenhaus, James M. Leiman and Marie Bienkowski
    1982       Automatic access of the meaning of ambiguous words in context:
               some limitations of knowledge based processing. Cognitive Psychology
                14: 489-537.
Sereno C. Sara, Jeremy M. Pacht and Keith Rayner
    1992       The effect of meaning frequency on processing lexically ambiguous
314     Rachel Giora and Inbal Gur
Ann Dowker
For many years, it has been known that children - and adults - do not use
language only for communication, or for referring to objects and events in
the environment. Children's playful and nonliteral uses of language have
been the subject of study at least since the end of the 19th century (e.g.
Lukens 1894; Chamberlain and Chamberlain 1904; Trettian 1904; Jespersen
1922; Chukovsky 1925 [1968]; Weir 1962; Aimard 1977; Garvey 1977;
Iwamura 1980; Sutton-Smith 1980; Schwartz 1981; Kuczaj 1982, 1983; Fox
1983,1993; Schieffelin 1983; Nelson 1987; Dowker 1989,1991,1998; Joffe
and Shapiro 1991; lohnson 1991; Crystal 1998). These non literal uses of
language are very diverse, but one important feature that they share is their
metalinguistic nature: they involve attention to and manipulation of aspects
of language, rather than treating language as a means to the end of trans-
mitting information. Children manipulate linguistic patterns and relationships,
including phonological relationships (e.g. rhyme and alliteration); syntactic
relationships (e.g. "modified repetition" as defined by Dowker 1991) and
semantic relationships (e.g. simile and metaphor).
    Metaphor and other forms of figurative language have been the object of
considerable interest for researchers: partly because of their importance in
the poetry and other literature - oral and written - of seemingly all cultures
(Finnegan 1977); partly because of their relevance to the ways in which
people form conceptual categories.
   Metaphor subsumes a variety of devices. Simile and metaphor are disting-
uished for several literary and psychological purposes (Kennedy and Chiappe
1999); and some researchers (Happe 1991; Nerlich, Todd and Clarke 1998)
suggest that true metaphor emerges later and makes greater demands on
theory of mind than does simile; but simile will here be treated as just one
form of metaphor.
   There have been numerous studies of the development and early use of
metaphor (Gardner et al. 1978; Billow 1981; Fourment, Emmenecker and
318    Ann Dowker
Pantz 1987; Gentner, Falkenhainer and Skorstad 1988; Winner 1988; Caramelli
and Montanari 1995).
    Most of these studies suggest that some forms of metaphor begin very
early. Billow (1981) found that it occurred frequently in the spontaneous
play and conversation of children from 2112 to 6; and that it did not increase
in frequency with age within that age range.
    Gardner et al. (1978) and Winner (1988) also report extensive use of
metaphor during the preschool period; though unlike Billow they found
significant age differences. Metaphor increased during the preschool period
and then declined with the onset of a "literal stage" at the age of about 6
or 7. It was suggested that during this period, children become more aware
of linguistic rules and category boundaries and are reluctant to violate them.
By adolescence, these rules and boundaries are more firmly defined, and the
children are in complete control of them, and thus may be more ready to
violate them for particular purposes. This may parallel the U-shaped rel-
ationships between expertise and strategy variability that Dowker et al. (1996)
have proposed for other domains such as mathematics.
    Later studies have mostly supported the view that young children are
capable of comprehending and using metaphors, though there are some age
differences in their frequency and predominant types. Numerous studies
(Gardner et al. 1978; Dowker 1986; Gentner, Falkenhainer and Skorstad
1988) indicate that young children tend to use predominantly perceptual meta-
phors, e.g. They [children walking around the school] sound like horses. Older
children and adults use a relatively larger number of cross-sensory metaphors,
e.g. Her dress is so loud that it shouts, and psychologicaVphysical metaphors,
e.g. fear like a seep of water (from a poem by "Emma", quoted by Dowker,
Hermelin and Pring 1996), and I grasp your meaning. For example, Caramelli
and Montanari (1995) studied Italian children's use of animal terms as meta-
phors and found that they moved from basing metaphors predominantly on
visual resemblance at age 6 to basing them predominantly on moral judge-
ment at age 12.
    There also appear to be age differences in the contexts in which people
use metaphor, though this has not been studied as systematically. Children
frequently use metaphor in the context of pretend play (Stross 1975; Billow
1981); while adults associate it strongly with poetry: an association that is
possibly less strong for young children (Dowker 1986; Dowker et al. 1998).
    When researchers attempt to distinguish between literal and figurative,
including metaphorical, language, the distinction is often difficult to make.
                    Young children's and adults' use offigurative language   319
Fass (1997: 2) points out that "it has proved extremely hard to develop precise
criteria for distinguishing literal from non literal language". Lakoff and Johnson
(1980) and Gibbs (1994) put forward the view that, in both children and
adults, the distinction between figurative and literal language is less sharp
than has sometimes been assumed. Gibbs (1994: 435-436) stated that "similar
cognitive mechanisms drive our understanding of both literal and figurative
speech"; that "people need not recognize figurative language as violating
communicative norms and maxims in order to understand what those expres-
sions figuratively mean"; and that "a great deal of our thinking is constituted
by metaphorical mappings from dissimilar source and target domains".
Examples of the latter include thinking of understanding as seeing (That's
very clear; I've got the picture; etc.) or anger as heated fluid in a container
(He got all steamed up; I was boiling over; She hit the ceiling).
    Cross-cultural differences in metaphor use have only been studied to a
limited extent from a psychological point of view, though they have also
received attention from some other perspectives, e.g. the difficulties that they
pose for translators. The ways in which metaphor is used are certainly
influenced by pragmatics and cultural convention: for example, a British
person would use the term fox to mean 'sly and cunning person', while a
Canadian might use the same term to mean 'attractive woman'.
    There have been some studies of the ways in which differences in meta-
phoric usage may reflect cultural differences in attitudes to and concepts of
certain emotional, social and cognitive domains, such as anger (Gibbs 1994;
Koevecses 20ooa), emotions generally (Palmer, Bennett and Stacey 1999;
Koevecses 2000b), and time (Zhou and Huang 2000; Moore 2001). Such
studies have revealed considerable commonalities, revealing universal cog-
nitive structures; for example both English and Wolof speakers have a
tendency to map spatial vocabulary onto temporal concepts (Moore 2001).
However, cultural differences are also found, and Moore concludes that
"While there is a substantial amount of metaphor structure that is shared
cross-linguistically, a full understanding of conceptual metaphor depends on
properties of particular languages, communities of speakers, or individuals".
Sometimes, even when a given type of metaphorical construction is found
generally across cultures, it is given more emphasis in some cultures than
others. For example, Yu (2001) points out that terms for the face are used
metaphorically in both English and Chinese to refer to broader aspects of
physical appearance (she's not just a pretty face), to emotion and character
(we mustface up to this), to interactions and relationships (we had aface-to-
320    Ann Dowker
face discussion), and to prestige and dignity (he lost face). However, such
metaphors were used more richly in Chinese than English, reflecting cultural
differences in the values attributed to the relevant concepts.
    Far fewer studies have been carried out on the ways in which language or
culture may affect the actual frequency of figurative language.
    Some years ago, I carried out an analysis of spontaneous poems produced
by children between the ages of 2 and 6 (Dowker 1986). The sources included
Timothy Rogers' book Those First Affections, his extensive unpublished
collection of poems by very young children, other published sources (e.g.
Trettien 1894; Dixon 1930; Britton 1970; Griffin 1981; Heath 1983; etc.),
and numerous personal communications from friends and acquaintances. In
all, there were nearly 400 poems by English children: 398 to be precise,
including 26 by 2-year-olds, 57 by 3-year-olds, 73 by 4-year-olds, 99 by 5-
year-olds, 99 by 6-year-olds, and 44 by nursery-school children of unspecified
age.
    The English poems were compared with poems by French, Hebrew and
Russian-speaking children of similar age-range. The 37 French poems were
taken from various published sources (e.g. Aimard 1977; Chevalley 1982).
Most of the 66 Russian poems come from Chukovsky (1970) with a few
from other published sources. Some of the 19 Hebrew poems were unpub-
lished and sent to me by friends and acquaintances; the rest come from
Goldberg (n.d.) and Rivkai (1937).
    The findings with regard to metaphor were as follows. One-sixth of the
English poems contained metaphor. There was some increase with age from
less than 10% of poems by 2- and 3-year-olds to 22% - just over a fifth - of
poems by 6-year-olds. In all, 78 metaphors were produced. Most of the
metaphors were similes; and about two-thirds of them were perceptually
based; e.g. When we walk around we look like shadows; The moon is just
like a little white bird/Except that it hasn't got a little face; This man was so
tall as the clouds; The sun is like a stove/Boiling water; Hands as soft as
seal. By contrast, only 7% were functionally or behaviourally based.
    The poems by French children were more likely than those of the English
children to contain metaphor. One third of their poems did so. Over half of
the French metaphors were perceptually based (e.g. Lune d'argent ... Etoiles
d'or [Moon of Silver ... Stars of gold]), though a somewhat higher proportion
of the French metaphors did come into other categories, e.g. Les oiseaux
sont mignons/MalheureuxlComme les pauvres gens [The birds are sweetJ
UnluckylLike poor people].
                   Young children's and adults' use o/figurative language   321
    If metaphors were commoner in the French than the English poems, they
were much less common in the Russian and Hebrew poems. Only 8% of the
Russian poems and none of the 19 Hebrew poems contained any metaphor.
    The study of children's spontaneous poems is important in showing what
they produce when not being actively directed by adults. However, there are
some disadvantages in confining oneself to the study of spontaneous poems.
They had to be noted down by adults, which may lead to selection bias:
adults may choose to record poems which they consider to be particularly
good or particularly funny or just particularly interesting. Moreover, these
poems were produced over a very wide timescale, from the 1890s to the 1980s,
and it is possible that children's poems could have changed somewhat over
this period, perhaps in response to changes in children's literature. Therefore,
a somewhat more controlled study was carried out, which involved eliciting
poems from children.
    Dowker et al. (1998) carried out a cross-linguistic study of poem pro-
duction by young children. The participants were 122 English children, 59
French children, 148 Italian children, 118 Polish children and 118 Brazilian
(Portuguese-speaking) children between the ages of 4 and 6 years.
    The basic task involved the successive presentation of three pictures. After
each picture was presented, the child was asked: "Could you tell me a story
about this picture?" and hislher response was recorded on a tape-recorder
and then played back to the child. The child was then told: "Now I'm going
to tell you a poem, which is a bit like a story but not quite. And I'd like you
to make up something like that." One of three poems dealing with the picture
(a rhyming poem, an alliterative poem and a simile poem) was then presented
as a stimulus, and the child was asked: "Can you make up something like
that?" Once again, the child's response was recorded and then played back
to himlher on the tape-recorder. The same procedure was followed with the
next two pictures. (The instructions were, of course, translated into the
relevant languages for the non-English children.) During this task, each child
heard one rhyming poem, one alliterative poem and one simile poem.
    Differences between the procedure as applied to the different groups were
(i) that of necessity the stimulus poems were different, as the children of
each group heard stimulus poems in their own language; (ii) the pictures
sometimes differed between languages, as they had to permit the production
of stimulus poems with the appropriate devices in a particular language.
    There were some differences in the use of phonological devices. The
Italian children made extensive use of both rhyme and alliteration; while
322      Ann Dowker
the Polish children made considerable use of alliteration but less use of
rhyme, and the English children used much rhyme but little alliteration. The
French and Brazilian groups were the only groups in the study to use phono-
logical devices in less than half of their poems. However, by far the most
marked differences were in the use of simile and metaphor. Metaphor, almost
always consisting of simile, was in general less common than the phonological
devices; but there were extremely striking group differences. Only a tiny
proportion of English, Italian or Polish children produced poems containing
similes, whereas similes were much more common in the French and Brazilian
poems.
   It was inappropriate to investigate age differences in the use of simile in
the English, Italian or Polish group, because of the very small numbers
involved. They were nonsignificant in the Brazilian group <X2 = 0.45; df = 2;
p n.s.). In the French group, there was a significant increase with age <X2 =
8.82; df = 2; p < 0.05).
    Thus, contrary to what might have been predicted on the grounds that
there are much greater phonological than semantic differences between lang-
uages, group differences were greater with regard to the use of metaphor and
simile than with regard to the use of phonological devices. Whereas all the
groups did make significant use of phonological devices in their poems, three
of the five groups (English, Polish and Italian) used almost no metaphor in
their poems.
    The findings that metaphor was used more often, and phonological devices
less often, by French than English children are in agreement with the results
of the study of spontaneous poems. The spontaneous poems were not elicited
by means of examples, making it unlikely that the cross-cultural differences
in the studies were the artifacts of a particular methodology.
    Both studies suggested a possible trade-off between the use of phono-
logical devices and of semantic devices (e.g. simile and metaphor), in that
the groups (French and Brazilian) that made the least use of phonological
devices made the greatest use of metaphor. Dowker et al. (1998) also found
that French and Brazilian children's poems outnumbered their stories, whereas
the reverse was true of the other groups. This may indicate differences in the
nature and sharpness of the division between the genres of stories and poems.
Perhaps to French and Brazilian children, poems are "stories with rhythm",
featuring semantic devices. For the other groups, poems may be more sharply
distinguished from stories through their emphasis on phonological as opposed
to semantic devices.
    Do the group differences in poem production reflect differences in actual
capacities for understanding or producing metaphor? Though it is difficult to
answer this question with certainty, it appears unlikely that this is the case.
Cross-cultural and cross-linguistic differences in metaphoric development
have not been much studied; but much of the research on metaphoric develop-
ment has been carried out with English-speaking children (e.g. Gardner et
al. 1978; Billow 1981), and there is ample evidence that such children are
capable of using metaphor and simile from an early age.
324    Ann Dowker
that (s)he lacks, then (s)he is doing one of the things that adults do when
they use metaphor: expressing "the otherwise inexpressible". The adult who
describes a state of depression as "a fog surrounding me", or who says "My
love is like a red red rose" is presumably using the figurative expressions
because (s)he cannot convey the meaning equally well by means of literal
description. Indeed, Carabine (1991) has pointed out that both children and
adults have "fuzzy boundaries" to the extensions of common word meanings,
and that there is far more similarity between children and adults in their
word extensions than is often recognized.
    When word meaning extensions are or become standardized within a
language, rather than idiosyncratic to a particular individual or occasion, they
may be described as polysemy.
    Polysemy has been defined (e.g. Ullman 1951) as the property of a word
that has multiple but related meanings (e.g. head can mean 'upper part of the
body', 'chief person in an organization', etc.). It is sometimes difficult to
distinguish it from homonymy (the property of a word that has multiple unrel-
ated meanings; e.g. bank can mean 'edge of a river' or 'financial institution ').
    Some linguists (Ullman 1951; Kinberg 1991) have suggested a continuum
from figurative uses through polysemy to homonymy. Taking into account
the above-mentioned findings concerning children's early semantic develop-
ment, the following continuum is here proposed: overextension - figurative
uses - polysemy - homonymy.
    Such a view implies a close relationship between polysemy and figur-
ative language; and indeed polysemy may be both a consequence and cause
of figurative language use. Many of the extra, polysemous meanings that a
word develops begin as metaphorical extensions of a primary meaning of
that word: e.g. head in the sense of 'chief person' presumably originated as
a metaphorical extension of head in its primary sense of 'upper part of the
body'. This is not to say that all polysemous words have one clearly definable
primary meaning; this is obviously not the case (Gibbs 1994). Other forms
of figurative language such as metonymy, a form of indirect reference in
which one word is replaced by a word referring to an entity closely associated
with it, are also the sources of polysemous meaning (e.g. 'face' for 'person').
   Polysemy may also influence non literal language use: especially in puns
and riddles (Nerlich, Todd and Clarke 1998), but also metaphors and similes.
   Since there appear to be cross-linguistic differences in metaphor use, and
since polysemy and metaphor seem to be closely related, are there also
cross-linguistic differences in polysemy? Do some languages have a greater
326    Ann Dowker
the development and use of polysemy. Only when both the developmental
and linguistic influences on both polysemy and metaphor are better under-
stood can we begin to fully understand the nature of these phenomena. and
the undoubtedly close relationships between them.
References
Aimard, Paul
    1977        Les Jeux de Mots de l'Enfant. Villeurbanne: Simep-Editions.
Alland, Alexander
    1983        Playing with Form: Children Draw in Six Cultures. New York: Columbia
                University Press.
Baroody, Arthur and Ann Dowker (eds.)
    2003        The Development of Arithmetical Concepts and Skills. Mahwah, NI:
                Erlbaum.
Billow, Richard
    198 I       Observing spontaneous metaphor in children. Journal of Experimental
                Child Psychology 3 I: 430-445.
Britton, lames
    1970        Language and Learning. London: Alien Lane.
Carabine, B.
    1991        Fuzzy boundaries and the extension of object-words. Journal of Child
                Language 18: 355-372.
Caramelli, Nicoletta and A. Montanari
    1995        Animal Terms in Children's Metaphors. Lund University, Sweden:
                Cognitive Science Research.
Chamberlain, Alexander and Isabel Chamberlain
    1904        Studies of a child, I and n. Pedagogical Seminary 9: 264-291 and
                452-483.
Chevalley, Etienne
    1982        Miracles de l'Enfance. Paris: Guilde du Livre.
Chukovsky, Kornei
    1925 [1968] From 'TWo to Five. Berkeley: University of California Press. 1968
                translation by Miriam Morton of 1959 edition. First Russian edition
                published in 1925.
    1970 [1925] Ot Dvukk do Pyati [From Two to Five]. Moscow: NarodnayaAsveta.
Crystal, David
    1998        Language Play. Harmondsworth: Penguin.
Deane, Paul
    1988        Polysemy and cognition. Lingua 75: 325-361.
                    Young children's and adults' use offigurative language     329
Dixon, Catherine
    1930       Children Are Like That. New York: John Day.
Dowker, Ann
    1986       Language play in young children. Ph.D. thesis. University of London
               Institute of Education.
    1989       Rhyme and alliteration in poems elicited from young children.
               Journal of Child Language 16: 181-202.
    1991       Modified repetition in poems elicited from young children. Journal
               of Child Language 18: 625~39.
Dowker, Ann (ed.)
    1998       Language play in children. Special Issue of Cahiers de Psychologie
               Cognitive/Current Psychology of Cognition 17.
Dowker, Ann, Amanda Flood, Helen Griffiths, Louise Harriss and Lisa Hook
    1996       Estimation strategies of four groups. Mathematical Cognition 2:
                113-135.
Dowker, Ann, Beate Herrnelin and Linda Pring
    1996       A savant poet. Psychological Medicine 26: 913-924.
Dowker, Ann, Grazyna Krasowicz, Giuliana Pinto, Antonio Roazzi and Anita Smith
   1998        Phonological and semantic devices in very young children: a cross-
               cultural study. Cahiers de Psychologie Cognitive/Current Psychology
               of Cognition 17: 389-416.
Fass, Dan
   1997        Processing Metonymy and Metaphor. London: Ablex.
Finnegan, Ruth
   1977        Oral Poetry. Cambridge: Cambridge University Press.
Fourment, Marie-Claude, Nelly Emmenecker, and Valerie Pantz
   1987        Etudes de la production de metaphores chez les enfants de 3 a 7 ans.
               Annee Psychologique 87: 535-551.
Fox, Carol
   1983        Talking like a book: young children's oral monologues. In: Margaret
               Meek (ed.), Opening Moves: Work in Progress in the Study of
               Children's Language Development. University of London Institute
               of Education: Bedford Way Papers.
   1993        At the Very Edge of the Forest: The Influence of Literature on Story-
               telling by Children. London: Cas sell.
Fremgen, Amy and David Fay
   1980        Overextensions in production and comprehension: a methodological
               clarification. Journal of Child Language 7: 205-211.
Gardner, Howard, Ellen Winner, Robin Bechhoffer and Dennie Wolf
   1978        The development of figurative language. In: Katherine Nelson (ed.),
               Children's Language, Vo!. 1. Cambridge, MA: Gardner Press Inc.
330     Ann Dowker
Garvey, Catherine
    1977         Play with language and speech. In: Susan Ervin-Tripp and Claudia
                 Mitchell-Kernan (eds.), Child Discourse. Academic Press.
Gentner, Dedre, Brian Falkenhainer and Janice Skorstad
    1988         Viewing metaphor as analogy. In: David Helman (ed.), Analogical
                 Reasoning. London: Kluwer.
Gibbs, Raymond w., Jr.
    1994         The Poetics of Mind. Cambridge: Cambridge University Press.
Goldberg, Lea
    (not dated) Between a Children s Writer and his Reader. Tel Aviv: Sifriyat Hapoalim
                 (in Hebrew).
Griffin, Elinor
    1981         Island of Childhood: Education in the Special World of Nursery
                 School. New York: Teachers' College Press.
Happe, Francesca
    1991         The autobiographical writings of three Asperger syndrome adults:
                 problems of interpretation and implications for theory. In: Uta Frith
                 (ed.), Autism and Asperger Syndrome. Cambridge: Cambridge University
                 Press.
Heath, Shirley
    1983         Ways with Words. Cambridge: Cambridge University Press.
Hunt, Earl and Franca Agnoli
    1991         The Whortian hypothesis: a cognitive psychology perspective. Psycho-
                 logical Review 98: 377-389.
Iwamura, Susan
    1980         The Verbal Games of Preschool Children. London: Croom Helm.
Jespersen, Duo
    1922         Language: Its Nature. Development and Origin. London: Alien and
                 Unwin.
Joffe, Victoria and G. Shapiro
    1991        Sound-based language play in four language-impaired subjects. The
                South African Journal of Communication Disorders 38: 119-128.
Johnson, Janice
    1991        Developmental versus language-based factors in metaphor interpre-
                 tation. Journal of Educational PsychoLogy 83: 470-483.
Karmiloff-Smith, Anneue
    1990        Constraints on representational change: evidence from children's
                drawings. Cognition 34: 57-83.
Kennedy, John and Dan Chiappe
    1999        What makes a metaphor stronger than a simile? Metaphor and Symbol
                 14: 63-69.
                     Young children's and adults' use offigurative language      331
Kinberg, Naphthali
   1991         Figurative uses, polysemy and homonymy in systems of tense, mood
                and aspect. Lingua 83: 319-338.
Koevecses, Zoltan
   2000a        The concept of anger: universal or culture-specific? Psychopathology
                33: 159-170.
   2000b        Metaphor and Emotion: Language, Culture and Body in Human
                Feeling. Cambridge: Cambridge University Press.
Krutetskii, Vadim
   1976         The Psychology of Mathematical Abilities in Schoolchildren. London:
                University of Chicago Press.
Kuczaj, Stan
   1982         Language play and language acquisition. In: Hayne Reese (ed.),
                Advances in Child Development and Child Behaviour 17. London:
                Academic Press.
   1983         Crib Speech and Language Play. Berlin: Springer-Verlag.
Lakoff, George and Mark 10hnson
   1980         Metaphors We Live By. Chicago: University of Chicago Press.
Lukens, Herman
   1894         Report on the learning of language. Pedagogical Seminary 3: 419-
                460.
Moore, Kevin
   2001         Spatial experience and temporal metaphors in Wolof: point of view,
                conceptual mapping and linguistic structure. Dissertation Abstracts
                International, Section A: Humanities and Social Sciences 61: 2685.
Nelson, Katherine (ed.)
   1987         Narratives From the Crib. Cambridge, MA: Harvard University
                Press.
Nelson, Katherine, Leslie Rescorla, lanice Gruendel and Helen Benedict
   1978         Early lexicons: what do they mean? Child Development 49: 960-968.
Nerlich, Brigitte and David Clarke
   1997         Polysemy: patterns of meaning and patterns of history. Historiographia
                Linguistica 25: 349-385.
Nerlich, Brigitte, Zazie Todd and David Clarke
   1998         The function of polysemous jokes and riddles in semantic development.
                Cahiers de Psychologie Cognitive/Current Psychology of Cognition
                 17: 343-365.
Palmer, Gary, Heather Bennett and Les Stacey
   1999         Bursting with grief, erupting with shame: a conceptual and gram-
                matical analysis of emotion-tropes in Tagalog. In: Gary Palmer and
                Debra Occhi (eds.), Languages of Sentiment: Cultural Constructions
                of Emotional Substrates, 171-200. Amsterdam: Benjamins.
332     Ann Dowker
Rivkai, Yehuda
    1937        Letters to Parents. Jerusalem: Achiasaf (in Hebrew).
Schieffelin, Bambi
    1983        Talking like birds: sound play in a cultural perspective. In: Elinor
                Ochs and Bambi Schieffelin (eds.), Acquiring Conversational Com-
                petence. London: RoutJedge and Kegan Paul.
Schwarz, Judith
    1981        Children's experiments with language. Young Children 36: 16-26.
Siegler, Robert
    1996        Emerging Minds: The Process of Change in Children's Thinking.
                Oxford: Oxford University Press.
Siegler, Robert and Eric Jenkins
    1989        How Children Discover New Strategies. Hillsdale, NJ: Erlbaum.
Sinha, CJrris and Tania Kuteva
    1995        Distributed spatial semantics. Nordic Journal of Linguistics 18:
                167-199.
Stross, Brian
    1975        Metaphor in the speech play of Tzeltal children. Anthropological
                Linguistics 17: 305-323.
Sutton-Smith, Brian
    1980        The Folkstories of Children. Philadelphia: University of Pennsylvania
                Press.
Trettien, August
    1894        Psychology of the language interest of children. Pedagogical Seminary
                3: 419-466.
    1904        Psychology of the language interest of children. Pedagogical Seminary
                23: 113-177.
UlImann, Stephen
    1951        The Principles of Semantics. Oxford: Blackwell.
Vosniadou, Stella
    1987        Children and metaphors. Child Development 58: 870-885.
Weir, Ruth
    1962        Language in the Crib. The Hague: Mouton.
Winner, Ellen
    1988        The Point of Words: Children's Understanding of Metaphor and Irony.
                Cambridge, MA: Harvard University Press.
Yu, Ning
   2001         What does our face mean to us? Pragmatics and Cognition 9: 1-36.
Zhou, Rong and Xiteng Huang
   2000         A cross-cultural study of the metaphorical representation of time.
                Psychological Science - China 23: 141-145.
Emerging patterns and evolving polysemies: the
acquisition of get between four and ten years
1. Introduction
first and most commonly used verbs in children's speech, along with other
"general-purpose" or "light" verbs such as put, make, go, and do (Clark 1978).
    Get is also one of the many highly polysemic verbs found in English.
The New Oxford Dictionary of English (1998), a dictionary of modern
usage, lists the following main senses and subsenses of get in current English
usage:
    Some questions that might be asked are: which senses are first learned by
children and in which sequence? Does the developmental sequence of
acquisition map onto the most prototypical synchronic spread of senses and/
or the diachronic spreading of senses?
    In this paper, we use recent developments in cognitive linguistics to
explore these questions. Some central tenets of cognitive linguistics and their
implications for the developmental study of get are outlined below. This will
provide the theoretical background for the study, but it will also allow us to
spell out more clearly the various hypothetical scenarios which could be
used to explain the acquisition of polysemous words by children.
2. Theoretical background
2.1. Embodiment
In recent years it has been proposed that meaning is grounded in the nature
of our bodies and in our interaction with the physical, social and cultural
environment we live in (see Lakoff and 10hnson 1999), It is claimed that
concepts are grounded in our bodily experience and then elaborated by struc-
tures of imagination, i.e. metaphor and metonymy. This means that metaphor
and metonymy motivate sense extension in some cognitively natural fashion,
as for example in expressions like 1 got it meaning 'I have understood it'.
   The prototypical cognitive linguistic investigation focuses on the per-
ceptual, visual bases of the emergence and change of word meanings.
Meanings are seen as perceptually grounded. More recently researchers have
also begun to study hearing and smell (see Ibarretxe-Antuiiano 1999) as the
source of our embodied conceptual structures which are again the basis for
our construction of meaning. But the body's input into the construction of
meaning is not only limited to our senses of vision, hearing and smell; we
also interact with the world, and this much more actively, through our hands,
through touch, grasping, gathering, giving and receiving. The prototypical
and primordial experiential scene in which the word get is embedded can be
said to be a situation in which somebody comes to have or hold something.
From this underlying matrix get's polysemous sense extensions seem to flow.
In the case of get the scene, as illustrated in Figure I, corresponds to the
experiential Gestalt of a basic causal event in which an agent receives an
object from another agent, which results in grasping or holding something:
336      Brigitte Nerlich, Zazie Todd and David D. Clarke
GET
                                                  terminal motion,
                                                                                 'arrive'
                                                  initial motion
 'become'
  ...
  'cause'       'capture'                                             'obtain'
 'induce'
                                                                  acquisitjon
                              'receive'
                               'eating'
                            (preparation)
CONCRETE or ABSTRACT
2.2. PoLysemy
Another central tenet of cognitive linguistics has been the poly se mic structure
of semantic categories. It has become clear that the study of the network of
senses surrounding a word form is of fundamental importance for any study
                         The acquisition of get between four and ten years      337
   that ultimately what children learn in acquiring these uses is a full set of
   constructions and not just a few idiomatic senses for a single verb. The study
   traces the emergence of these uses from a few verb-based formulae to a fully
   productive family of constructions. The evidence shows that these uses are
   not acquired in a random order, but rather are systematically linked to one
   another, with more complex constructions learned as extensions from simpler
   ones. (Israel, to appear)
tests (Raukko 1999; see Raukko 1997; see also Lehrer 1974). Others still are
investigating the acquisition of the get-passive using standard psychological
experiments (Kerstin Meints, personal communication; see also Giv6n and
Yang 1994).
    Raukko (1999; see also Raukko, this volume) analysed the polysemous
structure of the verb get by asking adult informants to fill in questionnaires
where they were asked to produce sentence examples that manifest different
meanings of get. His aim was to build up a picture of the intersubjective
perception of the polysemy of get. He established that for young adults the
senses of get cluster approximately around the following base senses:
Obtaining, Receiving, Change of State, Motion, Understanding, Catching,
Obligation, Stable Possession, Carrying, Ability. This conforms grosso modo
to the main dictionary meanings of get, as well as to the diachronic evolution
of get (except for get used in the sense of understanding, which had quite a
high frequency in the examples produced by American high-school students;
on the difficulties in categorizing the senses of get, see Raukko, this volume).
    However, one should be aware of the fact (already noted by the father of
the term polysemy, Michel Breal, in 1884 - see Breal 1991 [1884]):
   that the real meanings are often far more complex than the simple dictionary
   definitions would lead us to suppose .... The primary dictionary meaning of
   words was often far adrift from the sense in which they were actually used.
   Keep, for instance, is usually defined as to retain, but in fact the word is
   much more often employed in the sense of continuing, as in keep cool and
   keep smiling . ... Give, even more interestingly, is most often used ... as 'mere
   verbal padding', as in give it a look or give a report. (Bryson 1990: 143)
   Are we dealing here with four different senses of the verb sneeze? Goldberg
(1998; see also Goldberg 1995) thinks not:
340        Brigitte Nerlich, Zazie Todd and David D. Clarke
    In her view "each of these formal patterns and its associated meaning(s)
form a construction of the language" (Goldberg 1998: 205) and (argument
structure) constructions encode certain archetypal forms of human experience.
This has important implications for the study of language acquisition and
the study of get.
    Prefabricated chunks help us to build sentences up more quickly; argument
structure construction provides us with ready-made blueprints for constructing
sentences. Both should be important in language acquisition (see Hopper
1998: 166).
    Goldberg (1998) has argued that light verbs such as put, go, make, do and
get can be seen as "path breaking verbs", that is, as laying the foundations
for the acquisition of more and more complex grammatical constructions.
Some of these verbs are listed in Table 1.
    We shall explore whether the same applies to the acquisition of verbs like
get.
In order to study the acquisition of the polysemous verb get, one has to be
aware of three levels of polysemy:
3.1. Hypotheses
(i)     The first senses to be acquired will be the most prototypical values of
        the word get related to the primary scenario illustrated in Figure 1; then
        children will gradually work their way outwards to the more meta-
        phorically and metonymically motivated meanings. This would be in
        contrast to a pattern in which random chunks are only later integrated
        into a conceptual whole roughly structured like the prototypical scene.
(ii)    The first senses to be acquired will stay on the primary level of
        polysemy.
342      Brigitte Nerlich. Zazie Todd and David D. Clarke
(iii)    The synchronic spread of polysemes which cover the semantic area
         outlined by the word get reflects to a certain extent the diachronic
         spread of polysemes over time, and the developmental sequence in
         which get is acquired by children follows the same path.
(iv)     The conceptual frames associated with the various meanings of get
         are closely linked to children's primary situational knowledge in which
         the process of "getting" is integrated, of which the frame "obtaining
         a present" is the most central. This will be the starting point for a
         gradual exploration of the semantic and syntactic (argument structure)
         framework surrounding get.
    This means that children should learn the major senses of get in the
following sequence: 'obtain/receive', 'have', 'fetch', 'become', 'go/arrive',
'induce', 'having permission', 'understand'. We predict that the beginning
and the endpoint in the sequence will be stable, whereas there will be quite
a range of variation in the acquistion of the "intermediate" senses. We also
predict that the children will produce only a small number of get instances
from polysemy levels (ii) and (iii).
3.2. Method
This study uses elicitation and ranking tasks in the hope of tapping com-
petencies which do not normally show up in naturalistic tasks. We would
like to stress, however, that analysing the use of get by children in natural
settings is as important in the study of the acquisition of polysemy as using
experimental data (see Duxbury 1999 ms.).
3.2.1. Participants
Participants were recruited from local infant and junior schools. A letter was
sent to all parents of children in the selected age groups, and all children
who were given permission to take part in the study were used as participants.
There were eleven 4-year-olds, twenty 7-year-olds, sixteen 8-year-olds, and
twelve lO-year-olds.
                       The acquisition of get between four and ten years   343
3.2.2. Materials
One Panasonic portable cassette recorder, cassette tapes, and some cards of
postcard size on which the experimental sentences were typed.
3.2.3. Design
The experimental stimuli consisted of a set of sentences using the word get
that were chosen to represent the following core senses of the word: OBTAIN,
FETCH, BECOME, Go, UNDERSTAND. I In order to keep the procedure simple
enough for the 4-year-olds to understand, only one sentence was used to
illustrate each meaning. The sentences were tested in a pilot study with ten
children, as a result of which one of the sentences (OBTAIN) was changed to
remove a possible desirability bias. The initial sentence involved getting a
bicycle as a birthday present; many of the children commented that this was
very boring and the sentence was replaced with a more neutral sentence
about getting a cake. The letter to parents explained that the experimental
sessions would be tape recorded and that all data would be kept anonymous
and confidential. All children saw the same set of experimental sentences.
The order of presentation of the sentences was randomized to prevent any
order effects. The independent variable was age and the dependent variables
were the sentences produced in the elicitation task and the rankings given
for the experimental stimuli in the ranking task.
3.2.4. Procedure
The experiment was conducted over a period of several weeks on the school
site. Children were taken from the classroom one at a time to participate in
the experiment. Children were told that the purpose of the experiment was to
study the meaning of words and that there were no right or wrong answers.
They were asked if they minded the cassette recorder being used and the
machine was then turned on. Children were given an initial warm-up task to
reduce shyness and then the experiment proper began. The production test
method used by Raukko (1999, this volume) was adapted for use with young
children and supplemented by a ranking task. Children were told that we
were interested in the meaning of the word get, and asked if they could think
344       Brigitte Nerlich, Zazie Todd and David D. Clarke
of any sentences that used the word. (Raukko had asked participants to
produce sentences showing as many different senses of get as possible; this
was felt to be too difficult for young children and instead they were just
asked to produce five different sentences.)
    After the child had either produced five sentences or run out of ideas, the
ranking task began. Children were told that they were going to hear five
sentences using the word get and they would have to choose the one which
was the best example of what the word meant. After the sentences were read
out in a random order, the children were given the cards (in the same random
order) to choose from. Once they had selected the best example, they were
then asked to choose the best one from the remaining sentences, and so on
until all five sentences had been ranked. Once all the sentences had been
ranked, children were asked to give a reason for their best and worst choices.
Experimental responses were recorded both on paper and on cassette. Once
the experiment was over, children were thanked for their time, complimented
on their answers, and returned to the classroom.
3.3. Results
The results fall into two sections - the qualitative production data and the
quantitative rankings data.
3.3.1. Productions
To even out the numbers in each age group, data from five children were
selected at random from the 7-year-old group, and discarded. (N.B. This
applies to the production data only.) The productions were then categorized
according to the kind of get produced and the syntactic frames used.
   The typical frames in which utterances were produced are shown in
Table 2. The numbers of utterances falling into each of the main senses is
given in brackets. Not surprisingly, the 4-year-olds produced fewer utterances
and also used a smaller range of types of get. This was also the only age
group in which 'obtain/receive' was not the most popular get production.
The IO-year-olds were the most creative in their production of sentences
containing get. They produced nine different senses of get, including get as
meaning 'induce somebody to do something', embedded in a wide variety of
                        The acquisition of get between four and ten years   345
speech act types and argument structure frameworks. They are closely fol-
lowed, however, by the 7-year-olds who produced eight different senses of
get, including get as meaning 'become' and 'find'. The 8-year-olds produced
seven senses of get and used less syntactic variety. However, they produced
the only instances of get as idiomatic expressions, e.g. Get a life! and Get
lost. This shows a gradual progression towards a mature competence in the
use of get in its core senses, which accelerates at age 6-7. The use of get as
"verbal padding", in phrasal collocations as well as idioms was, however,
extremely rare.
    As one can see from the prototypical objects and events listed in Table 2,
the frames and scenes in which the utterances were implicitly embedded
come mainly from the children's experiences with obtaining (more and more
sophisticated) presents on the occasion of birthdays and at Christmas, of
receiving pocket money and sweets, and of having, being in possession of,
certain special toys or pets. This also ties in with the cognitive linguistic
hypotheses about the embodiment of meaning, as the first senses children
learn are related to their pleasure in having and holding something which is
precious to them in some way.
3.3.2. Rankings
The rankings gi ven to the different sentences by the children were collated.
Since there were different numbers of children in the groups, the data is
expressed as percentages in the graphs. The 20% level is equivalent to chance,
since there were five choices.
    Initially, the data were collated to see which sentences were ranked first,
that is, as the best example of what the word get means. For the 4-year-olds,
we can see (Fig. 2) that 'obtain/receive' was overwhelmingly selected as the
first choice (by 45% of children in this age group). This is significantly
different from chance (p < 0.05). This is not the case with the other age
groups, where there is no sentence emerging as a clear first choice. We can
see that for the 7-year-olds there is no clear leader. For the 8-year-olds,
'become' was never selected as first choice, but the others are ranked pretty
much equal. For the lO-year-olds, again there is no overwhelming first choice.
    The 4-year-olds' overwhelming choice of get 'obtain' as first choice
becomes even clearer when we look at all the rankings that this sentence
received. We can see (Fig. 3) that it was never ranked lower than in the top
                                                                                                                          w
Table 2. Typical frames of productions by children in the different age groups.                                           ~
                                                                                                                          C\
Prototypical      sweets, car, cat   sweets, toy, pocket money,    sweets, pocket money,        sweets, pocket money,
objects/events,                      play station, drink, shoes,   pet, toy, ice-cream,         shoes, video, TV, drink,
prototypical                         cat                           bike, cat, drink,            Christmas, birthday
conceptual                                                         Christmas, birthday
frame
                                                                                                                              ;l
Others            Get up!            get away from, get weighed,   Get lost! Get a life!        get told off, Got you!        "'
                                                                                                                              I:l
                                     get ready                     get ready, I got a long                                   .fll::
                                                                                                                              1:;0
                                                                   way to my birthdayo                                        ::0,0
                                                                                                                              go
                                                                                                                             ~
                                                                                                                             ~
                                                                                                                              0-
                                                                                                                             ~
                                                                                                                              i
                                                                                                                              l::
                                                                                                                              ....
                                                                                                                              I:l
                                                                                                                              5.
                                                                                                                               ~
                                                                                                                             1 <:J
                                                                                                                               ~
348         Brigitte Nerlich, Zazie Todd and David D. Clarke
50
40
      Q,)
      bO
      E
      Q,)
              30
      ~
               20
10
50
40
      Q,)
      bJl
      «I
      C
      Q,)
               30
      ~
               20
10
three. This is quite unusual when compared to all the other rankings given
across all age groups. The probability of this occurring by chance as first
choice this many times is less than 0.005. Four-year-olds also tended to place
'understand' in the last two at a level significantly different from chance
(p < 0.05).
    For the 7-year-olds, the distributions are much flatter; there is no clear
order to their rankings at all (see Fig. 4). In fact, the 7-year-olds gave the
most undifferentiated responses out of the age groups surveyed, and none of
the results are significantly different from chance. This could mean that
7-year-olds are simply guessing. It is, however, also possible that this result
simply reflects the fact that they are aware of all the different meanings of
get, and have no overall preferences about which is most central. In other
words, they have learnt that 'obtain' is not necessarily a "better" example of
get, but they have a pluralistic sense of the meaning of get and have learnt to
embrace its alternative meanings without establishing particular preferences.
This might also show that the process the acquisition of get is relatively
slow but seems to accelerate around age 6, an age when children start to
immerse themselves in word play, jokes, and metaphors, which for the most
30
10
30
10
30
10
3.4. Discussion
Notes
References
Bn!al, Michel
   1991 [1884] How words are organized in the mind. In: M. Bn!al, The Beginnings
               of Semantics. Essays, Lectures and Reviews, 145-151. Edited and
               translated by George Wolf. London: Duckworth.
Brugman, Claudia M.
   1981        The Story of over. Trier: LAUT.
Bryson, Bill
   1990        Mother Tongue. The English Language. London and New York:
               Penguin Books.
354     Brigitte Nerlich, Zazie Todd and David D. Clarke
Adam Kilgarriff
1. Introduction
   Many words have more than one meaning. When a person understands a
   sentence with an ambiguous word in it, that understanding is built on the
   basis of just one of the meanings. So, as some part of the human language
   understanding process, the appropriate meaning has been chosen from the
   range of possibilities.
    Stated in this way, it would seem that WSD might be a well-defined task,
undertaken by a particular module within the human language processor.
This module could then be modelled computation ally in a WSD program,
and this program, performing, as it did, one of the essential functions of the
human language processor, would stand alongside a parser as a crucial
component of a broad range of Natural Language Processing (NLP) appli-
cations. This point of view is clearly represented in Cottrell (1989):
2.1. Thesis
started examining it in the 1950s (see, e.g., Sparck Jones 1986), people have
been writing programs to solve it.
    NLP has not found it easy to give a very principled answer to the question,
"what goes in the lexicon". Before the mid-1980s, many systems made no
claims to wide coverage and contained only as many words in the lexicon as
were needed for the "toy" texts that were going to be analysed. A word was
only made ambiguous - that is, given multiple lexical entries - if it was one
that the researchers had chosen as a subject for the disambiguation study.
This was clearly not an approach that was sustainable for wide coverage
systems, and interest developed in dictionaries, as relatively principled, wide-
coverage sources of lexical information.
    As machine-readable versions of dictionaries started to become available,
so it became possible to write experimental WSD programs on the basis of
the dictionary's verdict as to what a word's senses were (Lesk 1986; Jensen
and Binot 1987; Slator 1988; Guthrie et al. 1990; Veronis and Ide 1990; Guthrie
et at. 1991; Dolan 1994). Looked at the other way round, WSD was one of
the interesting things you might be able to do with these exciting new resources.
    Since then, with the advent of language corpora and the rapid growth of
statistical work in NLP, the number of possibilities for how you might go
about WSD has mushroomed, as has the quantity of work on the subject
(Brown et at. 1991; Hearst 1991; Gale, Church and Yarowsky 1992, 1993;
McRoy 1992; Yarowsky 1992). Clear (1994), Schiitze and Pederson (1995)
and Yarowsky (1995) are of particular interest because of their approach to
the issue of the set of word senses to be disambiguated between. Schiitze
and Pederson devised high-dimensionality vectors to describe the context of
each occurrence of their target word, and then clustered these vectors. They
claim that the better-defined of these clusters correspond to word senses, so
a new occurrence of the word can be disambiguated by representing its
context as a vector and identifying which cluster centroid the vector is closest
to. This system has the characteristic that a context may be close to more
than one cluster centroid, so at times it may be appropriate to classify it as
more than one sense.
    Both Clear (1994) and Yarowsky (1995) provide a mechanism for the
user to input the senses between which they would like the system to disam-
biguate. They ask the user to classify a small number of statistically selected
collocates which the algorithm can use to "seed" the process of learning the
patterns which each sense is associated with. The user determines what the
relevant set of senses is when deciding on the senses he or she will assign
364     Adam Kilgarriff
seed collocates to.2 Clear then finds all the words which tend to co-occur
with the nodeword in a large corpus, and quantifies, for a very large number
of words, the evidence that it occurs with each of the seeds, and thus indirectly,
with each sense of the nodeword. Disambiguation then proceeds by summing
the evidence for each sense provided by each context word.
    Yarowsky's method is iterative: first, those corpus lines for the nodeword
which contain one of the seed collocates are classified. Then the set of corpus
lines so classified is examined for further indicators of one or other of the
senses of the word. These indicators are sorted, according to the strength of
evidence they provide for a sense. It will now be possible to classify a larger
set of corpus lines, so producing more indicators for each sense, and the
process can be continued until all, or an above-threshold proportion, of the
corpus lines for the word are classified. The ordered list of sense-indicators
will then serve as a disambiguator for new corpus lines.
    In the Semantic Concordance project at Princeton a lexicographic team
has been assigning a WordNet (Miller 1990) sense to each noun, verb, adjec-
tive and adverb in a number of texts, thus providing a "gold standard"
disambiguated corpus which can be used for training and evaluating WSD
programs (Landes, Leacock and Tengi 1996).
    In 1994--1995, there was an extended discussion of whether WSD should
be one of the tasks in the MUC (Message Understanding Conference) program. 3
This would have provided for competitive evaluation of different NLP groups'
success at the WSD task, as measured against a "benchmark" corpus, in
which each word had been manually tagged with the appropriate WordNet
sense number (as in the Semantic Concordance). Some trials took place, but
the decision was not to proceed with the WSD task as part of the 1996
MUC-6 evaluation, as there was insufficient time to debate and define detailed
policies. The theme has recently been taken up by the Lexicons Special
Interest Group of the Association for Computational Linguistics (ACL), and
a pilot evaluation exercise is taking place in 1998: a milestone on the road
from research to technology.
2.2. Antithesis
or less, as the dictionary says. Sometimes not all the sense distinctions
recognized in the dictionary are viewed as salient to the program. WSD
researchers tend to be "lumpers", not "splitters"S (Dol an 1994). (This is not,
of course, to say that WSD authors have not noted the theoretical problems
associated with dictionarys' word senses.) WSD research has gone a long
way on this basis: it is now common for papers to present quantitative
comparisons between the performance of different systems. Meanwhile, the
theoreticians provide various kinds of reason to believe there is no such set
of senses. To get beyond this impasse, the question "what is a word sense?"
needs to be considered more closely.
Or, to know what something is, is to know when something is it. To know
what a word sense SI is, is to know which uses of the word are part of s I and
which are not, probably because they are part of Si where i :;t. 1. If we are to
know what word senses are, we need operational criteria for distinguishing
them.
different aspects of the bicycle - its mechanical parts; its frame, saddle and
other large surfaces; its (and its rider's) motion - are highlighted in each
case. The meaning of bike is modulated differently by each context. 6
(10)   Bother! I was about to talk to John, but now he's disappeared!         (NOT-
       HERE)
(11)   I can'tfind it anywhere, it seems to have disappeared.      (CAN'T-FIND)
could mean that each arrived with a carnivorous fish, or that each arrived
bearing a long-handled medieval weapon, but not that the one arrived with
the fish and the other with the weapon. On the other hand, in
each might have raised a right hand, each might have raised a left, or one
might have raised his right, and the other, his left. The question now is, in
is it possible that Ellen bought plants and Harry, food? If so, then the con-
clusion to be drawn from the test is that bean is ambiguous between the
readings, and if not, then it is not. IO
The tests are generally presented with the aid of an un problematic example
of ambiguity and an unproblematic example of vagueness. This is done in
order to demonstrate what the test is and what the two contrasting outcomes
are. However, this is not to use the tests in anger. What we want of a test is
that it is consistent with our intuitions, where our intuitions are clear, and
that it resolves the question, where our intuitions are unclear. The cross-
reading test fares tolerably well in meeting the consistency condition (though
see Geeraerts 1993 for a contrary view). But do the tests help where intuitions
are unclear? There is little if any evidence that they do. Here I discuss three
classes of problems.
    Firstly, it must be possible to construct a plausible test sentence. The
word in its two uses must be able to occur with the same syntax and the
same lexico-grammatical environment. Consider the transitive and intransitive
uses of eat, as in John ate the apple and John ate. Is this a case of ambiguity
or vagueness?
is unacceptable, but the reason is that elided constituents must have the same
syntax and subcategorization in both their expressed and elided occurrences.
It might be desirable to treat all words with alternative subcategorization
possibilities as ambiguous. But whether or not that is done, the test still fails
to elucidate on the topic of a word's meaning, where the word has different
syntax in different uses. The test can only be posed where the two uses are
syntactically similar.
    The disappear example displays a different variant of this problem. The
CAN'T-FIND and NOT-HERE readings have different aspectual characteristics:
CAN'T-FIND is stative while NOT-HERE is a punctual "achievement" verb.
does not permit a crossed reading, but that is because we cannot construct a
viable aspectual interpretation for the conjoined sentence. Compare
(18) ?The newspaper costs 25p and sacked all its staff.
unless we grant newspaper two lexical entries, one for a copy of the newspaper
and one for the owner or corporate entity. Then the size of our lexicon will
start to expand, as we list more and more of the possible kinds of referent for
the word, and still it will never be complete. So the origin of the anomaly
must be the interpretation process. But the anomaly seems similar to the
anomaly that occurs with bank. In a case lying between newspaper and bank,
how would we know whether the source of the anomaly was the lexicon or
the interpretation process? In the general case the point at which the lexical
process becomes a general-purpose interpretative one cannot be identified.
There is no accessible intermediate representation in which lexical ambi-
guities are resolved (for acceptable sentences) but in which the contents of
the sentence has not been incorporated into the hearer's interpretation of the
discourse. Geeraerts (1993) presents an extensive critique of the tests along
these lines, presenting evidence that the different tests give contradictory
results, and that even if we constrain ourselves to looking at just one of the
tests, they can all be made to give contradictory results by manipulating the
context in which the item under scrutiny is set.
    The third problem is simply the lack of evidence that the tests give stable
results. It will sometimes happen that, for the same reading-pair, an informant
will deem crossed readings possible for some test sentences and not for others.
Or different informants will have conflicting opinions. There are, remarkably,
no careful discussions of these issues in the literature. The merit of the method
of acceptability judgements for syntax rests on the relative stability of their
outcomes: they work (to the extent they do) because linguists agree where
the question marks belong. Preliminary investigations into the stability of
outcomes in lexical semantics suggest that it is severely lacking.
    In the middle of this debate stand the lexicographers. The word senses
that most WSD researchers aim to discriminate are the product of their
intellectual labours. But this is far from the purpose for which the dictionary
was written.
    Firstly, any working lexicographer is well aware that, every day, they are
making decisions on whether to "lump" or "split" senses that are inevitably
sUbjective: frequently, the alternative decision would have been equally valid.
In fact, most dictionaries encode a variety of relations in the grey area between
"same sense" and "different sense": see Kilgarriff (1993) for a description of
the seven methods used in LDOCE (1987).
    Secondly, any particular dictionary is written with a particular target
audience in mind, and with a particular editorial philosophy in relation to
debates such as "lumping vs. splitting", so the notion of specifying a set of
word senses for a language in isolation from any particular user group will
be alien to them.
    Thirdly, many are aware of the issues raised by Lakoff, Levin, Pustejovsky
and others, with several lexicographers bringing valuable experience of the
difficulties of sense-division to that literature (see below).
    Fourthly, the weight of history: publishers expect to publish, bookshops
expect to sell, and buyers expect to buy and use dictionaries which, for each
word, provide a (possibly nested) list of possible meanings or uses. Large
sums of money are invested in lexicographic projects, on the basis that a
dictionary has the potential to sell hundreds of thousands of copies. Investors
will not lightly adopt policies which make their product radically different to
the one known to sell. However inappropriate the nested list might be as a
representation of the facts about a word, for all but the most adventurous
lexicographic projects, nothing else is possible. J3
    The division of a word's meaning into senses is forced onto lexicographers
by the economic and cultural setting within which they work. Lexicographers
are obliged to describe words as if all words had a discrete, non-overlapping
set of senses. It does not follow that they do, or that lexicographers believe
that they do.
                                            "[ don't believe in word senses"     373
4.1. Lexicographicalliterature
   It is precisely the lack of clarity in our use of the word culture which makes
   it such a handy word to have at one's disposal. It offers, as it were, semantic
   extras just because in most uses its possible meanings are not clearly disam-
   biguated. What can the dictionary maker do to reflect this state of affairs?
   They do not, cannot by their very structure, show that there is slippage
   between some of the senses that they give but not between others. (p. 139)
   Hanks (1994), looking at climb, and Fillmore and Atkins (1992), studying
the semantic field centred on risk, make similar comments about the
inadequacies of dictionary conventions, and appeal to prototype theory and
374      Adam Kilgarriff
This suggests a quite different answer to the question "what is a word sense?"
Corpus lexicography proceeds approximately as follows. For each word, the
lexicographer
senses that eventually appear in the dictionary are the result, at several
removes, of the basic clustering process.
    Ambiguity tests failed to provide us with an account of what it meant for
two uses of a word to belong to the same word sense. Once we operationalize
"word sense" as "dictionary word sense", we now have a test that meets the
challenge. The identity test for a word sense in a particular dictionary is that
two usages of the word belong to it if and only if the lexicographer would
have put them in the same cluster. 15
    We can now present a different perspective on the ambiguity/generality
debate. Where a word's uses fall into two entirely distinct clusters, it is
ambiguous, but where the clusters are less well-defined and distinct, "vague"
or "unspecified" may be a more appropriate description. There is no reason
to expect to find any clear distinction between the two types of cases.
   As the 715 examples in the British National Corpus (BNC)17 make plain,
typical uses involve things being put into, or taken out of, or looked for in
handbags, or handbags being lost, found, stolen, manufactured, admired,
bought or sold. But a couple of dozen examples stretch the limits of the
definition or fall outside it altogether.
   First, a proper name, and a reference to a unique object:
(19)   the "Drowning Handbag ", an up-market eatery in the best part of
       town
(20)   an inimitable rendering of the handbag speech in The Importance of
       Being Earnest
(21)   She moved from handbags through gifts to the flower shop
(22)   "How about you? Did the bouncing handbag find yoU?"IH
(23)   a weird, menacing building with bats hanging in the trees like
       handbags
(24)   Skin generally starting to age like old handbag or bodywork of car
The final, quite distinct group relates to discos, and the lexical unit dance
round your handbag, a pejorative phrase for the behaviour of certain exclu-
sively female groups at discotheques and dances where - prototypically -
they dance in a circle with their handbags on the floor in the middle. The
conversational speech subcorpus of the BNC provides two instances of the
full form while in the written corpus, the two related corpus lines, both from
music journalism. make only fleeting references to the collocation. and
strikingly indicate a process of lexicalization:
(37)   The shoot was supposed to be a secret, but word got out and Hitman
       regulars travelled down to Manchester. Two thousand couldn't get
       into the club, and tension mounted between trendy regulars (locked
       out of their own club) and the Hitman s handbag brigade (shut out of
       their programme).
(38)   New Yawk drawling rap over Kraftwerks "The Model" just does not
       work, no way, no how. Handbag Dis will love it.
All these uses can be traced back to the standard sense: the potential for
378     Adam Kilgarrijf
using the word in the nonstandard way, is (in varying degrees) predictable
from
These five knowledge sources define the conceptual space within which
lexical creativity and productivity, and the idea of a "word sense", are located. 19
    Needless to say, they frequently interact in complex ways. In handbags
at ten paces, the speakero assumes the addressee's awareness of handbag-
as-weapon. Note that ?briefcases at ten paces and ?shoulder-bags at ten paces
do not carry the same meaning. Although briefcases and shoulder-bags are
just as viable weapons as handbags, the words briefcase and slwulder-bag do
not carry the "weapon" connotations which make the citation immediately
understandable. Handbag-as-weapon is a feature of the word, over and above
the extent to which it is a feature of the denotation.
    In the citation's context, there is no overt reason for a reference to hand-
bag; the people involved are men, not women, so not prototypical handbag-
users, and there is no other reference to femininity. It would appear that the
speaker is aiming to both distance himself from and minimize the significance
of the incident by treating it as a joke. The "duel" metaphor is itself a joke,
and the oddity of handbag in the context of either football or duel, along
with its associations with femininity and Mrs. Thatcher, contributes to the
effect. Moreover, there is a sexist implication that the men were behaving
like women and thereby the matter is laughable.
    Interpreting handbags at ten paces requires lexical knowledge of handbag-
as-weapon, collocational knowledge of both form and meaning of "WEAPON
at NUMBER paces", and (arguably) knowledge of the association between
handbags and models of masculinity and femininity.
    The "music journalism" use displays some further features. Handbag was
                                         "[ don't believe in word senses"   379
lexicalized in the clubbing world in circa 1990 as a music genre: the genre
that, in the 1970s and 1980s, certain classes of young women would have
danced round their handbags to.2 1 The coinage emanates from the gay and
transvestite club scene and is redolent with implications, from the appropri-
ation of the handbag as a symbol of gay pride, to changes in the social
situation of women over the last 20 years (and its expression in fashion
accessories), to transvestite fantasies of being naIve 17-year-old girls in a
more innocent age.
    To restrict ourselves to more narrowly linguistic matters: the license for
the coinage is via the dance round your handbag collocation, not directly
from handbags. As shown by the spoken corpus evidence, the regular, non-
ironic use of the collocation co-exists with the music-genre use. It is of much
wider currency: all but two of a range of informants knew the collocation,
whereas only two had any recollection of the music-genre use. Also, "handbag"
music (or at least the use of that label) was a 1990-1991 fashion, and the
term is no longer current: 1996 uses of it will probably refer back to 1990-
1991 (as well as back to the 1970s and 1980s). Syntactically, the most
information-rich word of the collocation has been used as a nominal pre-
modifier for other nouns: in the music-genre sense, it is used as other music-
genre words, as an uncountable singular noun, usually premodifying but
potentially occurring on its own: "Do you like jazz/house/handbag?"
5.1. Frequency
These arguments make clear that there is a prima facie case for including
handbag-as-weapon and handbag-as-music-genre as dictionaries senses, and
dance round your handbag as an only partially compositional collocation.
Each exhibits lexical meaning which is not predictable from the base sense.
So why do the dictionaries not list them? The short answer is frequency.
Around 97% of handbag citations in the BNC are straightforward base-sense
uses. The music-genre sense is certainly rare, possibly already obsolete, and
confined to a subculture. The collocation is partially compositional and occurs
just twice in the corpus: for any single-volume dictionary, there will not be
space for vast numbers of partially compositional collocations. Not only is a
lex.icographer "a lex.icologist with a deadline" (Fill more 1988) but also a
lexicologist with a page limit.
380     Adam Kilgarriff
    Here the context does not particularly favour either reading against the
other. In the second case, the co-ordination with both an END word (vindi-
cation) and a PUN one (enforcement) supports both readings simultaneously.
How is this possible, given their antagonism? How come these uses do not
result in ambiguity and the potential for misinterpretation? The answer seems
to be that
Citations where sanction is unspecified for either PUN or END are rare, and
there is no case for including the unspecified "control" sense in a dictionary.
    The example demonstrates a relationship between a lexicographer's
analytic defining strategy and the interpretation process. There are occasions
where a "lowest common denominator" of the usually distinct standard uses
of a word will be the appropriate reading, in a process analogous to the way
an analytically-inclined lexicographer might write a definition for a word
like charming or knife, which would cover the word's uses in two or more
distinct corpus clusters. Some dictionaries use nested entries as a means of
representing meanings related in this way.
7. Conclusion
Acknowledgements
This research was supported by the EPSRC Grant K 18931, SEAL. I would also like
to thank Sue Atkins, Roger Evans, Christiane Fellbaum, Gerald Gazdar, Bob Krovetz,
Michael Rundell, Yorick Wilks and the anonymous reviewers for their valuable
comments.
384     Adam Kilgarriff
Notes
*   This article was first published in slightly different form in Computers and the
    Humanities 1997; 31(2): 91-113. Reprinted by permission. © 1997 Kluwer
    Academic Publishers.
        The title of this article is quoted from Sue Atkins (past President, European
    Association for Lexicography; General Editor, Collins-Robert English/French
    Dictionary) responding to a discussion which assumed discrete and disjoint word
    senses, at The Future of the Dictionary Workshop, Uriage-Les-Bains, October
    1994.
1. Since the paper was written, seven years ago, the exercise, christened
    SENSEVAL, has now taken place not once but twice, SENSEVAL-l in 1998
    (Kilgarriff and Palmer 2000) and SENSEVAL-2 in 2001 (Cotton et al. 2001;
    Edmonds and Kilgarriff 2002).
2. In Yarowsky's work, this is just one of the options for providing seeds for the
    process.
3. MUC is a series of US Government-funded, competitive, quantitatively-evaluated
    exercises in information extraction (MUC-5, 1994).
4. The AAAI Spring Symposium on Representation and Acquisition of Lexical
    Information, Stanford, April 1995 and the ACL SIGLEX Workshop on The
    Breadth and Depth of Semantic Lexicons, Santa Cruz, June 1996.
5. "Lumping" is considering two slightly different patterns of usage as a single
    meaning. "Splitting" is the converse: dividing or separating them into different
    meanings.
6. Cruse identifies two major varieties of modulation, of which highlighting is one.
7. See Zwicky and Sadock (1975) for a fuller discussion of the terms and their
    sources.
8. Also related to this distinction is the polysemy/homonymy distinction: when do
    we have two distinct words, and when, one word with two meanings? Most
    commentators agree that there is a gradation between the two, with the distinction
    being of limited theoretical interest. For some purposes, the distinction may be
    more useful than the vagueness/ambiguity one (Krovetz 1996). In practice, similar
    difficulties arise in distinguishing homonymy from polysemy, as in distinguishing
    vagueness from ambiguity.
9. Examples (6)-(11) are taken by comparing four state-of-the-art English learners'
    dictionaries (c/DE 1995; COBUILD 1995; WOCE 1995; OAWCE 1995) and
    finding words where the lexicographers in one team made one decision regarding
    what the distinct word senses were, whereas those in another made another. This
    immediately has the effect of introducing various factors which have not been
    considered in earlier theoretical discussions.
10. For many putatively ambiguous reading-pairs, there are intermediate cases. A
    sprouting bean, or one bought for planting, is intermediate between food and
    plant. But the possibility of intermediate cases does not preclude ambiguity: whether
                                             "/ don't believe in word senses"      385
    an actual or idealized dictionary. Here and throughout the paper we use "use" for
    the former, and "sense" for the latter.
17. For the BNC see http://info.ox.ac.uklbnc. Counts were: handbag 609, handbags
     103, handbagging I, handbagged 2.
18. This turns out to be a (sexist and homophobic) in-group joke, as well as a case
    of both metonymy and of a distinct idiomatic use of the word. Interestingly, in
    the text, "the bouncing handbag" succeeds in referring, even though the idiom is
    not known to the addressee, as is made explicit in the text.
19. In my thesis, in the context of an analysis of polysemy, I call the first four
    knowledge types homonymy, alternation, analogy and collocation. (Taxonomy
    is addressed separately.)
20. This is presented as a quotation of a football manager's spoken comment; quite
    whether it is verbatim, or the Daily Telegraph journalist's paraphrase, we shall
    never know.
21. Thanks to Simon Shurville for sharing his expertise.
22. Kjellmer implies that the further specification is a temporal process, there being
    a time in the interpretation process when the lexical meaning of the word is
    accessed but specified for "control" but not for either PUN or END. I see no grounds
    for inferring the temporal process from the logical structure.
23. A well-organized, hierarchical lexicon will mean that this need not introduce
    redundancy into the lexicon.
References
Apresjan, Juri D.
    [974       Regular polysemy. Linguistics 142: 5-32.
Atkins, Sue
    1993       Tools for computer-aided corpus lexicography: the Hector project.
               Acta Linguistica Hungarica 41: 5-72.
Ay to, John R.
    [983       On specifying meaning. In: Reinhard R. K. Hartmann (ed.), Lexicog-
               raphy: Principles and Practice, 89-98. Sidcup, UK: Academic Press.
Bejoint, Henri
    [994       Tradition and Innovation in Modern English Dictionaries. Oxford:
               Oxford University Press.
Briscoe, Edward J., Ann A. Copestake and Branimir K. Boguraev
    1990       Enjoy the paper: lexical semantics via lexicology. In: COLlNG 90(2):
               42--47, Helsinki.
Brown, Peter, Stephen Della Pietra, Vincent 1. Della Pietra and Robert L. Mercer
    199 [      Word sense disambiguation using statistical methods. In: Proceedings,
               29th ACL, Berkeley, California.
                                             "I don't believe in word senses"     387
ClDE
   1995         Cambridge International Dictionary of English. Cambridge: Cambridge
                University Press.
Clear, Jeremy
    1994        I can't see the sense in a large corpus. In: Ferenc Kiefer, Glibor Kiss
                and Julia Pajzs (eds.), Papers in Computational Lexicography:
                COMPLEX '94, 33-48, Budapest.
COBUlLD
    1995        The Collins COBUlW English Language Dictionary. 2nd Edition.
                Edited by John McH. Sinclair et al. London.
Copestake, Ann A. and Edward J. Briscoe
    1995        Semi-productive polysemy and sense extension. In: James Pustejovsky
                and Branimir Boguraev (eds.), Lexical Semantics: The Problem of
                Polysemy, 15-67. Oxford: Clarendon.
CorpusBench
    1993        CorpusBench Manual. Textware AlS, Copenhagen, Denmark.
Cotton, Scott, Philip Edmonds, Adam Kilgarriff and Martha Palmer (eds.)
    200 1       Proceedings of SENSEVAL-2, Second International Workshop on
                Evaluating Word Sense Disambiguation Systems. ACL, Toulouse,
                France.
Cottrell, Garrison W.
    1989        A Connectionist Approach to Word Sense Disambiguation. London:
                Pitman.
Cruse, David A.
    1986        Lexical Semantics. Cambridge: Cambridge University Press.
    1995        Polysemy and related phenomena from a cognitive linguistic view-
                point. In: Patrick Saint-Dizier and Evelyne Viegas (eds.), Compu-
                tational Lexical Semantics, 33-49. Cambridge: Cambridge University
                Press.
Dolan, WiIIiam B.
    1994        Word sense disambiguation: clustering related senses. In: COLlNG
                94, Tokyo.
Edmonds, Philip and Adam Kilgarriff (eds.)
    2002        Evaluating word sense disambiguation systems. Journal of Natural
                Language Engineering 8(4): 279-292.
FiIImore, Charles
    1988        Keynote lecture, British Association of Applied Linguistics conference,
                Exeter, September 1988.
Fillmore, Charles J. and Beryl T. S. Atkins
    1992        Towards a frame-based lexicon: the semantics of risk and its neighbors.
                In: Adrienne Lehrer and Eva Kittay (eds.), Frames, Fields and
                Contrasts, 75-102. New Jersey: Lawrence Erlbaum.
388     Adam Kilgarriff
McRoy. Susan W.
    1992        Using multiple knowledge sources for word sense discrimination.
                Computational Linguistics 18(1): 1-30.
Miller. George
    1990        Wordnet: An on-line lexical database. International Journal of
                Lexicography (special issue) 3(4): 235-312.
Moon. Rosamund
    1989        Objective or objectionable? Ideological aspects of dictionaries. English
                Language Research, 3: Language and Ideology. 59-94.
MUC-5
    1994        Proceedings, Message Understanding Conference. DARPA.
Nunberg, Geoffrey
    1978        The Pragmatics of Reference. Bloomington. IN: University ofIndiana
                Linguistics Club.
    1994        The once and future dictionary. Presentation at The Future of the
                Dictionary Workshop. Uriage-Ies-Bains. France, October 1994.
OAWCE
    1995        Oxford Advanced Learner's Dictionary of Current English. Fifth
                Edition. Oxford: Oxford University Press.
Ostler, Nicholas and Beryl T. S. Atkins
    1991        Predictable meaning shift: some linguistic properties of lexical impli-
                cation rules. In: lames Pustejovsky and Susan Bergler (eds.), Lexical
                Semantics and Knowledge Representation: ACL SIGLEX Workshop,
                Berkeley, California.
Pustejovsky, lames
    1991        The generative lexicon. Computational Linguistics 17(4): 409-441.
Quine, Willard van O.
    1969        Speaking of objects. In: Quine, Willard van 0 .• Ontological Relativity,
                1-25. New York: Columbia University Press.
Schulze, Bruno and Oliver Christ
    1994        The IMS Corpus Workbench. Institut fur maschinelle Sprachverarbeitung,
                Universitat Stuttgart.
Schiitze, Hinrich and lan O. Pederson
    1995        Information retrieval based on word senses. In: Proceedings, ACM
                Special Interest Group on Information Retrieval.
Sinc1air, John M. (ed.)
    1987        Looking up: An Account of the COBUILD Project in Lexical Com-
                puting. London: Collins.
Slator, Brian M.
    1988        Lexical semantics and a preference semantics pars er. Technical Report
                MCCS-88-16. Computing Research Laboratory, New Mexico State
                University, New Mexico.
                                           "[ don't believe in word senses"   391
Yorick Wilks
1. Introduction
   ... it is very difficult to assign word occurrences to sense classes in any manner
   that is both general and determinate. In the sentences "I have a stake in this
   country" and "My stake on the last race was a pound" is "stake" being used
   in the same sense or not? If "stake" can be interpreted to mean something as
   vague as "stake as any kind of investment in any enterprise" then the answer
   is yes. So, if a semantic dictionary contained only two senses for "stake":
   that vague sense together with "Stake as a post", then one would expect to
   assign the vague sense for both the sentences above. But if, on the other
   hand, the dictionary distinguished "Stake as an investment" and "Stake as an
   initial payment in a game or race" then the answer would be expected to be
   different. So, then, word sense disambiguation is relative to the dictionary of
   sense choices available and can have no absolute quality about it.
    QED, one might say, since the last sentences seem to show very much
the awareness (a quarter of a century ago, but in the context of a computer
program for sense-tagging) that sense choice may not be exclusive if defined,
as it must be, with respect to a particular dictionary. Hence, in my view, the
BM is no more than a straw man because writers of the dark ages of CL
were as aware as Kilgarriff of the real problems of dictionary senses versus
text occurrences.
    In general, it is probably wise to believe, even if it is not always true, that
authors in the past were no more naive than those now working, and were
probably writing programs, however primitive and ineffective, to carry out
the very same tasks as now (e.g. sense-tagging of corpus words). More
importantly, Wilks (1972), which created an approach called preference
semantics, was essentially a study of the divergence of corpus usage from
lexical norms (or preferences), and developed in the 1970s into a set of
processes for accommodating divergent/nonstandard/metaphorical usage to
existing lexical norms, notions that Kilgarriff seems to believe only developed
in a much later and smarter group of people around 1990, which includes
himself, but also, for example, Fass, whose work was a direct continuation
of that quoted above. Indeed, in Wilks (1972) procedures were programmed
(and run over a set of newspaper editorials) to accommodate such "divergent"
396    Yorick Wilks
(i)    Text usage different from that shown in a whole list of stored senses
       for a given word e.g. in a dictionary (which is what his later experi-
       ment will be about), with
(ii)   Text usage divergent from some "core" sense in the lexicon.
senses to a corpus. In that work, it was shown that about 75%-80% of word
usage could be correctly associated with LDOCE senses, as compared with
hand-tagged control text. That figure was subsequently raised substantially
by additional filtering techniques (particularly in Stevenson and Wilks 2001).
   The two considerations above show, from quite different sources and
techniques, the dubious nature of Kilgarriff's claim. Antal (1963) argued
long ago (and some believe Wierzbicka 1989 maintains some version of such
a position) that words have only core senses. On such a view, dictionariesl
lexicons should express that single sense and leave all further sense refine-
ment to some other process, such as real world knowledge manipulations,
AI if you wish, but not a process that uses the lexicon.
   Since the CRL result suggested that the automatic sense-resolution pro-
cedures worked very well (near 80%) at the homograph level (i.e. the sense-
cluster level, such as the LDOCE dictionary uses) rather than the ordinary
sense level (the latter being where Kilgarriff's examples all lie), one possible
way forward for NLP would be to go some of the way with Antal's views
and restrict lexical sense distinctions to the homograph level. Then sense-
tagging could perhaps be done at the success level of part-of-speech tagging.
Such a move could be seen as changing the data to suit what you can
accomplish, or as reinstating AI and pragmatics within NLP for the kind of
endless, context-driven, inferences we need in real situations.
   This suggestion is rather different from Kilgarriff's conclusion. He pro-
poses that the real basis of sense distinction be established by usage clustering
techniques applied to corpora. This is an excellent idea and recent work at
IBM (Brown et al. 1991) has produced striking non-seeded clusters of corpus
usages, many of them displaying a similarity close to an intuitive notion of
sense (or you can now explore Google's own version of the very same
methodology at labs.google.com!sets).
    But there are serious problems in moving any kind of lexicography,
traditional or computational, onto any such basis. Hanks (p.c.) has claimed
that a dictionary could be written that consisted entirely of usages, and has
investigated how those might be clustered for purely lexicographic purposes,
yet it remains unclear what kind of volume could result from such a project
or who would buy it and how they could use it. One way to think of such a
product would be the reduction of monolingual dictionaries to thesauri, so
that to look up a word becomes to look up which row or rows of context-
bound semi-synonyms it appears in. Thesauri have a real function both for
native and non-native speakers of a language, but they rely on the reader
                                                          Senses and texts   399
knowing what some or all of the words in a row or class mean because they
give no explanations. To reduce word sense separation to synonym classes,
without explanations attached would limit a dictionary's use in a striking
way.
    If we then think not of dictionaries for human use but NLP lexicons, the
situation might seem more welcoming for Kilgarriff's suggestion, since he
could be seen as suggesting, say, a new version of WordNet (Miller 1985)
with its synsets established not a priori but by statistical corpus clustering.
This is indeed a notion that has been kicked around in NLP for a while and
is probably worth a try. There are still difficulties: firstly, that any such
clustering process produces not only the clean, neat, classes like mM's
(Hindullew/ChristianlBuddhist) example but inevitable monsters, produced
by some quirk of a particular corpus. Those could, of course, be hand weeded
but that is not an automatic process.
    Secondly, as is also well known, what classes you get, or rather, the
generality of the classes you get, depends on parameter settings in the clustering
algorithm: those obtained at different settings mayor may not correspond
nicely to, say, different levels of a standard lexical hierarchy. They probably
will not, since hierarchies are discrete in terms of levels and the parameters
used are continuous but, even when they do, there will be none of the
hierarchical terms attached, of the sort available in WordNet (e.g. ANIMAL or
DOMESTIC ANIMAL). And this is only a special case of the general problem of
clustering algorithms, well known in information retrieval, that the clusters
so found do not come with names or features attached.
    Thirdly, and this may be the most significant point for Kilgarriff's pro-
posal, there will always be some match of such empirical clusters to any
new text occurrence of a word and, to that degree, sense-tagging in text is
bound to succeed by such a methodology, given the origin of the clusters
and the fact that a closest match to one of a set of clusters can always be
found. The problem is how you interpret that result because, in this
methodology, no hand-tagged text will be available as a control since it is
not clear what task the human controls could be asked to carry out. Subjects
may find traditional sense-tagging (against e.g. mOCE senses) hard but it
is a comprehensible task, because of the role dictionaries and their associated
senses have in our cultural world. But the new task (attach one and only one
of the classes in which the word appears to its use at this point) is rather less
well defined. But again, a range of original and ingenious suggestions may
make this task much more tractable, and senses so tagged (against WordNet
400     Yorick Wilks
style classes, though empirically derived) could certainly assist real tasks
like MT even if they did not turn out wholly original dictionaries for the
book-buying public.
    There is, of course, no contradiction between, on the one hand, my sugges-
tion for a compaction of lexicons towards core or homograph senses, done
to optimize the sense-tagging process and, on the other, Kilgarriff's suggestion
for an empirical basis for the establishment of synsets, or clusters that
constitute senses. Given that there are problems with wholly empirically-
based sense clusters of the sort mentioned above, the natural move would be
to suggest some form of hybrid derivation from corpus statistics, taken
together with some machine-readable source of synsets: WordNet itself,
standard thesauri, and even bilingual dictionaries which are also convenient
reductions of a language to word sets grouped by sense (normally by reference
to a word in another language, of course). As many have now realized, both
the pure corpus methods and the large-scale hand-crafted sources have their
virtues, and their own particular systematic errors, and the hope has to be
that clever procedures can cause those to cancel, rather than reinforce, each
other. But all that is future work, and beyond the scope of a critical note.
    In conclusion, it may be worth noting that the BM, in some form, is
probably inescapable, at least in the form of what Pustejovsky (1995) calls a
"sense enumerative lexicon", and against which he inveighs for some 20
pages before going on to use one for his illustrations, as we all do, including
all lexicographers. This is not hypocrisy but a confusion close to that between
(i) and (ii) above: we, as language users and computational modellers, must
be able, now or later, to capture a usage that differs from some established
sense (problem [iiJ above), but that is only loosely connected to problem (i),
where senses, if they are real, seem always to come in lists and it is with
them we must sense-tag if the task is to be possible at all.
We now turn to the claims (Gale et al. 1992; see also Yarowsky 1992, 1993,
1995) that:
(i)    Word tokens in a single text tend to occur with a smaller number of
       senses than often supposed; and, most specifically,
(ii)   In a single discourse a word will appear in one and only one sense,
                                                            Senses and texts    401
     These are most important claims if true for they would, at a stroke, remove
a major excuse for the bad progress of MT, make redundant a whole sub-
industry of NLP, namely sense resolution, and greatly simplify the currently
fashionable NLP task of sense-tagging texts by any method whatever (e.g.
Cowie, Guthrie and Guthrie 1992; Bruce and Wiebe 1994).
     Gale et al.'s claim would not make sense-tagging of text irrelevant, of
course, but it would allow one to assume that resolving any single token of
a word (by any method at all) in a text would then serve for all occurrences
in the text, at a high level of probability.
     Gale et al.'s claims are not directly related to those of Kilgarriff, who
aimed to show only that it was difficult to assign text tokens to any lexical
sense at all. Indeed, Kilgarriff and Gale et al. use quite different procedures:
Kilgarriff's is one of assigning a word token in context to one of a set of
lexical sense descriptions, while Gale et al.'s is one of assessing whether or
not two tokens in context are the same sense or not. The procedures are
incommensurable and no outcome on one would be predictive for the other:
Gale et al.'s procedures do not use standard lexicons and are in terms of
closeness-of-fit, which means that, unlike Kilgarriff's, they can never fail to
match a text token to a sense, defined in the way they do (see below).
     However, Gale et al.'s claims are incompatible with Kilgarriff's in spirit
in that Kilgarriff assumes there is a lot of polysemy about and that resolving
it is tricky, whereas Gale et al. assume the opposite.
     Both Kilgarriff and Gale et a1. have given rise to potent myths about
word-sense-tagging in text that I believe are wrong, or at best unproven.
Kilgarriff's paper, as we saw earlier, has some subtle analysis but one
crucial statistical flaw. Gale et al. 's is quite different: it is a mush of hard-to-
interpret claims and procedures, but ones that may still, nonetheless, be
basically true.
    Gale et al.'s methodology is essentially impressionistic: the texts they
chose are, of course, those available, which turn out to be Grolier's Encyclo-
pedia. There is no dispute about one-sense-per-discourse (their name for claim
[ii] above) for certain classes of texts: the more technical a text the more
anyone, whatever their other prejudices about language, would expect the
claim to be true. Announcing that the claim had been shown true for
402    Yorick Witks
the obligation/tax senses of the English word, which was indeed the criterion
for sense argued for in Dagan and Itai (1994). It has well-known drawbacks:
 most obviously that whatever we mean by sense distinction in English, it is
unlikely to be criterially revealed by what the French happen to do in their
language.
    More relevantly to the particular case, Gale et al. found it very hard to
find plausible pairs for test, which must not of course share ambiguities across
the FrenchlEnglish boundaries (as interest/interet do). In the end they were
reduced to a test based on the six (!) pairs they found in the Hansard corpus
that met their criteria for sense separation and occurrence more than 150
times in two or more senses. In Gale et al.'s defence one could argue that,
since they do not expect much poJysemy in texts, examples of this sort would,
of course, be hard to find. Taking this bilingual method of sense-tagging
for the six-word set as criterial they then run their basic word sense dis-
crimination method over the English Hansard data. This consists, very
roughly, of a training method over lOO-word surrounding contexts for 60
instances of each member of a pair of senses (hand selected), i.e. for each
pair 2 x 60 x 100 = 12,000 words. Notice that this eyeballing method is not
inconsistent with anything in Kilgarriff's argument: Gale et al. selected 120
contexts in Hansard for each word that did correspond intuitively to one of
the (French) selected senses. It says nothing about any tokens that may have
been hard to classify in this way. The figures claimed for the discrimination
method against the criteria} data vary between 82% and 100% (for different
word pairs) of the data for that sense correctly discriminated.
    They then move on to a monolingual method that provides sense-tagged
data in an unsupervised way. It rests on previous work by Yarowsky (1992)
and uses the assignment of a single Roget category (from the 1,042) as a
sense-discrimination. Yarowsky sense-tagged some of the Grolier corpus in
the following way: lOO-word contexts for words like crane (ambiguous
between 'bird' and 'machinery') are taken and those words are scored by
(very roughly, and given interpolation for local context) which of the 1,042
Roget categories they appear under as tokens. The sense of a given token of
crane is determined by which Roget category wins out: e.g. category 348
(TOOLS/MACHINERY) for the machinery contexts, and category 414 (ANIMALS/
INSECTS) for the bird contexts. Yarowsky (1992) claimed 93% correctness for
this procedure over a sample of 12 selected words, presumably checked
against earlier hand-tagged data.
    The interpolation for local effects is in fact very sophisticated and involves
404    Yorick Wilks
training with the lOO-word contexts in Grolier of all the words that appear
under a given candidate Roget head, a method that they acknowledge intro-
duces some noise, since it adds into the training material Grolier contexts
that involve senses of a category 348 word, say, that is not its MACHINERY
sense (e.g. crane as a bird). However, this method, they note, does not have
the problems that come with the Hansard training method, in particular the
notion, unacceptable to many, that sense distinctions in a source languge are
defined by those in the target language (advocated in e.g. Dagon and Itai
 1994), in that, for any word pair where two French and English words had
identical senses, there could be no sense distinction at all.
    In a broad sense, this is an old method, perhaps the oldest in lexical
computation, and was used by Masterman (reported in Wilks 1972) in what
was probably the first clear algorithm ever implemented for usage dis-
crimination against Roget categories as sense-criterial. In the very limited
computations of those days the hypothesis was deemed conclusively falsified;
i.e. the hypothesis that any method overlapping the Roget categories for a
word with the Roget categories of neighbouring words would determine an
appropriate Roget category for that word in context.
    This remains, I suspect, an open question: it may well be that Yarowsky's
local interpolation statistics have made the general method viable, and that
the lOO-word window of context used is far more effective than a sentence.
It may be the 12 words that confirm the disambiguation hypothesis at 93%
would not be confirmed by 12 more words chosen at random (the early
Cambridge work did at least try to resolve by means of Roget all the words
in a sentence). But we can pass over that for now, and head on, to discuss
Gale et al.'s main claim (ii) given the two types of data gathered.
    Two very strange things happen at this point as the Gale et al. paper
approaches its conclusion: namely, the proof of claim (ii) or one-sense-per-
discourse. Firstly, the two types of sense-tagged data just gathered, especially
the Roget-tagged data, should now be sufficient to test the claim, if a 93%
level is deemed adequate for a preliminary test. Strangely, the data derived
in the first part of the paper are never used or cited and the reader is not told
whether Yarowsky's Roget data confirm or disconfirm (ii).
    Secondly, the testing of (ii) is done purely by human judgement: a "blind"
team of the three authors and two colleagues who are confronted by the
Oxford Advanced Learners Dictionary (OALD) main senses for one of nine
test words, and who then make judgements of pairs of contexts for one of
the nine words drawn from a single Grolier article. The subjects are shown
                                                        Senses and texts   405
(l)    Plastic plants can fool you if really well made [= organic]
(2)    Plastic plants can contaminate whole regions [= factory]
4. Conclusion
Acknowledgements
Work referred to was supported by the NSF under grant #IRI 9101232 and the
ECRAN project (LE-2110) funded by the European Commission's Language
Engineering Division. The paper is also indebted to comments and criticisms from
Adam Kilgarriff, David Yarowsky, Karen Sparck Jones, Rebecca Bruce and members
408     Yorick Witks
of the CRL-New Mexico and University of Sheffield NLP groups. The mistakes are
all my own, as always.
References
Antal, Laszlo
    1963        Questions of Meaning. The Hague: Mouton.
Brown, Peter E, Stephen A. Della Pietra, Vincent J. Della Pietra and Robert L. Mercer
    1991        Word sense disambiguation using statistical methods. In: Proceedings.
                ACL 91, 142-146.
Bruce, Rebecca and Janyce Wiebe
    1994        Word-sense disambiguation using decomposable models. In: Pro-
                ceedings. ACL 94, 251-256.
Copes take, Ann and Ted Briscoe
    1991        Lexical operations in a unification-based framework. In: Proceedings.
                ACL Siglex Workshop, 61-68, Berke1ey, CA.
Cowie, Jim, Joe Guthrie and Louise Guthrie
    1992        Lexical disambiguation using simulated annealing. In: Proceedings,
                COLlNG 92, 903-909.
Dagan, Ido and Alon Itai
    1994        Word sense disambiguation using a second language monolingual
                corpus. Computational Linguistics 20(4): 563-596.
Gale, William A., K. Church, Kenneth Ward and David Yarowsky
    1992        One sense per discourse. In: Proceedings, Fourth DARPA Speech
                and Natural Language Workshop, 61-69, Monterey, CA.
Givon, Talmy
    1967        Transformations of Ellipsis, Sense Development and Rules of Lexical
                Derivation. SP-2896, Systems Development Corp., Santa Monica,
                CA.
Green, Georgia M.
    1989        Pragmatics and Natural Language Understanding. Hillsdale, NJ:
                Erlbaum.
Ide, Nancy and Jean Veronis
    1994        Have we wasted our time? In: Proceedings, International Workshop
                on the Future of the Dictionary, 103-107, Grenoble.
Jorgensen, Julia C.
     1990       The psychological reality of word senses. Journal of Psycho linguistic
                Research 19(3): 167-190.
Kilgarriff, Adam
    1993        Dictionary word-sense distinctions: an enquiry into their nature.
                Computers and the Humanities 26(1-2): 365-387.
                                                           Senses and texts    409
Miller, George
   1985        WordNet: a dictionary browser, In: Proc. First Internat. Cont on
               Information in Data. Waterloo OED Centre, Canada.
Pustejovsky, James
    1995       The Generative Lexicon. Cambridge, MA: MIT Press.
Schutze, Hinrich and Jan O. Pederson
    1995       Information retrieval based on word sense. In: Proceedings, Fourth
               Annual Symposium on Document Analysis and Information Retrieval,
               86-94, Las Vegas, NV.
Small, Steven I., Garrison W. Cottrell and Michael K. Tanenhaus
    1988       Lexical Ambiguity Resolution: Perspectives from Psycholinguistics,
               Neuropsychology, and Artificial Intelligence. San Mateo, CA: Morgan
               Kaufmann.
Stevenson, Mark and Yorick Wilks
   2001        The interaction of knowledge sources for all-word word-sense
               disambiguation. Computational Linguistics 27(3): 44-62.
Wierzbicka, Anna
    1989       Semantics, Culture and Cognition. Oxford: Oxford University Press.
Wilks, Yorick
    1972       Grammar; Meaning and the Machine Analysis of Language. London:
               Routledge.
    1978       Making preferences more active. Artificial Intelligence 11: 55-75.
Wilks, Yorick, Dan C. Fass, Cheng Ming Guo, lames E. McDonald, Tony Plate
               and Brian M. Slator
    1990       Providing machine-tractable dictionary tools. Journal of Machine
               Translation 5: 183-199.
Wilks, Yorick, Brian M. Slator and Louise M. Guthrie
    1996        Electric Words. Cambridge, MA: MIT Press.
Yarowsky, David
    1992        Word-sense disambiguation using statistical models of Roget's categ-
               ories, trained on very large corpora. In: Proceedings, COLlNG 92,
                801-809.
    1993        One sense per collocation. In: Proceedings, ARPA Human Language
                Technology Workshop, 112-117, Princeton, NI.
    1995        Unsupervised word-sense disambiguation rivalling supervised methods.
                In: Proceedings, ACL 95, 189-196, MIT.
Index