X (Bar) Syntax - A Study of Phrase Structure
X (Bar) Syntax - A Study of Phrase Structure
A Study of
Phrase Structure
Linguistic Inquiry Monographs
Samuel Jay Keyser, general editor
Ray Jackendoff
ISBN 0-262-60009-9
Foreword
We are pleased to present this monograph as the second in the series Linguistic Inquiry
Monographs. These monographs will present new and original research beyond the
scope of the article. Because of their originality it is hoped they will benefit our field by
bringing to it perspectives that will stimulate further research and insight.
Preface. xi
1. Preliminaries . 1
1.1. Goals. 1
1.2. Assumptions about the Structure of Linguistic Theory. 2
1.3. Lexicalism and Interpretivism . 5
4. Complements. 57
4.1. The Three Levels of Complements. 57
4.2. The V' and N' Complements . 64
4.2.1. Easy Parts of V'. 64
4.2.2. Particles and Predicates. 66
4.2.3. The N' Complement. 69
4.2.4. Generalization of V' and N' and the Definition of Object. 71
4.3. The V" and N" Complements . 72
4.4. Complements of [ — Subj] Categories. 76
4.4.1. Adjectives. 76
CONTENTS
4.4.2. Adverbs. 78
4.4.3. Prepositions. 78
4.4.4. Particles. 79
4.4.5. Case Markers and Prepositions. 80
4.5. Generalization of Phrase Structure Rules. 81
4.6. Summary. 85
4.7. Appendix: The Passive and Related Rules. 86
4.7.1. NP-Preposing . 86
4.7.2. Agent-Postposing. 88
4.7.3. NP-Postposing . 90
4.7.4. The Determiner . 92
4.8. Appendix: Arguments for Intraposition. 95
4.9. List of Transformations. 101
5. NP Specifiers. 103
5.1. Introduction . 103
5.2. Articles and Quantifiers. 103
5.2.1. Semantic Roles of NP Specifiers . 103
5.2.2. Phrase Structure Rules. 104
5.2.3. We and You. 106
5.3. The Partitive. 106
5.3.1. Partitives as N' Complements. 107
5.3.2. Position of the Quantifier in Partitive Constructions . 110
5.3.3. The Partitive Constraint. Ill
5.3.4. Substantivization. 114
5.4. Pseudopartitives. 119
5.4.1. Arguments for the Construction. 119
5.4.2. Pseudopartitive Specifier Constraints . 123
5.5. Numerals. 126
5.5.1. Comparison to Quantifiers and Group Nouns. 126
5.5.2. Numerals as Nouns. 128
5.5.3 Three Adjustment Rules . 130
5.6. Summary. 133
References 245
Preface
In the fall of 1967, Noam Chomsky gave the series of lectures at MIT in which he
introduced the Lexicalist Hypothesis, one of the first suggestions that the expressive
power of a generative grammar is not concentrated primarily in its transformations.
Many of us who heard those lectures found the new theory tremendously exciting, and
we looked forward to its impact on the study of syntax.
Its immediate effect was more one of spirit than of substance. "Remarks on
Nominalization”, the written form of the lectures, was the first authoritative stand
against the growing school of Generative Semantics; "lexicalist" came to be a term
describing anyone who wanted to retain deep structure in transformational grammar, or
who opposed transformational derivation of lexical items, or who proposed surface
structure rules of semantic interpretation. "Remarks" and Chomsky’s next major
paper, "Deep Structure, Surface Structure, and Semantic Interpretation", became
rallying points for a lexicalist-interpretivist school, mostly consisting of students of
Chomsky’s from 1965 and later.
But although "Remarks” had a great effect on the approach to syntactic descrip¬
tion, the most important issues it raised received surprisingly little attention for several
years. Chomsky’s arguments against the transformational derivation of lexical items
and for the cross-categorial application of transformations were widely appreciated and
extended. But his proposals on the organization of the lexicon and the theory of phrase
structure, although intriguing, were sketchy and programmatic, and this seems to have
discouraged further investigation. Many of us, of course, had inherited a misplaced bias
against phrase structure from the early work in generative grammar, which had little of
interest to say about phrase structure grammar, stressing only its insufficiency for
linguistic description; this prejudice too stood in the way of serious research. More
glamorous issues were demanding attention as well: in the late sixties, the relation of
syntax to semantics was the main arena; in the seventies, constraints on the transfor¬
mational component.
In the past few years, though, the structure of the lexicon has come to be of
growing concern, as phonology, syntax, and semantics have all put pressure on it to
solve their problems. And the gradual reduction in the power of transformations has led
syntactic theory, like other aspects of our lives these days, towards simpler ways—in
this case back to phrase structure. Thus the scope of Chomsky’s original proposals is
only just now beginning to be appreciated.
My original conception of this monograph was as a general overview of the
XII PREFACE
Lexicalist Hypothesis and of its implications for all components of the grammar. But as
I got involved in working it out, the importance of phrase structure stood out more and
more strongly, until it became clear to me that I would have to devote the entire book
to it. And still it kept growing, until the original four or five chapters became the
present ten, taxing my patience to a degree I hope I have spared the reader.
In its present form, the first two chapters of the book introduce the motivations for
a richer account of the base. Chapter 3 is the conceptual core of the book: it elaborates
Chomsky’s sketches in “Remarks” into a highly constrained and explicit theory of
constituent structure. The remainder of the book justifies the theory and works out its
consequences through a detailed examination of English phrase structure, giving a
much more wide-ranging and systematic survey of this aspect of the language than has
to my knowledge appeared in recent years. Thus the book is conceived of as a
contribution both to the study of English syntax and, because of the implications of
English for universal grammar, to linguistic theory as a whole.
As is usual for a book that has been so long in preparation, parts of it have
appeared in various forms elsewhere. The most rudimentary version was a paper called
“Speculations on Presentences and Determiners”, published by Indiana University
Linguistics Club and now mercifully out of print. “Introduction to the X Convention”,
also published by IULC, was the real germ of the book; readers familiar with that paper
will find passages from it scattered throughout chapters 2, 3, and 4, though they will
notice that I have since adopted three levels of bars or primes in major phrasal
categories rather than two. A paper published in Language, “Morphological and
Semantic Regularities in the Lexicon”, was originally intended as a chapter of this
book, but it was crowded out by the insidiously encroaching demands of the phrase
structure component. Finally, a paper entitled “Constraints on Phrase Structure
Rules”, consisting mostly of excerpts from chapters 3, 4, and 9, is appearing in Studies
in Formal Syntax, edited by Akmajian, Culicover, and Wasow (1977).
This monograph has benefited from the advice of numerous colleagues and
students over the years. My discussions on this material with Adrian Akmajian, John
Bowers, Joan Bresnan, Noam Chomsky, Peter Culicover, Joe Emonds, Jay Keyser,
Joan Maling, Ann Reed, Lisa Selkirk, and Henk van Riemsdijk have been particularly
fruitful, and I am grateful for their suggestions and encouragement. I especially want to
thank Akmajian and Culicover, in whose homes I precipitously arrived as a houseguest
in the summer of 1975 when chapters 6 and 7 were at white heat, for their patience and
forbearance in the face of my obsessive enthusiasm.
I also need to express my gratitude to my wife Elise, who, though she arrived on
the scene after the book was already well along, quickly came to an understanding of
what I found fascinating in this stuff and found herself beginning to care that I cared
about it. I have an overwhelming amount to thank her for, perhaps not least of which is
her sobering comment on the whole enterprise: “Impressive, but I'd rather read
Newsweek. ...”
1.1. Goals
Like all work in generative grammar, this study is concerned with describing what it is
that a human being knows when he knows how to speak a language. In particular, we
take it to be essential to separate those aspects of a speaker’s knowledge that are
universal from those that are language-particular, under the hypothesis that the
language-particular features must be learned by the speaker, but the universal parts
may be innate, i.e. determined by the structure of the human organism itself. Success
in separating universal from language-particular components enables us to make
interesting claims about the nature of the mind.
Given the fact that children learn languages in a relatively short period of time and
from rather fragmentary data, we would prefer a theory of linguistic structure in which
the job of the language learner is as easy as possible, that is, in which the universal
component is maximized. The extreme case, of course, would be if the entire language
were innate; but this hypothesis is clearly false, since children learn different languages
depending on the linguistic environment in which they are raised. Therefore, the theory
must strike the proper empirical balance between the diversity of human languages and
the need for them to be leamable.
This vision of the goals of linguistic theory originates in the work of Chomsky; it is
explicated in great detail in many of his writings, particularly Aspects of the Theory of
Syntax. What distinguishes the school of generative-transformational grammar from
other extant linguistic theories is the emphasis given to these goals at every level of
linguistic description, from the high-level formulation of general theory to the most
intricate low-level details of particular languages. The present study, it is hoped, is no
exception.
The particular area of inquiry we will address here is the theory of phrase
structure. Traditional grammar is concerned with defining parts of speech and constitu¬
ent structure, with such concepts as the grammatical relations subject and object, and
with the relations between modifiers and heads. The present study is an attempt to deal
with these traditional issues in the framework provided by the goals and methodology
of generative-transformational grammar.
7 RAY JACKENDOFF
The major questions we will be attempting to answer are these: What are the
lexical categories (parts of speech) available to linguistic theory? How are they
associated with syntactic structures, and what is the relationship between lexical
categories and the syntactic constructions in which they take part? What relationships
exist among lexical categories and how can they be expressed? What relationships exist
among larger syntactic categories and how can they be expressed?
These questions will be dealt with in the context of an attempt to describe the
grammar of English. Of course, in some respects, this choice of data severely limits the
range of possible linguistic phenomena to be used as input in formulating the theory. In
taking this approach rather than a relatively superficial survey of the grammars of many
languages, I am presupposing two methodological biases: first, that it is only by
investigating a system very thoroughly that one can begin to understand it; second, that
evidence from a single language can in fact provide substantial evidence for linguistic
universal. These biases are of course not empirical claims, but only strategies for
arriving at empirical claims. No matter how a claim is arrived at, it must of course be
tested against a great many details of a great many languages.
By the time of Aspects of the Theory of Syntax (1965), it was generally agreed that
linguistic theory must provide five distinct components, each with its own characteris¬
tic types of rules and characteristic interactions with the other components. In other
words, the organization of the grammar into components, the rule types, and their
interactions are not something that the child must learn in learning a particular
language, but rather are part of his hypothesis about what he is to learn. The field has
seen many disputes as to exactly the nature of these components, what aspects of the
language each accounts for, how the rules are formalized and constrained, and how the
rules interact within components and across components; the settling of these questions
is an empirical matter. But the general separation of the theory into components will be
taken for granted here.
The five components of the grammar are the lexicon, the categorial or phrase
structure component, the transformational component, the phonological component,
and the semantic component. The first two are often considered subcomponents of a
larger component called the base component; the first three, or the second and the
third, are often lumped together as the syntactic component. Because of the concern of
this study with the nature of phrase structure, we will be dealing primarily with the
form of the categorial component. We will make no attempt to discuss the other
components at all exhaustively. But since any decision about the form of the base is
nearly always intimately tied up with the other components, particularly the transfor¬
mations and the semantics, we will feel no obligation to ignore them.
The lexicon is the repository of idiosyncratic information about individual forma-
tives of the language. Each formative is assigned a lexical entry which describes its
PRELIMINARIES
phonological, semantic, and syntactic properties. The lexicon also contains a class of
rules called lexical redundancy rules, which may include word formation rules,
morpheme structure rules, and rules of allomorphy.
The categorial component includes a set of context-free phrase structure rules,
which define the hierarchical syntactic categories of the language and simultaneously
determine the left-to-right order of the immediate constituents of each category. The
phrase structure rules generate a set of phrase markers whose initial symbol is S and
whose terminal symbols are the lexical categories of the language.
The base also includes a lexical insertion rule which substitutes lexical formatives
for terminal symbols in the phrase markers generated by the categorial component. In
the Aspects theory, lexical insertion is constrained by the strict subcategorization and
selectional restrictions of the heads of phrases. We will assume here a less restricted
lexical insertion rule which checks only the lexical category and the strict subcategori¬
zation restrictions; selectional restrictions are regarded as well-formedness conditions
on semantic interpretations, as argued in Jackendoff (1972, chapter 1). We will also
assume that terminal symbols can be left unfilled by lexical items, resulting in an
“empty node” in the sense of Emonds (1970); and that a terminal symbol may
dominate a phonologically null item PRO, used as an anaphoric element in elliptical
constructions.1 An empty node must be filled at some point in the derivation in order
for the derivation to be well-formed; PRO must receive a reading by the rules of
anaphora or other specific rules of interpretation, in order for the semantic representa¬
tion to be well-formed. The phrase markers with lexical formatives generated by the
base component of a language L constitute the possible deep structures of sentences of
L.2
The deep structures are taken as input to the transformational component, which
relates them to the surface structures of L, and thence via the phonological component
to the phonetic representations of L.3 We will assume here that the transformations of
L are extrinsically ordered and that their application is governed by the principle of the
transformational cycle, as follows: certain syntactic categories are designated by
linguistic theory as cyclic nodes. In the Standard Theory of Aspects, the cyclic nodes
are the S nodes; it is now generally acknowledged that at least NPs are also cyclic
nodes. In applying the transformations to a sentence, apply in order Tj, . . . , Tn first to
the structures dominated by the most deeply embedded cyclic nodes, that is, to those
1 PRO takes over the function of the empty node A used in Jackendoff (1972), which did not have the
same meaning as Emonds’s empty node. However, Wasow (1975) argues that they need not be distinct
entities.
2 It should be clear from these assumptions that I will not be concerned with theories of transforma¬
tional grammar which use unordered base strings or posttransformational lexical insertion. Nor will I concern
myself here with providing explicit arguments against such theories. However, the analysis arrived at here
constitutes an implicit challenge to their proponents, in that it deals with a range of facts and issues generally
ignored in discussion of the significance of deep structure in generative grammar.
3 There has been considerable evidence that earlier levels of syntactic derivation also serve as input to
the phonology. See Bresnan (1971), for example.
4 RAY JACKENDOFF
which do not dominate any other cyclic node. After T,, . . . , T„ have applied to these
domains, they apply again in order to the structures exhaustively dominated by cyclic
nodes, all cyclic nodes dominated by which have already governed an application of the
cycle. The cycle is repeated under the same conditions until the highest S node is
reached.
We will assume furthermore the principle of the strict cycle (Chomsky (1973, 243)):
“No rule can apply to a domain dominated by a cyclic node A in such a way as to
affect solely a proper subdomain of A dominated by a node B which is also a cyclic
node.’’ Without this convention, it would be easy to circumvent the ordering of
transformations within embedded contexts.
We will take it for granted that there is a class of transformations which take place
on the final (highest) cycle only. In the best possible case, the last-cyclic transforma¬
tions have the same conventions on application as the cyclic transformations, i.e. they
apply in the highest clause only and cannot affect only a subordinate cyclic domain.
However, last-cyclic transformations which violate this condition have appeared in the
literature, and I will leave the issue open.
It has often been suggested that there is also a class of precyclic transformations,
which naturally must have different conventions of application than the cyclic rules.
However, no convincing arguments for such rules have appeared within the particular
variant of transformational grammar being assumed here; clearly the theory would be
more highly constrained if they were unnecessary.
In the Standard Theory of Aspects, the semantic component is taken to be an
interpretive component which derives semantic representations from the deep struc¬
tures of L. However, if taken literally, this assumption turns out to lead to the
Generative Semantics position that deep structures and semantic representations are
isomorphic, a position advocated by Lakoff (1971a), McCawley (1971), and others.
Since in this theory the form of underlying phrase markers must reflect purely semantic
considerations, all the aspects of syntactic generality in a language expressible by the
phrase-structure rules of the Standard Theory are lost.
Alternatively, one can give up the assumption that deep structure is the only
syntactic source of semantic information, adopting instead what Chomsky has called
the “Extended Standard Theory”. Arguments for this modification to the Standard
Theory appear in Chomsky (1970b) and Jackendoff (1972), among others, and I will not
go into those arguments here. In the Extended Standard Theory, various aspects of the
interpretation of a sentence are derived from its deep structure, from its surface
structure, and possibly from its end-of-cycle derived structures. Although this theory,
with its variety of rule types and rather complex organization, is less immediately
attractive than the Standard Theory or Generative Semantics, it appears to be rather
tightly constrained, in that the formal universals organizing the grammar are considera¬
ble.
A recent alternative to the Extended Standard Theory has been provided by the
PRELIMINARIES
"trace theory" of movement rules, developed in Chomsky (1973; 1975), Wasow (1971),
and Fiengo (1974). In this theory, all movements of constituents leave behind anaphoric
"traces" which serve to mark the previous position of the moved constituent. Certain
very general constraints on movement rules follow immediately from independently
motivated conditions on anaphora which are imposed on the traces. While the
consequences of this theory for the transformational component are still not well
understood, it has been pointed out by John Goldsmith that the trace theory permits all
semantic interpretation to take place at the level of surface structure. Jackendoff
(1975b) shows that this property permits a number of disparities in the semantic
component of the Extended Standard Theory to be resolved. Furthermore, it restores
to the semantic component the conceptual simplicity of the Standard Theory. We will
leave open which version of the Extended Standard Theory is to be preferred, as it is
largely irrelevant to our concern with phrase structure.
who have adopted the lexicalist and surface structure interpretivist positions, however,
are willing to accept a wider range of rule types, categories, and rule interactions within
the theory of grammar, provided each component can be sufficiently constrained. They
often are rather skeptical about the ability of traditional logic to provide a model of any
part of semantic representation. This methodology turns out to be congenial both to the
Lexicalist Hypothesis and to the Extended Standard Theory of semantic interpretation.
But there are empirically based ties between the two hypotheses as well. For
instance, if one wants to maintain that the interpretation of sentences containing
negation is determined solely from their deep structure, one must permit transforma¬
tions which drastically alter the shape of lexical formatives; Klima (1964) posits such
changes as neg + any —> no, neg 4- often —* seldom, and neg + many —> few. Though
one could maintain a narrow Lexicalist Hypothesis for nominalizations and still admit
such derivations, they certainly violate the spirit of the Lexicalist Hypothesis, in that
they account for what are at best semiproductive lexical processes by means of a
transformation. Only by adopting a surface structure interpretive theory of negation
(e.g. Jackendoff 1969a; 1972, chapter 8) can we maintain a lexicalist theory of negative
morphemes. Hence surface structure interpretation is a necessary condition for a
desirable extension of the Lexicalist Hypothesis; the two theories thus complement
each other empirically as well as methodologically.
2: Motivation for the X
Convention
This chapter provides the background against which the rest of this study is set.
Section 2.1 is concerned with Chomsky’s treatment of the relation between verbs and
their nominalizations; section 2.2 with adjectives and their nominalizations; and section
2.3 with adjectives and adverbs. Our approach is to show that none of these
relationships among lexical items can be accounted for transformationally. The task of
expressing these relationships must therefore fall on other components of the grammar.
One kind of relationship which it becomes necessary for the grammar to express is
cross-category generalization. We recapitulate Chomsky’s development of the X
Convention as a way to incorporate cross-category generalization, and we show how
this leads to a theory of the base.
Chomsky’s paper “Remarks on Nominalization’’ has as its most immediate concern the
syntax of derived nominals such as John’s criticism of the book and John’s refusal of
the offer. The point of departure is the considerable number of differences between
derived nominals and “gerundive nominals” such as John’s criticizing the book and
John’s refusing the offer. Basically, gerundive nominals behave like regular transforms
of sentences, but derived nominals do not.
Here are some of the differences. Gerundive nominals occur with aspectual verbs,
but derived nominals do not.
Gerundive nominals use adverbs before the gerund; derived nominals use adjectives
before the nominalization.
Derived nominals can take part in partitive expressions and can be pluralized;
gerundive nominals cannot.
But gerundive nominals can take a range of adverbials that are prohibited in derived
nominals.
Gerundive nominals, but not derived nominals, can undergo Subject Raising, Dative
Shift, and Particle Movement.
Subject Raising
Derived nominals may use the in the absence of a subject, while gerundives may not;
compare (2.10) with *the being certain that John will win.
The properties of gerundive nominals are exactly what one would expect if they
were transforms of sentences. But transformations which predict this behavior in
2: MOTIVATION FOR THE X CONVENTION
producing derived nominals from sentences will necessarily be rather difficult to state
and order.1
A second kind of data distinguishing gerundive from derived nominals concerns
syntactic productivity and regularity. While all sentences (except those whose subjects
are too complex to have a possessive form) have gerundive nominals, many, for
example those in (2.11), have no derived nominals.
1 In particular, Ross (1967) argues from the existence of a transformation producing derived nominals to
the conclusion that Dative Shift and Particle Movement are last cyclic: if they were cyclic, they would have
no way of knowing that the sentence they operate in was to be turned into a derived nominal on the next
cycle. Jackendoff (1972, chapter 9) argues that this conclusion leads to an ordering paradox, for Dative Shift
must take place in cyclic derivations such as John is believed by Harry to have been given a book by Bob.
21 will give the page references on Chomsky’s Studies on Semantics in Generative Grammar followed
by those in Jacobs and Rosenbaum (1970).
10 RAY JACKENDOFF
217). Other examples are easy to find. The point of these examples is that a
transformational derivation of at least some readings of these words, from sentences
involving the verbs laugh, marry, etc., must involve the presence of semantic material
idiosyncratic to the particular nominalization. For example, John's deeds does not
mean ‘things which John did’ but rather at least ‘fairly significant things which John
did’. The derivation of the political sense of revolution from revolve is fairly obscure;
similarly for the sense of construction in the verb-particle construction in English. In
short, a transformational derivation of these nominals entails an incredible proliferation
of idiosyncratic transformations.
Finally, there are a great number of nouns which occur in structures similar to
those observed in derived nominals, but for which there is no corresponding verb at all.
Chomsky cites constructions like the weather in England, the story of Bill's exploits,
the message from Bill to Tom about the meeting, a war of aggression against France,
John's attitude of defiance towards Bill, his habit of interrupting (p. 30/196), the author
of the book, the assistant vice-chancellor of the University (p. 32/196), and John's
poems/sonnets/Alexandrines (about Homer) (note 10, p. 20/note 11, p. 217). A
consistent transformationalist approach must derive these nouns from verbs too.
Following such an approach, Lakoff (1971b) comes to derive king from a verb by
analogy with ruler; he then invokes the notion of “positive absolute exception” to the
nominalization rule, in order to guarantee that the verb underlying king never appears
at the surface. Such a solution is indicative of the unnaturalness of the transformational
theory.
To describe the differences between gerundive and derived nominals in a system¬
atic way, Chomsky claims that gerundives have an underlying structure like (2.17),
containing an S,3 but that derived nominals have an underlying structure like (2.18), in
which the head is a noun and no S node appears.
(2.17)
3 We will propose a somewhat different structure in section 3.6. but for the moment we adopt
Chomsky’s formulation.
2: MOTIVATION FOR THE X CONVENTION
(2.18) NP
permit them. We expect Subject Raising, Dative Shift, and Particle Movement in Ss but
not in NPs, hence the differences in (2.9)—(2.10). Only NPs are expected to use the
definite article the, so only derived nominals can use it.
The differences in productivity and regularity also make more sense within the
Lexicalist Hypothesis. If gerundives are simple transforms of sentences, we correctly
expect unfailing productivity of forms and a systematic semantic relationship to
sentential forms. On the other hand, lexical relationships are notoriously irregular and
idiosyncratic, and it is therefore not at all surprising that derived nominals should be as
erratic as they are. Furthermore, if derived nominals are nouns, we would expect (as
observed) that there are NPs of the same structure as derived nominals, not related in
any way to sentences, simply because there happen to be no verbs lexically related to
their head nouns.
Thus the Lexicalist Hypothesis makes a principled differentiation between the two
types of nominalizations, explaining their differences as an automatic consequence of
their structures. Given the proposed structures, the properties of the two types of
nominalizations could not be too much different than they are without complicating the
grammar. On the other hand, the Transformationalist Hypothesis must resort to a great
deal of descriptive complexity to yield the observed properties of derived nominals.
These properties are thus taken to be accidental, in that there is no logical connection
between them, and in that the grammar would be much simpler if in fact the differences
between gerundive and derived nominals did not exist. Stated differently, in the
Transformationalist Hypothesis, the language learner must learn a large number of
related transformations to produce derived nominals; but in the Lexicalist Hypothesis,
the language learner needs to learn only the differences in deep structures (2.17) and
(2.18) in order to produce all the rest of the differences from independently learned
rules.
To make this argument fully convincing, we must of course show that all the rules
applying to derived nominals in the lexicalist account are independently motivated. To
begin with, examine the alleged base form (2.18). It has often been claimed that the
possessive form in John's house, for example, is derived by reduction from a deep
structure relative clause construction such as the house which John has. Under this
theory, possessive NPs do not appear in the base. However, (2.18) claims that the NP
John occupies prenominal position in the base form, so the base must apparently be
enriched to accommodate this structure. Chomsky shows, however, (pp. 37-38/200-
201) that this enrichment of the base is independently necessary: the “inalienable”
sense of John's leg, in which the leg is part of John’s body rather than a wooden leg he
happens to own, cannot be paraphrased by the leg that John has. For example, we
cannot say The leg that John has hurts meaning John's leg hurts. Further, expressions
such as John’s forte have only an inalienable sense; *the forte which John has is
nonsensical. This suggests that at least the inalienable sense of possession is not
derived from a reduced relative clause, but rather is generated in the base.
Further evidence comes from phrases like John’s book and John’s proof of the
2: MOTIVATION FOR THE X CONVENTION
theorem. Here appropriate relative clause paraphrases are the book which John wrote
and the proof of the theorem that John discovered. Insofar as the relative clause
describing John’s relationship to the object is dependent on the nature of the object and
is often still open to variation, a transformation reducing the relative clause to a
possessive will violate the recoverability requirement on deletion transformations.
Suppose, however, that the possessives are generated in the base. One projection rule
for the possessive position can specify a rather loose notion of “intrinsic connection’’
between the possessive NP and the object denoted by the larger NP. This notion would
be sharpened by the semantic nature of the larger NP: if it is written material, intrinsic
connection denotes the writer; if it is an idea, intrinsic connection denotes the
discoverer; and so forth. Placing the burden of specifying intrinsic connection on the
semantic component (or preferably on real-world knowledge) instead of on the syntax
eliminates the violation of recoverability. It simultaneously explains the creativity in
the use of intrinsic connection: for example, John's chair may denote the chair that
John owns (alienable possession), or, by intrinsic connection, the chair that John built,
designed, or habitually sat in.4 Of course, a complete account must be much clearer
about the nature of intrinsic connection that I can be here. My conjecture, however, is
that it is a rather fundamental cognitive function, of which the examples above are
simply special cases.
Thus there is some justification for including the possessive position in the base,
independently of its use in derived nominals. What about the prepositional phrase
following the head noun in (2.18)? Again, it is often claimed that PPs in this position are
reduced relative clauses, and so this position is unnecessary in the base. For example,
a book about Harry and a picture of the lizard are easily derived from a book which is
about Harry and a picture which is of the lizard. Furthermore, the reduction rule has
good independent motivation, since it provides a straightforward source for such
phrases as the child sleeping in the alley and a book yellow with age. Chomsky points
out, however, that at least some PPs in NPs cannot be derived from plausible relative
clauses, for example the weather in England, the weather in 1965, the author of the
book, his attitude of defiance, his advantage over his rivals, the reason for his refusal,
and his habit of interrupting. Since there are no constructions such as *the weather
which is in England, for example, it seems plausible simply to expand the base rules to
permit certain PPs to be generated under NP. But this extension is precisely what we
need to generate (2.18) as a base form.
As further justification for this expansion of the base rules, we observe that many
adjective phrases can include prepositional phrases or complement clauses which
cannot be reduced relative clauses, for example afraid of Bill, eager to please, pleased
with his work, apprehensive about the job, and sorry to be of trouble. This suggests that
4 Cf. “Remarks”, pp. 45/205-206. One might with good reason try to interpret possession as a special
type of intrinsic connection: there seems little justification to syntactically distinguish own or have from build
or write purely for the purposes of stating a reduction transformation, when a suitable semantic generaliza¬
tion might eliminate the reduction rule altogether.
14 RAY JACKENDOFF
all three categories N, V, and A permit a range of following phrases. We will show in
chapter 4 that prepositions likewise exhibit a range of associated structures.
To express this generalization, Chomsky introduces the variable X to stand for any
lexical category symbol, and uses the notation X (henceforth here X') to denote the
node immediately dominating X. The general base rule schema which expresses the
existence of material to the right of the head of a phrase is (2.19).
Chomsky further suggests that the material preceding the head of a major category
phrase is introduced by a base rule of the form (2.21). X (henceforth here X')
designates the node which is two nodes above a major category node, and [Spec, X']
(the specifier of X', henceforth noted as Specx) is an abbreviation for the material
preceding the head.
Again, the material in the specifier varies from category to category. Chomsky
associates Specv with the traditional auxiliary node, SpecN with the traditional
determiner, and SpecA with traditional degree phrases. We will discuss SpecP in
chapters 4 and 6. Chomsky does not say explicitly whether he considers the specifier to
be a constituent or, like Comp, an abbreviation for a sequence of constituents. His
diagrams show it as a constituent; however, we will argue here that it should not be.
To complete the base rules, Chomsky uses the initial rule (2.23).
Thus the trees (2.17) and (2.18) can be rewritten as (2.24) and (2.25). 1 enclose the
traditional names of the nodes in parentheses.
5 Chomsky does not subscript the various Comps by category. However, since the the Comp varies
from one category to another, I have included the subscript to indicate the possibility of variation.
(2.24) N" (NP)
the play
16 RAY JACKENDOFF
In order to capture the similarity of the two relations, the grammar should not have to
state, in addition to the standard Passive, a transformation having very similar effects
to the Passive but in the domain NP. Within the Transformational Hypothesis this
generalization is quite simple; all four phrases above are derived from the same
underlying form, but (2.26a) undergoes Nominalization, (2.27b) undergoes Passive, and
(2.26b) undergoes Passive followed by Nominalization. (2.26b) thus is the nominaliza¬
tion of a passive sentence.
In the Lexicalist Hypothesis this solution is not available, since there is no
Nominalization transformation. Instead Chomsky suggests that, like the base rules and
the projection rules, some transformations, in particular the Passive, may apply over
the domain NP as well as S. If this is the case, (2.26b) is derived from the same
underlying form as (2.26a), but it undergoes the generalized form of the Passive. Hence
its relation to (2.26a) is the same as the relation of (2.27b) to (2.27a), precisely as
required, and no extra rules need be added to the grammar. Section 4.7 will discuss the
generalized Passive in more detail.
The general nature of the claims made by the X Convention are now clear. The
structural schema (2.28), in which X represents any lexical category, is claimed to
constitute a linguistically significant generalization of the structures associated with
major categories.
(2.28) X"
That is, we expect there to exist rules whose structural descriptions refer to a
range of structures including more than one value of X. For example, we expect to find
rules whose domains include V" and N" or perhaps A' and N': but we do not expect to
find rules whose domains include nodes at different levels, for example P' and A".
The fact that some rules generalize to more than one major category of course does
not mean that all rules generalize over more than one major category. If that were the
case, there would be little reason to distinguish among major categories. Rather, we use
nongenerality of rules to distinguish one category from another. For example, the rule
of Cleft Sentence Formation can be used as a test to distinguish N" and P" from A" and
V".
The existence of cross-category generalizations is therefore not taken to be
evidence that one category is a subclass of some other category, as suggested by some,
for example, Lakoff (1971b), Ross (1969), and Postal (1971, chapter 18). Such a tactic
may merely shift around the names assigned to categories, since the distinctions must
18 RAY JACKENDOFF
still be made. For example, if we claim that all adjectives are deep structure verbs, we
must still assign a syntactic feature ±F which distinguishes surface structure verbs
from surface structure adjectives, and which is referred to by all rules in which
adjectives and verbs differ. In effect, then, we may have simply changed the traditional
name “adjective” into “ + F” and the traditional name “verb” into “ —F”, not a very
insightful proposal.
Of course, in a theory of grammar which does not include the X notation or some
similar device, there is no way to express cross-category generalization directly. Thus
one is forced to resort to a supercategory, with all the syntactic difficulties such a
proposal entails. Chomsky’s claim, then, is that cross-category generalizations play an
important role in grammar, and that the theory of grammar thus must contain a way of
expressing them.
The traditional method of expressing a classificatory system which includes cross-
classification is with a set of distinctive features. For the purpose of stating generalized
rules in the X notation, chapter 3 will introduce a set of syntactic distinctive features.
But first, let us examine other evidence for the Lexicalist Hypothesis, of a similar
nature to Chomsky’s but involving different categories.
Bowers (1968a), from which much of the material in this section is derived, discusses
the relationship between adjectives such as high, long, and deep, and their nominals
height, length, and depth, comparing the transformationalist and lexicalist positions.
The basic fact to be accounted for is that the genitive phrase in nominals like the
building’s height, the railroad’s length, the lake’s depth, the slope’s steepness, the
table’s width, the paper’s thickness, and the plane’s lateness is restricted in precisely
the same way as the subject of the corresponding sentences The building is high. The
railroad is long, The lake is deep. The slope is steep, The table is wide, The paper is
thick, and The plane is late. Likewise for nonmeasure adjectives and their nominals:
the genitive NPs in Mary’s beauty, John's perversity, the rock’s whiteness, the table’s
flatness, the desert's dryness, the sky’s cloudiness, that proposal's absurdity, and
Sarah’s diffidence are restricted in precisely the same way as the subjects of the
sentences Mary is beautiful, John is perverse, The rock is white, The table is flat, The
desert is dry, The sky is cloudy, That proposal is absurd, and Sarah is diffident. Under
a transformationalist theory these relations are expressed by a transformation which
converts embedded predicative Ss into the corresponding NPs, the predicate adjective
becoming a noun. Under the Lexicalist Hypothesis the adjectives and nouns are given
separate but related lexical entries; the noun phrases are generated by the base rather
than by the transformational component.
For purposes of comparison, we will again contrast the derived nominals with
gerundive nominals. Gerundive nominals occur with degree adverbs, but derived
2: MOTIVATION FOR THE X CONVENTION
The characteristic equative construction for gerundive nominals is as ... as-, for
derived nominals the same ... as is sometimes applicable.
Subjectless gerundive nominals cannot use the definite article; subjectless derived
nominals may.
Thus the adjective phrases in adjectival gerundive nominals have all the properties one
would expect of adjective phrases; the corresponding derived nominals have the
internal structure of noun phrases, taking all the characteristic noun modifiers such as
the definite article, the same, and prenominal adjectives. Such behavior is consistent
with the Transformational Hypothesis, in that one can always state transformations to
make all the necessary adjustments; but the existence of these transformations is taken
to be an accident of English syntax—things could just as well be otherwise. Under the
Lexicalist Hypothesis, however, only the gerundive nominals contain an underlying
AP; the derived nominals are deep structure NPs. This theory thus predicts the
difference between the two types of nominals and describes the behavior of derived
nominals as an automatic consequence of the behavior of ordinary NPs. Since ordinary
lexical properties determine syntactic behavior, the Lexicalist Hypothesis says that
things would be less general if they were otherwise, for instance if (2.31) instead of
(2.32) were ungrammatical and everything else were the same.
20 RAY IACKENDOFF
As with the deverbal nominals, there are irregularities in the relationship between
adjectives and nouns. Height does double duty for tall and high, there being no
*tallness. In addition, the nouns elevation and altitude cover part of the same semantic
area but are unrelated to adjectives. *Fastness is supplanted by speed (which is in turn
related to an adjective speedy, which is in turn related to a noun speediness), *oldness
is supplanted by age, and *afraidness is supplanted by fear (which is related to an
adjective fearful). Ability covers a much wider semantic range than its adjective root
able', the semantic relation between clean and cleanliness is irregular. Adjectives which
take infinitive complements (She is pretty to look at; It is easy to please Bill) do not
have direct nominal counterparts *her prettiness to look at, *the ease/easiness to
please Bill, though the latter type is related to a gerundive complement construction the
ease/easiness of pleasing Bill. In a transformational theory, each of these irregularities
must be derived by a lexically specific transformation or constraint. In the Lexicalist
Hypothesis, they simply represent deviations from the most ideal relationship; irregular
pairs contain more independent information than regular ones, but do not otherwise
complicate the grammar.
Under the Lexicalist Hypothesis, gerundive nominals and derived nominals will
have structures (2.37) and (2.38), respectively.
(2.37) N"
perverse
MOTIVATION FOR THE X CONVENTION
As with the nominals of section 2.1, we must account for the generality of selectional
restrictions, e.g. for the fact that perverse restricts the subject in (2.37) in precisely the
same way that perversity restricts the genitive phrase in (2.38). The transformationalist
theory of course explains this relationship as an automatic consequence of the deep
structure ascribed to (2.38), which, like (2.37), will contain the S John be perverse.
In the lexicalist framework, we must again appeal to the use of redundancy rules in
the lexicon to express the similarity of the selectional restrictions of perverse and
perversity. As for the rules which enforce selectional restrictions, we can appeal to the
extension to NPs of the notion “subject-of ’ proposed in section 2.1. Since the genitive
NP in (2.38) is to be considered the subject of (2.38), the rules which impose selectional
restrictions on grammatical subjects will apply equally in (2.37) and (2.38). Thus, as in
section 2.1, the Lexicalist Hypothesis requires a cross-category generalization to be
expressed, but this time in terms of adjectives and nouns.
Bowers points out that there is a similarity in the semantic functions of various
subordinate clauses in APs and NPs.
(2.39) a. The table is so wide that the cloth won’t cover it.
b. The table is as wide as I thought it was.
c. The table is wider than I thought it was.
d. The table is too wide to be useful.
RAY JACKENDOFF
I has
(2.40) a. The table such width that the cloth won’t cover it.
lis
b. The table has the width 1 thought it had.
c. The table has more width than I thought it had.
• ''T'l . li (doesn’t have the
d. The table 1. _ width to be useful.
fisn t ot a
While there is not a clear one-to-one correspondence between these phrases, there is
enough similarity to require explanation. In particular the relation between the equative
(2.39b), a comparative clause, and (2.40b), a relative clause, is striking. Bowers
suggests a similarity in the structures of comparative and relatives roughly like (2.41)
and (2.42).
An extension of the grammatical relation obtaining between a noun and its relative
clause to that between an adjective and its comparative clause is called for. The
Lexicalist Hypothesis by now leads us to expect such extensions. We will deal with
this particular relation in chapter 8.
Throughout this section we have ignored an alternative form for the derived
nominals: the height of the building, the width of the table, etc. Bowers argues that this
form is transformationally derived, and that the rule deriving it from the form with a
genitive NP accounts for some of the difficulties in Chomsky’s treatment of the passive.
We will deal with this rule in connection with the passive, in section 4.7.
MOTIVATION FOR THE X CONVENTION
In sum, then, adjectives and their nominalizations present a picture quite similar to
verbs and their nominalizations. The same difficulties accrue to the Transformationalist
Hypothesis, and the same problems of lexical relations and cross-category generaliza¬
tions must be solved by the Lexicalist Hypothesis.
The similarities between adjectives and adverbs in English are far more striking than
those between other pairs of categories we have considered, and many people have
considered it self-evident that adverbs should be derived transformationally from
adjectives. Nevertheless, we will show here that a theory which does not distinguish
between the two categories in deep structure leaves something to be desired.
First observe why a transformational theory is so tempting. Nearly all adverbs
consist morphologically of an adjective plus the -ly suffix, and the meanings of
adjective-adverb pairs are usually quite close. Furthermore, the modifier systems of
the two categories are identical, as pointed out by Bowers (1968b), among others.
rather
(2.43) a. John’s car is incredibly ' fast.
very
b. Its acceleration is so rapid that you’ll never catch him.
c. That quartet is more beautiful than you can imagine.
d. The stop was too sudden to react to.
e. The sun was not as bright as the sunlamp.
But deriving adverbs from adjectives in any semantically enlightening way turns
out to be a process conspicuously lacking in generality: observe the variety of sources
needed (examples from Jackendoff (1972, 52-55)).
(2.45) John is careless at driving his car. —» John drives his car carelessly.
The manner in which John disappeared was elegant. —> John disap¬
peared elegantly.
It was easy for Stanley to win the race. —» Stanley won the race easily.
I am frank in saying there is no reason for it. —> Frankly, there is no
reason for it.
The time at which Harry was known as "The Red Death” was a former
time. —> Harry was formerly known as "The Red Death”.
24 RAY JACKENDOFF
There are many cases where the related adjective exists but provides no convincing
paraphrase for the adverb.
Thus the transformational theory is again faced with the necessity for a large number of
more or less lexically specific transformations. For cases like (2.46), a transformational
derivation cannot even succeed in properly relating the adjective and the adverb forms,
so the motivation for the derivation is vitiated. These are just the kinds of semantic and
syntactic irregularities we found in the previous sections.
Because of the great similarity in the syntax of adjectives and adverbs, it is difficult
to find arguments against the transformational analysis which are based on the contrast
in structure between APs and AdvPs (like the arguments based on (2.1)—(2.10) and
(2.29)—(2.36)). However, there is one major syntactic difference between the two kinds
of phrases: many adjectives strictly subcategorize a PP or an S in their complements,
but the related adverbs do not.
MOTIVATION FOR THE X CONVENTION
d. Happy
I at their departure
[that they were leaving
j, John waved goodbye.
Happily
*at their departure
*that they were leaving
j, John waved goodbye.
I with anticipation
e. Eager ( John chewed his nails.
[to leave
*with anticipation
Eagerly ( John chewed his nails.
*to leave
In fact, Emonds’s theory does not differ from Chomsky’s theory of derived
nominals, except in one crucial respect: Chomsky distinguishes the two categories verb
and noun, for obvious reasons, but Emonds collapses adjectives and adverbs into a
single category, their differences being less apparent. What Emonds’s theory and the
original transformationalist theory account for is the almost complete similarity of
modifier structure and the morphological similarity.
The Lexicalist Hypothesis, on the other hand, is based on expressing the
differences between categories. It thus must find a new way to express the similarities,
since the use of a common category label is not possible. Again, we need a way to
express cross-category generalizations, just as in the previous two cases.
This chapter has shown that the Transformationalist Hypothesis runs up against serious
difficulties in dealing with a number of different problems, and that there is a potential
solution to these difficulties called the Lexicalist Hypothesis. Chomsky’s article
“Remarks” sketches a great number of suggestions concerning the formulation of the
Lexicalist Hypothesis and its consequences, but does not deal with any of the issues in
much detail.
A great deal of subsequent work has verified the need for the theory to express
cross-category generalizations, as predicted by the X Convention. For instance.
Bowers (1968b), Selkirk (1970), Bresnan (1973), and Milner (1973) explore the generali¬
zations among the determiner systems of NPs, APs, and AdvPs that we will address in
chapter 6. Dougherty (1970-1971) shows that the generalizations about surface con¬
junction of various constituents are not to be accounted for by Conjunction Reduction
from conjoined sentences, but must be treated as cross-category generalizations in the
base. Van Riemsdijk (1973; 1976) and Jackendoff (1973) show that prepositions form a
lexical category that participates in the X Convention.
Outside the categorial component, Selkirk (1974) shows that the X Convention
plays a role in determining liaison in certain dialects of French. Akmajian (1975) argues
that NP is a cyclic node, as part of an account of an extraposition rule that applies in
both Ss and NPs. Jackendoff (1971) shows that the rules of Gapping and VP-Deletion
have close NP analogues. Horvath (1976) demonstrates how a topicalization process in
Hungarian appears within several different syntactic categories. Chomsky (1973) uses
the cross-categorial notion of subject in defining his Specified Subject Constraint on
rule application. Perhaps most striking, Bresnan (1976a) revises Chomsky’s A-over-A
Condition into a cross-categorial “X-over-X” Condition, showing that constraints on
rule application depend in part on how cross-categorially the rules are stated.
The Lexicalist Hypothesis has also sparked an interest in the structure of the
lexicon. Such works as Halle (1973), Aronoff (1974; 1976), Siegel (1974), Rardin (1975),
and Jackendoff (1975a) have begun to open up the lexicon as a rich area of inquiry
independent of syntax and phonology.
2: MOTIVATION FOR THE X CONVENTION
What has remained relatively neglected is the theory of phrase structure. Though
all the works just cited make reference to the X Convention, none attempts to explore
systematically its implications for questions such as these: How is the X Convention to
be formulated so as to provide the best account of the syntactic structures of the
language? What is the system of syntactic features relating lexical and syntactic
categories? Can the notions “possible syntactic category” and “possible phrase
structure rule” be characterized?
The rest of this study will be concerned with these questions, and how the
grammar of English can be brought to bear on them. Chapter 3 restates the X
Convention, proposes a theory of syntactic features and a highly restrictive phrase
structure schema, and uses them to reanalyze the grammar of the subject and the
auxiliary. Chapters 4 through 8 apply the theory to substantial fragments of English
phrase structure, defending detailed solutions to problems raised by complements, NP
specifiers, the degree system, relative clauses, and degree clauses. Chapter 9 investi¬
gates a second phrase structure rule schema, one which generates a previously
unremarked class of “deverbalizing rules”; chapter 10 restates a general overview, lest
it have been lost in the intervening furor.
3: A Theory of Phrase
Structure
Chapter 2 presented arguments that the X Convention plays a role in the the theory of
syntax. This chapter will formalize what I take to be a quite conservative version of the
X Convention, the formally simplest and most restrictive statement of the theory that
appears adequate for the description of English. In adopting this approach and thus
requiring reformulation of many well-accepted analyses which would weaken the
theory, I am consciously adhering to the research strategy recommended by Chomsky
in the preface of Syntactic Structures:
Precisely constructed models for linguistic structure can play an important role, both
negative and positive, in the process of discovery itself. By pushing a precise but inadequate
formulation to an unacceptable conclusion, we can often expose the exact source of this
inadequacy and, consequently, gain a deep understanding of the linguistic data. More
positively, a formalized theory may automatically provide solutions for many problems
other than those for which it was explicitly designed.
In attempting to flesh out the X Convention into a full theory, I will separate
Chomsky’s proposal into a number of parts, some of which will be supported here,
others of which will not. Essentially, I will retain what I take to be the conceptual core
of Chomsky’s proposal, but I will modify most of the details.
(3.2) Xfc
The head of a phrase of category X” can be defined in two different ways, either as
the X"-1 that it dominates or as the lexical category X at the bottom of the entire
configuration. We make use of both senses in this study, making it clear when
necessary which sense is meant. Both reflect traditional usages of the term. Refine¬
ments of (3.1) in section 3.3 will guarantee that the head is unique, that is, each X" will
dominate one*and only one X”-1. Section 3.6 will take up a number of exceptions to this
generalization.
The third claim of the X Convention is that rules of grammar are stated in terms of
syntactic feature complexes and the prime notation. To be more precise, each term of
any rule of grammar must be either a specified lexical or grammatical formative or of
aFiT
the form . This third claim gives substance to the other two claims, since it is by
attempting to write rules of grammar in these terms that we test hypotheses about the
feature system and the hierarchical organization of categories.
To flesh out these claims, we must answer at least these questions: What are the
syntactic distinctive features? Is there a maximum value for n in (3.1), and is it the same
for each category? Are there any constraints on what may appear in place of the
ellipses in (3.1)? Does (3.1) provide the only phrase structure rules of the language or
1 Ivan Sag has pointed out (personal communication) that this particular aspect of the X Convention, the
notion that syntactic categories are projections of lexical categories, is a fundamental assumption of Harris’s
approach to syntax (Harris (1946; 1951)). On the other hand, Harris does not (at least explicitly) use this
notion to get at cross-category generalization, nor does he impute any significance to the number of
supercategories associated with a lexical category.
3: A THEORY OF PHRASE STRUCTURE
are there other types? The rest of this chapter is devoted to formulating one possible
set of answers to these questions; subsequent chapters defend these solutions on the
basis of the grammar of English.
The choice among competing systems of distinctive features should be made on the
basis of how easy it is to state actual rules of the language in terms of the proposed
systems. One presumes that rules are more likely to generalize to “natural classes”,
those that take fewer features to pick out. This criterion is essentially the same as the
one used in justifying phonological feature systems, and should therefore be altogether
familiar.
In “Remarks on Nominalization” Chomsky analyzes the major lexical categories
N, A, and V into the features ±N and ±V in the following way:
(3.3) +N -N
+V A V
-V N
A fourth major syntactic category is P(reposition), about which we will have much to
say in this study. Presumably Chomsky’s system would analyze it as [—N, — V],
completing the matrix in (3.3).
In distributing two distinctive features over four major categories, there are
actually only two systems possible other than (3.3):
(3.4) a. N V b. N V
A P P A
The choice among the three must be made on the basis of what categories go together
in the rules of English (and other languages, of course). Anticipating the generalizations
that appear in the rest of this study, we will adopt (3.4a) as our feature system. It will
emerge that there are many rules which generalize across supercategories of N and V,
and this is not expected in a feature system like (3.3). Similarly, there are many rules
which generalize across supercategories of N and A, and many to V and P, and this is
not expected in (3.4b). About the only rule that makes (3.3) and (3.4b) look more
natural than (3.4a) is the formation of cleft sentences, in which the clefted item can be
only an NP or a PP. Since the combination of N and P is so rare, and the combination
of V and A at least equally rare, we will feel justified in provisionally accepting (3.4a)
as the major division of lexical categories. As it would be too laborious to test the three
systems continually throughout this study, I leave it to the interested reader to check
the adequacy of this choice.
The classification of categories made by feature system (3.4a) corresponds to some
32 RAY JACKENDOFF
traditional divisions of categories: N and A are often called “substantives”, and A and
P are often thought of as “modifiers”. (3.4a) also has the interesting property of
corresponding to two significant divisions in the phrase structure rules of English.
Though these divisions may not strictly obtain in other languages, we will for
convenience name the features after the corresponding properties of English phrase
structure.
The first division is between those categories which have a subject, N and V, and
those for which no syntactic subject relation exists, A and P. We will accordingly call
the feature ±Subj, and designate N and V as +Subj and A and P as -Subj.2
To emphasize the heuristic, nontheoretical significance of the names for the
features, it is worth pointing out that in French, for example, nouns cannot take NP
subjects (other than possessive pronouns, which may be articles). Nevertheless, we
assume that French nouns and English nouns have the same syntactic features, and
that it is only the way these features appear in the grammar that differs from one
language to another.
The second major division in the phrase structure of English is between those
categories whose complements may include a surface NP direct object after the head,
i.e. V and P, and those categories whose complement cannot contain a surface NP, i.e.
N and A. The feature will be named ±Obj; V and P are +Obj and N and A are -Obj.
Again, the use of the term “direct object” is strictly syntactic: since there is no NP *his
consideration the offer and no AP *afraid Bill, but only his consideration of the offer
and afraid of Bill, these categories are not considered to have syntactic direct objects.
Next consider the “minor” lexical categories of English, starting with adverbs.
They clearly must be related to adjectives, and we will designate them both as [-Subj,
—Obj], Since their major difference with respect to associated phrase structure is in the
ability to take a complement, we will invent a feature to distinguish them and call it
±Comp: adjectives are +Comp and adverbs are -Comp.
We will also use the feature Comp to describe other minor lexical categories in the
language. The clearest case is the category Particle: particles such as up, on, and away
are morphologically identical to prepositions but permit no complements. To express
this relationship, we designate both particles and prepositions as [-Subj, -t-Obj];
prepositions are +Comp and particles are —Comp.
The difference between modal verbs and ordinary verbs must be stated somewhere
in the grammar, and ±Comp is a good place to localize it, calling verbs [ + Subj, +Obj,
+ Comp], and modals [ + Subj, +Obj, —Comp]. This analysis makes them separate
2 One might question the claim that adjectives do not have a subject, since it is often assumed that John
in John is tall is the subject of tall. This assumption is incorrect. Although tall imposes a selectional
restriction on the NP in NP is tall, the NP bears the grammatical relation "subject-of’ to the verb he, not to
the adjective. This becomes clearer if other verbs are substituted for he, e.g. John became tall. John made it
tall, etc. An NP bearing the “subject-of’ relation to an adjective would have to be contained in the AP, as
the subject of a noun is contained in the NP. Since there is no AP *John('s) fearful corresponding to the NP
John's fear, for example, we conclude that adjectives do not have syntactic subjects. For a discussion of how
the selectional restriction is imposed without a grammatical relation, see Jackendoff (1974b), especially
section 5.
3: A THEORY OF PHRASE STRUCTURE
categories, yet their close feature relationship makes plausible the historical creation of
the category of modals in English (cf. Lightfoot (1974)): a particular class of verbs
underwent change in one syntactic feature, moving into a category available in
universal grammar but previously unrealized in English. We will deal with the English
auxiliary in somewhat more detail in section 3.5.
Finally, there is a clear candidate for a -Comp category associated with nouns:
the rather heterogeneous system of articles and quantifiers. Chapter 5 will further
divide these into two classes, called Articles (Art) and Quantifiers (Q). Although the
actual division will not be determined until chapter 5, we introduce the relevant feature
±Det here for completeness: articles are [ + Subj, — Obj, -Comp, +Det]; quantifiers are
[ + Subj, —Obj, —Comp, -Det],
We will use the feature ±Det also to pick out a special class of adverbs, the
"degree words” so, too, as, etc. We will call this category “Degree” (Deg) and
designate it by the features [-Subj, -Obj, —Comp, + Det]; ordinary adverbs will be
[-Det], This choice will be justified in chapter 6; it depends primarily on parallelisms
between Art and Deg, which we therefore wish to analyze as differing in only one
feature.
There is no place in this system for coordinating conjunctions and complementiz¬
ers. However, they do not participate in the grammar in the same way as the other
categories, in that they do not strictly subcategorize complements and specifiers, so
their exclusion is motivated. Section 3.6 will suggest places for them in the general
theory. Subordinating conjunctions, on the other hand, can be described as preposi¬
tions with sentential complements, so they do participate in the feature system.
The feature system thus looks like this:
We will weaken this claim slightly in section 3.6. But aside from minor exceptions, we
will claim that (3.1) represents the canonical form for all phrase structure rules.
Notice what sorts of rules (3.1) excludes. One kind of phrase structure rule that
has been widely accepted in the literature is so-called Chomsky-adjunction,3 which
appears in a popular source for relative clauses (3.6a), one account of the verb-particle
construction (3.6b), one theory of manner adverbs (3.6c), and many other places.
(3.6) a. NP —* NP - S
b. V -* V _ Prt
c. VP —» VP - Adv
Since these rules generate no category on the righthand side that is one level lower
than the category on the lefthand side, they are impossible within schema (3.1).
In Vergnaud’s (1974) account of relative clauses, there is a phrase structure rule of
the following form:
Such a rule is impossible within schema (3.1), since (3.1) requires a rule expanding N"
to contain an N' somewhere on its right-hand side.
Berman (1974, 109) proposes a number of structures for comparative constructions
which illustrate violations of (3.1) and which seem in general counterintuitive. There is
nothing in the standard theory of phrase structure to rule them out, though:
QP
3 This term has nothing to do with Noam Chomsky, and he regrets its existence (personal communica¬
tion). However, since it seems firmly established in common linguistic usage, I will retain it, sparingly, with
apologies.
3: A THEORY OF PHRASE STRUCTURE
other categories as well, including the minor lexical categories Q and Adv. Thus, at the
expense of some otherwise superfluous structure, the most uniform hypothesis is that n
equals 3 for all categories. We will call an X'" a major phrasal category.
This hypothesis makes possible an answer to the second question above, close to
the proposal of Emonds (1976, chapter 1). We will claim that every category to the left
or right of X"-1 in (3.1) is either a major phrasal category or a specified grammatical
formative such as have, number, case, or tense, and probably that it is optional. Thus
the canonical form (3.1) for phrase structure rules can be refined to (3.9).
(3.9) Xn (Cj). . .(Q) - X"-1 - (Cj+1). • .(Cfc), where 1 < n < 3, and for all C,,
either C{ = Y'" for some lexical category Y, or Cj is a specified
grammatical formative.
We combine the feature notation of section 3.2 with the prime notation to arrive at
distinctive feature analyses of all syntactic categories. For example, NP (now N'") is
designated as [ + Subj, -Obj, +Comp]'"; a rule applying either to P' or V' will mention
the feature complex [ — Obj, +Comp]'. The combination of the two notations makes the
desired claim about possible cross-category syntactic generalizations: generalizations
must be across categories of the same level, so it is impossible, for example, to pick out
both N" and Q' with a single term of a structural description. This hypothesis, as will be
seen, is often crucial in deciding which of a number of possible structures to assign to a
construction, in that parallelisms with the structures associated with other lexical
categories constrain the choices in interesting ways.
Before going on, let us clarify our terminology. We will continue to use the
traditional terms S, NP, AP, AdvP, PP, and QP informally for the major phrasal
categories associated with V, N, A, Adv, P, and Q, respectively; the Three-Level
Hypothesis claims that these traditional symbols for the major phrasal categories are
equivalent to what rule schema (3.9) defines as V'", N"', A'", Adv'", P'", and Q'". A
glossary of all category names and their feature analyses appears at the end of this
chapter.
3: A THEORY OF PHRASE STRUCTURE
Chomsky uses the term specifier to refer to the material in a phrase to the left of
the head, and complement to refer to the material to the right of the head. According to
the tree structures in “Remarks”, Chomsky considers specifier to represent a syntactic
category, but complement is simply an abbreviatory term for some concatenation of
ordinary syntactic categories. However, there is to my knowledge no evidence that
either complements or specifiers function as constituents—they do not move or delete
as units, and unlike normal constituents, no part can be designated as a head.
Consequently, I will use the terms specifier and complement for expository conveni¬
ence only, with no theoretical significance implied.4 In (3.9), the terms Clf . . . , C} will
be referred to as X" specifiers, and Cj+1, . . . , Cfc as X" complements.
This terminology is suitable for discussion of English, though not perhaps for other
languages, since it calls the direct object in an SOV language part of a specifier, not of a
complement. In fact, we will see even in English that there are a few cases where a
particular grammatical relation is defined as part of an X’ specifier for some categories
and part of an X‘ complement for others. It must be understood, therefore, that the
distinction between specifier and complement is to be regarded here as of no theoretical
significance, but only as a convenience.
The next two sections work out two major aspects of English syntax, the subject
and the auxiliary, within the Uniform Three-Level Hypothesis. Besides exemplifying
the hypothesis in the most immediate way possible, they set the stage for the detailed
analyses of complements, specifiers, and relative and degree clauses in subsequent
chapters. After the discussion of the subject and the auxiliary, section 3.6 discusses
certain phrase structure expansions not covered by the Uniform Three-Level Hypothe¬
sis.
The grammatical relation “subject-of (an S)” is defined by Chomsky (1965, 71) as
[NP,S]; that is, in a string exhaustively dominated by S, the subject is a substring
exhaustively dominated by NP, and that occurrence of NP is directly dominated by the
S. We would like to define the relation “subject of an NP” for derived nominals in such
a way that it generalizes structurally with the relation “subject of an S”, so that the
various rules involving subjects need not be stated separately for NPs and Ss.
This is the first of many analyses to be presented here in which structural
parallelism across categories is a crucial consideration. The general principle entailed
by the X Convention is that if parallel grammatical relations exist in two different
categories, the categories must be syntactically parallel with respect to that grammati-
4 Hornstein (1975) claims that calling the specifier not a category but a concatenation of nodes is
unjustified, and that a treatment in which specifier is a category makes a stronger claim. The latter point is
correct, in that my view of the specifier ascribes less highly differentiated structure to sentences. However,
since neither he nor anyone else has given arguments that this extra structure is necessary (e.g. in terms of
movement or deletion), the weaker position seems to be the more supportable.
38 RAY JACKENDOFF
cal relation. In this way rules involving that grammatical relation can be stated so as to
apply to both categories, by appropriate use of syntactic distinctive features.5
Let us compare Chomsky’s proposed deep structures for John has proved the theorem
and several of John's proofs of the theorem.
(3.11) a. S
b. N"
John
5 Note that the X Convention says nothing about what to do with nonparallel structures. Hornstein
(1975) objects to the generalization of the subject relation on the grounds that many other aspects of Ss and
3: A THEORY OF PHRASE STRUCTURE
(The details of SpecN are from Chomsky’s (30), p. 37/200.) We would like to say that
John is subject in both cases. Observe, however, that the subjects are not in
structurally parallel positions. In particular, the subject of the S (3.11a) is dominated by
the third node above the head verb, but the subject of the NP (3.11b) is dominated by
the second node above the head noun. Furthermore, the subject of the NP is embedded
in the Specifier node, whereas the subject of the S is directly dominated by S.
Let us first see what can be done about the structure of the derived nominals. We
can make some improvement by taking several of not as part of the determiner of this
NP but as part of a higher NP, thus:
(3.12) N"
N'
N'
John
Chomsky suggests this possibility in note 26, p. 38/note 27, p. 219. Essentially this
structure for quantifier constructions is argued for in Jackendoff (1968); we will develop
it further in chapter 5.
Next, observe that the possessive affix's occurs with all NPs in subject position in
nominals, including those that arrive in that position through a transformation (e.g. the
NPs are not parallel—for example, Ss have auxiliaries and complementizers, and NPs have determiners. But
these differences are irrelevant: the X Convention says simply that when parallelisms exist, they must be
expressed.
40 RAY JACKENDOFF
city’s destruction by the enemy). This suggests that a relatively late obligatory
transformation adds the affix to whatever NP occupies surface subject position. (In
section 3.6 gerundive nominals will be analyzed in such a way that they too receive
their ’5 by this transformation.) With such a rule, we can eliminate the node Poss in
(3.12) and place the NP dominating John directly under SpecN, yielding (3.13) instead of
the lower N" in (3.12).
(3.13) N"
John
Next suppose we take the view of specifiers proposed in the last section: there is
no category Specifier; rather “specifier” is simply an abbreviation for a concatenation
of nodes. Then SpecN and Specv can be eliminated in (3.1 la) and (3.13), and John is in
nearly parallel positions in the two structures. It remains to modify one or the other of
the structures so that John is attached the same number of nodes up from the head. By
dropping a node in the S (3.11a) we get a parallelism of the form (3.14); by adding an
extra node in the NP (3.13) we get the pair (3.15).
3: A THEORY OF PHRASE STRUCTURE
b. N"
b. N'"
Here are some arguments which favor the uniform-level theories (3.14) or (3.15) over
Chomsky’s mixed-level theory. The arguments all show that where rules generalize
across categories, it is invariably Ss, not VPs, that behave like NPs, APs, and PPs.
Hence, by the assumptions of the X Convention, Ss must have the same number of
primes as the other major phrasal categories.
First, observe that the rule of Topicalization preposes the major syntactic
categories NP, AP, and PP:
(3.17) a. That you were comingC) no one ever expected Bill to find out.
b. *Coming tomorrow(,) no one ever expected Bill to find out that you
were.
The anaphoric expression for VP is do so or do it, which does not generalize directly
with any other category. Hence the generalization calls for S rather than VP being
similar to NP and AP.
Similarly, NPs, APs, PPs, and Ss serve as antecedents of appositives.
possible only if all are of the same level. (See sections 4.1 and 7.2-7.3 for further
discussion of appositives.)
Certain predicates permit a PP to replace the subject it.
This PP movement appears to generalize with the movement of S into subject, called
variously Subject Replacement or Intraposition.6
Again there is no generalization of this sort between PP and VP. Thus the phrase
structure rule schema demands that PP and S be of the same level if this generalization
is to be captured.
PP and S generalize again in a rule which extraposes certain PPs from within the
subject and also extraposes relative clauses.7
The rule of Gapping applies in conjoined Ss and NPs, with precisely parallel
conditions (see Jackendoff (1971) for details):
There is no corresponding generalization with VP. Again, the way in which the rule
generalizes argues that S, not VP, should be of the same level as NP.
Thus there are numerous reasons to consider S structurally parallel to the other
major syntactic categories, arguing for the uniform-level theory.
One difference between Ss and NPs, pointed out by Emonds (1976), is that
subjects are obligatory in Ss but not in NPs. This fact is an apparent counterexample to
the claim in section 3.3 that only heads are obligatory constituents. Since Emonds too
accepts this claim, he concludes that the subjects of Ss are outside of the constraints of
6 I assume here the account of Emonds (1970), not that of Emonds (1976). Some discussion appears in
sections 4.3 and 4.8.
7 Or, if there is not an extraposition transformation, the generalization is in the projection rule. See
Akmajian (1975) for evidence that this rule applies also in NP, a further generalization of NP and S'.
44 RAY JACKENDOFF
the X Convention, being generated by Chomsky’s special phrase structure rule S —> N"
- V".
However, let us examine the constraints on S subjects a little more closely. It is
generally considered to be the case that those sentences which lack surface subjects,
namely /o-infinitives, do so because the underlying subject has been deleted or is the
phonologically null form PRO. However, at least two recent articles, Lasnik and
Fiengo (1974) and Brame (1975), have argued that certain of these constructions contain
surface infinitives with no deep subject. Their claim is that these infinitives are VP
complements in deep structure, not Ss. Such a solution violates the phrase structure
rule schema (3.9), since it involves generating a V" to the right of a head. But an
alternative is to generate them as subjectless Ss, i.e. as Ss which do not even contain a
PRO in subject position. This alternative is equally consistent with Lasnik and Fiengo’s
and Brame’s arguments. Then the phrase structure rules could generate an optional
subject in S, its obligatoriness under most conditions being due to conditions extrinsic
to the phrase structure rules. Whether such conditions can be independently motivated
must be left for future research; but such a solution simultaneously meets Emonds’s
objection and provides a way of generating so-called VP complements within the
present framework. Should this solution not prove viable, we can, of course, always
accept a weakening of the optionality condition to allow obligatory subjects.
A second difference between S subjects and NP subjects pointed out by Emonds
concerns a general constraint in English against embedding sentences in specifiers,
including subjects of NPs.
Note that the bracketed phrases in (3.24) are grammatical in other contexts, and
that if the subordinate Ss are eliminated from the bracketed phrases, (3.24a-g) become
grammatical. Apparently the only exception in English to this restriction is subject
position in Ss, which obviously may contain a wide range of embedded sentences.
Emonds argues therefore that this embedding constraint would be more general if the
subject were not considered a part of the specifier of V, but were rather outside V", as
in Chomsky’s system. In a uniform level theory, on the other hand, the sentence must
be an exception to the embedding constraint.
3: A THEORY OF PHRASE STRUCTURE
However, there are languages in which even subjects may not contain sentential
complements. One is Walbiri, as pointed out by Hale (1975). Interestingly enough,
another is Old English, as pointed out by O’Neil (1976), who studies how relative
clauses in subjects came to be possible in English through a structural reanalysis of
certain topicalized constructions. In order to maintain Emonds’s generalization, one
would have to make the rather dubious claim that the historical change in English
involved altering phrase structure in a radical way, moving the subject from V" into
V'". A much more plausible change is that the embedding constraint itself was relaxed
to permit the observed exceptions; this accords with O’Neil’s analysis. Thus, accepting
the nongenerality of Emonds’s embedding constraint does not seem at all to be a
consequence damaging to the uniform-level theory.
One problem that remains in justifying the Uniform-Level Hypothesis is the treatment
of the complementizer. The question is where the complementizer is attached to the
sentence. All the movement rules cited in section 3.4.2 move the sentence with its
complementizer, and no rule ever leaves the complementizer behind. Thus, the
simplest hypothesis is that the complementizer is a left sister of the subject, attached to
V" in (3.14a) or to V'" in (3.15a). Unfortunately, this hypothesis appears to run afoul of
the analysis of Gapping, for Gapping applies only if the complementizer is absent from
the second clause:
(3.25) a. It is hard to believe that Jack hates swimming and (*that) Fred
fishing.
b. For Jack to hate swimming and (*for) Fred fishing would be a
tremendous surprise.
One explanation of this is that the difference in structures is as shown in (3.26a) versus
(3.26b).
(3.26) a. S
b. S
If Gapping were defined over S conjunction but not over S conjunction, the difference
would be immediately accounted for.
Other evidence against the theory that the complementizer is the sister of the
subject is provided by Bresnan (1974), who shows that Right Node Raising applies only
to single constituents, but that it can apply to sentences with or without their
complementizers:
(3.27) a. Mike wouldn’t tell us, but Randy readily volunteered, that Jenny was
drinking again.
b. I’ve been wondering whether, but wouldn’t positively want to state
that, your theory is correct.
Again this argues that the complementizer is a sister of the entire S, not of the subject.
This raises a difficulty for the generality of movement rules under the X
Convention: since S is one level higher than NP, i.e. apparently V4, it cannot be moved
by the same rule that moves an N'", a P"\ or an A'". There are at least two possible ways
to deal with this problem. One would be to claim that S is indeed V4 (assuming a three-
level S and NP), but that the A-over-A Convention actually generalizes over category
levels: the structural description of a rule would be met by the highest-level constituent
of the category mentioned by the rule. For instance, by mentioning V in a rule, the
structural description would be met by V4; by mentioning N, the structural description
would be met by N'". In order to pick out the lexical category verb, a rule would have
to mention specifically V°. This is a not unattractive solution, although some care
would be necessary in checking its feasibility. It has the interesting property of vitiating
all the above arguments for parallelism based on movement rules, since a rule
mentioning, say, +Subj will apply to either N'" or V4.8
8 Emonds has pointed out (personal communication) that this solution is in some sense a return to one
attractive aspect of Harrisian transformations: one would state, say, an NP movement rule as ‘‘move N”; all
the modifiers would be moved along automatically. This emphasizes the centrality of the notion “head” to
syntax in a highly suggestive fashion.
3: A THEORY OF PHRASE STRUCTURE
A less drastic solution is to claim that both S and S are instances of V'":
(3.28)
Comp
Under this proposal, the A-over-A Convention guarantees that any movement rule
moves S rather than S, while preserving the generalization with the movement of other
X'". Such a structure represents a weakening of the theory of phrase structure, since it
contains the forbidden configuration of a category dominating itself. Section 3.6 and
chapter 9 will show, however, that this exception can be accommodated within a highly
constrained class of rules, generated by a second phrase structure rule schema of
considerable interest.
Which of these two solutions for the complementizer turns out to be correct,
however, does not seem to be of major importance to the main hypothesis. For
simplicity, we will assume the second solution, in which the complementizer is
Chomsky-adjoined to S. This problem disposed of, the generalization of subjects
provides considerable evidence for the Uniform-Level Hypothesis.
The problem of this section is where the various parts of the auxiliary are attached and
whether they provide evidence for the general structure of the S. The position adopted
here is a modification of the analysis in Jackendoff (1972, section 3.8), based in part on
1966 lectures by Klima and on Emonds (1970); other revisions have been proposed by
Akmajian and Wasow (1975). In turn, these are all variants of the original analysis of
Chomsky (1957).
The verb and its complements apparently form a constituent that does not include
the modal, since there is a rule (pointed out by Ross (1967)) which fronts a VP after
modals under certain conditions:
(3.29) They said she may attempt to leave, and attempt to leave she will.
The rule of VP-Deletion provides evidence that the verb and its complements form a
constituent which does not include the aspectual verbs have and be, since a VP may
delete, leaving aspect intact.9
9 Hornstein (1975) presents this as evidence against the two-level theory of S, in which aspect must be a
daughter of V'. As will be seen, a three-level theory accommodates it.
48 RAY JACKENDOFF
Jackendoff (1972) gives evidence that the first auxiliary is a daughter of S, but that
subsequent auxiliaries are not daughters of S. The evidence is that sentence adverbs
such as frankly, probably, and evidently occur in all possible positions as daughter of
S—initial, final with comma intonation, and before the auxiliary. They also occur after
the first auxiliary, but not after subsequent ones.
frankly
I have amused
(3.31) a. George will probably ' the children by the time
[be amusing
evidently
we get there.
frankly
have .amused
b. ?*George will ' probably them when we get there.
be amusing
evidently
A simple way to account for this difference is to claim that the first auxiliary is a
daughter of S but that the second auxiliary forms a constituent with the verb and its
complements; then an adverb following the first auxiliary can be a daughter of S, but an
adverb following the second auxiliary cannot be.
These three arguments for constituency lead to a three-level theory of the S:
(3.32) V'"
One of the problems in Chomsky’s original analysis of the English auxiliary is the
M
recurrence of the configuration have in transformational rules as the first verbal
be
element, including all uses of be and the aspectual use (and in some dialects the
possessional use) of have. The solution of Klima, Emonds (1970), and Jackendoff
(1972) is an obligatory transformation Have-Be Raising which moves the appropriate
uses of have and be to a position under Aux, just in case there is no modal present.
3: A THEORY OF PHRASE STRUCTURE
After this transformation, any transformation can refer to the first auxiliary by
mentioning the node Aux in its structural description, and the need for the repetitive
braces is eliminated.
McCawley (1975) objects to the rule of Have-Be Raising on the grounds that it is
entirely arbitrary to treat have and be as unlike other verbs in undergoing this rule; and
that if modals are a category separate from verbs, there is no reason for modals, have,
and be to behave alike. The theory of categories of sections 3.2 and 3.3, however,
makes possible a refinement of the analysis which meets McCawley’s objections in a
rather interesting way.
Recall that the syntactic distinctive features of Modal are [+Subj, +Obj, —Comp],
contrasting with verbs only in the last feature. Now suppose that the lexical entries for
the appropriate uses of have and be are assigned the syntactic features [+Subj, +Obj],
being unmarked for Comp. Then, by the usual conventions for applying rules, any rule
which applies to the feature complex for modals, [+Subj, +Obj, -Comp] will also
apply to have and be but to no other verbs. In other words, [ + Subj, +Obj] corresponds
to Choms cy’s (1957) category v; [ + Subj, +Obj, -1-Comp] to V; [ + Subj, +Obj, -Comp]
M
to have The exceptionality of have and be and their falling together with modals is
be
expressed naturally in terms of such a lexical feature analysis; and the description of
the auxiliary thus looks considerably less arbitrary than before.10
For a further refinement, observe that the Uniform Three-Level Hypothesis claims
that Modal is the progenitor of a series of larger categories M', M", and M'". The
category Aux, which so far has not played any role in the X Convention, can therefore
be analyzed as M"\ Although there is no empirical evidence that as many as three
levels are necessary, the superfluous structure is harmless and makes a more highly
constrained theory possible; furthermore, the extra rules are part of universal grammar,
and thus add no cost to particular grammars. We will arbitrarily attach Tense as a
daughter of M'" rather than at any of the lower levels, yielding the following phrase
structure rules:11
10 Wendy Wilkins has pointed out (personal communication) that this analysis brings to mind the
arguments in Chomsky and Halle (1968, 382-385) which show a possible illegitimacy in the use of partially
specified phonological feature matrices. However, examination of those arguments reveals that they depend
on the fact that phonological rules may alter distinctive features. Since, under the assumptions of the
Lexicalist Hypothesis, syntactic category features are never changed by transformations, the use of partially
specified syntactic features is not susceptible to Chomsky and Halle’s criticisms.
11 Akmajian (personal communication) has objected to this approach to the auxiliary on the grounds that
it is a distortion of the actual facts of English, and that there are auxiliary constituents in other languages
which show no evidence of a modal. The approach taken here, however, is forced on me by the highly
restrictive phrase structure rule schema (3.9). The quotation from Chomsky in the introduction to this
chapter is relevant here: either this analysis will prove that the theory must be weakened in a highly specific
way, or else it will provide a new analysis of what is otherwise an unprincipled exception to the general
theory.
50 RAY JACKENDOFF
Now since the phrase structure rule schema also predicts that a head is obligatory
in M'", the only way a modal can be omitted from a tensed sentence is to generate the
empty node A under M. Thus both Have-Be Raising and Do Support can be stated in
such a way as to fill this empty node with a category nondistinct from M, making one
further peculiarity of the English auxiliary somewhat less peculiar.12
One further minor point: the phrase structure rule schema (3.9) claims that all
complements and specifiers are optional, at least from the point of view of phrase
structure rules. This is certainly true of M'" in rule (3.33a), since in infinitive clauses
there is no evidence that there has ever been a tense or modal present. We can thus feel
justified in regarding the obligatoriness of Aux in main clauses as a semantic condition,
or at least as a condition extrinsic to the phrase structure rules proper.
It is a little more difficult to justify making Tense optional in (3.33b), since the
accepted syntactic treatment has always required it. Its obligatory presence seems in
any event to be a minor exception to the overall generalization that only heads are
obligatory in phrase structure expansions.13
One obvious exception to the theory of phrase structure presented so far is coordina¬
tion. It is generally agreed that the node dominating conjoined Ss is an S, that the node
dominating conjoined NPs is an NP, and so forth.
12 If there are no auxiliaries, and Affix Hopping attaches Tense to the main verb, presumably the entire
M'" is deleted, so as to meet Emonds’s (1976) condition that no empty nodes appear in surface structure.
13 Emonds (1976) suggests that only phrasal categories are always optional; grammatical formatives are
optional or obligatory. This weakening accounts for at least the present cases properly.
3: A THEORY OF PHRASE STRUCTURE
(3.34) a. S
mushrooms toadstools
This violates even the hierarchic arrangement of categories predicted by (3.1), the
primitive version of the phrase structure rule schema. Furthermore, no constituent of a
coordinate construction can be identified as its head. Clearly a separate phrase
structure rule schema is necessary. (3.35) is one possible form.
(3.36) N'"
Schachter (1976) and Horn (1975) independently suggest an alternative structure, in our
terms represented as (3.37).
52 RAY JACKENDOFF
(3.37) N'"
N'" N"
(3.37) has at least three immediate advantages over (3.36). First, it enables the rule
inserting ’s (POSS) on the subject of an N'" to be generalized to the subject of gerunds
without further ado. Second, it makes possible a certain limited class of gerunds such
as (3.38) (quoted from Schachter).
b. N" -* ing - V
Either of these rules provides a way to use a verb phrase as a noun phrase, i.e. not
as a verb phrase with respect to the context in which it is embedded. I believe that
these rules are not an unprincipled exception to the phrase structure rule schema of
section 3.3, but are part of a class of “deverbalizing” phrase structure rules described
by the schema (3.40).
3: A THEORY OF PHRASE STRUCTURE
(3.40) X! —> af - V*
That is, a deverbalizing rule must expand a category as the verbal supercategory of the
same level, and there may be no complements or specifiers other than an affix or other
grammatical formative indicating the change of category.
This class of rules has up to now not been distinguished, perhaps because there has
been no theory of phrase structure sufficiently constrained that rules such as (3.39)
emerged as exceptional. Chapter 9 presents a number of candidates for deverbalizing
rules, some of which are not at all obscure.
One particularly relevant example should be mentioned now. Recall that in section
3.4.3 we proposed that the complementizer be generated with rule (3.41).
The X Convention as formulated in section 3.1 makes three claims: the class of possible
lexical categories is determined by a set of distinctive features; the class of syntactic
categories is determined by elaborating the lexical categories in terms of the prime
notation; rules of grammar are to be stated in terms of these features and primes. We
have argued that this theory can be strengthened into the Uniform Three-Level
Hypothesis: for every lexical category X, there are syntactic categories X', X", and
X'", and no more, and the major phrase structure rules elaborating these categories are
of the form given by rule schema (3.9), repeated here:
(3.9) Xn -* (Cj). . .(Cj) - X”-1 - (Cj+1). . .(Cfe), where 1 < n < 3, and for all C„
either C, = Y'" for some lexical category Y, or C, is a specified
grammatical formative.
One argument has appeared which bears on the need for three levels: the
constituent structure of the auxiliary requires a level for aspectual verbs between the S
level and the V' level (section 3.5). Further arguments appear in subsequent chapters.
Rule schema (3.9) provides the bulk of the phrase structure rules of the language,
but there are at least two other schemata which generate possible configurations ruled
out by (3.9): the schemata for coordination (3.35) and for deverbalizing rules (3.40).
These provide places in universal grammar for the parts of speech which do not
54 RAY JACKENDOFF
The main generalization worth noting at this point is that between (3.42a) and
(3.42b), involving the position of the subject. These two rules can be generalized via
feature notation into (3.45).
nt
(3.45) X + Subj
+ Subj -Obj
- <(M"')> - X"
<+Obj> ( + Comp)
_+Comp _ +Det
chosen on the left, +Comp must be and M"' may be chosen on the right; if —Obj is
chosen on the left. Comp is unspecified on the right, so either N'" or Art"' may be
chosen, and M"' may not be chosen.
Unfortunately, the clumsiness of this notation will turn out to be proportional to
the complexity of the rules being collapsed. (3.45) is a relatively simple case, and far
more unsightly cases will appear in the next chapter. We will eventually conclude that
the traditional abbreviatory conventions are not sufficient to express the linguistically
significant generalizations of English phrase structure. Insofar as the choice of
abbreviatory conventions is directly linked to the evaluation measure, this conclusion
raises deeper questions for linguistic theory than we will be able to answer here.
On the following page is a glossary of lexical and syntactic categories.
56 RAY JACKENDOFF
+ Subj
V + Obj VP PredP S, S
+ Subj _ + Comp_
L+Obj J + Subj
M + Obj Aux
. — Comp.
+ Subj
N -Obj Nom Nom NP
_ + Comp_
~+Subj
+ Subj -Obj
Art Det
L-Obj J -Comp
_+Det
+ Subj
-Obj
Q QP
-Comp
-Det
-Subj
p +Obj PP
-Subj _+Comp_
L+Obj J -Subj
Prt + Obj
_-Comp_
-Subj
A -Obj AP
_ + Comp_
-Subj
— Subj -Obj
Deg Det (?)
.-Obj . -Comp
_+ Det
"-Subj
-Obj
Adv AdvP
-Comp
— Det
4: Complements
If we classify complements on semantic grounds, we find that there are three distinct
ways in which a complement may be integrated into a semantic interpretation: as a
functional argument, as a restrictive modifier, and as a nonrestrictive modifier. We will
attempt to identify these respectively with X', X", and X'" complements. Thus we will
be able to claim that there is a strong correlation between syntax and semantics in the
complement system.1
Let us begin with functional arguments. Those lexical items which strictly
subcategorize phrases in their environment can be thought of as semantic functions
which take as their arguments the interpretations of the strictly subcategorized phrases.
For example, the verb give strictly subcategorizes a subject, an object, and an indirect
object, and can be thought of as a semantic function f(x,y,z) which maps ordered triples
of terms into propositions. Such an approach is developed in Katz (1966; 1972) and
Jackendoff (1972; 1976), among other works, and has been used implicitly in almost
every approach to semantic interpretation in generative grammar. Similarly, the noun
part (of) strictly subcategorizes an NP, and can be treated semantically as a function
g(x) which maps terms into terms; the adjective proud (of) maps terms into properties
1 Williams (1975) comes to similar conclusions about the complement system, on the basis of somewhat
different evidence. Readers familiar with his work will notice that I have only three complement categories
instead of his four. His types I, II, and IV correspond to X', X", and X'" complements, respectively. His type
III, the most problematic for him, appears to divide into X" and X"' complements in a way not fully clear to
me.
58 RAY JACKENDOFF
(or predicates, or whatever kind of object the intension of adjectives is); the preposition
to maps terms into (intensions which pick out) directions. Aside from subjects, all
functional arguments in English come immediately after the head, typically preceding
all other modifiers, and many grammarians have had the intuition that they are the most
“tightly bound” to the head of all the complements. This intuition can be expressed by
assigning all and only the strictly subcategorized phrases to the X' complement in deep
structure.
In many cases a phrase can be identified as strictly subcategorized by the fact that
it cannot be omitted from the sentence without incurring ungrammaticality. For
example, the PP in Joe put the book on the table must be strictly subcategorized, since
*Joe put the book is ungrammatical. However, this is only a sufficient condition for
strict subcategorization, not a necessary condition, since many words optionally
subcategorize phrases. For example, tell occurs in the contexts John told Bill a lie and
John told Bill, but a lie appears to operate semantically as a functional argument in the
former sentence and hence must be strictly subcategorized. Similarly, when adjectives
and nouns strictly subcategorize, their arguments are usually optional.
A second criterion for X' complements in Ss and NPs is provided by certain
anaphoric processes. The phrase do so appears to be a pro-V', and may be followed
only by material which is outside the V' complement (usually part of the V"
complement).2 Consider (4.1).
The ability of on Tuesday to follow did so indicates that it is a V" or V"' complement in
this sentence; the inability of on the chair to follow do so indicates that it must be
inside of V' in the antecedent sentence.
A parallel anaphoric process in NPs is the use of the pro-N' one. Lakoff (1970a)
observes that there is a contrast in the applicability of one which depends on the nature
of the complement.
(4.2) a. Jack met the king from England, and I met the one from France,
b. *Jack met the king of England, and I met the one of France.
Notice that the PPs of England and from England play different semantic roles, since
there may be a king of England from France. Of England appears to be a functional
argument, since it specifies part of the function of the king; from England, on the other
hand, specifies a somewhat inessential part of kinghood. Notice further that the two
PPs may appear together only in one of the two possible orders: we cannot get *the
king from France of England. These facts can be described simultaneously if we
suppose that of England is an N' complement and from England is an N" complement.
2 This distinction in the use of do so was first observed by Lakofif and Ross (1966). 1 assume that the
interpretation of do so will be carried out by focus-dependent anaphoric processes such as Akmajian (1973)
describes.
4: COMPLEMENTS
Based on this reasoning, the proper formulation of the difference between (4.2a,b)
appears to be this: the pronoun one cannot be followed by the phrase of NP within the
N' complement. It can, however, be followed by other N' complements, as will be seen
shortly.3 Thus, the inability of a particular of NP phrase to follow the pronoun one
(where the antecedent is count) is a sufficient test for its being an N' complement. (4.4)
is the forbidden configuration.
(4.4) N'"
This test requires some care. At first glance it would appear that of water must be
an N" complement in (4.5), since it follows one.
However, we notice that one can be pluralized in (4.2a) but not in (4.5).
31 am grateful to Noam Chomsky for pointing this out to me. See footnote 4.
60 RAY JACKENDOFF
The difference is explicated if we realize that there are two morphemes one that can
function anaphorically in an NP, a numeral (a kind of Q, alternating with two, three,
etc.—see section 5.5) and a pronoun. Only the latter, which takes the place of the head
of the NP, can be pluralized to ones. (4.6b) shows that the one in (4.5) is the numeral
oneQ, and that in fact the pro-N' oneN cannot be followed by the complement of water
here. We conclude therefore that of water is indeed an N' complement in (4.5), and so,
by parallelism, is of wine.
There are certain cases that appear to be ambiguous between N' and N"
complements without appreciable difference in meaning. For example, because of the
parallel between the direct object in (4.7a) and the PP in (4.7b), one would want to
claim that the PP is an N' complement.
Yet the pictures of Fred and the ones of Harry is also acceptable, arguing that of Fred
must be an N" complement, similar to the pictures which are of Fred. The simplest
solution is to accept both sources for such a case.
Another test to distinguish N' from N" complements is based on a distinction
noticed by Lakoff (1970b):
Notice that of few children is, by our previous criterion, an N' complement, since
*ones of few children is ungrammatical. But with few children can be paraphrased by
the relative clause who have few children, suggesting that it is an N" complement. This
conjecture correctly predicts that only one order of the two complements is possible,
i.e. fathers of few sons with many daughters but not *fathers with many daughters of
few sons. To describe the difference in grammaticality between (4.8a,b), then, one is
led to the hypothesis that a quantifier may extend its scope out of an NP dominating it
if it is in the N' complement but not if it is in the N" complement. Thus any could be in
the scope of few in (4.8a) but not in (4.8b); since any must be within the scope of
negation, (4.8b) is unacceptable. I can offer no reason why this distinction should exist,
but it is confirmed by considering the possibility of vr/7-questions and relatives:
according to the previous criterion, only N' complements can be wh-ed.
(4.10b) is all right if it is read arguments against which premises, parallel to arguments
with which people, but impossible if read arguments employing which premises. It is
plausible to assume that a person or premise against which an argument is directed is
strictly subcategorized, i.e. an N' complement, but that the instrument of argument, the
premise employed, is an N" complement like other instrument phrases. This is
supported by the grammaticality of arguments with Bill with few premises and the
ungrammaticality of *arguments with few premises with Bill. Thus, this distinction
between complement types supports the proposed interaction between level of comple¬
ment and scope of quantifiers and wh; we see that inability of quantifiers or wh to
extend scope outside the NP seems to be a sufficient condition for N" rather than N'
complements.
Since there has been little study of a possible distinction among different kinds of
NP complements, I am not familiar with enough strong cases to completely verify the
validity of the two tests described above, much less to explain why these particular
distinctions should obtain. However, the proposed distinction between N' and N"
complements does seem to be fairly well borne out by the few examples cited here.4
The distinction between X" complements and X'" complements is somewhat more
straightforward. In sentences, the V" complements are the expressions of manner,
means, accompaniment, instrument, purpose, and other so-called VP adverbials.
Semantically, they map predicates into predicates of the same number of arguments,
and they contribute to the main assertion of the sentence. As such, they can be
focused, clefted, and affected by sentence negation:
Because they add extra truth conditions to the assertion of the sentence, restricting the
extension of the sentence, V" complements can be called restrictive modifiers.
V'" complements, by contrast, add no conditions to the assertion of the sentence,
4 Notice, by the way, that (4.10) provides evidence that the ones test is valid only for of-NP
complements, not for other PPs in N', since both of the following examples are acceptable.
(i) Arguments with Bill are less fruitful than ones with Harry. (N' complement)
(ii) Arguments with many premises are less impressive than ones with few premises. (N" complement)
5 Just because they are in V" and focused does not mean they are automatically affected by negation.
Much depends on intonation. Cf. Jackendoff (1972, sections 6.7 and 8.6).
62 RAY JACKENDOFF
but rather add some sort of auxiliary assertion (one of whose arguments is usually the
main assertion). They include sentence adverbials of all sorts, sentential appositives,
parentheticals (e.g. John is a fink, i think), and various other subordinate clauses.
When they occur at the end of a sentence, they are set off by comma intonation. Since
they are not part of the main assertion of the sentence, they cannot be focused, clefted,
or affected by sentence negation:
The geometry of the sentence predicts that V'" complements must follow V"
complements, and this prediction is obviously borne out (except for some stylistic
inversions).
Similar contrasts can be found in NPs. The clearest example is the difference
between restrictive and nonrestrictive (appositive) relative clauses; the former follow
previous modifiers without a break, may contain foci, and may be affected by sentence
negation; the latter are separated by comma intonation, may not contain foci, and may
not be affected by sentence negation.
We would like to claim, therefore, that restrictive relative clauses are deep structure N"
complements and that appositives are deep structure N'" complements. But since many
proposals exist for deriving relative clauses from other sources, we will delay the
defense of this claim to chapter 7, where more space can be devoted to it.
s The insertion of parentheticals into V" may be what Banfield (1973) calls a stylistic transformation, one
of a class of rules with somewhat different properties from ordinary transformations. Alternatively, Emonds
(1976) gives an extensive treatment of parentheticals in which the complement is moved to the right of the
parenthetical.
4: COMPLEMENTS
However, other constructions besides relative clauses can appear in N" and N'"
complements. For example, the PPs in the king from England, the weather at 6:00, the
man with a big nose, etc., are not in the N' complement, follow the head without a
break, and can be focused and affected by sentence negation.
They are thus candidates for N" complements. Descriptive adjectives, despite their
prenominal position, have similar semantic properties to these PPs, arguing that they
are attached to N" (see section 7.4 for more discussion). This of course makes them
parallel in structure and function to preverbal VP adverbs, which are in V", consistent
with the predictions of the X Convention.
By contrast, there are certain nonsentential appositives which can be characterized
as N"' complements:
(4.16) a. I will sell you these bagpipes, the finest in all Poland, for only 4000
zloty.
b. Perhaps you have heard of my brother, known the world over as a
notorious womanizer.
c. She presented Picasso, then in his blue period, with a blueberry pie.
One might want to argue that all these are reduced nonrestrictive relative clauses, and
this is certainly conceivable. In any event, they have the characteristic comma
intonation and inability to be affected by sentence negation which we identify with an
X'" complement.
Again, the N'" complements must follow the N" complements, as predicted by the
geometry of the NP:
(4.17) the man that brought the strawberries, who was dangerous
*the man, who was dangerous, that brought the strawberries
We see therefore that there are three distinct kinds of complements in both
sentences and NPs, corresponding to the three levels predicted by the phrase structure
rule schema. Furthermore, the parallels in structure and function across the two
categories S and NP are clear and confirm the view of grammatical parallelism
advocated in chapter 3.
In the remaining major categories, AP and PP, the complements are less produc-
64 RAY JACKENDOFF
tive, but the three types are distinguishable. Both adjectives and prepositions strictly
subcategorize arguments, as was pointed out above. Most of the modifiers at the A"
and P" level are part of the degree system, which precedes the head; they will be
treated separately in chapter 8. At the A'" and P " level, there are appositive clauses
again, as usual marked by comma intonation.
As usual, the order of the appositive and the strictly subcategorized arguments cannot
be reversed:
Thus the facts of APs and PPs are consistent with the Three-Level Hypothesis,
although they do not push it to its limits as do Ss and NPs.
This section has shown that the three levels of complements made possible by the
theory of phrase structure of chapter 3 can be rather clearly distinguished, both
syntactically, through ordering and anaphoric behavior, and semantically, through the
means of integration into the interpretation. We have seen that there is strong evidence
for all three levels in S and NP, and for at least two levels in AP and PP.
The rest of this chapter will be devoted to the details of X' and X" complements,
concerning itself in particular with what cross-category generalizations appear in their
phrase structure. We begin with V' and N', the most elaborate and best-known
structures, then move to V" and N" complements in section 4.3. Section 4.4 deals with
the complements of A, Adv, and P, and section 4.5 discusses how the rules generalize.
The appendices deal with further problems of NP and S complements.
This section will be concerned with showing how a number of important syntactic
differences between Ss and NPs stem from differences in the phrase structure rules for
V' and N'.
The verb and direct object are obvious. Skipping the next three positions for a moment.
4: COMPLEMENTS
the two PPs are to provide for the strictly subcategorized arguments in constructions
like (4.21a). To see the need for both PPs, note that neither PP can follow do so
(4.2lb,c), although (4.2Id) shows that the verb talk can in other cases serve as the
antecedent of do so. We conclude that both PPs in (4.21a) are in V'.
[AdvP]
Return now to the position. It is needed for the strictly subcategorized
LQP J
phrases in constructions like these:
. . ., , . I handsomely I
(4.22) a. Thejobpa,d(us)|too|.n|e j.
Note that the AdvPs and QP in (4.22) cannot be omitted without incurring ungrammati¬
cally, and that, unlike most such phrases (cf. (4.23c)), they cannot follow do so.
Thus the adverbs in (4.22) pass the tests for V' complements.
The S in (4.20) represents the subordinate clause position in sentences like No one
would claim that Ss are NPs. There is no doubt that such an S is strictly subcatego¬
rized; it cannot be omitted, and it cannot follow do so. Furthermore, it is little
remarked that although the S must follow all other strictly subcategorized phrases, it
may be followed by manner and degree phrases.
We will suppose, therefore, that these Ss are daughters of V' in deep structure, and
66 RAY JACKENDOFF
that their more usual surface position at the end of V" is the result of an extraposition
transformation, perhaps the same one that extraposes heavy NP direct objects.7
Returning to the two positions after the direct object, we first find Particle.8 Emonds
(1972a) argues that in verb-particle constructions such as (4.25), the separated form
(4.25a) is underlying and the form with the particle next to the verb (4.25b) is derived.
The principal objection to Emonds’s position is that look up is a lexical item and thus,
in the standard theory, must be inserted under a single constituent in deep structure;
(4.25b) but not (4.25a) can fulfill this requirement on deep structure. However, Emonds
points out idioms like take NP to task and take NP for a ride ('dupe NP’), which (like
(4.25a)) are discontinuous and which therefore provide independent evidence against
monocategorial lexical insertion. This vitiates the argument against (4.25a) being an
underlying form.
Following the particle is what Emonds has called the Predicate position. This
constituent appears in sentences like (4.26).
It has been argued from time to time that these sentences have a deleted complement to
be. However, such sources as *We named the baby to be Hortensia, *You make me
(to) be sick, the alleged deep structures, are implausible, since elect, name, and paint
(at least) never appear at all with an infinitive complement. Thus, given the choice of
adding a transformation or extending the base to account for these verbs, the latter
solution appears better motivated. In fact, we can then claim that the copula involves
the Predicate position in John is a boy and John is sick, capturing the semantic
generalization between (4.26) and the copula directly in the formulation of the base and
7 As ever so slightly suggestive evidence for this extraposition, notice that no intonation breaks are
possible in (4.24), but pauses may be inserted in the extraposed forms:
(i) He claimed(,) in a loud strident voice(,) that Ss are NPs.
(ii) He believes(,) quite fervently(,) that Ss are NPs.
And even when the pauses are not inserted, the intonation in these forms is somewhat more “leisurely”.
Since such pauses are characteristic of deletion sites or traces (ct. Selkirk (1972)), (i)-(ii) must be the derived
forms, not (4.24).
8 According to the phrase structure rule schema, this should probably be Prt'", with no complements or
specifiers in the expansion of Prt'" down to Prt. Since universal grammar provides this much of the phrase
structure rules, using Prt'" instead of Prt in (4.20) incurs no extra cost to the particular grammar of English.
We will make the revision explicit in section 4.4.
4: COMPLEMENTS
projection rules, without recourse to a transformation. This makes be a verb like any
other in its predicative uses, avoiding the extra phrase structure rule of Syntactic
Structures and Aspects.
Emonds (1970) claims that the Predicate position is dominated by a node Pred
which is in turn dominated by VP. Such an analysis would violate our theory of
categories, of course. I believe, however, that postulating this node is as much a
mistake as postulating a node Agent. Rather, Predicate, like Agent, is a semantic
relation which may be assigned by the projection rule for the NP/AP position in the
VP.9 As syntactic evidence that the second NP node is not distinguished by a
dominating node Pred, we offer a few expressions which use the second NP position
for other than a predicate, and which do not appear to be derived by any sort of dative
shift (these examples were suggested to me by Bowers (1971)):
b. I envy
{you your beauty
*your beauty to you
c. He called
(her names
names to her (wrong meaning)
We need some justification for placing the Prt position before the Predicate. One
piece of evidence comes from sentences like (4.28), with a predicate AP.
Clearly the best position for up is between the direct object and the predicate. I know
of no cases which contain a direct object, a particle, and an NP predicate; but John
grew up a Catholic is suggestive, in that the particle cannot be interchanged with the
predicate as it can with direct objects.10
9Jackendoff (1976), following Gruber (1965), characterizes this semantic relation as Identificational
Location, showing how it fits into a general account of functional semantic interpretation.
10 In Jackendoff (1974a) it was claimed that a Catholic is not strictly subcategorized in this example.
However, it cannot follow do so as can the similar construction to which it was compared there:
I
(i) *John grew up a Catholic and Bill did so a Moslem.
drunk
a saint
having forgotten who he was J
f, but Bill did
sober
so ja sinner
[in full knowledge of the hazards
Though the NP case in (ii) is somewhat worse than the others, it seems better than (i). The evidence that Prt
precedes predicate NP is admittedly inconclusive; but there is no evidence at all against it. This is a case
where we can let the theory decide.
68 RAY JACKENDOFF
Another bit of evidence for the base configuration V-NP-Prt-NP is that it permits
a simple structure-preserving analysis of Particle Movement and Dative Shift. Notice
first that if there is a single object, it can be generated in either NP position, i.e. on
either side of the particle. Hence the difference between (4.25a) and (4.25b) need not be
the application of a transformation at all: it may be due to the deep structure option as
to object position.
Next, observe that when there is a direct object, a particle, and a PP in the
complement, the object may as usual be on either side of the particle, as in (4.29). But,
as Emonds (1972a) points out, in a double-object complement, the particle must fall
between the two objects, as seen in (4.30).
This constraint can be accounted for if Dative Shift is a structure-preserving rule which
moves the object of to into the NP position directly after the verb (lower case denotes
an empty node):
According to this statement of Dative Shift, the structural description of the rule
can be met only if the first NP position in V' is empty, that is, if the direct object is
generated in the position following Prt. Thus, the configuration (4.30a) is the only
possible ordering of the two objects and the particle under this analysis. In turn, the
analysis justifies the base configuration V - NP - Prt - NP in (4.20).
Finally, there are some possible improvements in the statement of (4.20), making
11 In this rule Va denotes the class of verbs that govern Dative Shift.
Emonds (1972a) describes these facts with a Dative Shift that actually interchanges the two objects. The
present analysis is better in that it does not violate the generalization that transformations move only one
constituent at a time.
(4.31) is however sufficient to describe only Emonds’s Dialect B, which happens to be mine. For the
other two more complex dialects he describes, a movement of Prt to postverbal position will still be
necessary. Since Prt is not a phrase node, this rule does not fall under the structure-preserving constraint, but
rather under Emonds’s notion of a “local transformation”.
An interesting feature, perhaps an advantage, of this analysis is that it permits Dative Shift to be
governed by deep structure strict subcategorization rather than the traditional rule feature. Verbs which
permit Dative Shift strictly subcategorize an optional empty second NP, permitting the structural description
of (4.31) to be met. Verbs which do not allow Dative Shift will omit the Predicate NP in their strict
subcategorization. Thus, in fact, Vacan be replaced simply by V in (4.31), if it is the case that all verbs which
allow Dative Shift also allow Particle Movement in the presence of a particle.
4: COMPLEMENTS
(AdvP
Qp
[
| position can be rewritten as [-Obj, -Comp, -Det]'". Since it
appears that strictly subcategorized adverbs and QPs as in The job paid handsomely/
enough never cooccur with the predicate position, these two positions can be collapsed
into one (syntactically, though not semantically) with the feature complex [-Obj,
-Det]'". Similarly, there never seems to be a need for two PPs and an S all at once. If
this is the case, the last two terms in (4.20) can be collapsed into [ + Obj, +Comp]"'.
Thus, the final form of the base rule, using feature notation where necessary, is (4.32).
-Obj + Obj
(4.32) V' V - (NP) - (Prt) - ( ) - (PP) - ( )
-Det +Comp
This subsection will show that certain differences between the phrase structure of V'
and N' account for a number of further divergences in their syntax. The expansion of
N' is at most (4.33).
Again skipping the Prt and NP for just a moment, we see that two PP positions are
necessary for phrases like our talk about the war with Bill, paralleling the two PPs in
the V' of We talked about the war with Bill. The S in (4.33) appears in such
constructions as Frank’s claim [to Tony) that Sim was the culprit, where the S
corresponds to the one in the parallel V' of Frank claimed (to Tony) that Sim was the
culprit.12 There never seems to be a need for two PPs and an S at once, so I have
collapsed the second PP with the S; in feature notation the combination would be
identical to the one at the end of (4.32). Returning to the Prt, in the rare cases when
there is a particle in N', its position is always immediately after the head, e.g. Bill’s
looking up of the information, not *Bill’s looking of the information up.
There remains the NP position. In surface structure, of course, N cannot be
followed by an NP object or predicate. An N' corresponding to a V' with a direct
object generally contains of (although there are exceptions such as doubt).
12 Note, by the way, that the S at the end of N' confirms Emonds’s (1970) hypothesis that that and for-
to complements may be generated directly under V', rather than being extraposed there from under NP, as
proposed by Rosenbaum (1967). The Lexicalist Hypothesis claims that Frank claimed that Sim was the
culprit and Frank’s claim that Sim was the culprit must have parallel structures. In Rosenbaum’s
complement system, the deep structures are [sFrank past [VPclaim [NPit [gthat Sim was the culprit]]]] and
[NPFrank claim [gthat Sim was the culprit]], clearly nonparallel structures. To solve this difficulty, one must
assume either that both Ss are in [NPit S] constructions or that they are both attached to X'. As Emonds’s
arguments motivate the latter configuration, it is the structure of V', not of N', that must be changed from
Rosenbaum’s system.
70 RAY JACKENDOFF
Verbs which take PP complements generally preserve the preposition in the derived
nominal:
Since in these cases the form of the complement is preserved in derived nominals,
Chomsky (1970a) suggests (p. 41-42/204 and note 27, p. 42/note 28, p. 219) that the
same is true of direct objects in underlying form: they occur in both V' and N' without
the preposition, and an obligatory transformation inserts of to produce the surface
form.13 Note that this of cannot be the preposition of, since there is no PP structure into
which it is inserted. Rather, it must be a specified grammatical formative like poss.
Another instance of the specified grammatical formative of will appear in section 5.4.
The transformation inserting of is therefore to be stated as (4.36).
(4.36) 0/-Insertion
[n.N - (Prt) -NP-X] ^>l-2-o/ + 3- 4
OBLIGATORY
13 Note that of is the only formative inserted. In his doubts about the proposal, about will be present in
deep structure, and about the proposal is a PP. The fact that the verb and noun do not agree in their
complement structure must be entered in the lexicon as a complication of their normal verb-noun pairing.
Chomsky’s footnote cited above defends the o/-insertion as opposed to an o/-deletion in VP. A further
argument appears in the discussion of the passive, section 4.7.
Jackendoff (1969c) argues that possessive constructions like a picture of John's are a kind of partitive,
analogous to one of John's pictures. If this is the case, the of is inserted by (4.36). Since 0/-Insertion applies
only to the NP immediately after the head noun, the genitive phrase must be the leftmost element. This
explains Chomsky’s judgment (p. 46/206) of the “quite clumsy phrase the portrait of Aristotle by Rembrandt
of the Metropolitan Museum's”, since there is no way for the grammar to generate it. If by Rembrandt is
omitted, the ungrammaticality is even clearer. Also, note the contrast of a portrait of the Met's by
Rembrandt and ?*« portrait by Rembrandt of the Met's.
Jackendoff (1974a) takes a somewhat different view of the inserted of: it is not a specified grammatical
formative, but a real preposition which fills an empty head of a PP complement. This analysis was necessary
in part because the notion of specified grammatical formative was not included in the theory of phrase
structure assumed there. The present analysis considerably simplifies the generalization of the grammatical
relation object, and is much closer to Chomsky’s account.
To distinguish between these two theories, one would ideally like to use extraction to test the
constituency of o/-NP. But the results are inconsistent. W/i-Preposing cannot carry the of along: *Of which
boys did you meet the fathers? But, as will be seen in section 5.3.1, Extraposition from NP does move o/-NP.
arguing against the present view. On the other hand, Akmajian and Lehrer (1976) show that certain apparent
extractions of o/-NP must be base-generated rather than extracted, so perhaps Extraposition from NP is such
a case too. I leave the question open.
4: COMPLEMENTS
Thus, the underlying form of the nominals in (4.34) will be destruction the city,
explanation the result, establishment the claim.
The absence of an NP position preceding Prt in (4.33) explains why Dative Shift
and Particle Movement do not occur in N'. We have analyzed Particle Movement in V'
as the choice between two underlying phrase structure positions for the direct object,
one on either side of the particle. Since N' contains only the NP position after the
particle, the other ordering is impossible and the object can only follow Prt. Similarly,
Dative Shift in V' requires a structure-preserving movement of the indirect object into
the NP position before Prt. Since N' contains no such position, Dative Shift cannot
apply.
Furthermore, the absence of Predicate complements like *the election of John (of)
President follows from the fact that there is only one bare NP position in N': a
construction parallel to the V' elect John President would require two NP positions in
N' and there is only one. The closest parallel uses a PP with as: the election of John as
President. Thus, our analysis of the phrase structure of N' gives a principled
explanation of three apparently unrelated differences between V' and N' on the basis of
the N' lacking the first NP position.
Having justified base rule (4.33), we now address ourselves to how it generalizes
with V'.
-Obj +Obj
(4.32) V' V - (NP) - (Prt) - (
+ Comp_ )
(PP) - (
_ —DetJ ’
+ Obj
(4.33) N' N - (Prt) - (NP) (PP) - ( )
-l-Comp
All positions are parallel, except for two: N' lacks the first NP position, and allows
only an NP where V' allows NP, AP, QP, or AdvP. Using subscripted angle brackets,
the two rules can be collapsed as (4.37), which takes only three features more than the
rule for V' alone.14
X
+ Subj \
4-Subj
4-Comp/2 +Obj
(4.37) (+Obj )x X- ((NP))1-(Prt"')- - (PP) - ( )
-Obj 4-Comp _
(-Obj >2
-Det
_ + Comp_
14 We change Prt to Prt'" as suggested in note 8, page 66; this will be relevant in section 4.4.
72 RAY JACKENDOFF
X
(4.38) In , the object is the N'" immediately dominated by X' in
_+Subj
the configuration [VX - (Prt) - N'" - Y].
Note that this definition allows an NP on either side of Prt to be designated as object,
as long as there is no nonempty NP between it and the head.
For completeness, we should also define the grammatical relation Predicate.
(4.39) In V', the Predicate is the italicized phrase immediately dominated by V'
As pointed out in section 4.1, the X" complement contains restrictive modifiers. In V"
these are AdvPs and PPs of manner, means, time, instrument, and so forth, as well as
various adverbial clauses. In N", the complements include PPs of time, place,
accompaniment, and so forth, relative clauses, and (for semantic reasons, their prehead
4: COMPLEMENTS
position notwithstanding) APs. The phrase structure of X" is on the whole rather
simple; this section will discuss three intricacies involved in the proper formulation of
X" phrase structure rules and in the use of X" positions.
First let us deal with the differences between AdvPs and PPs in V". The most
salient difference is that AdvPs may appear preverbally as well as postverbally,
whereas PPs may only be postverbal.
Furthermore, although PPs can appear one after another, there is a stylistic
preference for avoiding two consecutive adverbs, wherever they are attached (see
Jackendoff (1972, chapter 3) for discussion of adverb attachment to V" vs. V'")-
Note that if the adverbs are separated the examples in (4.41) become acceptable.
Since this condition can involve any type of adverb, and in particular can cross
constituent boundaries, it cannot be stated as part of a set of context-free phrase
structure rules, but must rather be stated as some sort of an output string condition.
Thus the phrase structure component may be left free to generate an indefinite number
of adverbs as well as an indefinite number of PPs, and the number actually possible will
be constrained by the output condition and the necessity that they all receive
interpretations which can be integrated into the interpretation of the sentence.
These considerations suggest, for a first approximation, the phrase structure rule
(4.43), where X* denotes a concatenation of an indefinite number of Xs.
74 RAY JACKENDOFF
[PP
(4.43) V"—> (have - en) - (be - ing) - (AdvP) - V' - (
[AdvP
There are two things wrong with (4.43). First, the final term, expressed in distinctive
features, is [-Subj, aObj, aComp], not a wonderfully natural combination. Further¬
more, in the N" complement there is no AdvP, and no juggling of features will be able
to combine [aObj, aComp] with [ + Obj, -1-Comp] in a particularly felicitous fashion.
[PP
This suggests that the combination i A1 _ is incorrect, though it appears necessary
lAdvP
because PPs and AdvPs may be freely interspersed in V".
The second difficulty is the repetition of AdvP in two positions in V", though it has
essentially the same semantic functions in either position. Within the usual framework
of phrase structure grammar, this cannot be avoided, since the V' gets in the way of
collapsing the two occurrences. Moreover, the manner adverb may occur before an
aspect that has not been moved up into the Aux, as in John will completely have
finished. Thus in fact three AdvP positions appear necessary.
The usual solution to this problem is to generate the desired constituent in a single
position in underlying structure, then to move it to its various surface positions by
means of a transformation. Here, however, the nature of the generalization lends itself
to a different treatment: the generalization is that AdvP may appear in any position
where it is a daughter of V". To express this generalization, we will follow the
suggestion of Keyser (1968) and allow the phrase structure component to designate
certain constituents as "transportable”: a transportable node will have free word order
with respect to all of its sisters.15 We will indicate transportability with a feature
[+Trans], and assume that [—Trans] is unmarked, that is, that if the feature is not
mentioned in a rule, [-Trans] is intended (or filled in by marking conventions). We can
now revise (4.43) to (4.44).
Adv
(4.44) V" —> (have - en) - (be - ing) )* - V' - (PP)* - (S)
-<[ .+Trans
The position of the Adv in (4.44) allows collapsing with the rule for N", (4.45), to
the rule (4.46).
15 Jackendoff (1972, chapter 3) argues that free adverb order in VP is impossible because strictly
subcategorized adverbs must be postverbal. In the present treatment, the placement of strictly subcatego¬
rized adverbs in V' separates them syntactically from other adverbs, which can now be free to wander
throughout V".
4: COMPLEMENTS
Note that the feature +Trans appears only in V", so that adjectives will appear only
prenominally and will not be free within N".
A second problem in the structure of V" is the ability of a strictly subcategorized
PP (i.e. a PP in V') to appear after a manner adverb, as in John gave the beans quickly
to Bill. To account for this, either the adverb must be lowered into the AdvP position in
V', or the PP must be raised into the first PP position in V". Both operations are
structure preserving and in that respect harmless.
One piece of evidence that might be thought relevant to the choice is that a strictly
subcategorized PP cannot follow do so even when it follows a manner adverb:
(4.47) *John gave the beans quickly to Bill, and Fred did so (slowly) to Susan.
This apparently argues that the adverb is moved down into V'. But in fact such an
argument is incorrect; it is also the case that strictly subcategorized clauses cannot
follow do so even when extraposed around various PPs in V".
in a loud voice
(4.48) *John said suddenly that smoking was fun, but Susan
at 6:00
Thus there must be an independent mechanism to rule out (4.48), possibly having to do
with the trace left behind in V' by the extraposed constituent; whatever this mechanism
is, it will also rule out (4.47) under the theory that the PP to Susan has been
extraposed. Thus there are no strong considerations to decide which constituent moves
in inverting a manner adverb with a strictly subcategorized PP. The spirit of the
Extended Standard Theory does include a general uneasiness with lowering rules, so
perhaps the raising of the PP is a better solution.
Finally, there is an S at the end of both V" and'N". This is the position of
restrictive relative clauses in N"and of comparative clauses in both V"and N"; chapters
7 and 8 will discuss these uses at some length. It is also the position for various
restrictive subordinate clauses in V" such as in It only hurts when I laugh, and the
position to which extraposed V' sentential complements move by a structure-preserv¬
ing rule, for example in (4.48). A more controversial use is as the position for sentence-
final subject complements, as in It bothers Bill that she came.
The controversy arises over whether the V" complement is only a derived position
for these complements, as claimed by Rosenbaum (1967), or whether it is an underlying
position, as claimed by Emonds (1970). Emonds shows that there are a vast number of
differences between that and for-to complements on the one hand and gerundive (poss-
ing) complements on the other, concluding that only gerundives behave like NPs.
Rosenbaum's theory, in which all three types of complements are dominated by NP in
76 RAY JACKENDOFF
underlying structure, is at a loss to explain this fact without a great number of unusual
conditions on the Extraposition transformation, which derives (4.49b) from (4.49a).
Emonds, on the other hand, shows that the syntactic distribution of complement types
is much better accounted for if (4.49b) is underlying and (4.49c), which occurs only in a
rather restricted class of contexts, is derived from it by a simply stated root
transformation of Subject Replacement (or Intraposition).
On general syntactic grounds, the Intraposition theory has much to recommend it.
However, a number of difficulties in this theory are pointed out in Higgins (1973), and
in response to them Emonds (1976) abandons the Intraposition theory in favor of a
variant of Rosenbaum’s analysis. The advantageous properties of Emonds’s earlier
analysis are accounted for in the new analysis, but only in a somewhat artificial and
suspect fashion. The issue is not crucial to the present theory, but Emonds’s original
view of these complements does fit attractively into our analysis here. Section 4.8
presents some new considerations which favor the Intraposition analysis, and which
show that V" is the correct position for subject complements to be attached.
The V" position of subject complements has a parallel in N". Chomsky (1970a)
points out (p. 44/203) that sentences such as (4.50a) are paralleled by nominals like
(4.50b).
4.4.1. Adjectives
The syntax of A' is similar to that of N', but more limited. Adjectives can take PP
complements, as in good at chess, yellow with age, eager for help. They can also take
S complements, as in afraid that Bill is fierce, eager to please, proud to be a frog, and
happy that he won. There are a few examples such as dependent on Bill for help which
4: COMPLEMENTS
have two PPs, and dependent on Bill to help him with PP followed by S. Thus the
\ PP]
phrase structure rule for A' includes at least (PP) - n =, f).
+Obj
(4.51) A'-> A - (NP) - (PP) - ( )
+Comp_
(4.53)
78 RAY JACKENDOFF
4.4.2. Adverbs
This difference may be ascribed to attachment of the two PPs in different places,16
arguing that for our hero in (4.54a) and in the parallel AdvP is an X" complement.
As for S complements, we will take the position that subordinating conjunctions
are prepositions rather than adverbs, so “adverbial clauses” introduced by because,
although, etc. are not Adv' complements. There appear to be no -ly adverbs with S
complements. Thus, the phrase structure rule for Adv' is the minimal (4.55), and we
can maintain the claim that [—Comp] categories take no X' complements.
4.4.3. Prepositions
16 Williams (1975) gives this sort of difference as criterial for preposing in Ss.
4: COMPLEMENTS
If both optional constituents are omitted, we get a class of "adverbs” without -ly
such as here, there, outside, downstairs, beforehand, and afterward. If NP is present,
we get the familiar forms in the house, down the hatch, after the ball, etc. If only PP is
present, we get phrases like out of the barn, down into the darkness, from inside the
barrel, and up to your bedroom. If both NP and PP are present, we get phrases like
across the street from Bill’s house, down the street toward Bill, and to Bill in New
York. The expansion P - PP may use an intransitive preposition in the lower PP, as in
over here, down there, and from within. It can also be expanded with a P - PP in the
lower PP, as in from out of the darkness, down from above the altar, etc. Thus (4.57)
seems to be perfectly productive and free of restrictions; I find it surprising that it has
gone unnoticed for so long. Note that the definition of the grammatical relation
"object”, (4.38), generalizes directly to P', so (4.38) can now be left altogether free of
category features on X.
Klima (1965) claims that subordinating conjunctions can also be analyzed as
prepositions which take an S complement (from which the complementizer that is
deleted obligatorily in modern English, optionally in Middle English). This provides the
simplest description of the relation between the prepositions in j the ball and
the "conjunctions” in
(before I
the ball is over: before and after, like many verbs,
after J
nouns, and adjectives, allow either an object or a subordinate clause. The S does not
cooccur in the complement with either NP or PP, to my knowledge, so it must be an
alternative to them, yielding this phrase structure rule:
PPs also permit the degree Adv"', as in not completely up the tree, quite utterly out of
his mind, partly into the room, etc. As will be seen in chapter 8, they also allow an S
for degree clauses. Thus the base rule for P" is in part (4.59):
4.4.4. Particles
These rules do not account for the specifiers all and right in (4.60a); the rules for PP
will have to be emended in parallel fashion.
Taking this view allows a more unified theory of case marking in languages such as
German and Russian, which have a mixture of prepositional phrases and cases:
prepositions, like every other lexical category, determine the cases of their comple¬
ments. Thus the lexical and transformational machinery necessary to enforce case
marking is very general, applying to the complements of all lexical categories. By
contrast, in the prepositions-as-case-markers view, the case markings due to preposi¬
tions must be accounted for separately from those due to other categories. Thus,
treating prepositions as a genuine lexical category, besides accounting for obvious facts
about their distribution, permits a more unified treatment of case marking.
+ Obj
(4.32) V' V-(NP)-(Prt"')-([J^J ) - (PP) - ( +Comp )
+Obj
(4.33) N' N - (Prt'") - (NP) -(PP)-( )
+ Comp
+Obj
(4.51) A' A (NP) - (PP) - ( _+Comp_ )
(NP) -(PP)j
(4.58) P' P
(S)
Certain generalizations about word order are apparent from inspection of these
rules. In X', the head precedes all other phrases. There is a strong recurrence of N'",
P'", and S in the rules for X', and their relative order is the same in all cases where they
cooccur: N"' always precedes P'" and S, and P"' always precedes S. If these are
linguistically significant generalizations (as I believe they are), the evaluation measure
for the theory of grammar ought to be constructed in such a way as to favor this
grammar over one (for example) in which V' expands as V - N'" - P'" but P' expands
as P - P'" - N'". The usual way to construct an evaluation measure is to develop
notational conventions in terms of which the more general case takes fewer symbols to
state. This is the purpose of parentheses, braces, and distinctive feature notations (cf.
Chomsky and Halle (1968, chapter 8)).
In this case, though, a number of instances arise in which the ordinary abbrevia-
tory conventions are of no avail. First, suppose that we try to collapse just the V' and
P' rules so that the appropriate parts of the complement coincide. In simplified form,
showing only the relevant parts, we get rule (4.63) as an abbreviation for (4.62a,b).
82 RAY JACKENDOFF
(4.63) X
(+Subj)!
(N "') - (P'")
<-Subj)2 -* X - - <(S)>
. <(S))*
+Obj
+ Comp _
Capturing the generalization about the relative order of N'" and P'" misses the
generalization about the presence of S in the complement. In (4.63) the two occur¬
rences of S have nothing to do with each other, and either could be changed to
something else without affecting the complexity of (4.63). This is clearly an incorrect
result. The only way to improve on this is with a complex Boolean condition such as in
(4.64), simulating the effect of the braces in (4.62b) within the constraint of a linear
string.
(4.64) X
+ Obj -*X-(N,w)1-(P'")2-(S)3
_+Comp_
Condition: If X= P, not (3 and (1 or 2)).
Apparently, then, all the X' rules can be collapsed, while preserving the generali¬
zations about order, only by means of negative Boolean conditions. The problem
becomes worse if we try to refine all the X' rules to prevent nonexistent expansions.
For example. A' appears not to allow NP to cooccur with any of the other
complements; thus, a more refined rule would contain a set of braces different from
that in P', requiring more Boolean conditions still.
It is hard to judge how good such a solution is. The introduction of more or less
arbitrary Boolean conditions increases the power of a phrase structure grammar
considerably, and one would have to ask very seriously how they can be constrained.
Furthermore, it is not clear how one would define an evaluation measure over such a
set of conditions, so as to know exactly what is a generalization and what is not.
Nonetheless, the numerous partial generalizations of English seem to require some
such rather complex solution. I can offer no interesting suggestions for alternatives to
Boolean conditions. A serious study of the phrase structure rules of a number of other
languages would no doubt elucidate what problems must be solved by an explanatory
theory of cross-category phrase structure rules.
To make it clear that this is not a pseudoproblem, let us just mention a few other
cases in the phrase structure component of English where the traditional abbreviatory
conventions are inadequate. First reconsider the rules for X'" developed in chapter 3:
4: COMPLEMENTS
h n -n"
Anticipating the results of chapter 6, we will also mention these rules for the A'" and
Adv"' specifier for comparison.
There is a clear relationship between the Degree phrase in A'" and Adv'" and the
Article in Nand in fact a few words such as this and that appear in both categories.
What Deg'" and Art'" have in common is that they are the [—Comp, +Det] counterparts
of their respective heads. Now observe that M'" is likewise the [-Comp] counterpart of
its head, a generalization of possible interest, not pointed out in chapter 3. Again,
however, we are faced with a dilemma: Deg'" in (4.65) can be collapsed either with M'"
in (3.42a) or with Art'" in (3.42b), but not with both; there is no way to collapse all four
rules at all without a Boolean condition that restates the braces.
aSubj
X
/30bj
(4.66) aSubj (N'"), - ( )2 - X"
-Comp
L/30bj
+ Det
Conditions: If X = [—Subj], not 1
If X = N, not (1 and 2)
This case is thus formally parallel to the V' - P' generalization pointed out before.
A different sort of problem arises in trying to collapse the rules for X".
Adv
(4.44) V" (have - en) - (be - ing) - ( )* - V' - (P'T - (S)
+ Trans_
(4.45) N" (A'")* - N' _ (p'")* -(S)
(4.52)' A" (Adv'") - A' - (P"') -(S)
(4.56) Adv" (Adv'") - Adv' _(P"') -(S)
(4.59) (Adv'") - P' -(S)
(4.61b) (Adv "j - Prt'
The problem lies in capturing the generalization that all categories have a prehead
modifier that is [-Subj, —Obj]. N" and V" have special wrinkles in the modifier: both
allow more than one; in V" it is transportable; and in N", unlike all other categories, it is
[+Comp], Expressing these deviances with any of the usual notations would be
extremely clumsy, if at all possible. Of course, unconstrained Boolean conditions can
84 RAY JACKENDOFF
-Subj
(4.67) X" —* (have - en), - (be - ing)j - ( Obj )2 - X' - .
Det J
Conditions: If X 4- V, not 1
If X = [+Subj], 2 is *
If X = V, 2 is [+Trans]
If X = N, 2 is [ + Comp]
If X =* N, 2 is [-Comp]
These conditions are still less constrained than those in (4.66), since they specify actual
features and not just presence or absence of constituents. The problem is to find some
device less constrained than traditional abbreviatory conventions but more constrained
than such arbitrary conditions.
Perhaps the most striking case is one that will be motivated in more detail in
chapter 6. All [ + Comp] categories except verbs permit a measure phrase (a kind of NP)
or a quantifier phrase in the X" specifier:
In the verb phrase there is also an alternation of a measure phrase and a quantifier
phrase in V", but this time it is to the right of the head:
To make matters worse, the measure phrase and (under certain conditions) the QP can
follow the head in PP:
+ Subj w
+ Subj
-Obj )l-. . - X' - . • -( -Obj
_-Det . Det _
Conditions: If X = V, not I
If X = [—Obj], not 2
But this does not capture the generalization that the same expression appears on both
sides of X'; the grammar would be no more complex (in terms of number of symbols) if
the subscripted terms in (4.71) were of entirely different categories but used the same
number of features. One might suggest instead'that all these modifiers are generated
before the head, and a transformation permutes them obligatorily around a V' and
optionally around a P'. Such a rule, however, would be non-structure preserving, and
besides would be supported by no independent syntactic evidence, at least in the case
of V'.
Though this is the only such case I know of in English, there is reason to believe
that it is not such a rare situation. It would arise in its most drastic form in a language
with a verb-final V' but a noun-initial N'; German, if it is SOV, as has been rather
persuasively argued by many people, is such a language. Languages in which adjectives
follow nouns but adverbs precede adjectives would be another such case; so would
SOV languages with prepositions rather than postpositions. Such languages are cited by
Greenberg (1963).
Greenberg points out, however, that such languages are relatively rare; his
“universals” say that the opposite correlations occur with “overwhelmingly greater
than chance frequency”. This suggests that the evaluation measure for phrase structure
rules counts parallelisms of word order across X" as generalizations, as we have come
to expect in the present study. But it appears also that parallel grammatical relations
with differing orders across X" also count as generalizations, though not as strongly;
present notations have no way at all to express this. Again, detailed study of the phrase
structure of languages other than English is necessary, in order to establish exactly
what generalizations there are and how the formalism for phrase structure rules should
be constrained so as to explain why these and no other generalizations appear.
4.6. Summary
Section 4.1 showed that complements divide semantically into three types: functional
arguments, restrictive modifiers, and nonrestrictive modifiers, and that these can be
identified with X', X", and X'" complements, respectively. Thus, there is a systematic
relationship between the syntax and the semantics of the complement system.
The rest of the chapter developed phrase structure rules for the X' and X"
complements of most syntactic categories in English, showing the extent of cross¬
category generalization in the complement system. A close examination of the existing
86 RAY JACKENDOFF
generalizations revealed, however, that many of them cannot be expressed with the
traditional notational conventions.
There follow two appendices. The first deals with the generalization of the Passive
to NPs; the second presents some new arguments about Intraposition. Besides the
phrase structure rules listed in the previous section, this chapter and its appendices
discuss a substantial number of transformations; they are listed after the appendices.
This appendix describes how Chomsky (1970a) generalizes the Passive transformation
to NPs, and proposes a number of improvements on his analysis.
4.7.1. NP-Preposing
Consider the relationship between the phrases in (4.72a) and the synonymous readings
of the phrases in (4.72b).
If we relate them with a transformation, taking (4.72a) as the underlying order, we can
explain two things: the syntactic relationship of these phrases to the sentences X
destroyed the city, X murdered John, etc., in which the NP is a deep object; and the
ambiguity of (4.72b), where the genitive phrase can originate either as deep subject or
deep object. Chomsky calls the transformation deriving (4.72b) from (4.72a) NP-
Preposing (p. 41/203).
This rule applies only to NPs with a direct object (in the sense of section 4.2): there
are no NPs *war's prayer, *my despair’s escape, *power's lust corresponding to the
prayer for war, the escape from my despair, the lust for power. This provides some
interesting evidence that the of of direct objects in NPs is not present at some earlier
stage of derivation. The argument goes as follows: the sense of approve which takes a
direct object, e.g. approve the plan, has the nominals the approval of the plan and the
plan's approval. But the sense of approve which takes an o/-phrase, e.g. approve of
your behavior, has only the nominal the approval of your behavior and not *your
behavior's approval. We can explain this difference by claiming that the former sense
of approval, like the verb, lacks of in deep structure, but the latter sense requires it.
Then only the former sense will permit NP-Preposing, although the deep structure
difference of the two senses is obscured by the later rule C/-Insertion.
4: COMPLEMENTS
In (4.73) John may not be understood as having subject function in the nominal, but in
(4.74) he may. Wasow claims that the difference is due to the ability of the nominals in
(4.74) to take an empty (or PRO) NP specifier and the inability of the nominals in (4.73)
to do so; the difference is justified independently in Wasow and Roeper (1972). Wasow
then points out that this difference correctly predicts that only the nominals in (4.74)
undergo NP-Preposing, since only they allow an empty NP into which the object can
prepose.
Wasow’s analysis does not exhaust all the situations where NP-Preposing fails, but it
accounts for an interesting class of cases.
For the moment we will assume that the is inserted to fill an empty NP in the
specifier position; we will return to this in section 4.7.4. Under this assumption, NP-
Preposing can be stated as (4.77), where lower case denotes an empty node.
(4.77) NP-Preposing
np-Y-N-NP-Z
=>4-2-3-</>-5
OPTIONAL
88 RAY JACKENDOFF
4.7.2. Agent-Postposing
(4.79) Agent-Postposing
[NP NP - X - N - Y - [PP by np] - Z]
=> np - 2- 3 -4 - 5-1-7
OPTIONAL
The underlying structures of (4.78b) will be the enemy destruction the city by np, etc.;
(4.78a) will simply lack the final PP.
If we order NP-Preposing after Agent-Postposing, we can apply them in sequence:
(4.79)
But the final form in (4.80) is precisely what we observed in chapter 2 as a parallel to
the passive sentence The city was destroyed by the enemy.
The Passive transformation, although traditionally written as a single rule which
moves two NPs, can clearly be broken into two components, the movement of the
subject into the /ry-phrase and the movement of the object into subject position. The
first component obviously generalizes with Agent-Postposing: the bv-phrase is used in
precisely the same way, and the NP is moved out of parallel positions. The rules
combine as (4.81).
4: COMPLEMENTS
The second component of the sentential Passive does not immediately generalize
so clearly with NP-Preposing. This part of the Passive, unlike NP-Preposing, is
obligatory, since there are no sentential forms like (4.78b) (e.g. *it destroyed the city by
the enemy). Furthermore, under certain conditions the passive construction permits a
preposition to intervene between the NP to be preposed and the verb, as in This bed
has been slept in by an infamous violinist. The second half of the Passive must also
introduce the auxiliary be-en.
(4.82) could clearly be combined with (4.77) by the usual notations. However, the
difference between optional and obligatory use casts a certain amount of suspicion on
the generalization. Fiengo’s (1974) analysis of the passive provides some hope of
improvement. In Fiengo’s theory, be-en is present in the underlying structure of
passives, and semantic conditions on traces guarantee that only passive surface
structures are associated with it. Furthermore, he generalizes the rule so as to use it to
derive examples like These books sell easily from sources like np sell these books
easily; thus be-en plays a role neither in the structural description nor in the structural
change of the Passive. Because of the presence of various independent conditions,
then, Fiengo’s analysis eliminates the be-en and the obligatory application from (4.82).
The only remaining discrepancy is then the optional preposition, so the generalization
of NP-Preposing looks somewhat more plausible than in the analysis of “Remarks”,
interpreted strictly. Conceivably the generalized form could be at least as clean as
(4.83) .
Besides the generalization with NP passives and the elimination of a rule moving
two constituents simultaneously, there is some independent reason to separate the two
components of the Passive. As pointed out by Emonds (1970), sentences like (4.84)
90 RAY JACKENDOFF
could be generated as deep structures with empty subjects and derived by the use of
(4.83) alone, avoiding the suspicious rule of By-Someone Deletion used traditionally in
deriving examples of this sort.
4.7.3. NP-Postposing
However, Bowers (1968a) argues against the by —> of rule, proposing instead that there
is an independent rule of NP-Postposing with the following effect (stated in the present
framework):
(4.85) NP-Postposing
NP-Y-N-np-Z
=> np - 2 - 3 - 1-5
OPTIONAL
relations. For these nouns and the adjectives high, long, etc., would be related in the
simplest way if they took syntactically parallel arguments. Since there are no forms
*high of the building, *long of the railroad, etc., but rather only The building is high,
The railroad is long, etc., the optimal relation between nouns and adjectives under the
X Convention favors (4.86a) as the underlying order. This of course requires the
existence of a rule like (4.85).
Similar considerations apply in the following nominals.
17 Our account differs from Chomsky’s analysis not only in the treatment of NP-Postposing in NPs,
discussed above, but also in the treatment of the determiner. Instead of an empty NP determiner in the
destruction of the city by the enemy, filled transformationally by the, Chomsky proposes that the determiner
position have the syntactic feature analysis [+NP, +Def], with the latter feature spelled out as the if NP is
moved away. Since at the time of “Remarks” empty nodes were not an available device, Chomsky’s solution
appeared much more necessary. Jackendoff (1974a) makes an attempt to work it out in some detail; the
complexity of the resulting theory has been criticized with some justification by Homstein (1975). Besides its
complexity, there are two arguments against Chomsky’s stratagem of combining category and definiteness
features at a single node in the base. First, no one has found another use for this device since 1967, the time
at which “Remarks” first appeared. Second, this device strikes me as utterly inadequate to cope with
examples like (4.88). I thus feel little regret in abandoning it.
4: COMPLEMENTS
was respected by John (these cases pointed out by Hornstein (1975)). Note that these
all are cases in which the by-phrase does not designate an agent, so they could not be
interpreted correctly by the proposed projection rule. They are thus good candidates
for a transformational derivation of the by-phrase from subjects. On the other hand,
this solution might make it more difficult to give a semantic account of those verbs
which do not undergo Passive at all, e.g. resemble, weigh, etc. I leave the issue open.
Note that even if Agent-Postposing is abandoned altogether, we are not led to
something like case grammar, in which all agents are generated as by-phrases in
underlying structure. Rather, NPs can be interpreted as agents either in subject position
or in by-NP complements, and there is no transformational relationship between the
two positions at all.18
Since NP-Preposing requires an empty NP in subject position, and since the
determiner of an NP may be either an article or an empty NP, the following derivations
may take place.
(every
that
(4.89) that ' proof of famous theorem
(the
(no transformations apply)
( NP-Preposing
r famous theorem —>
I that
the proof of famous theorem
We are thus positing two sources of the proof of that theorem, one in which the is base¬
generated as an article, and one in which the is inserted transformationally to fill an
empty NP.
One might think that the arguments for eliminating Agent-Postposing would apply
equally against (4.85), NP-Postposing. But interestingly enough, a substantial number
of the nominals which we proposed to derive by NP-Postposing appear only with the
article the, so counterexamples like (4.88) do not arise.
18 Emonds (1976) proposes that all agents originate in a by-phrase and are optionally preposed to subject
position. Though I have not investigated the consequences of this alternative thoroughly, it strikes me that it
will require at least equal complexity in the grammar, and that it is somewhat less plausible than the present
theory, which simply invests an optional phrase structure position with its own interpretation, similar to
instrument and accompaniment phrases.
94 RAY JACKENDOFF
the
Within the proposed analysis, a very simple account of this determiner restriction
emerges: these nouns subcategorize only NPs in the determiner, not articles and
quantifiers. Thus a configuration like (4.89a) cannot come about. The is introduced
transformationally when the NP position either is empty in the base or becomes empty
through the application of NP-Postposing.
A similar case is the action reading of destruction, which subcategorizes an object.
Though some destruction is all right, *some destruction of the city is not. We can
describe this difference by claiming that the action reading subcategorizes not only an
object but also an NP subject. Thus, the only article that can appear with it is the,
inserted if the subject happens to be empty at the surface.
Suppose we find nominals which are like (4.90) in that what is the subject of the
corresponding sentence follows the head of the nominal, but which differ from (4.90) in
that other determiners besides the are possible. Our solution must be to generate these
forms in the base instead as a result of NP-Postposing. In this case the verb and its
nominalization will not be optimally related, since the verb’s subject will correspond to
the noun’s object instead of to the noun's subject as in (4.90).
Some speakers may consider growth and growling relevant examples of this sort,
finding grammatical all growth of corn (related nonoptimally to Corn grows) and all
growling of lions (related to Lions growl). I find these examples marginal at best, and
all is the only determiner other than the that sounds even faintly plausible. In any
event, the existence of such nonoptimal examples is not a problem: I would guess that
this is an area where individuals’ lexicons will differ, since the evidence to decide
between a subject which postposes and an object which preposes is relatively scanty in
the corpus available to a language learner, and the choice has no serious consequences
elsewhere in the grammar.
This completes our account of rules moving NPs in and out of the complement.
There are still no doubt quite a number of anomalies and irregularities which we have
not attempted to deal with, but we have added much precision and detail to Chomsky's
4: COMPLEMENTS
rather sketchy analysis, making several improvements along the way. In the end, we
claim that NP-Preposing takes place in Ss and NPs, that Agent-Postposing may take
place in Ss, and that NP-Postposing takes place in NPs. An empty subject position in
an NP is filled by the\ if articles and quantifiers other than the appear in an NP, there
can have been no NP position in the determiner, and hence no NP to the right of the
head can have been an underlying subject.
Section 4.3 claimed that the S at the end of V" and N" is an underlying position for so-
called “subject complements” such as the S in It is obvious that Bill left, It disturbs me
that Bill left, and the necessity for Bill to leave. This is the view of Emonds (1970)
(though not Emonds (1976)), who claims that the synonymous sentences That Bill left is
obvious and That Bill left disturbs me are derived by moving the complement leftward
from underlying final position by a root transformation called Intraposition. Emonds
opposes the traditional Rosenbaum (1967) Extraposition theory, which claims the
derivations go in the opposite direction.
This section presents a number of arguments for the Intraposition theory, to meet
certain objections that have arisen since the publication of Emonds’s work. I do not,
however, claim to meet all extant arguments against Intraposition. I presuppose the
reader’s familiarity with Emonds’s arguments.
The most objectionable feature of the Intraposition theory was its account of
double complement verbs such as prove. Since Emonds provided only one S position at
the end of the sentence, he was forced to claim that (4.91a), for example, was derived
from an underlying structure (4.91b) in which both complements occupied the same
position, a situation Emonds called a “doubly filled node”, which would have to be
eliminated before surface structure.
Since the device of doubly filled nodes seems morally repugnant to most linguists,
Emonds’s account of (4.91a) seemed rather implausible, and many considered this
96 RAY JACKENDOFF
alone sufficient to dismiss the entire Intraposition analysis, or even the whole structure¬
preserving hypothesis.
However, the problem of prove turns out to have a better solution within the
Intraposition theory. There are some speakers who find (4.92a) and particularly
(4.92b,c) acceptable.
For these speakers, the doubly filled node solution is impossible, since the surface
structure is not at all ill-formed. These examples show that Emonds was wrong in
positing only one final S position.
In the present theory there are two distinct positions available for sentential
complements, one in V' and one in V". It is possible to claim that the first clause in
(4.92) is the V' complement and the second the V" complement, interpreted as a
subject:
(4.93)
N'" M
This would explain in particular why the clauses in (4.92) cannot be interchanged
without changing meaning, a fact unexplained by a structure like (4.91b). It then
remains to explain why (4.92) is unacceptable to many speakers (and notice that this
problem exists in the Extraposition hypothesis as well). The most plausible solution in
view of the dialect split is an output constraint similar to that for consecutive adverbs
(see section 4.3), ruling out consecutive Ss within V". Thus, doubly filled nodes can be
eliminated in favor of a theoretical device of an independently necessary sort, itself
4: COMPLEMENTS
required in the competing theory. Once we realize this, the problem of prove simply
ceases to decide between the two theories.
Another objection to the Intraposition theory concerns the relationship of the
clause to the subject it. How does the grammar determine that the clause has all the
semantic properties of a subject, without the defining syntactic property of preceding
the auxiliary? Rosenbaum’s solution is to claim that in the underlying form the clause is
in subject position, accounting for its interpretation in the simplest possible way.
Emonds, on the other hand, is forced to claim that there is an anaphoric relationship
between it and the clause, established by an independent projection rule.19 Thus, the
Intraposition theory needs a rule not needed by the Extraposition theory.
However, there is some evidence that the interpretation rule is independently
necessary. First, there are languages (e.g. French and Hungarian) in which a clause can
never occupy subject position. There is thus no syntactic evidence for generating it
there, and for these languages the anaphoric theory looks rather more attractive. Thus
the rule of anaphora must be a possibility in linguistic theory. Second, there are
sentences in English which cannot in good conscience be generated by an Extraposition
theory but which have final clauses with the function of subject: compare (4.94a) and
(4.94b).
The (b) sentences have the preposed form (4.95a), not (4.95b).
But since the clauses in (4.94b) have the same semantic function as those in (4.94a),
their interpretation requires precisely the rule of anaphora proposed by Emonds to
account for (4.94a), and in this case the alternative of an Extraposition analysis is far
less plausible.
19 Emonds says the relevant anaphoric relationship is coreference, which strikes me as somewhat
incoherent, in that it and the complement S do not constitute two separate references to the same event. A
more appropriate kind of anaphora might be termed “grammatical anaphora”, by contrast with the usual
“referential anaphora”. When a relationship of grammatical anaphora is established, any projection rules
which would normally apply to the proform by virtue of its syntactic position are directed to apply instead to
the antecedent. Thus the antecedent receives an “indirect” grammatical relation. This kind of anaphora is
probably the type involved in interpretation of traces, and perhaps with reflexives of certain types; it has an
effect rather like a bound variable in logical notation.
98 RAY JACKENDOFF
Section 3.4.2 pointed out a different class of cases, in which a PP appears to take
subject function:
These PPs can be moved into the subject in the same environments as that and for-to
complements can, and there is no problem in generating them under V". To interpret
(4.96), however, a rule of anaphora is needed which specifies the PP as what is being
measured, i.e. the subject. This is the same rule needed for the Intraposition analysis.
Thus, since the anaphora rule is needed anyway, it does not weigh against Emonds’s
theory, as originally thought.
The fact that the necessary rule of anaphora may be a surface structure rule of
interpretation provides the key to a problem of derived structure raised by Higgins
(1973). Higgins is concerned with the derivation of sentences like (4.97) (his (58)):
In the Extraposition theory, these are both derived from (4.98a), by Passive and
Raising in the main clause. It is not clear on which cycle Extraposition takes place in
(4.97a), so its derived structure could be either of the options in (4.98b). The fact that
Extraposition is generally obligatory in subordinate clauses argues that the complement
is probably attached to the lower VP.
(4.98) a. S
think S
4: COMPLEMENTS
b. S
\
\
be unwise
Higgins assumes that in the Intraposition theory the deep structure of (4.97) is (4.99).
Hence (4.97a) is derived by Passive and Raising of it; the derived structure is (4.98b),
with the complement attached to the lower VP.
(4.99)
PRO Pres V
think
To produce (4.97b), Intraposition now has to raise the complement over a clause
boundary, and Higgins correctly shows that this raising produces difficulties.
100 RAY JACKENDOFF
In the Extraposition theory this fact is difficult to account for, since the obligatoriness
of Extraposition in subordinate clauses would force the that-clause to attach to the
lower VP, on the earlier cycle. However, if the rule of anaphora in the Intraposition
theory is like all other anaphora rules in applying to surface structures, it can be
restricted to the environment (4.101), making only the higher attachment of the
complement possible as a derived structure.
(4.101)
(4.102)
4: COMPLEMENTS
This seems to violate our linguist’s intuitions about grammatical relations, since the
clause that you didn’t try harder is entirely outside of the clause it to be unwise of
which it is understood as the subject. This apparent anomaly disappears when we
realize that this subject relation is not a normal grammatical relation, but is established
only indirectly by a surface structure rule of anaphora. If it were not raised into the
upper clause, the anaphora rule could not apply, and that you didn't try harder would
be left without a semantic function.
This section has dealt with three objections to the Intraposition theory. First, we
showed that the double-complement verbs such as prove should not be accounted for
with doubly filled nodes, as Emonds suggested; rather, one complement is in V' and
one in V". The constraint against concatenated complements of the same type applies
equally in the Intraposition and Extraposition theories. Second, the rule of anaphora
relating it to the V" complement was shown to have independent motivation in both
English and other languages. Third, the fact that the anaphora rule applies to surface
structure enables the theory to meet Higgins’s objection about derived structure.
-l-Subj
(4.83) np - Y - <+Obj> - <(P)> -NP-Zz>5-2-3-4-0-6
. +Comp
OPTIONAL
[X'
X - (Prt) -NP-Y] =>l-2-o/+ 3- 4
L-obj J
OBLIGATORY
Output Constraints
5.1. Introduction
Chapter 3 dealt with aspects of the specifier that are central to the syntax of sentences:
the subject and the auxiliary. This chapter and the next extend the study of specifiers to
systems which play a minor role in sentences but which are important in NPs, APs, and
PPs: determiners and expressions of quantification and degree.
There are problems in studying specifier systems that do not arise in studying
complements. First, specifier systems involve very small numbers of lexical items and
are riddled with idiosyncrasies. Thus general phrase structure rules must be supported
on the basis of impoverished and skewed surface distributions. Only with judicious use
of lexically determined transformations and interpretation rules can any order at all be
brought into the system. One way to bring more data to bear on the problems is to
exploit the hypothesis of cross-category generalization, and that will be a major source
of evidence here.
A second problem with specifier systems, at least in English, is that it appears
much less possible to correlate semantic regularities with syntactic positions, as we did
in complements. This may of course be a function of our ignorance about the semantics
of deixis, quantification, and measuring; but it may also be a brute fact about English or
about specifiers in general. In the absence of a coherent semantic theory of specifiers, I
will make the latter assumption, hoping that a better understanding of the syntax may
make possible a more disciplined approach to the semantics.
This chapter deals with the syntax of the NP specifier. Chapter 6 extends this
account to AP and PP, proposing a reanalysis of Bresnan’s (1973) theory of the degree
system.
There are three different semantic roles that the elements of the NP specifier can play.
Without being able to define these semantic roles precisely, we can still make the
distinction among the three roles intuitively, at least for the clearest cases, and let the
104 RAY JACKENDOFF
We will show that (5.1) interacts with the syntactic constraints on specifiers to produce
some of the otherwise curious aspects of specifier distribution.
Genitive NPs and demonstratives may be followed by certain quantifiers, but not
by others:
'some '
Fred’s' Fred’s
each many
the the
(5.3) a. < , > - all dwarf(s) few dwarfs
those those
no several
\
which /
which
any /
There is no apparent semantic reason for this division: why should *Fred’s some apples
mean anything less innocuous than Fred's many apples? In order to account for this
distribution, then, we will resort to dividing the class of semantic quantifiers into two
syntactic categories. Those in (5.3a) will be assigned the category Art, which is
generated only by phrase structure rule (5.2). Their inability to cooccur with demon¬
stratives is a syntactic constraint, namely the availability of only one Art position in the
5: NP SPECIFIERS
We choose the N" specifier rather than the N'" specifier for this QP for reasons to be
discussed in section 6.3. Our analysis thus assigns (5.3b) this structure:
(5.5) N"'
Since some quantifiers are now Arts and some are Qs, the phrase structure
component will generate structures in which two quantifiers appear, one in each
position, e.g. *no many men, *all several men, *any much wine. These are ruled out
semantically, however, by the Specifier Constraint (5.1), which forbids two (semantic)
quantifiers in the same NP specifier. If the second quantifier is replaced by a numeral,
so that the Specifier Constraint is satisfied, acceptable constructions like no three men,
all six men, and any dozen men are generated.
For convenience in exposition, we will when necessary refer to the quantifiers in
(5.3a) as N"' quantifiers and those in (5.3b) as N" quantifiers. The former have the
syntactic features [+Subj, -Obj, -Comp, +Det] and the latter [ + Subj, -Obj, -Comp,
-Det].1
1 A few doubtful cases should be mentioned here. Little behaves like an N" quantifier but its positive
much behaves like an N'" quantifier:
John’s [ little j
(0 the Ji*muchJ
food
In most other relevant contexts, though, much patterns with many, few, and little, as will be seen throughout
this chapter. It is difficult to know whether to call much an Art in this use or to call it a Q and introduce
arbitrary restrictions that prohibit it from occurring with Art. More research into the semantics of English
quantifiers may help decide which is the appropriate solution. I assume it is a Q.
A second apparent anomaly is that every (but not its near relative each) can occur after genitives but not
after articles in the semi-idiomatic construction (ii).
,... , I Fred’s! fmovel
(..) Wehungon(*the j every {word j.
It is not clear whether every is functioning in its normal syntactic position here (indicating that it is actually a
Q) or whether this use as a Q is simply an idiomatic exception. I assume the latter.
106 RAY JACKENDOFF
We should mention two little-remarked articles in English which provide evidence for
the correctness of the feature system. Postal (1966) is concerned with the connection
between the normal pronominal use of we and you and their use in expressions like we
three guys, you five rascals. Plural but not singular vow has this use productively: I saw
you guys/*guy yesterday; though (inexplicably) singular you can be used this way in
vocatives: Come here, you bastard! There are dialects in which them guys is also
acceptable.
Postal effectively discounts the theory that the pronoun is head of this construction
and the other elements are appositives (we, who are three guys). Thus the pronoun is
apparently in article position. Since Postal has no syntactic features to work with, he
resorts to describing the relationship between the two uses by deriving the pronominal
use from an underlying we ones, so that we is always an underlying article. He
proposes this mechanism as a general derivation for all definite pronouns.
In the present framework there is an alternative. We and youplut can be assigned
the feature matrix [ + Subj, — Obj, + Det], leaving the feature Comp unspecified. Then
they can be placed in either N or Art positions. No transformations or abstract
underlying forms are necessary. Note that we and you can then be substituted into the
determiner of an NP in two distinct ways:
(5.6) a. N
Jwe 1
lyouj
(5.6b) will remain in its underlying form, giving we/you three guys. (5.6a), however, will
undergo Povs-Insertion to become ourlyour three guys.
To recapitulate, this section has shown that there are three semantic sorts of NP
specifiers: demonstratives, quantifiers, and numerals. They are divided up among two
NP specifier positions, in N'" and in N", and are of the categories Art and Q,
respectively. Demonstratives always appear in N'", but some quantifiers appear in N"'
and some in N". In order to rule out structures with two quantifiers, one in each
position, we appealed to the Specifier Constraint, a semantic condition. Finally, the use
of we and you as articles has provided evidence for the feature system.
All of the quantifiers mentioned in the previous section occur not only with a simple
noun (e.g. many men, all horses) but also with a definite of-phrase called a partitive
5: NP SPECIFIERS
(e.g. many of the men, all of the horses). Some of them alter their form when used with
a partitive (e.g. every man, no men but every one of the men, none of the men)', this
alternation will be discussed in section 5.3.4. All numerals allow a partitive phrase; so
do the demonstratives those and which, and the adjective (?) other and superlatives
(e.g. the oldest of the men). This section will deal with the structure of the partitive
construction.2
Many sources (including Chomsky in “Remarks”) account for the partitive construc¬
tion by treating the noun following of as the head of the construction, and the quantifiei
preceding of as a “prearticle”. Of is obligatorily inserted if the article is definite:
several the
Selkirk (1977) further points out that these of the N" can undergo Extraposition
from NP, which regularly moves N' complements.
2 As will be seen, the arguments of Selkirk (1977) have contributed greatly to this section and the next.
These are in turn based on Akmajian and Lehrer (1976). I understand that Jean-Claude Milner’s recent
doctoral dissertation reaches similar conclusions, but I have not had the opportunity to see it.
108 RAY JACKENDOFF
Thus a structure (5.11), which treats of the N" as a constituent in the N' complement, is
most plausible for group nouns.3
(5.11)
The partitive o/-phrase serves the same function after quantifiers as after group
nouns: it designates a set out of which certain individuals (or a certain subset) is
selected. For count nouns, the o/-phrase must be plural.4
Quantifiers and group nouns with partitives share three other properties. First,
both prohibit quantification in the o/-phrase (see section 5.3.3 for more discussion):
3 Our account of NP objects says that of N'" is not a PP, since of is a transformationally inserted
grammatical formative. The fact that Extraposition moves it like any other PP may be evidence for the
position of Jackendoff (1974a), in which NP objects are really PPs with an empty P filled by of. I abandoned
this theory in chapter 4 because the evidence for it was not as strong as I previously thought, in light of other
revisions, and the complication of the object relation was so severe. Henceforth I present trees in a way
neutral between the two theories. See also chapter 4, note 13, p. 70.
4 This needs appropriate modification to deal with the semantics of mass nouns, but in any event the
semantic parallelism holds: compare a gallon of the wine and much of the wine.
5: NP SPECIFIERS
* ( a group
(5.12) <
a number
> of
{some men
many all men
few
Second, if an N" quantifier or a group noun is preceded by the and has a definite of-
phrase, a restrictive relative clause is necessary.
This relative clause is not attached to the o/-phrase, as can be seen if the o/-phrase is
preposed:
the group
b. Of the men, that you met aren’t here anymore.
(the many
Third, as Selkirk (1977) points out, the partitives after quantifiers, like those after group
nouns, can undergo Extraposition from NP. The examples in (5.15) are comparable in
acceptability to those in (5.10).
Each of these arguments raises vexing problems for a theory like (5.7). First, if
quantifiers can take relative clauses of their own, “prearticle” must really be some sort
of NP. But (5.7) then incorrectly predicts that a relative clause on the prearticle should
precede the head noun rather than follow it. Second, the preposing rule which derives
(5.14b) would have to extract the head of the phrase, leaving the modifiers behind, an
otherwise unprecedented operation. Third, Extraposition from NP would have to move
a nonconstituent including the head, instead of a PP in the N' complement.
By contrast, the theory that the partitive is an N' complement explains the
syntactic parallelisms exhibited in these examples. Parallel (though still unknown)
constraints can be stated for the restrictions demonstrated in (5.12)-(5.13). Since
relative clauses always follow N' complements, the proper order of partitive and
relative clause in (5.13) follows from the base rules. And since transformations can in
general remove complements from their heads, the preposing in (5.14) is within the
5 I have no explanation for why the group of them is all right but *the many of them is not.
1 10 RAY JACKENDOFF
This theory of the partitive leads to two possible deep structures for many of the men.
(5.16a) is essentially the structure which Jackendoff (1968) chose, in part because
empty nodes were not an available device at the time. Ideally it requires some
explanation of why quantifiers cannot be preceded by adjectives; (5.16b) requires no
such explanation. We will develop solution (5.16b) here; section 5.5 will show that
numerals (but not quantifiers) do occur in construction (5.16a) and in fact can be
preceded by adjectives in that use.
To develop the description of (5.16b), we need a projection rule to assign the
empty head PRO an interpretation something like "unit(s)” when it is immediately to
the right of any of the articles, quantifiers, or adjectives that govern partitives. All of
these words must be lexically marked to govern the projection rule.6
6 The interpretation here given roughly as UNIT must be general enough to apply to mass phrases too,
e.g. some of the wine, where perhaps the interpretation AMOUNT is more appropriate than UNIT. Wald
(1976) develops a theory of "slices”, based on nonatomic mereologies, in which mass nouns and plurals can
be dealt with homogeneously. The term common to UNIT for count nouns and AMOUNT for mass nouns is
Wald’s term SLICE.
5: NP SPECIFIERS
But in (5.19a) PRO cannot be interpreted because it is not adjacent to the partitive word
many, and in (5.19b) PRO cannot be interpreted because there is no partitive word to
govern the rule. Since an uninterpreted PRO produces anomaly, *many old of the men
and *your of the men are eliminated.
(5.19) a. N"
N"
you(r)
This subsection has shown, then, that (5.16b) is a practicable structure for many of
the men, given the Partitive Projection Rule. Since we are saving (5.16a) for another
construction, (5.16b) is our only choice.7
Given structure (5.16b) for many of the men, we must now ask what happens to this
structure if the lower NP is indefinite, as in for example (5.20).
7 It is worth mentioning that the ungrammatically of *many old of the men is a stumbling block to many
earlier analyses of the partitive. After trying many alternatives, I have convinced myself that there is no
purely syntactic solution. See further discussion in section 5.3.4, particularly footnote 8.
1 12 RAY JACKEN DOFF
(5.20) N
Jackendoff (1968) claims that the of in this construction obligatorily deletes, giving
another possible source for many men besides the simple one of section 5.2, (5.21).
(5.21) N'"
N"
However, Selkirk (1977) argues that this claim makes a number of incorrect
predictions. Hence only (5.21) is a correct structure, and some way must be found to
eliminate the underlying form (5.20).
One of Selkirk’s arguments involves cases where a definite N' complement is
acceptable, but the indefinite without of predicted by (5.20) is impossible:
Conversely, (5.20) predicts that one single thing has the impossible definite counterpart
*one of the single things. (5.21), on the other hand, deals with these facts easily, as part
of the rule enforcing selectional restrictions and number agreement between quantifiers
and their heads.
Second, Jackendoff (1968) claims that Of-Deletion corrects the number of the head
noun when the quantifier is singular, so that underlying each of men becomes each man
rather than *each men. But Selkirk points out that it could not correct the number
within a relative clause on the head noun. Thus we would incorrectly expect paradigms
like (5.24).
(5.24) each of the politicians who were elected/*each politician who were
elected
(5.21) would avoid this problem, since politician in each politician would be underly-
ingly singular, not plural.
Third, Extraposition from NP would be expected to apply in (5.20) just as it does
in (5.16b), yielding sentences like (5.25), parallel to (5.15).
On the other hand, (5.21) claims that turkey, questions, and books are heads of the
subject NPs in (5.25), and that Extraposition from NP should move constituents that
are one level further down into the NP. This prediction is correct:
Thus the simpler (5.21), generated by the rules of section 5.2, is the only possible
underlying structure for many men. Selkirk proposes therefore to filter out (5.20) by a
generalization of the constraint illustrated in (5.12) that rules out *many of some men,
*many of all men, and various other specifiers in the partitive NP.
The nature of the constrainst that filters out (5.12) and (5.20) is not (to my
knowledge) yet understood. In observational terms, it seems to be approximately
(5.27).
Thus, (5.27) rules out *many of all men, *many of some men, and *few of many men,
as well as *many of men, since they all lack demonstrative or genitive specifiers; but it
permits many of the men, few of the many men, and many of his friends.
Notice that (5.27) refers to semantic functions in the specifier, not to syntactic
positions. This argues that (5.27) is part of the semantic component, and further
1 14 RAY JACKENDOFF
5.3.4. Substantivization
To complete the account of the partitive, we need to deal with the phonological change
that takes place in certain articles that govern partitives:
It is often thought that the change in form is a phonological coalescence of the article
with one, so that none of the men, for instance, is underlyingly no one of the men. But
since none can refer to mass nouns, as in none of the rice, this view is mistaken. We
will treat the alternation rather as the addition of an inflectional marker subst, added by
the following rule:
(5.29) Substantivization I
Art - [nPRO] 1 +subst - 2
OBLIGATORY
No+subst will be spelled as none; every +subst will be spelled as every one, etc. For
those articles such as all which do not change form, the addition of subst will not
change phonological shape.
This sort of rule changing phonological shape might be thought to violate the
Lexicalist Hypothesis. However, since it occurs with only a limited number of
morphemes and, unlike nominalization, has no semantic effect, we can regard it as a
rule of inflectional morphology, like Poss-Insertion and Affix Hopping, which the
Lexicalist Hypothesis permits.
In order to show that (5.29) is independently necessary, we will show that its
application is not restricted to partitive constructions; other rules which interpret a
PRO also create the appropriate configuration. One such rule can be called N'-
Anaphora (N-Deletion in Jackendoff (1971)). This is the rule operative in the final NPof
(5.30).
5: NP SPECIFIERS
/ Morris’s
yours
these
one
every one
succeeded in reading S none
some
many
three
one
*ones
The anaphoric element in this construction is a pro-N' PRO which occupies the head of
the NP, referring in (5.30) to the N' dossiers of famous linguists. Any specifier is
possible before PRO (as long as there is no intervening adjective); the impossibility of
ones shows that one in this construction is the specifier one, not the noun.
In the N'-Anaphora construction, the same quantifiers change phonological shape
as did in the partitive construction above. In addition, the possessive pronouns change
shape: my, your, etc. are replaced by mine, yours, etc. Again, the change cannot be a
result of coalescence with an underlying head one(s), since N'-Anaphora, unlike one(s)-
Anaphora, can apply to antecedents with mass nouns as heads.8
yours
I and
(5.31) John wanted some rice. ■ none ' was moldy.
[but
some
(5.32) Substantivization II
f Art )
NP
+Poss - [nPRO] => 1 +subst - 2
.Pro.
OBLIGATORY
8 The fact that this construction appears with mass nouns goes unnoticed by Postal (1966), who claims
to derive all pronouns from underlying articles plus one. The solutions of Perlmutter (1970) and Jackendoff
(1969c) suffer from the same objection, though they at least note it, with some puzzlement. Sommerstein
(1972) has the opposite problem: he inserts one if the head has been deleted and there is no determiner or
noun left for the number morpheme to attach to. Thus his rules incorrectly derive *My coarse sand is next to
your fine one. In the present theory these objections (raised also by Siegel (1974)) do not arise, because one
and PRO are distinct anaphoric elements in NPs, not transformationally related.
116 RAY JACKENDOFF
Substantivization also applies when PRO is interpreted by the rule of N'- Gapping
(see Jackendoff (1971)). This rule differs from N'-Anaphora in that it can apply only if
the antecedent is directly conjoined to the gapped NP, and in that it permits N'
complements to be left behind.
(5.33) a. *Joe’s destruction of the pictures was appalling, and Fred’s of the
records was even worse.
(N'-Anaphora: NPs not conjoined, but N' complement left behind)
b. Joe’s destruction of the pictures and Fred’s of the records appalled
everyone.
(N'-Gapping: NPs conjoined)
Again, the missing head can be regarded as a PRO; so, for example, the subject of
(5.33b) is Joe’s destruction of the pictures and Fred's PRO of the records.
N'-Gapping is of considerable interest to the theory of syntactic distinctive
features, for it has properties very similar to the well-known rule of Gapping in
sentences. These similarities are discussed in Jackendoff (1971), and 1 have nothing to
add here. What is germane to the present discussion is that Substantivization applies in
N'-gapped constructions as well, yielding yours and none in (5.34), for example.
The source for this construction proposed in Jackendoff (1969c), which I see no reason
to abandon, comes out in the present framework as (5.36).
9 (5.35b) shows that this o/-phrase cannot be produced by extraposing the genitive phrase from the
determiner to the end of the NP, as some have claimed, including Siegel (1974).
5: NP SPECIFIERS
(5.36) N'
\
\ \
>
PRO
(5.37) Demonstratives
b. N'"
tree(s)
(5.38) N'" Quantifiers
a.
r
each >V each
all . • all
[some C.
some J
tree(s)
(5.39) N" Quantifiers
a. N b. N'"
Each of these specifiers is marked [+Partitive] and thus in the (b) cases PRO can be
interpreted by the Partitive Projection Rule; the proper specifiers in the subordinate
NPs are guaranteed by the Partitive Constraint. Certain of the N'" specifiers in this
construction change their form in case (b) through the application of the rule
Substantivization.
This completes our discussion of the partitive. We now turn to two other
constructions in the NP specifier, pseudopartitives and numerals.
5.4. Pseudopartitives
Section 5.3.1 argued that group nouns followed by partitives such as a bunch of the
men have the following structure:
(5.40) N'"
ackendoff (1968) assumes that when the lower NP is indefinite, as in a bunch of men,
the structure is essentially the same:
(5.41)
120 RAY JACKENDOFF
However, Selkirk (1977) argues that a bunch of men has a different structure, (5.42),
which she calls the pseudopartitive. Her arguments are quite similar to those for
quantifiers given in section 5.3.3.
(5.42) N'"
In this structure, the grammatical formative of is a daughter of N". Other than the of
(5.42) is parallel to the structure of many men, (5.21).
(5.21) N"'
N"
Before presenting three of Selkirk’s arguments, let us see how a solution like (5.42)
affects the grammar. Ignoring the of for a moment, we see that there is now a choice
between Q'" and N'" in the N" specifier. In a theory without syntactic distinctive
features, this would involve a complication of the grammar. But since, in the present
theory, Q'" and N'" differ only in the feature Comp, the choice between Q"' and N'"
makes the grammar actually one feature simpler than if Q'" alone were possible. The
rule for N" thus reads like this:
5: NP SPECIFIERS
+ Subj
(5.43) N" ( -Obj ) - (A'")* - N' - . . .
Det
Since one of the things the Partitive Constraint rules out is a null determiner in the
partitive phrase, it rules out (5.41), unless (5.41) is an otherwise unmotivated exception.
On the other hand, (5.42) contains no NP interpreted as a partitive, so the Partitive
Constraint does not apply to it.10
Selkirk’s second argument concerns the application of Extraposition from NP.
Recall that in a definite partitive with structure (5.40), Extraposition from NP moves
the o/-phrase:
10 Note that there is another construction, exemplified by a group of three men, in which the o/-phrase
does not contain a demonstrative specifier. But this o/-phrase does not have a partitive interpretation, since a
group of three men means not a group taken out of three men, but a group consisting of three men. Hence the
Partitive Constraint does not apply, and a nondemonstrative specifier is permitted. The possibility of such a
“consistive” interpretation is what makes a pot of steel ambiguous: one reading is like a group of men, has
structure (5.42), and denotes a certain amount of steel; the other reading is like a group of three men, has
structure (5.41), and denotes a pot made out of steel. In fact, the consistive reading seems to require a
nondemonstrative specifier and thus is mutually exclusive with the partitive reading. The Partitive Constraint
may thus in fact be part of the structural description of the semantic rule that differentiates between these
two readings of the N' complement.
122 RAY JACKENDOFF
If (5.41) were the structure for a bunch of men, we would expect Extraposition from
NP to apply in precisely parallel fashion. However, it does not:
On the other hand, structure (5.42) correctly predicts the unacceptability of (5.46),
since it claims that, for example, leftover turkey is not an N'" in the N' complement.
Furthermore, structure (5.42) predicts a different result. It requires a rather large
number of books by famous authors to have this structure:
(5.47) N'"
N"
Hence (5.42), not (5.41), predicts the correct application of Extraposition from NP.
A third argument concerns the interpretation of relative clauses. Selkirk points out
that in (5.49a) the relative clause can apply to either a number or those daffodils, and
the interpretations are quite distinct, whereas in (5.49b) there is only one interpretation.
(5.49) a. She bought him a number of those daffodils, only two of which were
faded.
b. She bought him a number of daffodils, only two of which were faded.
5: NP SPECIFIERS
This difference is easily accounted for if a number of those daffodils includes two N'"
to which relative clauses can be attached, but a number of daffodils contains only one
such N'". This is predicted by the theory that a number of those daffodils has structure
(5.40) and a number of daffodils has structure (5.42). It is not predicted by the theory
that a number of daffodils has structure (5.41).
We conclude therefore that a group of men is not a true partitive. The syntax
does in fact generate the string ambiguously, with partitive structure (5.41) and
pseudopartitive structure (5.42), but (5.41) is ruled out semantically by the Partitive
Constraint.
Since pseudopartitives contain an N'" in the N" specifier, there is nothing to stop this
subordinate Nfrom having its own specifiers. Thus the phrase structure rules cannot
be prevented from generating such ungrammatical combinations of specifiers as these:
* John’s John’s
all a
(5.50) i > < > group of trees
some every
_ N'" those Nw _N"' that -N' -
' y ^ y -
It is clear that only one Nspecifier may be allowed to precede the group noun. The
problem is whether the permitted specifier is always that of the matrix N'", always that
of the subordinate N'", or sometimes one and sometimes the other.
First consider articles. It appears that when a group noun in a pseudopartitive is
preceded by an Art'", it is always the Art'" of the group noun, not that of the matrix
N'". To see this, notice that when the group noun is singular and the head noun is
plural, an Art before the group noun must be one that permits a singular head.
a
every
(5.51) aJ > group of people
this
that
all
we
you group of people
these
those
On the other hand, if the group noun is plural, the Art is one that permits a plural head.
all
we
(5.52) i you groups of people
these
Those
124 RAY JACKENDOFF
Since the number of the group noun controls the choice of articles, the assumption that
the article is a specifier of the group noun keeps the statement of these particular
agreement constraints the simplest.
Similarly, where quantifiers precede a group noun in a pseudopartitive, they
appear to quantify over instances of the group noun rather than over the head noun.
For example, all groups of people quantifies over instances of groups, not of people.
Compare this to all three people, which quantifies over instances of people (of whom
there are three), and which we have assumed has all in the N'" specifier of people. If
this semantic difference is to be represented syntactically, all in all groups of people
must be a specifier of groups.
Thus the evidence seems to indicate that an NP containing a group noun in its N"
specifier cannot have an Art'" in its N'" specifier. Can it have an N'" instead? The
answer seems to be yes. To see this, consider group nouns such as number and
quantity which place restrictions on their specifiers. When used in genuine partitives,
they cannot be used with a possessive:
This difference can be explained if number and quantity exclude possessives from their
N"' specifiers: in the true partitives (5.53), the possessives will have to appear
improperly in the specifiers of number and quantity, as in (5.55a), but in (5.54) they
may be in the specifier of objections and wine, as in (5.55b).
(5.55) a.
ft
N"' N
John’s
11 I have no explanation for the fact that (5.54) becomes much worse if the adjectives are omitted. Note
that this paradigm is parallel to the one with N" quantifiers, since we have *John's many of those objections,
like (5.53), but John's many objections, like (5.54).
5: NP SPECIFIERS
b. N'"
Thus it is possible to have a genitive in the matrix N'" specifier. There also seem to
be cases where a genitive in the group-noun specifier is necessary. The noun blame
does not permit a genitive specifier, since, for example, only the blame for the accident
is possible, not *Fred’s blame for the accident. Nonetheless, the pseudopartitive
construction with blame does permit a genitive, since Fred’s share of blame for the
accident is possible. The simplest account of this seems to be to attribute the genitive
to the specifier of the group noun:
(5.56) N'"
Fred’s N'
share
Thus, in pseudopartitives, only the group noun may contain an Art"', but
apparently either the group noun or the head may contain a genitive. Other than one
126 RAY JACKENDOFF
combination which will be useful in the next section, I leave precise statements of these
curious restrictions for future research.
The one combination of interest is this: just in case the matrix NP has an N'"
specifier, as in (5.54), the group noun is left with no surface specifier at all. This is a
little strange, since singular count nouns like number and quantity normally require
some specifier. One way to make the situation a little better is to claim that the
underlying form of (5.54) is John's [N»a rather large number] of objections, Fred's [N™«
substantial quantity] of wine, with the indefinite article as the specifier of the group
noun. Then the following local transformation would delete the indefinite article:
(5.57) Pseudopartitive a-Deletion
[n„,N"' - a - X] =>1-0-3
OBLIGATORY
Thus we can claim that the group nouns in (5.54) do have the specifier they require in
underlying form, but it is deleted.
This solution does not deal with the possibility of other specifiers of the group
noun cooccurring with a matrix genitive. I assume they are eliminated by a surface
constraint which is necessary anyway to eliminate such combinations as two genitives
in a row. Thus a is the only group noun specifier cooccurring in underlying structure
with matrix genitives, because it is deleted and hence the surface constraint is satisfied.
To sum up our discussion of pseudopartitives, we have argued that the N" specifier
contains an N'" position which alternates with the Q'" position there. It may be
occupied by group nouns which are followed by the specified grammatical formative of.
Furthermore, the group noun may contain its own specifiers, and there is a complicated
set of constraints coordinating the group-noun specifier and the matrix N'" specifier.
5.5. Numerals
We have so far neglected the syntax of numerals. They come in two varieties, which I
will call cardinals and seminumerals. Cardinals are words like three and seven, which
need not be preceded by an article; seminumerals are words like dozen and hundred,
which require an article before them.
The grammar of numerals is somewhere in between that of quantifiers and that of
group nouns. This section will argue that numerals are nouns which trigger certain local
transformations whose effect is to make the superficial behavior of numerals closer to
that of quantifiers.
This subsection will demonstrate the similarities of numerals to group nouns and
quantifiers, by showing how numerals behave in paradigms discussed in previous
sections.
First, like quantifiers and pseudopartitive group nouns, numerals appear between
N"' specifiers and A'"s.
5: NP SPECIFIERS
John’s
|three
(5.58) these old trees
[dozen
all
This suggests that they are in the N" specifier, either as quantifiers (5.59a) or as group
nouns (5.59b).
Numerals also occur in partitive constructions, with structure (5.60a) if they are
quantifiers and (5.60b) if they are nouns.
(5.61)
(three I ^ I some men
a dozen J [all men
as does the constraint on quantifiers and group-noun partitives that if they are definite,
a relative clause is required.
128 RAY JACKENDOFF
three
(5.62) a. *the of the trees
(dozen
I three
b. the of the trees that weren’t cut down
[dozen
(5.63)
[Three j
weren’t cut down of the trees you planted.
[a dozen J
By contrast, in the simple construction (5.58) (as in the parallel cases with quantifiers
and group nouns), Extraposition from NP applies not to the noun being counted, as in
the partitive, but to a PP one level further down:
f Three 1
(5.64) a. * A ^ r weren’t cut down (of) trees you planted.
Three j
b. pieces were eaten of the leftover turkey.
A dozen J
Thus numerals parallel the behavior of quantifiers and group nouns. If they are
quantifiers, the simple use (5.59a) is straightforward, and in the partitive use (5.60a)
they govern the Partitive Projection Rule. If they are nouns, the partitive use (5.60b) is
straightforward, and the simple use (5.59b) is like a pseudopartitive (though lacking the
of). The next question we will address is which of these analyses is correct. So far the
analysis of numerals as quantifiers looks more promising, since the paradigm (5.59a)-
(5.60a) is just like that of many and few. However, the next subsection will argue that
they are nouns.
The argument that numerals are nouns is based on their specifier system. The
quantifiers many, few, and little can be preceded by degree words such as so, as, too,
and how, chapter 6 will explore this specifier system in considerable detail. Numerals,
of course, can never be preceded by degree words:
as I three
(5.65)
too [dozen
how
On the other hand, nouns such as group, bunch, and number are preceded by typical
nominal modifiers such as articles and adjectives; if numerals are nouns, they should
have such specifiers too.
The fact that the seminumerals dozen and hundred always require a nominal
specifier (most often a) is a hint that we should look for other evidence that numerals
are nouns. The clearest evidence comes from a construction little noted in the literature
5: NP SPECIFIERS
The phrases are close syntactic parallels to group noun pseudopartitives like (5.67).
(5.69) a. N'"
weeks
121 have no explanation for why good examples of partitives like (5.68a) are so hard to find.
130 RAY JACKENDOFF
If numerals are indeed nouns, the next step in the analysis must be to account for
the differences between numerals and group nouns; the next subsection will deal with
this problem. But first it is worth mentioning two special cases, one and few.
We have repeatedly alluded to two different morphemes spelled one, a numeral
and a pronoun. The numeral can govern a partitive and cannot be pluralized; the
pronoun allows no N' o/-complement and has the plural ones. If the numeral one were
like other numerals in being a noun, we would expect, parallel to (5.66), the
ungrammatical *a beautiful one day. In order to prevent this, we will claim that the
numeral one, unlike two, three, etc., is a quantifier and hence takes no nominal
specifiers. The pronoun one, on the other hand, takes nominal specifiers but is not a
group noun, so it cannot occur in structures (5.69a,b) either.
Section 5.2.1 hinted at the distinction between few and a few. We can now spell
it out more clearly. A few, because of its indefinite article, can be identified as a
seminumeral, hence a noun. As confirmation, we note that it can be preceded by
adjectives, e.g. a miserable few people showed up. On the other hand, few is a
quantifier, as can be seen by its ability to take degree words {so few, as few, etc.) and
adverbs (e.g. We met miserably few people). Similarly, a little is a seminumeral and
little is a quantifier.13
We now turn to the adjustment rules which distinguish cardinals, seminumerals,
and group nouns from each other.
First let us distinguish cardinals from seminumerals. Seminumerals are like group
nouns in that they are always preceded by a specifier, in the unmarked case the article
a. (5.66) and (5.68) showed that cardinals also can occur with articles if an adjective
intervenes. To assimilate cardinals to the normal case of group nouns and seminumer¬
als, we will assume that cardinals too always have an article in underlying structure,
but that it is deleted by the following lexically governed local transformation:
a
-r n
_+Card
2
OBLIGATORY
According to this analysis, the underlying structure of two weeks is a two weeks,
parallel to a dozen weeks and a bunch of weeks (we deal with of in a moment); (5.70)
deletes a before two. A beautiful two weeks, parallel to a beautiful dozen weeks and a
beautiful bunch of weeks, will not be affected by (5.70) because of the adjective
intervening between the article and the numeral. Similarly, in That two weeks we spent
in Bermuda was awful, the singularity of the article shows it is a specifier of the
13 One exception to this clear distinction is a very few, which has both a nominal and a quantifier
specifier. 1 have no explanation.
5: NP SPECIFIERS
numeral; since the article is not a, it does not delete. (5.70) thus accounts for the
difference between cardinals and seminumerals.
Next we must distinguish numerals from group nouns. As pointed out in section
5.5.1, they differ in that group nouns but not numerals are followed by of when they
occupy the N" specifier. Under the assumption that the pseudopartitive of is an
underlying specified grammatical formative (a point that will be somewhat justified in
section 6.2), the simplest way to account for this difference is to posit another local
transformation, governed by all numerals:14
six
a bunch hundred
(5.72) of weeks a +Num of weeks
of weeks . +Num
_+Card.
apply 1 1 I
(5.71) n.a. a hundred weeks a six weeks
apply 1 1 i
(5.70) n.a. n.a. six weeks
There are a few lexical doublets that show how superficial the presence of the
feature [+Num] is. In my dialect, the word couple alternates freely between group
noun and seminumeral status, since I can say either a couple of trees or a couple
trees.15 In the majority dialect, dozen is a seminumeral; but there are dialects which
allow a dozen of eggs, where dozen is a group noun. Though hundred, thousand, and
million are seminumerals, hundreds, thousands, and millions are group nouns; thus we
have the contrast of six million men (numeral) vs. the somewhat fusty six millions of
men (group noun). In the present analysis, these differences follow solely from the
presence or absence of the exception feature [ + Num],
These two local transformations, Cardinal o-Deletion and Numeral o/-Deletion, are
sufficient to account for the differences between numerals and group nouns observed in
14 Under any other assumptions about how the pseudopartitive of is introduced, the exception feature
[ + Num] will play a similar role in preventing of from occurring in the surface. I assume there is a separate
idiosyncratic 0/-Deletion rule applying in the partitives all of the men, both of the men to form all the men,
both the men.
15 There is also the intermediate reduction a coupla trees, occurring with other group nouns as well, e.g.
a buncha trees. This reduction depends on of being a daughter of N". Notice that in the following example,
the reduction is impossible:
The reason is that the final NP in this S is the result of N'-Gapping and hence has the structure [N4N»a couple
[n,PRO of Bill]]). The presence of PRO inhibits the reduction of of.
132 RAY JACKENDOFF
section 5.5.1. To complete our account of the local rules applying to numerals, we
observe that Pseudopartitive a-Deletion, developed in section 5.4.2, applies also to
seminumerals, since the derivations of John’s bunch of cows and John’s dozen cows
are precisely parallel in this respect:
(5.73) N'"
Art'" N" N
a N' cows
{bunch
dozen
(a—Deletion)
I bunch of]
> John’s
(o/-Deletion)
1!dozen ofj
bunch of I
cows
However, there is a difference between group nouns and numerals. Section 5.4.2
showed that when a group noun in a pseudopartitive is preceded by an article, the article
must be in the specifier of the group noun, not in the matrix NP. For example, that
group of men has singular that in the specifier of group; but *those group of men, with
plural those in the specifier of men, is ungrammatical. Numerals apparently lack this
constraint, since we have both that three weeks, with that in the specifier of three, and
those three weeks, with those in the specifier of weeks.16
10 As further justification for these structures, notice the distinction between every three weeks and all
three weeks. Every requires a singular head, and thus cannot be in the specifier of weeks. Moreover, the
meaning of every three weeks is not ‘every one of three weeks’ but something like ‘every group of three
weeks’: every quantifies instances of three. Thus the evidence points to every being in the specifier of three,
like that in (5.74a). By contrast, all requires a plural head, and in all three weeks it quantifies instances of
weeks. This means that it must be in the specifier of weeks, and hence that all three weeks has structure
(5.74b).
5: NP SPECIFIERS
N N
three three
Now since seminumerals normally appear at the surface with their own determiners,
we would expect, parallel to (5.74b), *those a dozen weeks. Instead, we find those
dozen weeks. Apparently Pseudopartitive a-Deletion must be extended so as to delete a
in this configuration as well.
In the revision of Pseudopartitive a-Deletion, the feature system collapses Nand
Art'" in the first term of the rule:
(5.75) Pseudopartitive ^-Deletion, revised
+ Subj
-Obj -a - X =>1-0-3
_ + Det _ -
LN'"
OBLIGATORY
Thus the underlying structure of those dozen weeks is those a dozen of weeks, which
undergoes Pseudopartitive a-Deletion and Numeral o/-Deletion to reach the surface
form.
This completes our discussion of numerals. We have shown that, despite their
similarities to quantifiers, their specifier structure proves them to be nouns very much
like group nouns. Most differences between them and group nouns, and between
cardinals and seminumerals, are accounted for by simple local transformations.
5.6, Summary
Articles
those which
Semantically demonstratives: a, the, this, that, these, what, we.
+ Part. ’ .+ Part _
yOUpiur
Semantically quantifiers (all [ + Partitive]): any, all, no, some each, every, either,
neither, both
Quantifiers (all [ + Partitive]): many, much, several, few (negative), little (negative), one
(nonpluralizable)
Nouns
Group nouns: group, gallon, bunch, number, lot, score, hundreds, thousands, millions,
couple (some dialects), dozen (some dialects)
Seminumerals ([ + Num, -Card]): hundred, thousand, million, few (nonnegative), little
(nonnegative), couple (some dialects), dozen (most dialects)
Cardinals ([ + Num, + Card]): two, three, etc.
Pronouns: we, you, one (pluralizable), PRO
The following two phrase structure rules are responsible for the NP specifier
system:
+ Subj
(5.76) N'" —M -Obj ) - N"
_+Det
+ Subj
(5.77) N" ( -Obj ) _ (A'")* - N' - . . .
Det _
In (5.76), the feature matrix can be expanded as N"', which becomes a genitive, or as
Art'", which is a demonstrative or N"' quantifier. In (5.77), the feature matrix can be
expanded as N'", which is a group noun or numeral in a pseudopartitive construction,
or as Q"', an N" quantifier.
(5.78) N'"
Certain configurations generated by applying these phrase structure rules freely are
filtered out by constraints we have mentioned. The combination of an N'" quantifier
and an N" quantifier is ruled out by the Specifier Constraint. The combination of an
Art'" in N'" and a group noun in N" is ruled out by a constraint justified in section 5.4.2;
5: NP SPECIFIERS
but an Art'" in N'" followed by a numeral in N" is not ruled out, as we showed in
section 5.5.3. Further constraints are necessary and have been left for future research.
Partitive constructions take one of these three forms:
PRO PRO
The Partitive Constraint restricts the possible specifiers of the complement N'" to those
which contain a genitive or demonstrative. In (5.79a,b), PRO is interpreted by the
Partitive Projection Rule, which gives PRO the reading UNIT or AMOUNT.
We have developed four local transformations. The first, Substantivization,
changes the form of various articles and possessive pronouns when they are directly
followed by PRO. For example, no becomes none, every becomes every one, and my
becomes mine. This rule applies regardless of the interpretation of PRO; thus the
changes can be observed not only in partitive constructions such as (5.79a), but also in
instances of N'-Anaphora, N'-Gapping, and postposed genitives.
The second local transformation. Pseudopartitive a-Deletion, deals with situations
in which a group noun or seminumeral occupies an N" specifier, and the N'" specifier is
136 RAY JACKEN DOFF
nonempty. This rule deletes the indefinite article of the group noun or seminumeral,
giving derivations such as these:
hundred days
The other two local transformations deal with the differences between group
nouns, seminumerals, and cardinals. We assume all three of these have underlying
articles, and that all are followed by of when in the N" specifier (pseudopartitive
position). Numeral o/'-Deletion eliminates the of after all numerals, giving contrasts
like a pair of men (group noun) vs. a few men (seminumeral). Cardinal a-Deletion
deletes a immediately before cardinals, giving contrasts like a couple weeks (seminu¬
meral) vs. two weeks (cardinal); the rule does not apply if there is an intervening
adjective, so that the article appears both with seminumerals (a beautiful few weeks)
and cardinals {an ugly two weeks).
The relevance of all this to the general theory of phrase structure presented here is
this: first, the NP specifier system has long been an area subject to wild unsystematic
proposals about phrase structure. We have shown that it can be analyzed quite
consistently within the highly restrictive theory of phrase structure presented in chapter
3, and that its irregularities are accounted for either by local transformations or by
nonsyntactic means.
Second, we have seen that the specifier system is much less consistent in its
semantics than the complement system. Furthermore, a reader of this chapter who is
concerned with semantics will have seen how little is really known about the semantics
of specifiers, despite nearly a century of study of the formal logic of quantifiers and
demonstratives. It is hoped that our systematic survey of the syntactic possibilities of
specifiers will serve as a source of irritation to those in linguistic semantics who think
they know something about quantification.
6: Specifiers of X
6.1. Introduction
This chapter presents an analysis of the specifiers of degree, developing two major
cross-categorial phrase structure rules for specifier systems which introduce measure
phrases and quantifier phrases in X" and degree phrases in X'". We will show that the
minor lexical categories Q and Deg enter into cross-category generalizations with major
categories, justifying the use of Q'" and Deg'" predicted by the Uniform Three-Level
Hypothesis of phrase structure.
We draw heavily on Bresnan’s (1973) lengthy and insightful discussion of specifier
systems, which makes copious use of cross-category generalization and the X Conven¬
tion. We will propose a number of improvements on Bresnan’s analysis which in turn
confirm the general theory of phrase structure adopted here.
The general procedure behind the analysis is the same one we followed in chapter
4. A particular construction found in one category is checked for its occurrence in other
categories. An analysis is then developed consistent with the facts in all categories and
with the phrase structure schema. In particular, several crucial hypotheses are verified
here by reference to PP and its specifiers, to which Bresnan does not appeal, and which
turn out to be a surprisingly rich source of evidence.
We will build up the specifier system piecemeal, starting with measure phrases,
then going on to quantifier phrases. There follows a detailed discussion of the
occurrence of degree phrases and how recursion takes place in their specifiers.
(6.2) a. The pool is sixty feet long and thirty feet wide.
b. Six feet tall you’ll never be!
c. His house is to the left and six miles down the road.
d. Four hundred yards up the street they encountered a panther.
(6.3) a. This mixture is two parts alcohol and three parts water.
b. The book was (one) half garbage.
Expressions like two parts alcohol could be little but NPs, since they are made up of a
numeral and two nouns; but they may be used only in predicate positions, e.g. after be,
seem, stay, and so forth. Thus there are two uses of measure phrases in the specifier of
NP, with different interpretations. If the of in the use first mentioned is base-generated,
we can provide a deep structure cue for the difference in interpretation.
Where should the measure phrase be attached within NP? It precedes adjectives,
which are in N": two gallons of ordinary water, three parts clear alcohol, not *ordinary
two gallons of water, *clear three parts alcohol. Thus it is either in N" or N"\ If we
attach it to N", we automatically account for the fact that measure phrases do not
receive the Poss invariably attached to N'" in the N'" specifier. Thus we arrive at the
structures (6.4a,b).
6: SPECIFIERS OF X
Given (6.4) as the structure for measure phrases in NP, we are constrained by the
X Convention to propose structures (6.5a,b) for (6.1a,b). We then have the partial
phrase structure rules (6.6) and a grammatical relation (6.7).
(6.5) a. A'"
A"
N'" A'
two feet A
long
The projection rules for measure phrase will require that it consist of a quantified count
noun, with further selectional restrictions on the noun depending on the nature of the
X' to which it is a sister. The difference between the two interpretations of measure
phrase in NP will depend on the presence or absence of the specified grammatical
formative of.
The major category which is still missing in the paradigm above is V. There is
nothing preceding the verb which looks like the constructions in (6.1)—(6.4), but
measure phrases do appear postverbally.
Note that the measure phrase appears after a PP in both of these examples, a new
position for NP. In fact the measure phrase must appear after all strictly subcategorized
phrases (barring Heavy NP Shift), as shown by (6.9).
1 The measure phrase in (6.8b) is intuitively quite close to the PP for three years, and it is quite natural
to assume that it is the result of a For-Deletion. On the other hand, since three times does not appear to be
the result of a reduction, there is a base position available for three years, and we might as well make use of
it, sparing ourselves a rather idiosyncratic transformation.
140 RAY JACKENDOFF
The position of the measure phrase is thus the same as that of postverbal manner
adverb, so it must be a daughter of V", generated by the partial phrase structure rule
(6.11).
(6.10)
N'"
Charlie 1ST"
three times
The posthead position of the measure phrase appears also as a second option in
PP, as in down the road three miles, up the river a way. By the X Convention, this
must be attached to P", parallel to V", and there must be a partial phrase structure rule
(6.12).
(6.13) A measure phrase is an N'" immediately dominated by X". (In the Aspects
notation, [N"\ X"].)
This section has shown, then, that NPs with the function of measure phrases occur
in the X" of all major lexical categories, providing another significant cross-category
generalization. We next show that these NPs alternate with QPs in the same position.
Section 5.2 claimed that the quantifiers many, few, much, little, and several are
daughters of N". They differ from the articles all, each, every, some, etc., which are
daughters of N'" and cannot be preceded by genitives or other articles. The operative
phrase structure rule was (6.14).
Since the N" quantifiers do not cooccur with measure phrases, we can immediately
generalize (6.14) with (6.6a) to form (6.15).
+ Subj
-Obj
(6.15) N"->( - ((of))) - (A'")* - N' - . . .
— Det
( + Comp)
In (6.15), the realization of the constituent broken down into features has two
possibilities. If it is [+Comp], the features spell out N'" and the of is possible. If it is
[-Comp], the features spell out Q'" and the of cannot occur.
This generalization of measure phrase and Q'" carries over to the other major
categories. In sentences, we find QPs postverbally, as in You talk too much. Since
these QPs are not strictly subcategorized, they belong in V", so we can generalize (6.11)
to (6.16).
+ Subj
(6.16) V" V'-( -Obj )
Det
In PPs there are quantifiers far and long (as in far down the road and long after the
accident) which alternate with measure phrases, and which, as we will see, take the
same modifiers as many and much. This justifies generalizing (6.6c) to (6.17).
+ Subj
(6.17) P" ( -Obj ) - P'
Det
The QP, like the measure phrase, may follow the P' as well, as in down the road too
far. This use is subject to the restriction that the QP must contain a specifier of its own,
142 RAY JACKENDOFF
since *down the road far is out. I have no explanation for this restriction; for present
purposes I will conveniently ignore it, generalizing (6.12) to (6.18).
+ Subj
(6.18) P" P' - ( -Obj )
Det
Having dealt with the generalization of measure phrase and QP in N", V", and P",
we turn to A". The situation in A" is more complex than the other three cases, and
presents a difficult choice. Bresnan (1973) chooses one possibility, which, as we will
see, has repercussions for her entire analysis of the degree system. We will show that a
different choice provides a more satisfactory analysis throughout. The rest of this
section will outline the problem, and its thread will be repeatedly picked up in the rest
of the chapter.
The problem is this: on the whole adjectives do not take simple quantifiers; for
example, *much long and *little long are ungrammatical. However, in the comparatives
more beautiful and less beautiful, more and less appear to be the comparative forms of
the quantifiers much and little. The question is how to account for this curious
alternation.
Bresnan describes it by claiming that much deletes obligatorily before adjectives,
leaving only its modifiers. For example, underlying as much beautiful becomes as
beautiful', underlying much beautiful becomes beautiful. In more beautiful, much
becomes more by affixation of the comparative -er\ since it is no longer the
phonological much, it does not delete.
One immediate difficulty with Bresnan’s account is the nonexistence of *little
beautiful, since a deletion of the quantifier little here could hardly result in anything
with the expected meaning. Yet since less beautiful is to be derived from underlying -er
little beautiful, some way must be found to rule out the noncomparative form.
There seem to be two ways out of this difficulty. First, Bresnan could be right, and
all adjectives could take quantifiers, but various deletions and semantic constraints
could prevent all forms but the proper comparative forms. Under this theory, (6.6b)
would immediately generalize to (6.19), parallel to (6.15) and (6.17).
+ Subj
(6.19) A ( -Obj A'
.-Det
much
(6.20) a. You and I are little 'different from Bill.
far
much
b. You and I are alike.
little
In Bresnan’s theory, these adjectives must be lexically marked to take optional rather
than obligatory A/wc/i-Deletion. However, this fails to explain why little and far can
occur with these adjectives and not with others.
Under the alternative view I have proposed, there is another account: these
adjectives are exceptions to the generalization that adjectives do not take quantifiers.
Such exceptionality is not too surprising, in view of the fact that adjectives that permit
measure phrases must also be lexically marked and are almost equally rare (high, wide,
long, tall, old, thick, broad, and perhaps a few others). In order for these exceptional
adjectives with quantifiers to exist, the phrase structure rules must allow a quantifier,
and thus the generalization of measure phrase and QP in (6.19) is motivated.
This view of QP in AP eliminates all need for Bresnan's Mwc/i-Deletion transfor¬
mation, along with the difficulties it poses for the occurrence of little and far. Rather,
the few cases where there is surface evidence for a QP are taken as exceptions in
underlying structure, provided for by general phrase structure rules. One would hope
that the exceptionality ultimately proves to be semantically motivated, leaving the
syntax and lexicon maximally simple.
Whether we take Bresnan’s view or the present one on the presence of quantifiers
in A", this section has shown the need for phrase structure rules (6.15), (6.16), (6.17),
(6.18), and (6.19). We thus see that the base rules for measure phrases and quantifiers
are fully general among the major lexical categories, aside from the problem of order in
V" and P". The parallelism of the X" specifiers justifies us in considering all these rules
to be special cases of a single phrase structure rule. Furthermore, the recurring
parallelism of NP and QP in the X" specifier is evidence for the feature analysis that
relates the lexical categories N and Q.
There are a number of words we will classify as Deg which, among other things,
precede the quantifiers of the preceding section. (6.21a) illustrates them in QP within
N"\ (6.21b) in QP within P"\ and (6.21c) in QP within V"'.
144 RAY JACKENDOFF
how
this
that I far along the road I
b.<
so [ long after the accidentj
too
as
(this 1
that I much I
so [little J
c. He talks too
as {iT.tlei as Bi"
\
This section will develop the phrase structure rule generating them, showing that it
too is subject to an interesting cross-category generalization.
There is little strong evidence for where the degree words in (6.21) are attached to Q"\
However, the X Convention suggests the hypothesis that they are Q'" specifiers. This
accounts for the parallelism in the use of this and that, which as specifiers in N'" are of
the category Art but with similar demonstrative meaning. Under this theory, the N'"
specifier will then contain a [ + Subj, — Obj, -Comp, +Det]'" and the Q"' specifier will
contain a [-Subj, -Obj, -Comp, +Det]"'. The lexical items this and that will either
occur in both categories, differing by the feature Subj, or, preferably, they will simply
be unmarked for this feature. This parallelism between Deg and Art is the reason for
assigning Deg the feature analysis proposed in chapter 3. A further bit of evidence for
the parallelism arises in section 6.4.4.
Bresnan suggests a different attachment of degree words, to what in our terminol¬
ogy would be Q". Section 6.5 will call into question the validity of her argument for this
structure. Anticipating the refutation of her argument, I will adopt the solution
suggested by the X Convention and generate degree phrases in QP with the partial
phrase structure rule (6.22). Its relationship to the partial phrase structure rule (6.23)
for N'" is obvious.
N'"
the road
many far
much
146 RAY JACKENDOFF
There are three questions that must be dealt with in implementing this structure.
The most crucial is how the degree phrase is applied to A'" and Adv'". The next
subsection deals with this. There are also consequences for the treatment of the words
more, less, and enough, which we turn to in sections 6.4.3 and 6.4.4.
that
how
The X Convention suggests that the degree phrase be generated in the A'" and
Adv'" specifier, as in (6.26), so that the rules generalize with (6.22) and to a lesser
extent with (6.23).
(6.26) a. A'" -> (Deg'") - A"
b. Adv'" (Deg'") - Adv"
This analysis thus claims that adjectives appear with the same degree words as much
because of the similarity of the phrase structure rules (6.22) and (6.26).
Bresnan expresses this generalization a different way: by quantifying adjectives
with a much which is modified by a Deg'". This much is deleted transformationally,
leaving only the degree phrase at the surface. Section 6.3 pointed out the difficulty in
this view: it requires all adjectives to allow a QP, thus leaving unexplained the
distribution of the quantifier little. Thus Bresnan’s means of expressing the generaliza¬
tion is questionable, whereas in the present analysis, such a distribution is an automatic
consequence of the proposed base structure
A complication Bresnan points out turns out to follow immediately from the
present theory. She observes (note 4, p. 278) that those adjectives such as alike and
different that occur at the surface with much have two possible forms with degree
words, for instance as different and as much different. She uses this as evidence that
Much-Deletion is optional with these adjectives. The present theory permits a more
direct account, using phrase structure rules alone: the two base forms in (6.27).r
2 A third possibility would attach Deg'" to A"' and Q"' to A". I have no arguments against such a
structure, except for the possibility that combinations of quantifiers and degree words as independent
specifiers may be excluded by some generalization of the Specifier Constraint of section 5.2. I know of no
cases where both are needed; but I understand the semantics of degree phrases and quantifiers sufficiently
poorly that I am loath to make a strong commitment ds to how the unnecessary combinations are to be
eliminated.
6: SPECIFIERS OF X
(6.27) a. A'" b. A
much
' little ’
v
far -/
Hence the alternation follows not from a transformational exception, but from the
unusual subcategorization possibilities of these adjectives. This provides further
evidence that Deg'" is generated in A"' and Adv'" as well as in Q'", yielding a further
cross-category generalization in the specifier system.
Section 6.3 pointed out that the hypothesis that most adjectives do not take quantifiers
has as a consequence that more in more beautiful cannot be the comparative of the
quantifier much. Let us suppose instead that it is a Deg in the A'" specifier, parallel to
as, too, so, and how in (6.27a).
If more is a Deg, though, we expect it also to occur in the Deg"' within a Q"\
giving the incorrect forms *more many, *more much, etc. We correct this problem by
providing the idiosyncratic spelling rules (6.28) to yield the existing forms.3
3 Note that these rules are not a disadvantage with respect to Bresnan’s theory, since she needs similar
rules to attach her comparative morpheme -er to quantifiers:
-er - much => more
-er - many more
etc.
148 RAY JACKENDOFF
The theory that more is a Deg thus produces structures such as these
(6.29) a. A'"
beautiful
b N'"
many
v-
more
6: SPECIFIERS OF X
P"
far
-
farther
As a consequence of the theory that more in more beautiful is not the comparative
of much, less in less beautiful cannot be the comparative of the quantifier little either.
Rather, less must also be able to be a Deg, and less beautiful is then identical in
structure to (6.29a).
We further predict that less should be applicable to quantifiers as well as to
adjectives, substituting for more in constructions like (6.29b,c). One such possibility
exists—less far, as in less far down the road; but there are no *less much, *less many,
*less little, or *less few. Thus, while Bresnan’s theory must find a way to generate less
far and eliminate *little beautiful, the present theory must find a way to rule out the
other four less-Q combinations. The four bad forms, if they existed, would presumably
be synonymous with the existing forms more little > less, more few > fewer, more
much > more, and more many > more, respectively, which somehow seem “less
negative”. This fact may be useful someday in giving an interesting account for what
must remain for now an unprincipled filter.
Now consider the rule that forms comparative adjectives such as taller, presum¬
ably from the source more tall. As is well known, this rule applies only to a particular
class of lexically marked adjectives, mostly monosyllabic. In the present theory it
applies to this structure:
150 RAY JACKENDOFF
(6.30) A
tall
taller
Observe that this configuration is precisely parallel to that in the QPs of (6.29b,c). Thus
the rule forming comparative adjectives generalizes via the X Convention with the rule
forming comparative quantifiers.
In Bresnan’s system, there is of course also a rule that turns more - A into A-er.
However, in her theory the more - A rule attaches the quantifier more to the head and
thus has little to do with the rule forming comparative quantifiers, which attaches the
Deg -er to the head (see footnote 3, p. 277). Thus the present theory expresses a further
generalization missed by Bresnan’s analysis, justifying the assignment of more to the
category Deg, and further justifying the absence of QP in most APs.
As a final note, observe that all the arguments given here for the comparative Deg
more apply equally to the superlative Deg most, which also occurs in the specifiers of
A'", Adv'", and Q'". Like more, most undergoes spelling rules that attach it to
quantifiers and lexically designated adjectives; the environment is exactly the same as
with more, so the rules generalize. The word least, like less, occurs in the specifier of
A'" and Adv'" and is therefore also a Deg. Since least does not appear comfortably with
any quantifiers at all (even *least far is unacceptable), we can assume either a
generalization of the filter for less or some sort of semantic constraint. Thus the
superlatives behave almost precisely like the comparatives.
6.4.4. Enough
We now turn to the most syntactically exceptional word in the specifier system.
enough. It provides a great deal of evidence for the analysis of preceding sections.
6: SPECIFIERS OF X
r ~\
so
(6.31) a. beautiful enough / ' as ' beautiful
too
so
b. quickly enough / 'jas ' quickly
too
so
I far
as
C' {fewJen0Ugh/ 1 Ifew
too
Bresnan proposes a local transformation to account for this, moving enough around its
head. In present terms, this rule can be stated as (6.32), where X° denotes a lexical
category.
(6.32) Enough-Shift
enough - X° => <t> - 2 + I
OBLIGATORY if X° = A, Adv, Q
OPTIONAL if X° = N
I much
Bresnan treats enough as a Q, on the strength of similarities like
[enough
pudding and )Ugh J men• Small enough is taken to have the underlying structure
enough small, undergoing Enough-Shift; the parallel *much small becomes small by
obligatory Much-Deletion.
Consider, however, the alternative view that enough is a Deg. This would explain
its cooccurrence with quantifiers in expressions like far enough down the road, little
enough pudding, and few enough men, which would then have the following struc¬
tures.
152 RAY JACKENDOFF
P" N"
enough Q'
Q Q
far little]
few j
Since Bresnan points out (note 3, p. 277) that her analysis is unable to generate these
forms, we have already found a significant advantage to our analysis.
An apparent difficulty for our view that enough is a Deg is that it predicts the
existence of the forms *many enough men and *much enough pudding, parallel to
(6.33b). To correct this discrepancy, we introduce a rule to delete many and much after
enough ,4 changing these incorrect forms to enough men and enough pudding. This rule
is much more restricted than Bresnan’s very general rule of Much-Deletion, since it
localizes the irregularity in a particular combination of lexical items.
OBLIGATORY
Thus enough men and enough pudding have the structure (6.35).
4 This assumes the rule precedes Enough-Shitl. Alternatively, it could delete many and much before
enough, if it were ordered after £>iowg/!-Shift. The choice does not seem crucial.
6: SPECIFIERS OF X
(6.35) N'
N"
| many] ^
[ muchj
If the Q in (6.35) is changed to few or little, we get the forms few enough men and little
enough pudding by Enough-Shift. If the Deg is changed to as, we get as many men and
as much pudding, since neither Many-Much Deletion nor Enough-Shift applies.
Bresnan’s arguments that enough is a Q are based on a large number of parallels
between it and more, which, as we mentioned, she takes also to be a Q, the
comparative of much and many. However, our analysis also accounts for these
parallels. There are two relevant cases. In the specifiers of adjectives and adverbs,
enough and more are both Deg'" attached to X'", accounting for one set of parallelisms.
In the specifiers of nouns, prepositions, and verbs, however, surface enough and more
sufficiently long for us to use it); so governing a degree clause is not a sufficient
condition for something to be a Deg. But in fact, there are no other quantifiers that
govern complements, so enough would be in any event an unusual quantifier, though
not an atypical Deg. In our analysis, then, the fact that enough governs a complement
is not unexpected.
The present analysis also has a much more striking advantage. If enough is a Deg,
it can be made to function also as an Art by eliminating the feature [ — Subj] from its
lexical representation. Under this assumption, it would resemble the bicategorial this
and that. It could then enter into two possible structures within NP, (6.35) and (6.36).
(6.36) N'"
pudding
Now suppose that Enough-Shift, like other transformations, is subject to the condition
of the strict cycle, i.e. it can apply only on the lowest possible X'" domain in which
enough appears. Then it will be unable to move enough around pudding in (6.35),
because of the Q'" dominating enough. On the other hand, in (6.36) the rule can apply
to yield pudding enough. Hence the two possible NP structures, stemming from the
categorial ambiguity of enough, interact with a well-motivated constraint on transfor¬
mations to produce the variation between enough pudding and pudding enough. Thus
Enough-Shift can be made obligatory in all contexts, an improvement on Bresnan’s rule
(6.32), which singles out NP for special treatment.
Now consider what happens in A'". Under the present analysis of A'", tall enough
has the underlying structure (6.37).
(6.37) A'"
tall
6: SPECIFIERS OF X
In (6.37), enough obligatorily permutes around tall, just as in pudding enough and far
enough.
But observe how enough behaves with the exceptional adjectives alike and
different. Superficially it looks as if enough permutes around them optionally rather
than obligatorily, since we have both alike enough and enough alike. But there is a
more interesting account: since alike and different, unlike most adjectives, allow a Q"\
they can occur in two distinct underlying forms with enough:
A"
enough A'
alike I | alike j
different J [different/
far
• little ’
much
In (6.38a), enough permutes around the head to give alike enough and different enough,
parallel to (6.37). In (6.38b), enough permutes around the Qs far and little to give little
j
alike f alike 1
enough [different }' fQr enoug^ [different J muc^ *n (6.38b), Many-Much
Deletion takes place, and because of the strict cycle, enough does not permute around
the adjective; the forms derived are thus enough alike and enough different. This is just
like what happens in the nominal paradigm (6.35)—(6.36). What at the surface looks like
an exceptional optionality in Enough-Shift with these adjectives turns out to be a
consequence of the fact that they allow a QP in their specifier.
Thus the analysis of enough as a Deg confirms many details of the specifier system
argued for in previous sections. It makes possible the use of enough with the
quantifiers little, few, and far4, it makes possible the generalization that degree words,
156 RAY JACKENDOFF
but not quantifiers, govern complements; and, most striking, it makes the apparent
optionality of Enough-Shift in certain cases emerge as an automatic result of the base
rules and the principle of the strict cycle. Furthermore, the exceptionality of alike and
different, rather than being spread out over two transformations, Mwc/i-Deletion and
Enough-Shift, is localized in a single phrase structure difference: unlike other adjec¬
tives, they allow a QP.
To recapitulate the argument of this section, we have claimed that the category Deg
includes the words as, so, too, how, more, less, most, and least, plus the bicategorials
this, that, and enough, which function also as articles.
The constituent Deg'" is generated as an X'" specifier in A'", Adv'", and Q'",
partially generalizing with the Art'" position in the N'" specifier.
/ >
as many some
1 so
> <
the V
how which f
\ this ,this .
The relevant phrase structure rules are these:
in
( + Subj)
+ Subj \
-Obj
(6.40) ,+ Comp/ ) - X''
-Comp
-Obj
_ +Det
-Det
The primary argument for the feature relationship of Deg and Art and for the X'"
attachment of Deg'" has been the categorial ambiguity of this, that, and enough. This is
admittedly not overwhelming evidence, but in the absence of any other basis for the
decisions, we have followed the preferences dictated by the X Convention.
The irregularities of the degree system have been accounted for with various local
rules. More and most trigger spelling rules which attach them to quantifiers and certain
adjectives. Combinations of less and least with quantifiers (other than the combination
less far) are ungrammatical and must be ruled out by some as yet unprincipled filter.
Enough triggers two local rules, £>zowg/i-Shift and Many-Much Deletion.
Having determined the occurrence of Deg'", we will now explore its internal
structure.
Quantifier phrases with certain Degs can be further modified by measure phrases and
quantifier phrases.
many
(6.41) a. five times much
far j
(many enough
much => five times 'enough
[far far enough
five times
many
five miles
little
many
c. too much
much
few
far
far
a bit
The presence of the measure phrases and quantifier phrases depends on the
presence of the degree words, as can be seen by eliminating the degree words in (6.41).
Furthermore, which measure phrases and quantifier phrases are possible depends on
the choice of degree word, as can be seen by substituting other degree words into
(6.41). This suggests that the measure phrases and quantifier phrases are in the
specifiers of the degree words. Bowers (1968b) and Selkirk (1970) therefore argue that
this recursion is through the Deg'", as in (6.42).
(6.42) Q'"
many
In (6.42) I have attached the new specifier to the Deg" rather than to Deg' or Deg'".
This choice enables us to generate these specifiers with the phrase structure rule (6.43),
which generalizes with rules (6.15)—(6.19) of section 6.3. Furthermore, the definition
(6.13) of the grammatical relation measure phrase can apply with complete generality.
These generalizations argue that Deg is a full-fledged participant in the X system, as
predicted by the Uniform Three-Level Hypothesis of phrase structure.
+ Subj '»
(6.43) Deg" -Obj ) - Deg'
Det _
Deg' Q
Deg many
as
160 RAY JACKENDOFF
five N
times
These structures are generated by the recursion of the two basic phrase structure rules
developed in this chapter, the one generating a Deg'" in various X'" specifiers, and the
other generating NP or QP in a wide range of X" specifiers.
6: SPECIFIERS OF X
Bresnan’s analysis of these constructions is different. She claims that the recursion
of measure phrase and QP goes (in our terms) through the Q"\ Thus, instead of (6.42)
she proposes the structure (6.45).
(6.45) Q'
many
However, if we adopt the structure of A'" argued for in sections 6.3 and 6.4.2, the
X Convention compels us to adopt (6.42): since most adjectives do not subcategorize a
quantifier, the only structure available for much too tall is (6.46), parallel to (6.42). No
structure can be constructed for much too tall that parallels (6.45).
(6.46) A'
tall
not. She then claims that there is a transformation that moves too many, hence it is a
constituent, and (6.45) must be correct. The paradigm in question is (6.47).
(6.47a,c) contain recursive degree phrases like (6.41), i.e. either structure (6.42) or
(6.45). Bresnan claims that (6.47b,d) are derived from (6.47a,c) by an optional
reordering transformation which moves too many to the right of marbles, a rule called
QP-Shift. Since only under hypothesis (6.45) is too many a constituent in (6.47a,c), the
requirement that only constituents be moved favors (6.45) over (6.42).
I claim, however, that there is no such transformation as QP-Shift, but rather that
(6.47b,d) are base-generated forms with the following structure:
(6.48) N'"
N"
Q'" N'
Deg Q PRO
marbles
6: SPECIFIERS OF X'"
J six I
According to this claim, i f marbles in (6.47b,d) is functioning as a measure
las many J
phrase in Deg"', just as Jive times is in (6.42). By contrast, the QP-Shift theory of
(6.47b,d) claims that marbles is the head and does not form a constituent with
fas many J
].
To differentiate between the theories, observe first that the string six marbles too
many is paralleled by five days too long (after the accident), five miles too far (down
the road), and five pounds too much, in which measure phrases precede degree
modifiers. The only question in establishing the parallelism is the unusual use of
marbles as a unit of measure. But it is not implausible to say of a tray for marbles or an
array of marbles that it is six marbles wide; marbles is a reasonable unit of measure if
one happens to be counting marbles. Such is the case in (6.48): presumably the
interpretation of PRO must be fixed as marbles. The PRO posited as head in (6.48) can
furthermore be replaced by a lexical item in certain (albeit limited) cases where the unit
of measure differs from what is being measured, as in (6.49).
Thus the relevant features of the structure (6.48) are independently motivated, and the
appropriate interpretation is assigned to it by regular rules. Hence QP-Shift appears
unnecessary in order to generate the paradigm (6.47).
Furthermore, the data which motivate constraints on QP-Shift can be explained
immediately by structure (6.48). Bresnan claims that QP-Shift must follow 0/-Insertion
in order to prevent (6.50) (Bresnan’s (136)—(137)).
The real constraint seems to be that the noun cannot be definite. But this is a general
constraint on measure phrases:
’six 1
Hence the constraint follows from the claim that marbles is a measure
as many J
phrase in (6.47b,d).
Second, Bresnan claims that QP-Shift must be restricted to ‘‘count” Qs:
But this restriction too is not quite accurate, as much appears after sugar in (6.5Id).
Again, the real restriction is a general one on measure phrases, namely that they must
be count. Since by the present hypothesis much bread would have to be a measure
phrase in (6.53b), it is ungrammatical. But in (6.5 Id) five pounds of sugar is count, so
the construction is acceptable.
Thus the QP-Shift theory of (6.47b,d) requires special constraints on an extra
transformation. The base theory uses an independently motivated base construction to
describe (6.47b,d); the constraints on the paradigm follow automatically from independ¬
ent properties of measure phrases.
Returning to the issue of (6.42) vs. (6.45), we see that Bresnan has not made a case
for the constituency of j many in (6.41); hence we are free to choose (6.42) rather
than (6.45) as the proper structure. (6.46) is therefore a viable structure for much too
tall, precisely parallel to (6.42), and it is in turn possible for us to continue to maintain
that most adjectives cannot take quantifiers.
Having defended the recursion of measure phrase and QP through the Deg", we
have completed our analysis of the degree system. Though we have not dealt with the
complete range of data discussed by Bresnan, in particular the relation of so and such,
our analysis here has been successful in dealing with the major points of Bresnan’s
theory, eliminating such irregularities as the nonoccurrence of less before most
adjectives, the optional application of A/wc/z-Deletion and Enough-Shift under certain
conditions, and the suspect transformation QP-Shift. The reader will have noticed that
at each step of the analysis, the issue of whether most adjectives take quantifier
phrases has played a crucial role, for it led to differences in the treatment of the
exceptional adjectives alike and different, the position of degree phrases in A'", the
categorial status of more, less, and enough, and finally the recursion of degree
phrases.5
5 Lest the part of Bresnan’s theory we have not dealt with should be thought unassailable, it is worth
mentioning that there are problems with her treatment of the such/so alternation, for which I have been
unable to find a more satisfactory solution. The crux of the issue is her transformation AP-Shift, which
transforms taller a man (parallel in the base to as tall a man) into a taller man, and so tall a man into such a
tall man. She claims (p. 308) that AP-Shift is obligatory when the NP lacks a determiner, so that underlying
*so tall 4> men becomes such tall men. However, this alternation is not general, since none of the other
constructions of the form Deg A a N permit a plural alternant:
a man
(i) men
6: SPECIFIERS OF X
The analysis here has been carried out within the strong hypothesis of phrase
structure proposed in chapter 3. We have shown that the entire range of specifier
possibilities we have discussed is the product of two major phrase structure expansions
which apply to a large number of categories. In arriving at these rules, we made the
fullest possible use of the X Convention to make up for the scarcity of data within any
single category, under the assumption that cross-category generalization is an impor¬
tant part of the evaluation measure in syntactic theory.
As a final note, we should briefly mention a sizable group of specifiers we have not
touched on, which have a wide range of cross-category application, such as not,
almost, just, quite, nearly, practically, hardly, scarcely, and others. Some of these are
patently adverbs, and in NPs are replaced by their adjectival parallels; but the rest are
of some as yet undetermined category, perhaps Deg. Almost is a good example of
cross-category applicability—it can modify even the articles every and all, though up to
now we have had no need for internal structure in Art'".
(6.54) a. John almost fell. (in V'")
b. The pudding has become almost glue. (in N'")
c. The men look almost alike. (in A'")
d. We pushed John almost out the window. (in P'")
e. Almost every groundhog saw his shadow. (in Art'")
f. John left almost happily. (in Adv'")
g. You talk almost too much. (in Deg'")
Only the categories Q'", M'", and Prt'" are missing. Such behavior is not atypical
among this class of modifiers. Study of this class within the lexicalist framework would
no doubt yield a rich source of evidence for feature analyses and phrase structure rules.
X'" rules
This suggests that such tall men may not be a transform of underlying [so tall] men, but has some other
source, parallel to what tall men.
On the other hand, enough, like the comparative, does not share this restriction to singular heads, since
tall enough men is grammatical. Thus AP-Shift must be obligatory for comparatives (*taller a man) and
optional for A enough a man, and there must be a constraint ruling out the plural cases in (i). In all, the
system is highly idiosyncratic.
There is one semantic factor worth mentioning. Many of the constructions which require this indefinite
article—(i) plus more of a man, so much of a man, and perhaps others—can be used only predicatively. A
similar restriction appeared on NPs with measure phrase specifiers such as two parts water. One might
speculate whether the analysis of (i) involves some bizarre expansion of the measure phrase in the N"
specifier. On the other hand, hardly a man and scarcely a man can be used as subjects. Since they count
men, and the phrases in (i) do not, they are semantically more like QP in the N" specifier.
166 RAY JACKENDOFF
The categories expanded here are the class [-Obj, — Det]"'. The generalization is
evident. As in the rules collected in section 4.5, NP is different by one feature. In that
case many categories expanded X" with a prehead Adv'", but N'" used an A'", a
difference in the feature Comp. Here the difference between Deg and Art is in the
feature Subj. The same problems of felicitous collapsing arise here as did in that case.
X" rules
+ Subj
(6.16) V" -> V' - ( -Obj
Det
+ Subj
-Obj
(6.15) N" -> ( - ((of))) - (A'")* - N' - . .
— Det
_( + Comp)_
+ Subj
(6.19) A" —> ( -Obj )-A'
-Det
+ Subj
(6.17) P"-> ( -Obj )- P'
-Det
+ Subj
(6.18) P" -> P' - ( -Obj )
-Det
+ Subj
(6.43) Deg" —» ( -Obj ) - Deg'
_—Det
To this we can add (6.55), since much differently is an Adv'", and none of the adjectives
that allow measure phrases have corresponding adverbs.
Thus the rule creating this X" specifier expansion applies to all [ + Comp] categories and
all [-Subj, -Obj] categories.
One serious problem in collapsing these rules, of course, is that of the order of the
N"7Q " term in V" and P". We discussed this in section 4.5, and I have nothing to add
here.6
6 We have not, however, touched on the problem of combining these X" rules with the rules of chapter
4. The major difficulty is with the prehead Adv’" in A", P", and Adv". Consider A", for example. (6.19) must
6: SPECIFIERS OF X
— tzh'
OBLIGATORY
- f
The results of this chapter should be by now obvious enough that I need make no
further polemic about how the specifier system demonstrates the virtues of the X
Convention and the proposed set of syntactic distinctive features. It is worth pointing
out again, though, that the inclusion of Q'" and Deg'' in the generalized phrase structure
rules is a strong argument that the X Convention and the Three-Level Hypothesis apply
not only to the major categories N, V, A, and P, but to minor categories as well.
Finally, for convenience, here is a summary of the major differences between our
theory and Bresnan’s.
Bresnan X Syntax
1. Adjectives subcategorize quantifiers Most adjectives do not subcategorize
quantifiers
2. A few adjectives such as alike and A few adjectives such as alike and
different govern optional rather than different subcategorize quantifiers
obligatory Much-Deletion
3. Enough is a Q Enough is a Deg and an Art
4. Much deletes before adjectives Much and many delete after enough
the feature configuration that picks out these three terms is the improbable [-Obj, —Det, }]■
Consideration of the N" rule suggests that this is the wrong way to collapse (6.19) and (4.86) anyhow.
For in N" the parallel N"7Q"' and A'" terms are separate for good syntactic reasons, for example, many tail
men. This suggests that for the sake of generality the A" rule must be (i), in principle allowing the N"7Q"' and
the Adv'" to cooccur.
+ Subj
(i) A" —» ( -Obj ) - (Adv"') - A'
-Det
But since both modifiers have the semantic effect of designating a degree, there are semantic reasons for their
noncooccurrence. We may thus invoke a principle similar to the Specifier Constraint of section 5.2 in
justifying (i). Similar arguments apply in dealing with the rules for P" and Adv".
168 RAY JACKENDOFF
5. -er is the comparative Deg More and less are comparative Degs
6. Less far cannot be generated Less much, etc., must be ruled out
7. Adjectives are compared by means of Adjectives are compared by means of
a Q'" a Deg'"
8. Degree phrase recursion is through Q'" Degree phrase recursion is through
Deg'"
I would like to emphasize again here my debt to Bresnan’s work. Despite the
substantive difference between our analyses, the spirit of the work is in total
agreement. Though I feel the analysis here is superior in several respects, it could not
have been begun were it not for Bresnan’s insight in organizing the mass of data in
what I feel is so close to final form.
7: Relative Clauses
7.1. Introduction
This chapter is concerned with where relative clauses are attached in deep structure. It
is concerned only secondarily with the internal structure of relative clauses and with
the processes of relative pronoun formation and wh-preposing, since these have little
bearing on the form of the base.
Chapter 4 claimed that restrictive relative clauses are daughters of N" and that
appositives are daughters of N'", as in (7.1).
restrictive apposidve
Though this claim fit in nicely with the general framework proposed in chapter 4, we
postponed defending it in any detail until the present chapter, where we will show how
it accounts for many well-known facts about relatives and compare it with other widely
accepted analyses of relative clauses. We will show that our theory, the “NP-
complement” theory, accommodates the data used as evidence for other theories at
least as well as the other theories themselves do.
The major theories with which the NP-complement theory is in contention are the
“determiner” theory, illustrated in (7.2), and the “Chomsky-adjoined” theory, illus¬
trated in (7.3).
170 RAY JACKENDOFF
The determiner theory, advocated by Smith (1964) and Stockwell, Schachter, and
Partee (1973), among others, is consistent with the constraints on phrase structure rules
adopted in chapter 3. But since this theory requires an obligatory extraposition, the
NP-complement theory is prima facie less complex. Hence any evidence that the NP-
complement theory is equally adequate must be taken very seriously.
The popular Chomsky-adjoined theory, advocated by Ross (1967), is appealing
because of the simplicity of stating the antecedent of the relative pronoun: it is just the
entire lower NP. However, (7.3) is not a possible structure within the phrase structure
rule schema of chapter 3, which requires every X'to immediately dominate an X1-1. If
at all possible, we would like not to have to weaken the schema for the sake of relative
clauses.
We will not consider the hypothesis in which restrictive relative clauses originate
as sentences conjoined to some clause dominating the NP, as advocated by Thompson
(1971), for example. Section 7.9 will advance arguments against Ross’s (1967) well-
known theory that appositives are so derived, and all these arguments apply with equal
or greater force to restrictive relatives.
The next two sections deal with the relatively simple problems presented by
7: RELATIVE CLAUSES
appositives, after which five sections are devoted to the much more complex issues
involved in restrictives. In many cases, we will show that the theory of complements
laid out in chapter 4 uncovers otherwise unexpected generalizations which competing
theories cannot express—in other words, that what the X Convention leads us to
believe is the simplest solution is in fact simplest.
7.2. Appositives
To start with, an adequate theory of appositives must account for numerous differences
between restrictive relatives and appositives. Appositives must appear to the right of
restrictives:
(7.4) a. The man that came to dinner, who was drunk, fainted,
b. *The man, who was drunk(,) that came to dinner fainted.
An NP may contain two or more concatenated (“stacked”) restrictive relatives; but an
NP may take multiple appositives only if they are conjoined:
(7.5) a. the man who came to dinner who hated lox
b. The man, who came to dinner, who hated lox
c. the man, who came to dinner and who hated lox
X"'
As pointed out in chapter 4, appositives can occur in any whereas
_+Comp_ ’
restrictives can occur only in NP:
which I
(7.8) a. Relative clause formation is obligatory in NPs,
That J aC'
counts for the difference in surface shape, (in V"')
not
b. That Sheila was beautiful,
That i she was, was
1 (7.8b) is taken from Ross (1969), where it is used as an argument that APs must be dominated by NP.
The X Convention, of course, allows a different account, namely cross-category generalization.
172 RAY JACKENDOFF
of course'
probably
(7.9) This ice cream tastes like spaghetti.
I bet
no doubt
Since the two kinds of relatives arise from different base positions, the difference
illustrated in (7.5) can be a consequence of the base rules. We will argue in section 7.6
that (7.5a) is generated by the phrase structure rule (7.10), which permits multiple
clauses.
Note, by the way, that appositives do not concatenate in S, AP, or PP either; again
only conjunction is possible:
(7.12) a. Relative clause formation is obligatory in NPs, which accounts for the
f *</> ^
difference in surface shape, which only your theory predicts.
_ and _
*<t>
b. That Sheila was beautiful, which she was. which
and
she hadn’t been two years previously, was not realized until later,
c. Solving this problem will take from now until doomsday, which is longer
r*4> i
than most problems take. which is more time than we’ve got.
and
mentioned in section 4.1: restrictives, like other X" complements, can be focused and
negated, but appositives, like other X'" complements, cannot:
(7.13) a. We didn’t talk to the man who married SUSAN, (we talked to the man
who married JAYNE)
b. *We didn’t talk to the man, who married SUSAN.
The second kind of projection rule deals with the relationship between the relative
pronoun and the head. The rules for interpreting the relative pronoun in appositives
(section 7.3) and restrictives (section 7.8) will account for the differences observed in
(7.7) and (7.8), as well as a number of other differences to be pointed out in later
sections. Section 7.9 will show that a competing hypothesis of appositives, deriving
them from underlying conjunction, encounters serious difficulties.
Ai
man
(wh)(the)
N from N V"
Siberia
man wh- . . .
174 RAY JACKENDOFF
(7.15) a. who
N'"
wh [PERSON] ([PLURAL])
b. which
N
wh [INANIMATE] ([PLURAL])
c. where
p"
P N""
wh [PLACE] ([PLURAL])
7: RELATIVE CLAUSES
The bracketed expressions in capitals represent semantic markers of the usual sort. The
wh is a placeholder which triggers rules of semantic interpretation.
The simplest possible rule of anaphora is one which marks the relative pronoun an¬
aphoric to the entire dominating NP. The infinite regress of (7.14) does not arise, since
such a semantic rule does not provide the relative pronoun with internal syntactic
structure. Rather, two NPs are simply designated as coreferential, and one happens to
dominate the other. Such a rule seems appropriate for appositive relative pronouns,
though not for restrictive relative pronouns, for which we will propose a different sort
of rule in section 7.8.
The appositive rule can be stated like this:
(7.16) Appositive Wh-Interpretation
X
is anaphoric to Y"', in the configuration
_ + wh_
[Y.Y"[g[comp. • - X'" . . . ] S]]
(7.16) has been stated in its most general form, so as to account for appositives in V'",
A"', and P'" as well as in N'". The proper notion of anaphora for N'" appositives is
obviously coreference; but in other Y'" the proper interpretation of the notion
“anaphoric” is less clear. Whatever that notion may be, it is in any event independ¬
ently motivated, since the demonstratives it and/or that exhibit exactly the same kind
of anaphora as which. Compare the following pairs:
Ross (1967) cites this construction as evidence for the conjunction source, since the
putative conjoined source of (7.18b) is also ungrammatical.
Any
| man is insane, and he drives a Cadillac.
Every
The present theory provides an explanation as well. Observe that everyone and no
one can be coreferential with a pronoun which is in a restrictive relative clause or in
other subordinate clauses within the scope of the quantifier (understood anaphora
indicated by italics):
(7.20) a. Everyone there had a wife who loved him.
b. No one wanted Sue to waken him.
c. Everyone got from Boston to a place he had been to before.
But such anaphora is impossible if the him is within an appositive.
(7.21) a. *Everyone there had a wife, who loved him.
b. *No one wanted Sue to leave, which suited him.
c. *Everyone got from Boston to Cincinnati, which seemed to him like a
long way.
The ungrammaticality of (7.21a,b,c) follows from the more general fact that appositives
are immune to the scope of quantifiers and negation. (7.22) confirms this fact by
demonstrating that a negation outside an appositive cannot condition some-any type
alternations within the appositive, though any can appear in restrictive modifiers in
parallel positions.
(7.22) a. I didn’t see a man who had had any drinks.
some
I didn’t see Bill, who had had drinks.
"any
b. That Sheila isn’t nervous to any appreciable extent surprises me.
sometimes
That Sheila isn’t nervous, which she is, surprises me.
"ever
c. We couldn’t get from Boston to a place any of us had been.
some
We couldn’t get from Boston to Cincinnati, which seemed to
'any
of us like a short way.
Since two NPs can be coreferential only if they are subject to the same logical
operators, and since only the pronouns in (7.20) and not those in (7.21) can be
quantified by the subject, coreference is possible only in (7.20).2 But the anaphora
established between an appositive relative pronoun and its head by (7.16) is claimed to
be of the same nature as anaphora with ordinary pronouns. Therefore, in (7.18b), the
relative pronoun, which is within the appositive and hence obligatorily unquantified, is
marked anaphoric to a quantified head. Such a situation is semantically anomalous, just
as in (7.21); hence (7.18b) is ungrammatical.
2 Cf. the Modal Coreference Condition of Jackendoff (1972, chapter 7), for elaboration of this point.
7: RELATIVE CLAUSES
These two sections have shown that the NP-complement theory of relatives
provides a principled distinction between restrictive and appositive relatives, and that
an interpretive theory of the appositive relative pronoun is not only statable but also
revealing. We now turn in some detail to restrictive relatives.
This section and the next three will deal with four different kinds of data which provide
apparent counterevidence to the claim that restrictive relatives are attached to N" in
underlying structure.
The first case concerns interaction between restrictive relative clauses and the
determiners of their heads. These interactions have often been taken as evidence that
the underlying position of restrictives is in the determiner (see (7.2)), since in such a
theory the necessary constraints can be stated as constraints on the form of a single
constituent; in the NP-complement theory, on the other hand, they must be stated in
some sort of discontinuous fashion. This allegedly favors the determiner theory. We
will show, however, that the determiner theory must be stretched to absurdity and the
discontinuous environments are independently necessary.3
Let us start with some cases in which the - X - Rel can appear but the - X alone is
ungrammatical.
(7.23) a. *the Paris
the Paris that I love
b. *the manner/way
the manner/way in which he did it
c. *the four of the boys
the four of the boys that came to dinner
There are also some cases (pointed out by Perlmutter (1970)) in which the proper article
is dependent on the content of the relative clause:
31 am particularly indebted in this section to the arguments of Reed (1974) and Vergnaud (1974).
178 RAY JACKENDOFF
(7.25) N N'"
4>
The deep structure in (7.25) of course entails adding an otherwise unmotivated option
in the base rule for the determiner. The NP-complement theory, which countenances a
discontinuous constraint anyway, can generate the PP in its surface position and simply
generalize the constraint to a further class of modifiers.
Exploring further, we discover phrases like the old Paris and the unknown Paris,
with prenominal adjectives satisfying the determiner constraint. Since the former of
these means ‘the part of Paris which is old’ or ‘the Paris of the old days’, neither of its
readings can be derived simply from the phrase the Paris that j old by relative
old
7: RELATIVE CLAUSES
There is of course no need for an extraposition rule this time, but the proposed
structure looks even more suspicious than the one in (7.25).
Actually, a different generalization emerges from these examples: a certain class of
restrictive modifiers (of any syntactic category) permits the use of the definite article
with proper nouns. APs, PPs, and restrictive relative clauses can all be interpreted as
restrictive modifiers, by whatever projection rules happen to apply to them. It follows
then that there is no privilege in being attached to the determiner; hence (7.23a) is not
evidence for the determiner theory of restrictive relatives.
Either of the theories, of course, can provide projection rules that interpret APs,
PPs, and restrictive relatives all as restrictive modifiers. However, only the NP-
complement theory permits the generalization, pointed out in chapter 4, that all
restrictive modifiers are daughters of N". This permits a single projection rule with a
single generalized structural description to apply to all three categories. Such a
generalization is impossible within the determiner analysis, since relative clauses
originate in an entirely different position than prenominal adjectives or postnominal PPs
do.4
As further evidence that relative clauses and PPs can perform the same semantic
function, consider pairs such as these, pointed out by Vergnaud (1974):
of his arrival
c. the manner
in which he arrived
Since in each case the PP and the relative are nearly synonymous, a grammar in which
they are related to the NP by the same projection rule is to be preferred to one in which
they are not.5
Summing up this argument, the supposed advantage of the determiner theory can
be maintained only at the expense of positing deep structures like (7.25) and (7.26). The
NP-complement theory permits a different sort of generalization, one that positions
restrictive relatives so that the projection rule for restrictive modification can be of the
simplest form.
A parallel argument can be constructed for (7.23b).
4 The generalization is likewise impossible in Ross’s theory of restrictives, though Ross allows such a
variety of Chomsky-adjoined constituents and unrestricted transformations that it is difficult to tell what
claims he could make here.
5 Does the fact that these phrases cannot be one-pronominalized mean that both the PP and the relative
are under N' here?
of his departure of her arrival
(i) *the time and the one
at which he left at which she arrived }
180 RAY JACKENDOFF
Here the head nouns cannot occur at all without the relative clause. The alleged
advantage of the determiner analysis is that the constraint on the occurrence of these
nouns is confined to a single constituent, the determiner. But again, the existence of
PPs and adjectives that suit the same purpose calls into question the generality of such
a solution:
The same argument applies: the generalization concerns the need for a restrictive
modifier, of whatever category (or a possessive this time, as in his manner). Thus the
only possible syntactic consideration is the generality of the configurations that can be
interpreted as restrictive modifiers, and the advantage of the determiner analysis
vanishes.
Similarly with (7.23c), where the is made acceptable not only by a relative clause,
but by an appropriate PP or AP, as in (7.29).
usual warmth
b. He greeted me with
unusual warmth
7: RELATIVE CLAUSES
Notice that at least some of these examples cannot be derived by reduction of relative
clauses: * the four of the boys that was/were usual; *the four of the boys that were with
green eyes; *the warmth that was of an old friend. So again the constraints concern all
restrictive modifiers, not just relative clauses.
Next consider the inability of proper nouns to take restrictive relative clauses:
The determiner analysis claims that this constraint is related to the inability of proper
nouns to take determiners. But in fact, no restrictive modifiers at all are possible with
proper nouns, so the generalization of the NP-complement theory accounts for this
constraint adequately.
One might claim that a phrase like poor John is a counterexample to the
generalization of the NP-complement theory. But poor here is not a restrictive
modifier; it does not constrain the choice of possible referents of the phrase. Rather,
something more akin to the projection rule for appositives must integrate poor into the
interpretation of poor John. I leave open whether this entails a different structural
position for such an adjective.
There remains one sort of argument for the determiner analysis, the inability of
restrictive relatives to cooccur with a genitive NP in the determiner:
Here other restrictive modifiers are possible (e.g. his old book, his book with a faded
cover), so the previous argument for the NP-complement theory does not apply. The
determiner hypothesis predicts this ungrammaticality by claiming that when the
determiner is filled by NP the expansion Art - S is unavailable.6 Section 7.8 will show
how this constraint can be explained as a consequence of the relative pronoun’s
interpretation, making the determiner theory’s explanation unnecessary.
This section has thus shown that the motivations for the determiner analysis,
namely the dependencies between the relative clause, the article, and the head, are
(with one exception) consequences of more general constraints on all restrictive
modifiers. Since restrictive relatives must be interpreted as restrictive modifiers no
matter what syntactic analysis is assigned to them, there is in fact little ground for
6 Many proponents of the determiner analysis, however, simply require the extraposition of the
possessive to the form the book of his that you stole, a much more mechanical explanation, and one available
to all three analyses of relatives. But a syntactic relation between the determiner genitive and this postposed
genitive is very difficult to support, as argued in Jackendoff (1969b). For a different analysis of the postposed
genitive, see section 5.3.4.
182 RAY JACKENDOFF
Another problem in the analysis of restrictive relatives (to my knowledge, first pointed
out by Dean (1966)) concerns the interpretation of relative clauses in NPs with
quantifiers.
In (7.33a) the antecedent of the relative pronoun appears to be men or many men; in
(7.33b) the men; in (7.33c), the many. The problem is to specify quantifier and relative
clause structures which interact properly to create these interpretations.
I shall not be concerned here with describing various attempts to account for these
facts within other frameworks, for most writers seem to bog down hopelessly on this
issue, mainly in accounting for the partitive structures. Rather I will be content to show
that the analysis of partitives in section 5.3 interacts with the NP-complement theory to
provide an adequate account of (7.33).
Here are the possible structures. When there is no partitive N' complement, as in
(7.34), the relative clause can be attached only in one position; when there is a
partitive, as in (7.35), the relative can be attached to either the higher or the lower NP.
men
7: RELATIVE CLAUSES
N"
In (7.34) the relative clause modifies many men-, there is no way to separate its
modification into that of many vs. that of men. Such a prediction seems correct. Many
of the men who like knitting is, however, ambiguous. Most typically it seems to have
the structure (7.35a), with the relative attached to the men. But a phrase such as many
of them who like knitting, in which the relative clause cannot be attached to the
complement (*them who like knitting), shows that structure (7.35b) is possible too.
Further, if evidence from number agreement is at all reliable, (7.36a) must have
structure (7.35a), and (7.36b) must have structure (7.35b).
I one I
(7.36) a. of the men who like knitting
[each J
I one I
b. of the men who likes knitting
[each J
Finally, of the pair (7.35c,d), only the latter is a correct structure for the many of
the men who like knitting, since, as was shown in section 5.3, there is a constraint
requiring a relative clause attached to the upper NP in this construction. This can be
shown either by preposing, as in (7.37), or by number agreement in the relative clause,
as in (7.38).
(7.37) a. Of the men, the many who like knitting are the happiest,
b. *Of the men who like knitting, the many are the happiest.
likes
(7.38) the (only) one of the men who knitting
*like
Of course, if a second relative clause is added, it may modify the lower NP. Compare
the preposing and number agreement in (7.39) to (7.37)—(7.38) to see that one restrictive
in (7.39) is attached to the upper NP, and one to the lower.
7: RELATIVE CLAUSES
(7.39) a. Of the men who like bowling, the many who like knitting are the
happiest.
b. the (only) one of the men who like bowling who likes knitting
It is commonly claimed that the difference is due to the fact that the second relative
clause modifies or restricts both the head and the first relative clause. Thus the
antecedent of the first relative pronoun in (7.40a) is the men\ of the second, the men
who hated lox. The theory par excellence for expressing this difference is Ross’s
Chomsky-adjoined theory, since (7.40a), for example, can receive the deep structure
(7.41).
Within both the circled and boxed NPs, the subordinate clause can relativize, so each
NP becomes the men who hated lox. Then the boxed NP is identical with the circled
NP and can itself become a relative pronoun to yield the desired surface form (7.40a).
Thus the intuitions about (7.40) are accounted for by “stacking” the relative clauses.
In the NP-complement theory, however, it appears difficult to account for the
186 RAY JACKENDOFF
“stacked” interpretation, since the most plausible structure with two restrictive
relative clauses simply concatenates them:
men
The fact that the relative clauses are not stacked in (7.42) appears to be a serious
drawback. We will show, however, that this is not a drawback but a virtue of the NP-
complement theory.
Consider the kind of context in which the “stacked” interpretation emerges most
clearly:
(7.43) a. The men who hated lox who came to dinner left early, but that wasn’t
I the men who hated lox ! . . ~
true ot |tjlose j who arrived alter 8:00.
b. ??The men who came to dinner who hated lox left early, but that wasn’t
true of the men who arrived after 8:00 who hated lox.
(7.43a) sets up a situation in which the men who hated lox who came to dinner
contrasts with the men who hated lox who arrived after 8:00. According to the rules of
focus and presupposition of Chomsky (1970b), Akmajian (1973), and Jackendoff (1972,
chapter 6), the contrasting constituents must be foci and thus must receive prominent
stress. The contrasting parts of the two subject NPs in (7.43a) are the final relative
clauses, where highest stress normally appears. In (7.43b), though, the contrasting
relative clauses are medial in the NP, and the contrast can be obtained felicitously only
by placing a strong stress on the contrasting relatives and effacing stress on the final
relatives:
(7.44) The men who came to DINNER who hated lox left early, but that wasn't
true of the men who arrived after eight O’CLOCK who hated lox.
This suggests that the phenomenon of “stacking” is not to be accounted for in the
syntax, but rather in the system of presupposition and focus. Two questions are raised
by such a conjecture: first, are the rules of presupposition and focus capable of deriving
interpretations which account for the intuitions of stacking? Second, if stacking has
7: RELATIVE CLAUSES
nothing in particular to do with how relative clauses are attached, can other things
produce stacked interpretations as well?
The rules for focus and presupposition derive (7.45a) as the interpretation of the
first clause of (7.43a), (7.45b) for (7.43b), with normal stress, and (7.45c) for (7.44), with
special stress.7
In (7.43a) and (7.44) the presuppositions can be matched in the two clauses, and the
foci contrast properly. But in (7.43b) the presupposition of the second clause involves
men who arrived after 8:00 and the foci do not contrast, hence the observed infelicity.
Notice also that in (7.45a) the focused relative clause specifies members of the set
of men who hated lox; in (7.45b) the focused relative clause specifies members of the
set of men who came to dinner. These are exactly the properties required to satisfy the
intuition of a “stacked” interpretation, and they are achieved without any syntactic
counterpart of the semantic stacking.
“Stacked” interpretations can also be produced with pairs of PPs in the NP
complement. As with relatives, the stacking can be reversed by changing stress.
(7.46a) is felicitous only with stress on 1875 and 1936\ (7.46b) is felicitous only with
stress on Kansas and Australia. It is clear that the paradigm is parallel to (7.43)—(7.44).
Furthermore, the same pattern appears in the case of a PP followed by a restrictive
relative:
Again, (7.47a) must be read with stress on dinner and lunch, and (7.47b) with stress on
scar and eyes.
7 I am oversimplifying here, assuming only one of a number of possible foci linked to the given stress
pattern, namely the relative clause. The other possibilities are irrelevant to the context of (7.43) and (7.44).
See the references cited above for details of the treatment of focus and presupposition assumed here.
188 RAY JACKENDOFF
Since the syntactic subordination has now been shown unnecessary, stacking is no
longer a ground for preferring the Chomsky-adjoined analysis of restrictive relatives
over the NP-complement theory. The latter assigns the relevant NPs in (7.43)—(7.47)
the following structures:
(7.48) a. N'"
men
b. N'"
This analysis treats (7.43)-(7.47) as altogether comparable, since both PPs and relatives
can be attached to N".
Only one simple change must be made in the phrase structure rule for N" to
account for multiple relatives: the S at the end must be replaced with S*.
(7.50) N'"
(7.47a) will not be a problem, but the reversed stacking in (7.47b) requires the deep
structure (7.51), which appears at the surface only with great awkwardness, unless the
S is extraposed.
81 must emphasize again that PPs like with a scar cannot in general be derived by reducing relative
clauses.
190 RAY JACKENDOFF
Furthermore, the rules dealing with the interpretation of focus and presupposition on
the basis of surface stress are independently necessary, and make it possible to produce
“stacked” interpretations with normal stress patterns and no syntactic subordination of
the “inside” relative clause.
The syntactic stacking of the Chomsky-adjoined theory is thus not only superflu¬
ous but even disadvantageous. Hence the phenomenon of stacking is not only not
counterevidence to the NP-complement theory, but also provides important evidence in
its favor. The generalization underlying this result is the same as in section 7.4:
restrictive relatives work exactly like all other restrictive modifiers.
The fourth and final class of apparent counterexamples to the NP-complement theory
involves data first pointed out (to my knowledge) by Ross; the argument in the form
given here is due to Vergnaud (1974). Consider the NPs in (7.52).
These NPs cannot be derived by conjunction reduction, since the sources would have
to be the ungrammatical *the car that collided and the truck that collided, *the boy who
met in Vienna and the girl who met in Vienna. The verbs of these relative clauses
require plural or conjoined subjects; hence the relative pronoun’s antecedent cannot be
either of the nouns alone.9
Ross’s analysis deals with such examples straightforwardly:10
(7.53)
that collided
9 Dougherty’s (1970-1971) discussion of conjunction is, I believe, sufficient to eliminate such distant
sources as the car that collided with the truck and the truck that collided with the car. A source once
suggested by Ross (in lectures at MIT), the car wh- [car and truck] collided and the truck wh- [car and truck]
collided, is semantically incomprehensible.
10 Vergnaud is actually proposing a slightly more sophisticated theory than Ross’s; however, with
respect to this particular argument the two theories are equivalent. I will therefore couch the argument in
terms of Ross’s theory, which will probably be more familiar to the reader.
7: RELATIVE CLAUSES
In this structure, the relative clause is sister of a constituent that dominates both NPs,
and the antecedent of the relative pronoun can be the conjoined NP. On the other hand,
the NP-complement theory provides no convincing structure, since the Art'" is superior
to the relative clause:
that collided
Since only a theory like Ross’s can come even remotely close to a suitable structure for
(7.52), and it does so with great ease, Vergnaud considers this construction overwhelm¬
ing evidence for such a theory.
However, if Vergnaud’s argument is correct, then PP restrictive modifiers must be
outside the NP too, since they occur in the same sorts of constructions:
These examples can be derived neither by Relative Clause Reduction nor by Conjunc¬
tion Reduction, since both (7.56a) and (7.56b) are ungrammatical.
(7.56) a. *the boy and the girl who were with the same birthday (Relative Clause
Reduction source)
b. *the boy with the same birthday and the girl with the same birthday
(Conjunction Reduction source)
If these PPs are in the NP complement, (7.55) raises the same problem in any theory as
(7.52) does in the NP-complement theory of relatives: where is the PP attached?
192 RAY JACKENDOFF
Lest one should immediately yield to the temptation to attach these PPs outside
the NP, we observe that N' complements create this problem too.
(7.58) three students and two teachers of the same language
three members and two vice-chairmen of interlocking committees
Vergnaud’s argument forces us to conclude that the proper bracketing is [[three
students] of French] and [[three members] of the committee], not the otherwise highly
motivated [three [students of French]] and [three [members of the committee]]. Things
are beginning to look fishy.
To make life even worse, the X Convention as usual leads us to discover similar
examples in sentences:
similar issues
(7.59) a. John avoided and Bill ignored the same man
men with the same birthday
None of these sentences can arise from Conjunction Reduction or Right Node Raising,
since the presumed sources would either have the wrong reading or be ungrammatical:
similar issues
(7.60) a. John avoided • the same man • and Bill ignored
men with the same birthday
similar issues
the same man
men with the same birthday
7: RELATIVE CLAUSES
By Vergnaud’s argument, then, the phrases in braces must be right sisters of conjoined
subject-verb pairs:
(7.61)
But this flies in the face of the presumably well-established fact that the phrases in
braces are direct objects or other V' and V" complements.
It is really unfair to saddle Vergnaud with this argument, since such arguments
based on conjunction have been considered strong evidence for structure throughout
the history of transformational grammar. We have seen, however, that this type of
argument leads ultimately to conclusions which contradict the most fundamental and
best accepted notions of phrase structure.
The question is, which set of arguments should be given up? If we wish to retain
the arguments based on conjunction, we give up all distinctions between various types
of complements, since they must all reduce to the Chomsky-adjoined variety. All the
arguments of chapter 4 distinguishing complement types must go by the wayside and all
evidence for internal structure of NPs and Ss must be accounted for in some other way.
I prefer to believe that the error lies rather in the theory of conjunction, always a
troubled area in transformational grammar. There are to my knowledge no remotely
coherent formalizations of Right Node Raising; Gapping is ever problematical; and
even Dougherty’s (1970-1971) highly restricted version of Conjunction Reduction
comes to grief in examples like these:
matical here:
(7.63) *The same man got drunk and the cops arrested the same man.
*It is rarely easy to please the same man and the same man is rarely eager
to please.
*The same man praised you and it seemed [(for) the same man to hate
you].
If I am correct, and the anomaly lies in the theory of conjunction, we can retain all
the usual arguments for phrase structure; only the arguments from conjunction have to
be rejected. I have no positive proposals to offer on the analysis of (7.52)—(7.62). But
this choice defuses Vergnaud’s otherwise powerful argument for the Chomsky-adjoined
theory of relatives; thus the NP-complement theory of relatives, like the theory that
there are direct objects in sentences, can be maintained. What is really needed is a
thoroughly revised theory of conjunction.
Having dealt with potential difficulties for the NP-complement theory of restrictives in
the areas of determiner restrictions, quantification, stacking, and conjunction, we now
turn to the problem of interpreting the restrictive relative pronoun. As in the case of
appositive relative pronouns, we assume an interpretive theory, in which the relative
pronoun is inserted into underlying structure and \vh-preposed; a semantic rule applies
to the surface structure to establish the interpretation of the pronoun. The problem is to
find out exactly what this interpretive rule does. If it is properly defined, it can account
for otherwise puzzling facts about relative clauses.
Section 7.3 showed that the appropriate interpretive rule for the appositive relative
pronoun was one that established anaphora between the relative pronoun and the head.
A similar rule of anaphora could be developed for restrictive relative pronouns, but
there is reason to consider a different interpretation. To see this, let us look a bit more
closely at the semantics of NPs.
The semantic functions of various parts of an NP down to the N" level are of at
least four types: demonstrative or deictic, classificatory, measuring, and commenting.
The demonstrative function is taken by articles; they show whether the individual in
question is old or new in the discourse, unique, pragmatically designated, generic, and
so forth. Elements with the classificatory function place conditions on what sort of
object the individual in question is; the N' and the restrictive modifiers have this
function. Measuring and commenting functions are taken by numerals (and perhaps N”
quantifiers) and by appositives, respectively.
Traditional logic has not had much to say about the latter two functions, but it
makes a sharp distinction between the former two. All the classificatory functions have
been represented as predicates; those demonstrative functions that have been treated at
all emerge as operators such as quantifiers or the definite description operator. Thus,
7: RELATIVE CLAUSES
for example, (7.64) might be represented as (7.65) in one version of logical notation,
restricted quantification. For illustrative purposes, I will use the rather free translation
characteristic of traditional logic.
In these translations, the article has become an operator; each classificatory constituent
has been supplied with an “x is” and placed in the restriction on the variable controlled
by the operator.
Restrictive relative clauses require a slightly different treatment. In this notation,
the most appropriate translation of The man who Bill saw died is (7.66).
Here the relative clause has become a restriction on the variable. The translation rules
have not added the fixed material “x is”, as in (7.64); rather, the relative pronoun has
itself been replaced by the bound variable x. In other words, ordinary classificatory
elements can be made to restrict the operator by adding fixed external material, but
restrictive relative clauses must be tampered with internally, in the position corre¬
sponding to the relative pronoun. The tampering involves replacing the relative
pronoun by the bound variable controlled by the operator.
I suggest that something parallel to these logical translations is going on in the
natural language semantics of restrictive relative clauses. The relative pronoun is the
syntactic marker of the position which must be bound by the demonstrative operator of
the head. Since the syntactic correlate of the demonstrative is the Article position, the
rule for interpreting the relative pronoun must establish a “binding” relationship
between the relative pronoun and the article:
The exact characterization of the “binding” relationship depends on the ultimate form
of semantic notation chosen; in the rough logical translation rules above, “is bound to”
would mean “is replaced by the variable of’.
The crucial mention of the Art'" in (7.67) may be used to account for one of the
differences between appositives and restrictives. Since N'" is the only category which
dominates Art'", restrictives can be related only to NPs. But since appositive relative
196 RAY JACKENDOFF
pronouns are interpreted as anaphoric to the entire head, the internal structure of the
head is not crucial, and appositives may thus be generalized to all major syntactic
categories.
The dependence on Art'" in (7.67) plays a role in accounting for certain ungram-
maticalities as well. Section 7.4 mentioned one such example:
John's is not an Art'", so the relative pronoun cannot be bound; since the example does
not receive a complete interpretation, it is anomalous. Next consider the ungrammati¬
cally of (7.68a), and compare it to (7.68b,c).
Recall the discussion of section 4.7.4. There it was argued that when the is the only
specifier possible in a derived nominal, it arises not through expansion of Art'", but
through insertion into an empty N'". The destruction of the city is such a case, and the
destruction is not, since we have, for example, *much destruction of the city but much
destruction. Thus in (7.68a), the is dominated by N'" and the structural description of
(7.67) cannot be met, hence the phrase is ungrammatical; in (7.68b), the is dominated
by Art'", so the phrase is grammatical. Since the interpretation of appositive relative
pronouns does not depend on the presence of an Art'", (7.68c) too is grammatical,
though its determiner is dominated by N'".
This analysis of the relative pronoun eliminates the usual puzzles (cf. for example
Kuroda (1969)) about whether relative pronouns are definite or indefinite: they are
neither. In fact, the issue hardly makes sense in terms of the interpretation given by
(7.67) , since the bound variable is not referential at all, but simply has the effect of
converting the proposition represented by the relative clause into a property. So, for
example, a man who you know denotes an individual not completely identifiable from
previous discourse (i.e. indefinite), who is male, adult, and known by the hearer. The
man who came to dinner denotes an individual identifiable by this description plus
previous discourse or general knowledge (i.e. definite), who is male and adult and who
came to dinner. A relative clause may be “definitizing”, i.e. render an otherwise
indefinite NP definite, just in case it provides grounds for unique identification which
were not present in the NP without the relative. Thus there is no transformation of the
form that Perlmutter (1970) and others adopt, changing the article of an NP to the in the
presence of a restrictive relative clause.
In cases where the demonstrative element is interpreted as picking out a sort or
degree, it does so in the relative clause as well. For example, the object of We drank
the wine we always drink denotes not a particular physical body of wine (barring
7: RELATIVE CLAUSES
In the present analysis such a strategem is unnecessary. The generic character of the
head NP is determined by the indefinite article a, and this binds both the head N beaver
and the relative pronoun. Thus the conditional character of the relative clause in (7.69a)
is a result not of some unusual underlying form, but of the binding by (7.67) of the
relative pronoun to a generic determiner.
Similarly for NPs with quantifiers in the Art'" such as every man who owns a
Cadillac. There is no need to resort to an underlying form such as if a man owns a
Cadillac, he . . . in order to capture the semantic connection between the relative
clause and the main clause. Rather, binding takes place in the usual fashion, as shown
in (7.65a,c), and the quantifier automatically affects the relative clause.
A more subtle analysis than we have given here is required to account for the fact
that any can show up in quantified relative constructions such as every man who has
any sense. However, the failure of the present analysis to predict this is not serious,
since these peculiar anys turn up not only in relative clauses but also in restrictive PPs,
for example in every man with any sense. Thus the possibility of any here is not to be
explained in the semantics of restrictive relative clauses, but in a more general account
of restrictive modification, which we have not attempted to provide.
This completes our account of restrictive clauses. We have shown that the NP-
complement theory of restrictive relatives is capable of dealing with a wide range of
apparent counterevidence, and that in fact it permits a more general account of
restrictive modification than either the determiner source or the Chomsky-adjoined
source. Moreover, it requires neither an extraposition transformation, like the determi¬
ner theory, nor a weakening of the fundamental base rule schema, like the Chomsky-
adjoined theory. This last consideration is most important to the general goals of this
study, and it is the reason we have devoted so much space to the problems of relative
clauses.
(7.70) NP
Thus there can be no surface structural significance associated with the intonation
break characteristic of appositives, already a serious problem.
To provide a deep structure difference between restrictives and appositives, Ross
(1967) proposes deriving appositives from underlying conjoined clauses. For example,
(7.71a) is to be derived from something more or less like (7.71b).
Thus appositives have a different underlying source from restrictives, and many of the
differences between them can be captured in terms of the underlying forms or the
derivations of the two kinds of clauses.
Ross himself (section 6.2.4.1) points out a serious difficulty with the conjunction
source: appositives can occur within imperatives and questions, though the sentences
(7.73a,b), putatively more like the underlying form, are ungrammatical.
reservations about lowering appositives into such strong islands as conjoined NPs
(7.74a) and complex NPs (7.74b).
(7.74) a. Haj, who likes beer, and Sandy, who hates it, couldn’t agree on one
restaurant.
b. Paul’s book about the transgalactic lexicon, which is the hottest topic of
the decade, is eagerly awaited.
In defense of the conjunction source, Ross points out a sentence type which he
claims is an intermediate step in the derivation from normal conjunction to appositives.
(7.75) Bill, and I’ve known him for many years, isn’t as smart as he makes
himself out to be.
Here a conjoined sentence has been inserted after the subject, supposedly halfway to
becoming the appositive who I've known for many years. Unfortunately for this
argument, though, the presumed intermediate step ranges from dubiously grammatical
to impossible when substituted into examples such as (7.72) and (7.74):
The transformation producing (7.75), in other words, has all the constraints expected of
a rule combining two conjoined sentences; the rule producing appositives does not.
Hence the construction in (7.75) cannot be an intermediate step on the way to
appositives; in fact its constraints constitute evidence against Ross’s theory of
appositives.
We see then that Ross’s theory still lacks a viable underlying form for appositives.
Of course, one way to avoid the problems of the conjunction source is to generate
appositives in their surface position, as part of NP. But since restrictives already
occupy the highest possible position within NP, there is no room in NP for a higher
base position for appositives. Thus the disposition of appositives in Ross’s theory is
left enigmatic.
8: Degree Clauses
8.1. Introduction
Section 2.2 mentioned a parallelism, pointed out by Bowers (1968b), between NPs with
restrictive relatives and APs with degree clauses associated with the degree words so,
too, enough, as, more, and less, such as those in (8.1).
Bowers was the first (to my knowledge) to express this parallelism in terms of the X
Convention. Selkirk (1970), Milner (1973), and Reed (1974) develop the parallelisms
further. Postal (1974) has an extensive discussion of their semantic similarities, giving a
nonlexicalist formalization.
If this parallelism is genuine, and if restrictive relatives are attached to N" in deep
and surface structure, as argued in chapter 7, the X Convention compels us to adopt a
theory of degree clauses in which they too are attached to X" in deep and surface
structure. The present chapter compares such a view to the prevalent view that degree
clauses are extraposed from degree phrases. It will be shown that the X"-complement
theory is in fact quite plausible, though not yet entirely compelling. Thus the
framework proposed here is at least consistent with the facts of degree clauses, even if
it cannot yet be shown that it predicts them.
8.2. The Extraposition Theory and its Evidence for the X Convention
Degree clauses can appear at the surface in any of the major syntactic categories A'",
N'", V'", and P"\ Bowers points out that they always follow strictly subcategorized
complements in surface structure. Furthermore, since they always precede appositives,
202 RAY JACKENDOFF
the most likely surface position for them is in the X" complement. No rearrangement of
the complements in (8.2)-(8.5) is possible.1
(8.2) In A'":
a. he’s [so [[afraid to lose A«] that he’s freaking out, A»] which you’ll never
be Am]
b. he’s [too [[proud that he’s won A.] to admit it wasn’t such a big
deal A"]A'»]
he’s [too [[sickA,] to admit it wasn’t such a big deal,A„] which you’ll never
beA», ]
fof Bill
c. he’s [[[afraid enough to freak out,A»] which you'll
[that he’ll lose
never beA-»]
d. that she was [ j [[eager to winA,] than Fred was,A„] which she
It is rather obvious that there is a connection between the degree words and the
1 The status of appositives in some of these examples is rather curious. The appositives in (8.3c) and
(8.5c) seem to comment on the quantifier rather than the head, and no other type of appositive seems
possible in N"' or P'" of this sort. I have no explanation.
8: DEGREE CLAUSES
associated degree clauses. If the degree words are omitted in (8.2)—(8.5), the sentences
become ungrammatical unless the clauses in X" are omitted too. Furthermore, each
degree word governs characteristic complementizers:2
(8.6) so . . . that S
too . . . for-to S
as ... as S
This strongly suggests that degree clauses are strictly subcategorized complements of
Deg, despite their surface position. Bowers (1968b) and Selkirk (1970) take this view,
generating deep structures like (8.7) and extraposing the degree clause to its surface
position.
(8.7) a. A
b. A
2 For the complementizer status of as and than, see Bresnan (1970; 1972; 1974).
204 RAY JACKENDOFF
N'"
N"
d. N
(8.8) N'"
Art"' N"
As Bowers and Selkirk point out, the extraposition operations in (8.7a,b) and (8.8) are
structurally parallel; only by use of the X Convention can this parallelism be expressed.
Furthermore, the deep structures are parallel, in that Deg and Art are both X'"
specifiers; in the present theory of syntactic features, they differ by the same feature,
±Subj, as their respective heads, A and N.
(8.7c,d) are more complex, but essentially the same as (8.7a,b), particularly if one
assumes the S to be extraposed in two stages, first to the end of Q"\ then to the end of
N'". Under this assumption, the extraposition transformation used in (8.7a,b) and (8.8)
needs only to be generalized to Q'" via the X Convention; the double extraposition will
be carried out cyclically, and no additional rules are necessary.
Similar double extrapositions take place in (8.9), where the categories dominating
Deg'" are different.
A"
Adv'" A'
as / S Adv' lively
206 RAY JACKENDOFF
D"
N"
8: DEGREE CLAUSES
It is clear that the rule accounting for surface position of degree clauses will have to
make liberal use of the X Convention, whether or not it generalizes with the rules for
restrictive relatives.
In chapter 7 we rejected the view that restrictive relatives are generated under Art'", in
favor of generating them under N". Does this mean that they are in fact not parallel in
deep structure to degree clauses? Or should we seek arguments similar to those of
section 7.4, to show that degree clauses too should be generated in their surface
position under X", restoring the parallelism?
Reed (1974) gives arguments relevant to this question. First, she points out that the
expression the same governs a degree clause with the complementizer as, in which the
permissible deletions are comparable to those in other comparatives.
you
(8.11) a. He looks the same as you do • . (in A'")
I think he used to (look)
you
b. He’s reading the same book as you are (reading) (in N'")
I think you used to
you
c. He spoke the same as ‘ you do (in Adv'")
I think you used to
But in N'" it also occurs with the complementizer that and the deletion pattern
characteristic of restrictive relative clauses:
you’re reading
*you (are)
(8.12) He’s reading the same book that
I think you used to read
?I think you used to
Thus the same must be allowed to govern either a degree clause or a relative clause;
since the latter is permissible only in N'", it will of course not crop up in other
categories. This option is stated most simply if degree clauses and restrictive relatives
are derived from the same underlying position, either in the specifier or in the X"
complement.3
One of the arguments adduced against the determiner theory of restrictives was
that there are PPs that serve the same semantic function as restrictives. Reed observes
that there are similar examples for some degree clauses;
J One possible drawback of this argument is that which cannot be substituted for that in (8.12), as would
be expected in an ordinary relative clause. This may indicate that the same does not govern a genuine
relative clause. On the other hand, it may indicate that we have been mistaken in regarding relative clauses
with that and those with which as deriving from precisely the same source. If there were an underlying
difference, the same could be constrained to govern only the former type.
208 RAY JACKENDOFF
I to wear shorts I
(8.13) a. He’s too fat
[for shorts J
I as you are
c. He’s as sly
las a fox
d. He’s taller
{than you are
than six feet
(Notice that in the latter two, the PP case is unlikely to be a reduction from the
questionable las sly as a fox is, *taller than six feet is.4 Rather, as and than, like for,
must double as prepositions and complementizers.) Thus if degree clauses are underly¬
ing complements of Deg, extraposed to their surface position, these PPs must be too.
Reed next points out that the degree clauses and PPs of (8.13a,b) occur with a
number of adverbial specifiers too, with similar meaning:
I to be a President I
(8.14) a. He’s quite young
[for a President J'
to be in school
b. He’s awfully young
for school
Since at least these particular degree clauses and PPs seem to have an existence
independent of the degree words, Reed argues that they must be generated in deep
structure to the right of the adjective, parallel to restrictive relatives, and that their
4 Bresnan (1973) says the underlying clause in taller than six feet is than six feet = x much, invoking an
underlying verb “ = ” which never appears on the surface and which has a reading suspiciously similar to the
copula.
8: DEGREE CLAUSES
interaction with the degree words is purely semantic. The base rule for A", therefore,
must be of the form (8.16).
(8.16)
We first observe that the X"-complement theory of degree clauses restores the
parallelism between degree clauses and restrictive relatives pointed out by Bowers,
Selkirk, and others. Not only is there a structural parallel, since both types of clauses
are generated as X" complements, but there is a parallelism in their interpretation rules
as well. Recall that the rule for interpreting restrictive relative pronouns developed in
section 7.8 involves establishing a binding relation between the Article and an element
of a clause in the following configuration:
(8.17) N'"
210 RAY JACKENDOFF
Similarly, the simplest cases of degree clauses, such as (8.7a,b), are interpreted by
establishing a semantic relationship between an X'" specifier and part or all of a clause
in the X” complement:
More complex cases have a less direct syntactic relation between the Deg'" and the S;
it is interesting, however, that at least the simplest cases so closely parallel restrictive
relatives.
As some indication that this parallelism is not just an accident, recall the
discussion in section 4.8 of sentences with “subject complements”, for example It
bothers me that Tom isn't here. It was argued that such a complement clause is a V"
complement, interpreted by establishing an anaphoric relation between it and the
dummy subject it. The configuration is precisely parallel to (8.17) and (8.18):
Thus it seems possible to claim that there is a general rule of anaphora between X'"
specifiers and X" complement clauses, with three special cases. If the X'" specifier is an
NP, the X" complement is interpreted as a subject clause; if the X'" specifier is an Art'",
the X" complement is interpreted as a restrictive relative clause; if the X'" specifier is a
Deg'", the X" complement is interpreted as a degree clause. Notice that this generaliza¬
tion is more or less exhaustive, since N'", Art'", and Deg'" are the only major syntactic
categories other than M'" that occur in any X'" specifier.5 Furthermore, both constitu-
5 Since sentence adverbs, parentheticals, etc., are interpreted appositively, they are grouped semanti¬
cally with complements, not with specifiers, even when they precede the head.
8: DEGREE CLAUSES
ents that take N'" in the X'" specifier, namely V'" and N"\ take subject clauses; the
only constituent with an Art'", namely N"', is the only one with restrictive relatives; all
constituents that take Deg'", namely A'", Adv'", and Q'", also allow degree clauses.
Thus an interpretive rule for degree clauses generated in their surface position appears
to fit into a general cross-categorial class of anaphoric rules for which there is some
independent motivation. Furthermore, a grammar incorporating such a rule would
express the observed similarities between restrictive relatives and degree clauses, while
maintaining the NP-complement theory of relatives argued for in chapter 7.
The configuration (8.18) is, however, only the simplest in which degree clauses can be
interpreted. This section will deal with more complex situations.
Besides cases like (8.18) in which the degree clause is bound to the X"' specifier of
the immediately dominating phrase, it can be bound to an X" specifier:
A"
This configuration is also the one that appears with same and different:
212 RAY JACKENDOFF
Also, as seen in section 8.2, the relationship can pass through a (probably
indefinite) number of embedded X'" and X" specifiers:
(8.22) a. too many stories about Bill for us to bear (too is Q'" specifier of N"
specifier)
b. as extraordinarily lively as Bill is (as is Adv'" specifier of A" specifier)
c. sufficiently many men for us to stop counting (sufficiently is Q" specifier
of N" specifier)
d. as much too tall (for the job) as Bill is (as is Q'" specifier of Deg"
specifier of A'" specifier)
e. as much too much bread as I could stand (as is Q'" specifier of Deg"
specifier of Q'" specifier of N" specifier)
The most general condition I have been able to discover for the position of the
specifier is this:
(8.23) The specifier binding a degree clause must be the head of a phrase on the
leftmost extremity of that X'" which most directly dominates the degree
clause.
This condition certainly holds in all the examples given so far, except for those in V'"
(8.4).
(8.4) a. He gave so little to the fund that he looked stingy, which surprised no
one.
b. He didn’t say enough to us about how he did it to justify himself, which
was a shame.
As evidence of the essential correctness of (8.23), consider (8.24). The cannot cooccur
with a degree clause. (8.23) predicts this: since the is the leftmost specifier, the degree
clause is bound to the instead of to the degree word, producing semantic anomaly.
In fact, this statement of the condition permits two different degree words to bind
clauses in the same phrase:
(8.26) too much more bigoted than Vera is for us to like him
A"'
too much
Since the lower Deg'", of which too is the head, and the upper Deg'", of which more is
the head, are both at the leftmost extremity of the A'", both too and more can bind
degree clauses. The fact that the clauses are in the order they are in will be accounted
for in section 8.7.
It thus appears that (8.23) is reasonably close to the correct relationship between
degree clauses and their binding specifiers, though the cases in V'" still remain to be
accounted for.
214 RAY JACKEN DOFF
Once the degree clause is related properly to its associated specifier, the interpretations
of these two elements must be related to the rest of the sentence. In the cases of the
clauses governed by enough, so, and too, this presents no further syntactic problem,
just one of understanding the semantics. Since the semantic problems must be
surmounted no matter where the degree clause is attached in underlying structure, their
solution does not affect the decision between the extraposition and the X"-complement
theories.
In comparative clauses, though, there is a missing element whose reading bears
some relation to the phrase dominating the clause. For example, in John read more
books than Fred read, the object of the comparative clause is missing. Defining the
interpretation of this missing element involves syntactic problems as well as semantic
ones, as it turns out.
Selkirk (1970) and Bresnan (1973) argue that this deleted (or PRO) string begins
with a w/z-like dummy degree word x and is otherwise identical to (or anaphoric to) the
string between the governing degree word and the clause.6 In (8.27) the bracketed string
represents the material missing from the surface.
(8.28) a. A'"
x tall (f>
(or underlying PRO)
In Lees (1961), x is identified with the Deg that, just as wh at the time was identified with the Art the.
8: DEGREE CLAUSES
b. 1ST
N"' S
Under this analysis of the surface structure, the deletion or anaphora can be based on a
relationship between full constituents. In this it is structurally similar to the Chomsky-
adjoined theory of relative clauses, in which the relative pronoun’s antecedent is taken
to be the entire head NP.
But this analysis is also parallel to the Chomsky-adjoined theory in another way: it
violates the theory of phrase structure of chapter 3, having an X'" whose head is an X'"
instead of an X". Besides threatening the general theory of phrase structure, it
contradicts the conclusion of section 8.2 that degree clauses are attached to X". That
conclusion, in turn, was based on the theory that appositives are attached to X'", one of
the strong points of the present theory of phrase structure.
In the X"-complement theory of degree clauses, though, the constituent relation¬
ship of (8.28) is impossible. The upper and lower X'"s cannot be identical, since one
contains an S and the other does not. The most one can say is that a substring of the
upper X'" is identical with the whole lower X'" aside from the binding Deg'".
(8.29) a. A
tall as Bill is A
216 RAY JACKENDOFF
b. N"
N"
Thus the X"-complement theory of degree clauses has the same apparent difficulty of
interpretation as the NP-complement theory of relatives: nonconstituent identity.
The problem is exacerbated by stacked degree clauses such as the ones in
Bresnan’s example (8.30).
(8.30) Mary swam as many more laps than Joan swam as Linda swam.
The degree words and clauses are related from the inside out: the inner degree clause
goes with the second degree word and the outer clause goes with the first degree word.
The outer degree clause, furthermore, measures not simply a number of laps, but a
number of laps more than Joan swam. In Bresnan's outside-X'" theory, a stacked
structure can be constructed which accounts for this, preserving constituent identity in
the anaphoric process:
8: DEGREE CLAUSES
N"
218 RAY JACKENDOFF
In a deletion theory of comparatives this structure would be untenable, since (at least
under the usual assumptions) constituent identity, not string identity, is necessary for
deletion.
Thus, if we are to adopt the X"-complement theory of comparative clauses, in
keeping with the strong version of the X Convention, we are forced to develop an
interpretive theory for the missing element of comparatives, a theory that is based on
identity of strings, not structure. The next section proposes such a theory.
There are three steps involved in interpreting such structures. The first step is to
locate the degree word which binds it, i.e. which satisfies condition (8.23). This part of
the process is common to the deletion and interpretive theories. For concreteness,
suppose that the degree word is more, that the complementizer is than, and that the
clause is dominated by N'", as in (8.33b).
The next step of interpretation is to give content to PRO within the clause. In the
cases we have been considering, PRO must be of the same category as the X'" most
immediately dominating the clause.7 Assuming Bresnan’s account of the semantics,
PRO should be interpreted in the following way. Designate the string between more and
than as Z. Replace the Deg'" dominating more by a Deg'" dominating *, and assign PRO
the interpretation of the string* - Z. This string is equivalent to Bresnan’s underlying
structure.
It must be shown that * - Z has a coherent interpretation. Z contains some string
of specifiers (for example many in (8.33b)), followed by the N' and possibly some
daughters of N" to the right of the head. Since more is the head of a Deg'" at the
leftmost extremity of N'", replacing this Deg'" by * yields a complete specifier for* -
7 We are not considering any cases of what Bresnan (1976a) calls Sub-Deletion, for example (i) (elements
missing in the surface are bracketed).
(i) She is less beautiful than she is [x] funny.
We bought more books than we sold [x many] cars.
He has as many too many books as we have [x many] too few pencils.
Here the deleted elements are identical only to part or all of the specifier of the phrase dominating the degree
clause. These cases do not constitute a potential threat to the X"-complement theory even with deletion,
since the parallel strings are full single constituents, not partial constituents as in (8.29) and (8.32). 1 leave for
future research how Sub-Deletion should be stated interpretively, though 1 assume it will retain many of the
features of Bresnan’s elegant deletion analysis.
8: DEGREE CLAUSES
Z. The N' of course has a coherent interpretation. Any N" complement within Z is
either an ordinary restrictive modifier, which can be integrated into the interpretation
independent of the rest of the contents of the N" complement, or else it is of the type
which must be bound by a specifier. If it is of the latter type, the binding specifier must
also be contained within x - Z for the interpretation of .v - Z to be coherent. But this is
precisely what is found in the case of stacked degree clauses: any degree clause
between a degree-word-degree-clause pair is matched by its own corresponding degree
word. For example, in (8.33c), the first PRO is in a clause governed by more, so x - Z
is x many laps. The second PRO is in a clause governed by as, so x - Z is x many more
many laps than Joan swam PRO, exactly the desired interpretation.
The final step in interpreting the degree clause is to assign an interpretation to x
itself. Presumably this involves logically binding it in some way to the governing degree
words, by a process similar to that for restrictive relative pronouns. But since an
appropriate logical formalization of degree words is not yet extant, this step must be
left for future research. Again, this problem is common to the deletion and interpretive
theories.
Thus, of the three steps involved in an interpretive theory of degree clauses, the
first and last are independently necessary in the deletion theory. The middle step, the
interpretation of PRO, has been shown to provide coherent interpretations of the
desired sort; this step can be carried out with the comparative clause generated as a
daughter of X" instead of as a daughter of a higher X'", despite the nonmatching of
constituent structure. Thus, like relative clauses, comparative clauses need not be
exceptions to the general phrase structure rule schema.
In turn, appositives must be attached higher than degree clauses, as shown in
(8.2)—(8.5). If we had needed to generate comparative clauses at a higher node than X",
this would have forced appositives into a position which was also an exception to the
phrase structure rule schema. Generating comparative clauses in X", however, permits
appositives to remain in X'", in conformity with the arguments of sections 4.1 and 7.2.
8.8. Recapitulation
The argument has gone as follows. First, whatever the specific details of one’s analysis
of degree clauses, a theory of grammar is necessary in which cross-category generaliza¬
tions can be made, since degree clauses occur in such a wide variety of positions.
Second, the theory that degree clauses are generated in their surface position has
certain syntactic advantages over the theory that they are extraposed from Deg'". But
this requires the more complex semantic rule (8.23) to establish the binding of the
degree clause to the Deg'". Finally, in order to preserve the X"-complement theory of
comparative clauses, thus avoiding a weakening of the general theory of phrase
structure, we had to face the problem of string rather than constituent identity. An
interpretive rule was developed which was shown to provide coherent interpretations
even to the complex stacked comparatives.
220 RAY JACKENDOFF
Along the way, we have observed somewhat distant but still striking generaliza¬
tions among degree clauses, restrictive relatives, and possibly even subject comple¬
ments. It has been particularly instructive to see how the determiner, Chomsky-
adjoined, and NP-complement theories of relatives discussed in chapter 7 are paralleled
by corresponding theories of degree clauses, how much alike the arguments for and
against the three theories are in the two domains, and how in both cases an interpretive
theory of the missing constituent in the clause is necessary to save the X"-complement
theory. If nothing else, these similarities constitute an argument on the metalevel in
favor of a close, though abstract, relationship between the two constructions.
9: Deverbalizing Rules
Section 3.6 pointed out that gerundive nominals are an exception to the normal
hierarchical arrangement of syntactic categories predicted by the X Convention: they
are dominated by NP but contain no head N. It was suggested that the phrase structure
rule that describes them is one of a class of systematic exceptions to the Uniform
Three-Level Hypothesis, characterized by the following phrase structure rule schema:
The structures grow internally less X-like and more sentencelike, as i goes from 1 to 3
and as the complements and specifiers are determined by more supercategories of V
RAY JACKENDOFF
and fewer supercategories of X. Externally, however, they are all still X"'s and occur in
positions characteristic of X'", not S.
Note that (9.1) strongly restricts the possible exceptions to the major rule schema
in several important respects. First, the only category that can appear on the right is V:
this seems to be true of all category-switching rules I have encountered. Second, even
when the category of the head of a phrase is switched to V by the application of one of
these rules, the hierarchy of primes is undisturbed: there is no instance, say, of A'
expanded as an affix plus V" or V'". Third, at the level where the category is switched
by a deverbalizing rule, there are no further complements or specifiers other than the
grammatical formative that marks the change of category. In each of these three
respects, it is clear what kind of evidence would entail weakening the restrictions; for
the moment, though, (9.1) seems like a plausible initial hypothesis.
Gerundive nominals are the most NP-like of the three complement types studied by
Rosenbaum (1967). As Emonds (1970) shows, they differ from that and for-to
complements in that they occur in all NP positions, including subject position in
questions, relative clauses, and for-to complements.
John’s leaving
(9.3) a. What would ‘ *that John left ■ reveal about him?
*for John to leave
John’s leaving
b. a man who ' *that John left ' would irritate
*for John to leave
{John’s leaving
*that John left r to bother us.
*for John to leave J
Thus gerundive nominals are apparently NPs, though they lack a noun head. To find
the level at which they change from N' to V\ we investigate what complements and
specifiers they take.
The level which explains the most about the structure of gerundive nominals is the
X" level. To recapitulate the discussion of section 2.1, gerundive nominals permit
within them all the transformations such as Dative Shift and Particle Movement
characteristic of V' but not N'. They are preceded by preverbal adverbs, characteristic
of V", not by adjectives, characteristic of N". They may contain perfect and progressive
aspect, which we assigned to V" in section 3.5. They may contain because and result
clauses characteristic of V" and not of N".
On the other hand, gerundive nominals do not contain tense and modal, character-
9: DEVERBALIZING RULES
istic of V'" and absent in N'". They contain a possessive NP in the subject,
characteristic of N'", not of V'". As Schachter (1976) points out, they occur with a
limited range of articles, as in There is no enjoying this world without thee, thus
behaving like N'", not V'". Like N'" and unlike V'", they do not permit sentence
adverbial PPs; (9.4) compares gerundive nominals to t/iat-clauses to show that the
impossibility of these particular sentence adverbials is not a restriction of all subordi¬
nate clauses:
to our delight
(9.4) *John’s ' in his haste ' leaving so early didn’t distress Sue.
for some reason
to our delight
That John(,) ' in his haste (,) left so early didn't distress Sue.
for some reason
Further arguments of this sort appear in Schachter (1976) and Horn (1975), and,
though advocating a different theory, in Williams (1975). The point is clear. Gerundives
have the constituent structure of sentences up to the X" level and that of NPs above
that. Thus the appropriate skeletal structure is (9.2b), and the phrase structure rule is
(9.5).
Ing, like other affixes, undergoes Affix Hopping to attach to the first verbal form. Thus
the underlying form of John's having left is (9.6), exactly its surface form except for the
insertion of poss and the attachment of verbal affixes.
(9.6) N
leave
224 RAY JACKENDOFF
Let us turn now to tensed subordinate clauses. As Bresnan (1970) shows, these include
not only r/n/t-clauses but also comparative clauses with the complementizers as and
than, and indirect questions with the complementizer wh. All of these are indubitably
V"'s, since they contain, in addition to all the V" structure, tenses, modals, and
nongenitive subjects; what sentence adverbials they lack can plausibly be excluded for
semantic reasons. The discussion of chapter 3 showed that that-clauses, because their
movement generalizes with that of other X"', must be X'" of some sort. Furthermore, it
was shown that the complementizer is not within the Vthat dominates the subject and
the auxiliary. Thus a structure like (9.2c) appears most appropriate for tensed
complement clauses. The only question is what category the dominating X'" should be.
Chapter 4 alluded to the dispute over whether t/?at-clauses are dominated by NP in
underlying structure. We have taken the position that they are not, that their position at
the end of X' and X"is base-generated, and that their appearance in the subject position
of certain clauses results from the application of the root transformation Intraposition.
Thus we claim that the X'" in (9.2c) is V'" for tensed complement clauses, and that
there is no separate category S as Bresnan (1970) and Chomsky (1973) assume. Under
this theory, the phrase structure rule generating tensed complements is (9.7), and
typical structures are given in (9.8).1
Of course, if it were to turn out that that-clauses are dominated by NP, a possible
realization of (9.1) to describe this would be (9.9).
1 In principle this rule could incorrectly apply to its own output to yield sequences of complementizers.
I assume, perhaps unjustly, that either semantic considerations or a surface constraint prevent such an
eventuality.
9: DEVERBALIZING RU LES
But there is another kind of tensed complement for which a dominating N"' looks a
great deal more plausible: free relatives.
Free relatives occur in all NP positions, including those in (9.3) from which that
and for-to complements are excluded:
(9.10) a. What would what the FBI found out reveal about John?
b. a man to whom what you found out would be a nuisance
c. It would be disgraceful for what you found out to be revealed.
Furthermore, free relatives, like gerundive nominals and unlike that and for-to
complements, do not have an alternate extraposed position:
Thus free relatives behave externally much more like NPs than that and for-to
complements do.
It is generally assumed that a free relative is derived from a noun phrase with a
relative clause and some sort of dummy head, such as the thing which John saw or
PRO which John saw. However, as Jespersen (1909-1949, Vol. Ill) points out, such a
derivation would not provide an immediately plausible source for free relatives with
wh-ever, as in Whatever John saw interested me, We were interviewed by whoever we
met. Still worse are free relatives of the form what money I have, what help you can
offer him, etc., in which the wh-word is followed by a noun also fronted from the
relative clause. Underlying sources like the thing [s/ have wh-money] or PRO [syo//
can offer him wh-help] do not make sense as relative constructions—particularly if we
adopt the interpretive view of relative pronouns argued for in chapter 7.
Jespersen argues that the clause itself should be considered as the entire NP. In the
present theory, this can be expressed as the claim that free relatives are generated with
phrase structure rule (9.12), another deverbalizing rule; then what money / have is
derived from (9.13).
(9.13) N'"
There are some -ing complements which, unlike most gerundives, alternate with PPs
rather than NPs:
(9.14)
b. Bill remained
2 The latter of these examples thus has a V'" in surface subject position. 1 believe a stylistic inversion
has taken place—notice that the highest stress in this sentence is on table, in the middle of the sentence, not
on issue, the expected position. By contrast, in the free relative sentence, the highest stress is at the end, as
expected.
9: DEVERBALIZING RULES
This class of gerunds allows certain V" complements (e.g. Moe went on doggedly
leaving for work at 6 a.m.), though others are excluded, probably for semantic reasons.
Thus if they are generated by a deverbalizing rule, they must be of structure (9.2b) or
(9.2c). If they are dominated by PP, choosing (9.2b) would automatically explain the
absence of subject, tense, and modals, since P'" contains none of these. The phrase
structure rule would be (9.15), and the structure of (9.14a) would be (9.16).
P"
run
Emonds (1970) suggests that the progressive be belongs to the class of verbs that
takes this sort of -ing complement. The main attraction of Chomsky’s (1957) analysis of
the progressive auxiliary was that it explained the inability of progressive to be
followed by modals, perfect, or another progressive. But Emonds shows that this may
be part of a more general set of semantic restrictions, since the verbs of (9.14) cannot
be followed by perfect or progressive either.
b. *John remained
{having worked
being working
I having sung
c. *Moe went on
[being singing
Notice also that be takes a PP semantically parallel to the progressive: compare John is
working to John is at work. Hence, however these restrictions are accounted for, they
can be generalized to progressive be if John is singing is assigned the structure (9.18).
228 RAY JACKENDOFF
(9.18)
N M V"
A V
be
ing V"
Asing
As further support for this analysis, Jackendoff (1976) shows that the verb be fills a gap
in the semantic paradigm given by the verbs in (9.14), and that the interpretation of the
progressive predicted by this semantic analysis is not implausible, at least in these
cases. Hence we have some reason to consider progressive be as a main verb which
takes a gerundive PP complement.
Other than the absence of subjects and the alternation with PPs, 1 have given little
evidence that these gerunds are dominated by PP. One further argument concerns the
transformation which inverts a simple verb with the subject in the presence of a
preposed locational PP (see Emonds (1976, 37-40) for a formulation of this rule):
(9.20) Buried here lies the producer of Gone with the Wind.
Bouncing out of the room came the garbagemen.
Screaming down the hall ran two famous linguists.
Another use for rule (9.15) may be to account for the gerundive constructions in
(9.21), discussed by Akmajian (1977).
Akmajian argues that the sun rising in the east is not a clause, but rather that the sun is
the head of the construction. He proposes the following structure:
(9.22) NP
Unfortunately, this structure contains two configurations ruled out by our theory of
phrase structure: NP over NP, and VP dominated by some category other than S or
af-X".
In order to settle the first problem, we observe that these complements occur after
restrictive relatives (9.23a), appear with proper nouns (9.23b), and exclude appositives
(9.23c,d).
(9.23) a. The man you met falling on his face was a sad sight.
b. Bill falling on his face was a sad sight.
c. ?*Bill, who we all love, falling on his face(,) was a sad sight.
d. *Bill falling on his face, who we all love, was a sad sight.
This suggests that the gerundive may be treated as a new type of N'" complement, in
alternation with appositives.3
;l This proposal is substantively different from Akmajian’s analysis in two ways. First, in object
position, these gerundive constructions permit appositives: We saw Bill, who ire all loved, driving away.
Akmajian shows that in object position there are two distinct structures for the same string: one with the
object dominating the gerund, yielding passives like Bill driving away could not be seen by anyone, and one
with the main VP dominating the gerund, yielding passives like Bill was seen driving away. Akmajian claims
the latter is derived from the former by an extraposition rule. However, this would not predict the difference
in the possibility of appositives in subject vs. object position. As an alternative, we may claim that if the
gerundive is generated in the NP, it always excludes an appositive, but that these gerundives can also be
base-generated in VP, in which case the object is permitted an appositive. We thus reject Akmajian’s
extraposition rule in favor of use of the base. In fact, if these gerunds are PPs, as I have claimed, the
necessary phrase structure rule already exists, since it generates Bill was seen on TV, with a simple PP,
parallel to Bill was seen driving away, with a gerundive PP.
A more serious problem for the present analysis is the existence of passive gerunds of this sort, for
instance Bill being struck by lightning was a sad sight. In Akmajian’s structure the Passive can take place
under the domination of the upper NP; but in the present analysis, there is no complete NP to the left of the
gerund that could be moved there by the Passive transformation. In view of current doubts about the
existence of the Passive transformation (cf, Freidin (1975), Schachter (1976), Bresnan (1976b). for example),
this argument does not seem anywhere near as damning as it would have a few years ago. One can at least
leave the issue in abeyance instead of giving up altogether.
230 RAY JACKENDOFF
As for the second problem with (9.22), the bare VP, we observe that these
gerundives alternate with PPs of approximately the same semantic import and with the
same syntactic distribution:
This suggests that the lack of subject, tense, and modal is explained not, as Akmajian
suggests, by generating a bare VP, but rather by generating V" under P", again using
phrase structure rule (9.15). Thus our alternative to (9.22) is (9.25).4
(9.25) N'"
Gerundive relatives are often thought to be derived from ordinary relatives by deletion
4 A more traditional transformational analysis, of course, would probably suggest the reverse: that the
PP cases are derived from the gerundives by a being deletion. Though I suspect there is such a
transformation involved in cases like John angry is a scary sight, it is not so plausible for John with a scar
like that is a sad sight, since with in this use does not occur after be.
9: DEVERB ALIZING RULES
of the relative pronoun plus be, so-called Whiz-Deletion. (9.26a) shows that this is
impossible, since *a man who is owning a Cadillac is ungrammatical. Williams (1975)
gives a number of other arguments against Whiz-Deletion, which I will not reproduce
here.
(9.26b) shows that gerundive relatives follow N' complements; (9.26c) shows that
they precede other restrictive and nonrestrictive relatives. Hence they are in the
position of PPs under N". Since they are without subjects, tenses, and modals, and
since there is no NP position under N" after the head, it seems most plausible on all
grounds to invoke (9.15) again.
This section has shown that P" quite plausibly expands as V'' in at least three
different circumstances in the grammar. In each case the verb appears as a present
participle; in each case a subject is impossible; in each case PPs appear in identical
positions with parallel interpretations. We thus have evidence for a heretofore
unsuspected phrase structure rule, another instance of phrase structure schema (9.1).
I have said nothing about infinitival complements so far in this chapter, and
perhaps a word is in order at this point. The problems posed by infinitival complements
are in many ways like those of gerundives; they share the obligatory absence of tense
and modal, and the sometimes optional, sometimes obligatory absence of a subject. But
infinitival complements are far more widespread and varied than gerundives, and thus
less easy to categorize.
The standard approach to infinitival complements, of course, is that they are all Ss
with for-to or (following Bresnan (1972) and Chomsky (1973)) <$>-to complementizers. It
has occasionally been suggested that the “obligatory Equi” complements, such as
those with try, are generated with bare VP complements. The approach to complemen¬
tation proposed in this chapter suggests that a more enlightened use of phrase structure
rules might help sort out some of the complications of infinitivals. For example, the for-
to clauses which alternate with r/zat-clauses, such as the complements of prefer, wish,
etc., are probably best analyzed as V'", as usual. But the many infinitivals which
alternate with PPs of parallel interpretation and not with f/iaf-clauses, for example
those in (9.27), may well be expansions of PP similar to gerundive PPs.5
(9.27)
5 Faraci (1974) presents a wide range of examples of this type, suggesting a phrase structure rule PP
P - S
232 RAY JACKENDOFF
It is not clear to me, however, what sort of phrase structure rule should be used to
generate infinitivals, particularly for-to infinitivals. The fact that parallels in other
languages to full for-to complements are relatively rare, by comparison with parallels to
that-clauses, simple to-complements, and gerundives, may indicate that they are some
particularly exotic variation on the phrase structure rule schema for which there is no
other evidence in English. In any event, I leave the issue for future research.
A final tentative example of a deverbalizing rule comes from Fiengo (1974), who
revives a proposal of Chomsky (1955), in which the passive verb phrase is an AP. The
passive auxiliary is present in deep structure, and the necessity of preposing the direct
object is guaranteed by semantic constraints (which, however, Fiengo does not spell
out in much detail). The deep structure of a passive is (9.28a) or (9.28b); its surface
structure is (9.28c) in the present framework.
(9.28) a.
love Evan
9: DEVERBALIZING RULES
b.
be A"
en V"
be A"
love t
234 RAY JACKENDOFF
The t in direct object position in (9.28c) is the “trace” left behind by movement rules in
Fiengo’s theory, a pronoun which is anaphorically bound to the NP originally
occupying that position. We will not go into Fiengo’s motivation for this theory, as it
goes far beyond the scope of the present study. However, we observe that the analysis
requires a deverbalizing phrase structure rule, here represented as (9.29), in that there
is an AP without an adjectival head.
(9.29) Passive VP
A" -> en - V"
One piece of evidence for identifying the passive verb phrase as AP is that passive
verb phrases occur, with the correct meaning, after certain verbs other than be, for
example (9.30).
Since these environments also permit APs (Fred got sick, Mary got Kathy sick, George
had Jerry angrier than a swarm of bees), an A'' structure like the one in (9.28) would
make the generation of these otherwise troublesome examples rather simple. Such a
theory would, of course, have to develop a semantic explanation of why passive VPs
do not occur in all adjectival environments, a problem which 1 must leave for future
research.
We have shown that there is evidence for at least Five instances of rule schema (9.1) in
English; we repeat these here.
Though not all of these rules have been equally well motivated, their similarity justifies
the claim that they constitute a principled class of exceptions to the Uniform Three-
Level Hypothesis.
It is interesting to notice what happens if we let / = 0 in schema (9.1), a possibility
we have not considered so far. The resulting rules are of the form X —> af - V and are
congruent with a subset of the word formation rules in the lexicon, for example those
relating derived nominals to verbs. And in fact, the historical roots of (9.5), (9.15), and
(9.29) are undoubtedly in word formation rules which still exist in the lexicon of
English. Corresponding to gerundive nominals are derived nominals of Germanic verbs,
such as building and writing. It is not difficult to imagine a historical restructuring in
which the affixing is taken to be at a higher level, thus enabling the verb transparently
contained in the noun to take some of its verbal complements. The reanalysis is from
(9.31a) to (9.31b).
build
Similarly, the past participles used in passive sentences are paralleled by past
participles used as passive adjectives such as amused, disturbed, shut, and so forth.
Again, one can imagine a reanalysis of the affix at a higher level, so that alongside the
lexical process (9.32a) arose the syntactic process (9.32b). Wasow (1977) has an
interesting discussion of syntactic tests that distinguish the two constructions from one
another.
236 RAY JACKENDOFF
Gerundive PPs are historically derived from the deverbal noun preceded by the
preposition on or at, a possible relic of which is the proform at it used to refer to these
constructions. The preposition degenerated to the prefix a-, which in turn dropped off
altogether. The transition is represented as (9.33); the structure of the intermediate step
is conjectural.
(9.33) a.
work
9: DEVERBALIZING RULES
Now consider the implications of these changes for universal grammar. There are
word formation rules for deriving almost every lexical category from almost every
other lexical category. Is it a coincidence that the only types that have developed into
phrase structure rules are those which involve deverbal constructions? Rule schema
(9.1) claims that it is not, since it restricts the class of category-switching expansions in
syntax to those in which V is on the righthand side. For example, (9.1) predicts that we
would never find a historical change like the one from (9.34a) to (9.34b).
If there were a syntactic structure (9.34b), we would expect it to have N" modifiers
such as adjectives and quantifiers, e.g. *great fearful or *much beautiful. No such
category-switching constructions have come to my attention in which the head is other
than V. By contrast, changes in the opposite direction, changing syntactic structure
into lexical structure, do occur; for example, the adjective full developed into an affix
that converts nouns into adjectives. Thus rule schema (9.1) explains why certain
historical changes could have taken place and others could not have.
This completes our brief survey of deverbalizing rules. I consider them an
extremely interesting class of rules, putting on a principled basis a sizable number of
what previously had to be treated as altogether arbitrary phrase structure expansions.
In particular, they seem to me to provide a framework for a richer syntax of
complementation than has previously been considered, one that can stimulate a fresh
approach to the crucial problem of constraints on extraction. Moreover, as we have
just mentioned, they provide a previously unsuspected locus for historical change in
syntax.
238 RAY J ACKEN DOFF
6 After this book went to press, it came to my attention that there are also free relatives dominated by
PP, for example the subordinate clauses in She put it where l wanted it and I’ll go whenever you do. This
implies a further deverbalizing rule:
(i) P'" wh - V'"
10: More General Implications
We further claimed that there are only two kinds of exceptions to (3.9), coordinate
conjunction and deverbalized constructions, each constrained by a characteristic
phrase structure rule schema.
We claim, then, that all possible phrase structure rules of natural language are
described by these three schemata.
Starting in section 3.4 we embarked on a tour of English, concerned with showing
that its phrase structure is described by this theory. We took particular care to show
that many previous analyses of phrase structure which are not instances of the three
240 RAY JAC'KENDOFF
of section 4.5, the generalizations among phrase structure rules are characteristically
only partial, giving rise to several sorts of situations in which the notational conven¬
tions traditionally incorporated in the evaluation measure fail to appropriately reduce
the number of features in the rule. Since the evaluation measure as a simple feature¬
counting device depends on the existence of appropriate abbreviatory conventions,
there are only two solutions to the problem: either develop new abbreviatory
conventions, or develop an evaluation measure that does something more than just
count features. We have shown that one of these alternatives is necessary in the theory
of the phrase structure component.
It may not be out of place here to point out that Jackendoff (1975a), discussing the
nature of lexical generalizations, shows that they too resist analysis in terms of a
feature-counting evaluation measure. A different sort of evaluation measure is pro¬
posed there, better adapted for the partial generalization endemic to the lexicon:
instead of attempting to collapse related lexical entries into a single more general entry,
it directly measures the redundancy between fully specified entries. A parallel approach
in the phrase structure component would use the feature and prime system not to
collapse rules but to enable the evaluation measure to judge, given two or more rules
with all categories fully specified, how much alike they are, i.e. how easy it is to learn
one given the other. Whether such an approach would be feasible I leave for future
research: it has been shown, in any event, that the usual approach is inadequate.
Of course, the evaluation measure (at least as presently conceived of) would value
most highly a grammar in which all categories of the same level had exactly the same
phrase structure expansion, aside from the head. Such a grammar seems hardly likely,
and there are probably many substantive universals which prevent this ideal from being
achieved. For example, we have good reason to expect M'" only under V'" and Art'"
only under N'", and these very specific restrictions are undoubtedly part of universal
grammar. For a more controversial case, observe that the grammar of English would be
simplified if nouns and adjectives came to be followed directly by NP, as verbs and
prepositions are. Yet this change seems highly unlikely. One possible explanation is
that universal grammar places such general restrictions on phrase structure expansions
as prohibiting surface objects in N' and A', prohibiting subjects in A'" and P'", and
generally excluding all X' modifiers from the minor lexical categories. (This would
amount to the claim that my choice of feature names has theoretical significance.)
Whether or not these specific restrictions are correct, they are indicative of the way in
which substantive universals could in principle prevent the phrase structure component
from achieving maximal generality.
Having discussed the implications of this study for the syntactic component, we
will conclude by turning to its impact on semantics.
assumptions as “sweeping difficulties under the rug”, my sense of the situation is that,
given an apparently arbitrary restriction on syntax, the account of it that puts the least
burden on the language learner is that it follows from independently motivated rules
and restrictions on semantic interpretation; it can then be claimed that knowing the
meaning of the construction in question automatically results in the application of the
observed constraint. If instead the constraint has to be added to the syntax, either
through universal grammar, or worse yet, in the language-particular syntax, no real
explanation has been provided. It seems to me that the success of this methodology has
been amply proven in Jackendoff (1972; 1974b; 1975c; 1976) and need not be further
defended here.
In the present study, however, I have been somewhat lax about actually proposing
semantic solutions to syntactic constraints. In a few cases, such as the binding of
restrictive relative clauses to their articles and the interpretation of relative pronouns
and the PRO in degree clauses, I have given explicit solutions. But the cases left open,
such as the Partitive Constraint, pseudopartitive determiner constraints, the binding of
degree clauses to their degree words, and the nonrecursion of the complementizer
node, have been far more numerous. Thus the present study leaves unsolved a great
number of particular problems, some of which are demonstrably semantic, some of
which have been called semantic admittedly on the basis of little more than hope.
More generally, we have explored a wide range of syntactic constructions whose
semantics have never been seriously studied. And though for the most part we have
mentioned the semantics of these many constructions only peripherally, it is clear that
they provide a rich field of inquiry for natural logic. For example, consider the range of
modifiers within the NP. At the N' level there are functional arguments, which are well
understood by logic. At the N" level there are restrictive modifiers (generally treated in
logic as conjoined predications or restrictions on quantifiers), the quantifiers many and
few (not among the quantifiers traditionally treated by logic), and numerals, measure
phrases, and pseudopartitive group nouns (outside the province of traditional logic). In
N'" there are subjects (again functional arguments), genitive NPs of intrinsic connection
(not treated by logic), demonstratives and N'" quantifiers (only partially within
traditional logic), appositives (again treated as conjoined predication), and gerundive
PPs (semantics unknown). The PP, which has in the past been treated with equal
disdain by syntax and semantics, has proved to have an extremely rich syntax,
including a characteristic system of quantification and degree modification; this syntax
implies the existence of a corresponding semantics, up to now totally unexplored. In
order for the semantic component of language to achieve even observational adequacy,
it is clear that there is a vast range of phenomena yet to account for.
But of course the syntactic investigation here was not directed toward merely the
complexity of phrase structure but toward uncovering the richness of cross-categorial
generalization in phrase structure and in grammatical relations. Since grammatical
relations, as the environments over which selectional restrictions are defined, are in
fact the structural descriptions of rules of semantic interpretation, cross-categorial
10: MORE GENERAL IMPLICATIONS
What one might guess not to be universal about the correspondence is exactly how
the phrases expressing internal constituents of the major semantic categories are
distributed syntactically within the major phrasal categories. For instance, we might
expect to find modifiers of the same semantic type occurring within X" in one case and
within X'" in another. We have found at least two such examples in English alone: the
occurrence of semantic quantifiers at both the N'" and N" levels, and the alternation of
nonrestrictive relative clauses in N'" with nonrestrictive adjective phrases in N". It
would not be surprising to find much greater variability across languages in this
respect.
On the other hand, one might expect the correspondences within each X'" not to
be totally arbitrary. For example, nonrestrictive modifiers probably cannot be more
deeply embedded in X"' than restrictive modifiers, and functional arguments other than
the subject quite possibly must occur in X'. Among the tasks of cross-linguistic
investigation would be to discover exactly how constrained these relationships are, and
to specify how extensive cross-categorial generalizations are both in universal grammar
and, where universal grammar leaves the question open, within particular languages.
Such a program of lexicalist semantics, then, would conceive of the hierarchical
aspect of rules of phrase structure as being to a significant extent dictated by semantic
considerations. However great or small the relationship actually turns out to be, the
result is of interest. For to whatever extent the form of the syntax is not determined by
semantics, this is evidence for Chomsky’s (1975) contention that the language faculty is
an autonomous mental organ whose structure is as different from other mental organs
as the heart is different from the liver. On the other hand, the closer that syntactic form
is related to semantic form in particular or universal grammar, the more it is possible to
use purely syntactic arguments as a basis for hypotheses about semantic structure.
As may be gathered from the preceding remarks, I believe that the relationship
between syntax and semantics is a strong one, and that it can and should be exploited
in order to counteract the irresponsibility toward syntactic form that is characteristic of
most present-day work in logic. Nor should this relationship be used, as many linguists
have used it, to bring evidence from a mathematically conceived formal semantics to
bear on syntax. Rather, the syntax itself provides precise and highly differentiated data;
a lexicalist semantics would use this evidence to shed light on the much murkier
question of how natural logic is structured. I consider Jackendoff (1975c; 1976) as
examples of such an approach, using syntactic paradigms to motivate novel semantic
analyses of some generality.
Since semantic representation is presumably closely related to cognitive structure,
such a program of research has a further consequence. To the extent that a lexicalist
semantics proves successful, the study of language provides an unparalleled tool for
investigating not just a single organ of the human mind, but perhaps some of the most
fundamental principles of mental organization. The potential implications for psychol¬
ogy are vast. It is hoped that the present study provides some impetus toward their
discovery.
References
Greenberg, J. (1963) “Some Universals of Grammar with Particular Reference to the Order of
Meaningful Elements,” in J. Greenberg, ed. (1963).
Greenberg, J., ed (1963) Universals of Language, MIT Press, Cambridge, Massachusetts.
Gruber, J. S. (1965) Studies in Lexical Relations, Doctoral dissertation, MIT, Cambridge,
Massachusetts; published by Indiana University Linguistics Club, Bloomington, Indiana.
Hale, K. (1975) “The Adjoined Relative Clause in Australia,” in R. M. W. Dixon, ed. (1975).
Halle, M. (1973) “Prolegomena to a Theory of Word Formation,” Linguistic Inquiry 4.1, 3-16.
Harris, Z. (1946) “From Morpheme to Utterance,” Language 22, 161-183.
Harris, Z. (1951) Methods in Structural Linguistics, University of Chicago Press, Chicago,
Illinois.
Higgins, F. R. (1973) “On J. Emonds’s Analysis of Extraposition,” in J. Kimball, ed. (1973).
Horn. G. (1975) “On the Nonsentential Nature of the Poss-Ing Construction,” Linguistic
Analysis 1.4, 333-388.
Hornstein, N. (1975) “S and the X Convention.” Montreal Working Papers in Linguistics, Vol.
4, McGill University Department of Linguistics, Montreal.
Horvath, J. (1976) “Focus in Hungarian and the X Notation,” Linguistic Analysis 2.2, 175-197.
Jackendoff, R. (1968) “Quantifiers in English,” Foundations of Language 4.4, 422-442.
Jackendoff. R. (1969a) “An Interpretive Theory of Negation,” Foundations of Language 5.2,
218-241.
Jackendoff, R. (1969b) “Les constructions possessives en anglais,” Langages 14, 7-27.
Jackendoff, R. (1971) “Gapping and Related Rules,” Linguistic Inquiry 2.1, 21-36.
Jackendoff, R. (1972) Semantic Interpretation in Generative Grammar, MIT Press, Cambridge,
Massachusetts.
Jackendoff, R. (1973) “The Base Rules for Prepositional Phrases,” in S. Anderson and P.
Kiparsky, eds.(1973), 345-356.
Jackendoff, R. (1974a) “Introduction to the X Convention,” published by Indiana University
Linguistics Club, Bloomington, Indiana.
Jackendoff, R. (1974b) “A Deep Structure Projection Rule,” Linguistic Inquiry 5.4, 481-506.
Jackendoff, R. (1975a) “Morphological and Semantic Regularities in the Lexicon,” Language
51.3, 639-671.
Jackendoff, R. (1975b) “Tough and the Trace Theory of Movement Rules,” Linguistic Inquiry
6.3, 437-446.
Jackendoff, R. (1975c) “On Belief-Contexts,” Linguistic Inquiry 6.1, 53-93.
Jackendoff, R. (1976) “Toward an Explanatory Semantic Representation,” Linguistic Inquiry
7.1, 89-150.
Jackendoff, R. and P. Culicover (1971) “A Reanalysis of Dative Movements,” Foundations of
Language 7.3, 397-412.
Jacobs, R. and P. Rosenbaum, eds. (1970) Readings in English Transformational Grammar,
Ginn, Waltham, Massachusetts.
Jespersen, O. (1909-1949) A Modern English Grammar, George Allen & Unwin, London.
Katz, J. J. (1966) The Philosophy of Language, Harper & Row, New York.
Katz, J. J. (1972) Semantic Theory, Harper & Row, New York.
Keyser, S. J. (1968) Review of S. Jacobson, Adverbial Positions in English. Language 44.2, 357—
373.
Kimball, J., ed. (1973) Syntax and Semantics, Vol. 2, Academic Press, New York.
Kimball, J., ed. (1975) Syntax and Semantics, Vol. 4, Academic Press, New York.
Klima, E. S. (1964) “Negation in English,” in J. A. Fodor and J. J. Katz, eds. (1964), 246-323.
Klima, E. S. (1965) Studies in Diachronic Syntax, unpublished Doctoral dissertation, Harvard
University, Cambridge, Massachusetts.
248 RAY JACKEN DOFF
Kuroda, S.-Y. (1969) “English Relativization and Certain Related Problems,” in D. Reibel and S.
Schane, eds. (1969), 264-287.
Lakoff, G. (1970a) "Global Rules,” Language 46.3, 627-639.
Lakoff, G. (1970b) “Repartee, or a Reply to ‘Negation, Conjunction, and Quantifiers’,”
Foundations of Language 6.3, 389-422.
Lakoff, G. (1971a) “On Generative Semantics,” in D. Steinberg and L. Jakobovits, eds. (1971),
232-296.
Lakoff, G. (1971b) On Syntactic Irregularity, Holt, Rinehart and Winston, New York.
Lakoff, G. and J. R. Ross (1966) “A Criterion for Verb Phrase Constituency,” in Mathematical
Linguistics and Automatic Translation, Report NSF-17, Harvard Computation Labora¬
tory, Harvard University, Cambridge, Massachusetts.
Lasnik, H. and R. Fiengo (1974) “Complement Object Deletion,” Linguistic Inquiry 5.4, 535—
572.
Lees, R. B. (1961) "Grammatical Analysis of the English Comparative Construction,” Word 17,
171-185; in D. Reibel and S. Schane, eds. (1969).
Lightfoot, D. (1974) “The Diachronic Analysis of English Modals,” Montreal Working Papers in
Linguistics Vol. 3, McGill University Department of Linguistics, Montreal, 115-145.
McCawley, J. D. (1971) “Where do Noun Phrases Come From?” in D. Steinberg and L.
Jakobovits (1971), 217-231.
McCawley, J. D. (1975) “The Category Status of English Modals,” Foundations of Language 12,
597-601.
Milner, J.-C. (1973) “Comparatives et relatives,” in J.-C. Milner, ed. (1973).
Milner, J.-C., ed. (1973) Arguments Linguistiques, Maison Marne, Paris.
O’Neil, W. (1976) “Clause Adjunction in Old English,” ms., to appear in General Linguistics.
Perlmutter, D. (1970) “On the Article in English,” in M. Bierwisch and K. E. Heidolph, eds.
(1970).
Peters, P. S., ed. (1972) Goals of Linguistic Theory, Prentice-Hall, Englewood Cliffs, New
Jersey.
Postal, P. M. (1966) “On So-Called ‘Pronouns’ in English,” in F. Dinneen, ed. (1966); in D.
Reibel and S. Schane, eds. (1969), 201-224.
Postal, P. M. (1971) Crossover Phenomena, Holt, Rinehart and Winston, New York.
Postal, P. M. (1974) “On Certain Ambiguities,” Linguistic Inquiry 5.3, 367-424.
Rardin, R. (1975) Studies in Derivational Morphology, unpublished Doctoral dissertation, MIT,
Cambridge, Massachusetts.
Reed, A. (1974) The Structure of English Relative Clauses, unpublished dissertation, Brandeis
University, Waltham, Massachusetts.
Reibel, D. and S. Schane, eds. (1969) Modern Studies in English, Prentice-Hall, Englewood
Cliffs, New Jersey.
Rosenbaum, P. (1967) The Grammar of English Predicate Complement Constructions, MIT
Press, Cambridge, Massachusetts.
Ross, J. R. (1967) Constraints on Variables in Syntax, Doctoral dissertation, MIT, Cambridge,
Massachusetts; published by Indiana University Linguistics Club, Bloomington, Indiana.
Ross, J. R. (1969) “Adjectives as Noun Phrases,” in D. Reibel and S. Schane, eds. (1969), 352—
360.
Schachter, P. (1976) “A Nontransformational Account of Gerundive Nominals in English,”
Linguistic Inquiry 7.2, 205-241.
Selkirk, E. O. (1970) “On the Determiner Systems of Noun Phrase and Adjective Phrase,”
unpublished paper, MIT, Cambridge, Massachusetts.
REFERENCES
Selkirk, E. O. (1972) The Phrase Phonology of English and French, unpublished Doctoral
dissertation, MIT, Cambridge, Massachusetts.
Selkirk, E. O. (1974) “French Liaison and the X Convention,” Linguistic Inquiry 5.4, 573-590.
Selkirk, E. O. (1977) “Some Remarks on Noun Phrase Structure,” in A. Akmajian, P. Culicover,
and T. Wasow, eds. (1977).
Siegel, D. (1974) Topics in English Morphology, unpublished Doctoral dissertation, MIT,
Cambridge, Massachusetts.
Smith, C. (1964) “Determiners and Relative Clauses in a Generative Grammar of English,”
Language 40.1, 37-52; in D. Reibel and S. Schane, eds. (1969), 247-263.
Sommerstein, A. (1972) “On the So-Called Definite Article in English,” Linguistic Inquiry 3.2,
197-210.
Steinberg, D. and L. Jakobovits, eds. (1971) Semantics: An Interdisciplinary Reader, Cambridge
University Press, New York.
Stockwell, R., P. Schachter, and B. Partee (1973) The Major Syntactic Structures of English,
Holt, Rinehart, and Winston, New York.
Thompson, S. A. (1971) “The Deep Structure of Relative Clauses,” in C. Fillmore and D. T.
Langendoen, eds. (1971), 76-96.
Van Riemsdijk, H. (1973) “The Dutch P-Soup,” unpublished mimeograph, University of
Amsterdam.
Van Riemsdijk, H. (1976) “Extraction from Prepositional Phrases and the Head Constraint,”
unpublished mimeograph. University of Amsterdam.
Vergnaud, J.-R. (1974) French Relative Clauses, unpublished Doctoral dissertation, MIT,
Cambridge, Massachusetts.
Wald, J. D. (1976) Stuff and Words: A Semantic and Linguistic Analysis of Non-Singular
Reference, unpublished Doctoral dissertation, Brandeis University, Waltham, Massachu¬
setts.
Wasow, T. (1971) Anaphoric Relations in English, unpublished Doctoral dissertation, MIT,
Cambridge, Massachusetts.
Wasow, T. (1975) “A Note on Empty Nodes,” Foundations of Language 12, 603-607.
Wasow, T. (1976) Anaphora in Generative Grammar, Story-Scientia, Ghent.
Wasow, T. (1977) “Transformations and the Lexicon,” in A. Akmajian, P. Culicover, and T.
Wasow, eds. (1977).
Wasow, T. and T. Roeper “On the Subject of Gerunds,” Foundations of Language 8.1,44-61.
Williams, E. (1975) “Small Clauses in English,” in J. Kimball, ed. (1975), 249-273.
1*jU? iXV* -
-br¬
other recent titles in the series
Linguistic Inquiry Monographs
-->**•* . _ v V
» ‘ £»*~ •» * »■* >“♦. -w ♦v ^
v*