Professional Documents
Culture Documents
COMPUTATIONAL MODELS
ArriJicial
Volurne II:: Exper1
Artificial Inlelligence:
Intelligence: An AllT
AlIT Persprctii~e.
Perspective, Volume
Expert Problem
Problem Solving
Solving,
Narural
Urrdersrandif~g,Inrelligerlt
Coffrpuler Coaches,
Coaches. Rc[~resenlalioft
Natural Ianguage
Language Understanding,
Intelligent Computer
Representation
cditcd by Patrick Hcnry
Hcnry Brown,
IIrown, 1979
1979
and Learfrrng
Gnd
Learning edited
Henry Winston and Richard Henry
Artificial/n/cliigence:
An /MIT
Perspective. Volume
Arlr$cial
l~rleliigencr: Afr
I T Perspcclivc
Volume 1/:
11: UlIderstGllding
Urrdersrarrding Vision,
Visiorz
Manipulation, Computer
Design, Sj,rnbo/
Symbol blarripula~ion
Manipulation cditcd
edited by I'atrick
Patrick Henry
Maniyulariun,
(-ompuler Drsign~,
Winston and Richard Henry
Hcnry Brown, 1979
1979
System for
for Repre$en/ing
Representing afrd
alld Using
Real- World Knowledge
Knowledge by Scott
NETL: A Sysrern
Usirrg RmlFahlman, 1979
1979
The Irrlerprera~iort
f Visual
i'isual b!o/io~r
1979
The
Interpretation o
of
Motion by Shimon Ullman, 1979
A Theory
Syntactic Recogrzition
Recognition for
for Natural I.anguage
l.anguage by Mitchcll
Mitchell P. Marcus.
Marcus,
Theory of
of Syr~ractic
1980
1980
Turtle
Afedium for
for Exploring Atarhernarics
Afothemotics by
Turrle Geometry:
Gromerfy: The
The Computer
Cor~rpureras a Medium
Harold Abelson
Abclson and Andrea diSessa,
diScssa, 1981
1981
From Images
Con~puroriar~ol
Study of
of !he
Visual
Images to Surjace:
Surfaces: AA Computational
the Human Early Visual
Sj'srem
1981
System by William Eric Lcifur
Leifur Grimson, 1981
Computational
Models of
Michael Brady and Robert C.
Co~npurationalModels
of Discourse Edited by Michacl
Ucrwick,
1983
Berwick, 1983
Robor
Conrrol by Michael
Michacl Brady, John Hollcrbach,
Rabat Moriorr:
Motion: Planning and Control
Hollerbach, Tomas
Lozano-Perez, Matthew
Matthcw Mason, and Timothy Johnson, 1983
1983
Contributors:
James Allen
Robe rt C. Berwick
Be rwick
Robert
Je rrold Kaplan
Jerrold
David McDonald
L. Sidner
Candace L.
Bonnie Lynn Webber
The
The MIT
MIT Press
Press
Cambridge.
Cambridge,Massachusetts
Massachusetts
London.
Idndon, England
England
PUBLISHER'S NOTE
PUBLISHER'S
This format
fortT!at is intended to rcducc
reduce the cost of
of publishing certain works in book
'lhis
fonn and to shorten the gap bctwccn
between editorial preparation
preparation and final publication.
form
expense [IF
of dctailcd
detailed editing and composition in print
print have been
The time and cxpcnsc
the
of this
tllis book dircctly
directly from the author's
author's
avoided by photographing L
he text of
computer
printout.
cotnputcr
printing, 1989
1989
Fourth printing,
1983
1983 by
Copyright O
The Massachusetts Institute ofTcchnology
ofTechnology
'lhe
reserved. No part of
of this book may bc
be rcproduccd
reproduced in any form,
fonn, or by
All rights rcscrvcd.
means, electronic
electronic or mcchanical.
mechanical. including photocopying.
photocopying, recording, or by
any mcans,
infonnation storage and rctricval
retrieval system, without pcnnission in writing from
any information
publisher,
the publisher.
United States of America,
America.
Printed in the Unitcd
Library of Congress
Congress Cataloging
Catalopil~gin Publication
Publication Data
Main entry under tltle:
title:
Computational Models
Models of Discourse
(The MIT Press
Press series
scries in artificial
artificial intelligence)
intelligence)
Bibliography: p,
p.
Includes
Includes index.
index.
I.
1. Artificial
Artificial Intelligence.
Intclligence. 2.
2. Linguistics--Data
Linguistics--Data
3.
Speech
processing
processing.
processing. Speech prwcssing systems.
systems. I.Brady,
I.Brady,
Miehacl,
C. Ill.
1945- 11.
11. Berwick,
Bcrwick, Robert
RobertC.
Ill. Allen,
Allen.
Michacl, 1945James.
1V. Series.
Series.
James. IV.
001.53'5
Q335.C56
Q335.CS6 ln3
1983
M11.53'5 ~2-20402
82-20402
ISBN
ISBN 0-262-02183-8
0-262-02183-8
ISBN
ISBN: 0-262-52391-4
0-262-52391-4 (paperback)
(Paperback)
CONTENTS
CONTENTS
Forcword
Foreword
Rlichacl Brady
Michael Brady
xiii
Preface
PreCacellavid Isracl
Dalid Israel
xlii
27
27
37
37
42
46
50
59
63
66
68
70
74
77
84
87
107
107
110
HO
HI
112
113
114
llS
xiii
xvii
Contents
Contents
Bclicf, knowledge, and wants
nelief,Actions
knowledge,
and wants
and plans
Actions
and
plans
I h c plan infcrcnce rules
Thc plan
inference
rules
Rating
hcuristics
Rating?11c
heuristics
control of plan inferencing
111e control
of plan
inferencing
Obstacle
detection
Obstacle detection
Examplcs of hcfpful responses
Examples 'fie
ofhclpful
responses
train domain
'The train
domain
I h c spccch act definitions
111e speech
act definitions
Examplc
1: providing more information than requcstcd
Example
I:
providing
more information
than requested
Examplc 11: a ycs/no
question answered
nn
Example 11: a yes/no question answered n"
Indircct spccch acts
Indirect speech
acts
acts and mutual belief
Spccch
Speech acts and mutual belief
Surface linguistic acts
Surface13xtcndcd
linguisticplan
acts infcrcncing
Extended plan inferencing
Examplcs of indircct acts
Examples
of indirect
acts
Using
gcncraI knowledge
Using general
knowledge
l>iscussion
Discussion
Analyzing sentcncc fragments
Analyzing An
sentence
fragments
example
of a scntcnce fragment
An example of a sentence fragment
Conclusions
Conclusions
vi
vi
116
117
120
126
129
132
134
134
135
137
142
134
134
135
137
142
144
148
151
152
155
159
160
161
162
164
164
167
170
172
174
175
176
177
178
180
183
187
189
199
200
201
167
vii
201
205
267
267
270
Con tents
Contents
The focusing
approach
to anaphora
The focusing
approach
to anaphora
of focus
Thc definition
The definition
of focus
A skctch of the proccss model of focusing
A sketch
of the process model of focusing
'Ihc rcprcscntation
of focus
The representation
offocus
the
discourse
Finding
Finding the discourse focus focus
Kcjecring
thc cxpcctcd
Rejecting
the expected
focus focus
I
n
fcrring
and
focusing
In ferri ng and focusing
An algorithm for focusing
An algorithm for focusing
rnovcment
Focus Focus
movement
Rackwards
focus movcment
Rack wards focus movement
b e focusing algorithm for movcment
Using Using
the focusing
algorithm for movement
fur pronoun
interpretation
Focus Focus
for pronoun
interpretation
focus
for
pronoun
intcrprctation rules
Using
Using focus for pronoun interpretation rules
Focus
and
knowlcdgc
rcprcscntation
Focus and knowledge representation
rcstrictions
on co-specification
on co-specification
Focus Focus
restrictions
Pronouns
which
have
no co-specifiers
Pronouns which have no co-specifiers
'Ihc
problcm
of
parallelism
The problem ofparaJlclism
viii
273
278
278
280
283
287
289
291
295
299
302
304
305
310
314
316
318
Thc intcrprctation
and that
of this of
and[his
that
The interpretation
Co-prcscnt
foci
in anaphor disambiguation
Co-present foci in anaphor disambiguation
of co-prcscnt
[hisand lhal
lntcrpretation
Interpretation
of co-present
this and thai
This and [hat in focus movement
This and that in focus movement
themovement
focus movcmcnt
algorithm
Using Using
the focus
algorithm
320
320
321
323
327
Conclusions
328
Socan
what
talk now?
about now?
So what
wecan
talkwe
about
Bonnie
L.
Wcbber
Bonnie L. Webber
331
Introduction
Introduction
Fundamental
assumptions
Fundamental
assumptions
331
in forming
discourse-dependent
descriptions
in fanning
discourse-dependent
descriptions
FactorsFactors
The
dcfinitc/indcfinite
distinction
The definite/indefinite distinction
Quantifier
Quantifier
scopingscoping
Membcr/set
information
Member/set
information
of
I'hrce
uses
Three uses of pluralsplurals
An appropriate formalism for computing descriptions
An appropriate
fonnalism for computing descriptions
Noun phrases
in general
Noun phrases
in general
Singular
noun
Singular noun phrasesphrases
noun phrases
Plural Plural
noun phrases
Deriving
discoursc
entity IDS
Deriving
discourse
entity IDs
IDS
for
specific
discourse entities
IDs for specific discourse entities
339
340
334
342
345
345
347
347
349
350
353
353
ix
One anaphora
Conclusion
Conclusion
Bihliography
361
364
370
Bihliography
373
Index
391
THE
THE AUTHORS
James
James /\lIen
Allen
Assistant
Assistant Professor
Profcssor
Department
Science
13cpartrncntof Computer
Co~npi~tcr
University
Rochester
University of
oTRochcster
Rochester,
Ilochesler. NY 14627
14627
Robert
C. Berwick
IlobenC.
Bcrwick
Assistant
Assistant Professor
Professor
Ilcpartmcnt of Eke.
Elcc. Eng.
Eng. and Computer
Cornputcr Science
Department
Mti
nciallntelligence
Artificial
lntclligcncc l.ahoratory
laboratory
Massachusetts
hlassathusctrs Institute
lnstitutc ofTeehnology
of Technology
Carnbridgc M/\
MA 02139
02139
Cambr.idge
Michael Brady
J.J. Michael
Senior Research
Rcscarch Scientist
Scientist
Senior
Artiticial lntclligcncc Laboratory
Mtinciallntelligence
Massachusctls Institute ofTeehnology
of Technology
Massachusetts
Cambridge. MA 02139
02139
Cambridge,
David Israel
David
Rcscarch Scientist
Scientist
Research
Bolt, Beranek,
Bcranek, and Newman Inc.
Bolt,
50 Moulton Street
50
02139
Cambridge, MA 02139
S. Jerrold
Jcrrold Kaplan
S.
Vice
President. Business
Business Development
Vice President,
Tcknowledge Inc
Teknowledge
525 University
University Avenue
525
Palo
Alto, CA 94301
94301
Palo /\Ito,
Authors
David Mcllonald
McDonald
Ilavid
Assistant
Assist.int Professor
k'rofcssor
Computcr and Information
Information Science
Science
Computer
Massachusetts
University of Mass.achusetts
Amherst, MA 01003
Amhcrst
01003
Candace L.
I,. Sidner
I<cscarch
Scicntist
Research Scientist
Bolt, ncranck,
Beranek. and Ncwman
Newman Inc.
Ilolt.
50 Moulton Street
Cambridge, MA 02139
MI39
Cambridge.
Bonnie Lynn
Ijonnic
l.ynn Webber
Wcbbcr
Associate Professor
l'rofcssor
Department of Computer
Information Science
Computcr and lnformation
1)cpartmcnt
Moorc
School of Elcctrical
Moore School
Electrical Engineering
Univcrsity
Universily of Pennsylvania
I'hiladelphia.
19104
Philadelphia. PA 19104
xii
FOREWORD
FOREWORD
Michael Brady
Michael Brady
Brady
xiv
conventional linguists.
linguists. The
I h e work of Schaok,
Schank, Winograd,
Winograd, Wilks,
Wilks,
limited number of conventional
Woods, and their associates,
associatcs, suggested that it was
was possible
possible to build a (huge)
(huge)
Woods,
program capable
capable of interesting
intcrcsting linguistic
linguistic behavior.
computer program
cmphascs of different research
research efforts
efforts were
wcrc in different aspects
aspects
Although the
the emphases
Although
language, they
thcy shared
sharcd the
the characteristic
charactcristic that they
thcy resulted
rcsultcd in entire
entirc (mostly)
(n~ostly)
of language,
working systems.
systcms. All the
thc systems
systcms were
wcrc required
rcquircd to
to deal
dcal with a wide
wide range
range of
working
challengcs. 'Iney
'lhcy included a parser, a lexicon,
Icxicon, and emhodied
cmbodicd proposals
proposals
linguistic challenges.
linguistic
rcprcscntation and role
rolc of semantic
scmantic and pragmatic
pragmatic information
information in
about the representation
scntcnce. 'Ine
'lhe detailed
dctailcd interaction
intcraction
undcrstanding and responding to an English sentence.
understanding
bctwcen the
thc various subsystems
subsystc~nswas au central
ccntral concern.
conccrn. In Winograd's
Winograd's system,
system, fOi
fo~
between
example, the
thc fact
bct that ti,e
rhe sequence
scqucnce of
uf processing
pr(rcssing could not be
bc specified in
example,
advancc was claimed as a major
majur feature.
feature. The
'l'he program
pn~gramSIIRDLU
SI1ItDI.U contributed to
advance
the theory that sophisticated process
prtress interaction
intcraction is central
ccntral to modeling
modcling the
thought. perception,
pcrccption. and language
language understanding.
undcrstanding. In
flexibility of human thought,
reuospcct however,
howcvcr, these
thcsc approaches
approachcs to
tu computational
computatit~nallinguistics
linguistics seem
scc~nto be
retrospect,
rncchanism.
overly concerned with mechanism.
achieve a total, working.
working, natural
natural language
language
Incvitably, in order to achieve
Inevitably,
undcrstanding program,
program. early systems
systcms were
wcrc forced
forccd to ignore,
ignore, gloss over,
over, or
understanding
otherwise compromise
compromisc on. many aspects
aspects of language understanding.
undcrstanding. The
otJ1crwisc
treatment of time,
tirnc. mood.
mood, purpose. theme,
thcmc, the
thc determination
dctcrmination of focus
focus and
reference, as well as many othcr
other issues.
issues, were
wcrc accorded only a preliminary
rcference,
lreatmcnt. Some
Somc of the
thc main insights which informed
ink~rmedthc
Uic construction of these
treatmenl.
systems wcrc
were computational, and some of the main lcssons
lessons which thc
the authors
claimed could bc
lcarncd
from
their
research,
were
about
computational
issues.
be learned
research,
issue~.
the architecture of process interactions
interactions and the dcsign
design of rcprcscntations
representations for
scmantic
semantic processing. For this reason.
reason, thc
the dctailcd
detailed operation
opeT:.ltion of
of thc
the early
early natural
language
language undcrstanding
understanding systems and the issues Ulcy
they addressed wcre
were largely
accessible to the gcncral
general artificial intclligcncc
intelligence community.
community. At thc
the i~~tcrdisciplinary
interdisciplinary
workshop on Theoretical issues if,
in norural
nalural larigunge
language proccssb~g
processing [Nash-Webber and
19751, six of the eight sessions
Schank 1975),
sessions conccrncd
cuncemed mcmory
memory and knowledge
representation. Must
of
the
papers
in
thosc
sessions
could
have justas
Most of
those
just.as easily been
presented at a workshop on vision or reasoning (many
(many wcrc).
were). Interestingly, only
prcscnted
one of
of the forty papcrs
papers prcscntcd
presented at that workshop
workshop is rcfcrcnced
referenced in this volume?
volume. l
What are
arc tllc
tile symptoms ofchange
of change in computational
computational linguistics that
tllat II referred
to earlier? First
First, likc
like vision,
visiun, robotics.
robotics, search,
search, and filrnmal
formal reasoning,
reasoning, it has become
1.
N l . A P - I were refined
L To be
be fair, some ol
of !be
the syslelns
systems dcarihed
described a1
at nllNLAP]
relined and re-presented at
TINLAP-2. Several relcreneoare
referencc~ are made LOTINLAP-2.
lo TINLAP2.
xv
Brady
Brady
xvi
xvi
PREFACE
PREFACE
David Israel
Israel
In the
researchers talked confidently about
thc early
carly days
days of Artificial Intelligence.
lntclligcncc. rcscarchcrs
building
simulate (or emulate) any
building fully
fully intelligent
iotclligcnt entities,
entilics. "robots",
"robots". which could simulatc
natural
achievements, of
of course, is
ct~gnitivchuman achievement.
achic\,cmcnt. Among such achicvcmcnts,
natural cognitive
the
language. IThe
semi-official
thc (learning,)
(Icarning,) use and understanding
undcntanding of natural languagc.
h e early, semi-c~fficial
view
be duplicated by
view seems
sccms to
to have
havc been
bccn that the human linguistic
linguistic capacity could bc
the
thc application
application of perfectly
pcrfcctly general,
gcncral. non-domain specific operations to the
adtnittcdly special,
spccial, but largely
largcly then uncharted,
unchartcd, domain of
language. This was.
admittedly
oflanguagc.
was, at
timc, a perfectly
pcrfcctly reasonable
rcasunablc position. It has ccased
the time.
the
ceased to be so and the essays in
this volume
volume bespeak the recognition of that fact by the vast majority of
dlis
of natural
language
renouncing
lntclligcncc. They do so precisely by rcnouncing
language researchers
rcsearchcrs in Artificial Intelligence.
cvcn a residual
rcsidual obligation to speak
spcak to the issue. Whcrcof
even
Whereof we should no longer
haw to
to speak.
spcak, we should
should pass over
ovcr in silence.
silence.
have
longcr OK to "work
"nork in natural languagc"
Thus. it isis no longer
language" completely innocent
innocent of
of
rcsearch in syntactic
syntactic theory. It should,
should. by the way,
current research
way. be less than perfectly
alright for
filr people in syntactic theory to be as innocent of work in computational
alright
linguistics, especially
cspccially in the theory of parsing, as many still are. The same point
linguistics,
could
be
made
about the inexcusability of ignorance of contcmporary
could
contemporary work in
scmantics: but nowhere near so glibly. II shall rcturn
semantics;
return to this difference later.
scmantics "covered", can pragmatics be far behind? Indeed
With syntax
syntax and semantics
With
not, but here the situation
situation is very
vcry different.
diffcrcnt. Certainly
Ccrtainly the source of
no~
of the concepts
A1 researchers
rcscarchcrs is to be found in the works of philosophers (especially
deployed by AI
deployed
Strawson. Grice
Gricc and Searle).
Searlc). I1 think it can plausibly be maintained,
Austin and Strawson,
those concepts would remain,
remain ,"blind"
however, that those
however,
"blind" without having to
to
accommodate
the
"intuitions"
that
comc
from
attempts
to
design
computational
accommodate
come
artifactscapable
capablc of becoming
bccorning language-users.
languagc-users. (My apologies
artifacts
apologies to
to lmmanucl
Immanuel Kant)
wantcd to
to build a thing which could reasonably
Imagine, for
for instance,
instancc, that one wanted
Imagine,
be said
said to
to understand what you were talking aboutabout - at least when you were talking
about some
some previously
previously delimited domain; and to
about
to act accordingly, again within the
Israel
lsracl
xviii
xviii
leix
1. Notc that here the insurpasablc bamcr i<not the sentence,bul Lhc boundaries of the black box.
1. Note that
heresornc
the insurpassable
i~ not thesubject.
sentence,
the boundaries
of "Questions
the blaek box.
2. For
rclicf on thisbarrier
o n lorlurcd
seebutNoam
Chomsky's
or Form and
2. For some
relief on (in
this[Chomsky
oil tOrlured
subject.. see Noam OlOmsly's "Queslions of Form and
Interprclation"
19771).
Interpretation" (in [Chomsky 1977).
Israel
Israel
xx
xx
xxi
(a) All (both) programs for fonnulatillg fonnal scmantic accounts for
(a) 1\.11 (both) programs for fonnulating fonnal semantic accounts for
significant fragmcnts of natural languagcs (fragmcnts containing "intcrcsting"
significant fragments of natural languages (fragments containing "interesting"
scmantic constructions) cxploit formal languagcs quite diffcrcnt Lllan thc language
semantic constructions) exploit fonnallanguages quite different than the language
of first-ordcr logic. We havc in mind Montaguc-stylc scmantics and thc Situation
of first-order logic. We have in mind Montague-style semantics and the Situation
Scmantics of Barwisc and Pcrry [1982].
Semantics ofDarwise and Perry (1982].
(b) In other words: to our knowlcdgc, no significant fragment of any natural
(b) In other words: to our knowledge, no significant fragment of any natural
language has cvcr bccn scmantically analyzcd by way o f a (systematic) translation
language has ever been semantically analyzed by way of a (systematic) translation
into a shndard first-ordcr languagc. Indccd, again lo our knowlcdge, no one has
into a standard first-order language. Indeed, again to our knowledge, no one has
ever even seriously attcmptcd it.
ever even seriously attempted it.
(c) I h c foregoing facts don't sccm to bc accidcntq of history and thcre do seem
(c) The foregoing facts don't seem to be accident~ of history and there do seem
to bc good methodological rcasons for thc history. 'fhc main considcration is the
to be good methodological reasons for the history. The nwin consideration is the
ad hoc and unsystematic charactcr of attcmpts at scmantic analysis of articular
ad hoc and unsystematic character of attempt~ at semantic analysis of particular
scntcnccs of English by way of paraphrasc into shndard first-order languagcs.
sentences of English by way of paraphrase into slJ.ndard first-order languages.
Crucial herc is thc sourcc of h i s ad hm: charactcr. Any account of the scmantics
Crucial here is the sourcc of this ad hoc character. l\.ny account of the semantics
of natural languagcs which cxploits a formal languagc has a choicc about whcrc to
of natural languages which exploits a fonnallanguage has a choice about where to
be risk-taking. (Such choices go with thc territory of dccp and gcncral unsolved
be risk-taking. (Such choices go with the territory of deep and general unsolved
problems.) It can hc novcl and daring in thc specification of thc formal language
problems.) It can be novel and daring in the specification of the formal language
and its semantics. 'lhis is Lhc route takcn by Montaguc and Barwisc-Perry,
and its semantics. 'Ibis is the route taken by Montague and Barwisc-Perry.
Israel
Israel
xxii
xxii
xxiii
(psychologically?) impossibtc to translaic directly and syslcrnatically from a
(psychologically?) impossible to translate directly and systematically from a
nati~rallanguagc into thc languagc of first-order logic.
nalurallanguage into the language of first-order logic.
So much by way of argument; nccdlcss to say, lnorc nccds to be said. In the
So much by way of argument; needless to say, more needs to be said. 1n the
mcantimc. it's a plcasurc to wclcomc thc rcndcr to his volume, to taslc sorne of
meantime. it's a pleasure to welcome the reader to this volume, to taste some of
thc first fruits of a happy blending of thc scicnccs of language.
the first fruits of a happy blending afthe sciences of language.
CHAPTER 1
Introduction:
Introduction: Computational Aspects of Discourse
Robert C. Berwick
Talk,
'Talk, as everyone
cvcryo~lcknows, is cheap.
clicap. In stark contrast,
contrast. an scientific
scicnlific account of talk is
cxpcnsivc ..
-- we
wc have no good theory,
thcory, let
ict alone
alr~nca computational
computation;~lone, of this
expensive
behavior. This
mismatch bctwccn
between superficial
perhaps most facile
pcrhaps
facilc human bchavior.
l'his blatant n~islnatch
supcrficial
thcorctical mechanical intractability agrees
agrccs with the general
human case and theoretical
experience
twenty years of rcscarch
research in anilicial
artificial intelligence.
intclligcncc. Whereas
cxpcriencc of the past twcnty
"cxpcrl" problcm
duplicatcd -- thcrc
"experl"
problem solving bchavior
bebavior can bc
be roulincly
routinely duplicated
there are
progratns
that do
d o as wcll
inlcrprcting mass spectrograms
well or bcttcr
bettcr than pcoplc
people in inlerpreting
programs d,at
or calculus .--- the
mundane of human abilities rcmain
remain beyond the
die most munda~lc
UIC reach of
current computer
computcr programs. The
l h c following five
fivc chapters
chaprcrs descrihe
dcscrihc initial
computational forays
the difficult
territory of thc
the cognitivcly effortless
forays into thc
diff~culttcrritory
cffortlcss ~.
discourse.
Thc study of discourse bcgins
tc~lkis for.
The
begins with what wlk
for. In this crucial rcspcct.
rcspect, all
five
researchers share
largely unargued.
fivc rcscarchers
sharc a largcly
unargt~cd,but commonly accepted
acccptcd viewpoint: that
communicative behavior.! Packed
linguistic bchavior
behavior is basically abollt
abnttt c~mmunicativc
Packcd into these
fact is thc
gluc dJat
illat binds logcthcr
fivc rcscarch
cffons. The
two words, in fact,
the glue
together all five
research efforlS.
word "communication"
the I.lntin
.'.ttill cuiieitu~lilar,
communitas, to
reflecting
"communication" is derived
dcrivcd from thc
t oslwrc,
m , rcflccting
sulnc vague yet
an intuition that the touchstanc
touchstone for co~nmunication
communication involves, in some
-- bc
knowledge, or feelings.
feelings.
compelling sense. a notion of sharine
sharing -be it thoughts, knowledge.
Ijut what is even
cvcn morc
more remarkable about discourse is how this sharing comes
But
about. The
mere &
external forms
ronns -_. their uucranccs
utterances -Thc participants exchange mcrc
-- and
arrivc at an internal
yet manage to arrive
internal correspondence, presumably one of partially
shared
wi states.
sharcd men
mcntal
hcarcr is to somehow
view then, the job of a hearer
On the communicative vicw
the speaker's
spcakcr's internal
intcrnal life -- namely, those the speaker
reconstruct a portion of d,e
wishes to convey (call this the speaker's
message. or (;ommunicative
spcakcr's message,
sommunicative intent,
intent or
--
1.
without its
its own
problems; it
a~sumcs bar
that "communication"
whatever
I. This
This view
vicw is not without
own problems:
il (tacitly)
(lacilly)ssumes
"communication" - whatever
that is - mun
must have been functionally
This
may
funclionally supreme
supreme in the cvolutionary
cvolulionary "design"
"design" of
or language. n
is may
very wcll
well be true,
true, bul
but one must
must give some argument
to Olis
this effect;
not apodictic.
argumcnt lo
erect: it11is not
apadiclic.
28
Berwick
wh3teicr) -- and the job of tlic spcakcr is to somchow fncilitatc this rcconstructinn.
whatever) -- and the job of the speaker is to somehow facilirate this reconstruction.
is for communication, all
'Ihus, simply by adopting thc usual ~ i c wthat Ii~ngt~agc
'lllUS, simply by adopting the usunl view that language is for communication, all
usc:
fivc rcscnrchcrs have cmt)mced thc following embryonic model of l;~~iguage
five n:searchers have embraced the following embryunic model of language usc:
SPEAKER
intcrior form 1
interior form
spcakr wants (intcnbs) to am)
(what speater wants (intends) to sh~)
uaerance
(cxtcrior form of mcssagc)
(exterior fOffil of message)
intctior fomr
. interior form
of speaker's in tended
HEARER
HEARER
29
nIles
Shared knowlcdgc
knowledge niakcs
makes
nllcs to
to infer
infcr inner
inncr form
form from
from exterior
cxtcrior utterance. Shi~rcd
further
Ule mundane.
further sharing
sharing possible, trDnsforming
transfor~ningUle
dlc magical
1~1agic;ll
into Uic
From this
vantage
point,
Ule
study
of
discourse
reduces
tl,e study of
of these
thc
discoursc rcduccs to the
this vant;lgc
regularities.
he
is the chicf
chief contribution of
of the
rcgularitics. It should
shr~l~ld
be no surprise,
surprisc. then,
thcn, tllat
that this isthc
research
of thc
the systematic
Ulis book.:
book: to prmiidc
probidc a partial account of
rcscarch presented
prcscntcd in this
conventions
ctllic of
of thc
the artilicial
artificial intclligcncc,
intelligence.
conventions of discourse.
discoursc. According
Accurding to the
thc work ctl~ic
the
discourse rcgularitics
regularities they
tllCy havc
have
Uic authors
authors have
havc for
fur the
thc most part exploited
cxploitcd the
tllc discoursc
uncovered
uncovcrcd by embedding
cnibcdding thelll
thcm into working computer
computcr programs that actually aim
to
participants. A snapshot of
of ~which
Uic behavior
bch;lvior of human discourse
disco~~rsc
,l~ich
to mirror the
regularities
upnn looks solncthing
something likc
like the
rcgularitics each
c:sh c\mlrHmlor
cantributor has decided
dccidcd to focus
ftcus upon
following,
communicative intcnt
intent (as
k~llowing,where
uhcrc the
Uic reconstruction
rccunstruction of the
thc speaker's
spcakcr's co~n~nunicativc
represented
been brokcn
bmkcn down as follows:
rcprcscntcd internally
internally by tlle
the hearer)
hcarcr) has now bccn
SPEAKER
lca
communicative intent planning ~rules.
rulcsof inference
inferetlCe
rulerof
interior fann
McDonald
(what
speaker wants (intends)
(intends) ttoo share)
shale)
(whatspxkcr
rulcs
fonn
mlcs of logical form
paning mles.
parsing
rules,
I
'----:-t-.....
I
I1
parsing
rules,
parain8 N
~ S
N~U
of kl&d form
rules
oflogical
fonn
Webber,
Webber,
Sidner
Sidner
Kaplan,
Kaplan,
Allen
Allen
l i W meaning
wurlag
literal
rulcs of inference
rules
planning rules
speatds intended
in&
mmtng
speaker's
meanIDg
(-tion
of
speaker's m
)(recoDIlruction
ofspeater's
mesuae)
I
HEARER
Berwick
30
own
own rolc
role ta
to play
play in
in aa full
full account
account of
of discourse
discourse behavior.
behavior. I.ct
Let us
us rcvicw
review thcse
these
contributions, one by onc.
one.
contrihutio~~s,
of all, it is worlhwliilc
worthwhile to point
point out that Uic
tl,e discounc
discourse flowchart skctched
sKetched
First of
abovc
above splirs
SpliLS into two distinct parts. Onc
Olle is UIC
the tr;~nslation
translation from surfacc
surface form
fonn to
fonn: thc
the input to aa computcr
computer progrttm
program is just
just son~c
sOllie string of
of tokcns,
tokens, e.g.,
internal form:
"Discourse is likc
like magic",
magic", and our first task is u1
to translate, or associate,
associ;Hc. thcse
these
"1)iscoursc
those of
of ;lo
all intcmnl
internal Iango;~gc.
language, typically tokcns
tokens of
of somc
some ling~~islic
linguistic type.
tokcns with diosc
of thc
the input scrvcs
serves as tlic
the 1:1\1nching
launching pad for
'Il,C internally-formaltcd
internallY'fonnalted counterpart of
'llic
A
fi~rthcr
further processing.
prncessing. What should this
tllis internal language
languagc look likc?
like?
/I trivial
simply add lexical catcgory
category labcls
labels to cacli
each of thc
the tokens of
translntion would bc
be to 5in1ply
translation
the input string: typically, huwcvcr,
however, aa morc
more sophisticated translation is assumed.
assumed,
Uic
labeled br;tckcting
bracketing rcprcscnting
representing aa parsc
parse of
of thc
the input string.
string. Figuring out
such as aa labclcd
Olis itllcrnal
intcrnal language should bc
be likc
like is Wcbbcr
Webber and Sidncr's
Sidner's domain of
of
what Uiis
research. As
/Is wc
we shall see,
sec, Sidncr
Sidner and Wehbcr
Webber adopt so~ncthing
sometlling in the spirit of
of
rcscarcli.
~nodcrn
modern gcncrative
generative grammar, ifif not its cxilcf
CX<Jct detail.
detaiL Whatcvcr
Whatever thc
the process, it is
crllcially a formal
fonnal one.
onc. l'hc
The input is aa scqucncc
sequence of tokcns,
tokens, and the output is a
crucially
string of
of symbols
symbols in somc
some internal formal
formal language. Crucially,
Crucially, the relation
between surface
surface and intcrnal
internal languagc
language is not onc
one of idcntity
identity -- somc
some real work is
bctwccn
d(~ne in tlic
the conversion to an inlcrnal
intcrnal1anguage
the ctrmputation
computation that rnust
must
dqnc
languagc and so thc
likewise not ncccssarily
necessarily trivial. (We shall scc
see prcciscly
precisely how nun-trivial
non-trivial
be done is likcwise
below.)
we arc
are on familiar,
familiar, if not altogcthcr
altogether steady.
steady, ground. After
lifter all.
all,
So far wc
philosophers
pliilosophcrs since
since Frege
Fregc have claimed
claimcd that sentences
sentences of naturallanguages
natural 1;tnguagcs have
"underlying" representations
rcprcscnfations that are non-isomorphic to their surface fOnTIs.
forms. And
they have used distinctions in Ihe
ihc form of these
thcsc representations
reprcscntations to account for
seman
tical distinctions.
Allcn, though,
though, want to go this
this project
projcct one
scrnantical
distinctions. Kaplan and /llIen,
better.
bcltcr. If differences
differcnccs in underlying fonn
form can be
bc used
uscd to capture scmantical
scmantical
differences
-the
Frege-Russell
program
-then
why
not
expand
the
notion of
Frcgc-l<usscll
cxpand
diffcrcnccs
"undcrlying form"? Kaplan and /llIen
Allcn do just tha~
that, and attempt to recover
recover a
"underlying
partial representation
rcprcscntation of the
thc speaker's
spcakcr's beliefs
bclicfs and intentions.
intentions. Now, aa system
system that
knows
knows when
whcn and how
how to
to fix
fix beliefs
bclicfs isis aa different
diffcrcnt order of beast O,an
than one
one that
knows simply
simply how to
to map
map tokens
tokcns to
to linguistic
linguistic types.
typcs. Indeed,
Indccd, talk about fixing
fixing
knows
beliefs
bclicfs isis part and parcel
parcel of what
what it means
means to
to talk about
about higher
higher cognitive processes
processes
generally.
the province
province of the
thc language
language processor,
proccssor, at least
least not
gcncrally. It isis not necessarily
ncccssarily ille
in the
thc conception
conccpfion of the
the language
language processor
proccssor as
as ordinarily entertained
entenaincd in modern
linguistic
course, to
to ti,e
the extent
cxtcnt that the
thc mapping of tokens to
to internal
internal
linguistic theory.
theory. Of course,
ncr's theories
fonn
thc groundwork
groundwork for
for O,e
the fixation
fixation of
of belief,
bclicf, Webber
Webber and
and Sid
Sidner's
theories
fonn lays
lays the
provide the
thc cornerstone
cornerstone for
for Kaplan
Kaplan and
and /llIen's
Allcn's systems
systems as
as well.
well. (Entirely
(Entirely
provide
analogous,
sccm, to
to the
the way
way that
that early
early visual
visual processing
processing provides the
the
analogous, itit would
would seem,
chair.)
representational
rcprcscntational foundation
foundation for
for calling
calling aa chair
chair aachair.)
31
Returning now to our Cook's tour, Wcbbcr iind Sidncr's aim is to dcvclop thc
Returning now to our Cook's tour, Webber and Sidner's aim is to develop the
right internal format in which to formally rcnrcscllt uttcranccs, and how (in
right internal format in which to formally represent utlerances, and how (in
Sidncr's casc) to comoutc with thal rcprcscntation in ordcr to kccp track of thc
Sidner's case) to compute with thal representation in order [0 keep track of the
things talkcd about in a discoursc. Thc purpose of ihis forn~atis to dcfinc the
things talked about in a discourse. The purpose of this fom1at is to define the
doniains of individuals or cntitics U~at arc to bc madc availa1)lc Tor further
domains of individuals or entities that arc to be made available for further
computational ni;~nipulationhy thc hci~rcr. For instilncc, whcn 1 say, "1 saw a
computational manipulation hy the hearer. For instance, when I say, "I saw a
Pcrsiaii cat," [hcn Wcbbcr's systcln is to nlakc it clcar thal I am talking about a
Persian cat," [hen Webber's system is to make it clear thal I am talking about a
ccrtain domain of individuals, nanlcly, Pcrsian cats. Wcbbcr's systcm pairs
certain domain of individuals, namely, Persian cats. Webber's system pairs
cxprcssions in a fonnal languagc -- thc Ii~nguagcinto which strings likc "I saw a
expressions in a fonnal language .. the language inlO which strings like "I saw a
Pcrsian car"
gct mappcd -- with an intcmal (presumably mentalistic)
Persian cat"
get mapped -- with an internal (presumably mentalistic)
rcprcscntation of sets of individuals
representation of seLS of individuals
For thc most part Wcbbcr is conccrncd only with a scmanlics for Noun
For the most part Webber is concerned only with a semantics for Noun
Phrascs. For cxamplc. Wcbbcr claiins that thc string (and Noun Plirasc) IJlrce
Phrases. For example, Webber claims that the string (and Noun Phrase) three
bovs [hat atc a
"crcntcs" two possiblc discoursc c~ititicsrcflccting h e
boys that ate i! pizza "crea[es" two possible discourse entities renecting the
individuativc and collcctivc rcrtdings of tllis scntcncc. In onc rcading, a sct of boys
individualive and collective readings of this sentence. In one reading, a set of boys
togcther cat a singlc p i r ~ a ;in thc othcr, thrcc boys individually cach cat a pizza.
together cat a single piua; in the other, three boys individually each cat a pizza.
Crucially, Wcbbcr ciipturcs die diffcrcncc in meaning bctwccn thcsc two rcadings
Crucially, Webber captures the difference in meaning between these two readings
via a diffcrcncc in thc fr,rm ofpossiblc internal rcprcscntations. A syntactic dcvice
via a difference in the fOlm of possible internal representations. 1\ syntaclic device
-- the scoping of a SKI' operator -- is dcploycd to mirror [hc diffcrcnce in
.- the scoping of a SET operator -- is deployed to mirror the difference in
mcaning:
meaning:
(the collective
there exists
one set of boys that
serves asreading:
thc argumcnt
to "ate")
serves as the argument to "ate")
Berwick
32
33
undcr thc prcdicatc i~nplicdby thc objcct; for Kaplan, tllc cxprcssloll "three boys
under the predicate implied by the object; for Kaplan. the expreSSHlJI "three boys
that atc a pizza" (or rather its Ingicnl tr;~nslation)mcans a list of namcs of boys
that ate a pizza" (or rather its logical translation) means a list of names of boys
who cilhcr togcthcr o r scparatcly ate a pi?:/a. In contrast, Allcn's modcl ccntcrs on
who either together or separately ate a pizza. In contrast, Allen's model centers on
thc causal connection bctwccn humiun actions and dlc world. It involvcs in an
the causal connection between human actions and the world. It involves in an
csscntial way wlii~tthe spcakcr or hcarcr w:ints or inlcnds. On diffcrcnt occasions,
essential way what the speaker or hearer wants or intends. On different occasions,
dcpcnding on thc bclicfs, dcsircs, and party allcgiancc o f thc agcnts involved, one
depending on the bclicf.s, desires. and party allegiance of the agents involved, one
and thc samc sct o f formal objccts (hence objects with the simc propcrtics, hcnce
and the same set of formal objects (hence objects with the S<lme properties. hence
thc sarlic cxtcnsions) could llavc dirfcrcnt intcnsions. Kaplan is awarc of this
the same eXlensions) could have different intensions. Kaplan is aware of this
distinction; as hc obscrvcs. thc "propcr"answer to thc ~ucstion,
distinction; as he observes. the "propcr"answer to the ~uestion.
Which employees profit share?
Which employees profit share?
could, on onc occasion. bc simply a list of cmployccs -- the "cxtcnsional"
could, on one occasion, be simply a list of employees .- the "extensional"
mcaning. Rut dcpcnding on what thc spcakcr or hcarcr wants to & with the
meaning. But depending on what the speaker or hearer wants to do with the
answcr. it might hc sornc dcscription, such as, "thc Vicc I'rcsidents".
answer, it might he some description, such as, "the Vice Presidents".
In part this diffcrcncc in approach is Ulc rcsult of thc diffcrcnccs bctwccn the
In part this difference in approach is the result of the differences between the
objccts cach has chosca to study. Kaplan opts for a ~.clatibclyrich world of objccts
objects each has chosen to study. Kaplan opts for a relatively rich world of objects
and propcrtics, but an impovcrishcd thcory of bclicf and action: by fixing as his
and properties, but an impoverished theory of belief and action: by fixing as his
"rcal world" a relational dalabasc ([)I]) of objccts with ccrtain propcrtics and
"real world" a relational dawbase (DB) of objects with certain properties and
rcducing all human purposcs to onc, nan~cly.finding out thc answer to queries
reducing all human purposes to one, namely, finding out the answer to queries
about thc datahasc. Knplan can assume that cxtcnsions, now si~nplylists of
about the database, Kaplan can assumc that extensions. now simply lists of
databasc objccts, fix mcaning. With this thcory about how forn~alobjccts canncct
database objects, fix meaning. With Ulis theory about how formal objects connect
to the dawhssc world. Ki~plancan now build a systcln that constructs thc proper
to tllC datahase world. Kaplan can now build a system that constructs thc proper
rcsponsc to qucstions that arc poindcss to answcr whcn ccrtain logical conditions
responsc to questions that are pointless to answer when certain logical conditions
obtain- (such as Lhc qucstion " H o w many I3la1dy Marys did John drink at the
obtain' (such as the question "How many Bloody Marys did John drink at the
party?" if no liquor was served at the party). Kaplan's program is ablc to dcduce
party?," ifno liquor was served at the party). Kaplan's program is able to deduce
whcn such qucstions arc pointless to answcr bccausc in his simplc cxtcnsional
when such questions arc pointless to answer because in his simple extensional
world mcaninglcss questions corrcspond to occurrcnccs of clnpty sets in the
world meaningless questions correspond to occurrences of empty sets in the
middlc of a scarch through a databasc to locarc thc objccts that mcct the
middle of a search through a database to locate the objccts that mcet the
dcscription dcmandcd by the qucstion. Kaplan's system can detcct these gaps,
description demandcd by the question. Kaplan's systcm can detect these gaps,
and then cxploit a close corrcspondcncc bctwccn the surface syntactic form of the
and then exploit a close correspondence between the surface syntactic form of the
qucstion and the database search it dcmands to gcncratc an appropriate corrcctive
question and the database ~earch it demands to generate an appropriate corrective
or suggcstivc rcsponsc (such as, Nonc, bccausc therc was no liauor scrvcd.).
or suggestive response (such as, None. because there was!J..Q liquor served,).
I n contrast, Allcn assumcs an impovcrishcd univcrsc of propcrtics of objccrs.
In contrast, Allen assumes an impoverished universe of properties of objects.
but a rich thcory of human action. Allcn's basic idca is to conjoin a thcory of
but a rich theory of human action. Allen's basic idea is to conjoin a theory of
linguistic bchavior -- thc thcury o f spccch acts as devclopcd by Austin and Searle
linguistic behavior -- the theory of speech act~ as developed by Austin and Searle
-- with a largcly Al-bascd approach -- a thcory of planning complex tasks. Speech
-- with a largely AI-based approach -- a theory of planning complex tasks. Spcech
act thcory providcs a framework in which linguistic activity is "just anothcr" kind
act theory provides a framework in which linguistic activity is "just another" kind
of human action. But as a kind of action, a spccch act must be planned, and it is
of human action. But as a kind of action, a speech act must be planned, and it is
thcn susccptible to explanation via a theory of planning. What Allcn docs is to
then susceptible to explanation via a theory of planning. What Allen docs is to
Berwick
34
--
35
quite broad
broad inin scope.
scope. The
'l'hc regularities
rcgul;~riIiesthat
tliat have
havc been
hccn probed
prohcd range
rangc from
from patterns
patterns
quite
in the
tlic usc
usc of
of "dbbrc\'iatory"
"dhbrcviatory" Noun
Noun Phrase
I'hrssc expressions
cxprcssions and
and pronouns
pronouns (Sidncr,
(Sidncr,
in
Wcbbcr), to
to helpful
hcll)ful responses
rcsponscs to
to questions
questions (Allen,
(Allen, Kaplan),
Kaplon), to
to the
thc production
production of
of
Webber),
utterances,
uttcranccs, given
givcn an
an intended
intcndcd communicative
co~nmunicalivc"message"
"mcssagc" (McDonald).1
( ~ c l ) o n a l d ) . l 'n,e
llie
constraints exploited
cxploitcd by
by the
thc contributors
contributors also
also run
run d,e
thc gamut
gamut of
of linguistic
linguistic and
and
constraints
rcgul;~rilics. They
non-linguistic regularities.
include dlC
dic purely
purcly syntactic
synt:tctic (such
(such as
as the
the
'l'hcy include
non'linguistie
knowlcdgc that
that inin d,e
Ulc sentence
scntcncc Bill
llill wants
w:~nt?him
l1i1tiill
111 leave,
lcavc. Bill
I3il1 and
and him cannot
cannot be
be
knowledge
co-rcfcrcntial): semamic
scrnatltic (such
(such as
:IS d,e
thc knowledge
knowlcdgc that
that the
thc verb
vcrb giy
& h a has
s aa kind
kind of
of
eo-referemial);
"argumcnt stnlcturc"
stn~cturc"that
that requires
rcquircs aa giver,
givcr, aa givec.
givcc, and
and object
objcct to
to be
b e given);
givcn):
"argument
pragtnatic (knowledge
(knonlcdgc that
tliat has
h;~sto
to do
d o with
with the
thc context
contcxt of
of the
thc utterance,
uttcrancc. such
such as
as the
the
pragmatic
knowlcdgc
that
whcn
I
ask
thc
qucstion,
"Wlicn
d
m
s
the
train
to
New
York
knowledge that when I ask the question, "When docs the train to New York
prcsumc the
tlic existence
cxistcncc of
of aa train
train to
to New
New York,
York, because
bccausc otherwise
othcrwisc II would
would
lcavc?" II presume
leave?"
alrcady know
know d,e
dic answer
ans\\,cr to
to my
my question);
qucstion): and
and inferential
infcrcntial (such
(such as
as dlC
Ute simple
simple
already
inrcrcncc that
that ifif II kll(lw
kl~owth~lt
di:~tA
A wants
wants some
somc action
ilction K
X to111occur,
occur, then
then 1\A wants
wan& the
the
inference
preconditions of
of that
that act
act to
to obtJin.
obtain. or
o r more
tnorc complex
ct~mplcxinferences
itifcrcnccs grounded
groundcd upon
upon
preconditions
morc highly
Iiiphly structured
structurcd bundles
bundles of
of information,
informolion, like
like aa causally
causally structured
structured ;lelian
action
more
scqtlcncc plan for
for boarding aa train),
train).
sequence
clcar by now
now that this
d ~ i svicw
vicw of
of discourse
d i r o u n c as
as communicative
communicative
should be
bc clear
ItIt should
bchavior calls
calls for
for aa theory
thcory compounded
compounded of
o f two
two parts: d,e
thc first
first aa dlCory
Uicory of
o f what
what isis
behavior
sharcd ..-- messages
lncssagcs or
o r menwi
mcntal stuff;
stoff; the
thc second
sccond aa theory
thcory of
o f how
& that stuff
stuff comes
comes to
to
shared
sharcd. A study
study of
of whal
nhar the
thc regularities
rcgularitics of
of discourse
d i r o u r s c are
arc is,
is, of
o f course,
coursc, part of
of
bc shared.
be
thc domoin
domain of
of many
many other
other fields"
ficlds -- psychology,
psychology, sociology,
sociology. among
among others.
others. What
the
distingt~islicsthe contributions oFthis
of this volume
volumc From
from these
thcsc more traditional ways
ways of
dislinguishes
vicwing d,e
thc subject
subjcct (since
(sincc 011
all these fields
ficlds also,
also, by and large,
largc. adopt something
something like a
viewing
bchavior view
vicw of discourse). isis that they
thcy study the problem of
communicative behavior
diroursc from
from the stmdpoint
sondpoint of comptlt.1tion.
comwt~l?tion. Part of this commitment
commiment to
discourse
co~nputationlies
lics of
o f course in the very fact
fact that the authors all
all
explanation via computation
from
actually carry out the mapping from
construct working computer programs that actually
uttcrancc to
to internal
intcrnal fortn
form or
o r its
its reverse. More broadly, dlOugh,
though, the contributors
utterance
1. All the
!he contributors
contributors do
doadd
impohlnt qualification
qualificationto
lo the "naturalness"
"naturalness" of the
Lhe input that
Ulat isir; the
Ule
1.
add one important
a%~umcdstarting
slaning point
poinl for
far analysis:
analysis: the
!hc actual
actual input to
l o the
Ulc programs
programs is in orthographic
onhagraphie form.
form,and
and thus
lhus
a<;sumed
omis such
smch possibly important
imponant characteristics
characlcrirlia of
of the
thc speech
spccch stream
aream as
as StrN,
Srm, intonational
intonational contour,
mntour.
omits
pauses, and the like,
like. as
as well
wcll as
as those
those obviously "non-verbal"
"non-verbal" concomitanL~
concomibnu of communication
communication like
pauses,
geslurcs. This
Ihis is the
!he usual qualification
qualification made in computational
compula1ional research of
ol this
!his kind, and itif deserves
dnerves
gestures.
abslraclingaway
away from
from these
these well
well known
known properties
propcniesof
olcommunicative
behavior, itit
rome comment
commcnL In abstracting
some
communicative behavior.
imDoMnl to
lo keep
kcen in mind
mind whether
ahcthcr potential
~olmtialgcneralil.3tions
rcneraliwlions will be lost
losL Some
Some of
o fthese
h w features
fcalures of
isis important
speech _.--such
as fundamental
findamcnlal frequency
lrcqucn;y contour
contour(Ule
"pilch level"
lcvel" of
o l the
h c voice)
voice) and contrastive
conwive
speech
such as
(the basic "pilch
strm -- could
a u l d be
be accommodated into
inlo the
thc research
mcarch presented
presented in this
Ulis book.
book. In panicular.
panicular, itit will be
be
slrcss
how contrastive
contrastive stress
stress might
mighl be
be factored
bctorcd into
into the
the approach
approach that
h t Sidner
Sidncr takes
lake; 10
lo the
Ule resolution
resolution of
shown how
delinile anaphors.
anaphorr.
definite
Berwick
36
to this book arc conccrncd with & it is Ular what thc spcakcr has in mind comcs
to this book arc concerned with how it is that what the speaker has in mind comes
to hc rcconstructcd by thc hcarcr, in short, with t l ~ csrt~dyof (mcntal) prtrcsscs.
to he reconstructed by the hearer, in shan, with the study of (mental) processes.
Implicit in this idcntificiition of computation with rncntal p r t ~ c s sis a key
Implicit in this identification of compUli.ltion with mental process is a key
assumption of A1 rcscarch with rcspcct to thc study of discoursc bcl~avior,an
assumption of AI research with respect to the study of discourse behavior, an
assumplion that thc livc autllors all ti~kcfor grantcd, and onc that in fact providcs
assumption that the five authors all take for granted, and one that in fact provides
thc conlcrstonc for tlicir rcscarch and almost all cogniiivcly-oricntcd Al work
the cornerstone for their research and almost all cognitivcly-oriellted AI work
con,t>utational, whcrc
gcncrally. 'I'his is thc assumption that mcnt:~lprwcsscs
generally. This is the assumption that mental processes are computational, where
by co~npolation;~l
onc may takc Ibdor's dcliniticrn of co~npu~rticln:
o~crations
by computational one may take Fodor's definition of compu!<ltion: operations
dcfincd o \ c r (mcnti~l)rc~~rcscntations..
[Fodor 19751 Co~nputationalproccsscs are
defined-o\cr (ment,il) representations.. [Fodor 1975) Computational processes are
to bc distinguished fi~rthcrin that tlicir intcractions with rcprcscntations arc
to be distinguished further in that their interactions with representations arc
purcly firrrnal. '1'h;lt is, it is thc strrlclurc of a rcprcscntatiou tll;~tdictatcs the
purely funnal. That is, it is the stnlcllire of a representation that dictates the
coursc of a computation, and not its contents; a rcprcscnlation must wear its
course of a computation, and nut its contents; a representation must wear its
computational propcrtics 'on its slceve.'
computational properties 'on its sleeve.'
A onc-line surnmilry of what thc co~~tl-ibutors
to this volumc arc after cnds up
A one-line summary of what the contributors to this volume arc after ends up
sounding very mucli likc the goal of rnodcrn gcnerativc gcirnmar: rules and
sounding very much like the goal of modern generative grammar: rules and
rcprcscntations. Howcvcr, thcrc is a crucial diffcrcncc. As rncntioncd, since the
representations. However, there is a crucial difference. As mentioned, since lhe
con~ributorsto his vdlumc subscribe to tlic A1 work cthic, thcy aim to do more
contributors to this volume subscribe to the AI work ethic, they aim to do more
than si~nplycharactcrizc thc mapping bctwccil rcprcscntational lcvcls as some sort
than simply characterize lhe mapping between representational levels as some sort
of fi~nction,they aim to charactcrizc it as a c o m ~ u t l h l cfi~nctioq,prcfcrably, an
o~ function, they aim to characterize it as a computable function, preferably, an
cfficicntly computable function. 'lhis addcd cornputaticmal dcrnand is the real
efficiently computable function. This addcd computational demand is lhe real
spicc of A l rcscarch. and is what distiuguishcs tlic rcscarch described in this
spice of AI research, and is what distinguishes the research described in this
volumc from more standard approaches.
volume from more standard approaches.
From a pl~ilosophical point of vicw, the incorporation of a computational
From a philosophical point of view, the incorporation of a computational
vicwpoint into thc study of discoursc appcars to havc had two substantial bcncfits.
viewpoint into the study of discourse appears to have had two substantial benefits.
1-irst. ibr whatc\.cr reasons, thinking hard about computational issucs has Icd some
First. for whate\'er reasons, thinking hard about computational issues has led some
of the A1 rcscarch in this volurnc to actually anticipate certain dcvclop~ncntsin
of the 1\1 research in this volume to actually anticipate certain developments in
linguistic thcory and philosophy. Sccond, thc co~nputi~tional
vicw as cspoused in
linguistic theory and philosophy. Second, the comput.ltional view as espoused in
this volume leads dircctly to a morc nodular explanation of linguistic bchavior.
this volume leads directly to a more modular explanation of linguistic behavior.
lly "modular" I rncan that part of obscrvcd surfacc bchavior -- in this case,
By "modular" I mean that part of observed surface behavior -- in this case,
discoursc bchavior -- is to be accouiltcd for by thc intcraction of two componcnts.
discourse behavior -- is to be accounted for by the interaction of two components,
one bascd on the pure form of internal, mcntal rcprcscntations, and thc other
one based on the pure form of internal, mental representations, and thc other
bascd on cornpittations dcfincd ovcr thosc rcprcscntations. Of coursc, as any
based on computations defined over those representations. Of course, as any
compi~tcrscientist would point out. rcprcscntationnl and computational dcmands
computer scientist would point out. representational and computational demands,
data stnrcturcs and i~lgorithmiccornplcxity, arc intirnatcly rclatcd. 'Ihc prcsurned
data structures and algorithmic complexity, arc intimately related. The presumed
advantage o f a modular account is that it allows onc to factor out and state
advantage of a modular account is that it allows one to factor out and state
constraints for cach module scparatcly, with h e intcraction bctwcen ComponenD
constraints for each module separately, with lhe interaction between components
giving risc t o thc superficially cornplcx surface bchavior. In short, at lcast for the
giving rise tf} the superficially complex surface behavior. In short, at least for the
studics prcscntcd in this volume, thc introduction o f thc additional dcrnand of
studies presented in this volume. the introduction of the additional demand of
computability providcs an additional sourcc of cxplnnatory constraint that can be
computability provides an additional source of explanatory constraint that can be
mined.
mined.
37
Speaker
2: Yes.
tlley Act.
were in disagreement over extending
thc Voting
I<ights
the Voting Rights Act.
m:
(2a) I gavc llill fivc dollars...It was morc than I gavc Dill
(2a) I gave
fiveI3ill
dollars
1t was...
more
Ulan
I gave
(2b) Bill
1 gavc
fivc dollars
Onc of
them
wasDiU
torn.
(2b) I gave Bill five dollars One of them was torn.
Ilicre arc actually two scparatc puulcs to solvc hcrc. Onc is simply to figure
'J11ere arc actually two separate puzzles to solve here. One is simply to figure
out that fivc dollars can havc two diffcrcnl "mcanings", cithcr as a collcctivc entity
out that five dollars can have two different "meanings", eitller as a collective entity
or as a set of singlctons. Ihc othcr is to dctcr~ninch a t it can bc linked to the
or as a set of singletons. The other is to determine that i! can be linked to the
collcctivc cnlity, and them to thc singlctons, and not vicc-vcrsa. 'I'he solution to
collective entity, and them to the singletons. and not vice-versa. 'Inc solution to
thc first problcm is rcprescntational in character. ' n c aim is to dcvclop a
the first problem is representational in character. The aim is to develop a
languagc that can cxpress the spacc of possihlc logical rcprcscntations for the
language that can express tile space of possible logical representations for the
discoursc cntitics implicit in a givcn litcral tcxt string. In Ihc five dollar$ example,
discourse entities implicit in a given literal text string. In the five dollars example,
thcrc would bc two possiblc logical translations of thc tcxt: a collective rcading
tIlere would be two possible logical translations of tile text: a collective reading
and an individuativc rcading, corresponding to fivc dollars considcrcd as a single
and an individuative reading, corresponding to five dollars considered as a single
38
Berwick
cntity and f i x dollars considcrcd as a collcc~ionof fivc singlc bills. Onc must also
entity and fhe dollars considered as a collection of five single bills. One must also
supply a set of & h a t say cxactly how litcral strings arc to bc translatcd into this
supply a set of rules that say exactly how literal strings are to be translated into this
language. 'Ihis is the problc~nthat Wcbhcr scts out to solvc, and it is csscntially a
language. 'Illis is the problem that Webber sets out to solve, and it is essentially a
qucstion allout thc propcr rcprcscntation of tllc syntax of logical form.
question about the proper representation of the syntax of logical form.
Thc sccc~ndpu~.,.dcis lnorc compu~ationalin character: with the possiblc array
The second puzzle is more computational in character: with the possible array
o f discoursc cntitics in hand, what arc tllc rulcs that tcll us which onc (or morc) of
of discourse entities in hand. what arc tile rules that tell us which olle (or more) of
thcsc possibilities is ac1u;rlly linkcd lo su1)scqucnt pronouns and Noun Phrascs
these possibilities is actually linked to subsequent pronouns and Noun Phrases
that appear latcr in thc tcxt or discoursc? 'Ihc solurion tc) Ihis problcm is Sidncr's
that iJppear later in the text or discourse? 'Ille solution to tilis problem is Sidner's
work.
work.
Sctting out to solve thcsc problctns is extraordinarily an~bitious, in that
Setting out to solve these problems is extraordinarily ambitious, in that
Wcbbcr and Sidncr aim to do for discou~scwhat rcccnl ling~iisticthcory has
Webber and Sidner aim to do for discourse what recent linguistic theory ha~
accomp~ishcdonly for singlc scntcnccs. For singlc scntcnccs thcrc arc various
accomplished only for single sentences, For single sentences therc arc various
analogues of cxamplcs likc (1) abouc. For cxamplc, thc reciprocal anaphor
analogues of examples like (1) abovc, For cxample, the reciprocal anaphor each
other
can bc uscd as an cxprcssion that dcsignatcs a Noun Phrasc that has
other can be used as an expression that designates a Noun Phrase that has
prcvio~tsly appcarcd:
previously appeared:
(3)'Thc lncn sccm to likc cach o!her. (cncli othcr = the men)
(3) The mcn scem to like each o~her. (each other;; the men)
This kind of relationship is capturcd in scvcral currcnl linguistic thcorics by
This kind of relationship is captured in several current linguistic theories by
tlrc notational dcvice of ro-indexing.' In tllc cxamplc abovc. onc could rcprescnt
tire notational device of en-indexing.] In tile example above, one could represent
co-dcsignation by thc simplc dcvice of subscripling the NP thc mcn and the
co-designation by tile simple device of subscripting the NP tile men and the
rcciprt~aleach othcr with thc samc index:
reciprocal each other with the same index:
(4) [nlc rn~n],,,~sccm to likc [each otherIi.
(4) [rhe men]NP. seem to like [each other]..
I
Onc may disagree about whcthcr co-indexing is thc right sort of notational
One may disagree about whether co-indexing is the right sort of notational
machincry with which to cxprcss the indisputable fact that a tcxt "hangs together."
machinery with which to express the indispulablc fact that a text "hangs together."
n u t whatcvcr machinery one chooscs, what is crucial from a linguistic and a
nut whatever machinery one chooses, what is crucial from a linguistic and a
computational point of view is that "binding rclationships" arc apparcntly subject
computational point of view is that "binding rclationships" are apparently subject
to svnlactic constraints:
to syntactic constraints:
1. Sce, for cxample, thc Govcrnmcnt-Bindinglheory [Chomsky 19811: Lhc lrxical-I:unclional theory
1. Sec, [Kaplan
for example.
the Government-Binding
theory [Chomsky 1981]: the l.cxical-Funclional theory
and Rranan
19811.
(Kaplan and Bresnan 1981].
39
1-or instance, whcthcr cach orhcr can dcsignatc onc and thc srimc entity as ~ h men
c
For instance, whcther each othcr can designate one and the same elltity as ~ ~
dcpcnds in part on thc puccl! configuratinnal aspects of scntcnccs. [;or cxarnplc, a
depends in part Oil the purc1) configurational aspects of sentences. For example. a
lcxical Subjcct of a11cmbcddcd clausc sccms ttr block binding bctwccn cach c~thcr
lexical Subject of an embedded clause seems tu block binding between each other
and a possiblc Noun l'hrasc nntcccdcnt:
and a pussible Noun Phrase antecedent:
(6) 'lhc rncn want Jolln to likc cach other.
(6) *The men want John to like each other.
-- thc so-callcd Spccificd Subjcct constraint
(7) 'l'hc wolnrln ovclithcrc who likcs [cvcry manji in thc room
(7) *Thc wom<ln oyeretherc who likes [eyery m,m]i in the room
thinks [tliat [hcl] is too rich]. (every man =he)
thinks [that [he,] is too rich]. (every man =he)
S
~
NP
,. . . vP-..........
thinks
~
V"
S
iCWOilliiri
I:::-:----....
S thinks
hc 1s too rlch
Swho Iikcs cvcrv man
he is too rich
who likes everv man
Contrary to first imprcssiotls Uicn. it is nor just tllc mcaning of a sclltcncc that
Contrary to first impressions then, it is not just the meaning of a sentence that
dctcnnirlcs what can hang togcthcr ~ i t hwhat ((hough his is ccrtninly yct anothcr
dc!ennines what can hang together \lith whal (though this is certainly yet another
dimcn%ionof thc problcm). Further, whatcvcr the intcrprcratiun of this array of
dimension of the problem). Further, whatever the interpretation of this array of
facts -- and thcrc is substantial disputc about thc fi~l-mulationof principlcs like
facts .. and there is substantial dispute about the fOl1nulalion of principles like
constituent command -- one point is clcar: to thc cxtcnt that one can make
constituent comm<lnd .. one point is clear: lo the extent that one can make
binding conslraints like Ihc Spccificd Subjcct and c-comrndnd prccisc. one can
binding constraints like the Specified Subject and c-command precise, one can
rcplacc vaguc intuitions about "tcxt cohesion" with justifiablc principlcs, a me
replace vague intuitions about "text cohesion" with justifiable principles, a true
thcory of binding2
theory ofbinding. 2
A constrained thcory of binding also offcrs substatltial colnputational bonuses.
A constrained theory of binding also offers substantial computational bonuses.
For instance, as obscrvcd in [Bcrwick and Wexlcr 19821, the c-command
For instance, as observed in [Berwick and Wexler 19&2], the c'command
!. For our purposes hcrc, a phrase a conslirucnt commands anolher phrase P iT he first branching
1. For our
purposes
here. a phrase
a con~ti!Uent
P if the
first branching
category
that dorninaws
a dominala
,8. Scecommands
[Kcinhari another
19761 lorphrase
Ihe original
rormulation.
~tegory 2.
thatComparc,
dominates
domjnatc.~1 lalliday's
fl. See (Reinhart
1976] accou~ir
Cor the original
TorQ examplc,
unconsrraincd
(cilcd byformulation.
Wcbbcr) of how cohcsivenm in
2. Compare,
account substitu~inn
(cited by Webber)
of how cohesiveness
a textforis example,
oblaincdllalliday's
lhrough unconstrained
"rvord substitution":
is "specilicd
through the inuse of a
a tcxt isgrarnrndl~cal
obtained Ihrough
"word ~ubsLitution":
is "sllccified
through
Ihe use
signal indicating
lhat i r i s lo bcsubstitulion
rccovcrcd from
what has gone
beforc."
Ihisof
justa mlates
grammatical
signal indicating
it i~ toand
be recovered
fromphrases
what has
before." for
lhis
just rcslates
the observation
lhat that
pronouns
abbrevialory
aregone
subslitutcd
previously
mentioned
the obscrvation
that Itpronouns
and abbreviator)'
are substituted for prcviously mentioned
expressions.
is a dcscrintion
of what is lophrases
beexplained
expressions. It is a description of what is to be explained.
Bcrwick
Berwick
40
40
m,
1. To see this, notc that in the casc of a full blnary tree with n leaves and with a pronoun on Ihe
1. To sec
this, notc
that ina bthe
of a nodcz
full binary
trce
!l leaves
with a pronoun
on ccornmand
the
righvmost
branch,
wg
~case
intcrior
have to
bewith
examined
for and
co-indcxing
without the
rightmost branch, about!l interior nodes have to be examined for co-indexing without the c-command
condition, b u ~
only about log n nodes w i ~ hthe c-command condition.
condition, but only about log n nodes with the c-command condition.
41
What cvidcnce is thcrc that lhcrc arc rulcs ihat opcratc on logical rather Khan
What evidence is there that there are rules that operate on logical rather than
syntactic forms? Considcr an cxamplc that [Willianls 19771offers:
syntactic forms? Consider an example that [Williams 1977] offers:
(8) Spcakcr 1: Bob lcfl.
Spcakcr 2: ]%illwill, too.
Speaker 2: Bill will, too.
I h c second scntcncc has a missing Vcrb Pllrasc that is. intuitively, supplicd by
The second sentence has a missing Verb Phrase that is, intuitively, supplied by
h c first scntcncc:
the first sentence:
(9) 51: Bob
[lcftIVp
(9) 51: nob
[left]\IP
S2: I3ill will [cmptylyptoo
S2: Bill will [empty]vp too
As Williains obscrvcs howcvcr, it cannot bc that thc V1) of the sccund scntence
As Williams observes however, it cannot be that the VP of tile second sentence
is rcconstructcd by mpying (or co-indexing) thc svntactic VP of the first scntcnce,
is reconstructed by copying (or co-indexing) the syntactic VP of the first sentence,
since thc rcsult is ill-formcd:
since the resuIL is ill-formed:
(10)51: Bob
[left]\IP.'
I
1. In more derail. Williams proposes lhatm!g& has roughly Ihe foltowing LFstrucrurc:
1. In more detail, Williams proposes thai BilIlefl has roughly the following LF struClure:
(11) b'ast (Bill)Np [ h ( x hve)lVpIS
(11) P'asl (Bill)NI' [;l.x(x lcave}]yp]S
While one may conceivably mpy (or co-index) any pan of lhis formula to Ihc "cmpty" position in &&
While one may conceivably copy (or co-index) any part of this formula 10 the -empty" position in ~
will &g, only copying h e VP in squarc brackets rcsults in a well-formed LF.
will !QQ. only copying the VP in square brackets results in a well-formed LF.
Berwick
42
S1: [Bob
[Hob
(12) Sl:
[ I ~ f t ] (level
(lcvcl
~ ~ ] of
ofsyntactic
[Ieftlvpl
syntactic rcprcscntation)
representation)
(---Rule
tense interpretation
<---Ruleof tcnsc
[Past [Bob
[Ilob [leavelvp
[ I c a v ~(level
(lcvcl
] ~ ~ of logical
form)
[past
logical form)
SZ:
S2:
(---Rulc
<---RuleofVP
of VP co-indexing
co-indexing
[Futllrc [Bill
[Ili:l [emptylvl'.
[c~npty]~,,;
too]
(ICVCI of logical
logical fonn)
[Future
tool (level
Ihis example
cxample also points oot
out thc
similarity of this approach to
111is
the striking similarity
Webber's and Sidner's
tense is rcprcscntcd
represented at the
Wcbber's
Sidncr's work: figuring
liguring out how how tcnsc
level of LF
ti,e kind of Cjtlestion
Webbersecks
while the rule
1.F is just Uic
qlcstion that Wcbbcr
sccks to answer, nhilc
lcvcl
(or copying)
copying) is just the sort of rule
rulc that
th;~t Sid
Sidncr
of VP.
VP, co-indexing (or
ncr wants to
formulate. In the
thc rcmaindcr
section, wc
scc in morc
dctail just how
fonnulate.
remainder of this section,
we shall sec
more detail
close tl~csc
ti,ese approaches arc. in
particular, one can show tllat
ti1at Webber's
closc
In particular,
Wcbber's
rcprcscnlationallanguJge
with recent
rcccnt proposals for the structure of
rcprcscntational language is compatible wilh
1.LF,
1:. and Ulat
Wcbbcr's account of Q!lj;
g~ "naphora
nnaphora is simply another
anothcr Discourse
ti1al Webber's
Grammar "rule
construal", just like
VI' copying rulc.
rule, importantly,
"rulc of constru;~l",
likc the
thc VI'
Importantly, one key
finding that
Ulat Williams' approach
;~ppro;shdocs a
subsunic is Sidncr's observation that the
not subsume
co-indexing
rules for pronoun intcrprctation
interpretation appear
crrindcxing rulcs
appcar to follow push-down stack
diaiplinc: as we will sec,
scc, this is thc
allaloguc of the Specified
Spccificd
discipline:
the 1)iaoursc
Discourse Grammar analogue
Subject
Subjcct condition. If Sidncr's theory
U~coryis correct.
corrcct. then the
tlie rules
mlcs of Discourse
Discourse
constrsincd in a;I computationally
camputarionally advantageous fashion,
Grammar arc constrained
fashion, just as the
rules of Sentence
rulcs
Scntcncc Grammar seem
sccm to be.
1.2.2
1.2.2 Creating
Cre~tingdiscourse
discoursc entities:
cntitics: Webber
\Vebbcr
Wcbber's
go;~l.thcn,
logical precondition
prccondition to Sidncr's
Sidncis work. Discourse
Discourse
Webber's research goal,
then, is a logical
co-indexing involves
co-indexing at the
lcvcl of LF;
1.F: therefore.
co-indexing
involves co-indexing
ti,e rcprcscntational
rcpresentationalleve!
therefore,
befure
discourse entities
cntitics arc linked.
linked, onc
before onc
one can know which discourse
one must know what the
single sentences, one set of
space of candidate indcxable
indexable entities is.
is. In the case of single
corrcsponds to phrases that can be displaced according
according
possible index.ble
indcxablc objects corresponds
to a transformational
transformational movement rule:
(13) The
l h c officer arrested John(13)
John~
[ J ~ h n ] ~wasarrcstcd
, , ~arrested [empty
[cmpty Ii], by the officer,
oficer.
. [John]NI'.
was
For Webher,
Wcbber, thc
analogue of this task is thc
formulation of an LF
the analogue
the proper formulation
syntax for Noun Phrases,
Phrases, or, as shc
she pull
puts it, th?
th~ "combinatoric
"combinatoric representation" of
43
dcnotatum 1
dcnotalmn 2
--
Berwick
44
Verbs
Relative
llclative clauses
clauscs
Adjectives
Adjcctircs
Nouns
Prepositions
Prcpositions
Articles,
Dctcnnincrs.
1)ctermincrs. etc.
Number quantifiers
quantifies
plurals
Predicates
Predicates
lambda-abstracted predicate
predicate
Restriction
Ilcstriction clause
clausc in quantification
quantification
llcstrict domain
domain of
~~fquantification
Restrict
quantification
Predicates
I'rcdicatcs (e.g.,
(c.g., On(x))
Quantifiers as described
dexribcd in text
Counting restriction
rcstriction on
domain ofquantification
El'aluating the
fi7aluaring
rhe Webber
IVebber theory
rheury
How can Webber's
I.F be
Wcbbcr's proposals
propo%ls for l.F
bc evaluated?
cvoluatcd? TIle
The question
qucstion to keep
kccp in mind
here is why Wehher
Webbcr has adopted this p;~nicl~lar
logical languagc in which to map
particular logicall"nguage
surface
LF expressions
cxprcssions rather
rathcr than some
some other.
othcr. At first glance,
glance,
surfacc phrases
phrascs to 1.F
uncovering
uncovcring evidence
evidcncc !.hat
that would choose
choose among alternative
altcrnatibc proposals
proposals would seem
secm
to be problematic: what we arc talking about, at least for people, arc differences
diffcrcnces in
fonn
fonn that arc not visible at
a1 the
Ule surface".
"surface". At best.
bcst. we can observe only indirectly
the
lhc effects
cffccts of different
diffcrcnl choices
choiccs for a logical
logical fonn,
form, and so there
thcre can only be indirect
evidence
cvidcncc that bears
bcars on our choice.
Pursuing
I'ursuing this line
linc of discussion,
discussion, why should one an adopt a logical fonn that is
different
hypotllesis, after
all? The
'The null hypothcsis,
aftcr all,
all, would be
diffcrcnt from surface phrasal form at all?
assumc that LF
1.F looks just like asurfaee
a surface parse of the input string:
to simply assume
(14)
LF)
(surface parse = the LF)
(14) 1saw
I saw Jnhn
John --)
-->(I(I (saw
(saw (John)
(John))) (surface
instead
predicate calculus
instcad of the predicate
calculus fonnul.:
formula:
(15)
(IS) Saw (I,
(I. John)
It would certainly
ccnainly be
bc computationally
computationally simpler to just adopt the fonner
former sort of
representation
rcprcsentation as our LF; then no translation at all would be required. In short,
shon,
one
prcnex
onc must furnish
furnish some
somc sort of an argument for why LF
LF should look like a prenex
normal form rather
rathcr than just its surface form.
nonnat
As it turns out, howcvcr,
however, there
thcre ~
& evidence
cvidcncc for adopting an LF in a prenex
fonn,
at
least
for
sentences
containing
such
quantifiers as everv
every and m.
some. This is
form. lcast
the wen-known
well-known evidence
cvidcnce for an optional rule that moves quantifiers from the
45
placc whcrc thcy trcur in surfacc form and adjoins thc~nto the front of the
place where they occur in surface form and adjoins them to the front of the
ncarcst "S" (scntcncc) nodc. ('l'hc mlc of "quantificr raising" -- "QR" -- proposcd
nearest "S" (sentence) node. (The rule of "qualltificr raising" -- "OR" -. proposed
in [May 19771).
in [May 1977]).
'Ihis modcl prcdicts that scntcnccs in which thcrc arc two quantifiers and only
'Jllis model predicts that sentences in which there arc two quantifiers and only
clnc S-boundary, such as, "F.vcryonc lovcs somconc." should hc ambiguous. The
one S-boundary, such as, "Everyone loves someone." should be ambiguous. The
rcason: sincc thc rulc Q R can apply in any ordcr to first rnovc "everyone" and
reason: since the rule QR can apply in any order to first move "everyone" and
thcn "somconc". or vicc-vcrs, wc can dcrivc two structurcs. onc in which
then "someone". or vice-versa. we can derive two structures. one in which
"cvcryonc" titkcs widc scopc over "sornconc", and onc in which it docs not. And
"everyone" lakes wide scope over "someone", and one in which it does not. And
in fact such scntcliccs arc ambiguous in just this way. It is in his scnsc that QR
in fact such sentences arc ambiguous in just this way. It is in 01is sense Olat QR
accounts For thc apparent ambiguity of such scntcnccs. Silnilarly, the Q R lnodcl
accounts for Ule apparent ambiguity of such sentences. Similarly, the OR model
prcdicts that scntcnccs in which thcrc arc two quantificrs but onc is cmbcddcd in
predicts that sentences in which there arc two quantifiers but one is embedded in
such a way Illat thc rulc QK must cross two houndarics will nof bc ambiguous.
such a way that the rule OR must cross two houndaries will not be ambiguous.
'Ihis prediction too is bornc out, accounting for thc apparcnt lack of ambiguity in
'1l1is prediction too is borne Ollt, accounting for the apparent lack of ambiguity in
John lovcs someone":'
such scntcnccs as."F.vcryonc hclicvcs
such sentences as:'Everyone believes t11at John loves someone,,:l
In short, by assuming QR to bc a movcmcnt nilc, wc can usc thc samc
In short, by assuming OR to be a movement mle, we can use the same
constraints on movc~ncntknown to hold in other cascs to account for the observed
constraints on movement known to hold in other cases to account for the observed
surfacc distrilnltion of ambiguous quantifier rcadings.
surface distrihution of ambiguous quantifier readings.
Siinilarly, judg~ncntsabout group vs. individual rcadings of scntcnccs such as,
Similarly, judgments about group vs. individual readings of sentences such as,
a pjua scctn to bc conditioncd by a rulc of Q R . In the reading
Ihrcc
Three ~ ~ a Dina seem to be conditioned by a rule of OR. In the reading
~ h c r ckrcc boys scpariltcly cat a piz;r;l, wc may assumc that the rulc QR has
where three boys separately eat a piZ7.a. we may assume that the rule OR has
applicd to front thc quantificr tcrm Ihrcc bovs:
applied to front Ole quantifier term three~:
(16) r h r c c xi, x i a boy. [xi Ate a pizza]]
Whcn Qli docs not apply, we gct a collcctivc intcrprctation whcrc ?hc three boys
WhetT QR does not apply. we get a collective interpretation where me three boys
as a group cat a pizza:
as a group cat a pizza:
Webbcr's usc of a SEr opcralor in this casc parallcls the work donc by QR; SET
Webber's usc of a SET operator in this case parallels the work done by QR: SEf
may bc ordcrcd ciihcr inside or outside thc argument to the vcrb. Schematically,
may be ordered either inside or outside the argument to the verb. Schematically,
are much
I. As it turns out, judgtncnts when h e quasi-quantifiers & or a arc subslitutcd Tor
1. As i! turns out, judgments when the qua~;-quantifiers the or g are substituted for SYm are much
Icss precise. Hut recall bat *is mag be because thcse itcms are no1 " m e " quanlifien at all, hcnce are
[~precise. But rccalllhatthis may be because these items are not "true" quantifiers at all, hence are
not subject to QK.
Berwick
46
to indicate that X has bccn drawn from thc domain of individuals that arc boys, is
to indicate that X has been drawn from the domain of individuals that arc boys, is
also fairly standard. For cxiimplc, it follows h c nolarion chosen b y [Chumsky
also fairly standard. For example, it follows the notation chosen by [Chomsky
19751:
1975):
(20) Who did John kiss -->
>
(20) WhoWh-x,
did John
kiss
x a person.John
kisscd x
In sum, to thc cxtcnt that Wcbbcr uscs 1.F diffcrcnccs, particularly scoping
In sum, to the extent that Webber uses l.F differences, particularly scoping
diffcrcnccs, to account for diffcrcnt scntcncc rcadings, the rcprcscntational format
differences, to account for different sentence readings, the representational format
adopted by Wcbbcr stands squrircly within twcnticth century philosophical and
adopted by Webber stands squarely within twentieth century philosophical and
current linguistic tradition.
current linguistic tradition.
1.2.3 Conipating thc Wcbbcr I,F
We arc lcft then with thc qucstion of computation. Givcn an input scntencc, or a
We arc left then with the question of computation. Given an input sentence, or a
prc-parsed rcprcscntation of a sentcncc, how hard is it to actually construct
pre'parsed representation of a sentence, how hard is it to actually construct
Wcbber's I.F? Hcrc uc must be careful to distinguish bctwccn two aspccts of the
Webber's LF? Here we must be careful to distinguish between two aspects of the
complcxity of asystcm such as Wcbbcr's. First, thcrc is the complcxity of simply
complexity of a "System such as Webber's. First, there is the complexity of simply
translating from surfiicc strings to a prcncx form. Sccond, thcrc is fhc complexity
translating from surface strings to a prenex fonn. Second, there is the complexity
of actually computing thc prcdicatcs irnplicd by thc rcprcscntations so constructed
of actually computing the predicates implied by the representations so constructed
-- c.g., "Ate Bill, x". 'Ihcsc arc two diffcrcnt, rhough obviously conncctcd tasks.
-- e.g., "Ate Bill, x". These arc two different, though obviously connected tasks.
Thc second dcpcnds on the prcdicatcs involvcd, how domains of individuals are
The second depends on the predicates involved, how domains of individuals are
actirally dcfincd, and so forth; it is thus diflicult to make any claims about the
actually defined, and so forth; it is thus difficult to make any claims about the
complcxity of this problcm without a model. As we shall scc, part of what Kaplan
complexity of this problem without a model. As we shall sec, part ofwhal Kaplan
and Allcn aim to do is to in fact proposc simplc world modcls for rcstrictcd
and Allen aim to do is to in fact propose simple world models for restricted
domains so as to actually provide an intcrprctation for a Webbcr-type LF. We
domains so as to actually provide an interpretation for a Webber-type LF. We
will rcstrict oursclvcs hcrc to the complcxity of just thc translation of surface
will restrict ourselves here to the complexity of just the translation of surface
strings to a prencx form.
strings to a prenex fonn.
Even so restricted, interesting questions of computational complexity arise.
Even so restricted, interesting questions of computational complexity arise.
Suppose that we adopt Wcbbcr's prcncx form. It is rclarivcly casy to show that if
Suppose that we adopt Webber's prenex form. It is relatively easy to show that if
thc input and output languages for thc uanslation can be generated by a
the input and output languages for the translation can be generated by a
47
particularly rcstrictcd contcxt-frcc grammar, tllcn thc translation will lakc only
particularly restricted context-free grammar, then the translation will take only
ti~nclincar in thc Icngtli or lhc input strings, using push-down stack machincry.
time linear in the length of the input strings, using push-down stack machinery,
Flowcvcr. if wc sssumc iinything likc a "rcal-lifc" pl~rascslr~~clurc
(with NI' and
However, if we assume anything like a "reaHife" phrase structure (with NP and
VI' nodcs), thcn it can bc shown that Ihc translalion will tukc morc than simplc
VP nodes), then it can be shown that the translation will take more than simple
push-down machincry.' In short, translating to a prcncx fonn quickly is nor
push-down machinery.] In short, translating to a prenex f0I111 quickly is not
ncccssarily a trivial task.
necessarily a trivial task.
'1'0 gain soinc iipp~.cciation of thc issucs itlvolvcd hcrc, wc will considcr a
To gain some appreciation of the issues involved here, we will consider a
sirnplificd prcllcx translation and thctl show that cvcn this nlinimal proccdurc will
simplified prenex translation and then show that even this minimal procedure will
dclnand at Icast push-down storc prt~cssing.
demand at least push-down store processing.
Supposc that the input knguagc consists of Flnglish surfacc strings already
Suppose that the input language consists of English surface strings already
translated into solnc kind of I;il~clcdhe~ckcling-- a parsc trcc, likc that provided
translated into some kind of labeled bracketing -- a parse tree, like that provided
by, c.g., thc Marcusparscr. FurLl~crassunic f i a t thcrc is but onc parsc trcc output
by, e.g., the Marcus parser. Further assume that there is but one parse tree output
for cuch scpsrntc input string -- an obvioudy fatsc simplifying assumption, givcn
for each separate input string _. an obviously false simplifying assumption, given
thc cxistcncc of stntcturally atnbiguous scntcnccs. ? h c otltput Ianguagc, thc l.F, is
the existence of stnlcturally ambiguous sentences. 11le output language, the J.P, is
also ;ssumcd to bc uniqircly rcadablc (unambiguous). A simplc grammar for the
also assumed to be uniquely readable (unambiguous). A simple grammar for the
input langungc, ignoring the cxistcncc of VI-' structure, might look like this:
input language, ignoring the existence ofVP structure, mighllook like this:
NP! V NP2; NP:::;. NP (S)
N I ' 3 13c~crtnincr(Adjectivc)Noun or Name
NP= \)etenniner(i\djective) Noun or Name
N o u n a b o y , girl: Ilctcrmincr*a, evcry;
Noun~boy, girl: Detenniner:::::>a, every;
Adjcctivc-big. young ...
i\djectivc~ big. young....
V*lovcs; Nanic* John, Mary, ...
V=>lovcs; Name=>John, Mary,...
S~
Givcn thc input string. "John lovcs a girl", the dcsircd prcnex output is,
Given the input string. "John loves a girl", the desired prcncx output is,
(3x:girl) lovcs John, x
(3x:girl) loves John, x
Evcn givcn hcse imrncnsc simplifications howcvcr thc translation is still not
Even given these immense simplifications however the translation is still not
computationally trivial. Why is this? Intui~ivcly,thc rcason is that the ordcr of the
computationally trivial. Why is this? Intuitively, the reason is that the order of the
NP arguments can bc rcvcrscd in thc translation -- Jolln prcccdcs &Iin surface
NP arguments can be reversed in the translation -- John precedes il gil!. in surface
syntax, but thc ardcr is rcvcrscd in the 1,F. This suggcsts that onc must cithcr (1)
syntax, but the order is reversed in the LF. This suggests that one must either (1)
abandon thc prcncx form; or (2) move to a powcrful sort of computational
abandon the prencx fonn; or (2) move to a powerful sort of computational
machinery.
machinery.
More formally, supposc that thc mapping is cffccted by a syntax-directed
More fonnally, suppose that the mapping is effected by a syntax-directed
translation. 'That is, far cvcry contcxt-frcc rule of thc gnmmar for the input
translation. That is, for every context-free rule of the grammar for the input
language, we associate a corresponding translation rc-writc rule. F ~ c htime that a
language, we associate a corresponding translation re-write rule. Each time that a
re-write rule is uscd to build a portion of thc input Ianguagc parse uec, the
re-write rule is used to build a portion of the input language parse tree, the
48
Berwick
corrcsponding rule is triggcrcd to build Lhc trcc for the output I,F string. (Thus
corresponding rule is triggered to build the tree for the output LF string. (Thus
syn~ax-dircctcdtranslation maps trccs to trccs.)
syntax-directed translation maps trees to trees.)
Givcn thc input rulc, S+NP1 V NP2 we thcrcforc havc a corrcsponding
Given the input rule, S:::;. NPI V NP2 we therefore have a corresponding
output rulc that looks sorncrl~inglike,
output rule that looks somcthing like,
add or delete
directed
a graph,
connecting
thatnew
it currcntly
create
ncw nodcs
andedges
Iabcltothem.
Because
it a nnodes
conncct
nodcs topoints
cithcrto;theit can
left also
or the
righl of old
new nodes
and such
label athem.
Because
it can conncct
newuce
nodes
either itthe
left or the
of old
node,
machine
un pennutc
the output
(thetograph)
construcb
withright
rcspxt
to the tree
nodes, such
a machine
permute
11Ie output
tree iL$
(theability
graph)toit handtc
construclS
wiLh respect
to the tree Such a
grammar
hcncc
non-simple
mnslructions.
specified
by caninput
sped liedmachine
by 11Ie can,
inputhowever
grammar
ability operating
to handleinnon-simple
still- behence
quiteil~cffieicnl.
time linearconstructions.
in Lhc length orSuch
inputastrings if
machine the
can.underlying
however still
be quiteforemdent,
operating
time orlinear
in the
leng11l
of inputofstrings
if
grammar
the Iranslation
is in
LK(k)
Iuk).
(Sce
the section
this chapter
on
the underlying
grammar
for for
the atranslation
lhe section
of 11Iisgrammar
chapler is,
on roughly,
McDonald's
system
definition is
of LR(k)
I.R(k) or
andI.l~k).
LIik) (See
grammars.
An LR(k)
McDonald's
for a delinition
of LR(k)
andbeL1~k}
An LR(k)
grammar is,
roughly,
one system
that gcncratcs
a language
that can
pamdgrammars.
dclcrminidcally.
IcCt-lo-right
with
rhc parse tree
one that being
generates
a
language
that
caR
be
parsed
detcnninisLicaJly,
left-to-right.
with
the
parse
tree of his
conklnrctcd from batom to top.) Ihe Marcus parser is basically a push-down processor
being con1;tnlcted
from with
bottom
totended
top_} The
parser is basically a push-down processor of 11Iis
SOT(. operaling
an ex
LR(kMarcus
)-grammar.
sort. operating with an extended LR{krgrammar_
49
it is simple.
it is simple.
Cor~sidcrnprion (2). Supposs that \ec adopt Willi3nis' xhctnc in part, %I that
Cunsider option (2). Suppose that II'C adopt Willi<lllls' scheme in part, so that
LC t~;mslatios ot' NPI i'NI', - is NPI NI', V* X1 X2. X I and X2 being
Ihe tran~latioll of NP 1 V NP:! is NP I NP2 V'" Xl X2' Xl and X2 being
plnceholdcr clclncnts for the ;Ir,gumcnts NPI and NP2. ].'or cz:n~nplc,b l 1 1 1Ivvq
placeholder clements for tl1e ,lrgumcnts NP t and NP2' For example, .fu!lu loves
*-Alarv will now gcl ~nnppcdtt! tho hnn, "Jollnl hlriry2 lovcs XI Xi'. Obsc~vc
}.1MV will now gel mapped to tlle fOIlTl, "Johnl MarY2 loves Xl X2". Observe
IRst this now 3 silnplc synti~x-dircctcd~n~isInlic,o
-- tllc ordcr of non-lcnninals
IhJt this now .h a simple ~yntax-directed trall~[;)Linn .- the order llf nnn-lenninals
in iopitt ; I I K . ~ot~lptrln~lcsis thc s;lmc. Wc illustrittc an c~i111iplc
tri~n~lation
ofJ&~
in input and output n11c~ is the same. We i1Iustl'(j[C an example translation of 191m
loves
cvcrv
Iwy:
loves evcry !..2m:
Slack:
stack:
NPI
laM
y
NP2
NP2
Step
Step No.
No,
1
1
2
y.
8
9
(Rcc:iIl t l ~ a la cyl~t~lr-dircctcd
tr:~n.;l;~tion rcquircs that Lhc
~Ylltaxdirt'ct('d tr,Il1"lalilll1 rcqllirc~ that the
sy~iibol<
c j f ttic outpi~t[:.in:,Iit(1011 !?r:~~~~rlli~r
bc in~cnpcr!icdwith
sYlllbol~ of the Dulput (~i1nsla(i()ll !~r:J111Hlar be inlc~rerscd with
host of chc i ~ ~ p gl-rnilll?ar.
i~t
'I'liis is tllc ~c;~.;unfor thc
!hllS( of the illput gr'rllllnar. This is* the lcaStln for the
appcilw!iCC or Lhc dihtincr V and V syln!~ols.)
appeill';lflCC or the dhtinct V lInd V'" symbols.)
mccall that a
Berwick
50
onc dcmands lincar time opcraticrn, it may ~ ~ cbcl l h a t thc undcrlying grammar
one dem:mds linear time nperdtion, it may well be that the underlying grammar
for thc input langt~agc~nusibc I.ll(k) or I,l<(k)-likc.
for the input language must be I.R(k) or LR(k)-likc.
(21) There'I'hcy
werewcrc
sometasty,
strawberries
in the
and didn't
last fridge.
long.
w.
1. To we this. consider the case where thcrc is some leading paticrn o f Noun Phrases. NP1.
1. To see this, consider the case where there is some leading patlcm of Noun Phrases, NP1,
NI'2....,KP,, followed by a pattern of pronouns to which it must be linked. Pronoun l,....Pronoun..
NI'2....,NPn followed by a pauern of pronouns to which it must be linked, Pronounl .....Pronounn.
'Ihcre are an cxponcnlial number or p m i b l c pallcrns of binding rclaiionships between antecedent
There areNI"s
an exponential
number
possible
patterns
bindinginvrsligalcd.
relationshipscxponcntial
between antecedent
and pronouns.
IT allofof
these must
be of
cxplici~ly
rmurccs will be
NP's andconsumed
pronouns. If all of these must be explicitly investigated exponential resources will be
consumed.
51
"plausibility" and "n~caningfulncss"rcmain unclaborarcd. I .ct us call this sccond
"plausibility" and "meaningfulness" remain unc1aborated. I.et us call this second
diffic~lltytfic CO-indcxiiigconfirm:~tionprobfcm.
difficulty the co-indexing confirmation problem.
Sidncis rcscarch aims to solvc just thc first of thcsc problcrns, thc co-indcxing
Sidner's research aims to solve just the first of these problems, the co-indexing
proposal problctn. Sidncr3s solulion, not surprisingly, is to posit an additional
proposal problem. Sidner's solulion, not surprisingly, is 10 posit an additional
constraint that cuts down thc spacc of possiblc anlcccdcnt-ana]>llorpairs that must
constraint that cuts down the space of possible anlecedent'anJphor pJirs that must
bc cxamincd. Out ofall tlic NPs porcntially availablc Tor co-indcxing in a tcxt at a
be examined. Qut of al1 the NPs potentially availJble for co-indexing in a text at a
givcn point, only onc, dcsignatcd thc currcnt f t ~ u sis actunllv availablc as thc
given point, only one, designated the current focus is actu;JlIy avaibble as the
"first choicc" Tor eel-indcxing. .l'hc kcy idca is that what is currently bcing talkcd
"first choice" fill' co-indexing. The key idea is th;Jt what is currently being talked
about, dlc so-callcd currcnt ftrus cstablislics a local contcxl or "first rcsort" for
about, the so-called current focus establishes a local contexl of "first resort" for
resolving i~naphorarclationsl~ips. In tlrc scntcnccs abovc, sincc "stmwbcrrics" is
resolving anaphora relationships. In tJle sentences above, since "strawberries" is
t l ~ ccurrcnr f t r t ~ sat Ihc timc thcy is cncounrcrcd. by dcfault thcv is co-indexed
tJle current foclls <ll the time tJley is encountered. by default tlley is co-indexed
with strawbcrrics. If for somc reason thc local binding fails, tllcn and only thcn
with stri)wberries. If for some reason the local binding fails, then and only then
will altcrnativcs bc tricd. I:or instance, if thc sccond scntcncc was, "tJc ....",thcn
will alternatives be tried. For instance. if the second sentence was, "He....", then
bccausc thc dcfrlult antcccdcnt str:iwbcrrics is pluri~ltlic default binding would be
because the default antecedent strawberries is plural the default binding would be
rejected.
rejected.
flow is focus cliangcd? Simply by proposing and confirming n ncw potential
How is focus changed? Simply by proposing and confinning a new potential
focus. just as the vcry first firus of a discourse is computed. Notc Ilowcvcr that
focus. just as the very first focus of a discourse is computed. Note however that
thc currcnt f i ~ u scrvcs
s
as Lhc d c f a ~ ~ctr-indcxcr
lt
until it is dislodgcd by explicit
the current fiJeus serves as the default co' indexer until it is dislodged by explicit
disconfirmation. 'I'hus thc patrcrn for computing a ncw focus whcn a current
disconfirmation. Thus the pattern for computing a new focus when a current
focus alrcady cxisls must bc lo (I) propnsc a new ftcus; (2) disconfirm dlc current
focus already exists must be to 0) propose a new focus: (2) disconfirm the current
fmus; and (3) confirm the ncw focus. (Actually, (1) and (2) could be
focus: and (3) confirm the new f(Jeus. (Actually, (1) and (2) could be
intcrchangcd.) What happcns to thc old focus? Sidncr clnims that prcvious foci
interchanged.) What happens to We old focus? Sidner claims tJlat previous foci
arc availablc for later usc. lndccd, thcrc sccms to be snmc cvidcncc -- suggestive
arc available for later usc. Indeed, tJlere seems to be some evidence -- suggestive
but not ovcrwliclming -- that old foci arc maintained in a last-in, first-out ordcr.'
but not overwhelming -- t1lat old foci are maintained in a last-in, first-out order. l
As we shall scc, this stack consmint, if truc, has the cffcct c~flimiting thc numbcr
As we shall sec, this stack conslfaint, if true, has the effect of limiting the number
of an~cccdcnt-anapllrrrbindings that milst be invcstigatcd, and iil ccrtain cascs
of antecedellt-,JIlaphor bindings that must be investigated, and iil certain cases
cntircly climindtcs tllc combinatorial scarch for antcccdcnls that markcd carlicr
entirely eliminates the combinatorial search for antecedents tJlat marked earlier
computatiuna1 syslcrns dcaling with anaphora resolution. It is a kind of
computational systems dealing with anaphora resolution. It is a kind of
multi-scntcncc analogue of ccrtain single scntcncc constraints such as the
multi-sentence analogue of certain single sentence constraints such as the
Spccificd Subjcct Constraint IChomsky 19751 or Sul)jaccncy [Chonlsky 19771.
Specified Subject Constraint IChomsky 1975] or Subjacency [Chomsky 1977].
Such a finding is of i~nmcnscintcrcsr to h e computalionalist, bccausc such
Such a finding is of immense interest to the computationalist, because such
constraints play a valuable rolc in restricting thc amount of work that a language
constraints playa valuable role in restricting tile amount of work that a language
processor must do. For cxamplc, [Marcus 19801 has suggcstcd that St~bjacency
processor must do. For example, [Marcus 1980] has suggested that Subjacency
and thc Spccificd Subject Constraint arc intirnatcly connectcd to thc dcsign of a
and We Specified Subject Constraint arc intimately connected to the design of a
machine h a t can parsc English fairly cficicntly on-linc without backup. If
machine that can parse English fairly efficiently on-line without backup. If
1. Sidncr says nothing about whcther a "stack" of foci can be of arbilrary deplh. Presumably. as is
J. Sidner
says nothing
about
whether
of foci
be of arbitrary
deplh.
as isin such a
typically
assumcd,
there
is somca "stack"
finite limit
lo hcan
e amount
of malcrial
that Presumably.
can be rclalned
typicallymanner.
assumed, there is some finite limit to the amount of material that can be retained in such a
manner.
nerwick
52
There
1. Bclow we shall see lhal by graning an addiljonal laycr o f structure onto Sidner's basic lhmry a
1. Below
we shall
see thai by gralling
an gruclure
addiLional1ayer
or structure
Sidner's
thoory
- a can be
rhcory
or task-oricnted
d~xoursc
proposed
by IGroszonlo
19781
-- the basic
Sidncr
approach
theory or
task-orieoted
discourse
structure
proposed
by
[Grosz
1978]
-the
Sidncr
approach
can
be
made to handle discourns h a t go beyond simple narraliva.
made to handle discourws that go beyond simple narratives.
S3
l!!ik!:
Ule potc~itinl
potential focus is first considcrcd
considered to
&k]r for checking
chccking potential
potential foci.
firi. That
'l'li;it is.
is. Uic
be
is c110t
not
be copular, h~lCc htheme
thc Subject
Subjcct uf
o raabeh copula
copula sentence;
scnlcncc: if
i f the
Uic sentence:
s c ~ i t c ~ is
~
c &copulsr,
cmc
bc the
of the
sentence
is
considered
to
be
the
most
promising
potential
focus;
if
the
potcntial krus;
thc scntcncc is c~msidcrcd be thc nlost
theme
focus.
thematic mlcs
roles arc clicckcd.
checked. and so
thcmc cannut
cannot be
be confirmed
confirmed as
S
I;
h ~ u s ,other
otlicr tllcmatic
forth.
formulation ooffoclls,
Lirtli. If
I f we
wc translate
transli~tcthis
Uiis ranking
ranking into the
tlic "predication"
"prcdicr~tion"formulalion
f fmocus, we
sec
Ulat
ulis
order
simply
makes
explicit
a
set
of
heuristics
for
determining
scc chat U~is
makcs cxplicit
licuristics Tor dctcrmining what
the
relevant in tliat
that somc
some indcpcndcnt
independent
tlic predication
prcdica~ionof
o f the
tlic sentence
scntcncc is. This
'I'llis fllct
k t is rclc\-;~nt
work
the predication 11f
of a scntcncc
sentence is
has been
bccn done
donc in
in classifying
cl;~ssifyingthe
thc way in
ill which
wliicli Uic
work has
determined,
ncr's potcnti;d
potential kfoci
heuristics.
dctcrmi~icd.and
and the
the results
rcsults apparently
npparcnlly fe-state
rc-statc Sid
Sidncr's
r i Iic~~ristics.
'Illis
some
evidence that thc
the rc-dclinition
fe-definition ooff
coincidcncc thus
t l l ~ ~provides
providcs
s
solnc additional cvidc~icc
'Illis coincidence
focus
extends Sidncr's
Sidoer's list
lis~ in
krusSas
I; "the
"tlic x such
SLIC~I that I'(x)"
I'(x)" is
is on the right
riglit track. It also cxtcnds
that
cases arc handled
heuristics.
that some
s~~mccascs
liandlcd that
tliat arc
;ire not
nut covered
co\,crcd by her
hcr original ssetc ~ooff hcuristics.
I.et
tinguistic %(irk
work in morc
more dcuil:
detail: the
I.ct us
us sketch
skctcli out some
same of
o f the results
rcsults of this linguihtic
theory
us say that
Ulat thcrc
there is a rulc
rule that builds
t h a ~ r yis
is Ulat
Uiat of
o f (Williams
[Willi.lms 1980J.
19801. I.et
I.ct us
predicate
hy means ofco-iodcxing.
of co-indexing. thcrcby
thereby
prcdicalc structure
struculrc out of constituent
cc~nstitucntstructure
strocturc by
asn~ci;~tingDrcdicotcs
with arguments.
nrgumcnts. The
'lhc general
gcncral co-indexing
associating
predicates with
co-indexing sclicma
schema is simply,
(24) Co-index
Co-indcx NP
NP and
and X
whcrc X =
= ao
an Adjective
Adjcctivc Phrase
I'hrasc (AP),
(AP), a PP,
where
PI',
an NP,
Nl', a VI',
VP, or an
an S
an
'Thcrc is
is certainly
certainly not
nut much
much in
i n the
thc way of constraint
co~istrai~lt
in this prc~posal.
There
proposal. What is
missing
be co-indcxcd,
co-indexed, a
thc machinery
machincry telling
t c l l i ~ ~LIS
us
g which NP's and X's arc to bc
missing is
is the
rnattcr which
which we
wc will take
takc lip
up shortly. Assuming for now that Lhc
matter
the rules cxist
exist to
propcl.ly co'index
co-indcx items,
itcms, their
thcir effect
cffcct will bc
properly
be to !nap
map a constituent suucturc
structure like (i)
(i)
bclow into
i!lto structure
structure Oi):
(ii):
below
J,[ohn]
is (AI'
iAP
sick]]
(25) (i) ([~pJohol
is
sick))
Predication co-indexing
Predication
(ii) I[NI, JohnIi is FAp sickli]
l l e NP
NP John
& and
and the
thc AI'
AP sick
sic): wind up
up with idcnticnl
The
identical indiccs,
indices. corresponding --perhaps
-to
intuitions
that
&
is
bcing
prcdicatcd
ofJohn.
Co-indcxing
thus
has
perhaps -- to intuitions
ill.!>. is being predicated
Co-indexing
thc effect
cffcct of telling
tclling us
us which
which thing x is being
bcing prcdicatcd
the
predicated about --- that is,
is, ooff
11s "the
"thc Xsuch
X such Ulat
Ulat P(X)".
P(X)". Given proper co-indcxing,
pointing out to liS
pointing
co-indexing, a trivial rule
focus interpretation
intcrprctaiiun can
can then
then supply the
thc right focus:
of focus
of
54
Berwick
(26)
(26) II~I'
I[,, John]
John] is [AI'
I,,, sick]]
sick]]
Predication co-indexing
I'rcdication
11,1'
[I,, John];
J o l ~ nis] ~[AI'
[,, sick])
sickli]
xX
PIX)
I'W)
Focus
interpretation
Focns intcrprctation
In this sense
rule of intcrprctation,
interpretation. relating aa
scnsc the
thc focus
flrus rule
rulc is simply
silnply a rulc
rcprc~cntatjQn
structure)
another
lcvcl (thar
(that at
rcprcscntatiun at one
onc le\'el
lcvcl (that
(tl~atof COllstilucnl
C O I I S ~ ~ I U CS
I I~~ ~ I I C I U
toTanothcr
C)
lcscl
which other
operate). llcsidcs
Besides thcse
these cascs
casc~ of
of
othcr "semantic
"scniantic interpretive"
intcrprctivc" mles
n ~ l c sopcratc).
prcdiciJtion
there are
arc cxamplcs
examples of
of
prcdicetion that arc grammatically
gratnmatic;~lly(structurally) based, thcrc
thematic predication
observe, in such cascs
eases
prcdication control.
colitrol. /Is
As both Sidner
Sidncr and Williams ohscrvc.
the
U,e prcdication
predication can bc
be propcrly
properly bcatcd
located ifif it
tllc NP that is the
tllc focus
RICUS or
ur Subject of UIC
isis identified
where lhcmc
theme ct~rrcsponds
corresponds rt~ughly
roughly to
idclirificd with the
thc thcme ofU,e
oflhc sentence,
scntcncc. whcrc
the
thc notion
norion of "affected
";~ffccrcdobject."
objcct." '1l1is
'lliis approach is only as good as onc's ability to
identify
dillieulties of
lif its
idcntiry the U,eme
U~cmcof a sentence,
scntcncc, a problem that is fraught with diflicultics
own.
"case framc"
frame" rcprcscnlations
representations of
of scntenccs:
sentences;
(Scc, for
fur example,
cxarnple, U,e
the studies of
of"c5se
own. (Sec,
[Jackcndoff 1972].)
19721.) In any case, at worst (Williams notes) each
[Filllnorc 1968]:
19681: (Jackendoff
[Fillmore
verb
indicate which
which NP is the Uicme.
U,eme.
vcrb can
can bc individually
individually markcd
mnrkcd so that it will indicatc
cx;~mplc.the verb
vcrb struck probably requires
rcqujrcs cxccplio~~al
For example,
exceptional marking to handle
han<lle aa
scntcncc such
such as.
as.
sentence
55
'I'hcrc arc pcc~plc111 Ihc room.
There arc people III the room.
,,[ t l i c r ~ ][bc]
~ [N PIi [in thc room]
I..I.... p there].1 [be] [NPllin
the room]
I
As a rcsult, ~ h c s cscntcnccs fall togctlicr with schc~naI ofgrammatical prcdication
As a result. these sentences fall together with schema 1 of grammatical predication
control, N I ' k AP, with AP= rhc rcmnining postvcrbal coniplcmcnt; tlic NI'is thc
control, NP be AP, with AP = the remaining postverbal complement; the Nt> is the
Subjcct of thc prcdication, lic~iccthc fixlus. just as dictatcd by Sidncr's algorithm.
Subject of the predication, hence the focus, just as dictated by Sidner's 'llgorithm.
(ii) &-copuli~s 611 directly undcr schcma 1 of grammatical control: thc NI' subject
(ii) Jk-copulas filII directly under schema 1 of grammatical control: the NP subject
is t l ~ cSubjcct of tlic Predication.
is the Subject of the Predication.
2. l'hcmc as focus: All cascs ~ h c r cfocus is Lhc thctnc of a scntcncc fi~tlundcr the
2. Theme as foclls: All cases where focus is tl1e theme of a sentence fall under the
hcading of thcmatic control of prcdication co-indcxing. Oiicc again. thc St~bjcct
heading of thematic control of predication co-indexing. Once again, the Subject
of the prcdication (as dctcnnincd by t l ~ cco-indcxing rules) corrcsponds prcciscly
of the predication (as detennined by tlle co-indexing rules) corresponds precisely
co thc initial focus.
to the initial focus.
3. Otlicr tllclnatic rolcs: other rolcs markcd by thc vcrb (such as Instn~mcnt)arc
3. Other tl1ematic roles: other roles marked by the verb (such as JnstnuTIcnt) are
somctimcs optional: that is why ihcy arc in gcncral lowcr in ilic hierarchy of
sometimes optional: mat is why they arc in general lower in the hierarchy of
potcntial foci. 'I'hcrc arc two subcascs: thc additional rolcs can bc either
potential foci. There arc two subcases: the additional roles can be either
obligatory -- rcquircd by thc vcrb -- or optional.
obligatory -- required by the verb -- or optional.
(i) Obligatory thclnatic rolcs: thcsc arc, c.g., chc rccipicnt of an action, as in double
thematic roles: these arc, e.g., the recipient Df an action, as in double
objcct constnlctions such as. JJ in 1 gavc thc strrtwbcrrics
object constnlctions such as, llill in 1gave the strawberries ill llill.
(i) Obligatory
w.
(ii) 'l'n~lyoptional thcmatic roles: thcmatic clcrncnts that nccd not bc prescnt for
(ii) Tmly optional ll1ematic roles: thematic clements that need not be present for
thc scntcncc to mrlkc scnsc, c.g., optional I'rcpositional Yhrascs, as in. 1 kisscd Sue
the seRtence to make sense, e.g.. optional Prepositional Phrases, as in. 1kissed ~
at sundown.
1!! sundown.
Note, as might be cxpcctcd if focus is linkcd to thc prcdication of a sentence,
Note, as might be expected if focus is linked to the predication of a sentence,
that such optional clc~ncntsarc lcss highly valued as potcrilial f~ri;
this is bccause
mat such optional clements arc less highly valued as potential fod: this is because
tlicsc thc~naticclcnicnts do not play an csscntial role in the prcdication structure
these mematic clements do not play an essential role in the predication structure
of thc vcrb. 13y identifying focus with prcdication structure wc thus obtain a
of the verh. By identifying focus with predication structure we thus obtain a
simplc explanation of tllc ordcring of polcntial focus prcfcrcnces that is obscrvcd
simple explanation of the ordering of potential focus preferences iliat is observed
but not accounted for by Sidncr.
but not accounted for by Sidner.
Having calculated thc likely candidate foci, the sccond slcp in thc focus
Having calculated the likely candidate foci. me second stcp in the focus
co-indcxing computation is co~~firmation.
A potcntial focus is conlinncd wlicn it
co-indexing computation is confinnation. A potential focus is confinned when it
is known that a following pronoun or dcfinitc NP
bc co-indcxcd to the
is known iliat a following pronoun or definite NP illl be co-indexed to ilie
potcniial focus. A potcntial focus is disconfinncd if it is dctcrmincd h a t the
potential focus. 1\ potential focus is disconfinncd if it is detennined mat ilie
individual or sct of individual clcmcnts pickcd out by tlic focus cannot bc the
individual or set of individual clements picked out by the focus cannot be the
same sct of individuals pickcd out by a following pronoun or definite NP. Once a
same set of individuah picked out by a following pronoun or definite NP. Once a
focus has bccn confirmed, it is the Furrcnt focus and scrvcs as thc first choice for
focus has been confinned. it is the current focus and serves as me first choice for
any latcr co-indexing.
any later co-indexing.
fkrwick.
56
Hei
was hurt badly.]
BCi INPi was hurt badly.]
Tnkcn togcthcr, rhc currcnt focus along with its a)-indcxcd pronoun detcrmines a
Taken together, the current focus along with its coindexed pronoun determines a
sct of propositions, wit11 each proposition saying something aboul Che focused
set of propositions, with each proposition saying something about the focused
entity:
entity:
1,ct us call Lhis bundlc of propositions about a particular focus a focus context.
57
(31)
(31) Wilbur
Wilhur is as fine
finc scientist
rcicntist and a delightful guy.
He
buok a wbile
gaic me
mc a;I balk
whilc back wbich
which II really enjoyed.
cnjoycd.
Hc gave
It was on relativity
rcl;~tivitytheory
thcory and talked mostly about quarks.
'I'hey arc h<lrd
hard to imagine.
ilnaginc. because
bccausc they
Uicy indicate
indicate
They
nccd for elemenwry
clc~ncnbryfield
ficld theories.
the need
Thcsc theories
tlicorics arc tremendously
trclncndously complicated.
These
'lhc use
anaplioric it forces
'Ibe
usc of an anaphoric
rctricvc the (fonnally
(formally
k~rccsone
onc to "pop back" to retrieve
rcprescnted) description
dcxription oCtile
of am.
hlol-c importantly,
importantly. Sidncr observes
obscrvcs that this
represented)
book. More
bchavior is sl.lck-likc.
intcrvcning foci
fwi between
bctwccn blx,lt
ficld theories
thcories
behavior
stlck-likc. in that the intervening
book and field
-- quarks -- arc apparently "lost" when
whcn one
onc accesses
acccsscs the bol,k.
A picture of the
-book. /\
changc in accessible
acccssiblc Noun
drscriptions might luok
look like this:
change
Nuun Phrase descriptions
Step 2
Step 1
1
(before "Anyway, 1...")
I...")
field theories
accessible)
(most accessible)
quarks
U
the book
t1ie
Wilbur
the book
Wilbur
, (least accessible)
Figure 4 Accessible
Accessible Noun Phrases
I'hrases in a stack model.
interesting to ponder
Although intuitions arc not completely clear here, it is interesting
force of the stack claim comes to. Consider a weaker, alternative
what the force
rcprcsentcd by an
hypothesis: that the "things talked about" in a discourse can be represented
hypothesis:
unordered Jist
To
be
concrete,
let
us
say
that
items
enter
memory
as they are
'
T
o
bc
lct
mcmory
list
Berwick
58
mcntioncd, and thcn arc availnblc for anaphoric acccss strbjcct to a dccay
mentioned, and then are availablc for anaphoric access subject to a dccay
limitation dcpctidi~igon tlic "timc" since thc itcm was last ~ncntioncd.(I.ct us
limitation depending on the "time" since the item was last mentioned. (Let us
ignorc o b v i u ~ objections
~s
about how "timc" is to bc mc~rsurcd.intcrucning effects,
ignore obvious objections about how "timc" is to be mcasured. intervening effecL~,
and thc likc: this "list" hypotllcsis is Incant to be illustrarivc. not a serious
and the likc: this "list" hypothesis is meant to be illustrative, not a serious
suggcstion about actu;tl discoursc bchavior.) Nolc Lhar witi~a dccay factor addcd,
suggestion about aClll;tl discoursc behavior.) Note that with a decay factor ildded,
unordcrcd, sincc itclns arc ordcrcd in an acccssibility liicrarchy
thc list is not
Ule list is not re;llIy unordered, since items arc ordered in an accessibility hierarchy
dcpcnding upon rcccncy of mcntion. On this nltcrnativc vicw, Wilbtrr, tllc book.
depending upon recency of mention. On this alternativc view, Wilbur, the book,
and quarks arc all still acccssiblc aftcr is uscd to pick out t i ~ cbook in the
and Quarks are all still accessible after it is used to pick out ~ book in the
"Anyway" scntcncc, but now 11ic books is morc acccssiblc (sincc it was last
"Anyway" scntence, but now the books is more accessible (sincc it was last
"rcfrcshcd" by being poinkd at by thc i~napliord). 'I'liis list Iiypolhcsis thus
"rcfrcshed" by being pointed at by the anaphor il). This list hypothcsis thus
contrasts with Sidncr's:
contrasts with Sidner's:
Stcp 2
stcp 1
Step 2
Step 1
(Wilbur, thc book, quarks, ficld thcorics) (Wilbur, Q. F1',thc book)
(Wilbur, the book, quarks, field theories) (Wilbur, Q, FT, thc bOOK)
Increasing acccssibility -->
Increasing accessibility --)
Figure 5 Acccssiblc Noun Phrsscs in a list dccay model.
Figure 5 Accessible Noun Phrases in a list decay model.
Which struclurc morc adcquatcly rcflccts hunii~ndiscoursc bchavior? If the
Which structurc more adcquntcly renects human discourse behavior? If the
theories
lisi dccay nlodcl wcrc correct. thcn wc would cxpcct eithcr auarks or
list decay model wcrc correct, then wc would expcct either quarks or field theories
to be (rclativcly) acccssiblc for anaptioric contact aftcr the "Any*ayw scntence
to be (relatively) accessible for annphoric contact aftcr the "Anyway" sentence
(213, just as thc bouk was. Howcvcr, this sccms not to bc thc casc, for scntcnces
(20), just as the book was. Howevcr. this seems not to be the case, for sentences
whcre one attempts to point hack to cithcr of these old, prcsumably vcccssible foci
where one attempts to point back to either of these old, presumably accessible foci
sccm difficult to comprchcnd:
seem difficult to comprchend:
novel.
O n thc other hand. the old focus Wilbur -- a n itcm that should have "decayed"
On the other hand, the old focus Wilbur'- an item that should have "decayed"
59
--
asmuch
if not morc than ou;lrks
acccssiblc:
as
much ifnot
quarks is easily accessible:
1.2.5 Evidence
b:\idcnce for
lor Sidne,s
Sidner's focus
locus theory
1.2.5
Intuitions about dialogues such as thc
onc above,
above, though
thougl~suggestive,
soggcstivc, arc shaky
Intuitions
the one
cnough that one
onc ougiit
othcr kinds of confirming evidence
evidence for
enough
ought to cast about for other
modcl. Hcrc
Sidncr's model.
Here thcrc
there arc
aTC some
made. First of all,
nlmc genera]
gcneral points to be
bc madc.
all,
rcprcscnt;ainn of focus
fcrus mimics tllc
rulc of QR, u,ith
iti Qx-]l(x)
Qx--l'(x) fonn.
form.
Sidncr's representation
the rule
with its
Sccond. Sidner's
Sidncr's representation
rcprcscntalion is compatible
comp;~tiblcwith strcss
strcss phcnomcna
Second.
phenomena that are
uith focus. This is a particularly
propcrty, sinn:
known to intcract
interact with
particularly interesting property.
since it
h i ~ ~att s how onc would begin
bcgin to intcgratc
propcrtics of spoken language into a
hints
integrate properties
model
Finally, thc
tbe stack-like
behavior of foci
~nodclof discourse.
discoursc. Finally,
stack-likc bchavior
foci is analogous
analogous to the
more
syntactic opacity.
opacity. Let
murc familiar case
casc of
ofsyntactic
I.et us review
rcvicw this evidence
c\idcnce in turn.
Syntactically,
there are a variety of mechanisms
rncchanisms that create
crcatc structures
structurcs of the
Syntactically. thcrc
foom,
p(x)":
s.t. P(x)":
form, "the xx S.t.
Topicalization: Strawberries,
Strawbcrries. II never
ncvcr liked [e]
[el
Topicalization:
In eacb
the focus
cach case,
casc, the
thc fronled
fronted item
itcm is intuitively thc
focus of the sentence.
scntcnce. Thus, if the
"x S.t.
p(x)" fonm
rigbt one for focus,
s.t. fix)"
form is in fact the right
focus, then, as expected,
cxpcctcd, syntactie
syntactic
60
Berwick
Assuming now that this scntcncc has an abstract conslitlicnt structure with an
Assuming now that this sentence has ,111 abstract constituent structure with an
"cmpty" NP acting as a placeholdcr fur the normal position in which
"empty" NP acting as a placeholder for the nonnal position in which
"strr~whcrrics"would be found:
"strawberries" would be found:
(7) IStrawberries]NP [S I [ypnever liked [NpcmptyJ]]
61
A.
s-structure
intcrprctiition rulcs
~.f'XUS<--focus
i'terp"Wdu, ru1"
strcss
stress nltes--)
nlles--)
PI:
I,F
LF of gcncrntivc grammar.
FigurePF
6 A decomposition
Figure 6 1\ decomposition of generative grammar.
It is important to obrcrvc 1113t thc stress rulcs do not intcract with 1,F cxccpt in
It is important 10 (lb~erve that the stress rules do not interact with LF except in
Ole scnsc that h r c is a co~nnionintcrmcdiary rcprcscnt;rtion, s-structure, that
the sense that there is a common jl1termediar~' representation, s'structure, that
scrvcs as the bast dara stroctor-c for both tlic rirlcs that h n n rcprcscntations of
serves as the base daw strucllIre for both the rules that fonn representations of
c s fi)nn'rcprcscntations of ftwus. 'I'hus thc hcnrcr has an
stress and tllc r ~ ~ l that
stress Jnd the rules that fonnrepresenwtions of focus. Thus the hearer has an
indircct rourc that can bc used to dcducc whar fwws must hc, given stress, cven
indirect roUle that can be used to deduce whal focus must he, given stress, even
though strcss docs not, it] itsclf, dcienninc f t ~ u s . Namcly. if the strcss rulcs are
though stress docs not, in itself, detennine focus. Namely, if the stress rules are
invcrtih~c.~
thcn s-structnrc can bc rccovcrcd -- this is just part of paning. Givcn
invertible,l then S'structurc can be recovered -- this isjust part of parsing. Given
s-structure, thc f u c ~ rulcs
~ s may bc applicd to rccovcr Ihc focus.
S'structure, the focus rules may be applied to recover the focus.
T o scc h o w focus interacts with stress. considcr thc cascs discussed by
To sec how focus interacts with stress, consider the cases discussed by
[Chomsk y 19761. I'hc scntcnce.
[Chomsky 1976]. The sentence,
(39) 7hc woman he lovcd bctraycd someone.
'Ilic rcason for this follvws from a morc gcncl'al rcstrictinn on co-indexing,
'nle reason for this follows from a more general reslnctlon un co' indexing,
namely, that a quantificr cannot bind a pronoun to its Icft. Compare, for
namely, that a quantifier cannot bind a pronoun to its left. Compare, for
example,2
example,2
(41) (i) *'lhathci was draftcd shouldn't botllcr cvcry soldieri.
(41) (i) *1hal hCi was drafted shouldn't bother every soldierj'
(ii)l'hat hci was draftcd shouldn't bother nil$.
(ii) That hCj was drafted shouldn't bother Ilill j.
I n (39) thc quantificr somconc cannot bind hc for just this rcason.
In (39) the quantifier someone cannol bind he for just this reason.
Now considcr the sucsscd counterparts to thcsc scnlcnccs. (41i) bclow, with
Now consider the stressed counterparts to these sentences. (4li) below, with
1. An additional asumption that must bc justified: it is presumably not a necessary property of lhese
1. An additional
that mustSJbcjuslilied:
it is cxists.
presumably not a necessary property oflhese
mlcs thata~umption
lhcy are onc-lo-onc,
that an invcne
rulcs lhallhey
are onc'lo'one,
lhal an
inverse exists.
2. Thcsc
sentcnccs aresofrom
[llornstcin
19811.
2. These sentences are from [Ilornstein 1981].
Rcrwick
Rerwick
62
62
(43):
(43)
'Ihcn the focuscd NI' John is likc a quantificd NY, hcncc cannot bind k,just as in
'1l1en the focused NP John is like a quantified NP, hence cannot hind he, just as in
cast (39). SilniIa1.1y. if John is i~nstrcsscd,as in (42ii), tllc~iwc havc Ihc analoguc
case (39). Similarly, if John is unstressed, as in (42ii), then we have the analogue
of (40): John is not frontcd. hcncc is not quantificd, llcncc can now bind & as in
of(40): John is not fronted, hence is not quantified, hence can now bind ~ as in
(40). (40).
Summi~rizing,thcrc arc two scts of stn~cturalfacts that support a "Qx-l'(x)" or
Summarizing, there arc two sets of stmctural facts that support a "Qx'P(x)" or
operator--bound variahlc rcprcscntation for focus. Onc is thc varicty of syntactic
opcrator--bound variable representation for focus. One is the variety of syntactic
mcchi~nisn~s
tlilit, blindly iis it wcrc, crcatc a Qx--P(x) funn that is intcrprctcd
mechanbms that, blindly ,IS it were, create a Qx-'P(x) fonn that is interpreted
unifol.rnly as altcring Uic focus of a scntcncc: Lhc othcr is thc apparent fact that a
uniformly as altering the focus of a sentence: the other is the apparent fact that a
strcsscd NP th;~tacts as if it wcrc in Qx--l'(x) fimn is likcwisc intcrprctcd as
stressed NP that acts as if it were in Qx--P(x) ronn is likewise interpreted as
indicating thc filcus oTa scnlcnce.
indicating the focus or a sentence.
If disco~lrscstructurc rcally rcprcscnts thc propositional structurc of multiple
If discourse structure really represQnts the propositional structure of multiple
scntcnccs, thcn thcrc should bc no rcason why it could not rcflcct somc larger,
sentences, then there should be no reason why it could not reflect some larger,
ovcr-all 1~1.opositional
strucrurc of a discoursc. Significantly, there is some cvidcnce
overall propositional structure ofa discourse. Significantly, there is some evidence
that this is so.
that th is is so.
Notc first that Sidncr's observations about thc push-down stack bchavior of
Note lirst that Sidne(s observations abollt the push-down stack behavior of
discoursc ftci arc drawn mostly from cxarnplcs of narrativcs -- that is, slorics that
discourse foci arc drawn mostly from examples of narratives -- that is, stories that
follow chronological causal scqucnccs. h s a rcsult, thc propositional structure of
follow chronological causal sequences. As a result, the propositional structure of
niirrativcs tcnds to follow a scqucncilig dctcrlnincd by timc: first PI happened,
narratives tends to follow a sequencing determined by time: first PI happened,
thcn Pa, and st) forth. 'lhis timc-dctcrmincd structurc is rcflcctcd in the basic
then P2- ;md so forth. This time-determined structure is reflected in the basic
suck-likc behavior of foci.
stack-like behavior of foci.
nut one can also imaginc a more gcncral situation where propositional
Out one can also imagine a more general situation where propositional
structurc is not rcstrictcd lo simplc linear ordcr. Indced, [Grosz 19771 has
structure is not restricted to simple linear order. Indeed, [Grosz 1977] has
invcstigatcd just such a domain: a diwoursc about nsscmbling a mnchinc (a pump)
investigated just such a domain: a discourse about assembling a machine (a pump)
from its parts. Grosz discoilcrcd that ia h i s situation co-indcxing dcfaults sccm to
from its parts. Grosz discovered that in this situation co-indexing defaults seem to
follow rhc dccornpositional stn~cturcof thc task itsclf. For cxamplc, if the pump
follow the dccompositional stmcLure of the task itself. For example, if the pump
could bc asscmblcd by first bolting on a sub-asscmbly. 1). thcn a sub-assembly C,
could be assembled by first bolting on a sub'assembly, n, then a sub-assembly C,
and 8 in turn dcmandcd bolting on a brackcl turning scrcws, and so forth, then
and B in turn demanded bolting on a bracket, turning screws, and so forth, then
hcarcrs sccrn to follow this trcc-likc task structurc when figuring out what to
hearers seem to follow this tree-like task structure when figuring out what to
63
co-indcx with
with what.
what. Grosz'
Grosr' observations
obscrvatiuns support
support the
thc contention
contention that
that itit isis the
the
co-index
propositional structure
structure of
of aa discourse
discoursc that
that shapes
shapes its
its syntactic
syntactic form.
form. Sidncr's
Sidncr's linear
lincar
propositional
ordering, then,
then, becomes
bccomcsjust
justaa special
spccial sub-case
sub-cascof
of task
task structuring
structt~ringwhere
whcrc the
thc task
task isis
ordering,
lincarly ordered
ordcrcd by time.
timc. Observe
Obscrvc tilal
that Grosz'
Grosz' more
more general
gcncr;~l tree
tree structure
structure
.Iinearly
tree may
may wel1
wcll be
bc linear
linear in
in appearance,
appcarance.
subsumes Sidncr's
Sidncr's in
in that aa sub-section
sub-scction of
of aa tree
subsumes
and hence
hcncc could
could act
act like
likc one
onc ofSidner's
ofSidncr's focus
fixus stacks.
stacks.
and
It
is
intcrcsting
to
obscrvc that
that the
fit computational
~(~mpotational
advantage accruing
accruing from
from the
the
It is interesting to ohscrrc
advantage
modcl also
also ohtains
obtains inin ti,e
rhc more
rnorc genem!
gcncrill case
casc of
of tree-structured
trcc-strtrcturcd
Sidncr stack
stack focos
focus model
Sidner
x i Suppose.
Suppose. for
for example.
cxamplc, that a task imposes
imposes a natural tree-structured
ucc-structurcd
foci.
on a discourse.
discoursc. Adopting
Adopting Sidncr's
Sidncr's focus
ftxus accessibility constraint,
constraint.
dccolnposition on
decomposition
only antecedents
antcccdcnts at certain sub-tasks
sub-tasks would
would be
bc available
available for
for co-indexing.
co-indexing.
only
Ilepcnding on the
thc branching
branching slructllre
structure of the ulsk.
task, many co'
co-indcxing
Depending
indexing possibilities
(Thc exact
cxact savings
savings would
would depend
dcpcrid upon the task
would thus be ruled out. (The
decomposition.) Now observe
obscrvc t11,1[
that a linear
lincar narrative
narrati\'c task structure
slructurc ---- that assumed
assumcd
decomposition.)
Sidncr
actually obtains the
thc WOfst
worst possible
pussiblc savings,
savings, since
sir~ccunder
undcr a strict
strict linear
lincar
by Sid
ncr --- actually
;dl preceding
prcccding antecedent
antcccdcnt foci
foci are
arc potential
potcrltial candidates,
candidatcs. We see
scc then
thcn that
order all
Sidncr's theory
Uicory obtains only a lower
lowcr bovnd
bound on the
thc computational
colnputational savings
savings to be
Sidner's
adoptingaa focus
focus stack model.
lnodcl. /\
A morc
lnorc complex
complex theory
lheory of task stmcture,
structure.
gained by adopting
corrcct, would probably do better.
if corrcct,
1.3 Kaplan:
Kaplan: the
tllc world
norld as database
1.3
So far,
"connect" thc
far, we bavc
have not yct
yet providcd
provided any way to "connect"
the rcprcscntations
representations of
discourse entities or foci provided
providcd by Wcbbcr
Webber and Sidncr to thc
the world. One way
creatc a sirnplc
simple artificial
artificial world. Kaplan's approach
to attack this problem is to crcatc
docs
does just
just that: in Kaplan's systcm,
system, the world of
of actions and intentions is boiled
down to just
one,
that
of
asking
qucstions:
the
extcrnal
just one.
questions;
external wo~ld
world of objccts
objects is reduced
to that ofobjccts
of objccts and thcir
their properties in a databasc
database of rclational
relational attributes.
Formally,
Formally. we could define such a database as a triple <X,
<X. Die,,
DiEI U>. X
X is a
finitc
finite sct
set of
of database &&&,
objects. likc
like "John".
"John", "Mary"
"Mary" "Computer
"Complller ricncc
science 101". 1 is a
finitc
finite set of allributcs
attributes uscd to indcx
index thc
the domain D such as "scx",
"sex". "Grade",
"Grade", or
"year". U is a function dcfined
defined ovcr
over attributcs
attributes and thcir ~alucs
values that retrieves the
g
ill of
of objccts
objects that have a particular value of
of a specified attributc,
attribute. c.g.,
e.g., U(scx,male)
U(sex,male)
will rcturn
rcturn the subset of
of X
X that bas
has thc attribute "scx"
"sex" with thc
the value "malcs"
"males" -John, etc. The crucial point is that a databasc
database is defined so that it has a
quintcsscntially
quintessentially extensional scrnantics
scmantics ---- what U
U returns is a ill of
of objects. AA
qucstion askcd
asked of
of a databasc,
database. a databasc
database w,
ID!ill. is then simply somc
some scqucnce
sequence of
of
calls to thc
the interrogating function U.
Thc
The goal of
of a datibasc
database user in making a query is assumcd
assumed to be simply to find
--
Bcrwick
Berwick
64
64
out information about the attributes of objccts. Considcr a databasc that has
out information about the attributes of objects. Consider a database that has
infarmation about studcnls, thc courses thcy havc taken, thcir grades, and so forth.
infonnatiol1 about students, the courses they have taken, their grades, and so forth.
A typical qucry is, "Which students got A's in co1nputcr scicncc 101?" 'Ihc point
A typical query is, "Which students got A's in computer science 101?'' The point
of thc qucry is to "find out about X", whcrc X is thc act of objccts possessing the
of the query is to "find out about X", where X is the set of objects possessing the
value "A", for atlributc="gradc in computcr science 101". So wc havc finally
value ";\", for attribute ="grade in computer science 101". So we have finally
forgcd thc last link in the discourse chain:
forged the last link in the discourse chain:
1
2
3
4
1
2
3
4
surface
--->surface
--->logic;ll--->
(1.F. Scts of individuals>--->])atabase
surfacestring
--->surface
>logiC<ll--->
fonn <LF. Sets ofindividuals>--->Databasc
struclurc
string
structure fonn
(query)(query)Sidncr. Wcbbcr
Kaplan
Kaplan
Sidncr, Webber
Actually, what Kaplan docs is to tclcscopc thc proccss modularized by Wcbber
Actually. what Kaplan docs is to telescope the process modularized by Webber
and Sidncr a bit collapsing intcnncdiatc stcps of recovering stirface structure and
and Sidncr a bit. collapsing intennediate steps of recovering surface structure and
constnrcting 1.F into a singlc computational step:
constructing I.F into a single computational step:
surfacc ---> hybrid logical form ---> LFintcrpreted
surfacestring
---) hybrid
logical stnlcturc
f\lrm ._-> LF
interpreted
sy~ltaclic
with
rcspcct to a database
string
syntactic stnlcture with respect to a database
Kaptan's hybrid synlaclic/logical form is dubbcd MQ1. for "mcta query
Kaplan's hybrid syntactic/logical fonn is dubbed MQI. for "meta query
language". MQI. has all ~ h cpropcrtics of a hybrid: it cncodcs rhc surface
language". MQL has all the properties of a hybrid: it encodes the surface
constirucllt structure of a qttcry as well ns a sl>ccific:ttion of which polcntiaI scts of
constituent structure of a query as well as a specification of which potential sets of
objccts in thc databm thcsc surfi~ccplirascs coiild dcnotc. But how do we find
objects in the database these surface phrases could denote. But how do we find
out what objccrs thcsc imjllicd scts do pick out of thc database? ?his is not
oul what objects the~e implied sets do pick out of the database? 111is is not
difficult: givcn an hiQI, cxprcssion, with a11 its implied scts of objccts and thcir
difficult: given an MQL expression, with all its implied sets of objects and their
attrib;tcs, wc simply construct a qucry that docs thc actual job of "looking up the
allrib~tes, we simply construct a query that docs the actual job of "looking up the
answcr". returning thc sct of objccts that mccts thc stipulations dcmandcd by the
answer", returning the set of objects that meets the stipulations demanded by the
MQI, cxprcssion. Aftcr ;ill, h i s is just what ~llcdalabasc was dcsigncd to do in the
MQL expression. After all, lhis is just what the datahase was designed to do in the
first placc. Ihus, to intcrprct tllc question. "Who did John kiss?", Kaplan'ssystem
first place. 11lus, to interpret the question, "Who did John kiss?", Kaplan's system
would first build a combined syntactic/l-F rcprcscntation,
would first build a combined syntactic/LF representation,
65
Connectives and quantifiers (and. cvcrv. some. .I -- arc intcrprctcd as requests for
Connectives and quantifiers (m every. some..,) -- arc interpreted as requests for
set operations, as onc would cxpecl frt!m an cxrcnsional semantics. For example,
set operations, as one would expect fro.m an extensional semantics, For example,
"Who got A's in Cornputcr Scicncc 101 and A's in Coinputcr Scicncc 102?"
"Who gO[ A's in Computer Science 101 and I\'s in Computer Science 1027"
simply in~crscctsthc two scts rctumcd by thc two halvcs of thc full query.
simply i~ler~ccts the two set~ retumed by the two halves of the full query.
S~milarly,qi~antiticrsarc intcrprctcd simply as a scrics of qucrics, c.g., evcrv x Pb)
Similarly, quantifiers arc interpreted simply as a series of queries, e.g. every !fu)
is translatcd as x l P(xl)&x21'(x2) ...ctc.. itcri~tingover cach Nt' in the sct rctunied
is translated as Xl P(xl)&x2P(x2)...etc., iterating over each NP in the set retumed
by the non-quantified query, 1
1. As Kaplan obscrvcs, thcrc are some inlcrcs~ingqucslions to resolve hcre involving the scope of the
I. As Kaplan
observes,asumes
there are
some interesting
10 resolve here involving the scope of the
&. Kaplan
il always
taka widcQuestions
scopc: M(Peoplc
who got an A in coum 1,people who
~. Kaplan
it always
got ana~umes
A in mum
2). takes wide scope: And(People who gOI an A in course I. people who
gOI an A in course 2).
66
Berwick
Advise:
Advisor;--)Daulbase Field {AdviseeS/ProjeclS}-')ProjeclS
or~
wl~ich
databssc fields
liclds corresponding to thc
following
which would list Uic
tl,e rclcvant
relevant datJbase
the following
dat;~bascrepresentation:
rcprcscntation:
database
Addso"
Smith. J.
Advisces (---Advisee
<---Advisccsubfic1d
subficld
Jones, A.
Joncs,
Charles,
Ch;lrlcs, M.
Projects
(---Project
<---Projcct subfield
sublicld
Co~npilcrllcsign
Compiler
Design
The
advise would UIIIS
thus infi11.111
infhrm UIC
the system
that to find
find out.
'l'he entry
cntry for
Tor ndvirc
systcnl tliat
out whether
wlicthcr X
&&
thc advisor subfictd
subficld of lllc
database, then
d ~ c ntracc
frum
ill.yiscs Y. look up the
the database.
tnlcc through from
point to subtlclds
[Idviscc or and return thc
the objccts
there.
that p(1i11t
subficlds 3dViSCC
ohjccts listed thcrc.
1-1. t .1\ lQL ;,md the interaction or s)'ntax and
sl'm~Hltic~
As mcntioJ)ctl.
mcnti~>~~cd,
hlQl. is iJa hybrid LF/surfacc
i.l:/surfncc syntax
sp~i(:lxlangu:lgc.
I:~nguagc. It is instn~ctivc
MQL
inSlmctivc to
colilpnrc
1.1:. Consider
Coasidcr lhe
lhc ",Howing
Ibllowill[: example.
cx.~mplc. Snppose
Supposc the
compare it with Wchbcr's
Werber's 1.1'.
query
to Kaplan's
qurry rrcscnlt'd
prc?.cl~lrdtci
Kaplnn's sy51.('m
systrrn is:
is: "Which user')
users h,:lVC
linvc accounts on projects
projccts
sponsored by NSI,?'
NSF?" The
reflecting the
'l'lic MQI.
MQI. structure built
b~liltfrom this query,
qtlcry, rcflccti~lg
surface syntactic
sy111:sticstrLlcture
structure ofLhis
o f U~isquestion.
question, looks like this:
I1
Uscrs
llavc
("Ilsvc" is just a paU~
uscn to projccts)
path from users
projects)
Hove ("llavc"
Amounts
AccOllnlS
1 an
I1
Oil
Projects
Projects
NSI:
NSF
(rtnolhcr rcslriction)
Sponsored by
hy (another
restriction)
67
'Illat is, thc structurc of Lhc qucry is that (Uscm havc (Accounts on (I'rojccts
'Illat is. the structure of the query is th;]t (Users have (Accounts on (Projects
sponsorcd by (NSF)))). 'I'his structurc is nearly a mirror i~nagcof a Wcbbcr-style
sponsored by (NSF))), This structure is nearly a mirror image of a Webber'style
rcprcscntation for thc salnc scntcncc:
representation for the same sentence:
(wh-x:uscr)
(wh-x:user)
3y : ~ ( y:Account)[31.: h(u:Projcct)
3y:A(y: ACCOll nt)[]z: h(u: Project)
[Sponsor NSF, u] .
[Sponsor NSF. ul .
[On y, z]] . IJave x, y
lOn y, z]] , Have x, Y
'I'lic bri~ckctingrcflccts thc rcstriction scts enforced by thc MQI. verbs and thc
The bracketing renects the restriction 1sets enforced by the MQL verbs and the
prcpositiuni~lrcstriction chains.
prepositional restriction chains. 1
Finillly, wc obscrvc that Kaplan docs M advance a rulc likc QR that "raiscs"
Finally, we ()bser\'e that Kaplan docs not advance a rule like QR that "raises"
quantifiers fl-om their surfire positions and adjoins tlic~n to the front of a
quantifiers from their surface positions and adjoins thcm to the front of a
scntcncc. hlQI, simply iIssumcs that whcrcvcr n quanlificr is found in thc surface
sentence. f\lQL simply assumes that wherever a quantifier is found in the surface
conaitucnt structtlrc o i a phrasc dctcrrnincs its scoping relationship to thc rcst of
constituent structure of a phrase determines its scoping relationship to the rest of
thc scntcncc. Ilur. as wc h a w sccn, this cannot always givc thc right answer,
the sentence. BUl as we have seen, this cannot always give the right answer,
bccausc it docs nut allow for ~ h scoping
c
ambiguitics of, Tor cxa~nplc.Evcrvone
because it docs not allow for the scoping ambiguities of, for example, Everyone
bc modificd to incorporate a
loves
somconc.
In
thcsc
cascs,
MQI.
could
--In these cases, MQL could simplysimply
loves someone.
be modified to incorporate a
rule likc QR.
mle like QR.
In sum, to gct fin answer to a qucry in Kitplan's systcm wc take thc following
In s,um, to get an answer to a query in Kaplan's system we take the following
stcps:
steps:
1. hl;~pLhc surfi~cestring to thc MQI, rcprcscntation
(roughly, Wchbcis I..[' without qitantificr adjunction)
(roughly, Wehber's LF, without quantifier adjunction)
1. t\1ap the surface string to the MQL representation
68
Berwick
the query
3. Pcrfonn thc
qilcry (or series
scrics of
o f queries)
qucrics) as indicated
indicated
by thc
Q l . stmcture.
stmcturc.
the M
MQL
4. Return
Rcturn thc
sct of
o f objccu
qucry.
the set
objects satisfying the query.
Notc
sct of
o f objects
objccts retllrned
r c t ~ ~ r n bby
c yd Step
Stcp 4 could be
bc empty.
cmply.
Note that thc
the set
1.3.2 The
'l'hc intcrOlction
intcrartion or
of s)'ntax
syntax and semantics
sc~nantics
1.3.2
have seen.
Kaplan's logical form,
As we liavc
SCCII.
the syntax
s y ~ i of
~ xK;~plan's
filnn. MQL,
MQI.. in
in part determines
dctcrmincs the
meaning
q u c s t i ~ nin Uic
SCIISC Uiat
Uic order
ordcr of
o f constituents
co11stili1cntsdctcnnincs tile
die
meaning of a question
the sense
that the
databdsc path
o f thc
qucry, licncc
sct returned
rcturncd by the
lllc query.
qucry, hence
licncc the
Uie
databJse
path of
the query,
hence ihc
I..hc set
meaning of
the query
qucry given
givcn an
;in extensional
cxlcnsional semantics.
sc~nantics. So syntax call
can influence
rnc;!nlng
o f thc
semantics. For example,
the two
cxamplc, differcnt
diffcrcnr syntactic
synt;~ctic(MOl.)
(MQI.) structures
structurcs arc built for thc
questions.
professors taught studCIlL'i
qucstions. "Which
"Which profcssors
studcn~sin
i n computer
computcr science" and "Which
studcnts were
wcrc taught by profcssors":
students
professors":
[Wh'professors
i n computer
computcr science)]
rcicncc]]
[Wli-professors[[taught
wught students
stodcn~sin
Whrr::
suuclure:
cormpunding MQI.
corresponding
MQI. structure:
wh-~rofe~r~
taughtt
. 1
I
,in
students
ID
in computer
campuler SCIence
science
1 taught by
I
by profby presson in
t
computer science
skna
computer
Since Ute
the restriction
MQI. structures
rcstriction sets specified
spccificd by the two MQI.
structurcs arc
are different
69
in unc, wc ask about sludcnts in computcr xicncc, in rhc othcr, wc ask about
in one, we ask about students in computer science, in the other, we ask about
pmkssois in cotnpuicr scicncc -- ~ h c1nc:lning of il~csctwo qucrics is dislinct. I n
professors in computer sciC'nce the meaning of the-se two Qlleries is distinct. In
addi~ion.ofcour!;~,thc main quciicd sct is dircrcot in ac two scntcnccs.
addition. ofCOllrsc. the main qucdcd set is different in the two sentences.
If synrax influc~~ccs
sc~n:~nticsin K;~plan'ssyslcal, what i~boutLTIC other way
If syntax influences semantics in Kaplan's system. what aboulthc other way
around? C;ln di~l;tt>;is~
S C I ~ ~ I I ;I!ICT
~~~S
CY
S I I ~ ~'I'hc
I X ?~ I I S W isC ~YCS. If the di~bhasc
around? Can database semantics alter syntax? The answer is yes. If the database
p;~lhas cxtrclctcd A,an~a panicular parsc docsn'l milkc scnsc. wlrcrc "docsn't makc
path a~ extw<;tcd fr~1T1 a partil.:ular parse doesn't make sense. where "doesn't make
scnsc" nicnns U~attllc qucry pat11 sclcctcd was longer lhnn thc qucry palh
sense" means lIlat the Query path selected was longer than the query path
aeoci;itcd with anorhcr pa~xcfi)r dlc qucrjj, Ihcn lhc pnlsc with d ~ cxccssivcly
c
associ;lted with nnother parse for tile query, (hen the parse with tile excessively
l o n g qiicry p;~Ehis tli~cardcd. 1-or cxnmplc, considcr Ihc qucsi.iun. " ~ l l i c huscrs
long query path is di~cardcd. For example, consider the qllesliun, "Which users
work o n projccrs in al.c;i thrcc 1113t 81.c in divisicei 35011?" 'Iliis qucry has a t lcast
work on projects in area three that arc in divisioll 350m" 'Illis query has ~t least
two parscs: onc whcrc thc scntcn~inlco~nplcrncnt"Lhat arc in division 3500"
two p;mes: one where the sentential complement "tIlat arc in divisioll J500"
rnudilics uscn. and onc whcrc il modilics "projects." l'hc altcrr~alivcdatrlbase
modifies users, and one where ;.1 modilies "proj('cts." The alternative dat.1ba:,c
paths for thc:;~~ N S C Sijrc as fullows:
paths for the~;e parses me as follows:
Parsc I
Parse I
NP
/ \ AS,
Corresponding 11i1tnb;lscpath
Corresponding Databilse patll
division number
program tncr
program mer~ s c r s
Su3crs t!ut arc in
'ii?crs tl~ arc
in
division
3500 users uscr~i;imc
t1ivision 3500
username
N
Parse 2
NP
NP
division nurnber
A
NI S,
$GjZ d13t arc users
in
p;:@Cct:~ t1l<lt arc in
~"
USCIS
L I S C 11unibcrs
~
projcct litlcs
division 3500 llser numbers
division 3500
project titles
70
Berwick
Givcn Kaplan's account of rncaning, wc can now rcturn to our original problem of
Gi\'cn Kaplan's account of meaning, we can now return to our original problem of
answering qucstions. What docs somconc intcnd whcn thcy ask a qucstion?
answering questions. What docs someone intend when they ask a question?
Kaplan assumcs just what onc would cxpcct from a databasc scrnantics: a
Kaplan assumes just what one would expect from a database semantics: a
qucstioncr want to find out &hat collccrion of objects. if any, mcet thc
questioner want to find out \'0 hat collection of objects, if any, meet the
spccification laid out in thcir qucstion -- an If; of thc form, "Wh-x, such that
specification 1<lid out in their question -- an l.F of the form, "Wh-x, such that
I'(x)", wllcrc I'(x)= snmc spccification of x via a combination of attributcs and
P(x)", where P(x):::: some specification of x via a combination of attributes and
valucs for x. 'Ihc spccificalion l'(x) in turn dctcrtnincs what thc qucry fiinction
values for x. '111e specification P(x) in turn determines what the query function
U(attriburc. value) must look likc. Everything works finc if U rcturns a list of
U(attrihure, value) must look like. Everything works fine if U returns a list of
datlbasc ol>jcctssatisfying 1'. I3ut whar should wc do if U rctlrrns tllc cmpty set, if
dawbase objects satisfying P. But what should we do if U returns the empty set, if
no objcct satisfics I? An anafysis of ll~issccond possibility is what occupics thc
ill! object satisfies P? An anarysis of Ulis second possibility is what occupies the
rcrnaindcr of Ki~plan'sstudy.
remainder of Kaplan's study,
All cmpty scts arc crcittcd cqual. hut solnc arc less cqual than othcrs. An
All empty set,> arc created equal, hut some arc less equal than others. An
analysis of
U Ilappcns tt) rcrurn an clnpty sct is thc lynchpin of Kaplan's
analysis of why U happens to return an empty set is me Iynchpin of Kaplan's
analysis of "crnpty" qucstions such as "Who got A's in compurcr scicncc 101". In
analysis of "empty" questions such as "Who got A's in computer science 101". In
this rcspcct, t l ~ ctcrm "cmpty qucstion" is wclf choscn.
mis respect, the term "empty question" is well chosen.
'I'o bc concrctc, considcr Kaplan's cxa~nplcqucry. "llid Sandy pass the bar
To be concrete, consider Kaplan's example query, "Did Sandy pass me bar
cxam?" Ily assumption, t l ~ cgoal of thc qucry is know whcthcr Sandy passcd the
exam?" By assumption, the goal of the query is know whether Sandy passed the
bar cxam. (As wc ail1 scc, this corresponds to a KNOWIF; goal in hllcn's more
bar exam. (As we \~i1I sec, this corresponds to a KNOWIF goal in Allen's more
gcncral sgstan.) '1'0 p r r ~ c c dfurthcr, wc rnust know what it rncans to "pass" a bar
general system.) To proceed further, we must know what il me<lns to "pass" a bar
cxam. A s usual. thc spstcm's knowlcdgc of "pass" is built inlo a dictionary entry
exam. As usual, the system's knowledge of "pass" is built into a dictionary entry
that specifies its mcaning with rcspcct to l11c particular dambase at hand. Lxt us
that specifics ils meaning with respect to the particular database at hand. Let us
an X just in casc X is an cxam that Y took; that Y's grade was
say that Y
say that Y passes an X just in case X is an exam that Y took; that Y's grade was
grcatci than 60; and that Y is a studcnt 'ihus
dcfincs a sct of database
greater Ulan 60; and that Y is a student Thus pass defines a scI of database
attributcs and \lalucs that must bc satisfied if Y may bc said to havc passcd an
attributes and valucs mat must be satisfied if Y may be said to have passed an
exam.
exam.
Evidently wc havc that Sandy passcs the cxam if Sandy takcs thc cxam, and if
Evidently we have that Sandy passes the exam if Sandy takes the exam, and if
Sandy gcts a score of grcarcr than 60, plus somc other conditions that might be
Sandy gets a score of greatcr than 60, plus some other conditions that might be
rclcvant. Symbolically,
relevant. Symbolically,
Ili
71
pl~dicatcthat has an ~ ~ ~ l SC!
p i yas ILScxtcnsioii rcst~lbin a failure of the.
predicate that has an cmply se~ as it~ extension result" in a failure of the.
prcsuppo:;itior:s bchind a qucry. If A dcnotcs ~ l i csct rcturncd by some qucry
presuppo:;itions behind a query. If A denotes the s(;! returned by some query
function U tlchncd so ;ts tu spccily Sandy's p;~ssiogt l ~ cexnrn, and if A, the sct of
function U dcfined so as to specify Sandy's passing the exmn, and if A, the set of
cxaln pnsscd by Sandy, is c~npty.thcn Sandy did not pass thc cxatn, llut &
exam passed by Sandy, is cmpty, then Sanuy did not pass the cxam. But why
could A bc cmpty? h is thc ciripty sct if cidlcr I$ or 112 arc cnlply if Sandy
could 1\ be empty? A is the cmpty set if ciUler HI or 11 2 arc empty -- if Sandy
cithcr didn't take thc cxam or didn't gcl n g1.rldc ahovc 60 -- ~hcnSandy didn't pass
either didn't l4lke the exam ur didn't gel a gml1e ahme 60 -- then Sandy didn't pass
tl~ccx3nl. Ilut thcn wc now llaw access to whv A is crnpty. Sincc A implics
the exam. But t11en we now have access to why A is empty. Since A implies
Dj&117. n01(131&112)9~10t-A. 'I'lit~sif cilhcr lil ell. I$ is cnlpty, tllc~lA is crnpty.
B!&B2. not(B I &11 2);:> not-A. Thlls if eiLhcr HI or 112 is empty. tIlen A is empty.
L.ct 11s ~ ' 1 1 1thcsc UIC possihlc rcasons for A's cmptincss. lir;~phicnlly.A is a subset
Let LIS call t11cseUle possible reasons for I\'s emptiness. Graphically. A is a subset
of thc 11's. and i f cilhcr of Lhcin cinpty. tllcn A must be:
of Lhe Irs.md if eit1ler oflhem arc empty, then A must be:
--
.Figure1:igurc
7 Subsct rclstio~~sliips
rcflcct thc implicational
7 Subset relationships reflect the implicatianal
struclurc of a qucry.
structure of a query.
Berwick
72
the chain. sincc it is thc rcasnn for the emptiness of the qucry. For cxamplc,
the chain, sincc it is the !illQ.ll for the emptiness of the query. For example,
considcr Ihc qucry. "How many studcnts got A's in Co~nputcr~cicnccJOI?"
consider the qucry, "How many students got I\'s in Computer Science JOI?"
Assumc tlint this qucry has the following ilnplicational structure:
Assume that this query has the following implicational structure:
studcnts ---> sludcnts
---> course
students --- > students
._-) course
tak i~lgcourse
given
gctting A's
getting A's
tlk ing course
given
and thc corresponding database subsct stnlcture:
and the corresponding database subset stnlcture:
C o u r s eGiven
Course Given
Computer S c i e n c e 101
Computer
Science
Name Grade101
Name Grade
John
A
John Sandy
A
B
Sandy
Fine Arts
FineArts
'1'0 find thc studcnts who got A's i n Cotnputcr Scicncc 101. Kaplan's system
To find the students who got I\'s in Computer Science 101, Kaplan's system
builds a qucry that rurls down thc list of courses till it hits computcr scicncc 101,
b~i1ds a query that rlillS down the list of courses till it hits computer science WI,
tllcn extracts Ole "gradc" subficld. and finally, asscmb!cs into onc set all those
tllen extracts tile "grade" sub field. and finally, assembles into one set all those
narncs associated with a grade of A. Givcn this procedure. it should be apparent
names associated with a grade of 1\. Given this procedure, it should be apparent
that if the "Coursc Given" databasc object docsn't havc thc attribute value
that if the "Course Given" database object doesn't have the attribute value
"Computer Scicncc 10lW,the qucry procedure can immcdiatcly return an empty
"Computer Science 101", the query procedure can immediately return an empty
value. Sincc this was not the last stcp in thc query prtscdurc, wc simply report
value. Since this was not the last step in the query procedure, we simply report
this, by conjoining an cmpty sct answcr with the reason for the set's cmptincss:
this. by conjoining an empty set answer with the reason for the set's emptiness:
None, because thc course was not given.
None, because the course was not given.
So far then, we have sccn that the B(s are necessary conditions for A to be
So far then, we have seen that the Bi's arc necessary conditions for A to be
non-empty. Plainly they arc not sufficient conditions howcver, and this
non-empty. Plainly they arc not sufficient conditions however, and this
distinction is closely related to thc possibility of appropriatc "no" answers. To see
distinction is closely related to the possibility of appropriate "no" answers. To see
this. considcr thc cnsc where not-A (a "no" answer) can be a valid rcsponsc to a
this, consider the case where not-A (a "no" answer) can be a valid response to a
qucstion. By the above analysis, any of thc ni's bcing cmpty can make A empty.
question. By the above analysis, any of the Bi's being empty can make A empty.
So it needn't bc true that if A is empty, thcn a particular Bi is cmpty. (It is not
So it needn't be true that if A is empty, then a particular Hi is empty. (It is not
ncccssarily true that nor-Aanot-Hi.) 'lhis is the situation in which Kaplan dubs
necessarily true that not-I\~not-Bi') Ibis is the situation in which Kaplan dubs
Bi a prcsom~tion;Bi is simply a necessary condition for A. Now suppose that a
Hi a presumption; Dj is simply a necessary condition for A. Now supposc that a
"no" answer demands that Bi bc empty, i.c., that not-/\ *not-Bin But then we
"no" answer demands that Dj be empty, Le., that not-I\ ~not-Bi' But then we
havc that not-ni*not-A and not-h *not-Hi, LC., A iff Bi. Then whenevcr A is
have that not-Hj"::::.not-A and not-A =::::-not-B j, Le., A iff Bj Then whenevcr A is
true, Bi is true; when A is false, Ili is false. l h i s is the case in which Kaplan calls
true, Dj js true; when A is false, Bj is false. "Jbis is the case in which Kaplan calls
73
74
Rerwick
would fill certain gaps in his syntactic analysis of English. As mcntioncd, Knplan's
would fill certain gaps in his syntactic analysis of English. As mentioned, Kaplan's
syntactic analysis has no analoguc of a QR rulc. 'I'hcrcforc. it cannot capturc
syntactic analysis has no analogue of a OR rule. Therefore, it cannot capture
ambiguous quantifier scopings, nor dctcrminc whcn such altcrnativc scopings are
ambiguous quantifier scopings, nor determine when such alternative scopings are
pn~hibitcd. It would bc a simple matter to add sucll a rulc to thc MQI. parsing
prohibited. It would be a simplc matter to add such a rule to the MOl. parsing
system, tailorcd dircctly aftcr thc QIZ n ~ l cof modcrn gcncrativc hcurics. Sccond,
system, tailored directly afler the QR rule of modern generative theories. Second,
Kaplan makcs no usc of rhc co-indcxing work of Sidncr discussed abovc; lhcre is
Kaplan makes no use of the co-indexing work of Sidner discussed above; there is
no way for thc MQI. systcln to kccp track of thc introduction of discourse cntitics
no way for the MOL system to keep track of the introduction of discourse entities
and tl~circo-designators. Oncc again, this machincry could be liftcd whoIcsalc
and their co-designators. Once again, tllis machinery could be lifted wholesale
fro111Sidncr's work; one nccd only add a co-indexing notalion, and thc samc rulcs
fmm Sidner's work; one need only add a co-indexing notation, and the same rules
would carry ovcr Tor Lhc MQl. structurc. I n fact, i l wouId probably bc bcst to split
would carryover for tl1e MOL structure. In fact, it would probably be best to split
thc ho~nogenatcdMQI. lnnguagc back inlo thc two rcprcsc~llationsfrom which it
the hOlTIogenated MOl. language back into the two representations from which it
was distillcd -- surbcc syntax, and logical form. Finally, Sidncr's fixus rules, as
was distilled -. surface syntax. and logical form. Finally, $idner's focus rules, as
rc-cast in thc gi~isco f predication indicatc "what is bcing talked about" in a
re-cast in the guise of predication , indicate "what is being talked about" in a
discounc. hcncc, in a qucstion. I3y incorporaring thc f t ~ u salgorith~ninto the
discourse, hence, in a question. By incorporating the focus algorithm into the
MQL. stn~cturc via a rcprcscntation rcflccting a "x sat.P(x)" form -- Kaplan's
MQL structure via a representation reflecting a "x S.t. P(x)" form . Kaplan's
system could cxploit Sidncr's rulcs directly fur dctcrmining what thc topic of the
system could exploit Sidner's rulcs directly for determining what the topic of the
qucstion is,
question is.
--
75
thc stxion agent may assutne lhat pcoplc appronching him with questions only
the s[,.1tion agent may assume that people approaching him with questions only
want to find out ahout boarding trains or mccting arriving trains -- and hc may
want to find out about bOJrding trains or meeting arriving trains -- and he may
assulilc that thc pcoplc approaching him also know that. Il~us,agcnt and patrons
assume that the people approaching him also know that. lllUs, agent and patrons
sharc a sm;~ll~ ~ i ~ ~ of
n bmutual
cr
bclick; thcrc arc just two possiblc "top-lcvcl"
share a simil number of mutual beliefs; there arc just two possible "top-level"
c
goals. Fach of thcsc goals Icads to a characteristic planning scqucncc: if o ~ l wants
goals. r'.ach of these goals leads (() a characteristic planning sequence: if one wants
to board a train, thcn onc must buy a tirkct, find o ~ i that from gate and at what
to board a train, then one must buy a ticket, find out what from gate and at what
timc thc train Icnvcs. go to that gntc. and so forth. 'I'hc kcy point is that diffcrcnt
time the train leaves, go to that gate, and so forth. The key point is that different
goals can cxpantl into diffcrcnt planning scqucnccs, hcncc c~lcntuallytcrminatc as
goals can expand into difTerent plJnning sequences, hence eventually terminate as
diffcrcnt spccch acts. 'Ihus Ihc top-lcvcl plan of a qucstioncr could be idcntificd if
diffelent speech aelS. Thus the top-level plan of a questioner could be identified if
the spccch acts it Icads w match what thc hcarcr actually observes:
the speech acts it leads to match what the hearer actually observes:
GoaIs
Goals
1. b a r d train
1. Board train
G c t tickct
Get ticket
Spccch act
Find out about dcstination
Speech act
Find alit about destination
'Ihc train to
Find out gate
The train to
Find out gate
Windsor?
Find out dcparturc time
Windsor?
Find out departure time
2. Mcct train
2. Meet train
Find out gate
Find out gate
Find out arrival time
Find out arrival time
Berwick
76
AHen's morc
more gcncral
general ;~ppro;lch
approach to
to intentional
intentional bchauior
behavior pcrmits
permits his
his systcrn
system to
to
Allcn's
make largcr
larger lcaps
leaps of
of infcrcncc
inference than Kaplan's. In particul;lr,
particular, it can leap at least
some of
of Uic
the infcrcnti;~l
inferential g;~ps
gaps corresponding to so-callcd
socallcd "indircct
"indirect spcech
speech acts" --sonic
of UIC
the spcakcr
speaker Uiat
that arc dircctly
directly rcflcctcd
renected iinn surface syntactic form.
fonn. An
intentions of
example is Ulc
the qucstio~l
question "Can you tcll
tell me what time Ulc
tJ1C train to Windax
Windsor leavcs?"
leaves?"
cxamplc
the spcakcr
speaker presumably illtcl~ds
intends by this st;itclncnt
statement is to find out whcn
when the
What UIC
leJves. hcncc
hence to get the station ;lgcnt
agent 10
to tell him dcpnrturc
departure time.
train to Windsor Icavcs.
Thus thc
the actllal
actual mcaning
meaning of
of this qucry
query is "'l'cll
"Tell me tllc
the dcp;lrturc
departure tirnc
time of
of thc
the train
'llius
cyen though thc
the Li
fonn
the
query is aa ycs-no
yes-no qucstion.
question. and thus
mn iof~ thc
f
qilcry
to Windsor", cvcn
be literally answcrcd
answered will1
with a simple "Yes."
could bc
reconstruct aa qucstioncr's
questioner's bclicfs
belicfs and intcntions
intentions from surface
This ability to rcconttruct
that do not explicitly
exrlidtly in2icatc
injicatc them
thelll is quitc
quite rcrnarkablc.
remarkable. It
h is purchased at
forms tllat
the pricc
price of
ofse\'erely
restricting thc
the planning
pbnning altcrnati\fcs
allernativcs that can bc
be considered. If
If
thc
sc\,crcly rcstrictillg
ll1eettrains,
slight, and a
one can only board or ~ncct
1r;lins. then the opportunity to go astray is slight
comhinatoric
combinatoric explosion ill
in infcrcncc
inference unlikely.
unlikely, It
h is not clear whether the same
marc realistic sctting where athere
dozens or
approach would work in a more
c r e were dolens
hundreds of ;~ltcrnatives.
alternatives.
Ilundrcds
'l'hcre
There arc two classic ways out of the dilemma of cxccssivc
excessive infcrcnce.
inference. One
r~ad to salvation
salvation iniglil
might be callcd
called dcus
deus cx
ex machina: invoke some computational
road
~n;lchincry
machinery that can carry out Uic
the dcsircd
desired infcrcncc
inference quickly.
quickly. Ilut
But for many A
AIl
researchers, the cscapc
escope has bccn
been to adopt the working strategy of "divide and
rcscarchcrs,
conquer" by simply assuming that knowledge ahout thousands of things can be
organized into more
morc modular units.
units. Now,
Now. olle
one can package either inference
infcrence about
plans and intentions,
intcntions, or inference
infcrence about properties, or both,
both. As for inference
about
abont intentions,
intcntions, it seems
sccms hnrdly
llardly likely
likely that every
cvcry time one hears. "Can you pass
the salt?" one runs through in toto a long choin
chain of deductions
dcductions that ends with the
conclusion that what was really meant was that someone
someone wants you to pass the salt
"Ille
obvious alternative is to squirrel away commonly occurring
(xcurring deductions
deductions as
'lhc obvious
lillie
triggcr on an observed
observed speech
speech acts directly.
dircctly. Of course, this
little "lemmas" that trigger
approach
research questions
questions ahout
about ti,e
the nature and organi7.ation
organization of
approach begs important research
these
for inference
infcrencc about properties, the
thc AI
Al literature of the past
these lemmas.
lemmas. As for
decade
dccadc hos
has heen
hccn replete
rcplctc with seveml
scvcral ottempts
attempts at quasi-naturalistic
quasi-naturalistic classifications
classifications
human intentions) into
into
knowlcdgc (including knowledge
knowlcdgc about humon
of human knowledge
orgonized
packets.
Perhaps
naturally
enough,
for
tile
most
part
ti,ese
schemes
organizcd
I'crhaps naturally
for thc
these Schemes
hinge upon carving
carving the
thc world up in a way that isis organized
organized around human
activities
activities with intuitively
intuitively discernible
discernible boundaries ---- taking
laking an airplane
airplane fligh~
flight,
ordering
ordcring tickets
tickcts in
in a train
train station,
slation, or
or ordering
ordcring hamburgers in
in restaurants.
reslaurants. Whether
these
proper joints, I leave
leave for
for the
the connoisseurs
connoisseurs
thcsc efforts
efforts have
have carved
carved Nature
Nature at
at her pruper
of fast-food
fast-food restaurants
restaurants to
to judge,
judge.
77
1.4.1 Allcn's
Allen's system: the details
ulterance is rccovcrcd.
recovered. By
meant simply
(2) The
'lhc surface
st~rfaccintention
intcntion of the uttcrancc
It). this is mcant
whcthcr
("l'cll me..."),
mc..."), a question
qucstion ("'When".),
("Whcn...), or the
whether the utterdnce
utterance is a command ("'Tell
all O,ese
revealed by
hy explicit syn(,1clic
Uicsc sentence
scntcncc types arc rcvcalcd
synklclic signals, this
like. Since ;dl
be done easily,
roles
can bc
easily. TogeO,er
'l'ogcllicr with
wit11 the O,emalic
thcn~;~tic
rolcs recovered
recovcrcd in step (I), a
(;::: what the
could want, givcn
given surface indications)
sct oof
s-f requests (=
thc speaker
spcakcr aiuld
possible set
constructcd. For example.
examplc. the uttcranl'C
uttcrancc "Docs
"1)ocs the train to
tt] Windsor leave
lcave at 4?"
4?"
is constructed.
Icads
to
the
systcm
BI
concludc
that
UIC
spci~kcr
has
madc
arcqucst
to
bc
leads
system to conclude
the speaker
made a request be informed
if
the departure time for
lrain is at 4.
fur the Windsor train
- thc
(3) The system
systcln starts
sorts expanding inferences
infcrcnccs forward from its two top level goals
(boarding and leaving
lca\.ing trains), along with a third, dummy, emply
clnpry goal. 'lhe fringe
of sub-goals currently expanded
cxpandcd from higher-level goals arc called the
exnectations. At the same
time, the system
cx~cctations.
wmc time.
systc~nstarts expanding from the initial
utterance backwards. So for instance, given the initial request
rcqucst that the speaker
(train], 4)).
systcm flcshcs
thc possible
wants to -f(dcpanurc-time
know if(depanure-lime (trainl.
4)), the syslem
neshes out the
the speaker
leave at
spei~kcrwants the train to leavc
a1 4; or the speaker
spcakcr doesn't
doesn't
goals that cilher
either thc
4: or the speaker
spcakcr simply wants to know when the train
want the train to lcavc
leave at 4;
leaves. The fringe
fringe of goals expanded
cxpandcd backwards in this manner arc
are dubbed the
judged to mesh
mcsh with an expectation
cxpcctation according to a
alternatives. An alternative is judged
set of weightcd
weighted crileria
criteria that arc simply a way of expressing
cxprcssing hthee likelihood of their
compatibility. For example,
top-level goal expands into subgoals whose
canpatibility.
examplc. if a top-lcvcl
necessary prccondidons
preconditions arc incompatible with thc
the prcconditions
preconditions of the alternative,
then
unlikely: similarly, if the expectations
being expanded forwards
thcn a mesh is unlikely:
cxpcctations bcing
contains descriplions
dcscriptions (like "boarding
"boarding a train") thaI
that is compalible
compatible with the
being cxpandcd
expanded backwards, thcn
then the two arc
alternative bcing
are more likely to mesh.
Berwick
Berwick
78
78
When tile
the score
score of
of some
some alternative-expectation
altcrnativccxpcctation combination
combination exceeds
exceeds tllat
that of
of the
the
When
dummy alternative
alternative by
by aa certain
certain amount,
amount, the
the pair
pair is
is accepted
accepted as
as the
the most
most likely
likely
dllmmy
plan the
the speaker
speoker had
had in
in mind.
mind.
plan
(4) Given
Given aa likely
likely plan
plan for
for tile
tile speaker
speaker (including
(including aa single
single "visible"
"visible" portion,
portion. the
the
(4)
observed speech
speech act).
act), the
the system
system traces
traces out
out tile
Ole steps
steps of
of the
the plan
plan to
to see
sec ifif there
there are
are
observed
any
gaps
in
preconditions
such
that
the
speaker
might
not
be
able
to
carry
out
the
any gaps in preconditions stIch that the speakermight not be able to carry out the
plan.
For
example.
the
plan
of
boarding
a
train
demands
Olat
ti,e
speaker
be
at
plan. For example. the plan of boarding a train demandstllat tile speakerbe at
the departure
departure gate,
g<ttc. implying
implying in
in turn
turn that
that the
the departure
departure gate
gate was
was known.
known. (Note
(Note
tllC
that
this
means
that
Allen's
system
must
have
a
set
of
rules
for
reasoning
about
tllat this means that Allcn 's system must have a set of rules for reasoningabout
actions,
e.g..
inference
rules
of
the
form,
"To
be
at
a
place
X
implies
that
one
actions. c.g., inference rules of the form. "To be at a place X implies that one
knows
the
volue
of
place
X,")
knows the value of place X.')
(5)
Assuming aa model
model of
of co-operative
co-operative behavior,
behavior, the
the system
system responds
respondsso
as to
to fill
fill in
in
(5) Assuming
so as
ti,e
gaps
in
the
speaker's
knowledge
that
it
has
concluded
are
necessary
to
fulfill
tile gaps in the speaker's knowledge that it has concluded are necessaryto fu]fill
the
plan. Here
Here itit can
plan itit has
has already
already inferred
inferred to
to construct
construct
the speaker's
speaker's p]an.
can exploit
exploit the
the plan
using tllC
O,e same
same thematic
thematic role
mechanism of
of Step
Step (1),
(I), but
but in
in reverse
reverse,.
aa response.
response, using
role mechanism
Note
that
the
system
says
nothing
ahout
how
this
linguistic
realization
would
Notc that tile system says nothing about how tl1is linguistic realization would
actually
be
carried
out.
McDonald's
system,
in
fact,
has
been
designed
to
handle
actually be carried out. McDonald's system, in fact, has been designedto handle
exactly
tile prob]cm
problem of
of mapping
mapping from
plan description
description to
to an
an actual
attual utterance
utterance,.
exactly tl1C
from aa plan
(Sec
4.)
(SeeChopter
Chapter 4.)
As
this procedure
not deal
deal with
with indirect
indirect speech
speech acts
acts,. Bcfore
Before
As outlined,
outlined, this
procedure docs
docs not
wckling
this
more
difficult
case
tilOugh,
it
might
be
best
to
illustrate
the
tickling this more difficult case tll0ugh, it might be best to i]]ustrate the Allen
A]]en
system
by comporing
would analyze
systemby
comparing the
the way
way itit and
and Kaplan's
Kaplan's database
dat1basesystem
systemwould
analyzethe
the
same
train to Windsor leave
sameutterance,
utterance. "Docs
" Docs tile train
leaveat 47"
4?"
In
intention:: aa Request
In Kaplan's
Kaplan's system,
system, there
tl1cre can
can be
be only
only onc
one surface
surface intention
Bcguest to
to
Inform
that Kaplan
Inform if
if P(x)
P(x) is
is true.
true. This
'This is
is an
an appropriate
appropriate choice
choice given
given ti,e
tile goals
goals that
Kaplan
had
had for
for his
his research,
research, namely,
namely, to
to develop
develop aa cooperative
cooperative database
databasesystem
systemthat
that was
was
also
transportable.
Kaplan's
system
assumes
that
there
is
only
one
goal
of
also transportable. Kaplan's system assumesthat there is on]y one goal of aa
database
databaseuser:
user: to
to know
know the
the value
va]ue of
of some
some database
databaseobject.
object. In
In addition
addition to
to this
this
immediate
immediate goal,
goal, Allen's
Allen's system
systemcontains
contains several
severalinference
inference rules
rules that
that immediately
immediately
conclude
conclude that
that if
if someone
someonewants
wants to
to Know
know the
the properties
properties of
of something,
something, then
then one
one
wants
one
of
the
possible
values
of
that
property
to
obtain;
further,
one
often
wants one of the possib]e values of tl1at property to obtain; further, one often
wants
wantsto
to know
know the
the value
value of
of an
an object
object in
in order
order to
to identify
identify itit in
in the
the world.
world. Thus
Thus
Allen's
AI]en'ssystem
systemconstruct,;;,
constructsfour
four surface
surfacealternatives
alternativeswhere
where Kaplan
Kaplan has
hasone.
one.
What
What about
about top-level
top-level expectations?
expectations? Here
Here too,
too, Kaplan
Kaplan has
hasbut
but one:
one: to
to find
find out
out
about
aboutthe
the properties
propertiesof
of some
somedata
database
baseobjcc~
object, X.
X . In
In contrast,
contrast, Allen
A]]cn has
hasthree:
three: the
the
board
boardtrain
train plan,
p]an, the
themeet
meettrain_plan,
train.plan, or
or the
the empty
emptyplan.
plan.
79
Inform if
Surfacc Inform if
Surface Inform if
Inform if
rcqucst (I)cparturc.timc (I)cparturc.timc
request (Departurc.timc (Dcpartu re. time
train-1 is 4)
train-l is 4)
train_l is 4)
train_l is 4)
Default
Knuw_value
Goals
Goals
Dcpsrturc.limce 1~cpnrturc.timc
DepJrtUfe.time. Departure.time
train-l
train-1
train_l
train_1
2 & 1:'l'hc following infcrcncc rulc expands (1).
2. &.l: The following inference rule expands (1).
lfspcnkcr wants to know some value,
Ifspeaker wants to know some value,
thcn spcaker wants either a
then speaker wants either a
positivc valuc (wants train to leave
positive value (wants train to leave
at 4) or a ncgativc value
at 4) or a negative value
(wants train not to lcave at 4)
(wants train not to leave at 4)
4. lnfcrcilcc rulc cxpands (I):
1. Inference rule expands (I):
If spcakcr wants to know somc value
Ifspeaker wants to know some value
of somc cntity, then spcakcr wants to
of some entity, then speaker wants to
identify the cntity
identify the entity
Kaolan
Allen
Kaplan
Allen
Goal
Berwick
80
--
81
Obscrvcd
Observed
Request: lnfonn
I<cqucst:
I n A ~ r nif
dcparturc.timc
departure.
time
Request: Inform
Ilcqucst:
InA~nnif
departure.time
dcyarlure.time
Know_if
Know_value
Goal
Know_value
Know-valuc
departure
time
dcp~rturctimc
Know_reference (speaker,
lime)
Know-rcfcrcncc
(spcakcr, time)
whcrc
ti~nc:=dcp;~rtorc
where time:= departure
time of train
limc
vain
Huard train:
know-rcfcrcnce
Board
tmin: know_reference
(departure
time, gate)
(dcpc~rturcti~nc,
gatc)
1':i~llre
Summary uf
of Kaplan and Allen
Allcn example
cxamplc comparison.
compa~.ison.
Figur~ 9 Summary
spcnkcr's plan
pl;ln uther
utlicr Ill;~n
Notc til:lt
t11:lt Kaplan"s
K;lplan's SystClll
systcln has no
!no model
~nodclof thc
Note
the spcakcr'$
tlwn UIC
the
simple
the speaker
wants to
the v:lluc
value of
departure time.
si~tiplcone
onc that llic
spr;~kcrw;lt~ts
PI know UIC
ofdcp;lrturc
What ncxt'
K;lp1nn3ssystem,
sgslcrn, 111c
stnlctllrc (the p:~rsc)
n~;lppcdinto
nexO! In Kaplan's
the MQl.
MQL stnleture
parse) is mapped
:I database
dnt~bascquery
qilcry -- a sequence
scquc~lccof queries
qucrics O,at
Uiat aim to rind
lind out Ihc
11
the value of
dcparturc
Suppcsc tllat
!ll;lt the
t l ~ cdatahase
det~bascstructure is such that
th;lt ihc
lbllowing
d~parturc tilnc.
time.- Suppuse
the lbllowing
sitllation obtains:
obt;~ins:
sitllJtion
'l'oronto tnin
xhcddc
Toronto
train schedule
Arrivals
3 PM
3PM
I)cpart~~rcs
DcpartllfcS
1
PM
IPM
Windsor train schcdulc
schl..dulc
Arrivals
1AM
lAM
IPM
'I'hcn the
thc subset
subsct stnlet"'e
stnlcturc for tltc
qucry looks like
likc this:
Then
the query
tllis:
82
Berwick
I h2rc ~ r ntr
c Wi~rdulrrtmin dcpllrtr~rcs. As usuol. K i ~ p l i ~systcnr
t ~ ' ~ reports back
Th..::rc arc no Windsor train dep;Jrtllrc:-. I\s usual, Kilp1<llI'S systcnl reports back
tllc Inrgcst ernply suhscl ill dris ellain as the rcasol: for lllc cmply sct qucry
the brgest empty subset in this chain ,1S the rCaSlJH
response:
response:
f~Jr
In :,unr, wc see. h i l t K3pIan and Allcn's systc'111wi!l pcrforrv in bout thi! sunc
In ~;um, we ~e(' that Kilplan and Allen's system will pCrfl1fll1 in about the same
way ii1 rcsponsc lo tint;~l>a.sc-likc
"cmply" qtlcstions.
way iii response 10 dill;lbi\~c'like "cmpty" questions.
Of ~~)IIs:;c,
Allc~~'s
systc'm can do s o ~ ~ihirlgs
t c t l ~ a lKap:an's c:!ntlc;t, !)ccausc it
Of course, ,\Ileu's system can do SOl11 I: things IllJ! Kap:an's G!llllet, !)C'causc it
ha:; (I morc complctc 1!1c,cIcl of if~cs]?clkcr's pl:~n. Fol' cx3ii1plc.il call si~pplyt h e
ha:, a more COln;yklC modd of the spc:lker's pbn. For eXJnlplc, il can supply the
g;~Lcn~r~nbcl.
c~~c:i
iT ths usi!r did not c ~ ~ l i c i ddcn!i~nd
y
it -- 5i11ccit bo is 3
gate number e\\::1 if the user did not expliciLly demand it ~i nce it tuo is 3
~~~~~~~~~y prccc~nditic~li
hr tllc su:ccss of t f ~ cin t'crrcll spcakcr's plan.
llCccs~,lry preconditioIl fl;r the SllCecss of UlC in fcneu speaker's plan.
Jil a casc lili: "lloc\ tllc Wir~dsortr;~inIcavc at 4?",tlrc surr:icc fi.11-111 of Blc
Ja a case like "I)oe\ the Windsor train leave at 4?", the $urfacc furm of the
urtcrancc n~;tu:lics thi' ii:lc~ttionto find out \V:ILYI ! h trair,
~ IC,I\CY.
'I'his is n clircct
ultera:lec ll1;lU:llC~ tit;,:: iJ~'Clltioll to lind (lut Wl1l'1l the train kayes. This is a tlircet
spccc!~crcl. li' : ; L I T ~ ~ I U folm
C
dtrs not rfircclly r ~ * r l c cdl(!
t :;pcakc~-':, ifitctition, t1lc11
spcC'ch ,1Cl. Ii' ~;ur[lce form docs nol directly rl'ricct the :;pcJkcr'~; intention, then
c!nc II;IS ,\:I i[\a.i;rh!cljpcccli act. ?'hits, 10 crhilin thc "LI,L:c" inlent of a spcclkci, no
lme has <In 1Jt.h,'q spec'dl 'ICt. Thus, 10 oDllin the "lrue" inlC'nt of :l spc"kcr, no
nddi~iot::~!1nfi.1-cncc' is ncccss~ryin t l c;lx
~ c ~ af Ilirccf S[)CCC~I act: CIIC ~ C ' I ' S S I ~
,1uJith,1I::d inkrcnc.. . b necessary in tbe C<lse of a direct speech il\:t; the PCl'S011
n;car,s what hc litcrsll;; is s ~ t i n g ,fn col1tr;lst. i ~ ;!a
i it\dil.cct spcccl~iict, i s ~ c r a l
n;~'ans ",'hat he litcr,;lly is sJ}ing. In contrast, in ,!r! indirect specch 'A't, ',c\,efal
83
laycrs of infcrcncc might have to bc intcrposcd bcforc tl~e"tmc" rcqucst is
layers of inference might have to be interposed before the "true" request is
rccovcrcd:
recovered:
direct
indircct
surfacc
Ilocs the Windsor
Ilo you know whcn
surface
Docs the Windsor
Do you know when
form
train lcavc at 4?
the Windsor train leaves?
form
train leavc at 4?
the Windsor train leaves?
surface
Inform-if
Inform-if (Agcnt knows
surface
Inform_if
Inform_if (Agent knows
(Ilcparturc tirnc is 4) dcparturc tirnc)
rcqucst
request
(Departure time is 4) departure time)
'
I n fonn-if (Ilcparture
undcrlying lnfonn-if
underlying Infonn_if
Infonn_if(Departure
rcqucst
(Dcparturc tirnc is 4) timc is 4)
request
(Depalture timc is 4) time is 4)
Figurc I1 Comparison of direct and indircct rcqucsts.
Figure 11 Comparison of direct and indirect requests.
Oncc thc undcrlying rcqucst is rccovcrcd, the systcin can procccd as before.
Once the underlying request is recovered, the system can proceed as before,
but how can one gct from S U ~ ~ ~ Ircqucst
CC
to undcrlying rcqucst? Allcn's answer is
but how can one get from surface request to underlying request? i\llen's answer is
thnt rccovcring an indircct undcr1ying rcqucst involvcs rcconctructing a portion of
that recovering an indirect underlying request involves reconstructing a portion of
a spcikcr's bclicfs on the basis of obscrving an cxtcrnal uttcrancc. Hcncc,
a speaker's beliefs on the basis of observing an external utterance. Hence,
rccoucring an indircct spccch specch act is thc srlnlc sart of proccss as rccovcring a
recovering an indirect speech speech act is the same sort of process as recovering a
spcakcr's ultimatc plan -- only QIC infcrcncc chain ciin bc longer, hcncc more
speaker's ultimate plan -- only the inference chain can be longer, hence more
difici~ltand crror-pronc. Thc cxlra stcp is simply to takc thc literal mcaning of
difficult and error-prone. The extra step is simply to take the literal meaning of
thc observed uttcrancc. obtaining a surface spccch act as bcforc, and thcn cxpand
the observed utterance, obtaining a surface speech act as before, and then expand
this alternative backwards by a scrics of jnfcrcnce rutcs dcaling now with the
this alternative backwards by a series of inference rules dealing now with the
spcakcr's dcsircs with respect to what thc spcakcr and thc hcarcr (the station
speaker's desires with respect to what the speaker and the hearer (the station
agcnt) mutually bclicvc. In h i s casc, what they n~utua!lybelicvc includcs the
agent) mutually believe. In this case, what they mutually believe includes the
information that thc spcakcr wants thc agent lo tcll the spcakcr whcthcr the agcnt
information that the speaker wants the agent to tell the speaker whether the agent
knows whcn the Windsor train Icavcs. It also includcs Ihc fact that knowing when
knows when the Windsor train leaves. It also includes the fact that knowing when
thc train lcavcs is the sole relevant precondition for telling the spcakcr when the
the train leaves is the sole relevant precondition for telling the speaker whcn the
train leavcs, and furthcr that the & possiblc cffcct of thc agcnt's knowledge of
train leaves, and further that the only possible effect of the agent's knowledgc of
whcn the train lcavcs on the spcakcr is for fhc spcakcr to come to know when the
when the train leaves on the speaker is for the speaker to come to know when the
train Icaves -- this sincc both spcakcr and station agcnt alrcady mutually believe
train leaves -- this since both speaker and station agent already mutually bclicve
that thc sution agcnt knows when thc train lcavcs. 'Ihus the only possible plan
that the station agent knows when the train leaves. Thus the only possible plan
that is advanccd by asking the qucstion is a plan whcrc thc speaker finds out what
that is advanced by asking the question is a plan where the speaker finds out what
thc departure time of the train is -- hcncc thc inform-if undcrlying rcqucst.
the depanure time of the train is -- hence the infonn-if underlying request
If the context is constrained enough. thc computational problcms that could
If the context is constrained enough, the computational problems that could
arisc hcrc do not sccm to be iusurmountablc: if onc can only mect or board trains,
arise here do not seem to be insurmountable: if one can only meet or board trains,
thcn thc cxpression. "thc train to Windsor" uttcrcd without any surfacc indication
thl;n the expression, "the train to Windsor" uttered without any surface indication
of
intcnt (as a rcqucst, say) can still be analyzed. Howcvcr, this is simply because
of intent (as a request, say) can still be analyzed. However, this is simply because
Berwick
84
thcrc cannot bc many "dccp" intcntions. Indccd, given thc tightly circurnscribcd
there cannot be many "deep" intentions. Indeed, given the tightly circumscribed
train srarion world, a spcakcr would nccd but to appn~nchthc station agcnt and
train skltion world, a speaker would need but to approach the sLltion' agent and
say, Windsor". 'l'hc agcnl could thcn supply a rcsponsc that mct cithcr the
say, Windsor". The agent could then supply a response that met either the
dctnands of rnccting or hoarding a train: a gatc and a time. 'The powcr of Allcn's
demands of meeting or hoarding a train: a gate and a time. The power of Allen's
system thcn is that in a constricted domain onc can cncodc a sumcicnt set of
system then is that in a constricted domain one can encode a sufficient set of
axioms and infcrcncc n~lcsto deal with a cornplctc, but circurnscribcd, range of
axioms and inference niles to deal with a complete, but circumscribed, range of
human actions and intcntions. I'o cxtcnd this approach to thc world at large
human actions and intentions. To extend this approach to the world at large
would plainly rcquirc a way to rcstrict infcrcnce. Sincc pcoplc do solnchow
would plainly require a way to restrict inference. Since people do somehow
managc to gct around in thc world in gcncral -- or at lcast thcy sccrn to one is
manage to get around in the world in general .- or at least they seem to" one is
inc1uct;tbly Icd to the scarch for tliosc "natural" constraints on infcrcncc that
ineluctably led to the search for those "natural" constraints on inference that
pcoplc must prcsuinably usc. 13ut thcrcin lics thc story of thc Future of cognitive
people must presumably usc. But therein lies the story of the future of cognitive
scicncc, a story that must wait for another book to tell.
science, a story that must wait for another book to tell.
--
--
8S
intriguing cvidcncc for thc intcnlal constn~ctionof thc languagc production
intngumg evidence for the internal constnlction of the language production
mechanism itself.
mechanism itself.
McDonald's rcscarch into languagc gcncration lics squarcly within this
McOonald's research into language generation lies squarely within this
methodological approach. His basic goal is to dcvclup a working computer
methodological approach. His basic goal is to develop a working computer
program tliat can iictually producc orthographic output from an input mcssagc
program that can actually produce orthographic omput from an input message
language; onc can llicrcforc cval~latchis work with rcspcct to thc (wcak) milestone
language: one can Lllerefore evaluate his work with respect to the (weak) milestone
of Input/Output reproducibility. I-lc~wcvcr,not only is thc Mcllanald procedure
of Input/Output reproducibility. However. not only is the McDonald procedure
dcsigncd actually to output orthographic tcxt from tncssagc input it is dcsigncd to
designed actually to output orthographic text from message input, it is designed to
that
o ~ i t p ~tcxt
~ t o~iickly. 111 slioi-t. llic major cognitive fidclity rcqi~ircmc~~t
output text!ll!i1h. In short. the major cognitive fidelity requirement that
McI3onnld aims to ~ n c c is
t tl1;tt of flucncy, that is. thc ability to producc language
McDonald aims to mect is that of fluency. that is, the ability to produce language
at a boundcd ratc without arbitrary pauscs. 'I'his is a quintcssclitially
at a bounded rate without arbitrary pauses. This is a quintessentially
compuwtional dcmund, serving as a kind of "forcing function" to drivc thc dcsign
compuL1tional demand, serving as a kind of "forcing function" to drive the design
of thc gcncration prwcdurc. '1'0 mcct the flucncy rcquircmcnt, k1cf)onald
of the gcncration procedure. To mcet the fluency requirement, McDonald
advanccs a spccific sct oTconstraints on his input mcssagc languagc and mapping
advances a specific set of constraints on his input message bnguage and mapping
function. I'or instance. one of die mcssagc langilagc constraints lias to do with the
function. For instance, one oftlle message language constraints has to do with the
order in which rncssagc sub-clemcnts can appear in a conlpletc message: it is
order in which message sub-elements can appear in a complete message: it is
forbidden to allow a sub-clcmcnt that dcpcnds on somc highcr, morc "abstract"
forbidden to allow a sub-clement that depends on some higher. morc "abstract"
rncssagc clement to bc cxpandcd bcfi}rc that morc abstract clcmcnt. 7hcn too, the
meSSJge clement to be expanded before that more abstract clement. Then too, the
gcncrqrion proccdurc ilsclf is rcstrictcd to d o only a boundcd amount of "advance
gener~tion procedure itself is restricted to do only a bounded amount of "advance
planning" bcforc it must dccidc what to say at any givcn stcp. (Thcsc vague tcrms
planning" before it must decide what to say at any given step. (These vague terms
Iikc "boundcd" and "adiancc planning" will bc sharpcncd shortly.)
like "bounded" and "advance planning" will be sharpened shortly.)
'fogcthcr, Lhc constraints on rncssagc languagc and mapping atgorithm are
Together, the constraints on message language and mapping algorithm are
claimcd to bc suficicnt (not ncccssary) to obtain thc dcsircd result of flucnt
claimed to be sufficiellt (not necessary) to obtain the desired restlll of nuent
output. As in Lhc case of othcr systclns constraincd so as to opcratc with limited
output. As in the case of other systems constrained so as to operate with limited
rcsou;ccs, McDonald suggests that lhcy conspire so as to prcvcnt thc gcncration
resour~es, McDonald suggests that they conspire so as to prevent the generation
proccdurc from corrcctly handling certain input mcssagcs propcrly. Although he
procedure from correctly handling certain input messages properly. Although he
docs not providc a firll characterization of which sorts of constructions can or
docs not provide a full characterization of which sorts of constructions can or
cannot bc handled, it will b e shown below that a fonnal characterization of most if
cannot be handled. it will be shown below that a fonnal characteriz.ation of most if
not all of thc error-producing constructions can bc givcn. In a nutshell, the
not all of the error-producing constructions can be given. In a nutshell, the
flucncy constraint is subsumed by a formal property of grammars known as the
fluency constraint is subsumed by a formal property of grammars known as the
l.(k) condition [Lewis and Stcarns 19681. As we shall see, the LL(k)
LUll I.
condition [Lc\';is and Stearns 1968]. As we shall sec, the LL(k)
charactcrizalion providcs a uscful umbrclla framcwork in which lo unify the
characterization provides a useful umbrella framework in which to unify the
various constraints that McI3onald proposes. One such framework is sketched
various constraints tllat McDonald proposes. One such framework is sketched
below: it recasts Lbc gcncration procedure as transduction guidcd by an
below: it recasts thc generation procedure as transduction guided by an
undcrlying 1-L(k) grammar (and is conscqucntly a varicty of svntax-dirccted
underlying LL(k) grammar (and is consequently a variety of syntax-directed
translation, a formalism morc familiar from thc rcalln of compilcr design.) This
translation, a formalism more familiar from the realm of compiler design.) 'Ibis
rc-intcrprctation of thc McDonald modcl will allow us to invcstigate more
re-interpretation of the McDonald model will allow us to investigate more
carchlly thc claims of McI3onald's rcscarch, and considerably sharpcn the
carefully the claims of McDonald's research, and considerably sharpen the
statcmcnt of his constraints.
statement of his constraints.
Docs thc McDonald modcl mimic human specch production? Thcrc is some
Docs the McDonald model mimic human speech production? There is some
Rcrwick
Berwick
86
86
stlggcstivc cvidcncc tliat it docs. It turns out that thcrc arc systcmatically produccd
suggestive evidence that it docs. It turns out that there arc systematically produced
"crrors" in human spccch -- ungrammatical scntcnces that arc quitc common in
"errors" in human speech .- ungrammatical sentences that arc quite common in
the spokcn languagc -- that can bc clcgantly accounted for if onc adopts
the spoken language -- that can be elegantly accounted for if one adopts
Mcllonald's assumption of boundcd planning duri~lgspccch production. 'Thcsc
McDonald's assumption of bounded planning during speech production. These
ilrc ungrarnmi~ticalscntcnccs (as notcd by [Kroch 19811) with a)-callcd rcsum~tive
me ungrammatical sentences (as noted by [Krach 1981]) with so-called resumptive
pronouns:
pronouns:
'Ihc guy who I don't know whcthcr hc will cornc or not...
The guy who I don't know whether he wi1l come or not...
Such scntcnccs arc assumcd to havc a constirucnt analysis roughly like the
Such sentences arc assumed to have a constituent analysis roughly like the
following:
following:
[N;ll~c guy [S who I don't know [Swhclhcr
will cornc or
[NP'1l1e guy [s who I don't know [swhether he will come or
not...
not..
&
1. Typically only when reading Lhcrn, however. In normal speech, intonational cfficcls a
I. Typically only when rCilding them, however. In normal speech, intonational effects seem to
eliminate most garden palhs by providing su ficient cucs lo d i r d G-IC lislencr's parse.
eliminate most garden paths by providing su ffieienl cues to direct the listener's parse.
m to
87
turns out, forward planning of parsing in Marcus' modcl is limitcd to a lookahcad
turns out, forward planning of parsing in Marcus' model is limited to a lookahead
of just onc scntcntial clausc -- just as in Lhc modcl for tlic production o f scntcnccs
of just one sentential clause -- just as in the model for the production of sentences
-- and it is this restriction that forccs thc ~nisanalysisof gardcn path scntcnccs. It is
-- and it is this restriction that forces the misanalysis of garden path sentences. It is
not too much of an cxaggcratioii to say that thc Mc1)c)nald gcncration procedure
not too much of an exaggeration to say that the McDonald generation procedure
is thc production dual oithc Marcus parser.
is the production dual of the Marcus parser.
1.5.1 l'hc h1cl)onald 111odc1
1.5.1 The McDonald model
Ikrwick
Berwick
88
1. Clcarly. h e McDonald procedure does not fall inlo this degcncrate category. The gcncration
I. Clearly.
lIle McDonald
docs notmapping,
fall intosince,
this degenerate
category.
generation
fcaturcThe
niusl
he added to the
procedure
cannol beprocedure
a simple idcnlity
~rivially.a tcnx
procedure cannot be a simple identity mapping. since. trivially, a tense feature musl be added to the
ourput string. Still. the point about lhe irliportancc or Ihe inpul languagc remains Lhc same: a precise
output string. Still. the point about the importance of the input language remains the same: a precise
charactcrizalion of Ihe input message languagc is a logical prcrcquisi~c to thc rormulat~onof the
characterization
ofprocedure.
the input mc,sage
is a logical prerequisite to the formulation of the
gcncralion
70~ h cxtent
c language
lhal Ihe mcssagc langungc remains unspecified, claims about the
generation
procedure. cificicncy
To the extent
that
the message
language
unspecified.
claims
about the
compulatjonal
of lhe
gcncralion
proccdurc
arc remains
corrcspondinglq
vague
and imprecise.
computational
efficiency
of
the
generation
procedure
are
correspondingly
vague
and
imprecise.
2. Morc prccisely. Lhe numbering traces out a win-calledIcfl-most dcrivation. Civcn a grammarG Lhal
2. Moregcncralcs
precisely,a the
numbering
out aof
so-called
left-most
derivalion. Given a grammar G that
language
I, a traces
dcrivation
a scntcncc
w or L is called Icfl-most ii, a1 cach step in the
generatesdcri\ation
a language
l~ a derivation of a sentence Ii' of L is called left-most if. at each step in the
of L,
Ihc lch-mosl non-terminal of h c grammar is cxpnndcd For cxamplc. considcr the
derhation
of L,grammar
lIle left-most
non-terminal
lIle NP-->JohnlMary,
grammar is expanded.
ror example,
consider
the of the
timplc
S-->NP
VI', VP--)VofNP.
V-->kissed
A lea-most
derivation
simple grammar S->NP VI', Vp>V NP. Np)JohnIMary, V--)kissed A left-most derivation of the
scnlcncc "John k i w d Mary" would be: S-->NI' VI'-->john V1'-->John V N1'-->John kisted NP-->
sentence "John kissed Mary" would be: S--)NI' VP--)John VI'--)John V NP..)John kissed NP")
John kissed Mary. Note Lhat this expansion ordcr is cxaclly that o f h c top-down, left-toright traversal
john kissed
Mary.byNote
that this automaton.
expansion order is exactly lhat of the lopdown. Icft-to-right traversal
spccificd
McDonald's
specified by McDonald's automaton.
89
'lhc tables
wblcs of infnnnation
information arc like
likc the
thc transition tables
whlcsofa
finitc automaton --- they
tlicy
"I11e
ofa finite
thc next
ncxt state
swtc of the
thc machine
machinc shall bc.
givcn its current
currcnt state
swte and the
dictatc what the
dictate
be, given
symbol attached
attachcd to the
thc node
nodc nfthe
of thc tree
trcc currently bcing
scanncd.
symbo!
being scanned.
In particular,
thc
gcncration
transducer
has
two
finitc
tablcs that
th;~tdetennine
dctcrminc its
particu!ar, the generation
finite tables
ncxt state and the
thc symbo!(s)
symbol(s) that it should output: a dictionary
dictio11;lryand a er;lmmar.
next
grammar.
'lhc dictionary
dictionary specifics
spccifics an initial mapping from
from the
thc non-linguistic
mcswge
The
non~lingllistic message
structoral linguistic counterpart.
counterpart. 'Ihat
'lhat is,
is. the dictionary
dictionary maps input
language to a structural1inguistic
lncssagcs ..
-- such as "(lady-macbeth
"(lady-niacbcth (persuade
(pcrsoadc (macbeth
(macbc~h(action)))(macbeth
(action))))(macbcth
messages
(murdcr(duncan)))" --- into
illto the
thc familiar
fmiliar structural constituents
constitucnts of linguistic
linguistic theory
(murder(duncan))"
-- clauses,
clauscs, subjects,
sul~jccts,and objects. By
Ily connecting
connccting propositions to
lo linguistic
linguistic objects,
objccts, it
-UI;I~ tells
tclls us
11s which tokens arc to serve
scrvc as the
Uic predicates
prcdicatcs in the
is the dictionary that
input message
mcssagc language,
languagc, and which
ullich as
3s arguments
argumcnts to
u1 the
thc predicates.
prcdicatcs. '111('
'lhc dictionary
UIC repository of information about what is called
callcd in linguistic
linguistic theory
thcory
is also the
grammatical relations
relations -gr;~mrn;ltical
-- for example,
example. the
Lhc information that. in English at least,
lcast, the
Agori
Lady-macbctli. and typically
typicitlly occupies
occupics a certain
ccrtain structural
Agent of ocrsu;lde
pcrsuJdc is Lady-macbeth,
position in a phrase
phrasc structure
structurc tree
trcc of the linguistic
linguistic output --- the
Ihc first Noun Phrase
undcr a Sentence node
nodc (or the
thc Subject, in familiar tenninology);
terminology); the dictionary
under
furthcr
spccifics
hl;sbcth
as
UIC
Q
&
of
thc
scntcncc,
first Noun Phrase
further specifics Macbcth
U,e Obicct
the sentence, the first
Prcdicatc or Verh
Vcrh Phrase.
Phrasc. In short, the
thc dictionary provides a
dominated by the Predicate
nay of associating gra~nmalical
objccts (constituent
(constitucnt structure
structurc trees)
trccs) with
consistent way
grammatical objects
objccls, mapping thematic
thcmatic roles
rolcs such as the
thc "doer" of an action (the
(the
the mc~raac
message objects.
Agcnt) or the
thc recipient
rccipicnt of an action (the
(thc Patient
I'aticnt or Goal) into
inu, positions in a phrase
Agent)
dctails ofthis
of'this process, and whatever
wh;itcvcr onc's particular
structurc tree.
trcc. Whatever the details
structure
tastcs
tastcs in grammatical
prammatical theory,
thcory, onc
one thing
U~ingis clear: a mapping from
from thematic
thcmatic roles
likc Agent
Agcnt or Patient)
Paticnt) to wh;:lt
whi~tarc called
callcd grammatical
grammatical relations (notions
(notions like
like
Objcct) is something
stimcthing that almost every
evcry current
currcnt linguistic
linguistic theory
thcory
likc Subject and Object)
advocates.
McDonald's
grammar has U,e
Mcllonald's gmmm
thc job carrying out such specifically linguistic
linguistic work
dclcting equivalent
cquivalcnt Noun Phrases
Phrascs in embedded
embcddcd constructions and actually
as deleting
modifying
ihc structure
structurc of the
thc constituent structure
structurc tree
Ucc to produce a grammatical
modifying the
string
Macbeth case,
string of orthographic tokens. For example,
cxamplc, in the &&&
case, the grammar
specifics
thc following
following series
scrics of a!terations:
alterations: (I)
(1) addition of a tense marker
specifics the
("Lady-macbeth
persuaded macbeth..:');
("lady-mclcbcth pcrsuadcd
macbcth..."); (2) deletion
dclction of identical Noun Phrases
(Equi-NP
persuade Macbeth [Macbeth
(Equi-NI' deletion)
dclction) ("[Lady-macbeth
("[lady-macbcth pcrsuade
[Macbcth murd.er
muraer
Duncan]]=<>"[Lady-macbelh
persuade [Macbeth
I)uncan]]="[Lady-macbcth pcrsuadc
[Macbcth murder
murdcr Duncan]]");
l)uncan]]"); and (3)
interpoiat!on
persuaded Macbeth
interpolation of function words such as ill (Lady-macbeth
(Indy-macbcth pcrsuadcd
murder
"Lady-macbeth pcrsuadcd
persuaded Macbeth to murder
I)uncan*"Lady-macbcth
murdcr Duncan"),
Iluncan"). "I11e
The
murdcr Duncan=<>
end product is a finished
persuaded Macbeth
finishcd sentence,
scntcnce, e,g.,
c.g., "Lady-macbeth
"Lady-macbcth pcrsuadcd
Macbcth to
murder Duncan",
Duncan". 'Illus
'Ihus the reconstruction of the surface
surface string
string from
from message is
broken down into several
several stages.
-.
Berwick
90
It isis important
decomposition of
of
important to point out that holis
i s modular dccomposition
representational
parcel of
of
rcprcscntalional levels
lc\cls from message
mcssagc to phonological form is part and parccl
modern
the work of Chr~msky
Chomsky [Chomsky 1955J;
1955);
modcrn generative
gcncrativc linguistic
linguistic theory,
thcory, e.g.,
c.g.. thc
[Chomsky
example, in thc
the rcccnt
recent Cholnsky
Chomsky
[Cllomsky 1965J;
19651; and many others.
uthcrs. So for cxamplc,
theory,
thematic and prcdici~tc
predicate argument
lcvcl of representation
rcprcscntadon at which thc~natic
thcory, a level
structllre
linguistic levcl
level hUlat
describes a
gcts mapped to a lingl~istic
a t dcscribcs
structure isis expressed
cxprcsscd first gets
constituent
then, via grammatical rulcs
rllles (including
constitocnt structllre
structurc tree
trcc CD-structure"),
("D-structure"), thcn,
"movement
structure ("s-structure"), and
"rnovcmcnt rules"),
rulcs"). to a level
lcvcl of annotated
annotatcd surface
surf:~ccstructurc
finally,
'Illis is indccd
indeed roughly
rollghly the
ule block
finally.to a phonolngical
phonological representation
rcprcscntatiol~("P!"'),
("PF"). 'lliis
diagram
ulis logical dccomposition
decomposition docs a
llQl
Mcl)onald's approach,
;ippronch. However,
Huwcvcr, this
diagram of McDonald's
specify a way of actually computing
phonological form,
fonn, given an input
c~~rnouting
the phonoLrgicsl
message;
mcssagc: this is
is the
thc job of the
thc gcncmtion
gcncralion model itself.
Furthcnnorc,
procedure docs not compute
I;urthcrmorc. McDonald's
Mcl)onnld's actual generation
gcncration praccdure
tf1CSC
Rather, the annotatcd
annotated constituent
acsc stages
su~gcsseparately,
scparatcly. one after
aRcr the
!he other. Rathcr,
stmcture
right, in ordcr
order that thc
the fringc
fringe clcmcnts
clements
stnlcturc tree
trcc isis bllilt
built up piecemeal,
piccc~ncal.left
Icft to right,
of the
tokens may bc
be output as ssoon
thc tree
trcc corresponding to actual
;~ctualorthographic tokcns
w n as
possible.
entire trcc
tree for .!YIill
llill, the
possible. Thus,
Ihus, instead of building the
thc cntirc
Who ~
kisscd 13il1,
generation
the fronted who portion. IThis
gct~cratiunprocedure
pr~rcdurcactually
actually first constructs
constructs only thc
his
incrcmcntal output
ourput pcnnits
pcrmits the gcncration
nodc-by-nodc incrementa]
nodc-bynodc
generation procedure to actually
thc tnken
tokcn "who" before
bcforc the
thc remainder
rcmaindcr ofthc
"say" the
of the tree is complctcly
completely built:'
built: 1
/s________
COIIP
I x , xx a person
+wh x.
person
+wh
output
output
string:
string:
I .
[
"who..
"who
...
12 Producing a ~-question,
Figure 12
ll'h-question,
Turning now from
from questions
qucstions of representation
rcprcscntation to questions of
Turning
of cognitive
fidelity, we find
find that McDonald aims to account for a number of
fidelity,
of the obvious
ofhuman speech:
characteristics ofhuman
characteristics
also permits
permits an efficient interleaving
intcrlolving of
olprwcssing:
i n g produced,
1.1. It11 also
processing: while
while the
the output
output lakens
lokens are
are kbeing
produced,
the system
syacm can
can be constructing
eonsvucting the
thc next
nexl portion ofthem.
the
of the tree.
91
(1) Human
fluman speech
spccch production is nllcnt
&:
(I)
fluent and rmlli/:
wholc clauses
clauses arc produccd
without excessively
cxccssivcly iong
long
whole
produced withuut
pauses; output procecds
proceeds at approximately
bounded
nppruximatcly a boundcd
paoscs:
rate.
(2)
production is typically error-free
(2) Human speech
spccch pmduction
wcll-fom~cdaccording to Ule
Ulc rules
rulcs of modem
and well-formed
generative
gencrativc grammars,
grammars, but
hut with consistent exceptions.
cxccptions.
(3)
produced sequentially
(3) Speech
Spcccl~ is produccd
scqucntially (an obvious
('lhis docs
ducs nut exelude
excludc the
thc ability
physical constraint). (This
rcstsrt the
tlic output of an entire
cntirc phrase a
to backup and restart
common occurrence
occurrcacc in everyday
cvcryday speech.)
spccch.)
(4)
planned over
(4) Speech
Spccch is D1:lnncd
ovcr representational
rcprcscntational units that
arc greater
grcatcr than one
onc word in length.
ate
'l11c evidence
cvidcncc for this last claim comes from
from well-estabiished
well-cstahlishcd psycholinguistic
'll,e
psycholinguistic
pointed out U,at
phenomena. For instance,
phcnomcna.
instance. [I.ashley
[I.ashlcy 1951J
19511 pointcd
that anticipatory errors
crron soch
such
as,
pick), Jim
nutnbcr one
onc draft &).
as. "'Il,e
"'llic Patriots'
I'atriots' nomber
nurnhcr one
onc draft o1illJ..k -- number
I'lurrkctt",
logically impossible
impossible unlcsc
spccch is planned over expanses
cxpanscs of
Plunkett", arc logically
unless speech
greater
~ord.
u r d .[Shattuck-Hufnagel
[Shatusk-Huf~iagcl1979]
19791 Similar arguments also
grcatcr than a single ~
n
92
take another stcp along the mcssagc trcc structurc and output a token.' This
take another step along the message tree structure and output a token.] This
computational coastraint, in turn. is uscd lo motivate a scrics of constraints on (i)
computational constraint, in turn, is used to motivate a series of constraints on (i)
thc input mcssagc languagc (the "well-fonncdncss cnnsrraint"); (ii) thc mapping
the input message language (the "well-formedness constraint"); (ii) the mapping
proccdurc (it is dclcrministic. in that at any decision point a uniquc choicc of next
procedure (it is deterministic, in that at any decision point a unique choice of next
movc can he madc: it uscs only lirnitcd look-ahcad to dctcrminc its next move):
move can he made: it uses only limited look-ahead to determine its next move);
and (iii) what thc p r t ~ c d u r ccan and cannot successfully produce (namcly, it can
and (iii) what the procedure can and cannot successfully produce (namely, it can
succcssfi~llyproducc wh;~tpcoplc d o and lnakcs thc samc mistakcs that ycople
successfully produce what people do and makes the same mistakes that people
do). '!'his argulncnt is clcarly run along thc samc lines as that of [Marcus 19801:
do). This argument is clearly run along the same lines as that of [Marcus 1980J:
first, to assumc d ~ a at suong cornputatinn;~l(alias cognitive) fidclity assumption
first, to assume that a strong compul<Jtional (alias cognitive) fidelity assumption
must bc met -- namcly, lhat normal flucnt languagc is prod~lccdin rcal timc
ID.1!it be met .. namely, that normal fluent language is produced in real time
without backup -- and sccond, to go on to dcducc what machinc propcrtics arc
without backup .. and second, to go on to deduce what machine properties arc
suficient to acl~icvcthis computational constraint, wliilc maintaining descriptive
sufficient to achieve this computational constraint, while maintaining descriptive
adequacy, all the whilc dcvcloping predictions about failure pattcrns of the
adequacy, all the while developing predictions about failure patterns of the
gcncrator that comport with human bchavior.
generator that comport with human behavior.
How can thcse claims of cflicicncy be cvaluatcd? The rcmaindcr of this
How can these claims of efficiency be evaluated? The remainder of this
scction sketchcs an initial attempt to formalize thc Mcllonald gcncration program
section sketches an initial attempt to formalile the McDonald generation program
as an cxamplc of a more thcorcrically familiar ~nodcl-- an 1-l,(k)-parscr driven
as an example of a more theoretically familiar model -- an LL(k)-parscr driven
transduction, a lop-down. dctciministic, prcdictive parscr h a t parscs thc input
transduction, a top-down, detelministic, predictive parser that parses the input
mcssagc string using a I o ~ k - i ~ l ~of~ akd lokcns and at at thc same timc
message string using a look-ahead of 1. tokens and at at the same time
incrcmcntally produces languagc output as dcsired.
incrementally produces language output as desired.
O f course, this approach is not claimed to bc thc final word about how to
Of course, this approach is not claimed to be tile final word abollt how to
hr~nallyrcconstitutc thc h4cllonald approach. What h claimed is that thc I,I.(k)
formally reconstitute the McDonald approach. What i1i claimed is that the LL(k)
formalization hclps to cxposc thc capnbilitics and limils of thc McDonald
formalization helps to expose the capabilities and limits of the McDonald
gcncration proccdurc and at the sarnc timc admirs a disciplined and rigorous
generation procedure and at the same time admits a disciplined and rigorous
evaluation of thc constraints that hc advances. Informal statements of the
evaluation of the constraints that he ad\'ances. Informal statements of the
sufficicncy of various constraints can bc rcplaccd by prccisc thcorcms. 1 h c I-l4k)
sufficiency of various constraints can be replaced by precise theorems. lllc LL(k)
modcl, it is claimcd, can capture all of the following propcrtics o f thc McDonald
model, it is claimed, can capture all of the following properties of the McDonald
model:
model:
I. Thcrc is a dighi dificully using this dclinition straighlforwardly, in that an outpul loken an be a
1. There is a slight difficulty using this definition straightforwardly, in that an output token can be a
phonctic null clemenr. i.e.,silence. This is neccsary for the gcncration of sentcnccs whcre constituents
phonetic null clement. i.e., silence. Th is is necessary for !he generation of sentences where con~tituenlS
havc been displaced kern their canonical poshions c.g.. so-caltcd "movcmcnl" casn, as Who did
have been displaced from !heir canonical positions - e.g., so'called "movement" cases, as ~ ill!!
Macbcrh kill?, which is prcrumcd to havc thc underlying form Wh-x. x a ncrsctn, Machcth killed I . A
Macbeth kill', which is presumed to have the underlying form Wh-x,! ~ person, Macbeth killed!. A
qucsrion h e n arises as lo whether silences "count" in the time it takcs la produce langua~coulpuL
question then arises as to whether silences "count" in !he lime it la~e.' LO produce language output
Clcarly Lhcy do somctinlcs: wc havc already noted that !here arc slight pauscs at major constituent
Clearly they do sometimes: we have already nOled that there arc slight pauses at major constituent
brcaks. I'lnitc phonclic adjuslmcnts will no1 ancd any claim of linear timc operalion unlcs$ here can
breaks. Finite phonetic adjustment~ will nol affect any claim of linear lime operation unless. !here can
be an arbitrary numbcr of "pauscs" that can pile up.
be an arbitrary number of "pauses" that can pile up.
--
93
Incrcmcntal nodc-by-nodc constitucnt
Incremental node-by-node constituent tree
constniction and output.
cons{n1ctioll and output.
tree
- Deterministic operation.
- Traversal ordcr of Uic constitucnt structure trce.
- Traversal order of tJle constituent structure tree.
I . Aclually, the parser described hcre can be conslruclcd only if h e grammar G has the property of
I. Actually.
parser
here C3 n will
be conslrucled
only if the grammar G has the property of
bcinpthe
wonk
I .I described
.lk). lhis propcny
he defined shonly.
being ~tron
property
will bea1defined
2. g!JJ..tl.
Only one111is
symbol
is scanncd
a lime inshonly.
lhc nelwork modcl prcscnted above. We shall scc how l h i s
2. Only may
one symbol
is scanned
at a lime oft
in thetokens.
network model presented above. We shall see how this
be cxlendcd
Lo a lookahead
may be extended to a lookahead of! tokens.
Berwick
94
(1) If t11c
the stack aand
empty (as notcd
noted by
by
n d input string arc clnpty
presence of
of some prc-dctcrmincd
prcdctcrmincd cnd of
of input
input
the prcscncc
marker as thc
the top symbol on thc
the srack).
stack). thcn
ti,en halt
bait aand
markcr
nd
announce successful termination of
of the
lhe parse.
parse.
announcc
(Acceptance is by clnpty
empty stack and inpul)
input)
(Acccptancc
nonterminal
then lookup the
tbe entry
nontcr~ninal sy"mbol,
sjlnbul, thcn
corresponding to indcx
index ('l'.w)
(T.w) in tlic
ti,e paning
parsing table. TThe
he
entry is either
(the string is not gcncratcd
generated by the
cithcr error
errur(thc
underlying grammar G).
G). in which c;ec
case the
lhc machine
halts
and
announces
rejection
of
the
string:
halls
rcjcction of thc string:' l oorr else aa
.itring
of
nontcmlinals
that corresponds to the
string
nontcmiin;~ls XYZ Ulnt
right-band
side
of
a
grammar
rule .I'=XY%
T= XY/, in which
right-hand sidc
case
the
symbols
are
placed
onto
ti10 stack with the
casc thc symhols arc placcd
dlc
left-most
nonterminal
top-most
on
the stack
st.1ck (is.,
(i.e.. the
Icft-most non~crmin;il
o n thc
topmost symbol on hIhee stack bccomcs
becomes X.
X, fi~llowed
fi,lIowed by
Y.
Z. corresponding to aa left-most
Y, followed
followcd by %.
expansion). Notc
Note that in this case wc
we have "predicted
"predicted"
expdllsion XYZ. Productions inay
the expansion
may also be of
of the
foml T=0.
T s 0 , in which case
casc the
thc top of stack symbol is
form
"erased."
"erased."
'lhc initial configuration
configuration of the machine
rnachinc is1s with its rcad
The
read hcad
head ssanning
scanning the
lcn-most symbol
sv~nbolof d,e
thc in
i npot
~ ustring
t
suck containing just
just S
left-most
and its Slack
S as the top-most
top-most
thc end-of-input symbol beneath:
beneath:
symbol with the
symbol
Or course,
course, one
onc may
may do
do more
more than
than simply
yrnply reject
rcjccl the
the input
inpul string:
1.1. Of
string: since
since an error is caurd
caused by Ule
the
railure Lo10 find
find aa predicted
prcdiclcd rule
rule of
of the
the form
f a n T==>
T a XXYZ,
Y Z , one
one at least
least knows
knows what
what kind of
failure
of phrase
phrase (left
hand side
side of
of aa rule)
mle) was
war being
bcine.
soueht
a1 lhe
the time
lime the
thc error occurred.
acurred. and
and which
which sub-pieces
sub-pieccr of Lhe
hand
the
.sought
. at
righl-hand side
side had
hadalready
been constructed.
wnslrucled. This
Ihis anticipatory
anlicipalory context
contexl may
may in fact
fact w
k b u l e ltoa an
righi-hand
already been
contribute
cxplanalionof
of planning
planning errors
errorsof
or the
the Plunkett--pick
PlunkelI-pick tYPe.
1 s .
explanation
95
input st'ring
input string
4
I
read head
read head
stack:
stack.:
output s t r i n g
output string
Ktop
o f stack
Control t a b l e
Control table
S*NP VI'=Macbcth
V I ' a klacbcth V NI'3Macbcth
S=> NP VP==> Macbeth VP= Macbeth V Nil=> Macbeth
murdcl*N P a M acbcrl~murdcr Duncan
murder NP=>Macbeth murder Duncan
Supposc that u ~ i cwrotc dcwll s t i ~ ~ p s h of
v ~thc
s stock as i r i~ppcarcdduring the
Suppose that one wrote (\(!wn snapshots of the wck as it appeared dllfing the
1,1.(1) parsc clfthis scntcncc. Onc would ul)scrvc thc following scqucncc ~I'stack
LL(l) parse or this sentence. One would observe tJ1e following sequence or stack
c:onfiguri~tions( # is tllc C I I of
~ srack symbol):
con fig.urations (# is the elld of stack symbol):
96
Berwick
Step# 1 2
Stack
Stack
Top->
I~II::I
l!Jl!J
~,'acbe th
VP
fI
m
p
#
#
"I
L!J
NP
#
fI
~:s
\/\
VP
6---murder
Macbeth
Macbeth
MUrder
1~ 8
Duncan
Duncan
'Ihc I,L(k) p:uscr uscs its stack sit~iplyto kccp ~rilckof which non-tcrtninnl
'111C LL(k.) parser lIses its stack ~il1lply [0 k.ecp Irad of which non-terminal
nodcs arc to bc visitcd ontl tllc ordcr in wllich they :~rcto bc visitcd. .
noues arc to be visited and the order in which they arc to be Visited.. But Jhis 1:i fl&
JS
input the current non-lel1llina1, input s)'1Ob )1, and (liX<'d) finite
97
parsing
symbol. thcn
then stack
slJlck and
parsing lJIble,
tablc, and produces as output a new
ncw non-terminal symbol,
tree
structures
arc
extensiunally
equivaleot;
given
identical
inputs.
they
trcc structures arc cxtcnsiunally cquivalcnt;
inputs, thcy both
return the
thc same
ramc new
ncw non-terminal
non-tcrminal nodes
nodcs for further
h ~ n h ccomputation.
r
Since
determined by the top-of-stack
top-of-slJlck
Sincc an
an LL(I)
1.1.(1) parser's
pancr's move
n~oveis completely
complctcly dctcrmincd
symbol
and
input
symbol
scanned,
the
equivalence
between
left-most
symbol
symbol suanncd,
cquiv;~lcncc bctwccn Icft-most tree
traversal
top-down, Icft-most
left-most dcrivation
derivation
travcrwl symbols
symbols and top-of-stack
top-of-stack symbols in a top-down.
means
linite tablc
lJIble and using only
mcans that the
thc McDonald
Mcllonald controller when driven
drivcn by a linitc
one
be simulated by an
onc symbol
symbol of luokahead
lookahcad (the
(tlic input symbol scanned)
scanncd) can bc
LL(I)
I.I.(l) parser,
parscr, and conversely.!
conversely?
Let
of tlic
the clarification this
tllis formal
I r t us
us consider
considcr just one
onc simple example
exa~nplc of
approach
constraints. At first glance,
approach affords
affords before
bcforc reviewing
rcvicwing all of McDonald's
Mcllonald's constraints.
onc
the proper cxpansion
expansion of
of some
onc might think that in order
ordcr to determine
dctcrminc thc
nonterminal
expansion above --- one would
nontcrminal ---- say,
say, the VP in our familiar leftmost
lcftmost cxpansion
have
tl,e cntire
entire prefix w
~ to
kccp track of the
thc entire
entire string parsed
parscd so far (that is, thc
havc to
to keep
the
even, worse yct,
yet, the cntire
entire
left of the
the current
currcnt input symbol being scanned)
scanncd) or cvcn,
the left
parse
McDonald sharcs.
shares. He
parsc tree
trcc already
already built. In fact,
fact. this is a worry that Mcllonald
obscrvcs that tllis
this extended
cxtcndcd bookkeeping could lcad
observes
lead to compuDtional
compulJltional difficulties.
difficulties,
sincc tl,e
ihc sheer
shccr number
nu~nbcrof nodes
nodcs in the parsc
sioee
parse trce
tree to rlic
tl,e lcfi
left of
of some current
expansion
linear funclion
function of thc
the input suing
string
cxpansion point can clearly
clc;~rlybe
bc greater
grcatcr than
Ulan a lincar
2 Since
parscd so
so far
far.2
Sincc this eompulJltional
co~nputationalburden might intcrfcrc
parsed
interfere with the
tl,e aim of
of
flocncy. McDonald
Mcllonald proposes
proposcs a sollltion:
solution: the
thc cntirc
fluency,
entire discourse history (input
message scanned plus any relevant
rclcvant structure of thc
message
the trcc
tree built so far) $
.i> to be stored,
acccss list.
list so that items
itcms can hc
but in a random access
be rctricvcd
retrieved indcpcndcntly
independently of
of their
positioli in tl,e
rllc list.
list. In addition,
addition. McDonald
Mcllonald notes
notcs that
position
tllat in some cascs
cases information
bc "passed
"passcd along"
along" from
from node
nodc to node
nodc in the trcc
must be
tree in ordcr
order to aid with some
particular expansion
cxpansion decision
decision later on.
on. However,
Howcvcr, it is not clcar
particular
clear just
just what class of
of
c~nstmctionswould cause
causc such problems:
prohlcms: nor is it cvidcnt
constructions
evident what or how much
"passcd along".
information must be "passed
information
thc LL(k)
Il.(k) apparatus in haod,
hand. it is easy to make thcsc
With the
these intuitions precise.
Considcr first
first the
thc puzzle
puzzlc ofha,ing
of haiing to store all the words sccn
Consider
seen so far. This worry is
onc. but it has already
already been
bccn ruled
rulcd out by the l.l.(k)
a valid one,
LI.(k) constraint. If
If the
Ll..(k), then we havc
underlying grammar is LI..(k),
underlying
have already sccn
seen that one need not
More generally.
generally. aaprammar
is called
called strong!1J!l
slrona l . l i k ) ififlhc
1.1. More
rammar is
the oon-terminal
non-tcnninal symbol
symbol on the lop
top orthestack
of the stack
lhc next!
neil tokens
Lnkcni of
or the
l h c input uniquel}'
uniquely determine
dclcnine lhc
the
the next
nelt marc
move of lhc
the parser. Ihus,
lbus, ifif aa
grammar isis strong
Srong L1~k).
1.Iik). th{'n
lhen the simple
sinlplc parser
parscr designed
dcrigncd sketched
grammar
sketched above
abo'o'c can be
be slraightforvardly
straightforwardly
cxundcd to
lo accommooale
accommodnle k
k tokens
lokcns oflookahcad.
ollwkahead.
extended
In the
the worst
worst case,
cax, if
ii the
the length
lcnglh of
or the
lhc longest
lon~cslside
side of
or aa rewrite
2.2 In
re-write rule
rule is 1w,
so thal
that lhc
the "branching
"branching
cn interior
laaor" of the
the tree
uce isis ata1 most
most 1.!. there
there could
cauld be
bc IlCn
invnor nodes
factor"
nodes in
in LC
the derivation
derivation ucc
tree lor
for aa ming
string of
of
lcnglh D.n.
length
plus
plu~
Rcrwick
Berwick
98
rcmcmbcr all of the input sccn so far in ordcr to dctcrminc what to do ncxt. More
remember all of the input seen so far in order to determine what to do next. More
prcciscly, if a grammar is strong I.l.(k) thcn (currcnt non-terminal. lookahead)
precisely, if a grammar is strong LL(k) then (current non-tenminal, lookahead)
pairs uniqucly dctcrminc the ncxt movc of the automaton. In short, if the
pairs uniquely determine the next move of the automaton. In short, jf the
underlying grammar is I,l-(k), thcn Mc1)onald's worries arc groundless; there is
underlying grammar is LL(k), then McDonald's worries are groundless; there is
no nccd for a random acccss list of previous discoursc history in thisl case.'
no need for a random access list of previous discourse history in this case.
Morcovcr, thc constn~ctionof complcx catcgory sy~nbolsfor l.l.(k) grammars that
Moreover, the constnlction of complex category symbols for LI.(k) grammars that
are not strong providcs a crisp charactcrination of what Mctlonald mcans when he
are not strong provides a crisp characterization of what McDonald means when he
says that information must be "passcd along": it is exactly the complcx catcgory
says that information must be "passed along": it is exactly the complex category
symbols (rcflccting nun-tcnninal-follow sct combinations) that nccd to bc savcd
symbols (reflecting non-tenninal-follow set combinations) that need to be saved
on thc stack for latcr rcf~rcnce.~
on th e stac k for la ter re ference. 2
Let us now revicw the list of computational and cognitive claints that
Let us now review the list of comput.ltional and cognitive claims that
McDonald advances and scc how thc ].Ilk) propcny suhsumcs each of them.
McDonald advances and see how the 1.L(k) property subsumes each of them.
].inear rime production
Linear lime production
It is a thcorcm that I,I.(k) parsing (with output) exccutcs in time Iincarly
It is a theorem that LL(k) parsing (with output) executes in time linearly
proportional to d>clcngth of thc input (~ncssagc)3suing3
A ncccssary condition for I,I.(k)-ncss is that top-down parsing can proceed using
A necessary condition for LJ.(k)-ness is that top-down parsing can proceed using
just boundcd forward lookahcad into thc input string. I:urthcrmore, by the
ju'st bounded forward look ahead into the input string. Furthermore, by the
propcrtics of Icft-most derivations, thc string of non-tenninnls to thc
of some
properties of left-most derivations, the string of non-tenninals to the r..i.!lt of some
symbol A that is duc Tor expansion is always morc "abstract" than thc material
symbol A that is due for expansion is always more "abstract" than the material
into which A will bc cxpandcd, in the scnsc that terminals will be dcrivcd from A
into which A will be expanded, in the sense that terminals will be derived from A
bcforc one evcr procccds to cxpand the non-terminals complctcly to the right of
before one ever proceeds to expand Lhe non-terminals completely to the right of
A.
A.
Non-ambiguity
Non-ambiguity
McDonald's input rncssclge language is fully disambigualcd: the bracketing
McDonald's input message languagc is fully disambiguatcd: the brackcting
structure indicates cxactly the scoping relations to bc obscrvcd. As a rcsult, there
structure indicatcs exactly the scoping relations to bc observed. I\s a result, there
is only onc derivation tree for an input mcssagc suing. This is also a necessary
is only one derivation tree for an input message string. This is also a necessary
condition for LUk)-ness.
condition for LL(k)-ness.
1. Except in the scnse that the parsing table providcs "random access" lo a finite list or lransilion
1. Except in the sense that the parsing table provides "random access" to a finite list of transition
tu les
rules. 2. If a grammar k U.4k) but not amng LL(k), then thc complen symbol approach constructs a new
2. If a grammar l~ U.~k) but not strong LL(k). then the compleK symbol approach constructs a new
finile parsing table such that (complcx symbol. lookahcad) pairs also uniqucly determine the moves of
finite parsing
table such that (complex symbol. lookahead) pairs also uniquely determine the moves of
thc parsing machine.
the parsing machine.
3. Reason: Sincc ihe grammar must be non-left recursive. any derivation A--> a must take only a
3. Reason; Since the grammar must be non-left recursive, any derivation A--) a must take only a
linmr number of stcps. ('Ibisrequires some proor.) Thcrcfore, for a valid string in Lhc language, here
linear numher of steps. ('Ibis requires some proof.) Therefore. for a valid ~tring in the language, there
can be at most a lincar number of steps before the stack is poppcd and an input symbol consumed;
can be at most a linear number of steps before the slack i~ popped and an input symbol consumed;
hence a number ofsteps linearly proporlional 10 the input string in all.
hence a number ofsteps linearly proportional 10 the input string in all.
99
,'Von-Iefl recursion
Non-feft recursion
If unaltcrcd, McIIonald's proccdurc must of ncccssity "pause" an arbitrarily long
If unaltered, McDonald's procedure must of necessity "pause" an arbitrarily long
timc bcfore producing output for such rnessiilgcs as,
time before producing output for such mcss..1gcs as,
(((Mort's father's) cousin's) brother)
(Mort's father's) cousin's) brother)
?his is bccausc if the linitc state controllcr must traversc the mcssagc structure
This is because if the finite state controller must traverse the message structure
from node to nodc, thcn given a mcssagc structure such as t l ~ conc above:
from node [Q node, then given a message structure such as the one above:
NP
~~
~sin's
NP
NP
~.
NP
\
Mort'sMort's
Figure 16 Le Tt-rccursivc message structure,
Figure 16 Left-recursive message structure,
an arbitrary number of nodcs must be uaverscd before the first output token,
an arbitrary number of nodes must be traversed before the first output token,
ever produced. violating fluency. But Icft-recursion also violates the
M.!m:.s, is everisproduced,
violating fluency. But left-recursion also violates the
I,L(k) condition. Hence, the LI-(k) condition subsumes thc left-recursion
LL(k) condition. Hence, the LL(k) condition subsumes the left-recursion
limitation.'
limitation. 1
m,
The L l ~ k ) p r o p e r and
! ~ ~ !he mcssage constraint
The LL(k) property and the message constraint
McDonald imposes the following constraint on thc input message in order to
McDonald imposes the following constraint on the input message in order to
guarantee that his production procedure mccts the requirement of fluency:
guarantee that his production procedure meets the requirement of fluency:
1. I f his is so. hen onc puzzle remains: just why are such constructions even producible in English?
1. If this
is so,have
thenbeen
one at
puzzle
just whyexplanations
are such constructions
producible
in English?
There
least remains:
several diflcrcnl
for his in heven
e linguistics
literature,
but all seem
There ha~e been atleasl several different ex planations for this in the linguistics literature, but all seem
to amount lo much h c mme thing: it is assumed that the mcwge input is somehow altered so that it is
to amount
to
much
the
same
thing:
it
is
assu
mcd
that
the
message
input
is
somehow
altered
so
that
it [Krauwer
is
nd lelt-recursive. ([Chasky 19651 and [Liherman 19671: [Chamsky and Ilalle 19681: mid
nol left-recursive.
([Chomsky 1%5) and [Liberman 1%7): (Chomsky and Halle 1968]: and [Krauwer
and des Tombes 13801) The proposals dilfcr only with regard Lo how e x a d y !he recursion is
and deseliminated
Tombes 1980]) "The prop=Js differ only with regard to how exactly the recursion is
eliminated.
Bcrwick
Berwick
100
100
'111~M c s a ~ ~Well-fo~m~ds
c
Con~tmint( W I T ) 'rhc ordcr in
J.1l Mcssag~
Wc1Jfn!llWdD.~ ~()nstr:U.u1 (WFC) The order in
which Incswgc
c1c11lcn1.swill t)c rcalizcd must bc such that any
reported
lrn"
the Shah
101
102
Berwick
The guy who they don't know whcther hc will come or not..
The guy who they don't know whether he will come or not..
In the examplc abovc, he is a rcsumptive pronoun, presumably co-designative
In the example above, he is a resumptive pronoun, presumably co-designative
with the my.
with~gyy.
Importantly, such sentcnces are gcncrally judged ill-fomcd, even immediately
Importantly. such sentences are generally judged ill'formed, even immediately
afterwards by the speakers who have produccd thcm; yet they arc widcsprcad in
afterwards by the speakers who have produced them; yet they arc widespread in
spccch. The reason for the ungrammaticality of such scntenccs has been generally
speech. The reason for the ungrammaticality of such sentences has been generally
attributed to a violation of an "island constraint" [Ross 19671: loosely spcaking,
attributed to a violation of an "island constraint" [Ross 1967]: loosely spealcing,
the
points to a pronoun that is "too far" away. Specifically, any binding
the &!!1
points to a pronoun that is "too far" away. Specifically, any binding
relationship bctwcen the ~ u and
v &crosses [wo wh-S boundaries:
relationship between the IDlY and he crosses ~ wh-S boundaries:
[NP thc guyis
who.....[S whcther he
Clause
Oause
fronted phrase
matrix phrase
fronted phrase
matrix phrase
(the guy)
Continuing, the gcneration procedure expands the fronted phrase into a full
Continuing, the generation procedure expands the fronted phrase into a full
orthographic reprcscntation, so as to maintain the requirement of producing
orthographic reprcsentation, so as to maintain the requirement of producing
103
outpul as soon as possible. Note thitt one could no( wait to do this until tltc cntire
output as soon as possible. Note that onc could nol wait to do this until the entire
rndtrix phrasc wcrc rcalizcd in dctnil, sincc thc rilatrix phsasc ctuId be arbitrarily
matrix phrase were realiLCd in dct.1il, since the matrix phrase could be arbitrarily
long. 'I'llus thc m y is produced without h::ving dccidcd upon thc inlcrnal details
long. Thus the lillY is produced without having decidcd upon the internal details
of thc rnatrix phrase:
of the matrix phrase:
Oause
---.
rronicd pl~rasc
matrix phrase
~h
fronled
phrasercnlizcd)
matm p rasc
(~lrci~dy
( I (know (the guy)))
(ill ready r~i11izcd)
(l (know (the guy)
I 'n1c guy...
Outpni:
,'---.
Cl.. . usc
~matrix phnsc
~(+m
frol~lcdphnsc
fronted
phrase
(+(the guy)))
(alrc;~clyrc:ilizcd]
( I (know
(already realized)
(I (know (the guy)
Output: 'Ihc guy who 1 know
Output: The guy who I know....
....
]04
Berwick
matrix phrase
matrix phrase
~now...
Ncxt, the embedded mcssagc clcrncnt (thc guv (will cornc)) is proccsscd.
Ncxt, the cmbedded message clement, lllli; gyy (will come)} is processed.
Supposc that this clcmcnt is also rcalizcd as a wh-phrase, and a choice of
Suppose that this element is also reali/.cd as a wh-phrasc, and a choice of
"whether" as the Icad-in wh-lexical item is scIcctcd:
"whether" as the lead-in wh-Jcxical itcm is selected:
...
What is to happcn next? 'me wh-flag is still on, so the standard move would
What is to happcn next? 'The wh-tlag is still on, so the standard move would
bc to crasc the mcssagc clcmcnt "the guy", just as in a typical wh-phrase. But this
be to crase the message element "the guy", just as in a typical wh-phrase. But this
would bc an error:
would be an error:
Note further that the iIl-cffcct of "erasing as usual" can be locally checked;
Note further that the ill-effect of "crasing as usual" can be locally checked;
only thc immcdiatcly surrounding context is required in order to tell that this
only the immediately surrounding context is required in order to tell that this
structure is hopclcssly bad1
Stl:ucture is hopelessly bad. 1
All secms lost. Supposc though, as Kroch suggests. that discourse NPs
All seems lost. Suppose though. as Krach suggests, that discourse NPs -pronouns or anaphoric NPs -- arc available to the generation procedure. Since the
pronouns or anaphoric NPs -- ill available to the generation procedure. Since the
discourse machinery is already known to operate across utterances, then insertion
discourse machinery is already known to operate across utterances, then insertion
of discoursc itcms is clcarly not subject to wh-island reslrictions. If so, then one
of discourse items is clearly not subject to wh-island restrictions. If so, then one
should be able to insert such an item, salvaging interpretability of the sentence at
should bc able to insert such an item, salvaging interpretability of the sentence at
the cost of a syntactic violation. 'I'his is exactly what onc finds:
the cost of a syntactic violation. This is exactly what one finds:
--
He's the guy who I don't know whcther & wiIl come....
That thcsc inscrted items are indced discourse entities is substantiated by the
That these inserted items are indeed discourse entities is substantiated by the
fact that rcsumptive anaphoric NPs also occur in wh-island violation positions, as
fact that rcsumptive anaphoric NPs also occur in wh-island violation positions, as
Kroch points out:
Krach points out:
1. A sentence without a lexical Subject such as this one violatcs what [Chornsky 19811, [Kayne 19811,
1. A sentence
without a lexical Subject such as this one violates what [Chomsky 1981J. [Kayne 1981J,
and others
have dubbed the Empty Catcgory Principle (ECP): if a sentence has a non-lexical or
and others have dubbed the Empty Category Principle (Eep): if a sentence has a non-lexical or
"phonclicaliy "ernply" calcgories - such as in Who did you kiss [Npemply]? - thcn these calegories
"phonelically 'emply" calegories - such as in Who did you kiss [Npemptyj? - then these calegories
in a "local"domain. usually the nearest sentcntial clause. In the case at hand. an anpty
must be
must be 1lm!JJ.ll. in a "local" domain, usually the nearest sentential clause. In the case at hand, an empty
category in the position aRer "whether"could not be Iocally bound
lOS
In sum, we concludc that a compu~~ionally
bounded macl~incof the sort
In sum, we conclude that a comput..'ltionally bounded machine of the sort
Mcllot~ald proposcs can actually prcdict ccmin aspects of human language
McDonald proposes can actually predict certain aspects of human language
processing -- thc bcst onc could hopc for from any thcory, compiiuiional or not.
processing -- the best one could hope for from any theory. comput..1tional or not.
I3y now I hopc that I havc convinccd you that it is prcciscly this cmphasis on
By now J hope that I have convinced you that it is precisely this emphasis on
prcxcss that givcs tllcsc thcorics thcir special personality and flair. Wcbbcr aims
process that gives these theories their special personality and flair. Webber aims
to tcll us how ro ;nan from a logical form rcprcscntation to discoursc entity
to tell us how to map from a logical form reprcscnt..'ltion to discourse entity
descriptions, Sidncr how focus mclvcs. Kaplnn and Allen ~nodclhow a hcarcr's
descriptions, Sidner how focus moves. Kaplan and Allen model how a hearer's
bclicfs about a spcakcr arc mnstn~ctcd,Mcllonald, how languagc is produced. In
beliefs about a speaker arc constmcted, McDonald. how language is produced. In
cvcry case the ftrus is on how somcthing gcts donc -- as bcfils Ll~c tcrm
every case the focus is on how something gets done _. as befits tlle term
rornnutational modcl. Having survcycd thc tcrritory and lcli bchind solne
cornput:ltional motlel. Having surveyed the territory and left behind some
tourguidc tips. 1 will lcavc it to thc rcadcr to judgc how wcll lhcy succccd in tllcir
tourguide tips, I will leave it to the reader to judge how well they succeed in their
rcspcctivc tl~coricsof how.
respective tlleories of how.
CHAPTER 2
From Natural
Natural Language
Recognizing Intentions From
Utterances
James Allen
2.1 Introduction
Intrnduetion
In order
we necd
some of
ordcr to design
dcsign good question
qucstion answering systems
systcms wc
nccd to build in so~nc
11f human
lil~manconversation.
ca~~vcrwtion.
shr~uldbc
tlic charactcrirtics
the
characteristics of
In p;lriicular.
panicular, thcy
they should
be able to
providc responses
rcsponscs that specify
spcciry more
n111rcinformation
infur~ilatianthan strictly required
rcquircd by the
provide
question.
however, pro\'idc
inform.1tion or provide
qucstion. They
Tlicy should
slioold nol,
no4 houcvcr,
provide ttoo~ much
o
inbrtnntion
providc
inf<~rn~;lti~~n
tthat
h t is of
no Lise
usc 1to11 ihc
qucry (ef.
(cf. [Gricc
[Gricc 1975]).
19751).
infoml<ltion
ufno
tl,C pcrson
person who rnadc
made thc
the query
consider thc
following exchange
cxchangc at
aL an information booth in a train
For cx;~mplc,
example. consider
the following
station.
station.
(1.1)
OJ)
(1.2)
patron: Whcn
Montrcal train leave?
lcavc?
When docs UIC
tl,e Montreal
clcrk: 3:15 at gate 7.
clerk:
108
Allen
I r t us define
define&#&jobstacles to bc
situations which inhibit the
the goal achieving
achieving
Let
be situations
of an agenlln
agent. In lbis
lhis paper, all obstacles
obstacles wil\
will be in lbe
the fonn
form ofsubgoa1s
of subgoalsthat
procesc
process ofan
that
the agent cannot achieve without assistance.
anistanee.
lbe
7hc major claim
daim of lbis
this paper is lbat
that many instances
instances of helpful behavior arise
ariae
lbe
bbecause
u r the observing
ubserving agent recognizes
mognizes an obstacle in the other agent's plan.
plan, and
acts
particular, we are most interested
xts to remove lbe
the obstacle.
obslaclc. In panicular,
intcmtcd in the
the obstacles
obslsles
lbat
be removed
that can bc
remmcd by lbe
thc use
u r eflanguage.
ef language.
To model thiS.
behavior.
this we view lbe
the use
u r of language
language itself
iwlf a~
a goaloriented
goal-oriented bchavior.
Uueranccs are produced by actions lm!:l:!;h iKW
& lbat
that are executed
executcd in order to
to
Utterances
have some effcct
effect on lbe
the hearer.
hearer. 'Ibis
'lhis effcct typically involves
involves modifying the
hearer's
beliefs or gnals.
hearer's beliik
goah. A speech
spccch act,
xt, like any other action,
d o n , may be observed by
lbe
the hearer and may allow the hearer to
to infer
infer whatlbe
what the speaker's plan is. Often
0th a
sptcch
act
may
explicitly
describe
an
obstacle
(i.e"
a
subgoal)
to
lbe
hearer,
speah act
explicitly describe obwaclc (i.e.. subgoal) lo h e hearer. For
(1.1) conveys
conveys to
to lbe
the hearer
hcarer lbat
that the speaker
spcakcr needs to know the
example, utterance
uUerance (1.1)
dcpanure
uain. But there may be other obstacles
departure time of lbe
the vain.
obnaclcs in the plan that were
wm
Ihe speaker may also need to know the departure
deparmrc
not explicitly conveyed (e.g. the
location). Tbc helpful response
rnponsc will allempt
aUempt to
lo overcome
o v e r e m these
Ihese obstacles as well as
a6
the explicitly
expliitly mentioned
mentioned ones.
ones
lbe
explaining some aspects
a s p a s of helpful
Our viewpoint provides
provides the basis
basis for explaining
behavior in dialogue,
bchavior
dialogue. including:
including:
the generation
_
generation of responses
mpom that provide more
informatian than required
q u i d (as in the above
above
infonnation
example)
empk)
the generation
gcnemtion of respoD!ICS
rrsponss to sentence
mtmce
fragmenlS
Fragmenm
the analysis
speech
analysis of indirect
indimr q
x e h QC/3,
acts.
Let
propoeed
La us
US consider
oonsider each
a h of these
thcsc aspects
aspec$ in turn,
turn.and see
see how the model proposed
above applies
applia to each,
each.
It is fairly
fairly simple
simple to see how the model could
m l d explain the
the providing of more
information than explicitly
expliitly requested.
requested. In the train domain, the clerk ellpCCll
e m that
109
lhc patron
has goals such
such as boarding
niccting trains. II
A query
qucry (e.g.
(c.g. (1.1))
(1.1)) about
the
patron has
boarding or mecting
timc, as opposed
opposcd to one
onc about a train arrival. indicates that
diat it is
a train departure time.
likcly that
lhat the
lhc patron's plan
Uie train,
train. In
I n addition, assuming
assuming that the
likely
plan is to board
buard U,e
clcrk bclicvcs
lhal U,e
dic piitron
docs not already
alrcady know the
thc departure
dcparturc loeation,
location, he
clerk
believes that
patron dues
bclicvcs that
dint not knowing
knowing the
thc location
location is
is also
a l x ~an obstacle
obstaclc in
in the
thc plan. 'Inus
'lhus he
believes
gcncratcs a rcsponsc
rcsponsc that
tliitt overcomes
orcrcolncs both
ohstaclcs (i.e.
(i.c., (1.2)).
gcncrates
both obstacles
can explain
cxplain how sentence
scnlcncc fragments
fr;tgmcnts can be
bc understood
undcrsto~idwhen
whcn
s m c model
~nodclcan
This same
is sufficiently defined.
dcfincd. For instance.
insr;tncc. the following
killawing exchange
cxchange occurred
l~currcd
tllc context
conlcxt is
the
lllc train
trait1 smtion:
sotion:
at the
(2.1)
(2.2)
patron: The
'Ihc 3:15
3:lS train to Windsor?
Windsor?
clerk:
Gate 10
clcrk:
10
tlic query
qocry nor the
die meaning
meaning of
o f its words indicate
Ncitlicr the
thc syntactic
synt;~ctic form
k ~ r mof
o f the
Neither
what the
Uic response
rcsponsc should be.
bc. However,
Howcvcr, given
givcn our vic\,,'point
uic\+point above.
i~bovc. it is quite
concci\ablc
Illat the
tlic in
information
in the
Uic fragment
fraglncnt is
is sufficient
suflicicnt to allow C
U
I hearer to
concch
able that
formation in
t..he
infcr what the
tlic speaker's
spcakcr's plan is.
is. Hence
Hcncc he
hc can
can produce
produce a reasonable
rcasonablc response
rcsponse
infer
thc obstacles
obsu~clcsin
in the
thc plan
arc. In
I n U,e
Uie above
abovc example.
cxamplc. (2.1).
(2.1). is
b:lscd
based on what the
plan are.
si~fficicntto
111 identify
idcntify the
tlic speaker's
spcakcr's goal to
111 board
tllc 3:15
3:IS train to Windsor. An
si,fficient
board U,e
ohstaclc in
in U,is
Uiis pl;in
dcparturc location,
locati~m.hence
hcnce U,e
tlic response
rcsponsc (2.2).
(2.2).
obstacle
plan is knowing the departure
OUicr sentences
scntcnccs in
in the dialogues
dialagt~csarc not treated
trcatcd at face
facc value.
v;lluc. For instance,
instance.
Other
patron: Do
D o you know when
whcn the Windsor train
(3.1)
leaves?
leaves?
ycs/no question
qucstion about the
thc hearer's
hcarcr's knowledge.
knowlcdgc. However,
However, an
Syntactically, this is a yes/no
answer
be quite inappropriate.
answcr of
o f "yes"
"ycs" in
in U,e
thc given
givcn setting
sctting would bc
inappropri;~tc. In
I n other
surrciundings, however,
howcvcr, it could bc
intcndcd literally.
lilcrally. For instance,
insoncc, a parcnt
sccing
surroundings,
be intended
parent seeing
thc station
station and wanting to make
make sure
surc that everything
cvcrything is arranged
arranged
a child off at the
intending to receive
receive a yes/no
ycs/no answer.
answcr. Sentences
Scntcnccs such
such as this that
might say (3.1) intending
appcar to mean
mcan one
onc thing
Uiing yet arc treated
trcatcd as
;is though
ihougli they
tlicy mean
Incan something
solncthing else are
appear
tcrmcd indirect
indircct ~
sncccli i1illISearle
acts IScarle 1975].
19751. These
'llicsc forms can
can also bc
cxplaincd using
using
termed
be explained
tlic viewpoint <Ihove.
;~hovc. Simply st.:lted.
swtcd. the
thc solution lies
lics in
in the
tlic realization
rcali7.ation that the
the
spcakcl. knows
kttows that the
ihc hearer
licarcr will perform
pcrfi~rnisuch
such helpful
liclpfi~lbehavior,
bcliavior, and
and hence
hcnce may
speaker
samctliing intending
i~itcndingthat the
tlic hearer
hcarcr infer
infcr the
ihc indirect
indircct goal.
goal.
say something
I'liis papcr
dcssribcs L'lc
Lkc plan
infcrcncc and obstaele
obstaclc detection
dctcction proccsscs
This
paper describes
plan inference
proeesses and
shows how they
thcy can
can be applied
applicd to explain
cxplain helpful
hclpful responses
rcsponscs and the
o f sentence
scntcncc fragments.
fragments. The
'l'he model
modcl is then
thcn extended
cxtcndcd to provide a
understanding of
understanding
thcory of
o f indirect
indircct speech
spccch acts. Section
Scction 2 provides
ovcrvicw of
o f the
gcncral theory
general
provides an overview
Allen
110
cntirc approach, and thcn Scction 3 providcs details of thc plan infcrcncc and
entire approach, and then Section 3 provides details of the plan inference and
obstaclc dctcction proccsscs. Scction 4 applics lhcsc tcchniqircs to language
obstacle detection processes. Section 4 applies these techniques to language
analysis. It considers two cxamplcs of rcsponscs that providc Inore information
analysis. It considers two examples of responses that provide more information
than explicitly askcd for. Scction 5 re-cxamincs Ilic definition of spccch ac~5and
than explicitly asked for. Section 5 re-examines the definition of speech act~ and
introduces thc notion of shared knowlcdgc bctwccn thc spcakcr and the hcarer.
introduces the notion of shared knowledge between the speaker and the hearer.
'I'hc plan infcrcncc rncch;inis~nis thcn cxtcndcd to handlc indirect spccch acts.
The plan inference mechanism is then extended to handle indirect speech acts.
Using thc cxtcndcd systcm, Scction 6 dcscribcs an analysis tcchniqnc for scntcnce
Using the extended system, Sectioll 6 describes an analysis technique for sentence
fragments.
fragments.
A syslcrn bascd on lhc work in this pnpcr has hccn i~nplcrncntcdand tcstcd in
A system based Oil the work in this pilper has been implemented and tested in
tlic train domain dcscril)cd ahtrrc [hllcn 19791. Whilc h c di;rlogucs in this domain
the train domain descrihed above [Allen 1979J. While the dialogues in this domain
arc somcwhat rcsrriclcd in subjcct matrcr, thcy providc a widc rangc of linguistic
arc somewhat restricted in subject maller, they provide a wide range 0f linguistic
bchavior thilt has not prcviously bccn cxplaincd.
behavior that has not previously been explained.
III
m,
Cohen
Cohcn IC,lhen
[Cohcn 1978]
19781 demonstrated
dcmonstratcd that speecb
spccch acts
acts sucb
such as requesting
rcqucsting and
llan Construction
2.2.2 Plan
statc, two major tasks need
nccd to be
bc done to produce
producc a plan to achieve
Given a goal state,
that goal.
tliat will accomplisb
accomplish tbe
the
goal. One is to find a sequence
scqucncc of actions ti,at
transformation from
from lhc
thc initial
initial world state to the
thc goal st.ate,
statc. The
'The other concerns
conccrns
transformation
specifying tbe
parameters of the actions in the
the bindings for
for the paramctcrs
thc constructed
constructcd plan.
A typical method
mcthod ofconstructing a plan is backwards
II
backwards chaining: given a goal G,
find an action II
preconditions
A that has G as one of its effects.
cffccts. Then
'l'hcn evaluate
cvaluate the
thc prcconditions
somc of tllcse
Ulcsc conditions
conditions arc not satisfied
satisficd in the
thc initial
initial state..
statc. they become
of A. If some
subgoals
subgoals and the plan construction
construction process
proccss repeats.
repcats.
IInother
Another dimension of plan construction involves
involvcs planning at different
diffcrcnt !levels
c&
of
abstraction
(sec
[Sacerdnti
1973]).
For
example.
in
a
domain
where
a
robot
bas
;~hstractit~n
(scc
[Saccrdoti
19731).
I.'
o
r
cxalnplc.
doni:iin
whcrc
has
routc tbrougb
through many rooms.
moms. the
thc plan would first
first bc
dcvclopcd in tenns
tcms of
to plan a route
be developed
"opcn door y."
y." Only after
aftcr sucb
such a plan was constructed
constructcd would
"go to room x" and "open
one consider planning actions sucb
such as "rotate n degrees,"
dcgrccs." "propel
"propcl forwards,"
forwards."
arm," etc.
ctc. To incorporate tbis.
this, many actions must havc
thc capability of
"twist ann,"
bave tbe
being "broken down" into sequences
bcing
scqucnccs of more
morc specific
specific aClions.
actions.
We arc, for tbe
the most part, interested
intcrestcd in reasoning
rcasoning about the planning behavior
112
Allen
of oihcr agcnts. In ordcr to facilitalc this rcasoning about thc planning proccss, we
of other agents. In order to facilit<lte this reasoning about the planning process, we
charactcrizc it as a scl of planning rulcs and a control stntcgy. Sincc this papcr is
characterize it as a set of planning rules and a control strategy. Since this paper is
mainly conccrncd with plan infcrcncc. wc will not consider control stratcgics for
mainly concerned with plan inference. we will not consider control strategies for
planning cxplicitly. Ilowcvcr, lnany of thc control issucs for plan infcrcncc are
planning explicitly, However. many of the control issues fbr plan inference are
dircctly applicable to pla~iningas wcll. 'l'hc crucial point hcrc is that these
directly applicable tv planning as well. The crucial point here is that these
planning rulcs will bc used to dcrivc a similar dcscriptioli of thc plan inference
planning rules will be used to derive a similar description of the plan inference
prtrcss in tlic ncxt section.
process in the next section.
'I'hc planning rulcs arc all of the form "If agcnt A wants to achievc X, thcn he
The planning rules arc all of the form "If agent A wants to achieve X, then he
rnay want to acllicve Y." A silnplc rule is:
may WJnt to achieve Y," A simple rule is:
(C.1) lj'oii agnrl ,vatits 10 nchievr a goal 1: arid ACT Ir
(C.l) Ifall agent wallis to achiel'e a goal alld ACT is
cur ncfiurr llint hcls I.' as at1 eflcrf. IIIEII [he agent
1111 actiull that has F as all effiet. ,hen the agent
ttiny w i l ~ ~IV/ rxccure AC'T (i.e.. achieve the
may wallt to execute ACT (i.e" achiel'e the
cxecrrtioir of ACT).
executioll of ACT).
1::
(C.2) Ifan agel/I wanls to achieve l' al1d dues 110t know
whrthcr P is true. t/teit that agent niuy watu to
wh('/her P is Irue. then Ihal ageM may wallI to
113
obscrvcd. Givcn tl~csctools, wc Lhcn hope to prcciscly dcfiric and cxplain thc
observed. Given these tools, we tilen hope to precisely define and explain tile
morc complcx and spccializcd mechanisms by dcriving lhcm 6.om thc siinplc s c t
more complex and specialized mechanisms by deriving tilem from the simple set
As with CIic plan construction proccss, CIIC plan infcrcncc prc~cssis spccificd as
I\s with the plan construction process, the plan inference process is sped tied as
a sct of inrcrcncc rulcs and acontrol strategy, liulcs arc all of thc form "If agcnt S
a set of in fcrence rules and a control strategy. Rules arc all of the form "If agent S
bclicvcs agcnt A has a goal X, tlicn agcnt S may infcr that agcnt A has a goal Y."
believes agent A has a goal X, then agent S may in fef that agent 1\ has a goal Y."
Examplcs ofsuch rulcs, corresponding to tlic planning n ~ l c (C.l)
s
and (C.2) are:
Examples ofsuch rules, corresponding to the planning rules (C.l) and (C.2) are:
(11.1) If
Of coursc, givcn thc conditions in (11.2). S might altcrnatcly infcr chat A has a goal
Of collfse. given the conditions in (D.2). S might alternately infer that A has a goal
of'achicving nor 1': this is trcatcd as a scparatc rulc. Which of thcsc rulcs applics in
ofachie"ing not P; this is treated as a separate rule. Which of these rules applies in
:I givcn sctting is dctcm~incdby control hcuristics, as follows.
a given setting is detcnllined by control heuristics, as follows.
l'hc plan infcrcncc prtxrcss can bc vicwcd as a scarck throi~gha sct of partial
The plan inference process can be viewed as a searcr. through a set of partial
131~11
partial plan consists of two pans: one pan is constructed using the
plans. Each p,lftial plan consists of two parts: one part is constructed using the
11l;ln infcrcncc rulcs from tlic obscrvcd nction (and called an ;~l[crnativc),and the
plan inference rules from the observed :lction (and called an allernative). and the
~ cxpcctcd goal (and
other is conslructcd using thc plan construction rules o ; an
other is constructed using the plan construction rules Oil an expected goal (and
callcd an cx~cctation).Whcn mutually cxclusivc rulcs can bc applicd to one of
called an expecl<ltion). When mutually exclusive rules can be applied to onc of
thcsc partial plans, thc plan is copicd and onc rulc is applicd in cncl~copy. Each of
these partial plans, the plan is copied and onc rule is applied in each copy. Each of
Blcsc partial plans is rhcn ntcd as to how probablc it is to bc h c correct plan. ?he
these partial plans is then rated as to how probable it is to be the correct plan. The
highcst ratcd partial plan is always sclcctcd for furthcr cxpansion using the
highest rated partial plan is always selected for further expansion using the
infcrcncc rulcs. 'Ihis rating is dctcnnincd using a sct of hcuristics h a t fall into two
inference rules. This rating is detennined using a set of heuristics that fall into two
classcs: thosc that cvaluatc how well-fomcd thc plan is in thc givcn context and
classes: tilose that evaluate how well-fonned the plan is in the given context and
thosc that evaluate how well thc plan fits lllc expectations. An cxalnple of a
those tilat evaluate how well the plan fits the expectations. An example of a
hcuristic is:
heuristic is:
u.
(H 1)
Allen
114
hcarcr makcs whcn hc or shc infcrs thc spcakcr's plan. For example, to infer the
hearer makes when he or she infers the speaker's plan. For example, to infer the
plan of thc spcakcr from thc utterance
plan of the speaker from the utterance
Whcn docs thc Windsnr train leave?
(4.1) (4.1)When docs the Windsor train leave?
thc hcarer must infcr that rhc spcakcr has thc goal of knowing whcn thc train
the hearer must in fer that the speaker has the goal of knowing when the train
lcavcs. Sincc thc spcakcr docs not know tllis information, t h ~ sis an cxplicit
leaves. Since the speaker docs mIt know this information. thIs is an explicit
obstacle.
obstacle.
'Ihc hcarcr cannot. howcvcr. basc thc rcsponsc solcly on thcsc cxplicit
'111C hearer cannot. however. base the response solely on these explicit
obstaclcs. For instancc, if A. carrying an clnpty gas can. comcs up to S on thc
obstacles. for instance. if A. carrying an empty gas can. comes lip to S un the
strcct and asks
street and asks
Whcrc is tf~cncarcsr gas station?
(5.1) (5.1)Where is the nearest gas station?
and S answers
and S answers
(5.2)
(5.2)
On the next corncr
On the next comer
knowing full wcll that thc sution is closcd, Ihcn S has not bccn helpful. Rut S's
knowing full well that the station is closed, then S has not been helpful. flut S's
rcsponsc did addrcss ths explicitly mcntioncd obstaclc, namcly knowing whcrc the
response did address the explicitly mentioned obstacle, namely knowing where the
ncarcst gas station is. S may want tn notify A if thcrc arc othcr obstaclcs to A's
nearest gas station is, S may want to notify A if there arc other obstacles to I\'s
plan, cspccially oncs that A is not awarc or. 'Illis behavior is cxpcctcd: cvcn when
plan, especially ones that A is not aware of. '1l1is behavior is expected: even when
A and S arc strangers, if A bclicvcs that S kncw all along that tI~cgas station was
A and S arc strangers, if A believes that S knew all along that tl1C gas station was
closcd, thcn A has justification for bcing angry at S, for S has violatcd some basic
closed. then 1\ has justification for being angry at S. for S has violated some basic
assumptions about human cooperation.
assumptions about human cooperation.
In thc dialogues we haw studied. all obstacles arc caused by a lack of some
In the dialogues we have studied, all obstacles arc caused by a lack of some
infonnation required in ordcr to bc ablc to cxccutc thc plan. This is not the case
infonnation required in order to be able to cxecute the plan. This is not me case
in gcncral, as we saw in thc cxamplc whcrc thc clcrk opens thc door for thc patron
in general, as we saw in the example where the clerk opens the door for the patron
carrying the groccries. Ihcrc thc clcrk rcspondcd to an obstacle which arose from
carrying the groceries, There the clerk responded to an obstacle which arose from
tllc pawn's inability to open the door,
tlle patron's inability to open the door,
2.2.5 Hclatcd Work
2.2.5 Related Work
Although thcrc has bccn sornc prcvious work on recognizing plans and gcncrating
Although there has been some previous work on recognizing plans and generating
helpful rcsponscs, to our knowlcdgc, no onc elsc has attcmptcd to combinc the
helpful responses, to our knowledge, no one else has attempted to combine the
two tcchniqucs. [Ilrucc 19801 outlincs a gcncral modcl of story cornprchcnsion
two techniques. [Bruce 1980) outlines a general model of story comprehension
based on rccognizing thc intentions of thc characters in the story as wcll as the
based on recognizing the intentions of the characters in the story as well as the
intentions of thc author. Although a slightly diffcrcnt application, our work here
intentions of the author. I\lthough a slightly diITercnt application, our work here
agrccs with his vicw. Bruce does not, howcvcr, dcscribc any algorithm for actually
agrees with his view. Ilruce docs not, however, describe any algorithm for actually
115
recognizing
rccogniring the
thc intentions in his stories.
stories.
Schmidt et
19791 discuss a plan recognitIOn
rccognition algorithm where
ct al. [Schmidt
[Schmidt 1979]
thcir task is to discover what the agent is doing.
physical actions arc
are observed
obscrvcd and their
But
before committing
Ilut they allow an arbitrary number
numbcr of acts to be observed
obscrvcd bcforc
thcmsclvcs to a particular plan. This
Ihis technique
tcchniquc is appropriate for analyzing
analyzing
themselves
sequences
csscnti;~lthat we identify
identify at least
lcast
sequences of actions. In our work, however. it is essential
part of the
thc speaker's
spcakcr's plan from
from a single
singlc observed
obscrvcd action (i.e..
(i.c.. tl,e
thc initial utterance).
utrcrance).
'l1lUS.
thc dialogue continues.
continucs. t.hi~
this plan becomes
bccomcs further specified.
spccificd.
'll~us,as the
Wilensky's
Wilcnsky's system,
systcm. PAM
PAM [Wilensky
[~ilensky1978],
19781, analy,es
analyxs stories
storicS by constructing a
plan for the
thc participants
p;trticip;~ntsJnd
and then
thcn answers
;Inswcn questions
qucstions about
a b o ~ the
tlic
~ t story
story using the
pl;~n.However.
Howcvcr. it docs not aucmpt
nttclnpt to recognize
rcc~~gnizc
thc plan of the
thc agent
agcnt asking
asking the
plan.
the
questions
qt~cstionsor w
to do any plan
pl;tn based
b ~ s c dreasoning abollt
about lallgu<.Jgc.
language. PAM
I'AM answers
answers
thc fornl
Torn] of the
thc quest;un
qucstion asked
askcd (sec
(scc [I[I ehnert
chncrt 1977a]).
1977al).
qucstions solely
solcly on the
questions
Kaplan (this
(this volume)
volulnc) discusses
discusscs helpful
Ihclpfi~lresponses
rcspanscs to questions which arc
are based
on violated
viol;~tcdpresuppositions
prcsuppositiol~sconveyed
convcycd by the
thc question. 'Illis work fits
tits well
wcll with the
approach
i~ppro;~chdescribed
described here.
hcrc, as violated
violatcd presuppositions should introduce
\hat the system
systcm helieves
bclicvcs and what
wh;lt the system
systctn bclicvcs
discrcpancics bctwccn
discrepancies
between what
believes the
spcakcr bclicvcs.
'lhcse discrepancies
discrcpancics could U,en
then bc
rccognizcd as obstacles
obshclcs in the
speaker
believes. These
be recognized
speaker's
paper, by dealing
llcncc innuence
influcncc U,e
ihc response.
rcsponsc. Thus
'lhus our papcr,
dcaling with
spcakcr's plan, and hence
helpful
behavior in a more general
pro,ides a general
hclpfi~lbchavior
gcncral fonn,
fonn, prokidcs
gcncral framework in which it
is useful
uscful to view Kaplan's work,
work.
I'l;in Inference
Inference and Ohstacle
Ol~st;~clc
I~ctcction
2.3 Plan
J)etection
Some
be considered
pI,," inference
Somc representation
reprcscntation issues
issucs must bc
considcrcd before
bcforc the plan
infcrcnce process
can bc
be described
dctail. Section
Scction 2.3.1
2.3.1 discusses
discusscs the
thc representaUon
rcprcscntation of belief,
dcscribcd in detail.
knowledge
knowlcdgc and want,
want. and Section
Scction 2.3.2 considers
considcrs actions
actions and plans. The
description
of
plan
in
ference
is
Ulen
broken
into
three
parts:
we consider U,e
dcscription
infcrcllce il~cnbrokcn
thrcc
the plan
inference
infcrcnce rulcs
rules in Section
Scction 2.3.3,
2.3.3, the rating heuristics
hcuristics in Section
Scction 2.3.4,
2.3.4, and the
control of the
process in Section
thc proccss
Scction 2.3.5.
2.3.5.
The
'Ihc final
final section
scction considers
co~lsidcrshow obstacles
obshclcs arc detected
dctcctcd in the plans that are
inferred.
infcrrcd.
Allen
116
Ihus, onc may infcr that A infcrrcd solnc proposition Q if it is bclicvcd that A
Thus. one may infer that A inferred some proposition Q if it is believed that A
bclici-cs thal Lhcrc is sufticicnt c~idcncclo infcr Q. Wc also nccd an axiom that
believes that there is sufficient evidence 10 in fer Q. We also need an axiom that
slates that conjunction can "pass through" chc bclicf opcrator. 'I'hus, if S bclicvcs
states that conjunction can "pass through" the belief operator. Thus, if S believes
lliat A bclicvcs P is truc and h a t A bclicvcs Q is truc. thcn S also bclicvcs that A
that A believes P is true and that 1\ believes Q is true. then S also believes that A
hclicvcs P and Q arc truc, and visa vcrsa. Writtcn marc formally:
believes P and Q arc true, and visa versa. Written more formally:
DEUEVE(I\,P) /\ BELl EYF.(A,Q) == BELl EVE(A.P /\ Q)
BEIJEV~.:(I\.P))
As an abbreviation. wc dcfinc
I\s an abbreviation. we definc
A KNOW P = P A 13EIzIEVE(A,P).
A KNOW P = P /\ DELI EVf."',(A,P).
In other words, if BELII:VF!S, A KNOW P), then S bclicvcs that S and A agree
In other words, if IlEUEVF..(S, A KNOW P), then S believes that S and A agree
H7
that I' is true. Iliis, ofcuursc, has no implication as to whcthcr I' is "actually" truc.
thJt P is true. 111is, of course, has no implication as to whether P is "Jctually" true.
'Ihc ncxt structure involves uscs of "know" as in "John knows whcthcr P is
The next structure involves uses of "know" as in "John knows whether P is
truc." 'Ihis is Ihc typc of bclicf S would havc to havc if S bclicvcd that John was
true:' This is the type of belief S would have to have if S believed that John was
able to answer a qucstion such as "Is I' uuc?". It is rcprcscntcd as thc disjunction
able to answer a question slich as "Is P true?". It is represented as the disjunction
A KNOWIF P
= (P /\ BELl EVF{A,P
(....... p /\ BEUEVE(A.-.P
'Ihc final i~scof know is in thc scnsc dcmonstmtcd by thc scntcncc "John knows
The final use afknow is in the sense demonstrated by the sentence "John knows
whcrc the box is." 'I'llis cnsc js rcprcscntcd by quantifying over' thc Ill!l.lEVE
where the box is:' This case is represented by quantifying over the BEl.IEVE
opcrator:
operator:
to. KNOWREF D "" (3y)(D=y) /\ BEUEVE(A,D=y)
Allen
118
Wc will say that an action is intentional if whenever the action was performed, the
We will say that an action is intentional if whenever the action was performed, the
agcnt wantcd it to occur at that timc. 'Ihus, if
is an intcntional act, 4 any
agent wanted it to occur at that time. Thus, if ACT is an intentional act, !l any
agcnr.,f any time, then
agenl, I any time. then
ll9
unlcss
"planner KNOWREF thc location of n."
"planner KNOWREF the location ofn."
'1.0 pcrmit plans to bc rcprcscntcd at varying lcvcls of dctail. a plan structure itsclf
To permit plans to be represented at varying levels of detail, a plan structure itself
can bc a nodc in a plan. 'l'hcsc "plan" nodcs rcprcscnt thc bodics of actions. ' h e
can be a node in a plan. These "plan" nodes represent the bodies of actions. The
bodv arc links an action to a plan nodc that contains its body.
body arc links an action to a plan node that contains its body.
Allen
120
indicates that if S bclicvcs A ha6 a goal of X, thcn S 1n;iy infcr that ,4 has a goal of
indicates that ifS belicvcs A has a goal of X, tllen S may infer that /\ has a goal of
Y. 'l'hc "i" in h e rulc indicates that it is a plan itlfcrcncc rulc, as apposcd to a plan
Y. The "i" in the rule indicates that it is a plan inference rule, as opposed to a plan
construction rulc, wllich will hc indicated by using a "c". Note that
construction rule, which will he indicated by using a "c". Note that
"SIlAW(X)"
Note that all goals are propositions. If the proposition is associatcd with an action,
Note that all goals arc propositions. If the proposition is associated with an action,
thcn we will describe having such a goal by saying that A wants thc action to
then we will describe having sllch a goal by saying that A wants the action to
occur. Othcrwisc. we will describc having a goal by saying that A wants to achieve
occur. Otherwisc, we will describe having a goal by S<lying that A wants to achieve
the ctindition signified by the proposition.
the c(jndition signified by the proposition.
Vie possible n~lcscan bc divided into three broad categories: those that
TIle possible niles can be divided into three broad categorles: those that
concern actions, those that concern knowlcdgc, and thosc that conccrti planning
concern actions, those that concern knowledge, and those that concern planning
by othcrs. A11 thcsc rulcs are summarized in 'Ihblcs 1 and 2 at thc end of this
by others. All these rules are summarized in Tables 1 and 2 at the end of this
scction.
section.
121
action ACT cnablcd by P to occur.
action ACr enabled by P to occur.
nod>-Actio~lRule
Rule
Sl3AW(R)
SnAW(ACT) -- if B is part of the
SBAWeB) =>j SBA W(ACT) .. if B is part of the
body of ACT.
body of ACT.
'I'hus. if A wants an action 11 to occur, that is
Thus. if A wants an action n to occur, that is
part of dlc cxccution of anothcr action ACT, A
part of tJle execution of another action ACr, A
may want AC1' to occur.
may want ACr to occur.
Bod>~Actioll
Acfiorr-1;ffect Rule
Action-FJJect Rule
SIlAW(hCI') xi S11AW(E) - if E is an cffcct of
SBA W(ACT) =>j SBAW(E) - if E is an effect of
ACF.
ACr.
Sinlply. this says that if A wants an action A C r
Simply, this says that if A wants an action Acr
to trcur, tllcri h ]nay want thc cffccts of ACT to
to occur, then A may want the effects of ACT to
bc achieved.
be achieved.
Know-negative Rule
SBA W(A KNOWIF P) :J SBAW(-,P)
i
Allen
122
K i ~ o wvalue Rule
Know-value Rule
SljAW(A KNOWIF P(a)) 3i
SBA W(A KNOWIF Pta~ ::>i
SllAW(A KNOWREF the x : P(x)).
SBA W(I\. KNOWREF the x : P(x.
Of coursc, in any plan iiltcrnativc, at most one o f thc first two rulcs can bc
Of COlirse, in any plan alternative, at most one of the first two rules can be
corrcct. 7'hc decision as to which of thcse is corrcct, or that none OF bcsc is
correct The decision as to which of these is correct, or that none of these is
corrccl is rllc responsibility of thc hcuristic evaluation of thc plans produccd by
correct. is the responsibility of the heuristic evaluation of the plans produced by
applying thc rulcs.
applying the rules.
Onc spccial crtsc is of intcrcst hcrc. If A indicatcs that hc or shc has a goal
One special case is of interest here. I fA indicates that he or she has a goal
A KNOWn: P
and it is known bctwccn S and A that A alrcady knows whcthcr 1'. thcn lhc path
and it is known hetween S and I\. that A already knows whether P, then the path
h a t agrccs uith A'S knowlcdgc is bkcn. For cxamplc, if S is playing with a dime
that agrees with A's knowledge is taken. For example. if S is playing with a dime
in front o f A, and A asks
in front of A, and A asks
''110you havc 3 dime,"
"Do you have a dime,"
nccause of thc vagueness in thc resulting goal, this rulc does not produce
Because of the vagueness in the resulting goal. this rule does not produce
reasonable plans unless a spccific goal or action of form P(I)) alrcady exists in the
reasonable plans unless a specific goal or action of form P(J) already exists in the
cxpcctations.
expectations.
The Rules runcerning I'lar~nitrgby Oihers
The Rules Concerning Planning by Ofhers
'I'hc plan construction proccss can bc dcscribcd in thc samc manncr as thc plan
The plan construction process can be described in the same manner as the plan
infcrcncc proccss; as a sst of rulcs that dcscribc possiblc constructions, and a set of
inference process; as a set of rules I.hal. describe possible constructions, and a set of
heuristics to evaluatc the resulting plans. 'Ihc plan construction rulcs arc simply
heuriSl.ics to evaluate the resulting plans. The plan construcl.ion rules arc simply
thc invcrscs of thc plan infcrcnce rutcs. Somc cxamplcs arc givcn bclow. X is the
the inverses of the plan inference rules. Some examples arc given below. X is the
name of thc agcnl doing thc plan construction, and "XW(Y)" is a n abbreviation
name of the agent doing the plan construction, and "XW(Y)" is an abbreviation
for "BELlEVE(X,WANT(X,Y))." For thcsc rules we usc thc notation "3; to
for "BELlEVE(X,WANT(X,Y." For these rules we use the notation "::le" to
123
indicate aaplan
ptan construction
construction rulc.
rule.
indicate
Action-Precondition Rule
Rule
Aclio~t-Frecondi/iun
XW(ACr) 3
:J,c XW(P) ifif P is aa precondition
precondition of
of
XW(ACl')
ACT,
ACT.
X wants to cxccutc
execute ACT, X.
X may want to cnsurc
ensure that
tilat its preconditions arc
are
IThus
l u s ifif X
true.
m
e.
Action-Body Rule
Action-Body
XW(ACr)
3,c XW(0)
the body of
XW(ACf):J
XW(B) if BB iiss piut
part of
oftbe
ACf,
Am.
Effect-Actioll Rule
Rulel?fic/-Acliorr
XW(E):J
/lCr.
XW(E)
3,c XW(ACT) ififE
E is an cffcct of ACT.
Of coursc.
course, if E can he achicvcd
achieved by many actions,
actions. cach
each onc
one of them could be
Of
introduced by Uiis
this rulc.
rule. As with plan infcrcncc,
inference. Ulc
the plans canstructcd
constructed using this
introduccd
rule would thcn
then havc
have to bc
be cvaluatcd
evaluated by a sct
set of cvrluatio~l
evaluation heuristics.
rulc
Kllow-Rule
XW(P):J c XW(X KNOWIF P)
Thus,
Thus, if X wants to achieve P but doesn't
docsn't know whether
whcthcr P is true,
tlue, X must find
find out
whctiler
P
is
true,
true.
whedicr
When
Whcn X constructs
construcls a plan involving
involving the cooperation
coopcration of another agent Y, X
may depend
dcpcnd on Y to do some
some plan construction
construction as well.
wcll. 'l11u5,
'lhus. X might get
gct Y to
perform
pcrfoml some action ACr
ACI' by getting
gctting Y to have the
thc goal of achieving
achieving Acrs
ACTS
effects.
For
example,
assume
that
X
wants
to
have
a
surprise
birthday
pany
for his
cxamplc, assume
X
surprise binhday
effccts.
roommate
roummatc Y and needs
nccds to get
gct Y out of the house,
house. X
X says
says
"We need
nccd some
some beer"
expecting
cxpccting Y to assume
assumc the
UICgoal
goal of getting
gctting beer,
bccr. and
and then
thcn construct a plan to
to get
some.
samc. This
'lhis involves
involvcs leaving
lcaving the
thc house,
housc, the
thc goal
goal X had all
all along,
along. Thus X has
reasoned
proccss. Crudely,
Crudcly, this new planning inference
inference rule
rcasoncd about Y's planning process,
can be
bc described
dcrribcd as
as
XW(WANT(Y,"leave
XW(WAN'I'(Y,"lcave house")):J
house")) 3c
XW(WANT(Y,"gct beer"
beer"))
c XW(WANT(Y,"get
since X believes
124
Allcn
l'his rulc schclnata is of interest when .it is assurncd that thcrc is no deceit
This rule schemata is of interest when it is assumed that thcre is no deceit
bctwccn the agcnts, and both rcalizc that thc planning by thc hcarcr was intcndcd.
betwcen the agents, and both reali1.C that the planning by the hearer was intended.
'Ihus, a king tnight say
Thus, a king might say'
"It's cold in here"
"It's cold in here"
to a scrvant, cxpccting the scrvant to plan to mskc thc room wa.rmcr.
to a servant, expecting the servant to plan to make the room warmer.
XW(AC?')3,XW(D)
Action-llody
Action-Body
XW(ACT)::Ie XW(Il)
Effect-Action
Effect-Action
Know-Rule
Know-Rule
uispartof
II is part of
body of ACT
body of ACT
XW(3) IcXW(AC1')
E an cffect
XW(3) ::Jc XW(ACT)
Ean effect
of ACT
of ACT
xw(p) 3,
XW(P) ::Jc
XW(X KNOWIF P)
XW(X KNOWIF P)
Ncstcd Planning
XW(WAN'T (Y,P)) 3, if X bclicvcs
Nested Planning
XW(W ANT (Y,P ::>c if X believes
WANrI'(Y,Q) 3,
XW(WAN'I'(Y,Q)
XW(WANT(Y,Q)
WANHY,Q) ::J c
WANrI'(Y,P)
WANT(Y,P)
'Ihblc 1 'I'hc plan construction rules.
Tablc 1The plan construction rules.
Ilut for thc scrvant to understand thc king's intcntion in thc above cxample, he
But for the servant to understand the king's intention in thc above example, he
must rccogniec that thc king's plan included planning by thc scrvant. Wc can
must recognize that the king's plan includcd planning by the servant. We can
charactcrizc infcrcnccs that construct thcsc ncw plans as k)Hows (rcvcrting back ta
characterize inferences that construct these new plans as follows (reverting back to
125
S as
as recognizer.
rccognizcr, I\.
A as
as the
thc observed
obscrvcd agent):
agcnt):
The
The Recogllizil1g
R ~ c o g ~ ~ i z iNested-Planning
Nesred-l'larrrrir~g
ng
Rule
Sill\.W(WI\.NT(S,P
SI%AW(WAN'f(S.I'))=:Ii
>i Sill\.W(WI\.NT(S,Q))
SHAW(WANT(S,Q))
if Sill\.
Tl(WI\. NT(S'p) >c
=:Ie WAWl'(S.Q))
WI\.NT(S,Q))
SI3AB(WAN'I'(S.P)
This
aI'SS wanting to achicvc
achieve P,
This rule can be
bc paraphrased
pamphrascd as follows:
k~llows:If I\.
A has a goal of
then
givel) that A
A bciic\,cs
believes SS
tlicn /\A may also
also have
h;lvc a goal of S wanting to achieve
achicvc Q. givcn
would
plan
to
achieve
Q
in
order
to
achieve
P.
uould
to achicvc
ordcr
One
agent A
/\ can gi.1
gel S
S to want some
Onc of the
thc mmlt
most common
comlnon ways in
it1 which
wllicli an agcnt
condition
then dcpcnd
depend on S's
5's
cundilion isis to
to get
gct SS to believe
bclic\.c A wants the
thc condition and thcn
cooperation.
the goal. 'This
This introduccs
introduces the
cooperation. "1l1U5.,
'l1111$ 1\
A is gelling
gctting S to
ti, decide
dccidc to accept
acccpt thc
last
last inference:
infcrcnce:
Decide rllference
Sill\. W(SIlI\. W(P)) =:Ii SJl;\W(WI\.NT(S,P))
Unfustunatcly. the
thc conditions
conditions onder
under which an agent
agcnt dccidcs
Unfortonately,
decides to acccpt
accept a goal are
cxtrclncly difJicutt
diflicult to describe.
dcscribc. In this paper,
papcr, we
wc L~kc
extremely
take the naivc
naive vicw
view that an agent
dccidc to
to adopt a goal unkss
unless he
he explicitly l?
is known
o t ttoo want t\>c
will dcc~dc
will
~n\)wn n
'11\)\.
\l.c conditions
tondit)Qns
implics.'lhiscundition
cnfurccd by the
thc fating
rating heuristics in the next scction.
itit implies.
'Ibis condition is enforced
section.
126
Allen
-
Precondition-Action
SBAW(P) IiSflAW(ACT)
Precondi tion-Action SIlAW(P) ::li SBA W(ACT)
P a precondition
P a precondition
of ACT
ofAef
Body-Action
Body-Action
SIlAW(D) 3iSDAW(ACT)
SBAW(ll) ::Ii SBAW(ACf)
I3 a part of
Il a part of
thc body of ACT
the body of ACT
Action-Effcct
Action-Effect
SBAW(ACl')
SI3AW(E)
SIlAW(ACI') ::lj SBA WeE)
1' is an cffcct
E is an effect
of
ACT
of ACT
Want-Action
Want-Action
SIMW(WANrF(n,ACT)) xi
SBAW(WANT(n,ACT Ji
SIMW(hCI')
SBAW(ACI')
Know-Positive
Know-Positive
n is the
n is the
agcnt of ACT
:lgent of ACl'
SUAW(A KNOWIF P)
SUAW(A KNOWIF P)
S13AW(P)
::Ii SBAW(P)
Know-Negative
SIjAW(A KNOWlF P)
Know-Negative
SBAW(A KNOWIF P)
SIIAW(not P)
::Ii SBA W(not P)
Know-Value
Know-Value
Know-'rcnn
Know-Tenn
SBAW (A KNOWREF D)
SBAW (A KNOWREF D)
s n A w (P (D))
::li SBAW (P (D
Rccognition of
SnAW(WANT(S,P)) 3i
SBAW(W ANT(S.P:J j
Recognition of
Nesting I'lanning Rulc SIMW(WhNT(S.Q))
Nesting Planning Rule SBA W(WANT(S,Q
if SllAB(WANT(S,P)
if SllA new ANT(S,P)
3, WAN'r(S,Q)))
::Jc WANT(S,Q)
Dccide Inference
S13AW(SRAW(P))
Decide Inference
SBA W(SBAW(P::J j
SllAW(WANT(S,P))
SBAW(WANT(S.P
'i'ablc 2 Thc plan infcrcnce rules.
Table 2 The plan inference rules.
127
partial plans may contain many cxpcctations sharing onc common alternative.)
partial plans may contain many expectations sharing one common alternative.)
F ~ c hpartial plan is assigncd a rating. which is dctcrmincd using hcuristics
Each partial plan is assigned a rating, which is determined using heuristics
dcscribcd in this section, that rcflccts how IikeIy it is to bc part of thc "correct"
described in this section, that renects how likely it is to be part of the "correct"
plan. 'fhcsc hcuristics arc based solcly on domain-indcpcndcnt rclations bctwccn
plan. These heuristics arc based solely on domain-independent relations between
actions. thcir bodics, prcconditions and cffccts. 'l'hc initial partial plans arc givcn a
actions. their bodies, preconditions and effects. The initial partial plans arc given a
rating of 1. 'lhc hcuristics arc cxprcsscd hcrc only in tcms of increasing and
rating of I. "111e heuristics arc expressed here only in terms of increasing and
dccrcasing thc ratings. 'lhc ~ t u i 1formulas
1
arc vcry simple and givcn in 'Table 3 at
decreasing lhe ratings. The actual formulas arc very simple and givcn in Table 3 at
Lhc cnd of thc sccrion. 'I'his is organi/.cd in this way lo cmphasizc thc fact that
the end of the section. This is organi/.ed in this way to emphasize the fact that
whilc raring changcs in thc indicalcd direction arc csscnrial to our modcl, wc fccl
while rating changes in the indicated direction arc essential to our model, we feel
drat some variation is possiblc in the actual figurcs.
that some variation is possible in the actual figures.
Finally, bcforc wc givc thc hcuristics, wc must rnakc thc distinction bctwccn
Finally, before we give the heuristics, we must make the distinction between
actions that arc cu1.rcn1Iy in C X C C U ~ I O I ~UIOSC
.
itwiiiling cxcci~Iion(pcndinp,), and
aClions that arc currently in execution, tllOse awaiting execution (pending), and
tjlosc that hate bccn cxccu~cd.In prlrlicular, thc obscr\.cd aclion is considcrcd to
tllllse that have been execuled. [n particular, the observed action is considered to
bc currcntly in cxccution. and any action which contains an action currcntly in
be currently in execution, and any action which contains an action currently in
cxccution in its body is also considcrcd to bc currcntly in cxccution.
execution in its body is also considered to be currently in execution.
ANiun Based Heuristics
Gcncrally, onc cxpccts agents to construct plans that thcy bclicvc thcy arc able to
Generally, one expects agents to construct plans that mey believe they arc able to
cxccutc: and thcy cxccutc lhcm only to achicvc goals that arc not prcscntly true.
execute: and mey execute mem only to achieve goals that arc not presently true.
?his givcs us two rules:
This gives us two rules:
011) Decrease the ruling of a parlial plait ifit confains
CHI) Decrease the rating oj a partial plall if it contains
on aclion whose precondirions arc false at the
an actioll whose preconditions are false at the
liltl~rhe aclion slarls execufing.
till1e the action starts executillg.
Allen
128
Expectation-Based Heuristics
111is
heuristic favors
Iliis hcuristic
favors those
thosc partial plans whose alternatives
altcrnativcs seem
sccrn most likely to
merge with thcir
their expectation.
expectation.
rncrgc
Increase the
par/ial plan if it;I contains
(H3)
(H3) Increase
rhr rating
rarbrg of
of a parrial
conlains
descriptions
drs~~ripriunsof
of objects
objerrs and relations
rrlalions i1l
in its
allrmotil'f
alrn-~rarisr thaI
rhar are unijiable
utliJiable wilh
with objects and
relalions
rclariorls ill
h its
irs expectation.
expeclarion.
The
unifiable is uscd
used here
the sense
the unification
algorithm
'lhc term
tcrm unili;~blc
hcrc in thc
scnsc of thc
unificatii~~i
algorithm found
found
rcrolutio~ithcorcrn
(scc [Nilsson
[Nilsson 1971]).
19711). Thus.
'lhus. ifan
if an alternative
altcrnativc involves
involves
in re'olution
theorem provers (sec
a train description,
train
description. those expectations
crpcctations that
Ui;~tinvolve
invulvc ao (compalihlc)
(o~rnp;~[ihlc)
[rain wilt
will be
hvorcd. Similarly,
Similarly. if an cxpcclation
cxpcchtion involves
involvcs a relation
rcl;~tionsuch as arrival time, its
favored.
alternative
jfitit also involves
altcrnative seems
scelns morc
rnorc favorable if
itivolvcs an etrrival
arrival time
timc relation.
relation.
The
'lhc final
final heuristic
hcuristic favors
favors alternatives
altcrnativcs that have
havc produced
produccd inferences
infercnccs that arc well
rated enough
bc applied.
ratcd
cnough to be
(H6)
Increase the
partial plan each time
Increase
rhe ralillg
rarirlg ofa
of a parrial
rime an
inference
i~ferencerule is applied
Finally, we end
cnd this section
scction with a few implemeotation
implcmcntatiun details. Each
Fach partial
Finally,
Heuristic H3 adds a fixed
plan has a ~ that is uscd
used to calculate its rating. Heuristic
129
factor of 5 for cach similarity found. (Thc actual valuc 5 has no effect on the
factor of 5 for each similarity found. (The actual value 5 has no effect on the
scarch except possibly for roundoff considcrations in thc rating calculation.) All
search except possibly for roundoff considerations in the rating calculation.) All
othcr hcuristics affect thc wcight by a rnultiplicativc constant.
other heuristics affect the weight by a multiplicative constant.
n l c tom1 wcight of all thc panial plans is uscd in calculating each plan's rating,
The total weight of all the partial plans is used in calculating each plan's rating.
Thc rating of a plan I' is simply the pcrccntagc ot'U~ctotal wcight rh;~tplan P has.
The rating of a plan P is simply the percentage of the total weight that plan P has.
'Ihc actual valucs of thc tnultiplicativc factors arc provided in 'Sable. 3.
The actual values of the multiplicative factors arc provided in Tabl~ 3.
-- -- .....
..- . . . -
Heuristic
Heuristic
HI
H1
H4
HS
H6
Preconditions false
Preconditions false
Effects true
H2
H4
l )cscriptioq
Ikscription
. .
Factor
Factor
.5
.5
.5
H4
liefcrcnt Idcntificd
Referent Identified
I.S
H4
Rcfercnt impossible
Referent impossible
.2
H5
In tcrscction found
Intersection found
I.S
H6
1.25
1.5
.2
1.5
1.25
Allen
Allen
130
130
containing only thc obscrvcd action and one of the original cxpcctations. 'To allow
containing only the observed action and one of the original expectations. To allow
for thc possibility of an uttcrancc that docs not fit an cxpcctation, a partial plan is
for the possibility of an utterance that docs not fit an expectation, a partial plan is
also constructed with a null cxpcctation.
also constructed with a null expectation.
'Thc actual tiisks tliat pcrfonn thc plan infcrcncing can be dividcd into three
The aClUal tasks that perfonn the plan inferencing can be divided into three
classcs: those U~ntspccify thc structure of thc plans, thosc diat identify objccts on
classes: those U1at specify the structure of the plans, those that identify objects on
thc plans, and thosc that control thc scarch.
the plans, and those that control the search.
D1
Most of thc control rncchanisms arc built into Lhc riiting schcmc and thc plan
Most of the control mechanisms arc built into the rating scheme and the plan
infcrcncing monitor. For instancc, cvcry tilnc an addition Is madc to an
infcrencing monitor. For instance, every time an addition Is made to an
altcrnativc, thc cxpcctations arc cxamincd for ncw sin~ilariticscauscd by the
alternative, the expectations arc examined for new similarities caused by the
addition. 'Ibis may causc a changc in thc ratings according to he
addition. This may cause a change in the ratings according to the
cxpcctation-bascd rating heuristics.
expectation-based rating heuristics.
Somc rncchanism must tcrminatc plan infcrcncing. 'lhis is donc by the task
Some mechanism must terminate plan infercncing. This is done by the task
Acccut. which is suggested by thc monitor whcncvcr an intcrscction of alternative
Accept, which is suggested by the monitor whenever an intersection of alternative
and an cxpcctation secms possiblc bccausc they contain unifiable specifications of
and an expectation seems possible because they contain unifiable specifications of
a stcp (i.e., an action or goal) in thc plan, or when thc plan with the null
a step (i.e., an action or goal) in the plan, or when the plan with the null
cxpcctation has twice as high a rating as any otllcr partial plan. Accc~tmust decide
expectation has twice as high a rating as any other partial plan. Accept must decide
whcthcr to tcrminatc thc plan infcrcncing or not. At prcscnt. thc tcrminalion
whether to terminate the plan inferencing or not. At present, the termination
condition is fairly simplc: if thc plan under consideration has a rating twice as high
condition is fairly simple: if the plan under consideration h<Js a rating twice as high
as any othcr partial plan. it is acccptcd. 'l'his is irnplc~nct~tcd
by suggesting a
as any other partial plan, it is accepted. This is implemented by suggesting a
will sit on thc pcnding list
dummy task at half the prcscnt [ask's rating. l'his ~ s k
dummy task at half the present (ask's rating. This task will sit Oil the pending list
until all bcttcr ratcd bsks havc cxccutcd. Whcn it comcs tu tllc top, if no other
until all better rated tasks have executed. When it comes tll the top, if no other
Acccnts havc bccn cxccutcd, the original altcrnativc is idcntificd as thc spcakcr's
Mcepts have been executed, the original alternative is identified as the speaker's
plan and plan infcrcncc stops. If another Accc~thas cxccutcd, thcre is an
plan and plan inference stops. If another Accept has executed, there is an
ambiguity, which can bc uscd to gcncratc cldrification subdialogues (scc [Allen
ambiguity, which can be used to generate clarification subdialogues (sec [Allen
and Pcrrault 19801).
and Perrault 1980)).
Allen
Allen
Task
Infer
132
Task
Katinx Formula
Rating Formula
Infer
.75 * R
.75 * R
Expand
Expand
.75 * K * f(n)
.75 * R * f(n)
whcrc Nn) = 1.25/n, whcrc n is thc numbcr of
where fln) = 1.25/n, where n is the number of
ncw pards1 plans to bc crcatcd by tllc Expand
new partial plans to be created by the Expand
Idcnlify
Accept
132
133
As a consequence,
conscqucncc, although not having a tickct
that the patrons
palrons know this. lis
ticket may be
train, the clerk docs not expect thc
an obstacle in a plan to board a train.
the patron to ask
(bccausc he can't providc
him for a tickct
ticket (because
provide one).
The
believes that both S and II
'l'hc above
abovc arc useful
uscful strategies
stratcgics if S bclicvcs
A agree
agrcc on what
thc obstacles
obstaclcs in the plan afC.
arc. Huwcvcr,
disagrcc on some
somc issue,
issuc, special
special
the
However, if S and A
A disagree
obstacles
addrcsscd. For example,
cxamplc, if II
A thinks that state X
obstaclcs occur that must be addressed.
already
plao, but S bclicvcs
believes X docs not hold,
already holds and is depending
dcpcnding on X in his plan,
thea
mention this
tbis fact to II.
Otherwise, A's
II's plan will fail
thcn S is obliged
obligcd to mcntion
A. Othcrwisc,
fail and S will
bc considered
considcrcd as uncooperative.
unc~ropcrativc.In thc
rcvcrsc case,
case. if A
true.
be
the reverse
A thinks state X
X is not true,
tell A, for A may not execute
but S believes
hclicvcs it in fact already holds.
holds, tJ,en
then S shoold
should tcll
cxccute
wiil not succced,
his
(valid) plan bccausc
because hc
he thinks it will
his(v;~lid)
succecd.
,
There
class of obstacle
tJ,at is truly difficult
onc cl:lss
obstaclc that
dilficult to detect
dctcct but should
shouh be
'I'hcrc is one
considcrcd. IIf there
thcrc arc two goals
goal5 in a plan and one
onc is just a~ slep
stcp towards achieving
achicving
considered.
the
tJ,en Ulc
tJ,e hct~ristics
heuristics abovc
above will indicate
the first is the
indicatc that thc'first
thc only obstacle,
obstacle.
thc second,
sccond, tllcn
Huwever.
need to ever
Iiuwcvcr, in Some
solnc cases,
cascs, achieving
achicving the second
sccond eliminates
climinatcs the nccd
cvcr (E\'cn
(hen
achiccc thc
cxa~nplc.if A and S arc in a lockcd
temporarily) achieve
the lirst. Rlr
For example,
locked room and A
asks S wllere
key to the
tlle door is, S might dcducc
deduce the
whcrc tJ,e
UIC kcy
Ule following
filllowi~iggoals:
whcrc key
kcy is" in order
ordcr to "Gel
"Gct thc
open."
"A know where
the door open."
If S opens Ulc
U,e door himsclf,
himself, say by some
key, thcn
then
somc means
mcans other
othcr than using the kcy,
knowing thc
kcy's location becomes
bccomcs irrelevant.
irrelevant. Howcvcr,
dctccting
the goal of knowing
the key's
However. detecting
such situations is quitc
quite dilficult
difficult and beyond tJ,e
the prcscnt
present work, for it
thc scope of thc
may involvc
involve considering the
the future
thc speaker's
spcakcr's plan to an arbitrary distance
distance into Lhc
with no wcll'dclincd
welldefined tennination
tcnnination condition.
l'hc algorithm used in thc
system involves
involvcs tcsting
statcmcnt in the
Tbe
the system
testing every goal statement
preferences:
plan. Obstacles arc selected
sclcctcd using the
thc following
following prcfcrences:
Allen
l34
lhc algorithm rcturns thc sct of obstaclcs in the highest prcfcrcncc class that is not
The algorithm returns the set of obstacles in the highest preference class that is not
crnpty.
empty.
nOARD(agcnt,train.slation)
applicability
L)13conditions:
'Al<r.I.OC(tSOURCr-:(train,station),
rain,loc),I~~:PAR'l'.'fIMF~train,timc)
DEPART.I.oC(train,loc). DEPART.T1MF.(train,time)
precondition: Al'(agcnf loc, time)
prccond ition: AT(agent, loc, time)
cffcct: ON l3OARIl(agcnt,uain)
cffcct: ON BOA RD(agent,train)
135
MEI-Y(agcnt. train, station)
train, station)
applicability condirions: IlESl'(train,station),
applicability
conditions: DEST(train,station).
hl~1ZIVF,.I,OC(train,l~~),
hllRIVE.'I'lM Eitrain,time)
ARRIVE.LOC(train,loc), ARRIVE.TlM E(train,time)
MEl-~nagent,
Allen
136
INFORM(spcaker,hearer,P) :J
WANT(speaker,INFORM(speaker,hearer,P))
Notc that this action cannot succccd without thc cooperation of the hcarcr, for
Notc that this action cannot succeed without the cooperation of the hearer, for
only thc hcarcr can changc his own bclicfs.
only the hearer can change his own beliefs.
In many cascs. agcnts rcnson about inform acts to bc pcrfor~ncd(by others or
In many cases. agents reason about infonn acts to be performed (by others or
thcrnsclvcs) whcrc thc information for thc propositional contcnt is not known at
themselves) where the in fonnation for the propositional content is not known at
thc lime of planning. For cxamplc, A may plan for S to info~mA whcthcr P is
the time of planning. For example. A may plan for S to infolm A whether P is
truc: A cannot plan for S to pcrfonn INFOlIM(S.A,I') sincc this asstlmcs that P is
true: 1\ cannot plan for S to perfonn IN~ORM(S.I\Y) since this assllmes that P is
tnlc. 'l'hus we nccd t w o other "vicws" of thc INFORM act: INFORMIF and
trlle. Thus we need two other "views" of the INFORM act: INFORMIF and
1 NFOIIM 1ZF.F. (I-row now on. t l ~ c"want-precondition" axioms will bc omitted):
INFORM REF. (From now on, the "want-precondition" axioms will be omitted):
and
INFOIIM R F.F(spcakcr, hcarcr, dcscription)
INFORM REF(speaker, hearer, description)
137
For the time bcing, wc will assume that all spccch acts are rcali7.cd in thcir literal
For the time being, we will assume that all speech aCIS arc realized in their literal
fonn, and thus can bc easily idcntificd from thc input utterance.
fonn. and tims can be easily identified from the input utterance.
This is a vcry simplc cxan~plclo give an idca of tllc plan infcrcncc pnxcss in
1l1is is a very simple example to give an idea of the plan inference process in
operation. It has bccn inodificd from t l ~ cway tllc acti~alsystem runs so it can bc
operation. It has been modified from the way tile aCllIal system TUns so it can be
dcscribcd in rcnns of I l l c simplc vicw of partial plans as onc cxpccration and one
described in tenns of the simple view of partial plans as one expectation and one
alternative.
alternative.
1-ct rhc obscrvcd action be:
Let the observed action be:
REQUEST(A,S,INFORMREF(S.A,timcl))
where
whcre
TIMf--{timcl) 1\ DEPART.TIME(trainl,timel)
TRAIN(trainl) 1\ DEST(trainl,WINDSOR).
Allen
Allen
138
138
ralcd similarly, but most would probably have little in common with the
rated similarly, but most would probably have little in common with the
uttcrancc, and so would start o f f poorly ratcd. 'Jhc null cxpcctation plan surts
utterance, and so would start off poorly rated. The null expectation plan starts
with its tokcn wcight of 5.
with its token weight of S.
'I'hc initial ratings arc calculated as follows. Thcrc is a total weight of 40
The initial ratings arc calculated as follows. There is a total weight of 40
assigncd to thc Lhrcc partial plans. 'l'hc board plan, with a wcight of 25, rcccivcs a
assigned to the three partial plans. The board plan, with a weight of 25, receives a
rating of 62; the mcct plan, with 10, rcccivcs 24: and tllc null plan rcccivcs 12.
rating of 62; the meet plan, with 10, receives 24; and the null plan receives 12.
Aftcr this initial prtxcssing, thc partial plans arc as in Figures la and lb.
After this initial processing, the partial plans arc as in Figures la and lb.
'I'hc initial tasks suggcstcd are:
The initial tasks suggested are:
1) ldcntify rhc train in thc IIOARI) plan, ratcd 62.
I) Identify the train in the BOARD plan, rated 62.
ldcntifying thc train in rhc 110AR13 plan succeeds, the assumption bcing made
Identifying the train in the BOARD plan succeeds, the assumption being made
that the ncxt train leaving is thc onc intended unlcss thc spcakcr says otherwise.
tl1at the next train leaving is me onc intended unless the speaker says otherwise.
I h u s train1 is idcntificd with an cntry in t t ~ cschcdulc, say EN'I'RY-TRAIN17.
Thus trainl is identified with an entry in the schedule, say ENTRYTRAIN17.
'['his providcs further cvidcncc [hat thc HOAR11 plan is the correct onc, increasing
This provides further evidence that the BOARD plan is the correct one, increasing
Ulc 110AR11 plan's rating to71 (wcight 37) at thc cxpcnsc of thc othkr partial plan
the BOA RD plan's rating to 71 (weight 37) at the expense of the other partial plan
ratings.
ratings.
139
'lllC
Tile cxpcctation:
111C expectation:
~AkU(A,trainl.'l'ORON'rO)
I..
DOAkD(A,trainl.l'ORONTO)
cnabk
ble
AT(A,1ocI,timeI}
'Ibe alternative:
(I) REQUI~r(A,S.JNFORMREF(S,A.timcl):
I~
whcre
where
'I'RiZlN(tr;~inl)A SOU I~Cll(traiiiI,'~OKONNI'O)
A
TRAIN(trainl) /\l?l~Sl'(tr;li~il,WINI)SOK)
SOURCE(trainl,TORONTO) /\
I)J:ST(tmin[:(triiin
I,WINDSOR)
-- I )i~l'~~l<'l'.'I'l~.l
1,titncl)
-I)n''\RT.TiMI~lrainl,timcl)
-- l)lSIJA f~1'.L.OC(lrainI,loc~)
-- DEI'ART.LOC(lrainl,loc1)
--
'Ihc&r t;ak 011 tl~cUOi\l<I) plan (oow 13tcd 53) is cxcctitcd. Inferring from (2)
'1l1C Infer task on the BOARD plan (now rOlLed 53) i~ exeClited. Infcrring from (2)
in thc I10ARI3 plan, tllc cffcct of tl~c1<11QUk:S;'l' finds only Ihc w;~ilt-actionrule
in the BOARD plan, the effc~l of thi: REQU EST finds only the wilnt"[Iction rule
i~pplicablc. i\n F x I~ l ~
s kis ~
suggcstcd (r;lrcd 53). which ilnmcdiatcly cxccutes,
applicable. An Exp;Jnd L;\sk is suggesled (raled 53). whichimm('diatcly executes,
sincc it is thc bcsr ratc11 task, adding tlic action
since it is the best raled task, adding the action
(3) INFOlu'1REF(S,I\,timel)
140
Allen
'I'hc cxpcctation:
The expectation:
MIiEl'(A.tnin2;TOHONTO)
MEHl'(A.train2.TORONTO)
1aub~
AT(A.Ioc2,time2)
'Ihc altcmtive:
The alternative:
(I) KlQUI:SI'(A,S,l NFOKMHEt:(S.A,limcJ)
(I) REQUPSI'(A.S,INFORMREF(S.A.lime3):
!-,
[cflmt
'I'hc null tjlnn (cltcd 12) contains only thc altcrnstivc as dcscribcd abovc in the
The ill!!! plnn (rated 12) contains only the alternative as described above in the
MEt<I' plan,
MEET plan.
141
nOARD(A,tminI.TORONTO)
1,n,h1,
AT(A,Ioc1,timel)
1know
KNOWlZ F.F(A.timc1)
KNO\VRF.F(A.limcl)
I1
eff"ct
c ffcct
INFOIZM I{l:t:(S,A,tirncl)
INFORM REF(S,A,timcl)
want-cnable
1wanl-enable
WA~~I'(S.INI:OI~~~IZI-:F(S.A,~~I~~~))
\V ANT(S.lNI ;ORMREF(S.A,timcI
f effeCI
cffcct
---
An illtcl.csling p~~oblcln
rcrn~insas to how S coulti have p1;111ncdlo ;tchicuc
An intereslin3 probklll rcmJi ns as to how S ('(1uld have pJanncd 10 achieve
both goiils with a singfc r~ttcrdnccsuch as
Allen
142
" 1600 at Gate 7"
"1600 at Gate 7"
How docs onc construct a plan to achicvc multiple goals simultancously? It may
How does one construct a plan to achieve multiple goals simultaneously? It may
bc possiblc that thc specifications would bc simply Scnl to a gcncration phase (sce
be possible that the specifications would be simply sent to a generation phase (see
Chaptcr 4 of this volumc) which could dctcct thc similaritics and collapsc thc two
Chapler 4 of this volume) which could detect the similarities and collapse the two
utteranccs into one.
ullerances into one.
In this scction wc considcr nnswcring a qucstion such as "13ocs thc Windsor train
In this section we consider answering a question such as "Docs the Windsor train
lcavc at 4?". Assumc that this uttcrnncc idcnrifics thc action
leave al 41", Assume lh,lt this utterance identifies the action
REQUEST(A.S,INFORM IF(S,A,DEPART.TIMI:{train1 ,1600)
where
TR1\IN(train1) A CITY-REI.ATJON(trainl,Windsor)
CIIY-IZEI.A'rION is a paltcrn that will match thc prcdicatc names DEST and
CITYRELATION is a pattern that will match the predicate names DEST and
SOUI<CE. I h c infcrc~lccsfrom this action will cvcntually producc the goal (by
SOURCE. The inferences from this action will eventually produce the goal (by
applying thc action-cffcct mlc, thc want action rutc, and thcn the action-effect
applying the action-effect rule, the want action rule, and then the action-effect
rule).
rule).
h KNOWIF I.EAVE(trainl,1600).
1\ KNOWIF I.EAVJ--:(tra in 1,1600).
143
'Ihc first thrcc of thcsc possibititics do not lcad to rensonahlc plans (i.~.,they do
The Ill'st three of these possibilities do not lead to reasonahle plans (Le., they do
not allow infcrcnccs that Icad to a mcrgc with an cxpcclation). For cxamplc, none
not allow inferences that lead to a merge with an expectation). For example, none
of UIC cxpcctations involvcs a subgoal of knowing what train lcavcs at 4. l'hc latter
of the expect.ations involves a subgoal of knowing what train leaves at. 4. The latter
goal lcads to a rcasonablc plan directly: applying d ~ cknow-tcrrn rulc from it
goal leads to a reasonable plan directly: applying the know-term rule from it
prtduccs a conncction to thc tl~irdargumcnt of thc prccondition to tllc IlOARD
produces a connection t.o the t.hird argument of the precondition to the BOARD
action, nalncly
action, namely
AT(A,locl.timcl)
where
whcre
DEPA RT.I.OC(train UncI)
DEPARTTIMl-:(trainl,timel).
whcre DEPAR'IS.'flMlXtrain1,timel).
where DEPART.TIME(trainl,timel).
If the answcr to the original qucry were "ycs," thcn both thcse goals would be
If the answer to the original query were "yes," then both these goals would be
accornplishcd by answering thc query as a ycs/no question. Rut if the answer is
accomplished by answering the query as a yes/no question. But if the answer is
"no," only the first obstaclc is achieved by tl~cycs/no answcr. The second
"no," only the first obstacle is achieved by t.he yes/no answer. The second
obstaclc accounts for thc cxtra information.
obstacle account.s for the extra information.
This example rcflccts a gcncral point. When a person asks about the mth of
This example renects a general point. When a person asks abollt the truth of
somc proposilion that happens to bc false, s/hc often is intcrcstcd in a rclatcd, m e
some proposition that happens to be false, s/he often is interested in a related, true
proposition. 'I'hc main problcm is dctcnnining how to modify thc original
proposit.ion. 'Inc main problem is det.ennining how to modify the original
proposition to makc it truc. Our fcding is that with rcspcct to a givcn sct ofgoals.
proposit.ion to make it true. Our feeling is that, wit.h respect. to a given set of goals,
Lhc objects refcrrcd to by Lhc tcrrns in a proposition can usually be ordcrcd by
the objects referred to by the t.erms in a proposition can usually be ordered by
somc criteria rcflccting thcir importancc in the plan. 'I'hc morc important the
some crit.eria reflecting their importance in the plan. 'Inc more important the
term, the less likely it is that it is what is wrong in thc proposition. We suggest that
term, the less likely it is that it is what is wrong in the proposit.ion. We suggest that
a major indicator of a lcrm's importancc is how high up in the hierarchy of
a major indieat.or of a term's importance is how high up in the hierarchy of
subgoals Lhe term is first introduced.
subgoals the term is first introduced.
In the exarnplc above, the train description was introduced in the top lcvcl
In the example above, the train description was introduced in the top level
goal of boarding the train. while the departure time was introduccd (in terms of
goal of boarding the train. while the departure time was introduced (in terms of
Allen
Allen
144
144
thc train) in thc subgoal created by thc precondition of thc boarding action. 'lhis
the train) in the subgoal created by the precondition of the boarding action. 'Ibis
approach sccrns to bc quitc gcncral, As an cxarnplc, considcr a co-operative
approach seems to be quite general. I\.s an example, consider a cooperative
rcsponsc citcd by Kaplan (Chapter 3 of this volurnc).
response cited by Kaplan (Chapter 3 of this volume).
( l a ) Is John a scnior?
(la) Is (Ib)
JohnNo,
a senior?
hc's a junior
(lb) No, he's ajunior
[t rnakcs littlc scnsc to considcr an answcr out of contcxt. For instance, if 1 am a
It makes little sense to consider an answer out of context. For instance, if I am a
profcssor nccding a scnior to do somc projcct for mc, a morc appropriatc rcsponse
professor needing a senior to do some project for me, a more appropriate response
to my qucry (la) would be
to my query (1a) would be
Thc indircct spccch act probicm is bcst introduced by an example. Consider the
The indirect speech act problem is best introduced by an example. Consider the
plan that must bc infcrrcd in order to answcr (1)with (2):
plan that must be inferred in order to answer (1) with (2):
Do you know when the Windsor train leaves?
Yes, at 3: 15.
(2)
S:
(2) S:
Yes, at 3: 15.
145
rule)
to achieve
acliicvc the goal (actioneffect
(action-cffcct rulc)
(6) A
A KNOWI<I'.P"dcparturc
KNOWREF "departure time"
tlcncc.
( I ) is ollen
oftcn rcfcrrcd
snccch act [Searle
[Scarlc 1975].
19751.
Hence. (I)
referred to ;IS
as an indircct
indirect speech
Al[hougli
mcchanis~nsalreody
;~lrcadydescribed
dcscribcd arc capi~blc
answering such
Although Uic
tl,e mechanisms
capable of answering
correctly. lhcy
following cases:
cascs:
indirccl acts correctly,
indirect
they cannot distinguish bctwcen
between thc
the two following
(a)
mercly expecting
cxpccting a yes/no
ycs/no answer.
answer, but S
(a) A soid
said (I) mcrcly
answtrcd with the extra
cxtra infi~r~nation
bc helpful;
helpful;
answucd
information in order to be
(b) ,\
A said (I)
(1) intending
intcnding that S deduce
dcducc his plan and reolize
realize that
A
dcparturc time.
/\ really wants to know thc
the departure
Allen
Allen
146
146
rccogni7.c that thc spcakcr intcndcd to pcrform that act, as in [SCarlc 19691.
recognize that tile speaker intended to perform that act, as in (Searle 1969].
'Ibis rccognition of intention condition can only bc accomplished by
'Ibis recognition of intention condition can only be accomplished by
introducing lhc notion of sharcd knowlcdgc, o r mutual bclicf. bctwccn Uie
introducing tile notion of shared knowledge, or mutual belief, between the
spcakcr and hcarcr. Mutiial bclicf is thc common sct of bclicfs bctwccn the
speaker and hearer. Mutual belief is the common set of beliefs between the
spcakcr and hcarcr h a t cach knows thc othcr knows. l'hus. cach agclit is aware of
speaker and hearer tilat e,lCh knows tile other knows. Thus. each agent is aware of
what infcrcnccs the othcr could lnakc using his common knowlcdgc. Since both
what inferences the other could make using this common knowledge. Since both
agcnts arc awarc of thc possiblc infcrcnccs that can be madc from an uttcrancc in
agents arc aware of the possible infercnces that can be made from an utterance in
a givcn contcxt, thc spcakcr should sclcct his uttcranccs so Ulat only Lhc dcsircd
a given context, the speaker should select his utterances so that only the desired
infcrcnccs arc madc. 'l'hc hcarcr, knowing this, may dccidc that Lhc spcaker
inferences arc made. The hearer, knowing this. may decide tilat the speaker
intcndcd him to makc tlrc i~ifcrcnccshc ~nadc. In fact, at prcscnt ;III infcrcnccs
intended him to make the inferences he made. In fact, at present, all inferences
made in lhis contcxt arc considcrcd to bc intcndcd by thc spcakcr. As a rcsult,
made in this context arc considered to be intended by the speaker. As a result,
some inconscqucntial conclusions may bc assigncd unwarmntcd signilicancc. This
some inconsequential conclusions may be assigned unwarranted significance. This
has not Icd to uouhlc in rhc cxarnplcs wc havc studied so far. To rcmovc this
has not led to trouble in tlle examples we have studied so far. To remove this
problcm will rcquirc considcrablc work on thc nature of intcntions.
problem will require considerable work on tile nature of intentions.
lictt~rningLO Ihc cxamplc ahovc, wc can now cxprcss lhc differcncc hctween
Returning Lo tlle example above, we can now express tlle difference hetwecn
the two intcrprcrations of "110you know whcn thc Windsor train Icavcs?" with the
the two interpretations of" Do you know when tllc Windsor train leaves?" with the
aid of thc following diagram (Figure 3):
aid oflhe following diagram (Figure 3):
147
A
W B ~ I S to
10
A wants
know the
lhe answer
know
Ie
YIN
la V
R I quesllon
qualh
(4)
(51
(5)
181
(5)
...
A
W B ~
A wants
~ S
\0 know the
loknowlne
departure
deparlure lime
lime
S's private
S'r
privale
beliefs
belieis about
about A
(al
(81
fb)
fb)
Mulually known
known
Mutually
belween S &
8A
between
SURFACE
SURFACE
YESlNO
YESINO
QUESTION
QUESTION
(4)
(4)
(5)
151
(6)
(8)
----------+
----------....
Awanls
A
wants
lo know lhe
10 know the
deparlure lime
departure
time
I:igurc J
3 The
'I'hc two itltcrprctJtions
it~lcrprctotionsof
Figure
"Jo
when (lie
Win&wr train
" ~ l uyou know !~IICII
lJlc Wiridzor
Uain kaves?"
Icavcs?"
Notcs: '111e
' l l l c hodi'Olllallincs
Ilo!-iro~ll;~ll i ~ l c sreprescllt
rc;)rcicnt phn
~I;III
inkrcncil~g,the
dic numbers
:lu:nhcrs referring
rcfcrrillg to
Notes:
illlcrcncing.
rhc sicps
i n the
tllc iniLi;tl
iniii;il di:c~lr.;ion
of illis cXilmp!c.
cx;irnplc. 'file
' l h c vcrrrc:ll
;Irron,s rrpr~~:cnt
rcprcucnt the
the
st('p~ in
tli~;ctls';;on oftlli.,;;
vCflical'lrrows
illrcrcncinp lIsing
using shared
dl;lrcd knowledge,
knu\?lcdpi., and Ilcncc
I~cnccdetermine
dctcrlninc me
ihc speech act
infcrcncing
in:crprctJtion.
i~~:c~rcratio~i.
I:II~
of
(~lld of
ligurc. there
thcrc arc
itrc ta,o
palhs from the
tllc sh;lrcd
s11;lrcd knowledge
knowlcdgc that A
askcd a
IInn this figure.
two paths
A Js1<cd
yes!no questioo
II want.s
ycs/llr~
qucslic~n(bolla",
boll on^ left
lcrt of
o f Figure
Mjillre 3),
3). to
ro S's
S's bbelief
c l i c f dwc
t l : ~ rA
w.?nr.s 10 know che
the
dqlarlurc timc
lrup right of
u f 1:iglirc
idc111ifii.d a:;
a:; p:ilhs
(a) and (b),
(17).
d"pnrlure
time (tup
FigLirc 3). Hot11
Both pdlhs,
paths, idcntirtcu
paths (a)
involve
the inference:;
throllgl, sleps
ril~1 (a),
tlses
involvc rlic
inl'crcncc:~ihrol~gli
slcps (4),
(4). (5). anu
and (0),
( 6 ) . IIlowever,
louevcr, in
i n pilili
(a). S uscs
his
private Knowledge
about
A and hence
ill((~ndcd only a yes/no
liis pri\;~!c
kn~~\v!cdgc
hoot ,t
Ilcncc bclic\cs thar i\
.A intcndcd
ycs/ilo
qucstilJll.
the same:
knowlcdf~c
1~~~1.t h(11),
(I)). S made
n1:tdc ilic
sarnc illrl'rcmTs
it~rcrcnrcsusing
usi~lgslwrcd
sl16rctl irt~~\vIcd::c
q ~ ~ c s ~IIninv path
and lirltcc
cc>~icludcdl!Jat
tint A
A ii1t:nilcii
S If}
LIJ rCf:llgni/.C'
~cccgni/.cgoal
g11;11 (b).
(IJ).
bcrwcc~lthc:il,
bctw('cil
them, dnd
h~I1CC concluded
ii1t~nd('d S
Thus,
rl;prc:\cnt~; t11c
the literal
intl~rprclat[on, :11111
the
'l'lius, poth
pat11 (a)
(3) represent:
1ilrr;ll int'~i-prel.~tion.
:111d path
path (1)
(I))represents
rcprcscnts Ule
148
Allen
indirect. Accausc of S's disposition towards hclpful hchavior, his rcsponsc may be
indirect. Ikcause of S's disposition towards helpful behavior, his response may be
the samc in both of dlcsc cascs. Howcvcr, as mcntioncd abovc, if S did know thc
the same in both of these cases. However, as mentioned above, if S did know the
dcparturc time but was not allowcd to tcll A, h c n S's response would bc different
departure time but was not allowed to tell A, then S's response would be different
in C ~ C I I of thc intcrprctations.
in each of the interpretations.
l'hcrc is a fundamental assumption that dctcrmincs which of thcsc paths
There is a fundamental assumption that determines which of these paths
will bc takcn as thc corrcct intcrprctation. 'Ihis is that
will he taken as the correct interpretation. This is that
In othcr word$. Llic licarcr will continuc to usc shared knowlcdgc to make the
In other words, Ole hearer will continue to usc shared knowledge to make the
infcrcnccs for as long as possible.
inferences for as Ipng as possible.
All of thcsc issucs ail1 bc disci~sscd in dctail in tlic following sections. In
All of these issues will be discussed in detail in the following sections. In
partict~lar.Scction 1.5.1 introduces thc new spcccl~act dcfinitions and outlincs the
pitrticular. Section 2.5.1 introduces the new speech act definitions and outlines the
rolc of inutual bclicf: Scclion 2.5.2 dcfincs thc s11rfi1~~
linguistic acts; Scction 2.5.3
role of mutual belief: Section 2.5.2 defines the surface linguistic acts: Section 2.5.3
dcwribcs thc cxrcndcd plan i:ifcrcncc systc~nthat can ~ ~ the
s c ncw spccch act
describes the extended plan in ference system that can lise the new speech act
dcfinitions; Scction 2.5.4 prcscnts soinc examplcs: and Scction 2.5.5 discusscs
definitions; Section 2.5.4 presents some examples: and Section 2.5.5 discusses
sbmc remaining issucs.
some remaining issues.
2.5.1 Spccch Acts and hlutualI3elicf
2.5.1 Speech Acts and l\lutual Belief
149
150
Al1en
151
a.
2.5.2
2.5.2 Surface
Surhce Linguistic
1.inguistic Acts
We
reason about languagc
language itself.
itsclf. lhcse
These
thc actions
actions that
thnt allow us to rcason
Wc now turn to the
will
fonn of
of uttcranccs
utterances and the
will make
make the
thc connection
conncctil~nbetween
bctwccn the
rhc linguistic form
intentions
Obviously, thcrc
there arc many
intcntions of the
the speaker
spcakcr producing these
thcsc utterances.
uttcranccs. Obviously,
ways
perceived as bcing
being the
ways in
in which l1le
Uic fi.mn
fbnn of an utterance
uttcrancc influences what is pcrccivcd
speaker's
the most obvious in this
havc selected
sclccted only a few of UIC
spcakcr's intentions.
intcntions. We have
analysis. Surprisingly
Surprisingly enough,
cnough, the range
rangc of behavior
bchavior analyzablc
analysis.
analyzable with tlicm
them is quite
rich.
rich.
~l'hcreisis a basic
basic assumption
assumption underlying the
thc fi~llowing
There
following analysis: spcakcrs
speakers always
mcan what they
thcy literally say. and it is only by Uiis
mean
this litcral
literal mcaning
meaning that any indirect
meaning can he
bc inferred.
inferred. This
'l'his is the
thc position dcfcnded
meaning
defended by [Scarlc
[Searle 19751.
1975]. Given
d ~ i sassumption,
assumption. the
thc recognition
rccognition of the
thc literal mcaning
this
meaning providcs
provides us with aa set of
of
intcntions from
from which plan inferencing
infcrcncing rnny
may proceed.
intentions
proceed,
Thcsc initial
initial intentions
intcntions arc summari7.ed
summarized as surfacc
These
surface linguistic acts. 'There
There are
two of these
thcsc acts
acts of great
grcat relevance
rclcvance to our study which corrcspond
two
correspond to literal
litera!
INFORMS and
and REQUESTs,
I<F.UUk:S:.STs,and arc defined
dcfincd as follows:
INFORMs
follows:
SURFACE-INFORM(speaker,hearer,prop)
MB(hcsrcr, speaker,
cffcct: MR(hearer,
effect:
spcakcr WANT
speaker
hcarcr KNOW prop)
hearer
SURFACE-REQUES'!'(speaker,hearer,action)
cffcct: MB(hearer,
Mll(hcarcr, speaker,
spcakcr.
effect:
spcakcr WANT
speaker
UO action)
action)
hcarcr DO
hearer
Ihcsc actions
actions arc indicated
indicated by a number
numbcr of linguistic dcviccs,
These
devices, most
importantly the
thc mood
mwd of the
thc sentence and thc
importantly
the use
usc of ccmin
cenain "clue"
"clue" words.
words, For
Allen
152
With the ncw action definitions givcn in thc last two sections, it is not obvious that
With the new action definitions given in the last two sections, it is not obvious that
t l ~ cplan infcrcncc rules can still apply. However, with thc addition of onc new
the plan inference rules can still apply. However, with the addition of one new
infcrcnce rulc, and two new control heuristics, the rcst of thc plan inference
inference rule, and two new control heuristics, the rest of the plan inference
proccss can bc used as is.
process can be used as is.
Bcforc we introduce the ncw itcms, considcr what the cxtcndcd plan inference
Defore we introduce the new items, consider what the extended plan inference
process should look like. Thc spcaker speaks, thcrcby cxccuting a surface
process should look like. The speaker speaks, thereby executing a surface
linguistic act which is idcntifiablc from thc syntactic form of thc uttcrancc. Ihe
linguistic act which is identifiable from the syntactic form of the utterance. The
hcarcr bclicvcs the action was dclibcratc, so in fact the spcakcr intcnded to
hearer believes the action was deliberate, so in fact the speaker intended to
perform thc surfacc act, and thus intcndcd to achicvc its cffccts (action-cffcct
perform the surface act, and thus intended to achieve its effects (action-effect
infcrcnce). l'hus we have a bclief of the form
inference). Thus we have a beliefafthe fonn
153
hcarcr BEI.1 EVE
hearer BELIEVE
spcnkcr WANT
speaker WANT
M Mhcarcr, spcakel,
MIl(hearer,
speaker,
WANT X)
spcakcr
speaker WANT X)
where X depends on thc surfacc act. From this point, wc want thc hcarcr to bc
where X depends on the surface act. From this point, we want the hearer to be
ablc to perform plan infcrcnccs bascd on what hc bclicvcs is ~nirtuallybclicved.
able to perform plan inferences based on what he believes is mutually believed.
Wc can dcfinc this modc of reasoning in tcnns of the alrcady cxisting plan
We can define this mode of reasoning in tenns of the already existing plan
infercncc rulcs as follows:
inference rules as follows:
The Afurual Relief Rule
The Mutual ReliefRule
If
If
hcarcr I3EI,IEVE
hearer BELIEVE
spcakcr WANT
speaker WANT
M IYhcnrer,speaker,
MB(henrer,speaker,
WA Nl'(spcakcr,x))
WI\NT(speakcr,x
and
and
hearer BEI.IEVE
hearer BELIEVE
speakcr IIE1,IEVE
speaker BELIEVE
M ll(hcarer.spcaker,
MB(hearcr,speaker,
SIIAW(x) IiSBAW(y))
SBI\W(x) ::J i SOA W(y
then
hearer BELIEVE
hearer BELIEVE
spcakcr WANT
speaker WANT
M I3(hcarcr,spcaker.
MI3(hcarcr,speaker,
WANT(spcakcr,y))
WI\NT(speaker,y
In other words, if the hcarcr bclicvcs the speaker wants it to bc shared knowledge
In other words. if the hearer believes the speaker wants it to be shared knowledge
that the speaker intends X, and if the hcarcr bclicvcs the spcakcr bdicvcs it is
that the spenker intends X, and if the hearer believes the speaker believes it is
sharcd knowlcdgc that intending X infers intending Y, then thc hcarcr may
shared knowledge that intending X infers intending Y, then the hearer may
concludc the spcakcr wants it to bc sharcd knowlcdgc that thc spcakcr intends Y.
conclude the speaker want~ it to be shared knowledge that the speaker intends Y.
'Ihus thc hcarcr may infcr various further intcntions of the spcakcr using what
Thus the hearer may infer various further intentions of the speaker using what
hc thinks thc spcakcr bclicvcs is mutually bclicvcd bctwccn thcm. At some time
he thinks the speaker believes is mutually believed between them. At some time
(lo hc discusscd bclow), the hcarcr ceases to infer ncw intentions and idc~~tifies
a
(to be discussed below), the hearer ceases to infer new intentions and identifies a
spccch act via its body. 'lhc cffcct of the speech act pmduccs an intention that the
speech act via its body. The effect of the speech act produces an intention that the
hcarcr privatcly bclicves thc spcakcr has. From this, the hcarcr may infcr other
hearer privately believes the speaker has. From this, the hearer may infer other
intentions of the speakcr bascd on his private knowlcdgc of the speaker.
intentions of the speaker based on his private knowledge of the speaker.
Once this terminates, the plan can bc inspected for obstacles as usual.
Once this terminates, the plan can be inspected for obstacles as usual.
Allen
Allen
154
154
llowcvcr, thc hcarcr is obfigcd tu address thc obstaclcs that arc mutually bclicved
Howcver, thc hearer is obliged to address thc obstacles that are mutually believed
bctwccn thc spcakcr and hcarcr. l ' l ~ cobstaclcs hascd on Ihc. hcarcr's private
between the speaker and hearer. Thc or-stacles hascd on thc hearer's private
bclicfs may or may not bc rlddrcacd, dcpcnding on how hclpful thc hcarcr feels.
beliefs mayor may not bc addrcssed, depending on how helpful the hearer feels.
'I'he qucstion rcmains as to how long tlic hcarcr contiliucs using mutual
The question remains as to how long the hearer continues using mutual
bclicfs. Wc havc alrcady mcntioncd h a t iilfcrcncing within mutual bclicf will
beliefs. We have already mentioned lhat infcrencing within mutual belief will
continuc as long as possible, LC., that intcntion bill bc attributed if it can bc. ' h i s
continue as long as possible, i.e., that intention will be attributed if it can be. This
is capturcd by Lhc following raling hcuristics,
is captured by Ule following rating heuristics.
The Level ()flnfercncing Rules
1. Giam a cltoice bclwecn an itfererice rhar cut~!it~rres
1. Gillen a choice be/ween an inference that continues
usitlg nlulual b~lief:n t ~ dan itference rha~does
using mutual belief, and an inference Ihai does
11o1,fo\lor /hc one using rnurual belie$
1101,
The sccond rulc is important and is justified as follows: the infcrcnccs taken
The second rule is important and is justified as follows: the inferences tak.en
using mutual bclicf arc bkcn to havc bccn intcndcd by thc spcakcr. If, givcn the
u~ing mutual belief are taken to have been intended by the speaker. If, given the
mutual knowlcdgc, thcre arc multiple paths that thc hcarcr could takc, thcn the
mutual knowledge, there are multiple paths that the hearer could take, then the
hcarcr can no Iongcr attributc intcntion to thc spcakcr as hc has no rncans of
hearer can no longer attribute intention to the speaker as he has no means of
choosing which path thc spcakcr intcndcd. I h u s it is rhc spcakcr's duty to ensure
choosing which path the speaker intended. 111US it is the speak.er's duty to cnsure
that the infcrcnce path to his intcnded goal is casily found by the hearer;
that the inference path to his intended goal is easily found by the hearer;
othcrwisc the hearcr will simply takc thc literal, or some intermediate
otherwise the hearer will simply take the literal, or some intcnnediate
intcrprctation.
interpretation.
'Ihc above tnight sound quitc complicatcd to implcmcnt but, in fact, is not, if
'Ibe above might sound quite complicated to implement but, in fact. is not, if
one vicws t l ~ cplan infercnccs as bascd on tllc spcakcr's goals. In this schcme,
one views the plan inferences as based on Ule speaker's goals. In this scheme,
thcrc arc two parameters to lhc infcrcncc process; the context in which goals are
there are two parameters to the inference process; the context in which goals are
bcing inferred, and thc contcxt in wliich the infcrcnce rulcs arc bcing evaluated,
being inferred. and the context in which the inference rules are being evaluated.
'nus, for basic plan recognition, Lllc goal contcxt is what the system bclicvcs the
Thus, for basic plan recognition, the goal context is what the system believes the
spcakcr wants (i.c.. "SIMW(-)"), and the cvaluation contcxt is what the system
speaker wanL<; (Le., "SIlAW(-n, and the evaluation context is what the system
bclicvcs thc spcakcr bclicvcs hc., "Sl)Al)(-)"). In the cxtcndcd modc infcrcncing,
believes the speaker believes (i.e., "SBI\B(-)"). In the extended mode inferencing,
thc goal contcxt is what thc systcm bclicvcs thc spcakcr wants to bc mutually
the goal context is what the system believes the speaker wants to be mutually
bclicvcd that thc spcakcr wants (i.c.. "SIiAW(Mll(S.A.AW(-)))"), and h e
believed that the speaker wants (i.e., "SHI\ W(MB(S,A,AW(-)"), and the
cvaluation conlcxt is what thc systcrn bclicvcs thc spcakcr bclicvcs is n~utually
evaluation context is what the system believes the speaker believes is, mutually
bclicved (it., "SIIAIYMI%(S.A.-))"). Given Cohcn's rcprcscntation of bclicf and
believed (i.c., "SHI\B(MB(S,A,-)"). Given Cohen's representation of belicf and
mutual bclicf, thcsc contcxts arc casily constructcd and manipulated.
mutual belief, these contexts are easily constructed and manipulated.
155
2.5.4 Exa~nplesof lndircct Acts
2.5.4 Examples of Indirect Acts
"
cffcct: HAVE(bcncficiary,objcct)
HAVE(beneficiarY,object)
REACH(agcnt,object)
prccondilion: NEhR(agcntobjcc t)
precondition: NEAR{agent,objcct)
effect:
cffcct: HAVE(agcnt,objcct)
HAVE(agent.object)
"SBAW(MI3(S.A,A W (Pass(S.A,Salt."
Allen
156
(2.1)
Ihis cxamplc involvcs plan inferences using mutual bclicf bctwccn stcps (2.2) and
This example involves plan inferences using mutual belief between steps (2.2) and
(2.5).
(2.5).
Thc litcral intcrprctation of this utlcrance would arise from identifying a
The literal interpretation of this utterance would arise from identifying a
spccch act from stcp (2.2). Thc lcvcl of infcrcncing heuristic favors Lhc indirect
speech act from step (2.2). The level of inferencing heuristic favors the indirect
interpretation, however. Notc that, cvcn without the lcvel of infcrcncing heuristic,
interpretation, however. Note that, even without the level of inferencing heuristic,
the litcral intcrprctation would bc disfavorcd, for it invoIvcs A asking a question
the literal interpretation would be disfavored, for it involves A asking a question
to which both S and A bclicvc A already knows thc answcr. Thc know-negative
to which both S and A believe A already knows the answer. The know-negative
inference is possiblc from step (2.3) (producing SflARF.D(AW(not
inference is possible from step (2.3) (producing SHARED(AW(not
tlavc(S,Salt)))). This path is disfavarcd sincc it is mutually bclicvcd that A knows
Have(S,Salt)))). This path is disfavored since it is mutually believed that A knows
that S has the salt Note that it is still possiblc to dcscribc this utterance as a
that S has the salt Note that it is still possible to describe this utterance as a
ycs/no question as well as a rcqucst to pass thc salt. The urterancc can in fact be
yes/no question as well as a request to pass the salt The utterance can in fact be
both spccch acts simultancously. In gcncral, however, thc indirect intcrprctation is
both speech acts simultaneously. In general, however, the indirect interpretation is
the more uscful description of h e spcakcfs intentions.
the more useful description of the speaker's intentions.
Anothcr example, "Arc you ncar thc salt?". can bc handled in a similar
Aoother example, "Arc you ncar the salt?", can be handled in a similar
manner. Using shared knowlcdge, S would infcr that being near the salt enables
manner. Using shared knowledge, S would infer that being ncar the salt enables
him to rcach it which causes him to have the salt, which is a prercquisitc to passing
him to rcach it which causcs him to have the salt, which is a prerequisite to passing
it, which is an expcctcd goal for A in this context.
it, which is an expectcd goal for A in this context.
Example 111: "I Warrt lo Have the Salt'
Example III: "I Want 10 Have the Salt'
Ihis cxamplc rcquircs S to recognize that A intcndcd him to plan bascd on a goal
1bi5 example requires S to recognize that A intended him to plan based on a goal
inferred from A's uttcrancc. l h u s Lhc example will usc thc ncstcd planning rule in
inferred from A's utterancc. 'Ibus the example will use the nested planning rule in
Section 3.3.
Section 3.3.
(3.1) SIlAW(SURFACE-INI'OIIM(A,S,AW (Havc(A,Salt))))
(3,2)SHAREIYAW(S KNOW(AW (Havc(A.Salt)))))
(3.2)SHARED(A W(S KNOW(AW (Have(A,Salt))))
Iaction-c ffcct]
(3.3) SHAREI)(AW(SW (Havc(A,Salt)))) Iaction-e[fect]
(3.3) SHARED(AW(SW (Havc(A,Salt))))
[mutually bclicvcd dccide in fcrencc]
[mutually believed decide inference]
157
(3.4) SI4AI<EI)(AW(SW (Pass(S,A,Salt))))
(3.4) SHARED(AW(SW (Pass(S,A,Salt
[mutualiy bclicvcd.ncstcd planning using cfkct-action rule]
[mutually believed,nested planning using effect-action rule]
(3.5) SHAI<EI)(AW (Pass(S.A,Salt))}
(3.5) SHARED(AW (Pass(S.i\.Salt}
[mutually bclicvcd want-action]
[mutually believed want-action]
(3.6) SIIAW~I~EQU13S'l'(A,S.Pass(SSA,Salt)))
(3.6) SBA W(REQUEST(A,S.Pass(S.i\.Salt)
[body-action]
[body-action]
In this cxamplc. S rccognizcs that A intcnds S to acccpt thc goal of A having the
In this example, S recognizes that /\ intends S to accept the goal of /\ having the
salt (infercncc from 3.2 to 3.3), and then A intends S to plan to achicvc thc goal by
salt (inference from 3.2 to 3.3), and then A intends S to plan to achieve the goal by
passing Lhc salt (3.41.
passing the salt (3.4).
Examplc 1V: 'I'wo Intc~prctationsof "Do You Know the Secrefl"
Example IV: Two Interpretations of "Do You Know the Secrefl"
'fo crnpliasizc thc distinction betwccn thc levcls of plan infcrcncing. Iet us
To emph:lsize the distinction between the levels of plan inferencing, let us
considcr two intcrprcta~ionsof l l ~ cqucstion "110 you know thc sccrct?'ln one. it
consider two interpretations of the question "1)0 you know the secret?" In one. it
is assumcd to bc intcndcd ;it facc valuc--1inwcvcr thc hcarcr recognizes a
is assumed to be intended at face value--howeveT the hearer recognizes a
rnutivating goal that thc spcakcr rcally wants to know &hcsccrct, but docs not want
motivating gaul thnt the speaker really wants to know the secret, but docs not want
to ask. In thc sccond, ~ h hcarcr
c
intcrprcts it as an indirect rcqucst that hc tell the
to ask. In the second, the hearer interprets it as an indirect request that he tell the
sccrct, As wc shall scc, both plans involvc the sanlc infcrcnccs, only thc Icvcl.at
secret. As we shall sec, both plans involve the s:lnle inferences, only the leveLat
which tl~cyarc madc differs. 'l'hus this example has the samc strucalrc as dcpicted
which they arc made differs. Thus this example has the same structure as depicted
in E'igurc 3 abovc. Iktails on rcprcscnting a concept such as "the sccrct" are
in Figure 3 above. Detuils on representing a concept such as "the secret" are
ignorcd as thcy are not rclcvant to thc example.
ignored as they are not relevant to the example.
The Literal Interprefation
(4.1) SBAW(SURFACE-I~F.QUF,Sl1(A,S,INF0RMIF.
(4.1) Slll\W(SURFACF.-REQUEST(/\,S,INfORMIF.
(S.A,S KNOWIZEF "thc sccrct")))
(S,I\,S
KNOWREF "the secret")))
(4.2)
SliAl<l~l~(AW(lNFORMlF
(4.2) SHARED(I\W(INFORMIF
(S.A.S KNOWltEF "thc sccrct")))
[action-effect]
KNOWREF "the secret")
[action-effect]
(S.I\.S
(4.3)
SIIAW(I<EQU~S'I'(A,S.INFOIiMIF
(4.3) SBAW(REQUEST(
/\,S.lNFORMIF
(S.A,S KNOWIZEF
"thc sccrct")))
[body-action]
(5.1\,5 KNOWREF "the secret")
[body-action]
(4.4) SIlAW(A KNOWIF (S KNOWREF "thc sccrcl"))
(4.4) SBA W(I\ KNOWn: (S KNOWREF "the secret"
[action-cffect]
[action-effect]
(4.5) SIIAW(S KNOWIZEF "the sccrct")
[know-positive]
(4.5) SBAW(S KNOWREF "the secret")
[know-positive]
(1.6) SI)AW(INI;ORMlZEF(S,A,"thc sccret"))
(4.6) SBAW(lNFORMREF(S,I\. "the secret" [precondition-action]
[precondition-action]
(4.7) SIIA W(A K NOWREF "thc sccret")
[action-cffcct]
(4.7) SBA W(/\ KNOWREF "the secret")
[action-effect]
158
Allen
Note that thc samc chain of infcrcnccs arc donc cxccpt for the context. Stcp (4.4)
Note that the same chain of inferences are done except for the context. Step (4.4)
cbrrcsponds to (5.3), (4.5) to (5.4), (4.6) to (5.5). In stcp (5.6) thc indircct rcqucst is
corresponds to (5.3), (4.5) to (5.4), (4.6) to (5.5). In step (5.6) the indirect request is
idcntificd, ~ h c r c a sthc litcral intcrprctation was idcntificd in (4.3). Stcps (4.7) and
identified. 'Nhereas the literal interpretation was identified in E4.3). Steps (4.7) and
(5.7) arc idcnlical. Hcncc S rcachcs thc slime conclusion in both cascs. However,
(5.7) (Ire identical. Hence S reaches the same conclusion in both cases. However,
if S couId not tcll A thc sccrct, in interprctation I hc could simply answcr "yes,"
if S could not tell A the secret, in interpretation I he could simply answer "yes,"
while in intcrprctation I I he would havc to supply an excuse for not complying
while in interpretation II he would have to supply an excuse for not complying
with A's rcquest.
with A's request.
Thcre arc a few factors that affcct which intcrprctation would be choscn by the
There arc a few factors that affecl which interpretation would be chosen by the
system. For instance, if S believes that A bclicvcs that S knows thc sccrct. thcn the
system. For instance, if S believes that A believes that S knows the secret, then the
lircral intcrprctation is unlikcly for it cnails that A has a goal (4.5) tllat A bdieves
literal interpretation is unlikely for it entails that A has a goal (4.5) that A believes
already holds (scc heuristic HZ). This fact would also favor the indircct
already holds (see heuristic H2). This fact would alsu favor the indirect
intcrprctation by eliminating an infcrcncc from step (5.3) to
interprctation by eliminating an inference from step (5.3) to
159
snAWry)
SnAW(Y) :)i
3isnAW(X);
SIMW(X): and
SnAW(,X):)i SnAW('Y).
'l11cTe
'Ihcrc is also general
gcncral knowledge about the motivations of other
othcr agents.
agents. This
infcr new goals of agents.
agcnts. IInn particular,
knowlcdgc can be uscd
knowledge
used to infer
particular. if an agent
bclicvcd
X holds,
believed condition X
holds. and that condition X
X is undesirable, then the agent
notX, Thus wc
havc a corresponding
corrcsponding plan infcrcncc
may start to w.mt
want notX.
m:lY
we have
inference stating that
if an agent
agcnt to be aware of some
solnc undesirable
undcsirablc condition, then
agcnt wants another agent
the agent probably wants thc
the agent
agcnt to eliminate that condition. IThis
h i s is
summarized as:
UndeSirability
Undesirability Condition
SIlAW(Sn(X)):)i
s n A w ( s n ( x ) ) >i SnAW(SW(,X))
snnw(sw(7x))
where.S
where
S' belieles
beliel'es A
A beliel'es
beliel'es X is undesirable.
These
plans, and,
'Ihcsc new rules dramatically increase
increasc the search space
spacc of possible plans,
an4
as yet, have not been
bccn implemented
implcmcntcd in a general
gcncral manner. However, from a
theoretical
view, they allow onc
investigate the adequacy
dicorctical point of view.
one to invcstigatc
adcquacy of the
proposed tha)ry
theory of speech
them, a wide rangc
range of complex
proposcd
spccch aCts.
acts. Using thcm,
complcx language
usc
In particular,
particular. consider the classic
use can be
bc explained. In
classic example
(1)
(1) "It's cold in here"
issued
the king is cold
issucd by a king (I\)
(A) to his servant
scrvant (S). From (I),
(1). S may infer
infcr that thc
and, since bcing
being cold is undcsirable,
undesirable, tliat
tl,at thc
the king docs
be cold. Since,
does not want to bc
Since.
Allen
160
in this sctting, thc coldncss is bcing causcd by an open window, the scrvant may
in this selting. the coldness is being caused by an open window, the servant may
close the window. If thc king intcndcd this Iinc of infcrcnce to occur, and intended
close the window. If the king intended this linc ofinfercnce to occur, and intcnded
thc scrvant to rccognizc this intention, uttcrancc (1) counts as a rcqucst to close
the scrvant to recognize this intention. utterance (1) counts as a request to close
thc window. 'I'hus, assuming thc following gcncral world knowledge:
the window. Thus. assuming the following general world knowledge:
( I ) Cold(room) A 1N(agcntroom) 3 Cold(agent) .
,
(2) Undcsirablc (Cold(agcnt))
(2) Undesirable
(Cold(agent
(3) Opcn(window)
A Part-Of(window,room) 3 Cold(room)
(3) Opcn(window) 1\ Part-Oflwindow,room)::J Cold(room)
2.5.6 Discussion
2.5.6 Discussion
Thc above thcory of spccch acts is intcrcsting in that it m a t s the direct and
The above theory of speech acts is interesting in that it treats the direct and
indircct forms unifornily and allows an utterancc to bc morc than onc speech act
indirect forms unifomlly and allows an ulterance to be more than one speech act
simultaneously. In addition. thc rangc of indircct spccch acts accounted for is
simultaneously. In addition. the range of indirect speech acts accounted for is
considcrahly grcatcr than with nlhcr approachcs. such as [Gordon and lakoff
considerably greater than with other approaches, such as [Gordon and Lakoff
19751 and [Orawn 1980)). In cases whcrc this work and h e othcrs overlap, this
1975] and [Bmwn 1980]). In cases where this work and the others overlap, this
approach is Inore sclcctivc as it heavily uscs thc context in which thc utterance is
approach is more selective as it heavily uses the context in which the ulteranee is
spoken, and dc-cmphasizcs Lhc actual form of rhc uttcrancc. 'lhis, of course, is not
spoken, and dc-emphasizes the actual form of the utterance. 'lhis, of course, is not
donc without somc loss. 1:or instance, thc currcnt thcory cannot distinguish the
done without some loss. For instance, the current theory cannot distinguish the
sub~lcdiffcrcnccs bctwccn
SubLIc differences between
"Can you opcn thc door?" and
"Can you open the door?" and
"Arc you ablc to opcn thc door?".
"Arc you able to open the door?".
161
former sentence
sentence appears
appcars much more
morc likely to be a request to open the
The fonner
door than the latter.
lattcr. Disregarding
Ilisrcgarding such subtleties,
subtlcties, a large range
rangc of indirect
indircct uses can
be explained.
cxplaincd. For example,
examplc. in thc
smtion domain,
domain, the following
following forms
forms have
the train sk1tion
bccn rccognircd
intcrprctations:
been
recognized in their indircct
indirect interpretations:
..."
;~ddition,more
morc unusual analyses
annlyscs of sentences
scntcnccs have bccn
1%
In addition,
been analyzed by hand. For
instance, thc
hcrc cxplain
instance,
the mechanisms here
explain how Uic
the sentence
reported;
reported:
(2)
(2) an inform that John wants you to leave;
lcave: and
(3) a rcqucst
request that you lcavc
leave (giving
request).
(giving a reason for the request).
[Allcn 1979]
19791and [Perrault
[Pcrrault and Allcn
19801).
This analysis
analysis can bc
be found in [Allen
Allen 1980]).
Allen
Allen
162
162
163
(5.1) SURFACE,liF.QUES'I'(A,S,INF;ORMREF(S,A,x)
(5.1) SURFACE.REQUEST(A,S,INFORMREF(S,A,x)
whcrc PliOPElilY(train1.x) A TO-PROPER'I'Y(train1, Windsor)
where PROPERTY(trainl,x) A TO-PROPERTY(trainl. Windsor)
(5.2) SURFACE.REQUEST(A,S,INFORMIF
(5.2) SURFACE.REQUEST(A,S,INFORMIF
(S,A,Pl<OPII'SYinvolving lminl))
(S,A,PROPERTY
involving train I
whcrc '1'0-I'liOI'ER'I'Y(train1,Windsor)
where '1'0- PROPFR'I'Y(train I,Windsor)
whcrc PKOPERTY(train1,x)
where PROPERTY(trainl,x)
'I'o identify thc actt~alprcdicatc indicatcd by thc prcdicatc pattern PROPERTY,
To identify the actual predicate indicated by the predicate pattern I'ROPERTY,
thc 1%0A12Dcxpcctarion is inspcctcd for matches.
the BOA R0 expectation is inspected for matches.
Thcre arc two relevant properrics of uains, thc IIEPAR'T.1'IME and the
There arc two relevant properlies of trains, the DEPART.TIME and the
I)EPAR'l'.LOC. Assuming that S bclicvcs that A knows ncither of the values for
DEPART.LOC. Assuming that S believes that A knows neither of the values for
thcsc rclations, both can bc considcrcd obstacles and bc used to form a rcsponse
these rclations, both can be considered obstacles and be used to fonn a response
corrcsponding to
corresponding to
"It lcavcs at 3:15 from gate 7."
"It leaves at 3:15 from gate 7."
In another setting, S's rcspansc to thc sarnc fragmcnt might bc quite diffcrcnt.
In another setting. S's response to the same fragment might be quite different.
If thc train station had only nnc platform, hc would only rcspond with the
If the train station had only one platfonn. hc would only respond with the
dcparturc timc bccausc hc would bclicvc that A knows thc location already. To be
departure time because he would believe that A knows the location already. To be
complcdy diffcrcnt, if S wcrc thc tickct agcnt hc would intcrprct rhc fragmcnt as
completely different, if S were the ticket agent he would interpret the fragment as
a rcqucst for a tickct (sincc this is what S cxpccts, i.c. what S bclievcs that A
a request for a ticket (since this is what S expects, I.e. what S believes that A
Allen
164
Ihis approacli covcrs a quitc diffcrcnt rangc of scntcncc fragmcnls than any
This approach covers a quite different range of sentence fragments than any
othcr mcthod dcscribcd in thc litcraturc. 'I'hc most common mctliod, which could
other method described in the literature. The most common method, which could
bc callcd thc "scmantic appmacl~." accepts frag~ncntsin thc form of fill1 syntactic
be called the "semantic approach," accept~ fragments in the form of full syntactic
units, such as noun phriacs, and uscs thc fragrncnts to build a partial "scrnantic"
units. such as noun phrases, and uses the fragments to build a pilrtial "semantic"
rcprcscntation that is then tnatclicd into thc rcprcscntation of thc prcvious
representation that is then matched into the representation of the previous
uttcrancc [Cirosz 19771; Ilturton 19761; [Hcndrix 19771. If this m;itch is successful,
utterance [Grosz 1977]: [Burton 1976]: [Hendrix 1977]. If this match is successful,
h e rcprcscntation of thc utlcrancc is constructcd our of thc prcvious utterance's
the representation of the utterance is constructed out of the previous utterance's
structure with tlic newly spccificd parts replacing thc parts di,rt tllcy matched.
structure with the newly specified parts replacing the parts that they matched.
'Ihis mctl~odis lirnitcd tu thusc frag~ncntsLhat dcpcnd on thc structure of the
'Ibis method is limited to those fragments that depend on the structure of the
prcvious uttcrancc for tlicir intcrprctation. As shown in thc train dialogues, there
previous utterance for their interpretation. As shown in the train dialogues, there
are many fragmcnts uscd whcrc this is not thc case.
are many fragments used where this is not the case.
Our approach is suitcd for cascs whcrc thc mcrc mcntion of a conccpt or
Our approach is suited for cases where the mere mention of a concept or
phrasc is suggcstivc cnough to convcy a tllought/wish. 'Ihcsc inshnccs typically
phrase is suggestive enough to convey a thought/wish. 'Ibese instances typically
I~avclittle syntactic rcl~ltionto prcv~ousuttcnnccs, and in fact can wcur when
have little syntactic relation to previous utterances, and in fact can occur when
thcre is no prcvious uttcrancc. I n many ways. the matching tccllniqiics arc similar
there is no previous utterance. I n many ways. the matching techniques arc similar
t i thc "scmantic approach," but thc goats arc very diffcrcnt. "lhc goal of thc
to the "semantic il pp roach," but the goals are very different. 'Tne goal of the
semantic approach is to find a aructural similarity with thc prcvious uttcrance.
semantic approach is to find a structuntl similarity with the previous utterance.
I h c goal of this work is to identify thc plan and goals of the spcakcr. A
lbe goal of this work is to idelltify the plan and goals of the speaker. A
syntactically complctc irttcrancc is ncvcr considcrcd or constn~ctcdfor it has no
syntactically complete utterance is never considered or constnlcted for it has no
cffcct on thc understanding of h e uttcrance.
effect on the understanding of the utterance.
2.7 Conclusions
2.7 Conclusions
165
166
Allen
1
\
far, for our cmphasis has bcen on demonstrating thc uscfulncss
of such notions in
far, for our emphasis has been on demonstrating the usefulness of such notions in
a model of language. Now that wc havc a bertcr idea of how thcsc tools can be
a model of language. Now that we have a better idea of how these tools can be
used, it is time to rcturn to them and attempt a bcttcr formulation.
used, it is time to rcturn to thcm and attempt a bcttcr formulation.
Acknowlcdgments
Ihis paper is the result of multiplc papcrs and drafts writtcn over thc last two
This paper is the result of multiple papers and drafts written ovcr the last two
years. It rcprescrits my final attempt to clean up and prcscnt thc work from my
years. It represents my final altcmpt to clean up and present the work from my
thcsis. Much of thc original work was donc in conjunction with Ray Pcrrault and
thesis. Much of the original work was done in conjunction with Ray Perrault and
Philip Cohcn whilc at tl~cU ~ i ~ c r s iof
t y 'Toronto. 'l'hcy also havc providcd helpful
Philip Cohen white at the University of Toronto. They also have provided helpful
commctlls on many drafts in the last fcw years. I would also like to thank Jcrry
comments on many drafts in the last few years. I would also like to thank Jerry
Kaplan, 1'3avid McDonald, Candy Sidncr. and I3onnic Wcbbcr for their detailed
Kaplan, David McDonald. Candy Sidncr, and Bonnie Webbcr for their detailed
reading and comlncnting on thc final draft. All thcsc people have greatly
reading and commenting on the final draft. All these people havc greatly
irnprovcd Lhc quality of this dwurnent.
improvcd the quality of this document
CHAPTER 3
Cooperative Responses From a Portable Natural
Language Database Query System
S. Jerrold Kaplan
J.l Introduction
3.1
cssential aspect
aspcct of
of human conversation. Speakers cooperate for
Coopcration is an essential
Cooperation
variety of
of purposes: to perform tasks, solve problems, communicate needed
a varicty
ctc. Indeed, for a convcrsation
conversation to occur at all implies some minimal
information, etc.
of cooperation. One way in which human convcrsants
conversants cooperate is by
level of
observing
postulates, etc.,
observing a variety of rules, conventions,
conventions, postulates,
etc., that allow the
communication of
intentions and beliefs in addition to the literal meaning of their
ofintentions
promot~s smooth and effective communication. Consequently,
utterances. This promotcs
syntax and semantics
semantics alone are not adequate to characterize a cooperative
knowledge of the conventions of use
usc and expectations of a native
conversation:
conversation: a knowlcdgc
speaker is required. The
falls under the rubric
I h c study of these conversational
conversational effects falls
of
nra9:rnarjcs: those aspects oflinguistic
ofpraematics:
of linguistic communication that arise from the fact of
utterance or the context or
of use.
Considering non-human conversational
(NL)
conversational partners, Natural Language (NL)
nata
Dam Base (DB)
(DD) query systems
systems sometimes
sometimes engage
engage in behavior that would be
regarded
pamer. Although this
regarded as uncooperative or inappropriate in a human partner.
behavior is sometimes
sometimes due to a lack of world knowledge,
knowledge, much of it can be
attributed to a failure
failure to deal in a systematic
systematic way with pragmatic issues.
issues. Consider
the following
following exchange
exchange with a hypothetical NL DB query system:
User:
Which students
students got a grade
grade of Fin
F in CS105
CS105 in
User: Which
Spring 19807
1980?
System:
System: Nil. [the
[Lhc empty set]
set]
User:
CSlO5 in Spring
Spring 19807
19801
anyone fail
fail CS105
User: Did anyone
System:
System: No.
User:
How many
many people passed
passed CS105
CSlOS in Spring
Spring 19807
1980?
User: How
System:
System: Zero.
Zero.
User:
Was CSI05
CSlO5 given
given in
in Spring
Spring 19807
1980?
Uscr: Was
System:
Systcm: No.
No.
Kaplan
Kaplan
168
168
A cooperativc systcm should bc ablc to dctcct that the initial qucry in the
A cooperative system should be able to detect that the initial query in the
dialog incorrectly prcsumcd that CSlO5 was offcrcd in Spring 1980. and respond
dialog incorrectly presumed that CSI05 was offered in Spring 1980, and respond
appropriately. ?his ability is csscntial to a NL systcm that will function in a
appropriately. This ability is essential to a NL system that will function in a
practical cnvjronmcnt, bccause thc fact that NI. is uscd in thc intcraction will
practical environment, because the fact that Nt is llsed in the interaction wilt
imply to the uscrs Lhat thc nonnal coopcrativc convcntions follawcd in a human
imply to the users that the nonnal cooperative conventions followed in a human
dialog will bc obscrvcd by thc machinc. While each rcsponsc in thc dialog is a
dialog will be observed by the machine. While each response in the dialog is a
corrcct, dircct answcr to thc corresponding qucstion, the overall cffcct is
correct, direct answer to the corresponding question, the overall effect is
uncoopcrativc - thc system appears to "stoncwa~l".~
uncooperative - the system appears to "stonewall".1
A similar conversation with a coopcrative human rcspondcnt might have gone
A similar conversation with a cooperative human respondent might have gone
as follows:
as follows:
Q:
R:
K:
A coopcrative human spcakcr can infcr from the (pragmatic) fact that the question
A cooperative human speaker can infer from the (pragmatic) fact that the question
was poscd (and an assumption h a t the qucstioncr believes the qucstion to be
was posed (and an assumption mat the questioner believes the question to be
appropriate in the context) that the qucstioncr docs not know that CS105 was not
appropriate in the context) that the questioner docs not know that CSI05 was not
given in Spring 1980, and so s/he informs the qucstioncr accordingly rather than
given in Spring 1980, and so s/he infonns the questioner accordingly rather than
answering the qucstion directly. Noticc that thc appropriate response h not a
answering the question directly. Notice that the appropriate response is not a
direct answer to thc question at all, but rather is an indircct response correcting
direct answer to the question at all, but rather is an indirect response correcting
the questioner's rnisimpression about the domain of discourse.
the questioner's misimpression about the domain of discourse.
NL DB qucry systems that arc capable only of dircct answers to questions will
NL DB query systems that arc capable only of direct answers to questions will
necessarily give inappropriate or meaningless responses in this and similar
necessarily give inappropriate or meaningless responses in this and similar
contexts. Surprisingly, the conditions under which such indirect responses are
contexts. Surprisingly, the conditions under which such indirect responses are
considcred inorc appropriate than dircct answers can bc formally defincd, and a
considered more appropriate than direct answers can be formally defined, and a
limitcd but non-trivial class of appropriate indircct responscs can be incorporated
limited but nowtrivial class of appropriale indirect responses can be incorporated
into a computational question answering system using only a lexicon and h e DB
into a computational question answering system using only a lexicon and the DB
itself as sourccs of domain specific knowledge. This is possible in pan because
itself as sources of domain specific knowledge. This is possible in part because
speakers naturally encode directly in the lexical and syntactic structure of their
speak.ers naturally encode directly in the lexical and syntactic structure of their
uttcrances a great dcaI of information about their intentions and beliefs beyond
utterances a great deal of information about their Intentions and beliefs beyond
thc literal content of their utteranc~s.~
the literal content of their utterances.2
1. Stoncwalling is a term used for unux)pcrative (and oken misleading) yet technically mrreet
1. Stonewalling
term used 11for
(and often
misleading)
yet technically
correct
responsesistoa quc<tions.
waruncooperative
popularized during
the Senate
Watergate
IIcarings to
describe the
It was
popularized
during the
Watergate
to describe
the Senator
responsesbehavior
to quc.~tions.
of scveral
While
llouse witnesses
F:orSenate
an example.
see Hearings
the interchange
between
of several
!louse on
witnes!iCs
anYork
example.
the interchange between Senator
behaviorErvin
and I I.While
R. I laldcman
P.586 ofFor
[New
Timessee
19731.
on P.586
of [New
York
Times 19731_
Ervin and2.I1.R.
Haldeman
No daim
is made
hcrc that
all such
inrormation
is encoded in this way, but nlher that substantive
2. No claim
is made here
such information
is encoded in this way. but rather that substantive
conversational
cumthat
are all
recoverable
at this level.
conversational cues are recoverable at this level.
169
This chaptcr explorcs scvcral aspccts of thc pragmatics of coopcrative ql~cstion
This chaptcr cxplorcs scveral aspccts of the pragmatics of cooperative question
answcring, in thc computational contcxt of a NI- DB qucry system. It is drawn
answering, in the computational context of a NL OIl query system. It is drawn
from research originally reported in [Kaplan 19791. l'hc major conuib;ltion of this
from research originally reported in [Kaplan 1979]. The major contribJtion of this
work is to dcmonstratc that:
work is to demonstrate that:
1) Coopcrativc direct and indirect rcsponscs to a habitable class
1) Cooperative
direct and
indirect responses
a habitable
of NI, qucstions
conccrncd
with data to
retricval
can bcclass
produced
ofNL questions
concerned
with
data
retrieval
can
be
produced
in a practical fashion riom a reasonably portablc
NL Dl3 qucry
in a practical
system.fashion from a reasonably portable NL DB query
system.
2) The domain spccific knowlcdge nccdcd to interpret and
2) Therespond
domaincoopcrativcly
specific knowledge
to N1,
interpret
and can be
to a classneeded
of simple
qucstions
respondderived
cooperatively
to
a
class
of
simple
NL
questions
can
be
if
from thc information alrcady present in a DB systcm,
derivedaugmcntcd
from the information
already
present
in a DB
if used
by a suitably
encodcd
lexicon.
'[besystem,
infcrcnccs
augmented
by a suitably
encoded
lexicon. can
'[bebe
inferences
to produce
coopcrative
responscs
drivcn byused
the form
to produce
cooperative
responses
can
be
driven
by qucstion.
the fonn This
chosen by the user to cxpress his or her
chosen decomposition
by the user of
to knowlcdge
express hisresults
or her
in aquestion.
portable (toThis
new DBs)
decomposition
of knowledge results in a portable (to new DBs)
qucry system.
query system.
--
Kaplan
Kaplan
170
170
Casual users of NL computer systems are typically inexpert not only with regard
Casual users of NL computer systems are typically inexpert not only with regard
to thc technical details of the underlying programs, but often with regard to the
to tile technical details of the underlying programs, but often with regard to the
structure and/or content of the domain of discourse. Consequently, NL systems
structure and/or content of tile domain of discourse. Consequently, NL systems
must be dcsigned to rcspond appropriately when they can dctcct a misconception
must be designed to respond appropriately when tIley can detect a misconception
on the part of the uscr. Scvcral conventions exist in cooperative conversation that
on the part of tile user. Several conventions exist in cooperative conversation that
allow speakers to encodc their intentions and beliefs about the domain into their
allow speakers to encode their intentions and beliefs about the domain into tIleir
utterances. Qucstions 1A-IC below illustrate the encoding of goals and intentions
utterances. Questions 1A-IC below illustrate the encoding of goals and intentions
by the different responses that they will reasonably admit.
by the different responses that they will reasonably admit.
1A: Did John borrow my coffee cup?
IA: Did
cup? my coffcc cup?
1B;John
Wasborrow
it Johnmy
thatcoffee
borrowcd
Ill; Was
Johnitthat
IC:it Was
my borrowed
coffcc cupmy
h acoffee
t Johncup?
borrowed?
IC: Was it my coffee cup that John borrowed?
171
I D: No, it was Bill.
lD: No,
was Dill.
I E:it No,
it was your sugar.
]E: No, it was your sugar.
Supcrficially, all three questions appcar to convey thc same request for
Superficially, all three questions appear to convey the same request for
information. A closcr cxarnination revcals that although l D and 1E are both
infonnation. A closer examination reveals that although ID and IE are both
appropriate rcsponscs to 112, 113 favors 1D while 1C favors 1E. 113 indicates hat
appropriate responses to lA, lB favors 10 while Ie favors IE. 113 indicates that
the questioner is intercstcd in who borrowed the coffee cup, while 1C indicates
the questioner is interested in who borrowed the coffee cup, while Ie indicates
that thc qucstioncr is intcrcstcd in what John borr0wed.l
that the questioner is interested in what John borrowed.1
The form of the qucsiions (how information is requested) conveys useFu1
The fonn of the questions (how information is rcquested) conveys useful
information. as well as contcnt (what information is bcing requested). Such
infonnation. as wcll as content (what information is being requested). Such
pragmatic cues can bc rccovcred during thc parsing and translation process to
pragmatic cues can be rccovered during the parsing and translation process to
significantly enhance thc pcrformancc of a NL systcm.
significantly enhance the performance of a NL system.
While a thcory of syntax and semantics may bc sufficient for the analysis of
While a theory of syntax and semantics may be sufficient for the analysis of
sentences in the abstract, the role of scntenccs in discourse (where they are
sentences in the abstract, the role of scntences in discourse (where they are
realized as utterances) requires furrhcr knowledge. ( ' h e study of speech acts in
realized as uttcrances) rcquires furthcr knowledge. (The study of speech acts in
Philosophy of Ianguagc deals with h i s distinction. For instance, scc IScarle 19693
Philosophy of Language deals with this distinction. For instance, see [Searle 1969}
and [Cole and Morgan 19751.) To appropriately process NL utterances it is
and [Cole and Morgan 1975].) To appropriately process NL utterances it is
necessary to consider what Austin called thcir pcrlocutionary force [Austin 19621,
necessary to consider what Austin called their perlocutionary force [Austin 1962],
or the effect on the hearcr of an ut~erance.~
or the effect on the hearer of an utterance. 2
Though this tcrm is often used to refer to thc emotional reactions of the
Though this tenn is often used to refer to the emotional reactions of the
hearer, it could be construed to include the inferences drawn by the hearer
hearer, it could be construed to include the inferences drawn by the hearer
beyond the literal propositional content of thc utterance, particularly regarding
beyond the literal propositional content of the utterance, particularly regarding
the state of mind of the speaker. Producing cooperative rcsponscs from a NL
the state of mind of the speaker. Producing cooperative responses from a NL
query systcm requires at least a partial thcory of language use. Aspects of such a
query system requires at least a partial theory of language use. Aspects of such a
thcory will be presented here: specifically, the gencration of indirect responses to
theory will be presented here: specifically, the generation of indirect responses to
"loadcd" questions. The study of Ianguagc use in a scning where computational
"loaded" questions. The study of language use in a setting where computational
effcctivcness is important could be called com~utationalragm ma tic^.
effectiveness is important could be called computational pragmatics,
NL questions allow (in fact, often require) a cooperative respondent to address
NL questions allow (in facl, often require) a cooperative respondent to address
a questioner's intentions and beliefs beyond a literal, direct response. To be
a questioner's intentions and beliefs beyond a literal, direct response. To be
effective, NL computer systems must do the same. One problem, then, is to
effective, NL computer systems must do the same, One problem, then, is to
1. Sentences 1B and 1C are it-clefts. They are used in discourse to focus certain aspeas of fie domain
1. Sentences
IB and Ie are it-clefts. They are used in discourse to focus certain aspects of the domain
via fronting.
via fronting.
2 Austin's concept of perlocutionary force deals with the intentional or unintentional effects of an
2 Austin's
concept
deals with theofintentional
unintentional
effects
of an from
utterance
onof
theperlocutionary
hearer, that is.force
the conscquences
hearing anorurterance.
This is
distinguished
utteranceh on
the hearer,
thaI
is, locutionary
the wnscquences
of hearing
anofutterance.
This is distinguished
e rorce
of saying
(the
force) and
the force
doing somchinp
in saying (thefrom
illoculionary
the forceCorcc).
of saying
locutionary
force of doing
something
in saying
(the iIloculionary
For(the
cxample,
in the force)
mntexland
of athe
submerged
submarine
attempting
to evade
an encmy destroyer
force). For
in thea woteH
ofa submerged
submarine
attempting
to evade
enemy
by example,
being sileng
crew member
might yell
"If we don't
gel out
of herean we'll
alldestroyer
be killcd!" This
by beingexclamalion
silent, a crew
might
yell with
"If we
get outforce.
of here
all be killed!"
maymember
break the
silence
iu don't
locutinnary
issuewe'll
a warning
with iu This
illoculionary
ex clamalion
the silence
force,
issue a warning
its illoculionary
force.may
andbreak
frigh~en
the restwith
of ~itsh clocutionary
ercw with iu
pcrloculionary
force.with
Thesc
concepts correspond
force, and
frighten
rest the
of the
crew of
with
perlocutionary
force. These onconcepts
correspond
roughly
lo h the
e event.
intention
theits
speaker,
and the cansequences
the hearer,
mpcdvely.
roughly to the event, the intention of the speaker. and the consequences on the hearer. respectively.
Kaplan
Kaplan
172
172
provide practical computational tools which will dctcrminc both when an indircc~
provide practical computational tools which will detennine both when an indirec~
rcsponsc is rcquircd, and what that rcsponsc should be, without rcquiring that
response is required, and what that response should be, without requiring that
significant amounts of domain dcpendcnt world knowlcdgc be cncodcd in special
significant amounts of domain dependent world knowledge be encoded in special
formalisms. This work will takc thc position that distinguishing l a n ~ u mdriven
fonnalisms. This work will take the position that distinguishing langu;Jgc driven
infcrcnccs from domain drivcn infcrcnccs providcs a framework for a solution to
inferences from dom<lin driven inferences provides a framework for a solution to
this problcm in the TI13 qucry domain.'
this problem in the DB query domain. 1
3.2.1 What is a loadcd question?
A loadcd qucstion is onc that indicatcs that the qucstioncr prcsumcs something to
A loaded question is one that indicates that the questioner presumes something to
bc true about Ihc domain of discoursc that is actually false. Question 2A presumes
be true about the domain of discourse that is actually false. Question 2A presumes
213. A coopcrativc speaker must find 2 B assumable (i-e. not bclicve it to be
false2
2B. A cooperativc speaker must find 213 assumable (i.e. not believe it to be false 2
in ordcr to appropriatcly utter 212 in a cooperative conversation, intcnd it litcrally.
in ordcr to appropriately uttcr 2A in a cooperative conversation, intend it literally,
and expect a correct, direct response.
and expect a correct, direct response.
2A: What day docs John go to his weekly piano lesson?
3A: How many Bloody Marys did Bill down at the banquet?
3A: How
Marys did Dill down at the banquet?
3B: many
Hard Bloody
liquor was
available at the banquet
3B: Hard
3C: liquor
Zero. was available at the banquet
3C: Zero.
That is, from the questioner's viewpoint upon asking a question, more than one
That is, from the questioner's viewpoint upon asking a question, more than one
dircct answer must be possible.
direct answer must be possible.
1. 11 will not bc argued that this distinction is fundamental or even very clear cut, hut rather that it is a
1. It willfruitful
not be argued that this distinction is fundamental or even very clear cut, but rather that it is a
way of viewing this problem.
fruitful way
of viewing
problem.
2. Scc
[Prince this
19781
for a di.wussion of thc dirrercnl uses of "assumed" and "mumable" knowlcdgc.
2. Sec [Prince 197111 ror a discussion of the different uses of "assumed" and "assumable" knowledge.
173
Kaplan
Kaplan
174
115
Maxim of Quantity
Quantity [Gricc
[Gricc 1975])1
19751)~
A failure
responses is inappropriate in aa
failure to produce corrective
corrcctive indirect responscs
cooperative
cooperative conversation.
conversation.
3.2.3
3.2.3 Relcyance
Relevance to database
datnhase queries
queries
Many
capable of
of
NL query systems
systems stonewall,
stonewall, in part because they arc not capablc
Many NL
producing corrective
indirect
responses.
To
some
degree,
this
inability
results
responses.
corrective
from
language. NL qucstions
questions admit
from a yiew
view ofNL
of NL as
as a very high leyd
level fonnal
formal query language.
a wider range
do, and provide cues for sclccting
selecting
than fonnal
formal queries do,
range of responses
responscs tl,an
among
these
responses
that
are
generally
absent
from
fonnal
query
languages
among
generally
formal
[Kaplan
19781. The appropriateness of indirect responses is an important
[Kaplan 1978].
difference
system rhar
that
difference between NL questions and fonnal
formal language queries. AA NL sysretn
isis only
meaningless responses lo
to
only capable ofdirect
of direcl respomes
respurnes will necessarily produce meonit~gless
failed
failed presumprions.
presumptions. Unfortunately, the
foiled presuppositions.
presupposirions, and stonewall
slor~ewallon foiled
UUs are sut1lciently
sut?icienty complex that the uscr
domain of most realistic
realistic 1J1ls
domain
user of a NL query
facility
facility (most likely
likely a naive user)
uscr) will make incorrect presumptions in his or her
queries.
queries.
While
of interest from a
While the definition
definition of presumption given above may be of
linguistic standpoint, it leaves much to be desired for a computational theory.
linguistic
theory.
Although it provides
provides a descriptive
descriptive modd
model of certain aspects of
Although
of conversational
behavior, it does
docs not provide an adequate basis for computing the presumptions
behavior,
presumptions of
of
Ily limiting the domain of
reasonable way. By
aagiven
given question in a reasonable
of application to the
area of data retrieval,
retrieval, where typical
typical questions
qucstions are purely extensional, the linguistic
area
suucture of questions
questions encodes
encodes considerable infonnation
information about the questioner's
structure
questioner's
lhis structure can be exploited to compute a significant class
presumptions. This
presumptions.
dass of
of
responses. A
A technique
presumptions and provide appropriate corrective indirect responses.
for computing such
such responses in cases
cases where questions can be assumcd
for
assumed to be
thrust
of
this
work.
purely
extensional
is
the
main
purely extensional is
clchange)."
exchange)."
Kaplan
Kaplan
176
gonc into dctcrmining just what typc of knowledge is required, and how that
gone into detennining just what type of knowledge is required, and how that
knowlcdgc is to be rcprcscntcd, organized. acccssed, and uscd. Onc practical
knowledge is to be represented, organized, accessed, and used. One practical
problcm with evaluating systcrns that use a spccializcd knowlcdgc rcprcsentation
problem with evaluating systems that use a specialized knowledge representation
is that it is somctimcs difficult to dctcrmine if a particular failure is due to an
is that it is sometimes difficult to detennine if a particular failure is due to an
inadcquacy in thc formalism or simply an insufficient base of knowledge.
inadequacy in the fonnalism or simply an insufficient base of knowledge.
(Systcms that clcariy spccify semantics and infcrcncc rules, such as PIiLIQAl
(Systems that clearly specify semantics and inference rules, such as PHLIQAl
[Ijronncnbcrg rt.
19801. do not encounter this problcm.) In addition, the
[Bronnenberg ct. & 1980], do not encounter this problem.) ]n addition, the
collection and cncoding of thc appropriate knowlcdgc can be a painstaking and
collection and encoding of the appropriate knowledge can be a painstaking and
timc consuming task. Many NI, sysrclns that could be said to use spccialized
time consuming task. Many NL systems that could be said to usc specialized
knowlcdgc rcprcscntations (such as [Lchnert 19771 or Allcn's work described in
knowledge representations (such as [Lehnert 1977] or Allen's work described in
Chapter 2 of this volume) sharc a common approach: thcy dccompose the input
Chapter 2 of this volume) share a common approach: they decompose the input
into a suitable "scrnantic" rcprcsentation, and rely on various dcduction and/or
into a suitable "semantic" representation, and rely on various deduction andlor
reasoning mechanisms to provide thc intclligcnce rcquircd to draw the necessary
reasoning mechanisms to provide the intelligence required to draw the necessary
inferences. Infercnccs made in this way can be called domain2 drivcn inference,
inferences. Inferences made in this way can be called domain 2 driven inferences.
itself.
bccause thcy arc motivated by the domain3
because they arc motivated by the domain 3 itself.
While domain drivcn inferences are surcly essential to an understanding of NL
. While domain driven inferences are surely essential to an understanding ofNL
(and will bc a rcquircd part of any comprchensivc cognitive modcl of human
(and will be a required part of any comprehensive cognitive model of human
intelligence), the lexical and syntactic forms chosen by a speaker directly encode a
intelligence), the lexical and syntactic forms chosen by a speaker directly encode a
grcat deal of information that might otherwise bc derived from domain
great deal of infonnation that might otherwise be derived from domain
knowlcdge. Many of the variations and options available to a speaker in phrasing
knowledge, Many of the variations and options available to a speaker in phrasing
his or hcr utterances, while leaving the literal meaning unchanged, convey
his or her utterances, while leaving the literal meaning unchanged, convey
secondary information about the speakers beliefs and domain. This additional
secondary infonnation about the speakers beliefs and domain, This additional
information can drive the infercncing proccdures to detcct a speaker's
information can drive the inferencing procedures to detect a speaker's
misconceptions and other conversational characteristics (such as focus, as in
misconceptions and other conversational characteristics (such as focus, as in
example 1 above), without the nccd for domain drivcn inference. Inferences
example 1 above), without the need for domain driven inference. Inferences
driven from the particular phrasing of inputs can be called lanpuagg driven
driven from the particular phrasing of inputs can be called language driven
infercnceg.
inferences.
Language driven inferences have several useful properties in a computational
Language driven inferences have several useful properties in a computational
1. For example, to understand the statement "I bought a briercase yesterday, and today the handle
1. For example,
statement
"I boughtJrpically
a briefcase
broke o f f "toitunderstand
is necessarythe
to know
that briefcases
haveyesterday,
handles. and today the handle
broke off."
it
is
necessary
to
know
lhat
briefcases
~ have handles.
2. "Domain" hcre is mean1 10 include gencral world knowledge, knowlcdge about the specific
2. "Domain" here is meant 10 include general world knowledge, knowledge about the specific
contcxl. and inferential rules of a general and/or spccilic nature about h a t knowledge.
context, and
inferential
rules of
a general arc
and/or
~1Jecific nature about that knowledge.
3. Of
course. lhcsc
infcrcnccs
actually
made on the basis o f dcwrintions of the domain (the
3. Of course.
these
inferences
are
actually
made
the basis
ofWhai
descriptions
of the domain
(the
internal meaning rcprmntation) and not theondomain
i~wlC
is to be cvaluared
in such
systems is
internal thc
meaning
representation)
and not the
domain iL\Clf.
is to be evaluated in such systems is
sulicicncy
of that dcscriplion
in representing
h e What
domain.
the su mricncy of lhat description in represcnting the domain.
177
framework. First, being bascd on gcncral knowlcdgc about thc language, they do
framework. First, being based on general knowledge about the language, they do
not rcqiiire an infirsion of knowlcdgc to operatc in differing domains. As a result
not require an infusion of knowledge to operate in differing domains. As a result
thcy tend to bc transportable to ncw domains (ncw 1)13s, in thc casc of NL DB
they tend to be transportable to new domains (new DBs, in the case of NL DB
q~icry systcms). Sccond, thcy do not appcar to he as subject to runaway
query systems). Second, they do not appear to be as subject to runaway
infcrcncing [Roscnschcin 19761, i.e. the infcrcncing is drivcn (and hence
inferencing [Rosenschein 1976], i.e. the inferencing is driven (and hence
controlled) by thc phrasing of thc input Third, thcy can often achicvc rcsults
controlled) by the phrasing of the input TIlird, they can often achieve results
approximating that of domain d r i ~ c ninfcrencc tcchniqucs with substantially less
approximating that of domain driven inference techniques with substantially less
computationa1machinery and execution time.
computational machinery and execution time.
As a simplc cxarnplc, consider the casc of factivc vcrbs. I h c sentcncc "John
As a simple example, consider the case of factive verbs. The sentence "John
doesn't know that the ncatlcs brokc up." carries the infcrcncc that the Bcatles
doesn't know that the Beatles broke up." carries the inference that the Ikatles
broke up. Trcatcd as a domain drivcn infcrencc, this rcsult might typically be
broke up. Treated as a domain driven inference, this result might typically be
achicved as follows, Thc scntcnce could be parscd into a rcprcscntation indicating
achieved as follows. The sentence could be parsed into a representation indicating
John's lack of knowlcdgc of thc Ilcatles' breakup. Either immcdiatcly or at some
John's lack of knowledge of the Bealles' breakup. Either immediately or at some
suitable later time, a procedure might be invoked that cncodcs the knowledge
suitable later time, a procedure might be invoked that encodes the knowledge
"For someone to not know something, that something has to be thc case." The
"For someone to not know something, that something has to be the case." The
inferential procedures can thcn update the knowledge base accordingly. As a
inferenlial procedures can then update the knowledge base accordingly. As a
language drivcn infcrencc. this infcrcnce can be rcgarded as a lexical property, i.e.
language driven inference, this inference can be regarded as a lexical property, i.e.
that factive vcrbs presuppose their complements, and the complement
that factive verbs presuppose their complements, and the complement
immediately asserted, namely, that the Bcatles broke up (as in [Wcischedcl 19751).
immediately asserted, namely, that the Beatles broke up (as in [Weischedc11975]}.
(Notc 'that this process cannot be reasonably said to "understand" the utterance.
(Note 'that this process cannot be reasonably said to "understand" the utterance,
but achieves the same results.) Effcctivcly, certain infcrence rules have been
but achieves the same results.) Effectively, certain inference rules have been
encodcd directly into the lexical and syntactic structure of the language encoded directly into the lexical and syntactic structure of the language facilitating the drawing of the inference without resorting to general reasoning
facilitating the drawing of the inference without resorting to general reasoning
processes.
processes.
The CO-OP system, described below, dcrnonstrates that a language driven
The CO-OP system, described below, demonstrates that a language driven
inference approach to computational systems can to a considerable extent produce
inference approach to computational systems can to a considerable extent produce
appropriate NL behavior in practical domains without the ovcrhead of a dctailed
appropriate NL behavior in practical domains without the overhead of a detailed
and comprchensivc world modcl, other than a standard DB. By limiting the
and comprehensive world model, other than a standard DB. By limiting the
domain of discourse to DI3 queries, the lexical and syntactic structure of the
domain of discourse to DB queries, the lexical and syntactic structure of the
questions encodcs sufficient information about the user's bclicfs that 2 ~ienificant
questions encodes sufficient information about the user's beliefs that ~ significant
class ofprcsum~tionscan be corn~utedon a ~ u r c l vlan~uaggdrivcnb a a .
class of presumptions can ~ compu ted Qll !! ll!!.IDY language Qrllim~.
This section will summarize the dcsign of the CO-OP NL DD query system.
This section will summariz.e the design of the CO-OP NL DB query system.
CO-OP is intcndcd to providc cooperative rcsponscs to simple questions
CO-OP is intended to provide cooperative responses to simple questions
requesting data retricval. It operates with a typical COIIASYL DB. In addition to
requesting data retrieval. It operates witl1 a typical CODASYL DR In addition to
dircct answers, CO-OP is capable of producing a varicty of indirect responses,
direct answers, CO-OP is capable of producing a variety of indirect responses,
including corrective indircct rcsponscs. 'Ihe dcsign of the system is bascd on two
including cOffective indirect responses. The design of the system is based on two
Kaplan
Kaplan
178
h y porhcscs:
hypotheses:
179
response to a query is an N-placc relation rcalizcd by obtaining the refcrcnt ofthe
response to a query is an N-place relation realized by obtaining the referent of the
scts in the DR, and composing thcm according to the binary relations. Each
sets in the DB, and composing them according to the binary relations. Each
composition will have thc cffcct of selecting a subset of the current scts. The
composition will have the effect of selecting a subset of the current sets. The
subsets will contain the elements that survive (participate) in the relation.
subsets will contain the clements that survive (participate) in the relation.
(Actually, the responses are rcalizcd in a rnucn inore efficient fashion - this is
(Actually, the responses arc realized in a muen more efficient fashion . this is
simply a convenient view.)
simply a convenient view.)
As an examplc, consider thc query "Which students got Fs in Linguistics
As an example, consider the query "Which students got Fs in Linguistics
courses?" as diagrammed in Figure I.
courses?" as diagrammed in Figure 1.
(=y-=y")
STUDENTS
COURSES
180
Kaplan
The
'lhc MQL,
MQI., by elcoding
encoding snme
some of the syntactic
syntactic relationships present in the NL
query,
query, can hardly be said to capture the meaning of the question: it is merely aa
convenient
characteristics of
of the
convcnicot representation
rcprescntation formalizing
formalizing certain linguistic cl~aracteristics
query.
representation to generate inferences
' h e procedures
procedures that manipulate this reprcscntation
query. The
arc
these syntactic
are based on observations
observations of a general nature regarding thcse
relationshipS.
relationships. Consequently.
Conscqucntly, these
thcse inferences arc language driven inferences.
While
has its advantages
advantages and disadvantages,
disadvantages,
While the MQL,
MQL.. as implemented
implemcntcd in CO-OP 1x1s
it has
be present in any representation used for
has several
several characteristics
characteristicsthat ought to bc
similar
similar purposes.
First,
reflects closely
closely the surface
surface structure of the input (indeed, it is little
First, it reflects
more than a modilied
facilitates the capturing of
of surface
modified parse Iree),
tree), and so facilitates
syntactic
syntactic fcatures.
features. For example.
example, the representation
rcprescntation of corresponding passives and
actives
organi7..ation of
of the
actives isis not the same. "Ibis
'This has an important bearing the organiration
responses.
responses.
Second,
Second, it provides a level of dcscription
description useful for providing explanations that
a user
of an MQL
ccrtain to understand. This occurs mainly because each part of
uscr isis certain
expression isis labeled with a lexical item or phrase that the user just
just used (with
minor exceptions),
exceptions). Failures at lower levels of the system can always be localized
minor
to
subser of the MQL, and the offending subset can be
to the
the proccssing
processing of some
some subset
explained to the user in his or her own tenns,
terms.
Third,
Third, the MQL representation
reprcscntation of a query is invariant under differing
organirations of the underlying
underlying DB. Many of the organizational
organizations
organizational options available
DD designer
designer have no bearing on
nn either the range of
to a DB
of questions that can be
appropriately
posed
to
the
DB
or
on
the
content
of
the
responses
appropriately
of
responses they affcct
affect only
efficiency of retrieval.
retrieval. Consequendy,
Consequently, the options chosen by the DD
the efficiency
013 designer
ought to be transparent to the user. In CO-OP, the organization
organization of
of the response is
function solely of the MQL, and so is not affected by variations in the
a function
organization of the DB.
organization
structure is roughly proportional to the
Fourth, the complexity of the MQL slrUcture
complexity of the input This occurs mainly because the MQL corresponds closely
complexity
e10sely
to the
the surface
surface syntactic
syntactic slrUcture,
structure.
to
3.3.2 Computing
Computing corrective
corrective indirect responses
3.3.2
The crucial
crucial observation required to produce a reasonable set of
The
of corrcctive
corrective indirect
MOI,
ouery
presumes
non-emutincss
responses
is
that
the
responses is "that
MQL S!!.[Y
the non-emptiness of
of its connected
suhera~hs.Each connected subgraph corresponds to a presumption the user has
subgraphs.
discourse. If thc
made about the domain of discourse,
the uscr
user believed any connected subgraph
to represent
represent an empty
empty response,
response, slhe
s/he would also have to believe that the answer to
181
the entire qucry was cmpty. Hence s/hc would know a priori the answer to the
the entire query was empty. Hence slhe would know a priori the answer to the
qucstian, violating the convcntion that a qi~cstioncrshould leave a rcspondcnt a
question, violating thc convention that a qucstioner should leavc a respondent a
choice of dircct rcsponscs. The user must thercforc find the non-emptiness of each
choice of direct responses. The user must therefore find the non-emptiness of each
connccted subgraph assumable, i.e. not believe it to be crnpty. Consequently,
connected subgraph assumable, i.e. not believe it to be empty. Consequently,
should the initial query return a null rcsponsc, the control structure can chcck the
should the initial query return a null response, the control structure can check the
user's presumptions by passing cach conncctcd subgraph to the interpretive
user's presumptions by passing each connected subgraph to the interpretive
component to check its non-cmptincss. Noticc that cach conncctcd subgraph itself
component to check its non-emptiness. Notice that each connected subgraph itself
constitutes a wcll formed qucry. Should a presumption prove false, an
constitutes a well formed query. Should a presumption prove false, an
appropriate indircct response can be gencratcd, rather than a mcaninglcss or
appropriate indirect response can be generated, rather than a meaningless or
misleading direct rcsponse of "None."
misleading direct response of "None."
For example, in thc qucry of Figure 1, the connectcd subgraphs and their
For example, in the query of Figure I, the connected subgraphs and their
corrcsponding corrcctive indirect rcsponscs arc (the numbers represent the sets
corresponding corrective indirect responses are (the numbers represent the sets
the subgraphs consist of):
the subgraphs consist of):
1)
2)
3)
4)
1,2)
2,3)
3,4)
1,2,3)
2,3,4)
Suppose that there are no linguistics courses in the DB. Rather than presenting
Suppose that there are no linguistics courses in the DB. Rather than presenting
the direcf correct answer of "None.", thc control structure will pass each
the direct, correct answer of "None.", the control structure will pass each
connected subgraph in turn to be executed against the DB. It will discover that no
connected subgraph in turn to be executed against the DB. It will discover that no
linguistics courses exist in the DB, and so will respond with "I don't know of any
linguistics courses exist in the DB, and so will respond with "I don't know of any
linguistics courses." This corrcctive indirect rcsponse (and all responscs generated
linguistics courses." This corrective indirect response (and all responses generated
through this method) will enmil the direct answer, since they will entail the
through this method) will entail the direct answer, since they will entail the
emptiness of the direct rcsponse set
emptiness of the direct response set
Several aspects of this procedure are worthy of note. First, although the
Several aspects of this procedure are worthy of note. First, although the
selection of the response is depcndcnt on knowlcdgc of the domain (as encoded in
selection afthe response is dependent on knowledge of the domain (as encoded in
the DB system not as separate theorems, structures, or programs), &
the DB system not as separate theorems, structures, or programs), ~
computation af & prcsum~tion~ totallv indc~endenlgf domain Soecific
computation Qf lM presumptions is 1Q1ill1y independent ill QQm.ain ~
knowledee. Because these inferences arc driven by the MQL, the procedures that
knowledge. Because these inferences are driven by the MQL. the procedures that
determine the presumptions (by computing subgraphs) require no knowledge of
determine the presumptions (by computing subgraphs) require no knowledge of
the domain. Consequently, producing corrective indirect responscs from another
the domain. Consequently, producing corrective indirect responses from another
DB, or even another DB system, rcquircs no changes to the inferencing
DB, or even another DB system, requires no changes to the inferencing
procedures. Secondly, the mechanism for selecting the indirect response is
procedures. Secondly, the mechanism for selecting the indirect response is
identical to thc procedure for executing a qucry. No additional ~om~utational
identical to the procedure for executing a query. li2 additional computational
Kaplan
182
machincry need be
he invoked
invokcd ill
ln select ~
the appropriate
a~nronriatcindirect response.
resoonsc. Thirdly.
Thirdly,
machinery
co~nputational overhead involved in checking and correcting the users
the computational
prcsumptions is not incurred unless it has been determined
dctcrmincd that an indirect
presumptions
rcquircd. Should the query succeed
succccd initially, no ~enaltv
in
rcsponse may be required.
response
penalty ill
execution
oaid for the ability
abilitv 1Q
lo produce
~roduccthe indirect responses.
rcsoonscs. ]n
In
cxccutian time will be paid
increase in space overhead is a small control program to
addition, the only increase
produce the connected subgraphs. The linguistic
linguistic generation of the iodirect
indirect
rcsponse is essential1y
essentially free -- it is a small additiun
addition to the paraphrase component
response
already used in error detection.
thc most appropriate set of
The MQL also provides a means of selecting the
corrective responses when more than one is applicable. The presumptions of a
corrective
qucstion can be partially ordered according
according to an entailment relationship: the
question
failure of some presumptions
prcsumptions entail the failure
failure of others. For example.
example, if there are
failure
courses, which in turn entails
cntails that there are no
courses, then there are no Fs in courses,
no courses,
prcsumptions of a
Linguistics courses,
courses, etc. It is often the case that several presumptions
Fs in Linguistics
simultaneously, in part because of this partial ordering.
ordcring. In such
question fail simultaneously,
cases,
~ of
least li!i!ing
failh a
cases, the most appropriate response is to correct the kill
presumptions in this ordering.!
ordering?
among the
MQI, this ordering manifests itself as a subgraph relation amoog
. In the MQL,
chccking the
subgraphs of the MQL: some subgraphs are subgraphs of others. By checking
non-emptiness of the subgraphs in a suitable order, the most appropriate response
formulated. Thus,
Thus, in the example, it is possible for the system to produce a
can be formulated.
response
Linguistics
rcsponse such as "1 don't know of any Fs, and I don't know of any Linguistics
courses.", without producing the additional irrelevant facts
mere are no Fs in
facts that there
etc.
courses, no students got Fs, ete.
responses, produced in this fashion,
fashion, are language driven
Corrective indirect responses,
inferences,
infcrences, because they are derived directly from the structure
suucture of the query as
represented by the MQL. If the query were phrased differently, a different set of
presumptions would be computed.
computed. (This is not a drawback,
drawback, as it might seem at
first -- it insures that the response
will
be
in
terms
that
the user
rcsponse
uscr understands, since
the terms used in the explanation are those presented in the original query.) For
these reasons,
possible by a careful choice of
responses, made possiblc
reasons, corrective
corrective indirect responses,
representations and associated algorithms.
algorithms, are produced in a domain transparent
fashion with minimal system overhead using knowledge
knowledge already available in the
DB.
183
184
Kaplan
5Q:
trains leave for N.Y. this evening?
50: How many more uains
evening?
5R:
None. but there are 33 tomorrow morning.
SR: None,
In a computational
computational setting,
setting. some means of detecting the need for a suggestive
suggestive
focus of the current question is required.
indirect response and determining the focus
In human dialog,
dialog, there appears to be a conversational convention that
qucstions to avoid negative or trivial responses. A
speakers should phrase their questions
speakers
trivial response is one that denotes an empty set ("None."),
uivial
("None."), or the result of a
simple
simple predicate, such as COUNT, on an empty set ("Zero.").
("Zero."). By adopting this as
a principle of cooperation in discourse, it is possible for speakers
speakers to communicate
their expectations
expectations along with their questions.
questions. If a respondent is asked a question to
trivial, s/he is justified in assuming
which the response is negative or trivial.
assuming that some
(perhaps rather weak) expectation the speaker had has been violated
violated, and that the
conversation has temporarily
temporarily reached a dead end. It is then appropriate to
produce a suggestive
suggestive indirect response. llbat
h a t many questions carry
cany this expectation
can be illustrated by observing
obsening that a response of "Your assumption
assumption is correct"
correct." is
not a meaningless retort to the following:
Sunday, isn't it?
6: The A&P is open on Sunday,
7: The A&P isn't open on Sunday, is it?
senior?
8: Is John a senior?
9: Is John an underclassman?
underclassman?
A
effect occurs with questions other than yes/no questions.
questions Question
A related effect
10 carries
taken by students that live in
carries an expectation
expectation that some Friday classes
classes are takcn
185
the main quad dormitory - the answer "Nonc." indicates a violation of this
the main quad donnitory - the answer "None." indicates a violation of this
expectation. Similarly, 13 cxpccts that some il-52s have Ioggcd over 1000 hours
expectation. Similarly, 11 expects that some 8-52s have logged over 1000 hours
downtime an answer of "Zcro." indicates a violation.
downtime' an answer of "Zero." indicates a violation.
10: Which Friday classes are takcn by students that live in the
10: Whichmain
Friday
classes
arc taken by students that live in the
quad
dormitory?
main quad donnitory?
11: How many B-52s have logged ovcr 1000 hours downtime?
11: How many B-52s have logged over 1000 hours downtime?
These negative and trivial rcsponscs admit suggcstivc indircct rcsponses. For
These negative and trivial responses admit suggestive indirect responses. For
example, 8 might bc iollowcd with "No, he's a junior."; 10 with "Nonc, but here
example, 8 might be followed with "No, he's a junior."; 10 with "None, but here
arc thc classcs taken by students that live in any dormitory..."; and 11with "Zcro,
are the classes taken by students that live in any donnitory ..."; and 11 with "Zero,
but thcre arc 25 B-52s that have Ioggcd over 900 hours downtime." The
but there arc 25 8-52s that have logged over 900 hours downtime." The
appropriatcness of the various rcsponses dcpcnds on the dctcrmination of the
appropriateness of the various responses depends on the detennination of the
focus of the questions. It should be notcd that it is (perhaps equally) oRen
focus of the questions. It should be noted that it is (perhaps equally) often
appropriate to elaborate on questions with positive responses. In addition, it is
appropriate to elaborate on questions with positive responses. In addition, it is
occasionally ttuc that an elaboration aftcr a negative response appears to be
occasionally true that an elaboration after a negative response appears to be
superfluous, for example in "Have any B-52s logged ovcr 3000 hours downtime?".
superfluous, for example in "Have any B-S2s logged over 1000 hours downtime?".
What is of interest herc is that thc negative responses can serve as a conversational
What is of interest here is that the negative responses can serve as a conversational
cue for suggestive indirect responses.
cue for suggestive indirect responses.
Conscqucntly, the CO-OPcontrol structure attempts to produce a suggestive
Consequently, the CO-OP control structure attempts to produce a suggestive
indircct response when the dircct answcr to a question is negative or trivial. Once
indirect response when the direct answer to a question is negative or trivial. Once
a qucstion has been parsed into the MQL,
a question has been parsed into the MQL,
1) it is uanslatcd to a formal query that will produce a direct
1) it is translated to a fonnal query that will produce a direct
response, and an attempt is made to execute this against the
response, and an attempt is made to execute this against the
DB;
DB:
2) should thc result be an empty set (a null answer), the
2) should the result be an empty set (a null answer), the
prcsumptions of the qucstion are chccked in the hope of
presumptions
ofa the
question
are checked
the hope above:
of
producing
corrective
indirect
response,in
as described
producing a corrective indirect response, as described above;
Kaplan
186
dctcrmination
focus. Unfi~rtunatcly,
difficult and subde
subdc problem.
determination of thc
the focus.
Unfortunately, this is a difficult
question can be regarded as a special
To some
somc degree,
dcgrce, determining d1e
thc focus
focus of a qucstion
case of plan rccogllition.
qucstioncr will
recognition. An understanding of the intentions of a qucstioner
very effective
lead to a vcry
cffcctive strategy
stratcgy for
for determining
dctcrmining focus. (In
(In a sense, a complete
complete
enough undcrstanding
understanding would allow a respondent to ignore a questioner's
questioners
utteranccs
information is rarcly
utterances altogether!) Such information
rarely available to DD
Dil query systems,
available to CO-OP.
CO-OP. Since
Sincc CO-OP does
docs not maintain an
and quite simply is not available
cxplicit user model,
modcl, some
solnc simpler cues
cucs wcrc
focus dctcnnination.
dctcrmination.
explicit
were sought for focus
An alternative
altcrnativc method
mcthod of finding focus is to examine
cxaminc a sequence
scqucnce of questions,
qucstions,
when possiblc.
possible, to explicitly
wllcn
cvplicitly determine
dctcrminc what the likely
likcly focus
fwus is by finding what has
shiftcd and what has remained constant in the immediately
imlnediatcly prcccding
This
shifted
preceding dialog. Thi!
mcthod
locatc the focus
focus of thc
dialogs 2 and
method could correctly locate
the questions prcscntcd
presented in dialogs
above. A simplification of this approach is to compare the MQL rcprescntation
3 above.
representation
qucstion with that of thc
focus can then be
of the cunent
current qoestion
the previous one. The focus
diffcr bctwcen
qucstions.
sclccted from the prcscnrcd
selected
presented sets that differ
between the two questions.
lcavc a number of choices
choices for the focus,
focus, the
Should this tcchnique
technique still leave
resolved further
problem can be rcsolvcd
furthcr through some
somc syntactic
syntactic cues.
cucs. Questions are often
) the focus
unmarked cases
presented as "new" information.
information,
phrased so that (in unmarkcd
cases'l )the
focus is prcscntcd
and consequently
conscqucntly appears late in the question,
qucstion, as far as possiblc
(syntactically)
possible
from
the subject
fr"m thc
subject (This assumes
assumes that there is a tendency for new infonmation
information in
qucstions to be placed toward the end of a sentence).
sentence). For example,
example, the foci in
questions
dialogs
qucstions. (For a system that uses this
dialogs I1 and 3 both occur at the end of the questions.
observation to produce suggestive
suggcstivc indirect
indircct responses, though they do not use this
tenminology, see [Steedman
terminology,
[Steedman and Johnson-Laird 1976].)
19761.) In the MQL, this distance
distance
through the syntactic
syntactic parse tree tcnds
asgraphical
distance from
tends to manifest itself as
graphical distance
the presented set corresponding to the subject noun phrase. This occurs
occurs because
relationship betwecn
between the structure
of the close relationship
suucturc of an MQL expression
cxpression and the
syntactic structure of a parse tree. Relations in thc
the MQL tend to map one-to-one
one-to-one
phrases and elauses
A simple graph search algorithm can
with phrascs
clauscs in the question.
qucstion. A
truee focus.
therefore be used to locate or chose among likely candidates for the m
focus.
results; it
An error in the selection of the focus
focus docs not produce disastrous
disastrous results:
may, however,
however, be an inconvenience
inconvenience for a user to be confronted with unwanted or
irrelevant information.
information.
Experience with the CO-OP implementation has uncovered a more serious
serious
problem with generating
generating suggestive
suggestive indirect
indircct responses. Once dle
the focus
focus has been
chosen, it is important to vary it in a meaningful way. Generating a more general
1. Cases
C a m not overridden by
bv contcx
contextual
1.
tuaI considerations.
187
Kaplan
Kaplan
188
188
without thc associated names. The deeper problem hcrc is that NL qucstions
without the associated names. The deeper problem here is that NL questions
often do not explicitly indicate what infonnation should bc incorporated into the
often do not explicitly indicate what infonnation should be incorporated into the
response; the language is dcsigncd to share this burdcn with the rcspondent.
response; the language is designed to share this burden with the respondent.
These qucstions rcquirc Sun~ortivcIndirect Resnonscs - rcsponscs that provide
These questions require Supportive Indirect Responses - responses that provide
the supporting information ncccssary to intcrprct the answer. (Both REL
the supporting infonnation necessary to interpret the answer. (Both REL
phompson &. al. 19691 and LUNAR [Woods al. 19723 can provide limited
[Thompson et. ill., 1969J and LUNAR [Woods et. al. 1972] can provide limited
rcsponscs of this type.)
responses of this type.)
To insure that the appropriate supporting information is prcscnt, the query
To insure that the appropriate supporting information is present, the query
systcm could dump the cntire contents of the Dl1 in rcsponsc to any question,
system could dump the entire contents of the DB in response to any question,
sincc this would bc likcly to contain all information relcvant to the questioner's
since this would be likely to contain all infonnation relevant to the questioner's
nccds. Obviously, this would bc a less than optimal approach. The problem thcn is
needs. Obviollsly, this would be a less than optimal approach. The problem then is
to make an intclligcnt scltction of relevant information from the DB.
to make an intelligent selection of relevant infonnation from the DB.
Convcnicntly. the phrasing of the question provides excelIcnt guidance in this
Conveniently. the phrasing of the question provides excellent guidance in this
sclcction. Qucstioncrs tcnd to explicitly mention in their qucstions those aspects
selection. Questioners tend to explicitly mention in their questions those aspects
of the domain h a t arc relevant to their nccds. Convcmly, they tcnd to be vague
of the domain that are relevant to their needs. Conversely, they tend to be vague
with respect to irrelevant aspects of the domain. Vagueness is the dclction of
with respect to irrelevant aspects of the domain. Vagueness is the deletion of
information that may be relevant (but not essential given the context) to the
information that may be relevant (but not essential given the context) to the
sclect~onof a rcsponse.l
selection of a response. l
Consider the diffcrence between questions 14 and 15.
Consider the difference between questions 14 and 15.
14: Which students passed CS105?
14: Which students passed CSlOS?
15: Which students got a passing grade in CS105?
15: Which students got a passing grade in CS105?
While both questions appear to make the same request, 15 mentions grades
While both questions appear to make the same request, 15 mentions grades
while 14 does not, ('mat is, 14 is vague with respect to grades.) As a result, it is
while 14 does not. (Il1at is, 14 is vague with respect to grades.) As a result, it is
more appropriate to include thc grades along with the students in a response to 15
more appropriate to include the grades along with the students in a response to 15
than to 14. The use of vagueness serves as a cue indicating which aspects of the
than to 14. The use of vagueness serves as a cue indicating which aspects of the
domain are relevant to the user's needs, and hence may be appropriately
domain are relevant to the user's needs, and hence may be appropriately
incorporated into a response in the absence of other contextual cues to the
incorporated into a response in the absence of other contextual cues to the
contrary.
contrary.
This function of vagueness is used in forming the responses produced by
This function of vagueness is used in fonning the responses produced by
CO-OP.The systcm assumes that if a user explicitly mentioned some aspect of the
CO-OP. The system assumes that if a user explicitly mentioned some aspect of the
domain in his or her question, h e n that aspect may appear in the response.
domain in his or her question, then that aspect may appear in the response.
1. Vagueness should be distinguished from ambiguity, where multiple interpretations can be assigned
1. Vagueness
be distinguished
from
ambiguity,
where multipleis interprelations
can be assigned
to theshould
samc question.
As wilh
locus.
this characterization
a broad simplilication
of more subtle
to the same
question.
As with As
focus.
this characterization
a broad
simplification
subtleof other
linguistic
phenomena
defined
here. vaguenessiscovers
ellipsis.
hedging, of
andmore
a variety
linguisticphenomena
phenomenaallAs
defined
here. invagueness
covers
ellipsis.
hedging.
other
of which
rcducc
his domain
lo a lack
of dctail
usefuland
Tor a~ lvariety
c c t i n gaofresponse.
phenomena - all of which reduce in this domain to a lack of delail useful ror selecting a response.
189
(Convcrscly, aspects not rncntioncd do not appear.) This is achieved by observing
(Conversely, aspects not mentioned do not appear.) This is achieved by observing
that each aspcct mentioned will rcsult in a corresponding presented set in the
that each aspect mentioned will result in a corresponding presented set in the
MQL. 'lhc strategy is thcn simply to provide information on each presented set
MQL. The strategy is then simply to provide information on each presented set
that is not a singleton. 'ihus 14 above would produce a rcsponse containing only
that is not a singleton. 'Ibus 14 above would produce a response containing only
studcnts (CS105 is a singleton). while 15 would produce a table of student - grade
students (CS105 is a singleton). while 15 would produce a table of student - grade
pairs. Similarly, a qucstion such as "What arc the phone numbers of managers in
pairs. Similarly, a question such as "What arc the phone numbers of managers in
the marketing division?" would produce a response of not just phone numbers,
the marketing division?" would produce a response of not just phone numbers.
but also thc corresponding manager's names (assuming tllat this information is
but also the corresponding manager's names (assuming that this information is
avaiIable in the DB).
available in the DB).
Each lcxical itcm that may define a set has in the CO-OP lexicon an associated
Each lexical item that may define a set has in the CO-OP lexicon an associated
list of fields that arc to be printcd to designate the individuals in the set. Thus,
list of fields that arc to be printed to designate the individuals in the set Thus,
"students" above might result in the printing of both their names and social
"students" above might result in the printing of both their names and social
security numbers - but only if students were mentioned in the question.
security numbers - but only if students were mentioned in the question.
Since the selection of relevant infonnation to be incorporated into a
Since the selection of relevant infonnation to be incorporated into a
supportive response is determined by the phrasing of thc question, the technique
supportive response is determined by the phrasing of the question, the technique
is a language driven inference. The system is able to exploit the user's linguistic
is a language driven inference. The system is able to exploit the user's linguistic
competence to drive its inferencing procedures.
competence to drive its inferencing procedures.
3.4 CO-OPsample queries
Kaplan
Kaplan
190
190
realistic NL system, they are of lcsscr thcorctical intcrcst and are not dcscribcd in
realistic NL system, they are oflesser theoretical interest and are not described in
detail in the text. (An extcndcd discussion can bc found in [Kaplan 19791.)
detail in the text. (An extended discussion can be found in [Kaplan 1979J.)
the examples,
havc altered
becn alrcrcd
the intcrcst
of privacy.
In theInexamples,
names namcs
have been
in the ininterest
of privacy.
"Q" "Q"
(bclow) stands for question, "P" for paraphrase, and "R" For response. (Thc
(below) stands for question, "P" for paraphrase, and "ROO for response. (Th!
paraphrases are prcscntcd to the user for approval prior to exccution of the
paraphrases are presented to the user for approval prior to execution of the
query.) Commentary is in square brackets ([ I). (Some irrclevant paraphrascs and
query.) Commentary is in square brackets ([
(Some irrelevant paraphrases and
rcsponscs arc dcletcd in the interest of brcvity.)
responses are deleted in the interest of brevity.)
n.
Q : advises
Who advises
projectsin
area 36?
Q: Who
projects
in area 36?
P: Assuming
that thcre are projects that are in arca R36,
P: Assuming that there are projects that are in area # 36.
who advises those projects?
who advises those projects?
r h e system incorporates a powerful paraphrascr,l that pcrforms two types of
[The system incorporates a powerful paraphraser, 1 that performs two types of
disambiguation: lexical - it will explain its chosen intcrpretation for vague or
disambiguation: lexical - it will explain its chosen interpretation for vague or
ambiguous terms; and syntactic - it explains thc clausal dcpcndcncics dctcrmined
ambiguous terms; and syntactic - it explains the clausal dependencies determined
by the parser. It is also uscd for indirect responses, as illustrated below. This
by the parser. It is also used for indirect responses, as illustrated below. This
component has access solely to the MQL (parser output), and so requires no
component has access solely to the MQL (parser output), and so requires no
modification on new domains.]
modification on new domains.]
Q: Which programmers
from the ASD group are in
superdivision 5000?
superdivision 5OO0?
191
Look
Lnok for programmers that are from
from ASD group.
[This
Khis paraphrase illustrates
illusUatcs both types of disambiguation, although the lexical
disambiguation is probably unnecessary
unnecessary in this example.]
example.]
R: I don't know of
any ASD group. I don't know of any
ofany
programmers that are in supcrdivision
5000.
superdivision ##5000.
[TIlis
miis corrective indirect response has two components. Rather than simply
correcting all subgraphs of the MQL that denote empty referents in the OIl,
DB, the
system
system explains only the
thc appropriate ones.
ones. For example,
example, since there is no ASD
group,
group, it is obvious that there are no programmers in the ASO
ASD group. This
parsimony, a reflection of Grice's
"Maxim
of
Quantity"
[Grice
19751,
19751,is achieved
Grice's
by noting that the presumptions (subgraphs) of the MQL arc
are partially ordered
according
according to a "subgraph" rclation,
relation, i.e. some subgraphs are subgraphs of others.
The emptiness of a lesser subgraph in the ordering
ordcring will necessarily en!;lit
entail the
emptiness of all greater
grcalcr subgraphs. CO-OP
CO-OP cumputes
cumputcs and corrccLs
currccc. only the
Lhe least
failing
failing subgraphs in this ordering.
ordcring. Note that this technique is entirely domain
transparent: no additional domain driven inference is required.)
required.]
Q:
projects in oceanography does Nasa Headquarrers
Headquarters
Which projecls
Q: Which
sponsor?
P: (J
( I am
am assuming that "Nasa Headquarters" is a SPONSOR
NAME.)
(I am assuming that "oceanography" is a AREA OF
INTEREST,)
INTEREST.)
-- -~
-.
Which projects
p;ojects does Nasa Headquarters sponsor?
Look for projects that are in oceanography.
ah
R:
projects in oceanography that Nasa
R: I don't know ofany projccts
But you mIght
be interested in
Headquarters sponsors.
sponson.llut
m~ght~bcinterested
sponsors...
any projects that Nasa Headquarters sponsors...
Kaplan
Kaplan
192
(PROJECT/ACCT #)
(PROJECT TITLE)
(PROJECT/ACcr #)
(PROJECf TITLE)
35071040
35071040
35071048
35071048
Q:
R: I don'tthat
know
ofanyWirth
projects
tllat were
sponsored
by interested
NSF
Thomas
advises.
13ut you
might be
iliat Thomas
Wirth
advises.
Hut
you
might
be
interested
in any projects that Thomas Wirth advises, namely:
in any projects that Thomas Wirth advises, namely:
(ADVISOR TITLE]
(ADVISOR TITLE)
193
R: FAII-URE.
R: FAILURE.
Sorry, but I didn't understand you fullySorry, but I didn't understand you fullyI havc a vcry limitcd knowlcdge of English.
I have a very limited knowledge of English.
My bcst gucss as to what you meant is:
My bestWho
guesssponsors
as to what
you meant is:
projects?
Who sponsors projects?
Could you please rcphrase your qucstion and uy again...
Could you please rephrase your question and try again..
m i s is an example of failure in thc systcm. In principle, Ihc only kind of
[This is an example of failure in the system. In principle, the only kind of
failure is parse failure - thc intcrprctive componcnts can proccss any legal output
failure is parse failure - the interpretive components can process any legal output
of thc parscr. (In addition, the parscr will reach an interprctadon of any
of the parser. ([n addition, the parser will reach an interpretation of any
syntactically valid input, regardless of . its semantic wcll-fomcdness.)
syntactically valid input, regardless of its semantic wcll-formcdness.)
Conscqucntly, the only type of failure diagnostics that are required arc in the
Consequently, the only type of failure diagnostics that are required are in the
parsc. During thc parsc, a currcnt hypothcsis regarding thc input is maintained at
parse. During the parse, a current hypothesis regarding the input is maintained at
all times. In the cvcnt of a parsc failurc, thc bcst gucss (largest hypothesis)
all times. In the event of a parse failure, the best guess (largest hypothesis)
cncountcrcd is displayed for the user. in the hope that this will aid the user in
encountered is displayed for the user, in the hope that this will aid the user in
localizing thc nature of the problem. This approach, to try to explain what went
localizing the nature of the problem. This approach, to try to explain what went
right with the parse ratllcr than what went wrong. is rnotivatcd by h e fact that the
right with the parse rather than what went wrong, is motivated by the fact that the
point at which tllc error actually occurred is oftcn considerably bcfore the point at
point at which the error actually occurred is often considerably before the point at
which the error is detectcd - English is more than locally ambiguous. Any attempt
which the error is detected - English is more than locally ambiguous. Any attempt
to diagnose the error at Ihc point of failure is more likely to mystify the user than
to diagnose the error at the point of failure is more likely to mystify the user than
to help him or her re-phrase the qucstion. In this case, the system is unable to
to help him or her re-phrase the question. In this case, the system is unable to
process a negation preceding a prcpositional phrase.]
process a negation preceding a prepositional phrase.]
Q: doesn
Who'Idoesn
'f sponsorprojecu in the oceanography area?
Q: Who
sponsor projecls in Ihe oceanography area?
[Herc the user has re-phrased the question.]
[Here the user has re-phrased the question.]
P: (1 am assuming that "occanography" is an AREA OF
P: (I am assuming that "oceanography" is an AREA OF
INTEIlEST.) Assuming that therc arc projects that are in the
INTEREST.)
Assuming
aresponsor
projectsthose
that projects?
are in the
oceanography
area,that
whothere
doesn't
oceanography area, who doesn't sponsor those projects?
R:
(SPONSOR NAME)
(SPONSOR NAME)
Kaplan
Kaplan
194
194
r h i s example illustrates two points. Firsf that the system can process
[This example illustrates two points. First, that the system can process
negation. 'This is handlcd by the quantification routines, csscntially by counting
negation. This is handled by the quantification routines, essentially by counting
the numbcr of projects in oceanography sponsored by each sponsor and selccting
the number of projects in oceanography sponsored by each sponsor and selecting
those that sponsor exactly zero. This is a special case of quantification. Second,
those that sponsor exactly zero. This is a special case of quantification. Second,
the system must decide in such cases just what the sct "Who" should appropriately
the system must decide in such cases just what the set "Who" should appropriately
refer to. If it took the set of all people in the Dl3 (i.e. users, advisors,
refer to. If it took the set of all people in the DB (Le. users, advisors,
programmers, sponsors, ctc.) the responsc would include a great deal of
programmers, sponsors, etc.) the response would include a great deal of
extraneous information. CO-OP sclccts as the rcfercnt those individuals that
extraneous information. CO-OP selects as the referent those individuals that
could reasonably participate in the unncgated relationship. The resolution of
could reasonably participate in the unnegated relationship. The resolution of
vague referring expressions is made solely on the basis of rhe lexicon and DB
vague referring expressions is made solely on the basis of the lexicon and DB
schema.]
schema.]
Q : Which programmers are in division 41 1I and 4361?
Q: Which programmers are in division 4/// and 436/?
Q: Which
projects
are in6 area
6 and 27?
Q: Which
projects
are in area
and 27?
P:simplify
To simplify matters, I'm going to split your question into
P: Toseveral
matters,
I'm going
split your
parts.
I understand
yourtoqucsdon
to question
mean: into
several parts. I understand your question to mean:
Which projects arc in area #27? Which projects are in area
Which projects arc in area #217 Which projects are in area
# 6?
#67
195
the system
system interprets
interprets wide
wide scoped
seoped
[Mainly for its own convenience, the
conjunctions as presenting multiple questions. Notice
Notice that
that the
the effect
cfTect of
of the
the "and"
"and"
ratller like that of
of an "or".
"or". The system then
then answers
answers each
each question
question
in this case is rather
in turn.]
Q: Which
Which users work on projects
projects in area
area 33 that
that are
are in
in division
division
Q:
200?
P: Assuming that there arc projccts
projects that are in area #3,
# 3, which
which
users work on thoseprojects?
those projects? Look for users that arc in division
#200.
#200.
[Relative
difficult
[Relative clause
clause and prepositional phrase attachment is another difficult
of semantic tests in the parsing process. Once
Once again,
again,
problem that involves the use
usc of
sufficient information for disambiguation. Syntactically,
the DB schema provides suficient
"that are in division 2200"
modifies users
users or
or
it is unclear whether the relative clause "that
0 0 modifies
projects.
projccts. The system decides
decides that users is the most likely referent, by scoring the
of the
potential referents
referents on three dimensions.
dimensions. The first is the semantic relatedness of
of
the
resulting
MQL
relation,
in
this
case
(users,
divisions)
and
subject
and
object
subject
(projt;cts.
relatedness is measured by the distance through
through
(projects. divisions).
divisions). The semantic relatedness
the
the schema between the records
records denoting those items. In the schema,
DlST(PROJT,DIVIS) = 4 (recall that projects
D1ST(USER,DIVIS)
DIST(USEI1,DIVIS) = 2, while DISr(PROIT,DIVIS)
arc
are not organized into divisions). Second,
Second, the distance back in the question from
the
the clause
clause to
to the
the potential referent is measured, on the assumption that the
modifier isis most likely to
to be ncar
near its
its referent. Third, a prediction is made on the
modifier
basis of
of the
the information
information content of the lexical items designating
designating the relation (in
basis
this case,
case, the
the word "in" contains little predictive value, and does
afTect the
this
docs not affect
decision). This
This isis done
done by associating with each prepoiition
preposition and verb in the
decision).
lexicon aa (possibly
(possibly empty) list of probable subjects
subjects and objects.
objects. For example, a
lexicon
verb
like
"sponsor"
in
this
domain
predicts
a
sponsor
as
a
subject and a project as
verb like
an
an object.]
object.]
Q:
phone numbers?
Q: What
What are the
the advisor's
advisor'sphone
R:
(ADVISOR NAME)
NAME)
(ADVISOR
MOYER, ROSCOE
ROSCOE
MOYER,
OIIRIEN, JOANNE
JOANNE
OBRIEN,
ORSLAG, VANCE
VANCE
ORSZAG.
312-753-2184
312-753-2184
713-845-0365
713-845-0365
804-924-8675...
804-924-8675...
Kaplan
Kaplan
196
196
R:
(GROUP ID)
(GROUP ID)
AQM
AQM
HA0
HAG
SYS
SYS
UL
UL
R:
(USER #) (PERSON NAME)
(GROUP ID)
(USER #) (PERSON NAME)
(GROUPID)
8002
TAKAMINE, GARY UL
GARY UL
8002 8056 TAKAMINE,
AITKEN,
JOYCE
HA0
AITKEN, JOYCE
8056
HAO
8061
VALEN'T. JO
AQM
8061
VALENT.JO
AQM
8063
WALSH,MARIE
HA0
MARIE
HAO
8063 8067 WALSH,
WOIIKING, MARIE SYS
WORKING, MARIE SYS
8067
[This qucstion illustrates how the system organizes the responses according to
[This question illustrates how the system organizes the responses according to
the way the information is prcscnted by the question. This can bc contrasted with
the way the information is presented by the question. This can be contrasted with
197
the following question, which rcqucsts the samc information in a different form.]
the following question, which requests the same information in a different fonn.]
Q: Which
groups do prograttzmers work in?
Q: Which
groups do programmers work in?
R:
(GROUP ID) (USER #) (PERSON NAME)
(GROUP ID) (USER #) (PERSON NAME)
AQM AQM
HA0
HAO
SYS
UL
SYS
UL
8061
8056
8063
8067
8002
8061
8056
8063
8067
8002
VALENT, JO
VALENT,JO
AITKEN, JOYCE
AITKEN, JOYCE
WALSH, MARIE
WALSH, MARIE
WORKING, MARIE
WORKING, MARIE
TAKAMINE, GARY
TAKAMINE, GARY
R:
(SPONSOR NAME)
(SPONSOR NAME)
NASA GODDhRD SPACE CENTER
NASA GODDARD SPACE CENTER
NATIONAL SCIENCE FOUNDATION
NATIONAL SCIENCE FOUNDATION
[Quantification in CO-OP is done by a counting mechanism. While the
[Quantification in CO-OP is done by a counting mechanism. While the
handling of quantification in this system js a very partial solution to a difficult
handling of quantification in this system is a very partial solution to a difficult
problem, it is sufficient for handling simple existcntials, universals, negations, and
problem, it is sufficient for handling simple existcntials. universals, negations, and
counts. Scoping of nested quantifiers is simply taken left to right in the question.]
counts. Scoping of nested quantifiers is simply taken left to right in the question.]
Q: Which users work on everyprojecl in area 55?
Q: Which users work on every project in area 55?
Kaplan
Kaplan
198
R:
(USER)
(USER)
(PERSON NAME)
(PERSON NAME)
8000
PENNEYS, JEANNE
8000
PENNEYS, JEANNE
8002
TAKAMINE, GARY
8002
TAKAMINE, GARY
8067
WOIZKING, MARIE
8067
WORKING, MARIE
[Universal quantification (all, every, etc.) rcquircs executing more than one
[Universal quantification (all, every, etc.) requires executing morc than one
qucry against the DR. First, a query is executed to determine the size of the
query against the DR first, a query is executed to determine the size of the
universally quantified sct (in this case, the number of projects in area of interest
universally quantified set (in this case, the number of projects in area of interest
55). ?his value is substituted into a sccond qucry to produce thc appropriate
55). This value is substituted into a second query to produce the appropriate
response. Note that the paraphrase in the above example reflects the fact that in
response. Note that the paraphrase in the above example reflects the fact that in
English, universally quantified expressions are normally assumed to be
English, universally quantified expressions are normally assumed to be
non-empty.]
non-empty.]
programmers work in division 3513 or project
Q: Which
Q: Which
programmers work in division 3513 or project
436 10008?
43610008?
R:
(USER #) (PERSON NAME)
(DVSN NAME) (PROJECT TITLE)
(USER #) (PERSON NAME)
(DVSN NAME) (PROJECT TITLE)
8005
TAKAMINE, JOYCE ***'*
PRO DVLPMNT
T AKAMINE, JOYCE **.**
8005
PRO DVLPMNT
U.OF HAWAII
*****
8061
VALENT, 10
VALENT,JO
8061
U.OFHAWAII *****
8068
WONG. THOMAS
*****
PRO DVLPMNT...
*****
WONG, THOMAS
PRO DVLPMNT...
8068
[This question, while quite natural for English speakers to interpret. contains a
form of indircct reference. " H A 0 group" hcre does not actually refer to the
form of indirect reference. "HAO group" here does not actually refer to the
group itself, but rather to its members. If the system were very literal minded, it
group itself, but rather to its members. If the system were very literal minded, it
would find this question semantically ill-formed. since groups don't have
would find this question semantically ill-formed, since groups don't have
specialties. However, because the qucstion has been poscd, CO-OPassumes chat
specialties. However, bccause the qucstion has becn posed, CO-OP assumes that
the user must have mcant something, no matter how poorly expressed. This
the user must have meant something, no matter how poorly expressed. This
pragmatic effect is rcflcctcd in the system's ability to reach a most reasonable
pragmatic effect is reflected in the system's ability to reach a most reasonable
interpretation of virtually any syntactically valid input, regardless of its literal
interpretation of virtually any syntactically valid input, rcgardless of its literal
199
implausibility.
implausibility. In this
this case CO-OP
CO-OP discovers
discovers that the simplest path in the schema
from
from specialties
spccialtics to groups is through the programmers, and so it responds as
follows.]
follows.]
R:
(PERSON NAME) (SPECIALTIES)
AITKEN,
AITKEN. JOYCE LINEAR ALGEBRA
ENDS",
WALSH,
WALSII, MARIE GRAPHICS,
GRAPHICS. FRONT ENDS...
[perhaps
mis phenomenon is the system's
system's
perhaps a more interesting example of this
response
response to the
the following.]
following.]
Q:
Q: What
What is knowledge?
P: Display knowledge.
R:
(SPECIALTIES)
PLlB
PLIB MAINTENANCE
LINEAR AI-GEBRA
ALGEBRA
LIBRARY MAINTENANCE
...
MAINTENANCE...
3.5 Portability
Portability
3.5
Much work in AI
A1 has focused
focused on methods for organizing and representing large
Much
amounts of world knowledge.
knowledge. Motivating this research is an assumption that the
amounts
more knowledge
knowledge that can be incorporated into a system,
more
system, the more intelligent it.
its
behavior will
will be. While
While this assertion is almost certainly valid, it is equally
behavior
interesting to
to explore the question of how little
littleknowledge
,knowledge may be necessary to
to
interesting
particular tasks,
tasks, and how that knowledge may be partitioned
perform panicular
partitioned into
"general" and
and "specific"
"specific" components.
components.
"general"
reason, a design
design limitation was placed on CO-OP that all domain
For this reason,
spccific knOWledge
knowledge be encoded at the lexicalleve!.
lexical level. Only the information coded in
specific
the lexicon
lexicon Or
or already
already available
available in
in the
the DB
system could
DB system
could be
be used
used as
as sources
sources of
of
the
knowledge
about
the
domain.
All
other
aspects
of
the
system,
including
the
knowledge about the domain. All other aspects of the system, including the
generation of
of cooperative
cooperative responses,
responses, had
had to be totally free of
of any DB
DB specific
generation
knowledge. (That
(That is,
is, the
the programs
programs had
had to
to be
be completely
~ompletely domain
domain transoarcnt)
tronsoarent)
knowledge.
Though this
this limited
limited the
the type
type of
of questions
questions that
that could
could be
be handled
handltd to
to simple
simple
Though
200
Kaplan
1)1M
The kAi=.
lexicon. Some of the domain specific
specific words available
available to
1)
explicitly in the lexicon,
lexicon, along with
the users must be encoded explicitly
DB.
their definitions in terms of the DB.
2) The DR
DB schema. This has to be put in a special
special form to
facilitate
This task can be performed
facilitate path finding.
automatically if
if the schema is a tree structure (i.e.
(i.e. if
if no
a~~tornatically
ambiguous paths exist), and so a program could be written that
inputs a standard CODASYL Data Definition Language and
Outputs the needed structure. In the case of a graph structured
outputs
preferred paths is required.
schema,
schema, a selection of
ofpreferred
3) The DB itself.
itself, Obviously,
Obvio~sly. the new DB must be supplied.
4) lli
The item urint
names. A
special list (ITEMPRNAME in the
print l!J!lJlli.
Aspecial
program) associating
associating a print name with each field in the DB is
required for labeling the output and providing lexical
disambiguation in the paraphrase.
201
5) The Dl3 name and oassword. These are ncedcd to open the
5) The DR name and password. These are needed to open the
DB,
DB,
Kaplan
202
Kaplan
202
(ARCRFT NAME)
(ARCRFf NAME)
41500
44000
50000
58000
A-7
A-6
AV8A
F5
mis query gives a flavor of the type of data in the ONRODA DB. Notice that
[This query gives a flavor of the type of data in the ONRODA DB. Notice that
the system has produccd the information for cach aircraft - it is currently
the system has produced the information for each aircraft - it is currently
incapable of sclecting only the greatest of these.]
incapable of selecting only the greatest of these.)
Q: What is thefire rare of the M-61?
Q: What is the fire rate ofthe M-6J?
P: (I am assuming that "M-61"is a GUN NAME.) Display
P: (I the
am fire
assuming
"M-61" is a GUN NAME.) Display
rate of that
the M-61.
the fire rate of the M-61.
R: (GUNFIRE RATEj
R: (GUNFIRERATE)
3000
[M-61 appcars only as a value in Ule DB system. CO-OP has to infcr which field it
[M-61 appears only as a value in the DB system. CO-OP has to infer which field it
is likely to occur in. Hcrc it has correctly idcntificd it as a gun name. Tke same
is likely to occur in. Here it has correctly identified it as a gun name. The same
domain transparent procedures that rcsolvcd these references in the NCAR DB
domain transparent procedures that resolved these references in the NGAR DB
apply cqualIy well here.]
apply equally wcll here.)
203
Q:
slrut cume
curve radar that carry
At-6l guns?
Q: Which
Which ships have srmt
cary A!-61
["Strut
YStrut curve" is a type of radar.]
P: (l
(I am assuming that "strut curve" is a RADAR NAME.)
(I am assuming
assuming that "M-61" is a GUN NAME.) Which ships
have strut
suut curve radar? Look for ships that carry M-61
M-6i guns.
[fhis
detennine whether the relative
phis question is syntactically
syntactically ambiguous. To determine
clause "that carry M-61
M-61 guns" modifies "ships" or "radar" it is necessary to make a
semantic judgment. The system correctly chooses "ships",
"ships", as the paraphrase
semantic
indicates.)
indicates.]
Q:
ships?
Q: Which
Whichguns are on aircraft
aircraJi and
andships?
[This
F i s is a case of simple conjunction. Note the organization
organization of the answer.]
R:
(GUN NAME) (SHIP NAME) (ARCRFf
NAME)
(ARCRFTNAME)
MK
12
MKl2
NR-30
TARAWA
DODRY
OSA
A-4
SU-7B
SU-7B...
SU-7B..,
Q:
subs with
don't cany
carry MK-44 torpedoes?
Q: Which
Which mbs
with radar
radardon't
torpedoes?
R: 11 don't
don't know of any subs with radar. I don't
don't know of
of any
subs that carry MK-44 torpedoes.
[1bis
p i s is an example of a corrective response. Notice that the latter correction
entails
all Isubs don't carry MK-44 torpedoes.
torpedoes. Here the negated presumption
entails that &
does not entail an empty set, but rather the entire set, indicating that the
restriction was irrelevant)
irrelevant.]
Q:
Q: What
What is
is the deteclion
detection range ofships
of ships with torpedoes?
lorpedoes?
Kaplan
204
R:
(RADAR RANGE) (SHIP
(SHIPNAME) (fRPDONAME)
(TRF'DO NAME)
100NMI
lOONMI
140NMI
140 NM1
200 NMI
200NMI
AGERHOLM MK-.46
MK-46
FISKE
MK-46
MK-.46
MK-46
COCHRANE MK-.46
GRIDLEY
GRIDLEY
MK-48
mi
[Here,
[Here, the system produces a suggestive indirect response. It has designated hthee
torpedo type as the most likely focus.
focus, and consequentiy
consequently an<wers
answers the more general
question, as indicated.]
Q:
Q: What are the classes o[ships?
of ships?
R:
(SHIP CLASS)
(SHIP NAME)
HLUEBIRD
BLUEBIRD
KNOX
VIREO
WWNES
DOWNES
TRUElT
TRUETI
STEPANNY
SKORY
[This
p i s question requires the system to perform an inversion on the ship e1ass
class field
205
to present the data in a form consistent with thc request. Although the user has
to present the data in a form consistent with the request. Although the user has
presented the classes as entities, thcy are reprcscntcd in the DB as attributes of the
presented the classes as entities, they are representcd in the DB as attributes of the
ships. Responses in CO-OP are a Function of the phrasing of the query and the
ships. Responses in CO-OP are a function of the phrasing of the query and the
information in the DB, but not thc particular organization for the data chosen by
information in the DB, but not the particular organization for the data chosen by
the DB designer.]
the DB dcsigner.]
R:
(GUN FIRE RATE)
(SH tP NAME)
(SHIP NAME)
GARCIA
GARCIA
OWENS
OWENS
HIGBEE
HIGBEE
TARAWA ...
TARAWA...
18
40
3.6 Conclusion
3.6 Conclusion
In practical environments, the acceptance of NL DB qucry systems will be
In practical environments, the acceptance of NL DB query systems will be
decided in part by the degree to which thcy adopt the pragmatic discourse
decided in part by the degree to which they adopt the pragmatic discourse
conventions normally observed by human speakers. In human conversation, a
conventions normally observed by human speakers. In human conversation, a
variety of conventions, postuIates, rules, etc, facilitate smooth and effective
variety of conventions, postulates, rules, etc. facilitate smooth and effective
communication. The use of NL by a query system will imply to the users that
communication. The use of NL by a query system will imply to the users that
these conversational principles will be observed. Pragmatic conversational effects,
these conversational principles will be observed. Pragmatic conversational effects,
particularly those that relate to cooperation, will be essential to the behavioral
particularly those that relate to cooperation, will be essential to the behavioral
repertoire of successful NL systems.
repertoire of successful NL systems.
However. it is not the case that those effects are desirable for all users that wish
However, it is not the case that those effects are desirable for all users that wish
to query a DB. For some users. NL is not an appropriate means for interacting
to query a DB. For some users, NL is not an appropriate means for interacting
with a DB system. Some thoughts on tIlc types of users most likely to benefit by
with a DB system. Some thoughts on the types of users most likely to benefit by
NL query systems are included here.
NL query systems are included here.
YSerS
hi&-1evcI nr
If a user is unclear about what
&:
llim. Yiith high-level m: ~ ~: If a user is unclear about what
Kaplan
Kaplan
206
206
information s/hc wants or is unsure of what information is prcscnt in the DB, thc
information s/he wants or is unsure of what information is present in the DB. the
ability of NL to express vague qucstions and the appropriatencss of indirect
ability of NL to express vague questions and the appropriateness of indirect
responscs to NI, qucstions can ajd the uscr in locating rclcvant information. NL is
responses to NL questions can aid the user in locating relevant information. NL is
an appropriate vehicle for browsing through a DB, By contrast, users with specific
an appropriate vehicle for browsing through a DB. By contrast, users with specific
or detailed nceds may find NI, too imprccise for expressing their queries: various
or detailed needs may find NL too imprecise for expressing their queries: various
ambiguities, such as the interpretation of ncstcd quantifiers, may hinder rather
ambiguities, such as the interpretation of nested quantifiers, may hinder rather
than hclp in communicating with a Dl3 systcm. If a particular form and content is
than help in communicating with a DB system. If a particular form and content is
desired for the responses, a formal qucry language may be more cffcctive.
desired for the responses, a formal query language may be more effective.
Occasional or casi~alusers: For these users, the invcstmcnt of time and effort
Occasional or casual users: For these users, the investment of time and effort
to learn a formal query language may excecd the potential value of the
to learn a formal query language may exceed the potential value of the
information rctricvcd. For the user with a single question. the Fact that thc syntax
information retrieved. For the user with a single question, the fact that the syntax
and scmantics of NI. are alrcady known may be a major advantage to its use.
and semantics of NL arc already known may be a major advantage to its use.
Frcquent users, however, may be bctter off with a tcrse, spccial-purpose language:
Frequent users, however, may be better off with a terse, special-purpose language:
unrestricted NL may be too verbose and unstnicturcd for thcir needs. ?'he
unrestricted NL may be too verbose and unstmctured for their needs. The
relative cost of bccoming familiar with a particular query languagc falls off rapidly
relative cost of becoming familiar with a particular query language falls off rapidly
as frequency of use increases.
as frequency of use increases.
Naive users: Naive users arc unfamiliar with thc details of the DB structure,
Naive users: Naive users arc unfamiliar with the details of the DB structure,
and have no dcsire to lcarn how the information in the DB is organized in order to
and have no desire to learn how the information in the DB is organized in order to
appropriately phrase their qucrics. Whilc most formal query languages require
appropriately phrase their queries. While most formal query languages require
some knowledge of how the DD is normalized. NI, questions do not. NL provides
some knowledge of how the DB is normalized. NL questions do not. NL provides
a measure of data indcpcndcnce that most formal qucry languages lack,
a measure of data independence that most formal query languages lack,
facilitating access to data by naive users. On the other hand, users knowledgeable
facilitating access to data by naive users. On the other hand users knowledgeable
in the structuring of a DB may find this indcpcndence a hindrance - they cannot
in the structuring of a DB may find this independence a hindrance - they cannot
help but conceive their queries with rcspect to thcir undcrstanding of the details of
help but conceive their queries with respect to their understanding of the details of
the DB. The additional effort required to inap their vicw of the data onto a NL
the DB. The additional effort required to map their view of the data onto a NL
question, and hope that the systcm performs thc inverse mapping properly, may
question, and hope that the system performs the inverse mapping properly, may
simply bc an annoyance.
simply be an annoyance.
In summary, a NL DB query system is likely to be of most use to users who
In summary, a NL DB query system is likely to be of most use to users who
conceive of the system as an expert in its own structure and content, and are
conceive of the system as an expert in its own structure and content, and are
willing to share the task of formulating qucries and selecting the form and content
willing to share the task of formulating queries and selecting the fonn and content
of the responscs with the system itself. Uscrs who regard the system as a tool, and
of the responses with the system itself. Users who regard the system as a tool, and
wish to exert maximum control over its operation, are likely to find a NL DB
wish to exert maximum control over its operation, are likely to find a NL DB
query system of limited value,
query system of limited value.
This work has examined the feasibility of providing cooperative responses
This work has examined the feasibility of providing cooperative responses
from a portable NL DB query systcm. Beginning with a study of some of the more
from a portable NL DB query system. Deginning with a study of some of the more
obvious pragmatic aspects of cooperation in human discourse, a few principles
obvious pragmatic aspects of cooperation in human discourse. a few principles
have bccn projected onto the DB query domain, where computational approaches
have been projected onto the DB query domain, where computational approaches
have been explored. To assess thc effectiveness of the tcchniques developed, a NL
have been explored. To assess the effectiveness of the techniques developed., a NL
DB qucry systcm, CO-OP, has bccn implcmentcd and the results described. While
DB query system, CO-OP, has been implemented and the results described. While
it leaves open many questions, the design of CO-OPsuggests scvcral concepts of
it leaves open many questions, the design of CO-OP suggests several concepts of
207
potential linguistic and computational interest:
potential linguistic and computational interest:
3) I3y making the assumption that all aspccts of a question are making reference to
3) By making the assumption that all aspects of a question are making reference to
something in the DB. various types of indirect responses, most notably corrective
something in the DB, various types of indirect responses, most notably corrective
indircct rcsDonscs, can be produced in a domain transparent fashion from a NL
indirect responses, can be produced in a domain transparent fashion from a NL
DB query system.
DB query system.
4) Maintaining the lexical and syntactic integrity of the original question
4) Maintaining the lexical and syntactic integrity of the original question
throughout the interpretation process provides a means for explaining failures,
throughout the interpretation process provides a means for explaining failures,
errors, and responses in terms that the user is likely to understand. The use of an
errors, and responses in terms that the user is likely to understand. The use of an
intermediate rc~rcscntation that canturcs relevant linguistic consideration$
intermediate representation that captures relevant linguistic considerations
without rcflcctine arbitrarv organizational details &f & undcrlving nB is central
reflecting arbitrary organizational ~ ill ~ underlying 12!J. is central
to this approach. Given such a representation, the organization of data in the
to this approach. Given such a representation, the organization of data in the
responses can and should be independcnt of the organization of data in the DB.
responses can and should be independent of the organization of data in the DB.
m.mu
Kaplan
208
3.7 Acknowledgements
Achnoslcdgcmcnts
3.7
CHAPTER
CHAPTER
4 4
Natural
Language
Generation
a Computational
Natural
Language
Generation
as aas
Computational
an
Introduction
Problem:
Problem: an Introduction
David
D. McDonald
David
D. McDonald
4.1. l ntrodurtion
4.1. Intraduction
McDonald
210
elaborating the linguistic and rhctorical problcins that havc bccn dcalt with. I h i s
elaborating the linguistic and rhetorical problems that have been dealt with. This
discussion will givc thc rcadcr an idca of thc kinds of problcms that this rcscnrch
discussion will give the reader an idea of the kinds of problems that this research
has conccntratcd on antf whcrc thc focus of thc t h a ~ r ylics. Following that wc will
has concentrated on and where the focus of the theory lies. Following that we will
considcr what it rnciins lo llavc a computational inodcl of gcncration and skctch
consider what it means to have a computational model of generation and sketch
thc li~nitatiorlsthat havc bccn ilnposcd on it. 'I'hc main points of the modcl will
the limitations that have been imposed on it. The main points of the model will
thcn bc prcscntcd, including a walk-through of one of thc cxamplc outputs.
then be presented, induding a walk-through of one of the example outputs.
Finally lhc modcl will hc c{)nrraslcd briefly with carlicr gcncration tcchniqucs and
Finally the model will he contrasted briefly with earlier generation techniques and
thc utility of thc prcscnt conlputcr program discussed.
the utility of the presentLOmputer program discussed.
4.2. Itcsults for 'l'cst Spcakers
4.2. Results for Test Speakers
211
Ihc "linguistic conipi~ncnt"a i d the rcsr or tllc prtIccss can bc spccificd prcciscly,
the "Jinglli~tic component" and the rest of the process can be specified precisely,
and (2) h a t Ihc cx tc:il of thcir shclrcd ;~ssunlptionsabout rcprcscntalio~rsand
lmd (2) that the cxtC:lt of Ulcir shJred assumptions abollt representatioJls and
contingncics is snl;lll (otllcrivisc tlic linguistic componcnt \r.ould Invc to bc
conting.encies is SllWll (otherwise the linguistic component would ha\'c to be
1,lrgcly ril\vrittcn for c,?cli ncw spcrlker).
hlrgely rewritten for e;lcl1 new speaker).
'I'lic rcI;~tionsliipbctwccli lhc lil~gl~istic
componcnt and lhc largcr systcrn is
The rebtionshifl between lhe lingubtic component and the larger system is
skctdlcd in figurc 1.
sketched in figure 1.
I----------
EXl3ERT
PROGRAM
CENERA'IYIR
EXPERT
PROGRAM
GENJ<:RATOR
Spcukcr
Dictionary '
1,inpyistic
UoltUistk
Con~poncnt
Speaker
Component
McDonald
212
knowlcdgc 1113t
Uiat II :~Im
!m attributing herc
"spcakcr" will havc
bc incorporated
knowledge
here to a "speaker"
have to be
in a theoretically
thcorctically sound way. In any event,
cvcnt. it will bc
convcnicnt to think in terms
be convenient
thc speaker
spcakcr as thc
systcm tllat
Uiat ninkcs
dccisions about
of the
the part of thc
the cxpcrt
expert system
makes all thc
the decisions
what to say. providing the input to the
ihc linguislic
component.
linguistic
4.2.1.
4.2.1. The I)ilferr.nt
Different Input llcpresentations
Ilepresentations
213
I
FHI., "Framc-oricntcd. licprrscntation 1.aqguagc" was dcvclopcd by
FRI.. "Frame-oriented. Reprrscntation I.al}gllage" was developed by
Goldsicin and Robcrls [Cioldstcin and liobcrts 19741 as an cxpcrimcntal
Goldstein and Roberts [Goldstein and Roberts 1974] as an experimental
implcmcntation of "framc" idcss of Minsky [Minsky 19741. It was ilscd by
implcmrntation of "frame" ideas of Minsky 1M insky 1974]. It was used by
Winston as thc rcprcscntation for his program for ~nakitlg and cvaluating
Winston as the representation for his program for making and evaluating
analogies [Winston 19801. A dictionary was co~npilcdfor Winston's database on
analogies [Winston 1980]. /\ dictionary was compiled filr Winston's database on
thc play "Macbcth", from which texts wcrc dircctly produccd describing the
the play "Macbeth", from which texts were directly produced describing the
actors and major sccncs, Winston imposcd a rigid "casc-framc" discipline on the
actors and major scenes. Winston imposed a rigid "caseframe" discipline on the
ficlds of his ICIII. fraincs, making Lhcm vcry casy to translate into Ilnglish. ?his
fields of his FlU, frames, making them very easy to translate into English. This
madc i~ possiblc to conccntratc instcad on thc cohcrcncy of thc tcxt (as in the
made it possible to concentrate instead on the coherency of the text (as in the
sctnantic nct domain), and lo dcvclop a battery of gcncral linguistic
semantic net domain), and to deyelop a battery of general linguistic
transformations to dcal with propositional attitudes, subordinate clauscs, scntcncetransformations to deal with propositional attitudes, subordinate clauses, sentencelcvcl adjunction, and thctnatic firus, and to study explicitly planncd cataphor and
level adjunction, 4lnd thematic focus, and to study explicitly planned cataphor and
subordination (as in "B~cuuscI.adj1 hlacbelh penuallrd hit,! lo do it, Afacberh
subordination (as in "Becausc rad)' Macbeth persuaded him to do it, Macbeth
~lrurdrrcdDuncan '7).
murdered Duncan. ').
FRL
McDonald
2]4
215
was rcndcrcd as:
was rendered as:
"There is space 011 a surjbcefir a block ifafrd ortly $!hat surfoce is the
"There is space 011 a surface fur a block if and ollly if that surface is the
/able or if hns a clerrr lop:"
table or it has a clear top;"
Iliffcrcnt conventional intcrprctations of formulas wcrc cxpcrimcntcd with,
Different conventional interpretations of formulas were experimented with,
originally undcr cxplicit control of thc dcsigncr and latcr undcr program control
originally under explicit control of the designer and later under program control
i~singboth lookahciid at tllc linguistic decisions and simple tcsts of thc logical
using both lookahead at tile linguistic decisions and simple tests of the logical
stnrctilrc of the cxprcssions to dctcnninc whcthcr an intcrprctation would go
stmctllre of the expressions to detennine whether an interpretiltion would go
through. 'fhc same formula, say: "V(x) man(x) 3 rnortal(x)", can be
through, The same fOnTIlIla, say: "V(x) man(x):J morlal(x)", can be
undcrstood conventionally and rcndcrcd as: "All tr~o~
ow ~r~c~rtal",
or undcrstood
under~to()d conventionally and rendered as: "Allll1clI are /Ilortal", or understood
literally and rcndcrcd as: "l:or a11y ~ h i n g fthnt
,
rhir~gis a ntutl, /lrell ii is ~~ror~al".
literally and rendered as: "For any thillg. if that thillg is a mall. then it is mortal".
It docs n o t takc long, howcvcr, to cxhaust thc linguistic insights to bc gaincd
It docs not take long, however, to exhaust the lingui~tic insights to be gained
from looking at single fi)rn~ulasin isolation. A prcdicatc calculus formula is
from looking at single fomlUlas in isolation. /\ predicate calculus formula is
undcrdctcr~nincdwith rcspcct ro thc morc sopliisticatcd fomis of rcfcrcncc and
underdetermined with respect ro the more sophisticated f0I111S of reference and
qiianlificarion supported by nal~irallangilagcs, and iu conncctivcs and prcdicatcs
quantification supported by natural languages, and its connectives and predicates
can usually bc givcn many cqiialiy plausible rcndcrings. Whcn formulas appear in
can usually be given many equally plausible renderings, When fannulas appear in
isolation, thcrc is no molivation for using onc rcndcring or onc intcrprctation of a
isolation. there is no motivation for using one rendering or one interpretation of a
qiiantificr ovcr anothcr.
quantifier over another.
Onc way to providc thc nccdcd motivation is to look at formulas in thc context
One way to provide the needed motivation is to look at fonnulas in the context
of a proof. Figurc 2 shows a nati~ri~l
dcd~lctionproof followcd by thc tcxt that the
of a proof. Figure 2 shows a natural deduction proof followed by the text that the
logic domain's dictionary sclcctcd for it. ('Tllc first linc is a statcmcnt of the
logic domain's dictionary selected for it. enle first line is a statement of the
"barber paradox" crciitcd by Ijcrtrand Russcll as a popular rcndcring of thc sct of
"barber paradox" created by Bertrand Russell as a popular rendering of the set of
all scts paradox.)
all sets paradox.)
'Ihc lincs of thc proof arc passcd to Ihc program in scqucnce; the English text
The lincs of the proof arc passed to the program in sequence: the English text
sclcctcd for carlicr Iincs providcs a discoursc context to narrow the choices
selected for earlier lines provides a discourse context to narrow the choices
availablc to later oncs, directly controlling subscqucnt rcfcrcnccs to constants,
availa~le to later ones, directly controlling subsequent references to constants,
wriablcs intcrprctcd as gcncric rcfcrcnccs. and prcdicarcs and fonnulas used as
variables interpreted as generic references. and predicates and fannulas used as
descriptions. Furthcr motivation for text choice is provided by the labels that are
descriptions. Further motivation for tcxt choice is provided by the labels that arc
attachcd to certain lincs to rcflcct 'their rolc in thc structure of thc proof (c.g. "the
attached to certain lines to reflect their role in the structure of the proof(e.g. "the
assu~nption"or "a cot~~rrrdictir~n").
and by tIic logical infcrcnce rulcs that derived
assumption" or "a contradictio/l"). and by the logical inference rules that derived
thc lincs: a largc part of thc rcndcring of Lhc proof must hc an explanation. guided
the lines: a large part of the rendering of the proof must be an explanation. guided
by the infcrcnce rulcs, of how cach linc follows from thc carlicr ones.
by the inference rules, of how each line follows from the earlier ones.
McDonald
216
linel: prcmise
lincl:
premise
line2:
linc2:
3x (barber(x)
/\ Vy(shaves(x,y) ...
(barbedx) A
++ ""shaves(y,y)))
ishaves(y,y)))
existential
cxistcnti;il insUlntiation
instantiation (1)
barber(g) /\
A Vy(shaves(g,y) ... ""shaves(y,y))
-rshaves(y,y))
linc3: tautology
tnutology (2)
(2)
Vy shaves(g,y)'"
shaves(g,y) ++ ""shaves(y,y)
~shaves(y,y)
linc4: universal
uniucrsal instantiation (3)
line4:
shaves(g,g) ...
++ ""shaves(g,g)
-shaves(g,g)
lineS:
linc5: tautulogy
lautol(~gy(4)
(4)
linc6:
linch:
shaves(g,g) A
/\ ""shaves(g,g)
-shaves(g,g)
conoitiol1alizJtion
conditionali7,~tion(5,])
(5.1)
(barbedx) A
Vy(shaves(x,y)
-shaves(y,y)))
3x (barber(x)
/\ Vy(shaves(x,y)'"
""shaves(y,y)))
:J
3 (shaves(g,g)
(shaves(g,g) /\
A ""shaves(g,g))
-shaves(g,g))
line7:
(6)
linc7: reductio-ad-absurdum
rcducti~~-;~d-aI~surdu~n
(6)
...,3x
/\ Vy(shaves(x,y)
-3x (barber(x)
(barbedx) A
Vy(shaves(x,y) ... ""shaves(y,y)))
~shaves(y,y)))
Assume 111a/
w11o sllai>es
who
Ihal [here
Ihere is so111e
some barber who
shaves everyone who
doesn'l
shave himself
(and no one else). Call him
l ~ i t nGiuseppe.
Giuseppe
doesn'r sl~ave
l~iriiself(atid
NOH:
II>~O
doesti I' shave himself
hintself would
~vouldbe shoved
Noll', anyone who
doesn'l
shaved by
Giuseppe.
Giuseppz This
This would
u,ould include
irlclude Giuseppe
Giuscppr himself
I~ittnclf: 71101
7har is,
is he
11~ou1d
shol'c
i~itnsclf;if Gnd
and only if he did nol
not shave himself
hitnself;
would
shave himself
which
Ihallhe
conrradic/ion. This
This means
rneans /ha/
/he assumplionleads
assurnprion leads
~ 1 1 i cish a conlradiclion.
/o a conlradiclion.
conrradicrion. Therefore,
Tllerefore, ifir is false,
10
false. rl~ere
Ihere is no such
barber.
Figure 2 'Ibe
'lhc Barber
13arbcr Proof
Proof
lbc
I h c proofs that were
wcrc used
uscd in the logic domain were
wcrc selected
sclcctcd from a set of
uscd by Daniel
1)anicl Cbester
Chcstcr IChester
[Chcstcr 1976J
1976) in virtually the same
proofs that
thal had bccn
been used
task. The
'lhc choice
choicc was made
madc deliberately
dclibcratcly to permit
pcrinit a direct
dircct comparison of the
output of the
thc two systems on
011 the
thc same material-something that is rare in studies
languagc generation,
gcncration. Chester's
Chcstcr's version of the
thc "barber
"barbcr pr~xlf'
follows:'1
of language
pnx,r' is as fullows:
Suppuse
Ihal rhere
Ihere is ssome
Ihal /or
for eVeI)'
perSU/l rtre
Ihe barber
Suppose rhar
~ ~ barber
borbcr
r ~ c such rhar
ever). l~crsu!~
shalJes the
person iff
the person does nol
!-e/
stra~~es
rhc pcrso~r
ifjrhepcrso~r
!lor slim'!'
slra~vhimself.
hit~~sel/:
I.cr /IA denote
derrore such aa
borbrr. Now he ~hai,cs
lri~~rselfrjjhe
!lot shave himself,
hiuself: rhcrefire
barber.
shaves himself
iff he dues /lol
Iherefure a
co17rradiciio1ifjulluws.
u l l o ~ ~ Therefore
Therefire
~s
ifthere
/Ira1for
contradie/iull
if
there is sorne
sume barber such that
for every
1.
1. My
M y source for
lor Chesler's
Chnlcr's results
rcsulls is a personal
pcmonal communication
omrnunicalion with him in Novcmber
November of
or 1975;
1975: the
Ule
ellon on thc
Ihc logic
logic domain
darnam was completed
camplclcd inin Deccmber
kcrnber of ]977.
1977.
major cffort
217
person tile barber shaves the persotr 1 f f fhe person does rlor shave hitg~self
person the barber sho\'es the person iff the person does /101 sha\'e himself
rhar a co~ltradicliottfi~llowr Thus there is no barber such ~ h afor
l every
then a CO/llradiclio/l jiJllows. Thus there is I/O barher such Ihal for el'try
perTon /he bnrbcr shares rltel~erw~t
[ff the per so^? does tto! S ~ C I V himsrlj:
P
person Ihe barber shQl,('s the person iff the persu/l dol's 110/ shave himself
Chccrcr's program bclungs to thc "dircct translalion" school of ~latirral
Chester's program belongs to the "direct translation" school of natural
languagc gcncration systcrns (scc [Mann ct al. to appear]). It produced the
language gcncration systems (sec [Mann et a1. to appear)). It produced the
paragraph abovc by rccursivcly replacing thc formulas of thc proof with English
paragraph above by recursively replacing the formulas (If the proof with English
rcxt (aftcr editing it for production), cntircly on thc basis of I(~wlpropertics of the
text (after editing it for production), entirely on the basis of local properties of the
forn~ulas.'I'hc lack of culltcxtual input to tlic program's rcirli/ation decisions is
formulas. The lack of contextual input to the program's reali/.ation decisions is
rcflcctcd in its minimal trcatlncnt of subscqi~cntrcfcrcncc and thc occasional
renected in its minimal treatment of subsequent reference and Ule occasional
abruptness of transition fro111liiic to Iinc.
abruptness of transition from line to line.
At his point. I u ill usc tllc cxaniplc of thc barbcr proof to point out sornc of
t\t this point, I will use the example of the barber proof to point out some of
thc accomplishrncnts that arc c~nbodicdin Ihc current \crsiun of my gcncration
McDonald
218
Suhscqucnt rcfcrcncc
K n o ~ i n gwhcn not to ilsc a pronoun is vcry important
SUhSNjUent reference
Knowing when nol to use a pronoun is very important
in thc production of undcrstandablc tcxts. 'I'hus whilc t l ~ cbarbcr is idcntificd and
in the production of understandable texts. Thus while the barber is identified and
givcn a namc in first two scntcnccs, I-IC is not pronominali~cdin thc third and
given a name in first two sentences, he is not pronominalized in the third and
fourth bccausc those scntcnccs arc part of a ncw ncw discoursc structurc (the
fourth because those sentences arc part of a new new discourse structure (the
"sub;irgumcnt" cu~nposcdto casc Lhc transition to linc 4) whcrc thc discoursc
"subargument" composed to case the transition to line 4) where the discourse
ftxus is on the ~inivcrsillyquantificd variablc "y" rathcr than on thc barber
focus is on the universally quantified variable "y" rather than on the barber
"Giuscl~pe".Whcn thc f t ~ u shifts
s
to him in scntcncc five as a rcsult of the use o f
"Giuseppe". When the focus shifts io him in sentence five as a result of the usc of
thc intensifying rcflcxivc ("Giusepp~llimsey), hc can thcn h c pronomiiializcd in
the intensifying reflexive ("Giuseppr himsell'), he can then he pronominalized in
LIIC fuur instances in scntcnce six. (N.b. tllc nairlc "Giusq~e" was pickcd
the four instances in sentence six. (N.b. the name "Giuseppe" was picked
arbitrarily.)
arbitrarily.)
Descriptions may bc "pronominalized" as wcll as rcfcrcnccs. A t thc cnd of
Descriptions may be "pronominalized" as well as references. At the end of
scntcncc one, thc original dcscription of "y" (i.e. "everjwt~ewlho doem'f shave
sentence one, the original deSl:ription of "y" (i.e. "everyone who doesn't shave
h i t ~ ~ s eis~ 'rccapi~i~latcd
)
in the dcscription of thc complcmcnt sct as: '510one
himself') is recapillilated in the description of the complement set as: "flO one
ebe". 'I'hcn in tlic final scntcncc, thc original cuinplcx description of tllc barbcr is
ehe". '111en in the final sentence, the original complex description of the barber is
rcduccd to just thc adjcclivc "such':
reduced to just the adjective "such",
Functional lahcls
hat
'I'hc prclnisc fi~nctionsin thc pmof as "UI! assu~~~ption"
Functionallahcls
The premise functions in the proof as "all assumption" that
is to bc shown to be fdsc bccnusc it lcads to a contradiction. Sincc this role is
is to be shown to be false because it leads to a contradiction. Since this role is
known to thc audicncc (we bcgnn by saying ",lssutr~e tho/... (3, wc can use h e
k~own to the audience (we began by saying "Assume thai... .' ), we can usc the
IahcI l a ~ c r(scntcncc scvcn) as a succinct rcfcrcncc to the cntirc first linc. 'The
label later (sentence seven) as a succinct reference to the entire first line. The
logical schcma " A A - A " is 5imilarly lahclcd as "0 co~~tmdicrioi~".
Part of the
logicul schema" A 1\ -'A" is ~imilarly labeled as "0 cOlllradictiun". Part of the
concept of a labcl is the ability to include a literal rcndcring of thc labeled
concept of a label is the ability to inclUde a literal rendering of the labeled
cxprcssion as an appositive (final scntcncc). In thc logic domain's dictionary.
expression as an appositive (final sel1lence). In the logic domain's dictionary,
ap1:ositivcs arc triggcrcd if thc last literal rendering was not in the samc paragraph
appositives arc triggered if the last literal rendering was not in the same paragraph
or, as in this casc, if thc line is the conclusion of an argument.
or, as in this case, if the line is the conclusion of an argument.
Contcxt scnsitivc rcalizatiuns
Part of the linguistic conrcxt that is produced to
Context sensitive realizations
Part of the linguistic context that is produced to
guidc later dccisions is a rhetorical dcscription of thc discoursc structure. The
guide later decisions is a rhetorical description of the discourse structure. The
diffcrcnt terms of dlis structure will guidc dccisions at syntactic and morphological
different terms of this structure will guide decisions at syntactic and morphological
Icbcls: in sentcnccs one and three a contraction is uscd ("duc.m'l shavc13while the
levels: in sentences one and three a contraction is used ("duesl!'t shave') while the
same logical structurc in (he firnial contcxt created by thc conclwsiar~scntcnce of
same logical structure in the fUl11wl context created by the cunclusion sentence of
thc subargumcnt is not contracted Uorttlal bcing an cxpcrimcntal rhetorical
the subargument is not contracted (formal being an experimental rhetorical
fcaturc in thc grammar). Similarly thc conncctivc
is spclled out in a formal
feature in the grammar). Similarly U1C connective ++ is spelled out in a fannal
contcxt (sentcncc fivc), but in an unmnrkcd, inf(rrna1 contcxt il is understood as a
context (sentence five), but in an unmarked, informal context, it is understood as a
restriction on a variable and cxprcsscd as a rclativc clausc. In another case, the
restriction on a variable and expressed as a relative clause. In another case, the
samc quantificd variable ("y") is rcaliscd in thc unrnarkcd contcxt of scntencc one
same quantified variable ("y") is realized in the unmarked context of Sentence one
as "everyone", but whcn markcd in scntcncc thrcc as idcntifying a sct it is rcalized
as "everyone", but when marked in sentence three as identifying a set it is realized
as "anyone".
as "anyone",
Part of the discoursc context is the distance bctwccn phrascs. When a
Part of the discourse context is the distance between phrases. When a
contradiction is dcduccd from the immediately previous linc, as in lirie fivc, the
contradiction is deduced from the immediately previous line. as in line five. the
219
identification of
possible by
o f that deduction is given
giwn in
in the most
no st direct
dircct way possible
adjoining aa relative
rclativc clause
clausc to the
thc last
l;~stsentence;
scntcncc: when
whcn the
thc dependency
dcpclidcncy line is
is much
much
adjoining
carlicr (as in
i n line
linc six).
six). the
thc formula
fi~nnulafrom
frum the
thc line
linc is
is repeated
rcpcatcd and
i ~ n dthe
thc phrase "leads
earlier
(0"
lo" is used.
uscd.
aroid ~lmhiguity
lnll~igu~ty In
j\lte~nptsto amid
Attrmpts
" as a
inrcrprctation of
o f "....
"cr"
I n sentence
scntcncc one.
onc. the
thc imerpretation
rcs~rictionon
011 a
3 variable's
variilblc's range
rilngc must
inust include
includc some
samc phrase
phrasc to indicate
indicatc that the
restriction
entire
been specified
pan of
rangc has
has bccn
spccificd and
and not just aa pan
o f it.
it. Consequently
Conscqocntly the
thc "iff"iffcntirc range
entry"
alld
(comp!('n!rnt or
(An
cntry" is
i s designed
dcsigncd to S<ly
wy "(r('~1rkli{ln>
"(rr*~rir~ion>
a~rd<cotr~nlcn~cn~
or frs1ridiod)".
rr>~rir~ion>".
(An equivalent
cquivalcnt
technique
bccn to replace
rcplacc the
thc word "evCf"yone"
"rec~'orre"with
wilh "all
''all and only
ortly
tccliniqoc would ha\'e
h a ~ cbeen
(hose
who...
the
I ~ O S P me/l
IJICI
u'110r... ".) Because
IIcca~~ic
thc presentation
prcscntiltion of
o f this
tliis combined
colnbincd restriction
rcstriction should
be
donc carefully,
c;lrcfully. a
;I spccialmonitllring
spcci;~linnnik~ringmutine
routine is
is activated
activ;~tcdin
in <In
an attempt
attclnpt to avoid
bc done
intnlducing scope
scopc ambiguities
itmhiguitics in
in the
dic cDnjunction.
co~~junction.
thc basisof
thc point where
introducing
On the
basis of the
the
tlic conjunction
conjunction is
is JUJchcd
auachcd (i.e
(i.c... as the
thc direct
dircct object
objcct of
o f "some
"xorrle barber who shaves
X").
tlic projected
projccrcd contents
contcnts uf
i ~the
thr
f second
sccond arm
arm of
o f the
thc conjunction (3
(a noun
noun phrase).
X'). the
and
arm has ended
cndcd with a direct
dircct object,
objcct. thc
dic monitor decides
decides
and the
thc filct
litct that
tlii~tthe
thc first ann
that
possible tbat
be misinterpreted
conjoining with the
tli;~tit is
is possiblc
that tl,e
the second
sccond ann
ann will hc
misi~itcrprctcdas
ashlnjoining
imnicdiarc, lower
loucr direct
dircct object
objcct rather
ralhcr than with the
thc intended
intcnded one. It
I t causes
causes
morc immediate,
bc added
addcd around the
Uic second
sccond arm as one
u ~ i cway available
avail;~blcto it in
in
thc pJ:enLheses
parcnthcscs to be
the
tl1is
h i s case
caic to try and
and forestall
forcaall misinterpretation.
misintcrprctation.
4.2.3. !'I.ANNER-st)'le Assertions
'llic source
sourcc data structure
stmctl~rcin
in this domain was a KI:OSE
nctwork reimplemented
rcim]~lcmcntedas
'Ille
KL'O~E network
a set
sct of
o f binary relations
rclationt expressed
cxprcsscd as PLAI'\:'iER-style
Pl.kNYI:R-style assertions. A
A KLONE
KITONE net
consists
consists of
o f a set of
o f named
ni~mcdobjects
ol~jccoof
o f various
vilrious types
typcs (only
(only "concept" and
and "role"
"role" will
be shown
bere). linked together
he
shown hcrc),
togcthcr by the
thc relations:
relations: "subconcept".
"subconcept", "has-role",
"has-role",
"valuc-rcstricdon", and
and others
othcrs not shown. This
I h i s domain took the
thc network as input
"value-restriction".
an English
E~iglishdescription of
o f its literal
litcral contents;
contcnts; that is.
is, rather than
and produccd
and
produced an
intcrprct the
thc net
nct as a representation
rcprcscntation of
ofcertain
facts (c.g,
(e.g. "every
"evcrj phrase
head
interpret
certain facts
phrase has a head,
rr~odificr, and
arid an
arr interpretation...
ir~rerprerorior~
... ").
'), it is
is interpreted
intcrprctcd at its
irs literal
litcral level
level as a
a modifier,
o f KI
l(l..ONI;
objccts (e.g.
(c.g. "The roncrl~r
iflrhe lIel,
!re/, ilir has a
collection of
..O:"E objects
foncepl r,hrare
"hrase is the
Ihe /up
Iup oflhe
head
Ihal... ').
head role
rule fhar...').
Figure
paragraphs of
constructed for one
Figure 3 shows
sliows the
thc first paragraphs
o f the
thc text'
tcxt'conitruc~cd
onc of
o f the
dcvclop~ncntnetworks
nctworks in
in usc
llolt Beranek
llcranck and
and Ncw~nan
("IlI1N") during
Newman Inc. ("BIlN")
development
use at Bolt
thc spring of
o f 1979. (It represents
rcprcscnts aa first pass
an English
English
pass at aa conceptualization of an
tl,e
grammar.) The
I h c text
tcxt was created
crcatcd by scanning
scanning tbe
the net
nct depth-first
dcpth-first following its
grammaL)
dcvoting one
onc paragraph
cach concept.
conccpt. Each
12~cliparagraph
links,
ks. devoting
paragraph to each
paragrapb
"subconcept" lin
mentions
below) three
mcntions (or assumes-sec
assumcs-scc bclow)
thrcc facts
facts about about its concept:
conccpt: (1)
(1) the
name
namc of
o f the
thc concept(s)
conccpt(s) it
i t is
is aa sobconcept
subconcept of, (2)
( 2 ) the names
namcs of
o f its "roles"
"roles" and
and the
McDonald
220
~--~Vall/c'
Hcstriction
Valoc-llcslriclion
D:;;=-+-J-C:OI~CEPT:::>
--
1'/IJ.{fse
lis irller?,
inlem role rt~us!
I11USI he
be a
I'lilrisc is Ihe
/he lOP
lop of
of Ihe
/ / I ( ! IIfl.
11e1. 1:s
r o t w , and
niid ils a~dflu
role alid
~ l i iils
d/ sl!!:!Id
! ~ ! ~role
f ! nlris/
'2.!/{'{'/'I,
l110rJiflcr role
rolc
l11usl hc
be s*.rhrases.
S I I ~ L . ~ > I I C C arc
arrpj;,
~ / S PI!. !lJ!, mljUIICI..iJWWdIIISf.
I11s
Is subt'OlIccpls
a ! j ~.O~l&ck!w.
.
~and
n 1\'d(11'~11.o d .
~ c j f i0
..i/lli'.1l.
,
il!/c1'~.
110sIhe
/he IOles:
roles: L~Q.bJ..
1'/2
ilUfllJ. atidcoj&.
and l!1!.Q./U. &&/nus!
dj musl
-I'fi lias
bc a D./2,
~ 2 pm!
~, ma(jI 1!Lf,'!,
~3 ll!.lJ:.!J!
h u an dtai;!~.
be
l'f1ll.!Q!l. atr(l&
alll! ilJ1Q1J.i. ira E.
I!./!.. &k
Eo. is ~bet l0~liceI'I
c o ~.Ii care
arc
c ~ ~&
/s
(
~
~
~
L I V~
kciua
~ I I~
L i!J!, ,arid
III
oljl(;[$lWl!./!..
iJm~Ll'kl.I'/!'
1ili:.a
!(III
a/l d
nl:o~~~sriI~icc/~!n..
al10ul
.III bicc I{) p.
-O ~ I ! Ihas
! a m:l2i
&~i mle
,ole II'hich
which n111s1
I11USI he
be, a IIUI>~O~JB
humanp. a11dc7
and a
-O/j'crSOIl!'!l
~JLJ
mle
~v/~i,ic/t
IIINSI
br
O
I
I
~
1lHf2 role which 11111.11 be Oll g[.
I I J . Y I ~ ~ J IIobj
1~01v
~ ( ~ role
C I ~musl
111113/
' S be a 21.z~
p['I'clJrole aall
11
!lIslIbh'ill1!.s
3@IT.1llJ!, i1.7
ils &
ii!,
n
~
i
d
ils
role
a
&jsl.
in. (l11r1 il1l!f1. role sub}rr.1
-
...
ljl'urthcr pa(at!r~~rhs
pnrs;r:phs for
Tor the
...!lfurlh,or
rcsi
olpp3 rubmt~replsjj
n.:~1 ofpp's
subrnuccj)ts]]
&j
ciiio/I~er;ub"O/l[('pl
sub~.o~~cc.pt
(.f&r&~ ...
!J.I'
. isi.s anolher
ofi!lJJ.illf...
...
l[l;~nhvrpJr3gmrh~
p~ia;mphs f(lf
ior tlll'
tlic rest
rcsl llf
<,Ipli:3ar.'s
... [[fmtlwr
r,llr:Jsc's i~~hconccpls
slIbconcepts and
aud lhc
the ~~bronccpts
subconcCI)ls of
or
C;ldl
o( those in
wmll
1:igorc 3
3
Figure
if ;lny_ Cille
IKI1DCS
i:J.
221
already
the information that, e.g.,
alrcady known to the
thc audience,
audicncc, and as a consequence,
conscqucncc. thc
"phrase
is
a
COI/CCPI"
is
omitted
as
already
given.)
'>phrase
cu~rccp/"is
alrcady givcn.)
Var)
The
l'hc few
fcw paragraphs shown in the figure are
Var)ing
ing the
the paragraph
paragrap11structure
sufficient
the dictionary for this domain
suflicicnt to illustrate
illusuatc tlle
thc stylistic
stylistic heuristics that fhc
incorporates.
speaker as such:
such; its
incorporates. (Like
(I.ikc the
fhc logic
logic domain. this domain had no spcakcr
messages
directly
nets
coherent subnctworks.)
sUbnetworks.) In
mcssagcs were
wcrc comprised
cl~~npriscd
dircctly lif
i ~ KI,()I\'I:
KI;ONI;
f
ncls or cohcrcnt
each
the concept's rolcs
roles and thcir
their
tach of the
fhc first
first three
thrcc paragraph'i.
paragraphs, the
thc presentation
prcscntdtian of Uic
value-restrictiuns
done hy varying thc
the rhctorical
rhetorical
valuc-rcstricticlnsis
is given
gi\cn in a different
diffcrc~ltstyle.
stylc. It is donc
p:.lttcrn
roles thc
the conccpt
concept has. In fhc
the
tlic description
dcscription according
according to the
thc number of r(llcs
patlcrn of tile
first
the stylc
style choscn
chosen puls
puts cach
each role in aa
first paragraph,
paragraph. "phrilsc"
"phrasc" has three
thrcc roles
mlcs and thc
separate
The
second paragraph's
scp;lratc sentence:
scntcncc: "(role)
"<ro$> must
IIIUSI be (\alu('"rrslrirlion)".
< ~ ~ ~ ~ C . I C S I ~ ~ C I ~ O I'Ihc
I > " . sccond
concept
the use of
of a suni~narizing
summarizing scntcncc
scntence to
conccpt has
h;a more
morc than I..hrcc
Uircc roles,
rolcs, le<Jding
lcading to rhc
identify
their valuc-restrictions.
value-restrictions. 'fhc
The third
identify them
thcm as
as its
its roles
rolcs before giving fhcir
paragraph,
wilh
un thc
the "has-role" rclation.
relation,
p;~r;~graph.
will] only 1wo
two roles, uses scntences
scntcnccs based on
with
relative clause.
value-restriction embcdded
c~nbcddcdas a rclative
with each
each valuo-restriction
O~~~itting"given"i~~lor~~iation
Omitting
"ginn" information
Notc that thc
the sccond,
second, ihird.
"third, and fourth
p;lr;~graphsdo
do not start
sfart with a sentencc
scntcncc about what tlicir
paragraphs
their conccpt
concept is a subconcept
of.
text alrcady
already (in fhc
the last
of. This
'l'iiis isis bocause
bccausc that infonnation
information appears
appcars in the tcxt
sentence
dictionary cntry
entry that would make
prc\,ious paragraph)
pc~rapraph)and the dictionary
scntcncc of each previous
~ h cdecision
dccisim ttl
tu include
includc thilt
that information
infclrrnation decides
dccidcs that
thc
thilt it will bc
be still rcrnc~nbercd
remembered
and thus
thus would be redundant
rcdundant if included. Similarly in the sccond
and
second paragraph
whcrc there
thcrc isis a summary
summary sentence
scntcncc listing roles
rolcs of fhc
where
the conccpt
concept fie's,
llQ.'S, thc
the "has.roleW
"hasrole"
facts have
havc been
bccn left
lcft off
offof
tlic later
Iatcr sentences
sentcnccs since to leavc
facts
of the
leave lhcm
them in would have led
to an
an unacceptably redundant
rcdundant text.
to
hrying descriptions
descriptions with
aith context
'nie noun phrases constructcd
Var}"ing
'n1e
constructed to describe
along the
thc same
same lines
lincs as paragraphs, i.c.
rolcs vary
vary along
roles
i.e. fhcy
they includc
include facts or leave them
dcpcnding on what facts
facts have
havc already
alrci~dyappeared
appearcd in thcir
out depending
their paragraph and what
rcniain
to
bc
givcn.
'Thus
we
go
from
using
just
a
namc
remain to be given. Thus
just name to introduce aa role
(paragraph three)
thrcc) to giving
giving the
thc concept
conccpt that owns iit,l its name, and thc
(paragraph
the fact that it is
"rule" (in paragraph four).
four).
a "role"
'lhroughout the
thc example
cxample tcxt, grarnniatically-driven
Usingellipsis
Using
ellipsis
'Ibroughout
grammatically-driven cllipsis
ellipsis is
applicd to
to reduce
rcducc redundant
rcdundant verbs (paragraph two). and to mcrgc
applied
merge rclations
relations with
common arguments
arguments (paragraph one).
onc). These
'lhesc are
arc gcncral
common
general purpose transformations.
transfonnations,
syntactic and lexical propenies
properties of
triggcrcd by the syntactic
triggered
of the teats,
texts, indcpcndcntly
independently of
of the
contcnt of the reiations
relations involved.
content
McDunald
Mcl
) onald
222
222
4.3.
Computalional1\1odcl
4.3. A Comput
.ttionalModel
The
ability to
to speak
is as
natural to
to us
us as
as the
the ability
to see
'I'he abi1ity
speakis
as natural
abi1ity to
seeor
or to
to usc
use our
our hands
hands
to grasp
arc fast
fast,, we
grasp objects.
objects. We
We are
we arc
are accurate,
accurate, and
and we
we are
are unaware
unaware of
of the
the
how we
we do
it. 1 As
I\s easy
easy as
us to
mechanics of
mechanics
of how
do it.1
as it
it is
is for
for us
to speak,
speak, we
we k.now
know from
from
linguistic
ethnomethodological analysis
that the
linguistic and
and etIlnomethodo1ogica1
ana1ysisthat
the process
processis
is complex.
comp1ex. Even
Even if
if
we 1cave
leave aside
the question
of how
how we
we
asidetI1C
question of
we arrive
arrive at
at the
tI1CthoughLs
thoughts behind
behind our
our words
wordsand
and
look
just at
the "linguistic"
part of
1()ok just
at tI1C
" 1inguislic" part
of the
the process-selecting
pr(JCess
- se1ectingwords
wordsand
and constnlctions,
c()nstructions,
applying
grammatic;ll
rules.
and
producing
the
words
in
sequence-it
is clear
applying grammatic.a1rules, and producing tIle words in scquencc- it is
clear that
that
very
sophisticated
IlI1cs
me
being
followed.
Somehow
we
select
very sophisticated ru1cs are being fol1owed. Somehow we select one
one
lexical/syntactic
the many
many possible
possible alternatives,
1exical
/ syntactic comhination
comhinati()n from
from tllC
alternatives, managing
managing to
to
attend
simultaneously
to
the
potentials
of
the
different
constructions.
our
multiple
attend simu1tancous1y
to tI1Cpotentials ()f tI1Cdifferent constructions, our multiple
goals.
the cOllstmints
arbitrarily imposed
imposed by
goa1s
. and
and the
constraints arbitrari1y
by our
our grammar.
grammar. We
We follow
follow
conventions
utility only t()
to our audiences
conventionsof
of direct uti1ity
audiencesand
and actively
activc1ymainlllin
maintain elaborate
e1aborate
coherency
relations across
large stretches
cohcrcncy rclations
acrosslargc
strctchcsof
of discourse.
discourse.
Our
ability
ll1
do
all
this
with
such
facility
Our ability to do all tIlis with such facility needs
needsto
to he
bc explained.
explained. For
For this,
this, aa
sllltic
the rules
rules being
being f()llowcd
followed will not he
he sufficient:
static description
description of
of tI1C
sufficient: we
we must
must
explain what
what itit is
is about
way these
these rules
rules are
manipulated that
that
explain
ahout the
tI1Cway
arc reprcseltled
representedand
and manipulated
insures
that
Ule
process
of
language
productio))
is
tractable
and
gives
the
process
insurcs tI1atthe pr()Ccssof languageprodllction is tractlble and givcs tI1Cprocess
the charactcr
char<lcter that
that itit has.
has. In
In short,
we must
must develop
tile
short, we
develop aa compul<,tional
compuUttional model:
model: aa
simulacrum
processing steps
viewed
simulacrum whose
whosc processing
steps and
and representational
reprcscntational devices
devices when
when viewed
from tI1C
the intended
intended lcvcl
level of
of abstraction
those
from
abstraction we
we take
take to
to be
be isomorphic
isomorphic with
with those
operating
in
the
human
mind.
operating in tI1ehuman mind.
It is
is important
important to
to appreciate
that by
by "computational
we do
not simply
It
appreciatetI1at
"computational model",
model" , we
do not
simply
mean aa program
program whose
whose input/
input/output
bchavior matches
mean
output behavior
matches ~at
~ at of
of people
people (though
(though
that in
in itself
itself would
would be
be a
a considerable
considerable accomplishment
accomplishment).) . The
tI1at
The internal
internal structure
structure of
of
the
program-the
reasons
Il"lIl'
its
input/output
bchavior
is
what
it
is-is
critical
tI1Cprogram- the reasons" 'h}' its input/ output behavior is what it is- is critical to
to
its value
valuc as
as a
a model.
model. This
to the
for "strong"
its
'Ibis is
is comparable
comparable to
the requirement
reqllirement for
"strong" rather
rather
than "weak"
equivalence in
in constructing
constructing aa grammar
than
" wcak" equivalence
grammar for
for aa natural
natural language.
language. If
If the
the
is
to
be
truly
successful
(and
ultimately
to
be
a
source
of
testable
model
model is to be truly successful (and ultimately to be a source of testable
predictions). its
its behavior
behavior must
must follow
follow inescapably
rather than
predictions),
inescapablyfrom
from its
its structure
structure rather
than
from stipulated
rules:: itit will
will stand
as
of
because
any
from
stipulated
rules
stand
as an
an explanalion
explanation
of tile
t11e behavior
behavior
because
any
with comparable
C<lmp<Jrablc slruclUre
would be
be incapable
of
otherwise.
dcvice with
device
structure
would
incapable
of behaving
behaving
otherwise
. If
If
I. The
Thc nomlal
nOl111al speaking.
ralc for
ror I:nglish
I'nlish isis approximately
s}lIahles per
per second
words per
1.
...-peakingrale
approximalelyrour
four syllables
secondor
or 160
] 60 words
per
Book or
Records sreed
ror reading
readinll I:nglish
English bis400
words per
rer minute
minute.)
minute.. (I
tThe
Guincss-'i llook
minule
.heGuincs
of Rccords
speedI'Ccnrd
recordfor
400words
.) A
A
by IAibov
l.abov [I(I "bov
shown
study
studyby
Aibov196o]
1966
] ha~
has~
hownthai
that75%
75'foor
of everyday
everydayspcech
speechisis grammmical
grammaticalby
by any
anycrilerion.
criterion.
II' general
g.cneral rules
rules for
figure rises
If
for ellipsis
ellipsisand
and sclr-editing
self-editing are
are added.
added. Ihis
this figure
ri~es 10
to 90%
90% for
for non-academic
non-academic
elrerience.. IntrosrCClive
srcakcrs talking
talking about
about everyday
speakers
everydayexperience
Intro~-peclivereports
reportsor
of production
productionarrear
appear\0
to go
go no
no
deepcr than
than mentally
mentally "hearinj('
We appear
have no
deeper
"hearing" full
full rhrascs
phrases(or
(or alternative
alternativewords)
words) al
at aa lime.
time. We
appearto
lo have
no
conscious access
10
any of
or the
the actual
aClual a'i.>;cmbly
processes
such
the sequencing
of
conscious
access
lo any
a'iSCmbly
processes
suchas
asthe
sequencing
of the
the words
wordsor
or their
their
morphological specialization.
morphological
specialization
.
223
one can thcn indcpcndcnlly show that thc human language fiiclilty is structured in
one can then independently show that the human language facLilty is structured in
thc sarnc way as thc modcl (pcrhaps by comparing thc kinds of errors that Lhc two
the same way as the model (perhaps by comparing the kinds of errors that the two
systcrns rnakc), thcn onc will havc explained why pcoplc havc thc modcled
systems make), then olle will have explained why people have the modeled
bchavior. Conscqucntly, if our lnodcl is to bc compclling, we must limit its
hehavior. Consequently, if our model is to be compelling, we must limit its
computational powcr very carefully. A cornpu~ationalrnodc! that pcrmittcd the
computational power very carefully. to. compulJtional model that permitted the
usc of arbitrary proccdurcs (c.g. a'l'ul-ing rnachinc) would not bc intcrcsting as the
use of arbitrary procedures (e.g. a Turing machine) would not be interesting as the
basis of a tllcory bccausc all that it woi~ld cxplain would be that languagc
hasis of a theory hecause all that it would explain would be that language
production was ~ [ l t n p ~ ~ t a bsomctliing
lc:
wc alrcady bclicvc. Wc must look instcad
production was computable: something we already believe. We must look instead
for the wcakcst modcl Illat can d o tlic work: n modcl from whosc computational
for the weakest model that can do the work: a model from whose computational
propcrtics thc charactcrisrics of ht~manIangtiagc production wolrld inexorably
properties the characteristics of human language production would inexorably
foltow. I3y doing his, by rcslricting thc kinds of bchavior that our 1nodcI is
follow, By doing this, by restricting the kinds of behavior that our model is
capablc of, wc can cxtract non-trivial predictions from it and rnakc it subject to
capable of, we can extract non-trivial predictions from it and make it suhject to
empirical tests.'
em pirical tests. 1
4.3.1.
Charart~riling
the I)roblem
c
solving uhcn wc talk? "'l'alking" is
What conipurationi~lproblcrn is ~ h mind
What compU'lational problem is the mind solving when we t..lk? "Talking" is
of course a loose tcnn i~scdlo covcr miiny kinds of actiritics, cacll likcly to have its
of course a loose tenn used to cover many kinds of activities. each likely to have its
own rcquircn~cnlsin prtccssing timc. ncccssary mcmory, possibilities for editing.
own requirements in processing time. necessary memory, possibilities for editing,
or coi~scious involvcn~cnt. Wc know intuitively that thcrc is an cnormous
or co'nscious involl'ement. We know intuitively that there is an enormous
differcncc in bchavior bctwccn, say. writing a carcful cssay and holding a f u n y
difference in behavior between, say. writing a careful essay and holding a fuzzy
conversation over brcakfslst; so much so h a t thcrc is little rcason to bclicvc a
conversation over breakfast; so much so that there is little reason to believe a
privri that thcy posc idcritical problcms to thc human spcaker. In thc prcscnt
priori that they pose identical problems to the human speaker. In the present
rcscarch, I havc ficuscd on itj~nlcdia~c
speech, spoken (or writtcn) without
research, I have focused on immediate speech, spoken (or written) without
rchcarsal and with only a rough conscious knowlcdgc of what will bc said next.
rehearsal and with only a rough conscious knowledge of what will be said next.
'Il~crcis introspcctivc cvidcncc to suggcst that this modc of spccch is primary
cnlerc is introspective evidence to suggest that this mode of speech is primary
sincc cvcn in dclibcratc writing whcrc thcrc is amplc opportunity for editing and
since even in deliberate writing where there is ample opportunity for editing and
planning, it is thc common cxpcricncc that phrases and cvcn multiplc scnlcnces
planning, it is the common experience that phrases and e\'cn mul~iple sentences
"spring to mind" as immcdiatc pcrccpts without any conscious cffort having been
"spring to mind" as immediate percepts without any conscious effort having been
madc to fonn thcm from thcir constitucnt parts.
made to fonn thcm from their constituent parts.
Givcn this restriction on tllc modc of spccch to bc considcrcd, I take thc core
Given this restriction on the mode of speech to be considered, I take the core
of the "problem" for thc mind to bc thc re-cxprcssion of a dcli~nitcd,dclibcratcly
of thc "problem" for the mind to be the re-expression of a delimited, deliberately
sclcctcd "packcr" of information (including rcfcrcnccs, propositions, dcscriptions,
selected "packet" of information (induding references, proposilions, descriptions,
and probably spccitic rhclorical instructions) from its original form in the mind's
and probably speci fie rhetorical instructions) from its original form in the mind's
1. Ncithcr of which 1 inlcnd to do in this paper. As will bc clear. Lhc modcl alrcady mccls a number
1. Neither
of which Ipsychological
intend to do crileria
in this paper.
will be clear,
!he model
mcets
number
of "obv~ous"
such a\As
scqucnlial
produclion
and alrcady
indclibilily
(scca also
[Ucbnald
of "ohvious"
psychological
criteria such
a, sequential
production
and
indelihility
also (McDonald
19801).
11s lirsl non-obvious
applicalion
is cxpcc~cd
to bc in
a thcory
of Lhc(sec
mcchanisms
bchind certain
19HO]l. lis
lirst non-obvious
applicalion
is expected
to he cxchanp
in a theorycrrors
of theand
mechanisms
bchind
rertain19801 for
naturalty
occurring "spcch
crrors",
paflicularlp
blends. (Sw
[Garrc~t
naturallyanoccurring
errors",
parlicularly exchange
errors
and blends.
(Garrell 19&Oj for
ex~cnsive"speech
descrip~ion
01speech-errors.)
Ihis work,
however,
is still inIScc
progress
an extensive description of speech-errors.) lbis work, however, is still in progress.
McDonald
224
internal rcprcscntation
representation into a constrained.
intcrnal
constraincd. fixed-format
fixed-format language (e.g.
(c.g. English)
accurding
mapping.
according to a fixed.
fixcd, context-free,
contcxt-frcc, cunventional
convcnti~rnalmapping.
4.3.2, Lanpuapc
Language Generation
nedsionmaking
4.3.2.
Ccncr~tionas 1)ccision-mnking
What arc to bc
Uic primitive operations
operations of tlic
modcl-at what "grain size"
be the
the model-at
the gcncration
generation proccss?
process? Following
Followiog thc
the lead
will it characterile
charactcrilc thc
lcad of systemic
systemic
grammarians
lialliday [Halliday
19661 and Winograd [Winograd
19721, 1I
grammarians such as Halliday
IHalliday 1966)
IWinograd 1972].
vicw
uutpt~tofdle
of Ulc pr(rcss-the
view thc
the output
process-the natural languagc
language tcxt-;IS
text-as thc
the rcsult
result of
nfaa series
of decisions,
dccisions, the
thc set
sct of
~ r fpossible consistent decisions
dccisions hcing
dctcrmincd by the
being determined
langui~gc's
grammar. The
'lhc most rclcvant
aspccts of a generator
gcncrator will thcn
langUilgc's grammar.
relevant aspects
then be how it
gocs
goes about making those dccisions.
decisions. which is what thc
the thcory
theory is to dctcnninc. In
p;~rticular:
(I) wli;~t
dccisions arc to be
bc made,
madc. what prompts them
thcm and
particular: (1)
what kinds of decisions
what is the
thc nature of Uicir
(rt~tput:(2)
( 2 ) what
dccisions
tllCir output;
whal kinds of ink~rmation
information thc
the decisions
informatil~nwill vary in its accessibility
acccssihility and fonn
form according
according
rcquirc
require and how that information
process; (3)
~ hat dependencies
there arc bctwccn
between decisions
Uic slate
statc of the proccss:
(3) uliat
dcpcndcncics thcrc
decisions and
to d,e
they influence
innucncc the overall
made
ovcrall control stmcture
structure (c.g.
(c.g. arc decisions
dccisions madc
how thcy
n~ndctcnninistically
ordcrcd?): (4)
(1) to what extent
cxtcnt the
nondetenninistieally or arc tlicy
they ncccssarily
necessarily ordered?);
rcsulw
dccisions and the foreknowledge
firrcknowlcdge of planncd
dccisions are a
resullS of prcvious
previous decisions
planned decisions
represented
part of d,e
UIC generator's
gcncrator's state,
state, i.e.
i.c. is this kiod
kind of iofOlmation
infolrnation explicitly
cxplicitly rcprcscntcd
and accessible
currcnt decision-makers?
decision-makcn?
acccssiblc to currenl
4.3.3.
4.3.3. Rcstrictions
Restrictions on tl~c
the hlodcl
Model
Given
problem of translating
packet of expressinns/instructions
translating a packct
cxprcssions/instructions into a
Givcn d,e
tl~cproblcm
highly constrained,
arnstraincd, fixed
fixcd format language
language given a context-free
contcxt-free mapping (i.e.
(it. the
arc many known ways we could usc
use to solve it:
translation dictionary),
dictionary), there <Irc
uptimizcrs that worked
workcd on whole
Approaches liavc
have ranged from nondctcrministic
nondeterministic optimizers
paragraph-siled tcxts
texts at once
paragraph-sizcd
oncc (Moore
[Moorc 1981).
19811. to programs that have attempted to
model strcamof-consciousncss
were liahlc
liahle to intcrn~pt
intcrnlpt thcmsclvcs
themselves with new
modcl
strcam-of-c(rn~-iousncssand wcrc
phrase [Clippinger
cvery phrasc
[Clippingcr 1978).
19781. All approaches
approachcs have in
plans or constraints at every
commoo
the
notion
that
the
input
packet
or
"message"
will
be
thc
packct "mcssagc"
bc decomposed
dccomposcd into
common thc
looked up in thc
the dictionary
its component
componcnt clements;
clcmcnts: that the clements
clctncnts will be
bc lookcd
dictionary and
contcxt-scnsitivc decision
dccision made
madc as to how they
Uicy can bc
rcali~cdin the
UIC mget
a context-sensitive
be realized
target
language; and that these
the constraints of thc
the grammar
languagc;
thcsc realizations.
rcalintions. subject
subjcct to thc
message. arc
are picccd
pieced togcthcr
together into Uic
dlC uttcrancc.
utterance. The
and the overall goals
goals of the mcssagc,
nearly every
utterance to
approaches
approachcs ddiffer
i f i r in ncarly
cvcry other
othcr aspect,
aspect, e.g.
c.g. how large
largc an uttcrancc
eoostruct
once. how to control
the process and ordcr
order thc
the decisions,
dccisions, or how to
construct at oncc.
control thc
represent the grammar and implement
rcprcscnt
implcmcnt its constraints.
constraints.
In the interests
candidates. and becausc
because I bclicve
believe that
interests of narrowing the field
ficld of candidatcs,
225
thc resulting modcl is both Inarc perspicuous to the cnginccr and more interesting
the resulting model is both more perspicuous to the engineer and more interesting
to the psychologist, thc following additional computational li~nitationshave bcen
to the psychologist, the following additional computational limitations have been
stipulalcd in my thcory.
stipulated in my theory.
On-lint Operation
Thc input message is vicwcd by thc linguistic component
On'line Operation
The input mes.<;age is viewed by the linguistic component
as a strcnln of clcmcnts, thc spccific ordcr and chunking of the "mcssagc
as a stream of clements. the specific order and chunldng of the "message
clcrncns" bcing dictalcd by thc dictionary cntries dcsigncd for that particular
clements" being dictated by the dictionary entries designed for that particular
domain. 'I'hc co~nponcntinay bc conceptually (though not literally) dccomposcd
domain. The component may be conceptually (though not literally) decomposed
cers
tngcthcr. thc first taking thc nrxt clcmcnt of the
into /H-O t r a ~ ~ ~ d t ~ cascadcd
into IH'O transducers cascaded together. the first taking the n-ext clement of the
mcssagc strcam and converting it into a surfiicc structurc phrasc atcachcd to the
message stream and converting it into a surface structure phrase attached to the
trcc at thc point whcrc thc mcssagc clcmcnt was, and thc sccond thcn trakersing
tree at the point where the message element was. and the second then tra\ersing
that phrase and producing thc text from it (scc sccrion 4.1 bclow). '[he two
that phrase and producing the text from it (see section 4.1 below), The two
transducers arc constr;iincd to opcratc "on-linc", that is, thc output from thc first
transducers arc constrained to operate "on-line". that is, the Olllput from the first
transduccr must bc co~npletelyconsumcd by thc sccc~ndbcforc tllc first transduccr
transducer must be completely consumed by the second before the first transducer
movcs on thc ncxt mcssagc clcmcnt at the samc levcl.
moves on the next message clement at the same level.
lntlelihility
'lhc decisions of the first, "rcalizing" transduccr arc exhaustively
Indelibility
'Ihe decisions of the first, "realizing" transducer are exhaustively
rcprcsc~ltedin t l ~ csurfi~ccstnlcturc phrascs that it products. 'l'hc ;stions of the
represented in the surface structure phrases that it produces. 'lhe actions of the
second, "trcc-walking" transduccr arc thcn co~nplctclydictatcd by the structurc
second, "tree-walking" transducer are then completely dictated by the structure
and arinotation of thosc phrascs. Slrt$lr.e lrnrcrure is indelible. i.e. once a phrase
and al)notation of those phrases. .",'lIIface slruclure is indelible. Le. once a phrase
has bccn constnlctcd i~ndincorporated into thc ongoing surfacc structure trcc, it
has been constructed Olnd incorporated into the ongoing surface structure tree, it
can not bc rcmovcd o r cditcd (though it may bc augmcntcd). As a conscqilcnce of
can not be removed or edited (though it may be augmented), As a consequence of
indelibility and thc fiict that thc surfacc structilrc is organized as a strict trce
indelibility and the fact that the surface structure is organized as a strict tree
without loops. thc p r o m s will not bdckup-it is iinpossihlc to rctravcrsc carlier
without loops. the process will not bdtkup-it is impossihle to retraverse earlier
scctions of thc trcc oncc Ihc rcalizing transduccr has past through thcm. (Ihis
sections of the tree once the realizing transducer has past through them. (lbis
samc stipulation has bccn applied to thc recognition of phrascs by a parser
same stipulation has been applied to the recognition of phrases by a parser
[Marcus 19801 with intriguing results,)
[Marcus 1980] with intriguing results.)
McDonald
226
Rcal-time
?hc ovcriill pl-t~cssmust pcrforn~its computations in qrtusi-real
Real-time
The ovef;lll process must perform its computations in quasi-real
titue: that is. thc numbcr of opcrntions that takc placc bctwccn thc consumption of
ti/llC': that is, the number of operations that take place between the consumption of
any onc mcsslgc clc~ncnrin thc stream and thc ncxt 01-bc~wccnLhc output of two
anyone message elemelll in the stream and the next or between the Olltput of two
succcssivc words must bc no grciitcr than sornc fixcd maxi~nuttiunrclarcd to the
successive words must be no greater than some fixcd maximum unrelated to the
si7.c of thc input or output strcams. 'Ihis is a slrongcr ti~ncbound that rhc usual
size of the input or Olltput streams. This is a stronger time bound that the usual
onc of lincar timc. ;~ndrcflccrs thc intuition tliat thc prtxcss always procccds at a
one of linear time. and reflects the intuition that the process always proceeds at a
constant rate.
constant rate.
227
Dot11 thc dictionary and thc intcrfacc hnctions must be specifically dcsigncd
Both the dictionary and the interface functions must be specifically designed
for cach ncw spcakcr/cxpcrt program; [hcy arc thc repository of all the
for each new speaker/expert program; they arc the repository of all the
information I-cquircd to adapt dlc linguistic cornponcnt to such ncw domains.
in formation required to adapt th{; linguistic component to such new domains,
Givcn this niodularity, cspccially thc htnctional intcrfacc, wc can bc flcxiblc about
Given this modularity, especially the functional interface, we can be nexible about
thc clioicc of foi.m;rl rcprcscntntion for a mcssagc; we can usc whatever
the choice of formal representation for a message; we can usc whatever
rcprcsc~ltation is convcnicnt for Lhc spci~kcr's planning, typically the
representation is convenient for the speaker's planning, typically the
rcprcscntation uscd in LIic cxpcrt program (c.p. iis dcscribcd in section 1.1).
representation used in the expert program (e.g. as described in section I.l).
'I'hcrc is no prcsu~nptionthat rncssagcs should rcsult in tcxls of ally fixcd size.
There is no presumption that messages should result in texts of any fixed size.
With dic p r c s c ~ rcst
~ t spcakcrs, singlc mcssagcs havc produccd rcxts ranging from
With the presei1t test speakers, single messages have prodlll.:ed textS ranging from
singlc cxcli~niationsto n~ulti-paragraphdiscourscs. Ncithcr d o incssagcs havc to
single exchllllations to multi-paragraph discourses. Neither do messages have to
bc cqiialcd with turns in a conversation siiicc tllc liilguistic state of t l ~ ccornponcnt
be equated with turns in a conversation since the linguistic state of the component
is prcscrvcd bctwccn i~ctivationsand (I tcxt ciln bc "pickcd up ~ ~ l i citr cIcR off'.
is preserved between activations and a text can be "picked up where it left off',
Stri~cturnlly.mcsmgcs ha\c fi~tlcninto two broad classcs. 'Ihc simplcst just
Structurally, messages have fallen into two broad classes. The simplest just
consist uf'prc-cxisting cxprcssions wkcn dircctly from thc cxycrt's data b;ac. (All
cnnsist of pre-existing expressions taken directly from the expert's data base. (All
of thc ct)mplctcd tcst spcakcrs fcll into h i s class.) 'I'lic data basc cxprcssions
of the completed test speakers fell into this class.) The data base expressions
hcco~nc"instructions for w11at 10 say" ~ l ~ r o t ~(hc
g l ii n t c r ~ ~ ~ ' ~ t ;pmvidcd
~ t i o n by their
become "instructions for what 1O say" through the interpretation provided by their
dictionary cntrics, a i d tllc rcsult is a fairly litcral rcndcring of thc cxprcssion, the
dictionary entries, and the result is a fairly literal rendcring of the expression, the
structilrc of thc output tcxt following tllc co~npositionalstntcturc of thc input
structure of the Olltput text following tile compositional structure of tile input
cxprcssion. 1.-igurc 4 bcluw shows an cxi~mpkcxprcssion from Winston's data
exprc~sion. Figure 4 beluw shows an example expression from Winston's data
hasc of Shakcspcariiln ],lays (givc~iin 1;kI.) followcd by tllc tcxt that Ihc linguistic
base of Shakespearian pl<JYs (given in FRI.) followed by Ule text that the linguistic
program gcncratcs for it using tllc dictionary fi)r h a t domain.
program generates for it using tile dictionary for that domain.
In a simplc "dircct translation" mcssagc such as this. Ihc linguistic component
[n a simple "direct translation" message such as this, tile linguistic component
has somc ability to simplify and smooth Lhc tcxt by following dchult nhcs keyed
has some ability to simplify and smooth the text by following dcfhult rules keyed
by rhc Iucal linguistic structurc as thc tcxt is built; howcvcr, all of the choiccs of
by the local linguistic structure as the text is built; however, all of the choices of
ordcring and lcvcl of' dctajl arc fixcd by thc intcrnal structurc of thc data base
ordering and level of detail arc fixed by the internal structure of the data base
cxprcssion. Sincc rhc structurc of thc d a b basc is dctcrmincd by whatcver is
expression. Since the structure of the data base is detcnnined by whatever is
convcnicnt for the cxpcrt's intcl-nal co~nputationsrathcr than by considerations of
convenient for the expert's internal computations rather Ulan by considerations of
tcxt planning. mcssagcs of this sort arc ncccssarily Iirnircd in Ihc applicability.
text planning, messages of this sort arc necessarily limited in the applicability,
(Scc [Swartout 19811 for an extcnsivc discussion of this problcrn.)
(Sec [Swartout 1981] for an extensive discussion ofthis problem.)
228
McDlInald
(ma ( a k o (story))
(rna (aka (story
(part (macbeth)
(part (macbeth)
( l a d y - m a c beth)
(lady-mac
beth)
(duncan)
(duncan)
(macduff))
(macduff))(heath-scene)
(subpart
(su bpa rt (heath-scene)
(mu r d e r - s c e n e )
(mu rderscene)
(battle-scene)))
(battle-scene)))
229
(message1
(message1
(sequence
(sequence
(rnacbeth (murder (duncan))) :*5nurderma"
(mac beth (m u rde r (duncan))) ;"mlmfe,.ma"
(rnacbeth (become (king))) :",no-becomeking"
(mac beth (become (king))) ;"ma-become-king"
(lady-macbeth
(lady-macbeth
(persuade (mac beth (action murder-ma)))) ;"pemadrma"
(pe(lady-macbeth
rsuade (m ac beth
mu rde rma)))
;"penuadc-ma"
(hq(action
(ambitious))))
:"ombi~iovslm"
(lady-mac
beth
(hq
(am
bitious))))
;
"ambiliou:rlm "
(time-f rame (before-time-of-speech))
(time-f(focus
rame (befo
re-time-ofspeee h))
(macbeth))
(focu s(ancillary-facts
(macbeth))
(anc i lIaryfacts
((murder-ma (motive (ma-become-king)))
((mu rde
r-ma (motive (purpose
(rna. become-king)))
(cause (murder-ma)))))))
(persuade-ma
(persuadema (purpose (cause (murder-rna))))))
From rhc point of vicw of the spcaker/cxpcrt program, thc linguistics cornponcnt
From the point of view of the speaker/expert program, the linguistics component
is a subrou~ine. a subprr~cssthat thc spcakcr explicitly aclivatcs to rccr1iz.e an
is a subroutine, a subprocess that the speaker explicitly activates to realil.e an
individual rncssagc. I t is only activatablc by Ihc spczrkcr, i.e. it has no indcpcndcnt
individual message. It is only aClivatable by U1C speaker, i.c. it has no independent
cxistcncc as a parallcl prtrcss (though thc history of thc discoursc is conliguous
existence as a parallel process (though the history of the discourse is contiguous
across activations), and oncc activatcd runs to complclion indcpcndcntly of thc
across activations), and once activated runs to completion independently of the
rcst of thc systcm. 16 iiltcrnal state is a black-box, not dcsigncd to bc monitored,
rest of the system. ItS internal state is a black-box, not designed to be monitored,
intcrruptcd, or cditcd (thuugh it could bc "shut-off' and complctcly rcstancd).
interrupted, or edited (thuugh it could be "shut-off' and completely restarted).
McDonald
230
I Speaker-Export-program I
A
fif'F
c{b
;a "message"
**c?
B*' 7 E*'?
..
1 me
Time--
" "
***rcalized
.,.
1
It
If
\ff
It
"
Linguistic
Li nguistic
component
component
...
I1
Audience
Audience
l l ~ o u g indcpcndcl~tlp
l~
ct~l~tr.trllcd,
thc lin~uislic ctunponcnc is not cut o f f from
"1llOugh independcntly confrollcd, Ihe Ijn~l1isl ic component is not cut otT from
Ihc spcakcr wllilc it is ]>i~occssiug.I'i2ui.r: G sllo\vs dingr;~mmatic:~llyhow ttle
the ~peakcr whilc it is proCC~Sil1g. Figui\~ 6 shows diagr;nnmarically how the
comporicnt may ;it HII); poi111 ilsk tlic S P C ; I ~ C L .qucs[i(~~ls
;1110111 ;I specific 1ncss;Ige
component may at any point ask the speaker questions ;lbollt it specific message
1i11guislic;lliy(for
clclncrit in ordcr to dcrcrrnir~ch c t s tliiit arc o n l y i;i~poi,t:~~~t
clement in order to dc[crrnillC filcts thdt an~ only important linguistically (for
cxn~ilj>lc"pcrson" and "nu~nbcr") or 10 cipply tlo:r~:~i~l-h;l:;cd
tcsts to some
example "person" and "number") or to ~lppJy dOiTwin-ba:;ed tests to some
clcmcnt, in crfcct cxrcr:iling dlc nlcssngc. 111Lhc figi11.c.Lllc nicss;;:gcis rcduccd to
c!cmen!, in effect extcnding tile lTlesr,age. In 111C figure. the mesS<lgc is reduced to
its csscntials: 2 conipositc rul;~~ir~ii
orcr ol>jc.c~ssclcclcd tly (lie spcakcr/cxpcrt
its essentials: a composite rc1aLtllll over objcClS ~clCClCd by thc speaker/expert
progt.;tol (LC. A ( R(C,D), E, F)). Wc $cc it bl.okcn dosivnwitliil~dtc linguistic
program (i.e. A( O(C,O), E, F. We sec it brdCll (11)\\'n within the linguistic
compojrcnt Idycr by !nycr s ~ ~ r ~ ~i ni tg h
thc riwl rcliitlo11 A, B and E havc bucn
component layer by laycr sl.:.11"ling \\ lth the mot re!<tLJon A, Band E have been
rcfcr'rcd back to thc spc;~kcrfor fu:if~or clal~orntiori.wllilc T: i~iidtllco 0 wcre
n:fc!Teu back to the spcakCT for funhcf elaboration, while r- aill! 111CIl 0 were
rralii.cd dircclly in I.'riglish (indic;:tcd as It..." 111 lllc f7g1il.c).'l'lic spcakcr is
realized directly in Ellgli~h (inl!ical('d ;IS "... " ill the fig1:!"c). The >jwakcr is
ncccssiblc co!~tinuo~~sly,
but tlic tirrii~~g
nnil tlic co~nl~ulatiutial
co~ltcstin wl~ichit
a("ce~;sihle continuously, but I.he timing and the complll,ltioJ1:l1 context ill which it
is actually colu;~~ltcrl
arc dict:ltc11 cntircly I)y tlio lingl~istic: co~i~j~tincnt.
is actually con:;ultcd arc dictated entirely by the lingui,( Ie component.
l ' l ~ c spcnkcr has ~ i ocoiitrol o\cr Llic dclions of lllc li~~guistic
cr~lnponent
The Spl\lkcr has no control o\er the ;lCtions of the lingnistlc COll 1PQllcnt
t l ~ cspci~kcrconlinucs Lo bc aclivc
bcyond s~rpplyj~ig
it will: mcsu;l9cs; ivlic~hc~.
beYllnd slirplying it wi1l1 mcs~gcs; whether the spt'(lkcr continues to be aClive
231
wl~ilcthc componcnt is opcrating is not important to thc theory. Whcthcr it
while the component is operating is not important to the theory. Whether it
should have thc ability lo interrupt Lhc linguistic componcnt and rcstart it.
should have the ability to interrupt the linguistic component and restart it.
pcrliaps in rcaction to what it hc,~rswhilc "listcning" to Ihc ctrmponcnt's output, is
perhaps in reaction to what it heilrs while "listening" to the component's output, is
a qucstion wc will lcavc opcn. 'I'hcrc arc somc cvcnrt~alitics(such as structural
it question we will leave open. There are some evenlllalities (such as structural
:tmhigiiitics) that arc difficult to fi~rcsccwhcn rcidizing a nlcssilgc via a linguistic
ambiguities) that arc difficult to foresee when realizing a message via a linguistic
colnponcnt of this dcsign. and thcrc arc also potcntinl divisiotis ofcfiwt within thc
component of lIlis design. and there arc also potential divisions of e!Tort within the
spcakcr's ptanni~igprtccss wliicli [night bcncfit from a "fccdhack" dcsign of this
speaker's planning process ",hich might benefit from a "feedback" design of this
sort. Ilcforc dcvclopi~lgs~lcha dcsign. howcwr. it is critical to have a clcar
sort. Before developing such a design. however. it is critical to have a clear
understanding of thc kinds of linguistic information that arc n3turiilly available at
understanding of thi.:' kinds of linguistic information that arc nMurally aVJilable at
difrcrc~itstagcs in !hc prnd~~ction
prtxcss ;und of how thcy rclutc 10 thc vtxiibulary
di !Terent stages in the prodllction process and of how they relate to the vocabulary
c ~ fthc spcilkcr's pl;~nningprtrc~:ss-onc of'thc primc conccnis of m y rcscarch.
(If the speaker's planning process-one of the prime concerns of my research.
(Clippingcr i113d Ilrown [Clippingcr 1975, 1978 ; I<.Iir.own 19731 dcvclopcd a
(Clippinger and Brown [Clippinger 1975. 1978 ; R.Brown 1973] developed a
~ n t ~ dof
c l tlic production of psychodnalytic disct~ul'scthiit madc critical use of such
model (If the production of psychoanalytic discourse that made critical usc of such
ii fccdback dcsign, with ~ h cresult that it was ablc l o producc very natural
a feedhack design. with the result that it was able to produce very natural
hesitations and rcstarts in its monologue.)
hesitations and restarts in its monologue.)
4.5. 'I'hc Inlcrnal Struclurc of l l ~ e1,inguistic Con~ponent
4.5. The Internal Structure of the Linguistic Component
4.5.1. 'h casc;~dcof t\ro transducers
4.5.1. ',\ cascade of lI\O tr:lI1sduccrs
As a n automaton, tllc linguistic colnponcnt is bcst dcscribcd as ~ w ocascaded
As an automaton. the linguistic component is best described as two cascaded
of a sit~glr,dnra-dir~credcor~lroller.
rmt~scfucersfolded logelher utlder (he can~?iatid
transducers folded together under the command ofa singlr, data-directed CUI/troller.
'Illc first transduccr goes from the mcssige lo a surface structure Icvcl linguistic
'Ille first transducer goes from the message to a surface structure level linguistic
rcprcscntation of thc uttcrancc to bc produced-the "working" data structure of
representation of lIle utterance to be produced-the "working" data structure of
the linguistic cornponcnt-and
thc sccond gacs from Ihc surfacc structure
the linguistic component-and the sccond goes from the surface structure
praduccd by thc first to I'nglish tcxt. (Scc thc skclch in figure 7.)
produced by the first to English text. (See the sketch in figure 7.)
The "decisions", whosc dispositions are so important to this theory, arc made
The "decisions". whose dispositions are so important to lIlis theory, are made
allnost cxclusi~clyby thc first transduccr; thcy arc the dccisions that realize the
almost exclusively by the first transducer; lIley are the decisions that realize the
individual clcments of thc mcssagc thrc~ughthe sclcction of particular surface
individual clements of the message through the selection of particular surface
suucture phrascs (or rcfinc cxisting ones). l'hc sccand transduccr in cffcct
stl11cture phrases (or refine existing ones). The second transducer in effect
"cxccutcs" thc dccisions of thc first by intcrprcting thc surfacc structure as a
"executes" the decisions of the first by interpreting lIle surface structure as a
program of linguistic actions: printing words, annotating Ihc grammatical context,
program of linguistic actions: printing words, annotating the grammatical context,
recording thc history o f thc prwcss, and propagating gri~mmaticalconstraints.
recording the history of the process, and propagating grammatical constraints.
'I'hc bulk of my thcary of languagc production is contained in the
The bulk of my lIlcory of language production is contained in lIle
characteristics of Lhc surfikcc stnicturc rcprcscntation and the transducer that
characteristics of lIle surface structure representation and the transducer that
produccs utterances from it. l'hc transduccr from h e mcssagc to surface
produces utterances from it. The transducer from the message to surface
structurc~onccptually an cxtcnsion of thc spcakcr-will be less rigorously
structure---eonceptually an extension of the speaker-will be less rigorously
dcvclopcd: it is dcfincd chicfly by its rclalionship to thc first transducer-the
developed: it is defined chiefly by its relationship to the first transducer-the
canlroller-which gatcs its activities and imposcs filtcrs and constraints on its
controller-which gates its activities and imposes filters and constraints on its
McDonald
232
I Mcss:lgc
1. Mcss:lge
to
to
Dictionary
SurL~ce
Surface
Slruclurc
Struclure
_.
*6&+Fe-.
2. Surface
2. Surface
Str3ct11rc
Structure
to
to
're u t
Ted
.... --*---
......... .
Figure 7
1'WI) Transdw;crs
As input:; to llir ~mnsdilccrs.botll tllc mcssagc and rhc surhcc strucrorc arc
As inpllt~; to the transducers. both tJlC message and (he sur/ace strw.;tufe arc
Lrc:~tcd;IS 0)~11ly
ordcscd scqucll:ial strc:rrtls of d:~ta:tc~kcnsfrom thc s ~ r c a ~ arc
ns
trealeLl as tlllally lJI"ckred sequential stre;UtI$ of data: tnkells from the streams arc
prtrcsscd ut:c at a tinic, and arc proccsscd only o n ~ (i.~'.
c 1111' sIrc;llrts IICVCI- rcvcrsc
processed one at a time, and arc processed only ollce (Le. the streams never reverse
or loop). 'l'hc t v ~ ostrciIIlls ilrc prwcsscd "on-line", whidl nlcans that UIC nrctput
or !clOp). The two streams arc pr(}i;cs~cd '"oIl-line", which OlC,1I1S tll,!l the output
From tllc fil.!,t t~.:?~isduccr
Tor onc tokcn is cotr~plctclycotrsunlcd by tlic sccond
from the Iir~;t tr;lllsducer for onc token is completely cOllsumed by the sccond
~ransduc.crbclbrc lllc first niovcs on to its ~lcxttt~kcn. '!'llc t~.ansdr~ccn
pur sc arc
lr,l11sducer bel{)fi: the first movcs on to its next tnkcn. The transducers Jlfr se arc
only i~~tcrprel[~r.r.
'Thcy hnvc (11sitbility to rollow rhcir input strcams and to bind
only ill/C/pre/ers. They have the ability to follow their input streams and II) bind
certain prcdcfincd variables b ~ r tli~tlcclsc; tlicir u.ansJ~rcigpuwcrs dcl'ivc from
cert<lin prcdefincd varbb1c:s but liltle chI..': their transducing powers derive from
t w o bodics of pcrlrlnnctit infonn:i[ion, tiit "tlicrii)~i:iry" and thc "gramtti;~r", to
two bodks of pCrlT1iln~nt information, the "dklinl1ary" and the "grammar", to
which tllc tra~lsduccrswill di~patchi~ccordingto w l ~ i tthcy l i n d in thcir inpiit
which the Irall~,dllC'Cfs will di5patch Jccording to what they find in their input
wilh potential
strcanls. 'llic ciictionary :iss~riatcsclc~ticnufrorn t!ic ir~css;~jic
Slrl..'al1l~. The dicli<)lImy associates clelllenls from Ille IIlCSS;lgC with potential
rcaliying pllrasts, using a lingt~isric voc:ti)~llal-ydclinttl by Lllc grilmlnx; tllc
realizing phras~s, using a lingUistic vocabulary defined by the grammar; the
granlalar intcrprcls lllis vncabul;~~.y
ar~dcnforccs llic ct~nstrni~it.;
and conventional
gfal11111ar intcrprets this vocabulary ;lIld cnfnn:cs thc constrail1t:, and cnllvclHional
dcr~il5it specifics. 'I'llc prtzcdurcs and scllcln;~inin da:i;c tuo "librar-ics" do all of
dctJils it spccifics. The procedures <lno SChCItHla in these two "libraries" do all of
233
thc rcal work of the lingt~isticcomponcnt; thc transduccrs arc rcsponsiblc for
the real work of the lingui~tic component; the transducers arc responsible for
controlling whcn thc libraries are used and for maintaining Illc linguistic context
controlling when the libraries arc used and for maintaining the linguistic context
to which they rcfcr. ('Ihc dictionary and grammar consist of a largc numbcr of
to which they refer. ('Ille dictionary and grammar consist of a large number of
small proccdurcs that arc associatcd with individual tokcns that can appcar in the
small procedures that arc associated with individual tokens that can appear in the
data slrcarns (specific mcssagc clcmcnts, narncs of grammatical catcgorics, crc.).
data streams (specific message clements, names of grammatical categories, etc.).
Whcn a transduccr sccs onc of ~ C S Ctokcns, it "disparchcs to" thc associatcd
When a transducer sees one of these tokens, it "dispatches to" tlle associated
prt~cdurc(LC.calls it as a subroutinc) and waits until that prc~cdurchi~sfinished
procedure (i.e. calls it as a subroutine) and waits until that procedure has finished
its exccution hcforc going on to the ncxt tokcn.)
its execution before going on to the next token.)
'Ihc spccial propcrty of thir cascndc is that thc two transduccrs havc bccn
The special propeny of this cascade is that the two transducers have been
folded inm a singlc prtrcss: thc traversal of thc surfacc structurc. 'I'his prcccdurc
folded into a single process: the traversal of the surface structure. This procedure
can bc summarized as follows: 'l'hc mcss;~gcstarts out as thc solc constitucnt of the
can be summarized as follows: The message starts out as the sole constituent of the
root node of the surfacc structurc trcc: thc first transduccr thcn dccidcs which
root node of the surface structure tree; the first transducer tllen decides which
E~lglish phrasc shouId rcali~cirs dominant clcmcnt and Ihat phrasc, which
English phrase should rcalife its dominant clement and that phrase. which
incorporates at its fringc thc ncxt lcvcl of mcssagc subclcmcnts, replaces the
incorporates at its fringe tlle next level of message subelcments, replaces the
mcssag in that constitucnt position. A trce-travcrsal controllcr ([tic sccond
message in that constituent pOSition. A tree-traversal controller (the second
transduccr) now ti~kcschargc of thc prrxcss and procccds to travcrsc this newly
transducer) now takes charge of the process and proceeds to traverse this newly
constn~ctcdsurfacc struct~~rc
("(he ~ t r P 'fbllowing
)
its normttl top-down. Icft-toconstructed surface structure (" (he Iree") following its normal top-down, left-toright ordcr. As thc controllcr passcs ovcr it. thc lingt~isticannotation on thc trce
right order. As the controller passes over it, the linguistic annotation on the tree
triggcrs dispatches to thc prtrcdurally rcprcscntcd grammar and to thc dictionary
triggers dispatches to the procedurally represented grammar and to the dictionary
for thc realization of thc embcddcd mcss;lgc clcmcnts. ('lhc dictionary thus
for the realization of the embedded message clements. Cll1e dictionary thus
constitutcs thc rca1 contcnt of thc first transduccr.) J f a fringc constitucnt is a
constitutes the real content of the first transducer.) If a fringe constituent is a
word, it is printcd out as part of thc tcxt; if it is a mcsngc clcmcnl, it is rcalizcd,
word, it is printed out as part of the text; if it is a message clement, it is realized,
rcplaccd in thc trcc by thc new phrasc, and thc ncw phrasc thcn travcrscd as an
replaced in the tree by the new phrase. and the new phrase then traversed as an
cxtcnsion of thc surface structure.
extension of the surface structure.
I h c two transduccn can bc rcliably fo!dcd togcthcr bccausc of a wcll-fomcdThe two transducers can be reliably folded together because of a welHonnedncss condition I havc imposcd on thc suucturc of mcssagcs, thc "conslroinrness condition I have imposed on the structure of messages, the "conslrailllprecedes" stipulation, which dictatcs that the cnumcration ordcr of a mcssagcthe
precedes" stipulation, which dictates that the enumeration order of a message-the
position of mcssagc clcmcnts within thc input stream-must bc such that any
position of message clements within the input stream-must be such that any
message clcmcnt that makes rcfcrcncc to othcr clcrncnts in thc mcssagc must be
message clement that makes reference to other clements in the message must be
rcalizcd bcforc any of thosc clcrncnts are.
realized before any of those clements are.
'I'his condition is rcquircd bccausc of the stipulation that thc gcncration
This condition is required because of tlle stipulation that the generation
prmcss must bc i~ldelibk(cf. scction 2.3). 'l'hc lhcory insurcs indclihility by
process must be indelible (cf. section 2.3). The theory insures indelibility by
designing thc trcc-walking controllcr so that it is unablc to rctracc any part of the
designing the tree-walking controller so that it is unable to retrace any part of the
surfacc structurc trcc aftcr it has passcd thn~ughit oncc. Whcn couplcd with the
surface structure tree afLer it has passed through it once. When coupled with the
locality stipulation, this mcans that thc first transduccr is prohibited from
locality sLipulation. this means that the first transducer is prohibited from
arbitrarily scanning the mcssagc in search of potentially rclcvnnt subclcmcnts that
arbitrarily scanning the message in search of potentially relevant subelemcnts that
dcnotc constraints but must "wait" until thosc clcmcnts arc rcachcd in their
denote constraints but must "wait" until those clements arc reached in their
normal ordcr in Lhc mcssagc strcarn. Conscqucntly if a mcssagc includcs elements
nonna! order in the message stream. Consequently if a message includes clements
that should bc intcrprctcd as constraints on thc rcalimtion of othcr clcmcnts (for
that should be interpreted as constraints on the realization of other clements (for
exarnplc thcy might specify discourse focus or pick out attributes that are to be
example they might specify discourse focus or pick out attributes that are to be
McDonald
234
specially
be dealt
spccinlly contrasted),
contrastcd), then
thcn those
thosc constraining elements
clcmcnts should
slioold bc
dcalt with first
first so
dlat
be noted
that their implications can bc
notcd and incorporated into d,e
Ulc context
contcxt of the
thc later
decisions.
process had not bccn
been stipulated
to bc
be indelible,
dccisions. If d,e
Uic precess
stip~~latcd
indcliblc, d,en
chcn we
wc might
imagine
i~nagincordering
ordcring constraints haphal.ardly
hapIia?.ardly within the
thc mcsSt'lgc
mcsslgc and editing affected
affcctcd
parts of the
thc output text by backing up the
thc generator
gcncrator and restarting once a
cnnsrraint was noticed.
noticcd. Allowing even
cvcn buundcd
howcvcr (as for example
cxamplc
constraint
bounded backup however
tlic equivalent
cql~ivalcntof a well-fonned
wcll-fc~r~ncd
wot~ldremove
rcmovc the
thc prlrcss
with the
substring tahlc) would
process
thc "on-line" stipul<:llion
stipulation impossible
~ron1the
Uic rcalm
rcal~nof real-time
rc;~l-timeand would make the
from
to maintain.
maintain, not to mcntil)1l
mcntion requiring
rccluiring a considerably
considcrnbly increased
incrrascd incmory in order
ordcr to
ltl
rct;~in
all
potentially
rct~pcn;lhlc
st;~tcs
of
thc
prrrccss.
'lhc
"constrain-prcccdcs"
rcwin
reopen able St<ltcs
the process. '1l1C "constrain-precedes"
tlic stipulations
stipulation? of
o r section
scction 2.3 arc thus effectively
cffcctivcly working
condition and the
hypoUlcscs that
illat claim
cl;~imthat
tlli~tthe
thc appropriate
appn~pri;ltcprwssing
tradc-off within gcneration
gcncration
hypotheses
processing tradeoff
to supply heii\'i1y
lica\.ily planncd
canccptual messages
mcssagcs to
10 a relatively
rclativcly unsophisticated
is lO
planned conceptual
clcari linguistic
li~~guistic
gcncrator. rather
r;lthcr t!liill
than the
tlic other
othcr way round.
but quick and clean
generator.
llcprerenling linguistic
liog~~islir
context -- the tree
trce
4.5.2. Ucprcscnling
contcxt
In order
ordcr to understand
undcrsti~ndthe
dlc two transducers we
wc must
nus st understand
i~ndcrstandthe
Uic data structure
Ulat
hinds
them
together:
the
surface
struclUre
representation
that binds Uic~ntogcthcr: Illc surfacc structure rcplcscntation of tllC
Ulc uttcrance
uttcrancc
undcr
construction known for
and its
un~crconstructi~n
fur short as the
/he tree.
rrcc. Wc
We first dcscribe
dcscrihc its fonnat
fonnatand
relationship
to the
rcl;~tionsliil~
Uic grammar and the
Uic dictionary.
dictionary. We
Wc thcn
then movc
mauc on to a sketch
skctch of
of
thc
Uic controllcr.
contrullcr, showing how it travcrscs the
thc tree
trcc and how the trec
trcc is used
uscd to
indicate
proper routines
indicntc thc
Uic prupcr
routincs ttl
to dispatch to within thc
t l ~ cgrammar and dictionary.
Figure
Figurc 8 is a diagrmn
diagram illustrating
ilh~stratingthe
thc representation
rcprcscntaticin used
uscd for
fur the tree.
trcc. (It is not
a snapshot of the
itsclf: we will not sec
scc one
onc of those
Lhosc until the
thc main example.)
cxample.)
Uic tree
trcc itself;
Two
arc indicated: constifuent
co~isrifuoi/sfruc/Ure:
srrucrurr: dcfining positions
'I'wo kinds of structures aTC
within the
thc trec.
trcc, how they
thcy arc connected
conncctcd and how the
thc controller
contrullcr is to
Lo traverse
travcrsc them;
thcm;
and ggrammatical
properties those
r a ~ ~ z r ~ ~ alabels:
~ i c a ldefining
dcfining the
thc propcrtics
thosc positions arc intended
intcndcd to
have.
235
Grall111l111ica] IAlbels
COIlS/iluent slntclure
"nodt"
"cat('~,Jry"
------claus.
~
"slot"
f subject 1 [predicate ~
~
"slotname"
"slotname"~
A
~
vp
...;'lonlcnls..
the predicate
?"~tlcnls" of
o/rl!eprcdicarc
[ obiectl ]J
[[verb
verb ]J [objectt
shm'es
~"Cl1nl('nls"
vfrhe w!rb
I-'igurc
Ccnsiti~tcntSlnlclurc:
Smlcture: Positions
Figure 8 I<cprcscnti~lg
Representing Cctlsitutcnt
Positions and
nnd labcls
Labels
'l'lic
constiluc~~r
d r i ~ ~ t uiswindicated
i ~ ~ d i c i i t cgri~[iliic:~lly
d
bbyy tllc
The constituent
structure
graphic:llly
the pattcrn
pattern of trapc~oids
trapezoids
ill dlc
i d the
Uic brackcts
;~ndI)l-ckcts:
tr;~l~c/.,:ids iIHh:,HC
i111l/ci1[~1
tlic
"nodes" in
and
brackets: Uic
the lrapczdds
the "nudes"
tJ1C UCC.
tree. m
<Ind
br<Jckcls
indic~llc llw
pO'iitiol1s of
possible constilllcnts
within
the
nodes ,lIld
referred 10
10
illdicnlc
LIic 1io~iili01is
of possililc
i o l l ~ l i l i l c 1 1~
1 i~t l l i tllc
n nodcs
i111darc rclicrrcd
as
Th~ ;Klllal
the slots'
is "slots"
"sl<~ts"oorr "constituent
" c o ~ ~ s l i t i ~ cSIlI1S".
sl~nts".
nt
'l'lii.
.1ct11aIconstituents
c ~ n s t i t l 1 ~ 1lthemselves
h1c1~~n s c l ~ carc
s Ilic
"cantc~lts": for
lilr ccxampk
r : r ~ i ~ p lthc
s node
aoilc labeled
I:rbclcd "vp"
"vp" is tllc
"prcdic;itc constituent"
cunstitucnt"
"contcllts":
the
the "predicate
(abbrc\i;ltcd "lprcdic;lIc]")
"lprcdic;irc]") of
ol' the
tllc "dmsc
"cl.lllsc ncldc".
Ilcsidcs ,\;lnodc,
cu~ilcnlsof
of a
(abbreviated
node". Besides
node, tllc
the contents
slot tn:ly
may bc
be an word.
\\-ord. or ;
aI 1l1
.'~sag(' c1emcnt.,
they llJay
A. sllhtrcc
mcrs:lgc
~ICIIICIIL or
o r Ulcy
III;I~
bc empty.
cnil~ty. A
slllitrcc from
lirorn
g i \ - c 11adc
~ ~ 10
li-iogc ortllc
ofUic Lrcc
(Sincc the
Llic trcc
a;Igiven
node
to thc
the fringe
tree will bc
be rcfcrrcd
referred ~II
to ;ls
as ao phrirsr.
phrase. (Since
tree
ala,lys ggrowing
r i ~ w i o gthl?~u:Ii
action of
u f tJle
dlc first tln~~sduccr
is ahvays
throup.h thc
Lhe acti(lll
transducer rcpl:~cilig
replacing ~iicss:~gc
message
c l c ~ ~ ~ c\\,it11
n t s phra:cs.
Lllc rioti(~n
"fi'i~~gc"of
ol' tlic
trcc is a dynamic
dy~l;lmic one,
onc,
cletllents
with
phr'hes. the
notion ooff tllc
the "fringe"
the Iree
cClIlstalllly
<.'IS the
process
prm.:eeds.) GraJl1l1l;ltical
tlic ggeneration
c ~ i c r i ~ t i oproccss
n
proceeds.)
t i r a n r ~ ~ ~ ; ~ tlabels
1;1bcIs
icol
constantly changing
cli;~oping 3s
cilhcr label
l:~I)cl ~i<xlcs,
in w
l l i c l ~case
cnsc tllcy
rclicrrcd to SIas
: '\:.ltt'gorics"
"catc!:oricsU and
either
nodes, in
which
they may bc
be rclCrrcd
C
I ttrapezoid;
r : ~ ~ ~ ~ ' %or
u i clsc
d: ~
l i c yIahel
l i ~ h ccon,;titucnt
co11:;tituc11t
l
prinrcd
printed just abovc U
the
else
Lhey
slots, in \\rl~ich
which case
they arc called
inside Uic
tJle br:skcts.
bwckcts. A
node or
c i ~ l l c d"slt>t"n;IOles"
"sli~t-r~;mmcs"and lprinted
~ r i l i l c dinsidc
A nodc
o r slot may
Ulcy
Iiavc
I
l;~bcl.
haveCIIIIImore
than onc label.
Thi~; constituent
reprcs~:nlatit)n is diffcrclit
different from
most oll1ers
frvrn lnost
othcrs in
in the
'I'hi.:
a ~ o s t i t t ~ slrtlcturc
su-ucrurc
c~~t
rcplrs::~ilaLit~n
li~~guistic
lilctaturc
~ c c a ~ it~ scxlilici~ly
c
ldbcls tllc
collstitt~cntprisitir~os
linguistic
Iiter,1ture tbccallsc
explicitly
labels
the constituent
pllsitions rathcr
rather than
d,'Jining LIICIII
them in
terms of
relalive pusiriol~
position or
IlOdcs. ('lllis
n;uning
just dcfinit~g
i n tcrms
uf tJ1C
dic rcl:llirc
iil'llodcs.
('Illis explicit
cx~ilicittl;lming
of
grammar" [pcrlmutter
11rCOllstilucnts
a ) r l s t ~ t ~ ~ cis~ lalso
;ilso
t s done
dunc iinn so"called
s ~ ~ c a l l c"rcl:ttioual
"rcI;tti~i~~:~I
d
~,~;IIIIIII;~?
[I'c~lmuttcr and
I'ost:d 10
l o ;Jppear]
nl;pc:~r] <.lIIU
e ~ l dVwas
S
I;:
used ill
ill some
solnc early pllr;lsc
strilcturc systems. sec
scc [postal
[I'ostnl
Post:il
lIscd
phrase struclure
1963].)
"~lIhjcc[" cti~~stitr~cn!.
constitl!ent. for
alternatively bc
be defined
asd
19631.) The
'l'lic "w!jcc["
fur example,
cxamplc, could ;~lrcro;~tivcly
~ i c l i ~ i c;a
tJIC n
nOlln
phra:,c node directly under a clause
(:annot
Uic
i r m pllr;~:~c
cl:lasc !lude.
IIIIJ~. Slot'nalncs
Slot-~i;~~ncs
c a ~ i ~ i obe
r
disp?~iscd\\it11
Ihc prcscot
tlicnry. howcvcr.
uscd lo
Lo cmry
carry Ulc
disp('nscd
with in the
present thcllry.
how('ver, I)CC;IIISC
bCCiluse llicy
rJley arc lIsed
the
gr<lllllll;lIic.l1
p!'oprrlks ol"thc
posilions lllcy
they 1.1l
bhd
AttClJlpling to u.
use
gr.11nmi1lic31 p!'i~llcrtics
11litlic constituent
cu~~stilucnt
paai!io~is
1c1. ~\ttc~i~ptin::
;e
McDonald
236
'l'hc ~ I T I I ) O I o. ~f [Iris
~ ~ Ipat11
C C is ill tllc scqilcncc of gratnni;~ricrllInbcls that it dcfincs
Th 7 importance of this path is ill the sequence of gwmmaticallabcls that it defines
;ind in d ~ c~rnlclits
c
of tlic slolr at tllc trcc's fringc. I:;IcII 1;tbcl is nsstriatcd iu thc
;l1ld in Ihe contents of U'IC slot" at Ul(.' tree's fringe. F':leh lahel is i1ss11ciated in the
grainniar with a hct of j ~ ~ ~ ~ d citllcr
l l t ~of'
c its
~ .t)wn or prtxcdurcs of othcr labels
gr,llnmar with a set of procedures. either of its own or procedures of other labels
l h ~ arc
t contingc~iton it: tlicsc' proccdurcs ;trc lcfcrrcd w as grtntlt~lnrrnliliiles.
that arc contingent on it: these procedures arc H:ferred III as grammar-routilles.
'I'llc slorn;unc "wl>jcct", rt~rvx:~~nplc.has gri1111111arroutincs of its O*II (i.c.
The slomallle "subject", for cX:J1nple, has grammar routines of its own (i.e,
triggcl-cd I)y thc conu?~llcrwlwn tllc ~sul~jccc]
is rcacilccf. scc bclow) that handle
triggered lly the colllrollcr when Ule lsubjet.:!j i~ rt'aehed, see helow) that handle
oI's~~l)jcct
;11rd ~ 1 . in
b qucstir.)ns i111dthc insc~tivnuf
such tlli~igs;IS LIIC i~r~ctsioll
such thi ngs as l.he inversion of subje!.:t and vcrb in questions and the lnscltion of
~ h cfunction word "ii" in cxlr:~lroscJclauscs such as
c.u.xy rc, br corl/irst.d by uull
l.he function word "i," in cXlraposcJ clauses such as "irs cosy 10 be coli/used by all
rile r~rtirirrolug~~".
'I'llc collsti~ucntI~bclcd"sul~jcct"is lookcii For spccifically by
fhe Irm rinolog.l' ". The COllstilllcnt IJbekd "subject" is looked for specifically by
Ihc ~r:~~iin~ar-roi~tinc
bat pcrforn~ssul~jcct-vct'b;Igrcctncnt in tc~rscdcl;~uscs.and
the grallllllJr-routine that perfurms subject-verb agreement in tensed clauses, and
b y Uic n~orphologyroutine wllcn it nccds to tlc~a'n~it~c
whcdliura pronoun should
by the morphology l"l1utinc when it needs to determine wht,tlwr n pronoun should
bc in thc non~in;~tivc
case.
be in the nOlllinalive case,
4.5.3. '!lIe Coutroller
'Ilw algoritlrm Tor tllc ctrntrolicr is thc llcart of this Ulcory of gc~~craliun:
it is the
algnriUml for the controller is the heart of this U1cory of generation: it is the
sccu~ttlLr.nnsd~iccr,iulnpr~ctin: thc trcc position by posiliot~nnrl dlcrcby dicctling
$eculLd transducer, inlcrprctin~ the tree positiun hy position and thereby dictating
tllc ordcr of cvellts witllin tllc prtrcss, tllc co~~lcstuat
in61nnntion i~vailahlcto
tllC order ofcvcnls within the process, tile COlll!:xttlal infinmJtion available to
roulincs in the tlictionary or gr:rnfnar (LC. ivlrat parts of lhc WCC ~licyc:lil ac'ct~s),
routines in the dictil1l1ary or gr;l1l1lnar (i.e. ",hm parts of th~ trce they call access),
;tnd llic potcnli.11 scupc' of thc decisions ~ n i ~ d11yc tflo~csottti~~cs.
'I'hc algorith~n
,lI1d the potential ~;ClJpC of the deci~ions In<1dc by UJOSC routines. The algorithm
i t v l f is quilt simple sincc all ihc ctin~ntllcrtnust do is tmvcrsc Uic trcc and
il"c1f is quite simple si IKe all the contwllcr must do is tnl\ersc Ole tree and
iliaj):ltch ro I i l ~ ~ . ~roi~tincs
i ~ y acccrding to the hbcls on Lhc positians and the
dIspatch to lihrary routines accl'rding to tile lahels on the positil'I1S ;md the
'11l\~
237
contcntsof
diagra~nmcdin figures
figurcs 10,
10. 11,
11. and 12.
contents
of thc
the slots. It is diogrammed
The
enough,
'lhc import of the
thc algorithm lies
lics not
I N I ~in its flowchart
flowchart which is simple
simplc cnough.
but in thc
imposcs implicitly on
an the designer
dcsigncr of thc
the coastraints
constraints it imposes
the grammar and
dictionary.
Noo acriorz
actioJl can
it
tali be taken
rakerr by the
rlr? linguistic
lir~guisliccomponent
r o ~ ~ r / ~ o n ounless
uriless
rl
il is
dictionary. N
~peciflcal/y selected by
controller
SIIPCI~CUIIJ
bj this
/his cOfllro/ler
m ~ ~ r r u l l eal
a!r the time
rir~rcand place that
rhar the
rhe co~rrroller
dictates.
Thus all actions arc
diclares 'Thus
tire local to the controller's
contr~~llcr's
position and subject
suljcct to
contextual cunrr~~l.
control. No pan
part of the
tree is '\:isible"
the grammar or di<.:tionary
thc trcc
"bisiblc" to thc
dictionary
b r those
thosc parts spccificalJy
spccific;~lly pickcd
c~~ntrollcr'spointcrs
cxccpt for
except
picked out by the controller's
pointers and
prcarrangcd
granimar routincs
abovc and bchind
by Uic
the grammar
routines above
behind lllc
the
prearranged pointers p(lsiti<~ncd
positioned 11y
hicrarcliic;~lc<mstfllctions
constr~~ctions
cmbcddcd
co~itrollcr'sposition: this nicans
contn)lkr's
means tli;~t
that hierarchical
such as embedded
clauses
with lcft
left 10
dependencies such as
pronominalization or
clauscs or
11r rules witli
111 right dcpcndcncics
;IS pr~moniinali~.ation
cllipsis can bc
opp~)sitcdcpcndcncies
ellipsis
be trcatcd
treated naturally whilc
whik plicnomc~ia
phenomena with opposite
dependencies
must bc
be explicitly planncd
planned for
they will bc
be misscd.
missed. Similarly, since
a single
E I or
~ thcy
sincc only asinglc
the trcc
tree is seen
with thc
the multi-position
mulli-posillon buffcrs
buffers of
position in
i n thc
sccn at a time
timc (in conrrast
contrast \$,it11
la~~guagc
parsing systems
sy9tcms such as [Marcus
[Marcus 1980)),
19801). phcnomcna
natural language
phenomena that can
dislributcd vicw or
(ria
structure--such
sccn by prlrcsscs
only hc
be seen
processes uilh
v. ilh a distributed
a constirucnt
constituent structure-such
as structural ambiguities-cannot
be easily
t1iis system
ambiguilics-cannot bc
casily appreciated
apprcciatcd by lliis
systcm and will
typically go uncorrcctcd.
controllcr is thus
ilius tlic
c~nbodimcntof thc
uncorrected. This controller
the embodiment
the hypothesis
th,Jl (~nly
hierarchical and sequential
be appreciated
thal
1,nly hicrarchic;d
seql~cnti;tl dependencies
dcpc~ldcncicscan bc
:~pprcciarcdduring
immediate
being eithef
immcdiatc speech:
spccch: aU
all others bcing
ciUicr expressly
cxprcssly anticipated and planned fOf
for
ahead
time or lcft
left to a ppost
monitof to detect
compensate for later.
alicad of timc
~ ~ shoc
r monitor
dctcct and conipcnsatc
p:!sscd to thc
the linguistic
component, it
Initialization
lnitinliration
When a new message
mcssagc is pnsscd
linguistic component.
bccomcs
contcnts of the constant slot "root-constituent",
"root-constituent". If at that time the
becomes h
thee contents
component has finished
processing any earlier
messages then it will havc
have returned
Anishcd pr,xcssing
carlicr mcssagcs
tl,at position at thc
the top of the tree
trcc and the messages
mcssagc's processing will start
to tllat
immcdiatcly: alternatively
altcrnativcly should an earlier
carlicr mcssagc
immediately;
message still be in progrcss
progress it will
\lim not
be disrupted
to the ncw
new one until
uotilthe
one is finished
bc
disruplcd and nothing will hoppen
liappcn lo
the old onc
and thc
ule controller
controllcr completed
complctcd its (raversal
traversal back to tl,e
UIC root. lhe
I h c initial
initial state of the
controllcr is shown in Figure 10.
controller
McDonald
238
MUMBLE ( m e s s a g e )
MUMBLE (message)
nrgumcnt: a rncssagc
argument:
messagcnone
rcturna valuc:
return value: none
'Ihc initial trce:
'Ihe initial trec:
root-node
root-node
[root-constituent]
[ root-constituent]
START
1niriali-r.cController Variables:
Initialize COlllroller Variables:
c u rrent-node < = 'root-node
current-node
cu rrent-slot<= 'root-node
< = 'root-constituent
cu rrent-slot
c u r r e n t - c o n<t=e n'root-constituent
t s < = (Make-elmt-instance m e s s a g e )
cu rrent-contents
<=
A
(Make-elmtinstance message)
AStart thc
Start UIC Controller at
Controller at
"Dispatch on currcnt-contents"
"Dispatch on current-contents"
l h c controllcr decomposes
'I'hc Iilock-lcvcl Organimtion of l'hc ContralIer
The B1ock-Ie\'el Organil.ation of The Controller
lhe controller decomposes
in ro thrce rccu rsivc p r t ~ e d urcs namcd process nod^, I'r(~ccss-slot,and Dispa!ch.
into Lhree recursive procedures named Process-node, Process-slot, and Dispatch,
whosc dcfinitions arc givcn in thc ncxt two figures. 'Ihcy arc brcadcd in that
whose definitions arc given in the next two ligures. 'Jlley are threaded in that
ssmc ordcr: IJroccss-nodeciills Proccss-slot on cach o f its ilnmcdiatc constituents,
same order: Process-node calls Process-slot on each of its immediate constituents,
and
Process-slot in turn caHs 1)isp;ltch o n thc contcnt of UIC slot prcscntly being
and Process-slot in turn calls Dispatch on the content of the slot presently being
proccsscd, 'lhc rccursivc structure of the trcc is matchcd in the controller
processed. 'Ihe recursive structure of the tree is matched in the controller
algorithm by Ihc rccursivc call to Prtrcss-nodc from within Ilispatch.
algorithm by the recursive cal1to Process-node from within Dispatch.
'I'hc algorithm differs from thc standard rccursivc dcsccnt algorithm far
The algorithm differs from the standard recursive descent algorithm for
travcrsing a trce only in its "rcalizc and rcplacc" stcp within Dispatch: This step
traversing a tree only in its "realize and replace" stcp within Dispatch: This step
239
has thc cffcct of dynamically cxtcnding thc trcc cvcn whilc thc controller is
has the effect of dynamically extending the tree even while the controller is
travcrsi~lgit. 'Ihc cxtcnsion stops whcn a plirasc is sclcctcd that hi~sno further
traversing it. The extension stops when a phrase is selected that has no further
mcssagc clcrncnrs cnlbcddcd at irs fringc. 'I'hc dynamic cxtcnsion of lhc trcc as is
message clements emhedded at irs fringe. The dyn,nnic extension of the tree as is
thc kcy to thc progressive ~ P ~ I I F I ~ I Etcchniqllc
III
that char;ictcri7cs this theory: the
the key to the progrrssive rejillrment technique that characterizes this theory: the
cnlbcddcd mcssagc clcmcnts in cffcct constitute "dclaycd dccisions" that arc not
embedded message clements in effect constitute "delayed decisions" that arc not
takcn up until all of b c prior dccisions that might cffccr Illcin have bccn made
taken up until <111 of the prior decisions that might effect them have been made
and thcir constraints cstahlisl~cd(jll such dcpcndcncics hcing. by hypotlicsis.
and their cunstraints established (all such dependencies being. by hypothesis.
associated with pcaitions in tllc st~rfiiccsrnicri~rcahovc iiad bchind tllc cmhcddcd
associated with positions in the surface stl1lcture above and behind the embedded
clcmcrrt). 'I'his rcchniq~lcis akin to Lhc tcchniquc 0frjEln)~rjbittclitrg that is uscd in
clement), This technique is akin to the technique of deloyed binding that is used in
thc prt~cssingof somc progr,lnlrning Ianguagcs.
the processing of somc programming languages.
McDonald
240
Dispatch (current-contents)
'empty
ICEI'URN
RETURN
a WORII-INS'I'ANCI!
a WORD-INSTANCE
(Morpholog y-routine current-contents)
(Mo rpholog
y. rout ine eu r rent-contents)
:ntod!ry Ilrc wktrC'r p!inl 11;uiic ;ISnccdcd i u ~ d"say" it
:mncti ry
111(:
iUld
"say" il
RETURN
a NO1113
aNODE
(Proci?ss-node current-corrtcnts)
(Procoss-node e u rrent-contcnts)
SIIIII'UICr\i
l~ETURN
an M.M'I-INS'l'hN(:li
an ELMT-lNSTANCE
cvrroni-contents < = (Realize current-contents)
c;u rrenl-con tents <= (nealizc eu rrent-contents)
~
Aflcr-,4S~cr-l!~:1li~;~lion
Hcalizalion
(Dispatch current-t;ontents)
(Dispatch current- (;ontonts)
~ RIA'UKN
RElUHN
ERROR
241
Gating the
tlac First Transducer
Tnnqduccr
The
l h c first transducer is taken
takcn up cxclustvcly
cxclusivcly in the
di5p;rtch to the
thc function named "rc1Jlil.c".
"rcalilc". This
'l'his function contains
col~tainsthe procedures
dispatch
rcali~ati(mdecisions:
dccisions: Le.
i.c. criteria for
and heuristics that arc common to all realization
altcrnatc subsequent
subscqucnt reference
rcfcrcncc fnmls
fi~rn~s
'bone" or "such",
'kuch",
pronominalization. alternate
pronominalization,
such as "one"
for WH-mavement;
WH-movcmcnt: if nane
none of these
thcse apply, it dispatches to the
and "gap-creation" for
message
tncrsagc clement's
clcmcnCs dictionary
diction:lry entry
cntry and the
thc standard entry
cntry interpreter
intcrprctcr sketched
skctchcd
latcr in the
thc papcr
complctc specification
specification of tl,e
thc re"lization
realization prwcss,
sce
later
paper (for a complete
process, see
19801).
[Mcl)~nilld1980]).
[McDonald
Ebcry message
mcsstlgc element
clcmcnt tllat
d u t is embedded
c~nbcddcdin the
thc tree
trcc must eventoally
cvcntually ppass
as
Every
throogh
this step
stcp of the
thc controller,
controllcr. and tl,en
thcn and only then
lhcn will its English
through tllis
rcturn from
from the
thc function
fi~nctionRealize,
llcalixc, the
thc selected
sclcctcd node,
dccidcd on. On return
realization bc
be decided
thc tree
trcc in place
placc of the
thc original clement,
clc~ncnt,at which
word, or subclcmcnt is knit into U1C
point the
thc controller
controllcr loops
loops around
an~undand repeats
rcpcats the
thc dispatch
dispatch on the
thc new
ncw contents
contcnts of
Ihc slot.
the
.4 Last-stage,
last-stage, Marphalagicall'roeess
hlorpl~ologirall'roccss
A
When an English word is found as the
contcnts of a slot,
slm it is passed
p~sscdto a procedure
prcedurc named
namcd the
thc Alorpholugy
ruu~brefor
fur
contents
Morphology mUline
an) required
rcquircd specialization
spccialiration and from
from there
thcrc to the
thc output stream.
strc;lm. The
l h c Morphology
Morphology
any
routinc docs not have
havc much work to do in English: it is responsible
rcsponsiblc for the case of
routine
pruno~ins,for plural forms. verb
vcrb conjugation, contractions,
contractiuns. and the possessive. It
pronouns.
b:ascs its
ils decisions
decisions on properties
propertics it associates
associatcs with the
thc slot-na1nes
slat-names of the slot that
bases
(c.g. "subject"
"subject" is understood
undcrstood as forcing
forcing the nominative case) and
contains the word (e.g.
inhcritcd attachments
attachmcnts to the constituent structure marking such "cxtra"extraon inherited
information as tense.
tcnsc, aspect.
aspccL and negation. It is the routine within
constituent" infonnation
thc clearest representation
rcprcsentation of the notion "next word",
word, and as a
gcncrator with the
the generator
rcsponsiblc for grammatical
gra~nmaticalplicnomcna
dcpcndcnt on successive linear
result is responsible
phenomena dependent
vcrbal auxiliary,
auxiliary.
position such as the verbal
McDonald
242
E~llcr-node:
Enter'nude:
( ~ o r c a c hfealure
of current-node
(Forcilch do
feature
of current-node
(evaluate
(get-gramma r-routine 'cnter-node feat11re)))
do (evaluate (gotg r<lmrna r routine 'cnte r-nodc featu re)))
I
i
I~!:ivc-node:
I.~~:l\'('node:
\
) I:lT!'IJ!<N
HE'!'URN
Process-slot (current-slot)
Process-slot (ell rronl-slol)
iiI'gtlIl~~f~t:
it S I O ~
argullIcnr:
a slot
rc1lll.n
valuc:
relurn value:
nOllC
I~OIIC
eu rrontcontents
I:ntcr-s!ot:
Enter-slot:
(Foresch fealure of currctll-slot
(Dispatch current-contents)
(Disp .. tch current-contents)
Ix~~Yc-s~o~:
LcaH~'slot:
(Forcach foalure of current-slot
(Forc<lch foal u re of cu rrent-slot
do (evsluatc. (gel-grornn~ar-routine'leave-slot ieaturc)))
do (ev<Jluate (got-g r<::mnl <1 r- routine 'Ieaveslot feature)))
5
) 1II:l'UI:N
RETURN
Fipurc I?. 'I'lrc Controller: I'roccscmiiig
Ncdcs a;ld Slots
Figure 12 Tlh' ((mtrol1cr: ProCCS',illg Nodes ilnd Slots
243
Assoc.iatinp Gron~n:~r-roulincs
l\'ith Conslitucnt Struclurc 1,abcls
Thc
two
Associating Grammar-roulincs With Constituent Structure Labels
The
two
prtlccdurcs Process-node and Proccss-slot arc tllc primary placc wllcrc rhc library
procedures Process-l1(lde and Process-slot arc the primary place wherc the library
of activc prtccdurcs that consr.iiutcs tlic activc aspcct of dic grammar is uscd.
of active procedures that tons!itutcs the active aspect of thc grammar is used.
'Ihcsc proccdurcs arc rcfcrrcd to as gra~n~~iar-roulif~cs
and arc asstriatcd with
These procedures arc referred to as grall1l11G1~ruuti/lcs and are associated with
spccific grammatical labcls (also rcfcrrcd to as frorures). Grammar-routincs arc
specific grammatical labels (also referred to as f('atures). Grammar-routines are
fiirthcr spccificd by thc point in tlic controller's nlgoritli~nwhcrc thcy arc to bc
further specified by the point in the controller's algorithm where they arc to be
cxccutcd. markcd in thc flowcharts in bold rppc. Lhcrc arc fivc gcncric cvcnts in
executed. marked in the nllwcharts in bold type. Ulere are fhe gcneric cvcnts in
tllc trnvcrsiil of a trcc hll of thc irtivc pi~risof t l ~ gcncra~or's
c
I'nglicll grammar arc
the traversal of a tree All of the aClive parts of the gencrator's English grammar arc
asariatcd wit11 tlic I,~l~cls
attachcd to thc nodcs and slots of tlic surf;lcc slrirctnre.
associated with tJ1C lahels attached to the nodcs and slots of tlie surface structure.
'I'hcsc points correspond to firc gc~icric"c~cnts" in Lhc travcssal of tlic trce:
These points correspund to five generic "evcnts" in the traversal of the tree:
cntcri~igor lcaving ij nodc. cntcring or Icaiing a slot, and just aficr a rncssage
cntcring or leaving a node. entering or leaving il slot. and just after a message
clcmcnr 11as hccn rcalizcd I)ut bcfore ~ h crc;~li~ing
plirasc liiis bccn knit illto h e
clement has been realized but before the realizing phrase has been knit into the
trcc. As indicated in figlrrc 12, wl~cnonc c~fthcsc even& is rcnchcd. cach of the
trec. /\s indicated in figure 12, when one of these events is reached. each of the
lal)cls ascrsiatcd with thc currc~itnodc or slot is clicckcd fur a grammar-routine o f
lahels associated with the currelit node or slot is checked for a grammar-routine of
tli;~tcvc~lrtypc. whicli if found is immcdiatcly cxccutcd.
that cyellttype. which iffound is immediately executed.
Gralnmar-routincs rnay pcrli~nnany of ~ I i cfi~llowingactions:
Grammar-mutines may perflJlln any ofU)e following actions:
(1) Add functioll words directly into the output text stream;
McDonald
244
rcalization of evcry rncssagc clctncnt instance, evcry sclcctcd choicc, and cvery
realization of every message element instance, every selected choice, and every
dccision brought about by lhc grammar.
decision brought about by the grammar.
In sirmmary, thc currcnt contcxt of thc linguistic componcnt can be vicwcd as
In summ<lry, the current context of the linguistic component can be viewed as
a four dirncnsional array consisting of (1) the namc o f thc contmllcr cvcnt or
a four dimensional array consisting of (I) the name of the controller event or
subroutine prcscntly bcing cxccutcd, (2) thc valucs of thc thrcc controllcrsubroutine presently being executed, (2) the values of the three con trollervariables. (3) thc valucs of thc grnmn1;lr-variahlcs, and (4) thc rccords of the
variables, (3) the values of the grmnmar-variablcs, and (4) the records of the
discourse history, l'his rcprcscntntion of thc context will bc uscd in thc diagrams
discourse history. This representation of the context will be used in the diagrams
of thc main cxamplc.
of the main example.
4.6. An Example
t
on thc appar~itusof thc
This cxamplc should scrvc two purposes: first. to p ~ l flcsh
This example should serve two purposes: first, to put nesh on the apparatus of the
linguistic componcnt just discussed by showing how it acts as a systcm; and
linguistic component just discussed by showing how it acts as a system; and
sccond, to illustrate sornc of thc sorts of linguistic analysis that onc is lcad to as a
second, to illustrate some of the sorts of linguistic analysis that one is lead to as a
scicnlis working in tcrllis of this thcory of langi~agcgcncration. From thc point
scientist working in terms of this theory of limguage generation. From the point
of vicw of convcntioni~l,cotnpctcncc-bascd lingi~isricssolnc of thc analyscs that
of view of conventional, competence-based linguistics some of the analyses that
will bc skctchcdmay sccni ~tnusualor cvcn bizurc; this is pcrhrlps to be cxpectcd
will be sketched may seem unusual or even bizzare; this is perhaps to be expected
si~lccthc nccd to srnootllly intcri~ctwith an indcpcndcnt, non-linguistically based
since the need to smuothly interar:t with an independent, non-linguistically based
proccss (the spcakcr/cxpcr( program) has i~nposcdits own mark on thc analyscs
process (the speaker/experl program) has imposed its own mark on the analyses
cvcrywhcrc from thc timing of dccisions to thc dcciils of thc surface constitucnt
everywhere from the timing of decisions to the deuiils of the surface constituent
structure.
structure.
'Ihis cxamplc is drawn from thc logic domain dcscribcd in scctian 1.2, Wc will
This example is drawn from the logic domain described in section 1.2, We will
look at thc gcncration of thc last part of thc "barbcr proof': initially in
look at the generation of the last part of the "barber proof'; initially in
considcrablc dctail in ordcr to dcrnonstratc how thc cuntrollcr interacts with the
considerable detail in order to demonstrate how the cuntroller interacts with the
sclcctcd surfacc structurc. and thcn at a coarser lcvcl of dctail so as to conccntrate
selected surface structure, and then at a coarser level of detail so as to concentrate
on thc analyscs and thc motivcs behind thcm. ' h c cxamplc will actually bc only
on the analyses and the motives behind them. The example will actually be only
thc last two lincs of thc proof, but wc will put thosc lincs in contcxt first by
the last two lines of the proof, but we will put those lines in context first by
sketching the evcnts up to that point.
sketching the events up to that point.
Gcncration in thc logic domain is an cxamplc of "dircct translation". There is
Generation in the logic domain is an example of "direct translation". There is
no planning componcnt; instcad, mcssagcs are constili~tcd dircctly from the
no planning component; instead, messages are constituted directly from the
regular data structures of the domain, thc lines of thc prwf. This is the
regular data structures of the domain, the lines of the proof. This is the
characlcristic pattern of dircct translation systems (for cxamplc Iswanout 1977;
characteristic pattern of direct translation systems (for example {Swartout 1977;
Shorlliffc 19761). and it thc sourcc of thcir convcnicncc-sidc-stcpping an
Shortliffe 1976]), and it the source of their convenience-side-stepping an
elaborate planncr by taking advantage of thc organization alrcady in thc domain's
elaborate planner by taking <ldvantage of the organization already in the domain's
native data StNCtUrCS, as well as of thcir limitations-thcy lock thc gcncrator into a
native data structures, as well as of their limitations-they lock the generator into a
single lcvcl of abstraction and invariably lcavc many conceptual conncctions
single level of abstraction and invariably leave many conceptual connections
implicit. 13y translating first into a linguistic rcprcscnlation and thcn applying
implicit. By translating first into a linguistic representation and then applying
general grammatical rulcs and usagc heuristics, wc arc ablc to gcncrate a
general grammatical rules and usage heuristics, we arc able to generate a
smoothcr, more natural tcxt than carlicr gcncrators that translatcd dircctly into
smoother, more natural text than earlier generators that translated directly into
word strings: howcvcr, the overall form and contcnt of thc tcxt rcmain in the mold
word strings; however, the overall form and content of the text remain in the mold
245
set by tlrc input proof. It is safc to say that thc dircct translation tccl~niqueis
set by the input proof. It is safe to say that the direct translation technique is
pushcd hcrc to Lhc lilr~itsof its flucncy: filrthcr inipmvcolcnt~will only comc with
pushed here to the limits of its fluency: further improvements will only come with
thc .iddition of a planner with a knowlcdgc-basc ~Trhctoricalhcuristics.
the addition of a pbnner with a knowledge- basc of rhetorical heuristics.
'l'hc "~ncss;~gc"h a t startcd tlic gcnclitlor off wia lhc scvcll lincs of thc proof in
The "messagc" that started thc generator off was the seven lines of the proof in
scqucncc. Figure 13 is a snnpsllot of thc trcc just aftzr h i s mcssugc was rcccivcd
sequcnce. Figure 13 is a Sna[1ShOl of the tree just afl~r this messilge was received
and distributed i i i ~ orhc slois of a silnplc pnragmph; notc that Llic ordcr of thc
and distributed imo the slols of a simple paragraph; note til at tile order of the
lincs has bccrl yrcnrvcd in thc Icrt-lo-right scquc~iccof lhc slots. 'Ilic formulas
lines has been preserved in the lcn-to-right sequence of tile slots, '111C fOlmulas
l~avcbccn abhrcvialcd to just lllc namcs of rhcir lincs.
have been abbreviated to just the names ofthcir lines.
lincl: prcrtlise
premise
3 x (barber(x) A Vy(shaves(x,y)
ishaves(y,y)))
3x (barber(x) A Vy(shaves(x,y) ++ ""shaves(y,y)))
linc2: cxihtcnlii~linstantiation (1)
linc-2: existenti;ll inswntiatian (1)
barber(g1 A Vy(shaves(g,y) +) yshaves(y,y))
b.. rber(g) 1\ Vy(shaves(g,y) +- ""shaves(y,y
linc3: t;~otology(2)
lind: tautology (2)
V y shoves(g,y) +-, ~ s h a v e s ( y , y )
Vy shnvcs(g,y) ++ -'shaves(y,y)
lincil: u:~i\~crs~I
itls~antialion(3)
Iine4: univmal instantiation (3)
shaves(g,gl ++ ~ s h a v e s ( g , g )
linct:
lineS:
linc5: t;~utulogy(4)
Becomes:
Fjgur~
'lhc lixcd travcrsnl paltcrn of tllc contrullcr dictates dlnl thc text v;ill be
'111e fixed traversal pattern of the controller dil.:lates tllat the text will be
produccd incrcmcnt3lly fi)Itowing thc scqucncc of thc lincs. This guaranteed
produced incremenlJlly follO\~ing tlw sequence of the lines. This guaranteed
convcn!ioi\al scqucncc providcs
b,isis for a clironologicnl d~scolrrsccontext:
conventional seqllcnc~ provides the rasis for a chronolugical dIscourse context:
Ihc text for rhc iirst litic will havc 1)ccn sclcctcd a:?d pn~ducedbcfort. that of the
The telt for the firsl line will have heen selected a~~d produced befi.lfc that of t.he
sccond linc is bcgu!i, the sccond bcftnt thc tllii-d, and so on. On this b:~sis, a
5ccond line is bcgu!l, the second before the third, and so on. On ulis basis. a
modcl of w h ~ It ~ listcncr
C
will havc 11cardc3n be jnfcrrcd, and, couplcd wiUl a
model of what the listeller will have hCJrd can be interred. and, coupled witll a
(vcty simplc) made1 of what inf~rinccsIhc listcncr will makc or. can bc lcnd
(very simple) model of what infcr-enccs the listener will make or can be lead
McDonald
246
Assur~~e
rho1 ~herc
is sot~~cr
rtlho slmvcs everyone who
Assume
thal there
is some
barberbnrbcr
who shaves
everyone who
doc
st^
'1 .~i~rrr,r
h
irnsc~(otid
,lo otrc clsc). Cull lrirli Girtseppe.
docsll'l
shmc
himsclf(and
110 one elsc). Calf him'Giuseppe.
it: arljlot~r
tdlo (IOL~SII
'I s h n l ~
hi/nsclf~tn~tld
bc sl~nvcdby
Now, No
011)'011(' who docsl1'l shave himsclfwould be shaved by
Giuseppe.
771is~vo~rlri
irlcl~rde
Giuscppc
IlitllseCJ:
Thai
Giuseppe.
nlis
would
inelude
Giuseppe
himself
is,
he is, he
\r~ould
si~nvc
hirnseIJ:
qutld
oriljl iflre
did
notThai
sllar>e
hirnseK
wouldwhich
shaveishimself,
ifand onlyThis
if hetncatu
did nOIrhar
shave
himself,
a contrudicrion.
ilieass~rrnp/ion
leads
whichtoisaacotrirudictiot~.
cOIl/radie/ioll. Tlzcrefore,
This mcansi! lhat
Ihe
assumplion
leads
isJulsc, rttere is no such
/0 a cOlllradiction. Therefore. if isfalse. there is /10 such
. barber.
. barber.
I'igurc 14 The '13arbcr' Proof
Figure 14 The 'Barber' Proof
'Ihc third Iinc docs not appcar in thc tcxt per re at all since it is an obvious
The third line docs not appear in the text per se at all since it is an obvious
conclusion from what was known so far. 'I'hc fourth linc, on the othcr hnnd, has
conclusion from what was known so far. The fourth line. on the other hand, has
hccn expanded into a thrce scnlcncc "mini-argument" hccnusc of its importance
been expanded into a three sentence "mini-argument" because of its importance
to the proof and bccausc its logic may not bc obvious. 'Ibc fifth line, the
to the proof and because its logic may not be obvious. '1l1e fifth line. the
dcrivntion of thc contradictic.rr~. is intcrprctcd for its co~~vcntional
role, i.e.
derivation of the contradiction, is interpreted for its conventional role, Le.
announcing thc derivation of the contradiction. In the tcxt it is adjoined to the
announcing the derivation of the contradiction. In the text it is adjoined to the
prcvious scntcncc as a rclativc clausc-a kind of "rcnaming" spccch-act.
previous sentence as a relative clause-a kind of "renaming" speech-act.
We scc thc changes that rhcsc realization dccisions havc madc in Lhe trce in
We sec the changes that these reali/ation decisions have made in the tree in
figure 15. Only thc top nodcs of thc scntcnccs arc shown. 'Ihc controllcr is now
figure 15. Only the top nodes of the sentences arc shown. The controller is now
positioned at slot "dtl". and thc two firlal lincs of thc proof rcmain. In looking at
positioned at slot "do". and the two final lines of the proof remain. In looking at
Lhc gcncratinn ofthosc lincs, wc will hc hcgin with vcry cursory descriptions ofthe
Ule generation of those lines, we will be begin with very cursory descriptions of the
first fcw lincs to establish thc hasic pattcrn, thcn movc to vcry dcuilcd snapshots
first few lines to establish the hasic p~ltern, then move to very detailed snapshots
of thc controllcr and thc trcc fur scvcral dccisions that involve straight-forward
of the controller and the tree for several decisions that involve straight-forward
analyscs, and thcn back away from the dctail during the last line to highlight the
analyses, and then back away from the detail during the last line to highlight the
spccial kind of reasoning that gcncration can cntail. For fi~rthcrcxamplcs and a
special kind of reasoning that generation can entail. For further examples and a
thorough discussion of thc analyscs. scc [McDonald 19801.
thorough discussion of the analyses. see [McDonald 1980].
247
(dl)
tdll
[d2]
':--.
id21
[d3)
id31
"psi/ionoJlhe cotlrroller"
"positioll ofthe COli/roller"
4.6.1.
HCl'IJrshc J)l'SCClIt
McDonald
248
I.
process slale:
I.Controllcr
Controller pnrccss
slalc: "Dispatch"
"Disj~orch"
3. (;r;~t~~tadr
\'ari:~l~!es (Oll/Y
(ortly 3 orc
,O~WC)
.l
GnllHm~r Y:.uialJ!cs
arc SIshown)
ClirrCl1t~sCllfel1Ce
ct,rrcnl-sctllerlce
IdS]
?currcll[c1ause
clause
~~~~~~~subjrcl
currelltsubjec/~
[subjE'ct] predicate
Variables
2. COlltroller
C'o~~trollcr
\'ari;tblcs
............. this"~
.~~~~jj;pi~~r:===
currenl- rude
currellt/lude
lVE!rb)
[verb] [complement]
[con>plement]-current-SIOI furrent-slol
memr
))is('oursc histor!
history
4. I)incoarsc
i(il11plic:llil'nltl![...],linc6[...},
lilinlll[..], Ii6[..
ror~jllll[
..I. ......)
ronjllll[...],
clause
~flirretll-C01l1enIS
C'--<
[sul~jcrt[[predicate)
[predicntcl
[suhjertl
far1,ta1~89
10",,,,1,69
[verb]
[verb] [to-objj
[lo-obi]
lead
/nod conj101
conjlOl
Figure 16
l?igvrc
16 Snopshot
Sngpshot ol'the
oI't11c Cuntrol1cl"s
Contiollcr's Stilte
State
\Vith U
th~
st,lte.
lllC Ilcxt
the posiriuti
position uorr lthe
:\:itli
CI contlOlh.,r
c t ~ ~ i t in
~
i n ~its~ "Dbpatch"
"llispatcl~"
ll~r
st.11~.Cic
licnt Skp
sku11 (H,lil
lictiil tlic
hc
sni.lpshol
the earlier
nowch~lrtS). '1l11C
rccllrsivc caU
ca!l ltno lProc(,ss'nod~
' r t ~ c s s n o d c(r"fer
(rcicr 1to11 Uic
ciirlicr no\+,ch;~rts).
bc
sn,~y!JloL is an recursive
controller v;lii;~blc
variable "current-node"
reil')SigJ:l.~d to lhc
the !lew
node 13bclcd
bbc1cd "clause",
controlkr
"c11rrc111-tilrlc"is rcnaig~:cll
11cwnodc
"cliiusc",
we ('xr:cutc
Wilh
and wc
cnr:cotc allY
ally gr;l1l1mar-routine$
g r : ~ ~ ~ ~ ~ i ~ n r - r;lssodlled
;~ss~ri;itcd
~ ~ u t i l i c will1
s
kL1::cc label
lal?cl "l'1JlISC"
"clause" and lhe
Ihe
There ilrc
arc ;rrc!.ci~tl)
i''Jrc~,ciltly two oflhcsc:
o f Ulcsc: one
rinc for assigning
cCOlllf(\llcr-c\'cllt
u n ~ t ' ~ d l c r - c v"cntcrnodc",
"cnlcr-nodc".
c~~t
'1'llcl.c
the ClinenlSClHencc,
whil.:h
long~r 'lpplics,
tJwt rccursi\,cly
rCiOlIl'sivcly rc:tssigns
reassigns
tlic
CII~ICII~-S~I!~CIICL'. w
l l i ~ l nno
l o longs
npplics. and one
onc Ula:
Uic
gr:~nitil;~r-vnriabIc"cIIITe-Ill-c1ause",
"c~~rrcti~clousc".
which docs. From Ilcrc
conlrollcr
the grammar-v<lriablc
here thc
the controller
llliJvCS to II'IGccsssl(lt.
"current-slot"
the lirst
the cl.auscs
' ! ~ r c s s s l ~ ~fI:assigning
rcilssi:ning
L
"currclil-slot" to ll:c
liint or
ui' thc
clonse's
niorcs
constilt;Cnt slolS,
the "subject",
slots. Ihc
"subjcct", and "currcntcol1tel1ts"
"currcnt-conrciits" to formuta89.
liir1nula89. One
co~istill;ct~l
~Ipplics licrc
here to assign
grammar varijblc
vari;ib1c "currt:nt-subjcct"
to
grammar-routine :~pplics
gramn~iir-ioutinc
a!;sign the
Uic grammilr
"currznt-suhjccth W
the current-contents,
rrCU1SlVC ellvironments
such
danse
tlic
current-catitcnts. (Working
(\Vorking in
i n rccu~sivc
c~iviro~ltncnts
s ~ ~ c:a~,
I!; l i this clause
requires c.lfe
the ltiming
J5signlllcnls. By
the
thce
icrlr~ircs
c.lrc in
it1 ihc
i r n i ~ ~of
opf assignt~len~s.
I l y "dcbying"
" d c l ~ y i l ~ gd!c
" updating of
of h
pointer to thc
the subject
until nun.
now, wc
we haye
retained acro.s
tu Ule
tile Ilighcr
Jlig,hcr suhject
pointelsubjcct unlil
11aic retnincd
;icccss Lu
si~hjcct
where it war
W3') 11eC'(krl,
j4licrc
ncedcd, c.g..
c.g.. for [P(lt\:lUhll
~ o t c l ~ ~applications
i ~i ;! ~ l~ l i c i l t i oor
ot/i s'cquiv,licnt-np-dcletion
~ ~ ] ~ i v ~ t I c ~ ~ t - ~ i or
opr- d c l ~ t i o ~ i
C!i:US!~.)
cconjunction-reduction
o t i j u ~ ! c t i o n - r c d i ~LI;alll:~
~ l11:i: ~ ~lc\d
Ic\c.l
~ ~ "rlhc
o r i h c c!.:usc.)
ffl'm
Proccss-:;!I)t. !!le
the ('"U!Jlrol1cr
[)i~p.Jtch and
the "msg-elmt"
Fri>m l'rlrcc!;s-slot.
:u!~~rollcrcalls
c:lllr [>izp.ltcll
ilnd selects
sc:ccls ihc
"lnsg-clrnt"
casCo "he
J{c~\lil.c wiil
will iicw
now CO;I:IO~
eOiltloJ ltile
rl~,llizing 1)lrnsc
pllraSl: for
casc.
.fhc function
!i~l!ction l<e:!li~c
l ~ c";Clcctitl!l
l c c l i i ~of
ol f~i.iii i\:;~liring
249
formula89 which will thcn bccomc thc current-contcnu and 'thc cuntrollcr will
formula89 which will then become the current-contenL~ and the controller will
loop tllrough Dispatch again. 'Ihc rcaliration of fortnula89 involvcs apprcciating
loop through Dispatch again. 'Ihe realil.ation of formula89 involves appreciating
its rcdcsc~.iptionas an objcct with a special role in thc proof, i.c, "rlre assur~~ption".
its redescription as an object with a special role in the proof, i.e. "the assumption".
Its dictionary cntry is considerably more involvcd than avcragc; conscqucntly
Its dictionary entry is considerably more involved than average; consequently
mthcr than ltiok at thc intcrprctation of that cntry, wc will digrcss hcrc to consider
rather than look at the interpretation of that entry, we will digress here to consider
a morc "normal" cntry and how it fits into Rcillizc. 'Ihe rcdcscription tcchniquc
a more "normal" entry and how it fits into Realize. The redescription technique
itsclf will be dcscribcd latcr.
itself will be described later.
4.6.3. 'I'hc Rcali/arion I'roccss
4.6.3. The Ucalization Process
Figure 17 is a high-lcvcl flowchart of the function licalizc: It dividcs into two
Figure] 7 is a high-level flowchart of the function Realize: It divides into two
paths dcpcnding 011 whclher this is thc first instiincc of thc mcssagc clcmcnt to
paths depending on whether this is the first instance of the message clement to
appcar in [tic trec, in which casc wc go dircctly to its dictionary cntry, or whether
appear in the tree, in which case we go directly to its dictionary entry, or whether
this is a subscqucnt rcfcrcncc (as with formulit89). in which casc we apply various
this is a subsequent reference (as with formula89). in which case we apply various
hcuristics to dctcrminc if it should bc rcalizcd as a pronoun or somc other form of
heuristics to determine if it should be re;llized as a pronoun or some other form of
"subscqucnt rcfcrcncc". (Surnmirrizing iln cntirc firmula with t l x phrase "rhe
"subsequent reference", (Summarizing an entire formula with the phrase "the
assun~plion"is a form of subscqucnt rcfcrcnce.) ,
assumption" is a form of subsequent reference.)
Every cntry has a "lnatrix" decision. thc one that dctcrmincs what catcgory of
Every entry has a "matrix" decision, the one that determines what category of
phrasc will bc uscd, c.g. noun ptirasc or clausc, and may have an arbitrary number
phrase will be used. e.g. noun phrase or clause, and may have an arbitrary number
of othcr "refining" dccisions that can add additional fcaturcs to tfic phrasc or add
of other "refining" decisions tllat can add additional features to tl1e phrase or add
optional constituents.
optional constituents.
A dictionary cntry consists of a sct of possiblc "choices" and a sct of "dccision1\ dictionary entry consists ofa set of possible "choices" and a set of "decisionrulcs" to pick bctwccn thcm: A choice is a symbolic specification of phrases,
rules" to pick between them: A choice is a symbolic specification of phrases,
words, or subelemcnts of the elcmcnt being rcalizcd; A decision-mle has two
words, or subelcments of me clement being realized; A decision-rule has two
parts: onc, a list of prcdicatcs that may cxaminc bolh thc linguistic context and the
parts: one, a list of predicates that may examine both the linguistic context and the
contcxt of the speaker, and two. the choice that should bc sclccted if those
context of the speaker. and two, the choice tllat should be selected if those
prcdicatcs arc true. %hc bulk of thc realization process consists of interpreting the
predicates arc true. 'nle bulk of the realization process consists of interpreting the
decision-rules to sclcct a choice, thcn possibly going through Further sets of
decision-rules to select a choice, men possibly going through further sets of
decision-rules lo sce if h e grammatical or rhetorical contcxt dictates that the
decision-rules to see if the grammatical or rhetorical context dictates that the
choicc should bc transformed. Extcnsions to the trce occur whcn the sclccted
choice should be transformed. Extensions to the tree occur when the selected
choicc specifics a phrase.
choice specifics a phrase.
McDonald
250
MScrELMT
MAIN
STREAM
>-
SUBSEQUENT
S U B S E ~
REFERENCE
REFERENCE
no
seec
subseq
_no
ref. strategy
apply-transformotlona
....
'-------- ;
---------'
Figure 17
17 flowchart of
thc rrcali7.
c a l i.a~tion
~ ~ tprocedure
ion
ortlle
The \,ocabulary
vocabulary of thc
the specification comes
the .permanent
comcs from thc
,pcnnancnt knowledge
base in the grammar, part of
the legitimate
o f which is a listing of all of Ule
lcgiti~natecategories in
Lhe
language and for each
legitimate sequences
ofslolnamcs
the languagc
cacli category,
catcgory, of the
tllc Icgitimatc
scqucnccs of
slomamcs that it
can dominate.
dominatc. These
'l'hcse listings arc organized in terms of "constituent schcrnas'.
schcmas".
("Schemas"
they will
will be uscd
used as tcmplates
templates for the construction of
scnsc that thcy
("Schcmas" in the sense
"instances"
configurations in the trcc.)
tree.) Every choice has (at
"instances" of those category
catcgory configi~rations
least)
three parts:
pans: (1) a;I "phrase-schema" that defines
tree of
constitucnt-schcmas
Ici~st)thrcc
dcfincs a trcc
ofconstituent-scllcm~
possibly augmented
by
additional
labels
and
by
specific
words
from the English
augmcntcd
labcls
spccific
vocabulary;
parameters Uiat
tl,at will bc
be uscd
used to pick out
\ocabulary: (2) a list of formal
fnrmal paramctcn
subeJcments
being rcalizcd;
realized; and (3) a mapping from
clcmcnt bcing
subclcmcnts of the message element
parameters to slots
tbe fringe nfthe
paramctcrs
s111tsat the
of thc specified
spccificd phrase.
Figure 18
the entry,
the
Figurc
18 lists thc
cntry, choice, and grammar that are
arc required
rcquircd to realize (he
logical predicate shaves.
251
Message ElelllC/rt
shafCS(X,Y)
(matrix
default (clause-direct.objsct shaver "shave" shavee)))
dEofaull (clause-direc!-objec! shaver ",ha"e" shavee)))
(matrix
the C71oice
the Choice
Co~rslilriall(define-schema basic.clause
(define-schema basic-clause
scherrras
categories [clause)
schemas
categories (clause)
Com/ill/CIl/-
IVJJiClll'IWfJUCE;
A
clause
~
[ subject ] ~
[ predicate ]
[ subjer:t]
I
x [ predic:lte
vp
~
[ verb ] [ object1 1
[ verb I [objecH]
shave
Y
shave
\'
252
McDonald
of
when tl,e
taken. To
o f tl,e
the local
loc;tl variables of
o f the
ihc entry wlicn
t l ~ cchoice is tiri.cn.
T o produce new
constitllcnt
phrase'schema must
mllst bc
be ills~lntiated
constitacnt structllre
structure from the choice.
cboicc, its phrase-schc~na
insuntiatcd and
tlw
mapping
applied
to
fiJI
its
leaves
with
the
message
clements
Ihc mapping
ti11
lcavcs
thc nlessagc clc~ncntsthe entry has
sclected.
sclcctcd.
4.6.4. Contilluing
through the tree
4.6.4,
Continuing tltrot~gh
Once
tl,e pllrasc
phrase for formula89
been instantiated
tl,e tree.
Oncc Ule
ft1rnlul;lS9 has becn
instantiated and knit into
i n t o thc
trcc, the
controller
Process-node and bcgins
begins to trJVCrsc
controllcr SIas
; bclorc
hcforc rccurses
rccurscs on
o n I'roccsc-node
travcrsc the
dlc new
ncw noun
phras('. Nothing
Nothing ncw
new happens
phrase. so we
pItras1'.
Il;~ppcnswithin Llle
Lhc phrasc.
wc will move
lnovc on
o n to
t o the
d ~ cnext
snapshot,
moved
down
snapsllot, taken
takcn after the
lltc controller
controllcr has
11;s finished
finisl~cdwith the subject
subjcct and
a t ~ movcd
d
and through
tl,e
slot
"to-obj"
to
li1e
ohjcc!
constituent.
throllgli dle
"ttl-obj" tu Ihc ohjcct cc~nstitt~cnt.
I.
CoJltrolh~r
stare:
I.C
b ~ ~ t r n l lIlro('('ss
procrw
rr
slarc: "Dispatch"
"l)ispa!ch"
3. Gr:UlllII;n
I;r:~ttt~t~;tr\'miablcs
\':tri~hles (01/1)'
(O,,/J ('urrefl1-srnte17Ce,
currerrr-rmnnce.
currcTitsubj(xt.
currcar-.~ul!iccr.(Jlld
orld C/JrrCII('c!aIJse
C U N P I I I - C ~ O !or('
or?
, ~ ~shown)
IIZOW~)
~CI~currelll-sWlena
~CI/-SEI~ICP
[dB]
[dB]
clause
clause
[subject]
[sul~icctl[pre<1icnle]
[pr&icnle]
rlti.f
,/,is
vp
~
[verb] [complement]
4.
[verb] [complement]
ml'an
claus~
ml'nn
clause
CLirf(;1I1~cllluse---16;;;n:::;::;r.;:~
curr{'f11~~'Ubjecl
c u r r ~ ~ ~ l - s i ~ b j e ~ l l~uhjcl'11 Iprl'llicalc
~
np
vp
formulaR9
~ ~
~ {verbnto:db:]
I ~ P 05SlImptiOl/
n.xx,~.~~li,~!i
lcfin
Ihe
/mli
Said so fur:
UisroUlliC' history
1)irroilrrc
Itistory
(forulUla:l9[
(B,ntnul;JY[ ...].
. I , implkalionI1l1[...],
intplirrlionlll2[ ..&
Iim'II[...],
li#!cb[ ...I, fonjIUI[...],
co~~jllll[
...I....)
...)
2. ('onlrolll'r
C'oatrollrr Variahles
\'ari;~It!es
CUtT:,'//{-l1ode
currellt-slot
o,l~illll
C~lrrrlll-~ul1lcllls
l'oujllll_
t'ilrrCJI/-eVlllcnls
Noticc
111at we
wc havc
repl;scJ ihc
vcrh "'jtll
with its third
tltir(1 pcno!l
siltp.ul;~rfonn
Notice that
have not rcplaccJ
the verb
pcr..,on singular
cvctl dlaugll
tli:~tis the
t l ~ cform
fortn 111;
oppc.~rs in the
tllc output
oull>t!t ~trean1.
strcarn. fly
By design.
design, we
even
tllOUgh thal
thaLt apPcilrs
ltovc dcci(lcd
t11at Ulc
lcvcl of
ol'rvprr~cnt:~tion
c ~ l ~ i b i t cin
i nd Uw
UC
I lrcc
s l l ~ i l ~bc
l d the
!l<lVC
d~cidC'd that
tlle kvd
n'presC'!ll;\tion exhibited
trcc should
be
kvd nneeded
n:r~rC'm:e it;
\\"(: y.,iI1
lcvcl
c c d d by !he
!he library
li11r;try routines
nrt~tittcs!Jut
th.11 irfsrcncc
it: \tc
will Hot
ttot gr;]tuitollsly
gl'at~iil(lurly
"ttixJxc" its cOlltents
co!ttcnts or add
ad,! lahd'i
Iitbcls if
i f tth('y
l l f y ;ire
i i i ~ t $ktot do further
fi!rtlicr work
\iilrh in
i n Ule
the
"update"
~m' not p
g(lilli~
gr;irllnl;tr.
'lltc
c~,rrccl
1narpholo!:i1:.11
i
h
u
n
(11'
Uic
vcrh
w;ls
nccdccl
CIIIIY
for
the
grammar. The correll l11orpholo!,.iC<ll thnn or tJ1C vcrb W<'IS needed llnly
253
output text and so was not constructcd until thc morphology routine was passed
output text and so was not constructed until the morphology routine was passed
tllc word on its way to the tcxt strcarn. latcr grammatical rcfercnccs to thc vcrb
the word on its way to the text stream. Later grammatical references to the verb
arc going to be conccrncd with its gri~mmaticalproperties ralhcr Ih$n its
are going to be concerned with its grammatical properties rather than its
morphological oncs (for examplc whcthcr it can takc cotnplcmcnts, and if so
morphological ones (for example whether it can take complements, and if so
whcthcr thcy ihrc subject or objcct controlled) and thcse propertics arc by
whether they are subject or object controlled) and these properties arc by
convention ;esociatcd aith the roor fvtm of thc vcrb which is dlc onc in place in
convention associated with the roOl form of the verb which is the one in place in
thc vcrb slot. 'l'hc prcpasition "lo" kas introduccd by a grammar-routinc attnchcd
the verb slot. The preposition "to" was introduced by a gwmmar-routine attached
to thc labcl "to-obj" rather than having its own slot for thc a m c rcason: wc cxpixt
to the label "ta-obj" rather than having its own slot for the same reason: we expect
no othcr part of thc gcncratur, dicliunnry or grammar. to iiccd to know about h e
no other part of the generator, dictiunary or grammar. to need to know abollt the
prcscricc of that prcposition so wc makc our cxpcctation concrctc by having the
presence of that preposition so we make our expectation cnncrete by having the
prcposition complctclg "invi~iblc" within a grammar routinc rathcr rhan
preposition completely "invisible" within a grammar routine rather than
occupying a slot whcrc it could bc noticed.
occupying a slot where it could be noticed.
Hrdcscriptionaccordi~~gtofunc~ion 'l'hc last significant opcration bcfore
Redescription according to function
The last significant operation before
moving on to line7 is Ihc rcalization of conjlol, thc formula shaves(g,g) A
moving on to Iine7 is the realization of conjlOl, the formula shaves(g,g) /\
ishaves(g,g). as h c 131gli511phrasc ''0C O I I I ~ ( I ~ ~'Ihis
L ' /is~of
O~
coursc
I " . not a
-'shaves(g,g). as the English phrase "a cOIJlradic'ioll ". 'Illis is of course not a
fircral rcndcring of thc forniuln: that would havc bccn "He shaves hhbuevarrd he
literal rendering of the fomlllb: that would have been "lie shavcs himself alld he
docsn't shalle hitrrself'. Instcad it is a rcndcring of the cor~~~entiot~al
role h a t the
docsII" shal'e himself', Instead it is a rendering of the cul1l'enliollal role that the
fotmula playcd in thc proof at that point, i.c, an indication that a contradiction
fotmula played in the proof at that point. Le. an indication that a contradiction
had bcin dcrivcd. 'Ihis ability to rcalizc cxprcssions in terms of thcir functional
had been derived. '111is ability to realize expressions in terms of their functional
rcdcscriptions was also uscd in the rcalization of formula89 as "rhe assurt~piion"at
redescriptions was also used in the realization of formula89 as "the assumption" at
Ll~cbcginning of Lhc sentence.
the beginning of the sentence,
Rcdcscription is a way of seeing the same conccpt or opcration at multiple
Redescription is a way of seeing the same concept or operation at multiple
Ic\.cls simultaneously dcpending on one's intent, and has become an important
leYels simultaneously depending on one's intent, and has become an important
part of thc reprcscntatiunal "rcpenoire" of modern cxpcrt systems, whcre it is
part of the representational "repertoire" of modern expert systems, where it is
uscd in plan recognition and in dcfining lcvcls of abstraction (scc particulaly
used in plan recognition and in defining levels of abstraction (sec particularly
[Mark 19811). Intuitivcly, rcdcscriprion is associated with particular turns of
[Mark 1981}), Intuitively. red~scrip[ian is associated with particular turns of
phrasc in English such as appositivcs (as in the last scntcncc of this cxamplc) or
phrase in English such as appositives (as in the las[ sentence of this. example) or
some noun-noun combinations (c.g. "the role
conscqucntly, it is uscful to
some noun-noun combinations (e,g, "'he role pobi'): consequently, it is useful to
nl;rkc spccific arrangements for it within thc linguistics cornponcnt.
make specific arrangements for it within the linguistics component.
Ordinarily, rcdcscription would bc an opcration at a conccptual lcvel rather
Ordinarily, redescription would be an operation at a conceptual level rather
than a linguislic onc, and wc would cxpcct it to be explicitly indicated in the
than a linguistic one, and we would expect it to be explicitly indicated in the
mcssagc; however, since the prcscnt rnicrospcakcr has no real conccptual
message; however, since the present microspeakcr has no real conceptual
knowlcdgc of logic and starts with olily thc barc formulas of the proof, we must
knowledge of logic and starts with only the bare formulas of the proof. we must
coinpcnsatc by performing thc rcdcscription locally within thc dictionary. Ihe
compensate by performing the redescription locally within the dictionary. lbe
rclcvant par& of the dictionary arc thc entries for thc infcrcncc rulcs, thcsc being
relevant parts of the dictionary are the entries for the inference rules, these being
whcrc thc rnicrospcakcr's tacit knowlcdgc about thc structure of proofs resides.
where the microspeaker's tacit knowledge about the structure of proofs resides.
The rcdcscriptions of the individual formulas arc deduccd as thc entrics are
The redescriptions of the individual formulas are deduced as the entries are
intcrprcrcd and stored wihin thc linguistics cornponcnt on a spccial association
interpreted and stored within the linguistics component on a special association
list: the cntry for a Prcrnisc, for cxamplc, notcs that the formula on its line serves
list: the entry for a Premise, for example, notes that the formula on its tine serves
w'):
254
McDonald
4.6.5.
Ilcl"yin~
Decisions
One
direct-transl'ltion tccliniquc
tcchnique is that while the
Onc pproblem
r l ~ b l c ~tl1at
th;~t
n can arise with the
tlic dircct-transl;~tion
tlic data
d,~tauscd
(hcrc the
thc prcdic;~tc
fihr~nalstructure
s u u c t u ~ cof the
formal
Llsed in a mcssagc
mcssage (here
predicate calculus) may be
crinvcnicot within the
ihc dom,lin,
d o ~ n ; ~ i nit ,can be
bc at odds kith
convcnicnt
convenient
..dIh wliat
what would bc
be convenient
fur thc
gcacrator. Ncg;1tionl03,
thc I;~st
proof: is aJ case
c;~scin point.
for
the generator.
Negationl03, the
last line 11f
of thc
the proof;
-'3x (barber(x) 1\ Vy(shaves(x,y)'" -'shaves(y,y)))
25~
this polarity in the text, but from this valitagc point in the prdccss wc d o not yet
lhis polarity in the text. but from this vantage point in the process we do not yet
know what linguistic mechanism should bc used (e.g. an explicit "do" or an
know what linguistic mechanism should be used (e.g. an explicit "do" or an
emphatic "not"). Wc must tllus dclay thc dccision until wc know more, which
emphatic "not"). We must thus delay the decision until we know more. which
means h a t wc add an annotation to tlic formula wc cmbcd, cxpccting the
means that we add an annotation to the fonnula we embed. expecting the
annotation to bc recognized by later routines h a t will bc activc whcn the nccded
annotation to be recognized by later routines that will be active when the needed
information is known.
information is known.
As it happens, thc dictionary entry of the very next opcmtor in h e
As it happens. the dictionary entry of the very next operntor in the
decomposition, thc negation, has an altcr~~alivc
among its chbiccs that we have
decomposition, the negation, has an alternative among its choices that we have
dctclmincd in dcsigni~igthc grammar will scr-ye to cmph;airzc ncgativc polarity,
detellllincd in designing the grammar will serve to emphasil.e negative polarity,
LC. '?l,crdS. or llw nrga:alion> is Jrlsr". 'I'hc ncgation clmy is allowcd to sclcct this
Le. "<hndy or Illr m'l:alion) is !iI/Sf''', The negation entry is allowed to select this
choicc i f thc body of thc ncgstio~~.
tlic prcmisc lint fi~nnula89.call bc cvprcssed as
choice if the hody of the negatio!l. the premise line fonnula89. can be e~pressed as
s sitnplc noun phragc (no~ni~ializcd
clauscs arc disi~llowcd):that is the case here.
a simple noun phrase (nlJlninalized clauses are disallowed); that is the case here.
sincc forrnul;189 was just rcfcrrcd to in thc last scutcnce as "rhc assu))~l)lion"
and
since formula89 waS just referred to in the last sentence as "lhe assumpliull" and
that nominal form will carry ovcr. If only a clau~illrcaliz;rtion had bccu possible,
that nominal form will carry ovcr. If only a clausal renlization had becn possible.
then thc ncgation dccision would hahe bccn dclaycd as well.
then the negalion decision would ha\e been delayed as well.
McDonald
McDonald
256
256
[d7]
A
clause
~
[themalic-ndvsrb] [head]
[thematic-adverb) [head]
tl~crdore
therefore
n~I:I0l.....,
rnlphasize-polarily "ncg~live"
I~ll1ph~silt'Jlolaril)'
"n~J:alive"
Becomes:
Becomes:
{d7)
clause
[thematic-adverb) head)
clause
~
(head] [appositive]
[head)
clause
[appositive]
cIause\"cgl03,
nrg 1OJ
use-literal-form
~
luse-literal-form
[subject] [predicate]
[subject) (predicate)
rormul~"9
~
[verb] [pred.adj]
[verb] (prlld -adj]
"rn'u'"9
be
folst
257
draw appositives if thcir long forms havc not bccn mcnlioncd within three
draw apposItives if their long forms have not been mentioned within three
scntcnccs. 'Ihc samc hcuristic applics to pronaminslization decisions and is
sentences. '1l1e same heuristic applies to pronominJlization decisions and is
intcndcd to rcflcct whcn a rcfcrcncc has fadcd in memory-rcscarch on discourse
intended to reflect when a reference has faded in memory-research on discourse
structure should lcad to a morc principled critcrion, Thc fact that t l ~ cappositive
structure should lead to a more principled criterion. The fact that the appositive
has bccn planned at the lcvcl of thc clausc inhibits it from appcaring redundantly
has been planned at the level of the clause inhibits it from appearing redundantly
with the noun phrasc: h a t is, thc local dccision that would how produccd "The
with the noun phrase; that is, the local decision that would have produced "The
a.~su~t~ptiot~
rho1 !here is IIU such barb~ris fnlsc rhcre is no such h r b e r " is tiltercd
assumption 'hat there is I/O such barber is false, there is no such barber" is filtered
out by thc prcscncc of tllc highcr appositive in thc tree.
out by the presence of the higher nppositive in the tree.
4.6.6. Inttri~ctiosHufrvccn Ilccisions
4.6.6, Interaction IJctwccn Decisions
l'ravcrsing the trcc bcloa thc cliiusc that rcali~cdthe first instancc of ncgationl03
Traversing the tree below the clause that realized the lirst instance of negation103
is a sitnplc mattcr. ?'he cmbcddcd for~nu!a in rhc subjcct is pronorninalizcd
is a simple matter. The embedd~d formula in the subject is pronominali7.ed
and h e fact that both installccs wcrc in
bccausc of its proxitnity to its last it~s~;incc
because of its proximity to its last instance and the fact that both instances were in
subjjcct position (in cfl'cct a "poor man's" rulc oidiscoursc ftcus). 'Illc pronoun,
subject position (in efl'eet a "pom man's" rule of discourse foclIs). '1l1e pronoun,
vcrb, and adjcctivc are thcn passed 10 thc output strcam ;a thc controllcr moves
verb, and adjective are then passed to the output stream as the controller moves
Lhrough thcir constilucnt positions.
through their constituent positions.
At thc position of thc i~pposititc.Llle ncgation cntry this titnc passes its
At the position of the appositive, the negation entry this time passes ;ts
dccision down to 3 latcr pnrcss sincc its body has been specially annotated
decision down to a later process since its body has been specially annotated
bccausc of its app[)sitiilcfunction so as to block thc rcdcscriplion of fi~mula89as a
because of its applJsitive function ~o as to block thc rcdescriplion of furmula89 as a
noun phrasc. We go thcn l o thc ncxt Icvcl of ncg103, thc cxistcntial quantifier.
noun phrase. We go then 10 the next Ievcl of ncg103, the existential quantifier,
whcrc wc havc two choiccs: cithcr to pass thc rcaliation of tllc quantifier down to
wherc we have t\Yo choices: either to pass the realihltion of the quantifier down to
appcar as thc dclcmincr in Lhc realization of thc variablc (as in "S'un~eorreshaves
appear as the determiner in the realization of the variablc (as in "Sumeone shaves
i
hi111se!f') or to i ~ s cthc spccial cxistcntial construction
everyorle who d o ~ s'r~slra~~c
everyone who dorsl/'t shave himself) or to use the special existcntial construction
"There is". As onc [night imagine, this dccision is dcsigncd to be sensitive to the
"There is". As one might imagine, this dccision is designed to be sensitive to the
pcnding dccision on thc ncgation. and we sclcct the spccial construction since the
pending decision on the negation, and we select the special construction since the
ncgation would prccmpt the dctcrmincr and makc thc olhcr alternative
ncgation would preempt the determiner and make the other alternative
ineffective.
ineffective.
In English, clauses with the cxistcntial ''there" arc grammatically unusual
In English, clauses with the existential "there" arc grammatically unusual
because thc verb agrccs in numhcr with thc objcct rather than the subjcct. 'Ihis is
because the verb agrces in numher with the object rather than the subject. This is
handlcd hcrc via the samc mcchanisrn as prcscntly used in transfomiational
handfed here vb the same mechanism as presently used in transfornlational
grammars: l h c word "~hcre"is taken to bc a lexically fillcd "tncc" pointing to the
grammars: The word "thrre"is taken to be a lcxically filled "trace" pointing to the
logical subjcct of thc clausc. such that whcn thc grammar routinc that i~nplcmcnts
logical subject of the clause, such that when the grammar routine that implements
"subjwt-vcrb" agrccmcnt rcfcn to thc variable "current-subject" it is passed
"subject-verb" agrecment refers to the variable "current-subject" it is passed
transparently to thc objcct instcad. Cl'hc rcfcrcncc is not actually to the
transparently to the object instead. (The reference is not actually to the
"object"-it cantlot bc. sincc thc position of thc objcct has no1 yet bccn reached
"object"-it cannot be. since the position of the object has nOl yet been reached
by the controllcr and lhus ia contcnts cannot bc known. lnstcad thc trace points to
by the controller and thus iLS contents cannot be known. Instead the trace points to
Lhc mcssagc clcmcnt that would havc bccn thc subjcct if "ihcre" hsd not been
the message clement that would have been the subjcct if "there" had not been
uscd, and Lhc transformation that introduced the word "111ere" rcdircctcd that
used, and the transfonnatilJn that introduccd the word "t!lere" redirected that
clcmcnt lo Lhc objcct position.)
clement to the object position.)
258
McDonald
4.6.7. lle~liling
Iloles
Ile~liringMessage
Mcss:~gcElements
Elc~rcntsin (erms
tcrnls of their Roles
The
constituent of
of the "rhere"
"there" clause is yet
'lhc conjllnction
conjunction thJt
that is now tlle
thc object
objcct cunstitucnt
another
conventional
expression
within
lhe
proof.
anothcr convcnlional cxprcssion
lhc
barber(x) 1\ 'o'y(shaves(x,y)'" -'shaves(y,y)
'Il,e
011 the
~le variable X, which,
which. in other
'llic predication
predicatio~l"barber(x)"
"b;lrbcr(x)" is a;I restriction on
logical
other placcs
places in thc
the crprcssion.
expression. l'hc
The
logical notations,
nutations. would have
Ih:lvc appeared
appcarcd in otlicr
actual
the universally qoanlificd
quantified Rlr~nula.
formula. If
If we
actual "content"
"content" or
of the
tlic conjunct is just thc
\l.l1C'
W nol\1ing
we wuuld
would bc
be forccd
forced to rcalizc
realize it litcrally,
literally,
kncw
n~rlliingelse
clsc about
bout this conjunction,
coajunclion, wc
as
Gild
shaves rsegrj!le
fl'er)'one who doer117
dOCSI/'! skal'e
shGl'e hi~nself'.
himself'.
as in
in "Someone
" S ~ ~ I P is
~ IaoI barber
P
and he sha~ws
However,
of ihc
the conjunction apparcnt
apparent to
Howcvcr. if we m,lkc
makc the
Uic conventional
convcntionnl structure of
the
much inorc
more fll~cnt.
fluent. I3y
By labcling
labeling the
thc linguistic
linguistic component,
componcnt, we can be
hc inuch
predication dS
a
"description"
and
lhe
fonnula
as
a
"proposition".
bolh
as
"dcscription"
thc formula
"proplsition". both
predictahle
lake advantage of
of a gcncral
general purpose
vnrinblc X. we
wc can take
prcdictahlc on the variable
dictionary entry fi}r
for that
ilia1 combination.
u~mbination.
dictionary
As shown
shown in figure
figl~lr21. a description
dcscription and a proposition can bc
As
be conihincd
combined in
sc\'crat
(i.e. which of
of llic
the two clcrncnts
e1cm~t\ts is
~~ murc
mm~
sc\cral ways
ways according
according to
tu what is needed
nccdcd (i.c.
important.
etc.).
case sincc
since hcconjunction
the conjunction
imporwnt. which order
ordcr is more important,
i ~ n p o r t l ~ctc.).
~ t , In this casc
isis acting
where the two are
arc sct
set iIIp
modifiers in a
acting as
as an object.
c~bjcct.the
thc combination whcrc
~ pas modilicrs
phrasc denoting
dcnoting the
thc \ariable
tariablc is thc
noun phrase
the most appropriatc,
appropriate. and that
~lat is the phrase
rcplnccsconj88
is built and reploces
mnj88 in the tree.
that is
259
Combinations
Combinationsof a description and a proposition
predicated of the same object.
proposition[objec!
ldescription
descriptionlobjec!
II'ho
proposition
lnp
I,, object )
l1
tnOdifier
tnodifier
Ilfoposition
proposition
A barl't'r
doesn'f dove
5hal'e ltimsel/
himself
&ll.?r WilD
wlio slimes
1180t,es I'VefJ'Onc
e18ewneWilD
WIJO
do~sn',
Figure 21
This panicular
particular conjunction
conjunction has of course
coursc nppcarcd bcforc
This
before in the first line of
of
thc pHlof.
proof. We
WCarc
arc therefore
ihcrcforc dealing
dcilling with a subsequent
subscqucnt rcfcrcncc
the
reference ;lad
aod the
!he heuristics
that section
scction of the Realize
llcaliic function
function apply.
apply. llccousc
in !hat
Because of
of thc
!he distance of
of the
original
from
the
present
position.
the
conjunction
-should
not
be
original instance
inslltncc from thc prcscnt position. dic
shoold
pronominalized.
subsequent reference
prunominalizcd. however
howcvcr there
thcrc arc ofcoursc "intermediate"
"intcrmcdiatc" subscque~~t
stralcgics. One of these.
thcsc. particularly
parriculnrly appropriate to tllc
strategies.
the sl!lc
stylI:' of
of a mathematical
"such": this
this word can "pronominalized"
"pronomin;tlired" rhc
proof, isis the word "such";
proof,
tl'e modifying phrases
the reference,
rcfcrcncc, leaving its head and detenniner.
determiner.
of the
4.7. Contributions
Contributions ;md
and Limitations
1.imitations
4.7.
4.7.1. Specific
Specific Contributions
Contributions ofTbis
oll'his Research
4.7.1.
nlc computer
computcr program developed
dcvclnpcd in thjs
this rcscarch
lllC
research (src
(sec [Mrl)onald
fMd)onald 19811)
1981]) is
is the
linguistically competent
colnpctcnt natural
~~ntural
languagc production program that hhasa been
most linguistically
language
rcportcd
to
date.
'l'his
is
due
primarily
u1 thc
to
the advanccs
advances in
in thc
the computational
reported
date. This
19801 which havc
thcory of production
production reported
rcportcd here
hcrc and in IMcl)on;~ld
theory
IMcDonald 1980)
have simplified
Ihc process
proccss of representing
rcprcscnting linguistic rules
mlcs and usagc-heuristics.
the
usage-beuristics. In particular:
(1) 'Ibis
l h i s isis the
thc first theory to be specifically
spccilically designed for usc with source
(1)
1
that
use
different
reprcscniational
programs
programs that usc different representational systems.
systems.l
McDonald
McDonald
260
260
261
'Two othcr pcrspcctivcs have been takcn (scc [Mann ct al. to appcar]): one
Two other perspectives have been taken (see [Mann et al. to appear)): one
school can bc tcrmed gra~?tt~~arcon/rollcd
lincariza!iott a 4 ~rurr.slurion[Simmons
school can be termed grammar-controlled lineariza/ioll alld translalioll [Simmons
and Slocum 1972; Goldman 1974; Shapiro 19751; anothcr, largcr though less
and Slocum 1972; Goldman 1974; Shapiro 1975): another, larger though less
linguistically sophisticated school can bc tcrrncd production direclly porn program
linguistically sophisticated school can be termed produc/ion directly from program
dala [Swartout 1977; Chester 19761. Cl'wo othcr important systcms, [Clippinger
data [Swartout 1977: Chester 1976]. (lwo other important systems, [Clippinger
19781 and (13avcy 19741, fall into ncid~crof thcsc catcgorics as thcy both cmptoy
1978] and [Davey 1974), fall into neither of these categories as they both employ
cxtcnsivc grammars and vcst control with nun-grammatical proccsscs;
extensive grammars and vest ,control with non-grammatical processes;
unfortuna~cly.ncithcr has becn furthcr dcuclopcd.)
unforlunatcly. neither has been further developed.)
vcsts total control of me p r t ~ c s sin a topdown
'Ihc gmn~r~~~rco~irn~llrdl~dschool
The granll11ar-colltml!ed school vests total control of the process in a topdown
gcncrativc grammar. typically givcn as an augmcntcd triinsition nct ("A'~'N"). This
generative grammar, typically given as an augmented transition net ("UN"). This
grammar hypothcsizcs a way in which thc mcssage might bc rcalizcd, and then
grammar hypothesizes a way in which the message might be realized, and then
tcsts thc mcssagc to scc if that way is fcasiblc. It constnlcLs h c hypothcsizcd text
tests the message to see if that way is feasible. It constrllcl~ the hypothesized text
if thc tcst succccds; othcrwisc it backs up and considers thc ncxt grainmatically
if the test succeeds: otherwise it backs up and considers the next grammatically
possiblc realization. 'I'cxts arc pn~duccdas a sidc-cffcct of trnvcrsing thc ATN.
possible realization. Texts arc produced as a side-effect of traversing the ATN.
Comnarcd with using h e mcssagc decomposition itsclf to controt tllc prtxcss, this
Compared with using the message decomposition itself to control the process. this
tcchniq~~c
is incfficicnt at best, and at worst, allows thc possibility of producing
technique is inefficient ;]t best, and ;]t worst, allows the possibility of producing
totally confused tcxt should lhc .&I'N cvcr backup ovcr an arc-path that produced
totally confused lext should the ATN ever backup over an arc-path th<lt produced
words ( i t . it would start rcpcaiing itsclf without rcgard forcontcxt). Historically it
words (i.e. it would start repeating itselfwithout regard for context). Historically it
is the case that nonc of thcsc systcms has cvcr had occasion lo backup: we
is the. case that none of these systems has ever had occasion to backup: we
conjcc.lurc that the rcason for this is that thc spacc of possible mcssage
conjecturc that the reason for this is that the space of possible message
configurations dcalt with by thcsc systcrns is rclativcly small, making it possible to
configurations dealt with by these systems is relativcly small, making it possible to
dircctly cncodc thc spacc on thc arcs of rhc A'I'N grammar as tcsts for all of the
directly encode the space on the arcs of the ATN grammar as tests for all of the
possiblc contingencies. Wc predict that whcn thc contingcncics bccome too
possihle contingencies. We predict that when the contingencies become too
diverse to anticipate when thc grammar is written, that grammar-controlled
diverse to anticipate when the grammar is written, that grammar-controlled
systcms will metamorphose into a more mcssagc-controlled style.
systems will mctamorphose into a more message-controlled style.
I h c direcr production school is much closcr to thc philosophy underlying the
lbe direct production school is much closer to the philosophy underlying the
present work. 'Ihcir approach is to start with a data structure from the expert
present work. 'lbcir approach is to start with a data structure from the expert
program (thcir "mcssagc") and to evaluate it with a special "text gcncration"
program (thcir "message") and to evaluate it with a special "text generation"
evaluator just as in othcr circumstances thcy might evaluate it with, e.g., the
evaluator just as in other circumstances they might evaluate it with, e.g., the
)
in ordcr to cxccutc some function. 'Ihe structure of the
normal 1 ~ 1 ~ 1evaluator
normal uSP evaluator in order to execute some function. The slructure of the
rnessagc govcrns what gcncration proccsscs are run and in what sequence
message governs what generation processes are run and in what sequence
(invariably a strict depth-first scqucncc, translating arguments bcfore functions
(invariably a strict depth-first sequence. translating arguments before functions
and using thc intcrnal I,lSlpstack to rccord what to do ncxt and what to do with
and using the internal LISP stack to record what to do next and what to do with
"subtcxts" as thcy arc constructed). Thc "gcncration functions" for individual
"subtexts" as they are constructed). The "generation functions" for individual
kinds of program objccts asscrnblc tcx& by cmbcdding the tcxts produccd for
kinds of program objects assemble textS by embedding the texts produced for
their argurncnt objccts within a matrix tcxt; cnnccptually, gcncration firnctions
their argument objects within a matrix text; conceptually, generation functions
play cxactly the sarnc rolc as dictionary cntrics in thc modcl prcontcd hcrc. We
play exactly the same role as dictionary entries in the model presented here. We
suggest that thc difficulties thcsc systcrns facc-almosl complctc ignorance of
suggest that the difficulties these systems face-almost complete ignorance of
grammar, and an inability to produce tcxt that is not absolutcly isomorphic in
grammar, and an inability to produce text that is not absolutely isomorphic in
structurc to its mcssagc-could bc ovcrcomc if they were to adopt an
structure to its messagc-could be overcome if they were to adopt an
McDonald
McDonald
262
262
263
McDonald
264
4.7.4. What
\\'bat This
'll~isModel
hlodel Cannot Do
Efficicncy
thcrc arc certain
ccrtain kinds of
of potentially
potcntially
1:fficicncy has its
its pricc.
price. Ilccaosc
I3ecausc of its dcsign.
dcsign, thcre
useful
lbis
th3t this linguistic
linguistic component is intrinsically incapable of. Ihis
useful operations that
isis not L1ken
of
Llkcn as
as a failing,
failing, but as the necessary result of a deliberate distribution of
laSks
according
to
the
components
that
arc
architecturally
most
suited
to
wsks according to
components
are architecturally
perfonning
tl13t ihc
tl,e bulk of
of what
pcrf(~nningthem;
them: tl,at
Lhat is,
is. I claim (but will not justify here) U~at
this
this linguistic
linguistic component cannot do can be done better by other the components
that it will interact with. Specifically:
('[ratile
situations:: 'lhis
'lllis linguistic
nords to new situations:
Creative expression-fitting
expression-fitting old words
component
mean. Hy
By inverting its
conip~incntdocs not know what words mcan.
dictionary
circumstances a word could be
dictionary it could compute in what circumswnccs
used,
owo for interpreting these
tl,ese
uscd. but it has no means of its own
"circumstances"
if it is able to
"circu~nstanccs"and
2nd generalizing
gcnerali~.ingthem. (How could it if
be used
A
uscd with expert programs with different conceptualizations?) A
dictionary
dictionary entry selects
selects words reflexively according to its
precomputed
usc any sort of
of
prcct~mfiutedpossibilities:
possibilities: in particular, it docs not use
pnttcrn-matching on "semantic features", b~)th
pattern-matching
both because of
of the
cxpcnsc and because
bccause features
computational expense
features that capture useful
gcncralizations are
arc unlikely to be refined enough to
generalizations
to pickout
pickollt specific
words.
words.
hlonitoring itselF:
itself: It is generally easier to anticipate and forestall
Monitoring
forestall problems
planning than to monitor for them and thcn
by planning
then have to edit an
7his linguistic
linguistic component capitalizes on this rule
ongoing procedure. This
ongoing
from its process architecture the expensivestate
of thumb by omitting from
expensive state
history that would make editing through backup possible. The kinds
history
of unwanted effects
effects that are
arc difficult to avoid through
tllfOUgh planning
planning
(bccause they would require essentially full simulation) are
(because
coincidental stroctural
stn~cturalor lexical
lexical ambiguities: these
tl,ese rcquire
require a multimulti
coincidental
burner to detect (the
(the sort which is natural to
to parsers) and
constituent buffer
arc thus better noticed by "listening to oneself' and interrupting the
are
generator with new instructions when needed, rather thcn
then burdening
process with aalargc
that process
large buffer which will otherwise
otherwise go unused.
Recognizing when a message will un~voidahly
e ~ d to awkward or
una,oidahly llead
Reeogniling
ungra~nmaticaltext:
tcxt: Again,
Again, given the present dcsign
ungrammatical
design this possibility
cannot be foreseen
foreseen at the linguistic-level
linguistic-lcvcl without a complete
simulation (i.e. rehearsing to oneself). Either the speaker's messagemessage'
building heuristics
heuristics will be such that these problems just
just will not occur
(this isis almost
almost inevitable
inevitable when messages are planned and motivated
(this
detail in accordance
accordance with the "constraint-precedes" stipulation), or,
in detail
265
by planning the mcssage in t c m s of rhetorical predicates such as
by planning the meS!;<lge in terms of rhetorical predicates such as
"rnodifics" or "focus". potentially awkward pl~rasings will be
"modifies" or "focus", potentially awkward phrasings will be
forcsccn at the linguistic lcvcl and planned around by general rules.
foreseen at the linguistic level and planned around by general rules.
Ilcxsoning shout trade-oms caused Iry lirnitcd cxprcssihility: It car1 happen
I~casoning ahout tradc'orrs caused by limitcd cxprcssihility: It can happen
that thc inability to simultaneously cxprcss, say, both modality and
that the inability to simultaneously express, say, both modality and
subordillation will not bccomc apparcnt until thc realization of the
subordination will not become apparent until the realiz~tion of the
mcssagc is alrcady begun. 1'0 bc ablc to rcasscss thc rclative
mesS<lge is already begun. To be able to reassess the relative
importancc of thc mcssagc cIcrncnL! that promptcd thosc choices, this
importance of the message c!emenL<; that prompted those choices, this
linguistic componcnl would (1) nccd a common vocabulary with the
linguistic component would (1) need a common vocabulary with the
spcakcr in which to cxprcss the problcm (since what should bc done
speaker in which to express the problem (since what should be done
is ultimately thc spc;ikcr's decision). and (2) nccd to be aware of the
is ultimately the speaker's decision), and (2) need to be aware of the
porcntial prohlcm early cnough to be ablc to plan altcrnativcs.
potential prohlem early enough to be able to plan alternatives.
Without such a vocabulary, thc cotnponcnt must rely on the tacit
Without such a vocabulary, the component must rely on the tacit
spccificalion of rclati~c-inlportanccprovided in 11ic ordcring of the
specification of rcl<lthe-importance provided in the ordering of the
mess~gcrind thc spcakcr must be prcparcd for its lncssagcs to
message and the speaker must be prepared for its messages to
somctimcs not be rcalizcd camplctcly.
sometimes not he realized completely.
I'lanning by hack~vnrdschaining frooi desired linguistic cflccts: Onc cannot
Planning by backwards chaining from desired linguistic eff~cts: One cannot
give a spccific grammatical rclation as a high-lcvcl goal in a message
give a specific grammatical relation as a high'kvel goal in a message
and cxpcct this linguistic componcnt to pcrfonn thc mcans-ends
.and expect this linguistic component to perfonn the means-ends
analysis rcqi~ircdto bring it about: c.g. onc cannot give it instructions
analysis required to hring it about: e,g. one cannot give it instructions
such as: "thc subject of what 1 say ncxt should bc thc same as the
such as: "the subject of what I say next should be the same as the
dircct object that 1 just said". Such reasoning can rcquirc exponcntial
direct object that I just said", Such reasoning can require exponential
timc to carry out and a high processing ovcrhcad. I'hc cffccls of such
time to carry out and a high processing overhead. The effects of such
inslructions can somctimcs bc achicvcd "off-line" howcver, by having
instructions can sometimes be achieved "off-line" however, by having
the designer prccornputc the decision-spacc that the dclibcration
the designer precompute the decision-space that the deliberation
would cntail and thcn incorporate it into the cornponcnt's library as
would entail and then incorporate it into the component's library as
what would in effcct bc an cxtcnsion of thc rulcs of thc grammar.
what would in effect be an extension of the rules of the grammar.
('hc abovc instruction, for examplc, is roughly cquivalcnt to the
(The above instruction, for example, is roughly equivalent to the
existing f i ~ u heuristic.)
s
existing focus heuristic.)
CHAPTERS
CHAPTER 5
Focusing
Focusing in
in the Comprehension of Definite
Anaphora
Candace
Candace L.
L. Sidner
or I>dinitc Anaphom
In
. . TiIlCIl discourse,
people
to "point
"point hack" in the
I n spoken
spokcn and
illid .uri1tc11
di%~~(~rsc.
p c ~ p l clise
ISC certain
ccrtiti~iwords (11
to
the
people.
places,
objects.
times.
c\'ClHs
and
ideas mcntioncd
mentioned
discourse
context
disc[~~~rsc
contcxt 111 tlic pcoplc. placcs, ol~jjccts.
c\c~its idcas
there.
bad
callt'J .*I-ananhorJ, and I
lw
will
refer
pointing
hack device
dcbicc is c;lllcd
i l l rcfcr
tlicrc. The
'I'llc usc
t~scof
o r such
s ~ ~ cai~l pointing
i
to
in p,lrlicular,
pdrticular, dcfinitc
definite ;~n;lnhora
anaphora
111 'Words
wolds or phrases
plir;~ccsused
uscd in
in (hi)
lhi, way
w;~y as
as-: anarhors: in
include the
thc personal
pcr%~n;ilpronouns.
pronc,uns, ccrLain
ccrlai~luses
uscs of dcfinitc
itll:ludc
definite mlllti
noun phrascs,
phrases, and noun
ll1is
rC'sc~rchcrs havc
have dcfincd
drOned atJ1cc
phrases
phrascs containing
cunt;~i~iing
riris and
and thal.
rhnr. Tr~lditi{)]la1Jy,
'l'r;iditi~rn;~lly, rcscarchcrs
problem
as one
determining the a~itcccdcnt
antecedent ooff an
problcn~of Jll;:lpllor
a ~ ~ a l i l comprehension
a~mprclicnsion
~or
onc of dctcrniini~ig
an;~ph(iricexpressioll,
cxprcssion, that
tli:~tis,
is. determining
dctcr~niningto wliicli
which word or phrasc
phrase ;In
an anaphoric
anaphoric
ill
both artificiel
artificial intclligcncc
intelligence and
expression
cxprcs,ioil refers
rckrs or "points".
"points". Recent
I < c c c ~studies
snidich
~l
it1 botll
linguistics have
Iiatc demonstrated
dcmrlnstratcd the
thc need
nccd for
Lir a thcory
theory Rlr
for i~ndphor
anJphor co~nprchcnsion
comprehension
linguistics
accounts
fur the
tlic role
rolc of
o f syntactic and
and semantic
sclnantic cffccls,
~ l i i c laccounts
l
which
for
effects, as wcll
well as
JS infcrcntial
inferential
knowlcdsc in
in explaining
cxpl;~ininghow anaphors
an:~phorsarc undcrst(r~d.
understood. IInn !his
this cliaptcr
chapter a new
knowledge
thcory. based
bascd on
i1n the
thc concept
a~nccptof
o f focusil1g
firusing in
in thc
thenry,
the disc~lursc.
discourse, will be intruduccd
introduced to
cxplain the
Uic interpretation
intcrprctation of
o f definite
dcfinitc anaphors.
explJin
rhcory can
ciln be
bc given,
given. and before
bcforc even
cvcn thc
Ilcrorc aa theory
Before
the difticultics
difficulties in inlcrprcting
interpreting
anaphors can
can be
bc discussed,
dicct~sscd.we
wc must
must first rc-considcr
re~consider what an antcccdcnt
antecedent is. 'l'he
The
anaphors
tr;~ditiun;~ldefinition
dcfinilion encounters
cncountcrs difficulty
difticulty right from thc
the start:
start; iitt is founded on
traditional
the notion
notion that one
onc word
word in
i n an sentence
scntcncc refers
rcfcrs or points back to anotlicr
another word in the
the
( u m c or ,lIlother)
anothcr) sentence.
scntcncc. But
Hut words don't
don'l rcfcr
refer h:sk
back to othcr
other \yards
words [Morgan
(same
19781: people
pcoplc usc
i15cwords 10
111 refer
rcfcr to
10 objects.
objccts. In
In particular
1978):
particubr pcclplc
people usc anaphors to
rcrcr to
lo ohjects
ol~jccts.'hich
which have
i~il\'calready
:~lrcady been
bccn mentioned
mcntioncd in a discoursc.
fefer
discourse. Sincc
Since an
;~n;tplioric rhra~c
p1ir;lsc docs
(11,~snot
no1 refer
rcrcr to .10
;In ;jlltccedent,
antcccdc~it.one miglll
might want lo
10 claim th;it
that
anaphoric
tlic
i~ntcccdc~~t
i<lI1d
~ n dthe tln<lphor
;~napliorco-refef
co-rcfcr to some
somc ~
~
l
~
j
c
'I'his
c
t
.
definition
lboth
x ~ t lthe
i <lntC'ccJcl1t
object. This definition is
adcq~~atc
scntcllcc (I)
(I i below,
bclow.
adequate
for senteoce
( I ) II tllink
h i n k green
grccn ill2J2!>
;innlcs laste
tastc best
bcsl 'md
and [he)'
rhry makc
(I)
make thc
the bcst
best cooking
applcs too.
apples
not for
for discourse
discoursc 1)1,
111, wbere
whcrc there
thcrc is no antcccdcnt
though not
though
antecedent word in thc
the discourse
discourse
which co-rcfers
co-rcfcn with
with tlle
Uic pronoun
pronoun the)'.
rhey.
wbich
268
Sidner
269
'hcsc clcmcnts, prcscnt in thc mcrnorics of spcakcr and hcarcr, arc of course
'rllese clement", present in the memories of speaker and hearer, arc of course
rclatcd to othcrcognitivc clcmcnt$ in thcir mcmorics.
related to other cognitive element" in their memories,
What is Uic rclation of spccifications to thc rciil world? Onc might likc to
What is the relation of specifications to the real world? One might like to
claim that a rcfcrcncc rclation cxisls bctwccn spccificd cognitivc clcmcnts and
claim that a reference relation exists between specified cognitive clements and
objccts in rhc world, but sincc rcfcrring is what pcoplc do with words, h i s relation
objects in the world, but since referring is what people do with words, U1is relation
is problematic for cognititc clcmcnts. Instcad, spccifications will bc said ro
is problematic for cognithe clements. Instead, specifications will be said to
rccrcscnt the objccts rcfcrrcd lo, that is, tllcy bcar a wcll-structured
represent the objects referred to, that is, Uley bear a well-structured
cur~rcspondcnccto ohjccls in thc world. Applcs2. tlic spccification of gree~rapples,
correspondence to onjects in the world. I\pples2, the specificmion of green apples,
rcprcscnts thc ohjccts that arc grccn applcs. For phrascs such as ,Yat'nrrraC'laus,
represents the ohjects that arc green apples. For phrases such as Sallfa Claus.
whcrc thcrc is no rcal world objcct to rcprcscnt, a spccification rcprcscnts thc
where there is no real world object to represent, a specification represents the
propcrtics tlon~iallyasstriatcd with this imaginary pcrson.
properties normally associated with this imnginary person.
'I'llc phrasc 1hrj8 in ( I ) also specifics a cogni~ivcclcmcnt, namcly thc samc onc
The phrase fhey in (l) also specifics ,I cognitive clement, nnmely Ule same one
that grcrt, c~pplesdocs. Sincc thc two bcar thc salnc rclittion to thc rcprcscntation
that gr('en apples d(les. Since the two bear the same rc!ation to the representation
Applcs2. thcy will be said [o tso-~prc'fi~
that tncmory clcmcnt or altcrnativcly, h a t
I\pples2, they will be said III co-specifY lIlat memory clement. or alternatively, that
o f intcrprctation of
Ihc intcrprct;~tionof gr('eu npplr.~in (1) is'hc c ~ - s p ~ c f i e rthc
the interpretation of green apples in (1) is" lI1e cu-sprcijier of the interpretation of
rhcj: Co-spccifici~tion. ttnlikc co-rcfcrcncc. allows onc to construct abstract
fh(')'. Co-specification, unlike co-reference, allows one to construct abstract
rcprcscntalions and dcfinc relationships bctwccn thcm which can bc studicd in a
representations and define relationships between them which can be studied in a
computationnl framework. With co-rcfcrcncc no such use is possible. since the
computational framework. With co-reference no such usc is possible, since the
object .rcl'crrcd to exists in thc world and is not available for examination by
ohject ,referred to exists in the world and is not available for examination by
computational processes.
computational processes.
Evcn if a noun phrasc and a pronoun do not co-spcrify, thc spccification of a
Even if a noun phrase and a pronoun do not co-sperify, the specification of a
noim phr'asc may bc uscd to gcncratc Lhc spccification of a pronoun. For
noun phrase may be used to generate the specification of a pronoun. For
cxamplc, in 111 rkry docs not co-spccify with h c apparcnt antcccdcnt phrase a
example. in [)] they does not co-specify with tile apparent antecedent phrase a
nlotis~erIlarlcy 1200. ticrc h c anaphor f h c rcrcrs
~
to thc class of Harlcy 1200s of
mOils/a Harley /200. Here tile anaphor they refers to the class of Harley 12005 of
which thc apparcnt anteccdcnt (the neighbor's monstcr Harlcy 1200) is an
which the apparent antecedent (the neighhor's monster Harley 1200) is an
instance. 'Thus anaphor intcrprctation is not simply a mattcr of finding the
instance. Thus anaphor interpretation is not simply a matter of finding the
corresponding cognitivc clcrncnt which scrvcs as thc spccification of thc anaphor:
corresponding cognitive clement which serves as Ule specification of U1e anaphor;
somc additional proccss must gcncratc a spccification for Lhc anaphor from the
some additional process must generate a specification for the anaphor from the
rclatcd phrasc a tnonsfcr Harley 1200.
related phrase a mOllster Harley 1100.
'rhc conccpts of spccification and co-specification capturc thc "pointing back"
The concepts of specification and co-specification capture the "pointing back"
quality of antcccdcncc, and also pcnnit us to formulate an cxplani~tionof anaphor
quality of antccedence, and also pennitlls to formulate an explanati(ln of anaphor
intcrprctation which avoids thc pitfalls of thc conccpt of antcccdcncc. Anaphar
interpretation which avoids the pitfalls of the concept of anteccdence. Anaphor
intcrprctation can bc studicd as a computational prtrccss that i~scsLhc already
interpretation can be studied as a wmputational process that uses tile already
cxistirlg spccification of a noun phrasc to find thc spccification for an anaphor.
existing specification of a noun phrase to find the specification for an anaphor.
'lhc prtrcss uscs a rcprcscntation of lllc discourse prcccding thc anaphor which
"111e process uses a representation of the discourse preceding the anaphor which
cncodcs thc syntactic and scmantic rclationsllips in cach scntcncc as wcll as
encodes the syntactic and semantic relationships in each sentence as well as
Sidner
270
271
must bc
be distilled into
preserves its
richness wilhout
wiU10ut adding
inlo a form
fr~rrii that prcscrvcs
i t richncss
ovcrwhcl~ningcomplexity to thc
intcrprclation prwcss.
In addition,
i~dditios.researchers
rcscarchcrs
overwhelming
the interpretation
process. In
havc discovered
disctrvcrcd !liar
an;~pliorinterpretation
intcrprct;~tion involves
involvcs making
milking inferences,
infcrcnccs, sotne
of
have
that anaphor
some of
be complex.
complcx. each
cach of
o f which rnllst
choscn from a large base
of
which
which can he
must bc
be chusen
hase of
knowledge about objects.
people and
and things. '111e
practical deployment
knowlcdgc
ohjccts, pcoplc
'llic pr;~ctic;~l
dcploymcnt of
of
itlfcrcoti;~Icapal~ilitics
rcquircs control:
conlrol: knowing what
infcr whcn,
inferential
capabilities for any task requires
y,hat in
tn infer
when,
and knowing
knowing nlicn
Sincc thc
gcncrl~lcmuml
c~r~ilrtrl
problc~n is poorly
and
\o\'hcn to stop. Since
the general
problem
undcrst~rad,solutions
solu~itr~~s
Ito
O the
llic more
morc spccilic
pnrblcm of
o f controlling
a ~ n t n ~ l l i ninference
itikrcnce
g
understood,
specific problem
in
i~n:~phor
i~itsrprct;~tion
most he
hc provided
prtrvidcd by a theory
tlicory of an:~plior
intcrplrtatil~n.
anaphor
interpretation must
ana ph or interpretation.
I;
ncll as syntax and
and semantics
scm;~nlicson
un anaphor
a~i;~pIior
'lhc role
rolc of context
contcxt and
and infcrcncc.
The
inference. $as
",,'ell
l)ricl'I<~ok
a t these
tlrcsc explorations
inlcrprcl;~ti~ln
lhavc becll
bccn explored
cxpl,rrcd extensively.
cxtcnsi\.cly. A
interpretation
have
1\ brief
look at
will indicatc
indicate the ncccssily
necessity of
Research on anaphora
o f a new
~ncwapproach.
apprt~;sh. Ilescarch
anaphlrra falls into
four
catcgories:
kror broad catcgorics:
General
filr finding
licuristics Brr
finding antecedents
a~itcccdcnts
Gcncral heuristics
([Winograd
([Wi~ingrad1972])
19721)
Syntaclic and
and scmn~~ric
constraints on anaphora
anaphora
Syntactic
sema1ltic constraints
19631. [Woods
[Woodsct
19721.
([Kalr and Fodor 1963].
([Katz
et al. 1972].
[Chomsky
ICIilrmsky 1976J.
19761. [l.asnik
[I.assik 1976].
19761,
[Reinhart
[Ikinhart 1976].
19761. [Walker
[Walkcr 1976]),
19761).
o f infcrcncc
antcccdcnts
Use
Use of
inference to find antecedents
([Clr;~rniak1972].
19721, [Rieger
[Ricger 1974].
19741.
([Charniak
[Hobbs 1976])
19761)
in a discourse contcxt
context
Analysis of objects
objccts in
([Grt~sz
19771, [Lockman
[l.trkman 1978].
19781.
<{Gros7.1977].
[Ilcichman 1978].
19781, [Webber
[Wcbbcr 1978a].
1978al.
[Reichman
[Hobbs 1979])
19791)
rcvicw each
cach approach. IIw
i l l point out thc
o f each
each
Rather than
than review
will
the contributions of
catcgory
catcgory to a theory
thcory of
o f anaphor
anapllor interpretation.
intcrprctation.
Gcncral lhcurislics, as a means
incans of
o f choosing
ch(xrsing antecedents,
antcccdcnts, prcdict
rcliably in
General
heuristics,
predict reliably
in a
nunihcr of
o f typical examples.
cx;~mplcs. Howcvcr,
simplc characterization
cli;~ractcriz;~lionfits the
large number
However, no simple
wide variety
where lhcy
they fail
widc
varicty of cases
cascs whcrc
fail (see
(scc [Winograd
1Winograd 1972]
19721 and
and (Hobbs
[llobbs 1977]);
19771):
furtticnnorc, the
Ulc licurislic
i~ppmacliis not Lharrclically
grounded ami
and cannol
cannnt offer
offcr
furthennore.
heuristic approm;h
Lhcoretically grounded
approach to Uic
a unificd
unified approach
u1e phenomena.
phenomena.
Semantic
based on thc
the Katz-Fodor
Scmantic selectional
sclcctional restrictions,
restrictions. based
Katr-Fodor theory
thcory of
o f semantic
scmantic
rnarkcrs, and used
uscd by many computational linguists,
linguists, can
can reduce
reduce the
thc space
spacc of
of
markers,
climinatc all possibilities,
possible
antcccdcnts, but thcy
possible antecedents.
tl,ey cannot bc
be uscd
used to eliminate
possibilities. as the
Sidner
272
273
and thc naturc of thc knowlcdgc in thc SJ);ICC. Scvcral such spnccs, dubbcd "focus
and the nature of the knowledge in the space. Several stich spaces, dubbed "focus
spaccs." may bc rclcvaiit at a time atthough only onc is ccnlcrcd on for prtxcssing
spaces," may be relevant at a time although only one is centered on for processing
at any givcn timc. Grosz prcscntcd a proccdt~rcfor intcrprcting nun-pronominal
at any given time. Grosz, presented a procedure for interpreting non-pronominal
noun phrascs using thc fixusing and firus spacc notions. Rcichman [licichman
noun phrases using the focusing and focus space notions. Reichman [Reichman
19781 has cxpalidcd this pari~digm by dcscribing "context spaccs" which arc
1978] has expanded this paradigm by describing "context spaces" which arc
dclincatcd by thcir topics. Within a contcxl spacc. cnrirics rcccivc various focus
delineated by their topics. Within a context space. entities receive various focus
Icvcls: only pllrascs Lhat arc in high f t ~ u may
s bc pro11ominali;rcd. licichman's
levels: only phrases that arc in high focus may be pronominalized. Reichman's
work Icavcs open irnportant quc$tions: what is Lhc recognition proccdurc for
work lCJ"es open important quc~tions: what is the recognition procedure for
dclcr~nininga context space, Ilow docs onc idcntify its topic.'and how docs thc
delermini ng a context space, how docs one identify its topic: and how docs the
hcarcr dctcnninc thc inrcrl)~.ctationof a an:tphor. hilt is, lltrw docs n hcarcr dccidc
hearer detellnine the inlerpretation of a anaphor. that is, how does a hearer decide
which highly fi~uscdplirascs act ;IS thc co-spccificr of :I anaphor?
which highly focused phrases act as the co-specifier of a anaphor?
'1'0 sunllni~ri~c,
cilrrcnt rcscarch on ani~phor intcrprclation suggcsu h a t
To sUlllmaril.e, current research on anaphor interpretation suggests that
co~ttrolof' inferring and const~.ainrson rcpr-cscntation of discoursc arc ncccssary
control of inferring and constraints on representation of discourse arc necessary
nspccts of a thcory of anaphor intcrprctation. Gros~'approach indici~tcsthat onc
aspects of a theory of anaphor interpretation. Gros/ approach indicates that one
Inus[ also considcr wllict thc spcakcr is talking ahou[; RcicIim;~n'sanalysis shows
must also consider what the speaker is talking about; Reichman's analysis shows
that ccrtain pllrascs in tllc con\.crsation play a spccial rolc in inrcrprclation, whilc
that certain phrases in the colllersation playa special role in interpretation, while
Wcbbcr inclicatcs how h c rcprcscntation of quantilicrs aff'cc~ithc ilitcrprctation
Webber indicates how the representation of quantifiers aflCcl~ the interpretation
ofani~phor.All thcsc approaches support thc vicw that iincc hcarcrs do not ha've
of anaphor. 1\11 these aPPfLl3ches support the view that "since hearers do not have
privilcgcd acccss to a spcakcr's mind, other than Uuough what a spcakcr says;
privileged access to a speaker's mind. other than through what a speaker says;
imposing structure c ~ nthc spcakcr's discoursc will ~rovidca fra~ncworkfor
impo~ing structure 011 the speaker's discourse will provide a framcwork for
establishing thc intcrprctation of anaphors.
establishing the interprelatinn of anaphors.
5.1.2 'Ihc Focusing Approach to Anaphora
5.1.2 The Focusing Approach to Anaphora
Onc of the biisic uniLs of languagc co~nrnunicationis thc dikoursc. Informally
One of the basic unil~ of language communication is the diScourse. Infonnally
and intuitively, a discoursc is a conncctcd piccc of tcxl or spoken language of
and intuitively, a discourse is a connected piece of text or spoken language of
more than onc scntcncc spvkcn by onc or more spcakcrs. If such an informal
more than one sentence spoken by one or more speakers. If such an infonnal
definition is to bc hclpfirl at all, somc notion of what it means to bc "conncctcd" is
definition is to be helpful at all. some notion of what it means to be "connected" is
nccdcd. Whilc thcrc arc many diffcrcnt propcnics which contribute to discoursc
needed, While there arc many different properties which contribute to discourse
conncclcdncss, in this chapter I want to considcr only onc: thc spcakcr or spcakcrs
connecledness, in this chapter I want to consider only one: the speaker or speakers
talk about sornctl1ing. onc thing at a timc.
talk about somcthing. one thing at a time.
In it discoursc spcakcrs ccnlcr Lhcir uttcntion on a particular discoursc clcmcnl
In a discoursc ~peakers cellLer their attention on a particular discourse clement,
11 is U>C clc111cntwhich is clnboratcd by a portion
onc wliicli I will call Lllc &.
one which I will call the focus. II is the clelJlent which is elaborated by a portion
of tllc discou~~c.So~l~crimcs
spcakcrs' discourscs can bc quitc diflcrcnt; thcir
of the discourse. Sometimes speakers' discourses can be quite diflcrcnt; their
discourscs ilrc incohcrcnt or at lcast hard to Follow bccausc:
discourses arc incoherent or at least hard to follow because:
Sidncr
274
In a nutshcll, discourscs with lticsc propcrtics arc not conncctcd, that is, thcy lack
rn <l Ilutshell, discourses with these properties arc not connected. that is, they lack
;In clc~ncntwhich is f(xiiscd on. 'I'lic focus is tlicn onc of thc connccting thrcads
an element which is focused on. The focus is then one of the connecting threads
LI1;lt I ~ Y ~ aC tcxt
S
or a sct of utrcranccs a discourse.
thaI makes a text or a set of ullerances a discourse.
l'c~using is a discourse phcnomcnon rrrihcr than onc of single scntcnccs.
Focusing is a discourse phenomenon mther than one of single sentenccs.
, would cxpcct that it
Whcn a spcakcr irscs scbcral scntcnccs about onc f t ~ u s onc
When ,t speaker uses several sentences about one focus, one would expect that it
would nccd to bc rc-inlroduccd in cach scntcncc. tlowcvcr, rc-introduction is a
would need to be re-introduced in each sentence. However, re-introduction is a
redundant and rhus inc~ficicn~
proccss; in fact. spcakcrs d o not usc it. If
redundant and lhus inefficient process; in fact. speakers do not usc it. If
rc-iiitroduction is not uscd, and still hcarcn claim to know what is bcing talked
re-introduction is not used, and still hearers claim to know what is being talked
about, h c r c must be somc mcans by which thc discnursc rcrnains conncctcd. In
about, Lhere must be some mcans by which the discourse remains connected. ]n
fact. thcrc arc two ways. First, spccial words indicaic to ~ h hcorer
c
"that I aln still
fa~t, there arc two ways. First, special words indicate to the hearer "that I am still
t;liking about thc thing I talkcd about in thc prcvious scntcncc;" tradilionally
talking about the thing [ talked about in the previous sentence;" traditionally
rhcsc signals irrc callcd aiiilphoric cxprcssions. Sccond. ~pcakcrsrely on ~ I S S U I ~ C ~
tllese signals arc called anilphoric expressions. Second. peakers rely on assumed
sharcd knowlcdgc in discourscs: thc spcakcr assumes that somc conncctions
shared knowledge in discourses: the speaker assumes that some connections
bctwccn thc f t ~ u 2nd
s somc orher c!emcnts arc alrcady sharcd with thc hcarer so
between the focus and some OLher clements arc already shared with the hearer so
Uiat shc or hc nccd not explicitly state what they arc. Of course. thcrc js a risk that
that she or he need not explicitly slate what Uley arc. Of course. there is a risk that
rhc conncctions are no longer obvious. resulting in a sct of scntcnccs which simply
the connections arc no longer obvious, resulting in a set of sentences which simply
confusc the hcarcr.
confuse the hearer.
Now a possible line of investigation bccomcs clear. 'Ihe focus and the
Now a possible line of investigation becomes clear. The focus and the
assumcd sharcd knowlcdgc can bc uscd as one of thc chicf consiraints on h e
assumed shared knowledge can be used as one of the chief consiraints on the
choice of thc co-specification of anaphoric cxprcssions. liulcs governing an
choice of the co-specification of anaphorie expressions. Rules governing an
anaphor intcrprctcr can bc discovcrcd which usc thcsc two sourccs of constraints.
anaphor interpreter can he discovered which usc these two sources of constraints.
In thcsc rulcs tllc ftcus will play a central rolc as a sourcc of co-specification. 'Thc
In these rules Ule focus will playa central role as a source of co-specification. The
~ ~ X L and
I S thc struuturc of a s s ~ ~ m csharcd
d
knowlcdgc arc significant to rulcs
focus and the structure of assumed shared knowledge arc significant to rules
governing thc choicc of anaphors bccausc thcy capturc thc cffccts of what has
governing the choice of anaphors because they capture the e!Tects of what has
bccn ti~lkcdabout previously and what thc spcakcr has ;ssurncd is knowlcdgc that
been tllked about previously and what the speaker has assumed is knowledge that
is sharcd with thc hcarcr.
is shared with the hearer.
'I'his vicw of focusing and anaphora rcsts on four assumptions about thc naturc
This view of focusing and anaphora rests on four assumptions about the nature
of communication, cach of which is true in most situ;ltions. First, thc spcakcr is
of communication, each of which is true in most situations. l-'irs1, the speaker is
assumcd to bc communicating about something. 'I'his assumption implies that the
assumed to be communicating about something. This assumption implies thal the
spcakcr is not speaking gibberish, that thc utterance contains rcfcrring cxprcssions
speaker is not speaking gibberish, that the utterance contains referring expressions
275
Sidner
276
277
114-1 The axon may run for a long distance. scnding off sckcral
sid~brmlchrsalong thc way,
sidebranches along the way.
2 bcforc it tcrmina~csin an cvcn fincr nctwork of filarncnts, h e
2 before ittcrminates in an even fincr network of filaments, the
i~ntri~iml
arbor.
IfnI/ina! arbor.
3 hli~n'slongcst axon runs for scvcral fcct. froin the spinal column
3 Man's longcst axon runs for scveral feet. from the spinal column
to t n ~ ~ x lthat
c s control movcmcnts of thc tocs.
tll muscles that control movements of the toes.
4 In spitc of its grcitt Icngth, his axon, likc all ncrvc libcrs, is a pan
4 In spite of its great length. this axon. like all nerve fibers. is a part
of a singlc ccll.
of a single cell.
S It is living mattcr.
5 It is living maller.
135-1 I'm having a party tomorrow night;
I)5-1 I'm having a party tomorrow night;
2 it will bc likc the onc I had last wcck.
2 it will be like the one I had last week.
3 'I'liat w r t y was a big succcss
J That lli!L!Y was a big success
4 bccausc cvcryonc dauccd.
4 because everyone danced.
5 'lhis onc wit1 I~itvcbcttcr food.
5 '1l1is ~ will have better food.
6 I'vc ahkcd cvcryonc to bring something spccial.
6 I've asked eve:ryol1e to bring something special.
7 Want to cnmc?
7 Want to come?
If ftxusing is to bc vicwcd as p;trt of an;\phor comprchcnsion, somc prcxcss
I f focusing is to be \'iewed as part (If anaphor comprehension. some process
must choosc, in a rcliablc way, what I havc dcscribcd looscly as thc focus of thc
must choose, in a reliable way, what I have described loosely as the focus of the
discoursc. 'I'hc p~.occss will bc ~.cquircdto rnakc usc of rcprcscntations of
discourse. The process will be required to make usc of representations of
t linkcd to t)thcr mcniory clc~ncntshccausc pcoplc
clcmcnts of a scntcncc d ~ a arc
clements of a sentence that arc linked to other memory clements because people
sccni to LISC just SLICII information thcmsclvcs. In addition, an,intcrprctcr will
seem to usc just SllCh information themselves. In addition, an' interpreter will
makc LISC of the focus. as well as as syntactic, scmentic. and-gcncral knowledge
make use of the focus. as well as as syntactic. semantic. and- general knowledge
rcstrict~ons, in dctcnnining thc co-spccificr of thc anaphoric cxprcssions. 'lhis
restrictions. in detennining the co-specifier of the anaphoric expressions. 'Ibis
. fwl~singprwcss and thc unaphor intcrprctcr raises
brief dcscription of a f i ~ u s thc
brief description of a focus. the focusing process and the anaphor interpreter raises
sumc qucstions which must be answcrcd. What is thc fwus of thc discoursc, and
sume questions which must be answcred. What is the focus of the discourse, and
how is it dctcrmincd? What kinds of assumptions about thc stnlcturc of the
how is it determined? What kinds of assumptions about the structure of the
knowlcdgc must be madc in ordcr to usc a focus for dcfinitc anaphor
knowledge must be made in order to use a focus for definitc anaphor
disambiguation? What infcrcnccs arc uscd in thc prcdiction of co-specifiers?
disambiguation? What inferences are used in the prediction of co-specifiers?
How docs an anaphoric intcrprctcr use thc focus to intcrprct pcrsonal and
How does an anaphoric interpreter use the focus to interpret personal and
1his4harpronouns?
this/lhal pronouns?
l'hc answcrs to thcsc qucstions will bc providcd in thc ncw fcw scctions. As
The answers to these questions will be provided in the new few sections. As
thc tllcory unfolds. I will also support thc following claims ahout dcfinitc
the theory unfolds. J will also support the following claims about definite
anaphora.
anaphora.
1)4-1 The axon may run for a long distance. sending off several
Sidner
Sidncr
278
278
3. 1)istingui~liin.qand disambicu;~tinedcfinitc a n a ~ h o r a
3. Distinguishing
definite anaphora
I't~us, and
useddisambiguatin2
w i ~ hLIIC rcprcscntation
of knowlcdgc in (I), with Wcbbcr's
Focus, rcprcscn~ation
used with the representation
in (l), with Webber's
) and with infi)~.rnation describing
(scc Chaptcrof6knowledge
representation
Chapter
6) and such
with asinf{lrmation
describing
scnlcncc (sec
syntistic
constr;~ints,
c-con~lnand,
atid sc~nantic
sentence
syntactic
constraints,
stich
as
c'command,
and
semantic
sclcctional rcstriclions, 'can distinguish those dcfinitc
an;~phora
se1cctional restrictions,' Can distinguish those definite anaphora
goicmcd by discoursc cffccts. It can bc used tu dis~mhiguatcthcir
by discourse
effects. It can be used to disambiguate their
gll\erned
specifications
as wcll.
as
well.
specifications
'I'hc thcory of focusing ~nakcsccstain tcstnhlc prcdictions, oncs which arc
The theory of focusing makes certain test<lb1c predictions, ones which are
produccd by lhc prcrccsscs and intcrprclcrs that arc lo bc dcscribcd in this chaptcr.
produced by the processes and interpreters that are to be described in this chapter.
'I'licy will prcdict which rcprcscntations arc thc spccificatiuns of ;inaphors and
They will predict which representations arc the specifications of anaphors and
hat decisions arc n ~ i ~ dtoc find thc rcprcscnt;~tions. Sincc thc Lhcory rclics on
'" hal decisions arc made to find the representations. Since the Uleory relics on
rcl~rcscntationsof k11o\i lcdgc, ir also innkcs prcdictions and stlggcsts c{)nstraints
representations of knowledge, it also 1llilkes predictions and suggests constraints
on both thc structure and the contcnt of thcse rcprcscntations. 111 discussing all
on both the structure and the content of these representations. In discussing all
thcsc prcdictions, I will illusir;~lctllc advanccs of thc focusing Ihcory ovcr carlicr
these predictilltls, J will illtlstrilte the advances of the focusing theory over earlier
work as ~ c l as
l cxplain thc limit;~tionsof lhc thcosy for intcrprcting onc class of
work as well as explain the limiwtions of the thcory for interpreting one class of
dcfinitc anaphora.
definite anaphora.
5.2 The Definition of Focus
5.2.1 A Skclcfl of l l ~ cl'rocuss hlodcl of Focusing
5.2.1 ASkclch of the I)rocess Model of Focusing
279
116-1 1,ist ncck tllcrc wcrc somc nicc strawbcrrics in tllc refrigerator.
1)6-1 Last \'leek there were some nice strawberries in the refrigerator.
2 'l'hcy camc from our h o d co-np and wcrc unusually frcsh.
2 They came from our filOd co-op and were unusually fresh.
3 1 went to use tticm for dinncr, but somconc had c:ltcn thcm all.
3 I went to usc them for dinner, but someone had eaten them all.
4 I-i~tcrI discovcrcd it was Mark wllo had catcn Lhcm.
4 Later I discovered it was Mark who had eaten them.
5 Mark has ii hollow Icg. and it's irnpossiblc to kccp f w d around
5 Mark has a hollow kg. and it's impossible to keep food around
whcn his stomi~chnccds filling.
whcn his slOmach needs filling.
Suppose thc first focusing prtccss initially gi~csscsthat strawbcrrics are the
Suppose the first focusing process initially guesses that strawberries are the
ftcus in 116-1. Ncxt a pronoun ioterpr;tcr would apply n rulc h a t says "A
focus in 1)6-1. Next a pronoun interpr~ter would apply a rule that says "II.
pronoun that can bc rcplaccd by tlic fixus plirasc, with tlic. rcsulting scntcncc
pronoun that can be replaced by the focus phrase, VYith the. resulting sentence
rcrn;rining syntilctically acccptablc. co-specifics with d ~ cfczus. ~ ~ n l c ssome
s
remaining syntactically acccptable, co-specifics with the focus, unless some
pragmatic knowlcdgc rulcs out that co-spccificr." to dc~cnnincthat strawbcrrics
pragmatic knowkdge rules out that co-specifier," tll detennine that strawberries
can rcplacc / ~ I P Jin~ 136-2 with no syntactic fiiilurc. An infcrcncc proccss. govcrncd
can r('place the)' in 1)6-2 with no syntactic failure. An inference process, governed
by ~ h pronoun
c
intclprcler, could confirm that str~lwbcrricscan colnc from food
by the pronuun interprcter, could confirm that strawberries can comc from food
co-ops iuid can bc Srcsll; thi~tis, n o contrad~ctionin gcnccrl knowlcdgc rcsults.
co-ops and can be fresh; that is, no contradiction in gcneral knowledge results.
Finally, the third prtwss can confirm stratrbcrrics ;is Ulc f t ~ l r ssincc it lias bcen
Finally, the tJlird process can confirm strav. berries :15 tJle foclls since it has been
rc-mcntioncd and bcc;iusc t)thcr objccts mcntior~cdin 1%-1, tlic rcfrigcrator and
re-mentioned and becaus(' other objects mentioned in D6-1, tJle refrigerator and
[tic prctious wcck. wcrc no1 discusscd in 116-2.
the previous week, werc not discussed in 1)6-2.
.l'hc ftcusing rncch;tnisn~will bc a usclul ~hcorcticnl.toot only if it is cohcrcnt
The focusing mechanism will be a useful theoretical. tool only if it is coherent
ti) talk about solnc clcmcnt of thc discourse as bcing in focus. While our
ttl talk about some clement of the discourse as being in focus. While our
intuilions as spcnkcrs and hearers Icd us to bclievc that thcrc is somctlling wc talk
intuitions as speakers and hearers led us to believe that there is something we talk
about. tlic inr~~ition
is pr-ohlunatic bccirusc thcrc appear to bc rnany phcnomcna
about, tJle intuition is prohlematic because tJlere appear to be many phcnomena
which function in distinguishing what it is that somculic is talki~igabout. Onc
which function in distinguishing what it is that someone is talk ing about. One
such phcnomcnon for marking focus are syntactic constructions, such as
such phenomenon for marking focus arc syntactic constructions, such as
thcrc- inscrtion scntcnccs as ( 5 ) and clcft scntcnccs as (6).
there' insenion sentences as (5) and cleft sentences as (6).
( 5 )'l'hcrc once was a wisc old king who livcd on a mountain.
(5) There once was a wise old king who lived on a mountain.
(6) It w;s 111cbutlcr who kid~iappcdthc hcircss.
(6) It was the butler who kidnapped the heiress.
Another phcnonlcnon which marks ftrus is specch strcss and prosodics: it appears
Another phenomenon which marks focus is speech stress and prosodics: it appears
that thcsc mark what thc spcakcr is most intcrcstcd in talking about. In (7) if
that these mark what the speaker is most interested in talking about. In (7) if
contrastive strcss is put on Jcrcrny, thc hcarcr might cxpcct tllat thc ncxt sentence
contrastive stress is put on Jeremy. the hearer might expect th<lt the next sentence
will say Inorc about him.
will say more about him.
(7) 1 want o ~ l of
c JI:I<EMY'S picturcs.
(7) I want olle of JEREMY'S pictures.
In iln irpcorning scction thcsc and similar pl~cnomcn;~
will bc prcscntcd and
In an upcoming section these and similar phenomena will be presented and
analyzed in dctiiil for tlicir rolc in dctcrmining ftrus. In tllc thcory that will be
analyzed in detail filr their role' in determining focus. In the theory that will be
prescntctl. tlic firus %ill hc dcliricd as 1h;rt discor~rscclcmc~ltsclcctcd by thc
presented. the fiJeus will be defilled as that discourse ekmellt selected by the
c o n ~ p i r ~ ~ ~ iproccss:
o n a l thc prc~csswill bc dcfincd so tllitt it hkcs into account all
compu~llional process: the pn)(:ess will be defined so that it takes into account all
thc phcnotncna. I lcncc. thc ddinitic!n of fcxus will bc a function of thc thcory
the phenomena. llence, the dcflnitil!1l of f(Jeus will be a function of the theory
rather thitn ;ui indcpcnclcnlly dcfincd objcct.
rather than all independently defined object.
SidneI
280
cvent
event
place
place
time
time
is-a .
mccting (gcncric)
(gcneIlc)
participant
participant
topic
topic
purpose
purpose
~~eat.il~g
instmcc
inswnce
rnccring-w
ith-Stancrzyk
meeting-with-Stanoczyk
placc: SO1 tirnc; 'I'hursday-at-3 participant: Stanoczyk, 1,cwin
place: 801 .time: Thursday-at-3 participant: Stanoczyk. Lewin
'
281
'Ihc data structilrc must also suppon two kinds of hicrarcliical links with thc
The data structure must also support two kinds of hicrarchicallinks with the
ability to inhcrit o n both. Onc link cxprcsscs h c is-a kind of rclittion; it allows
ability tlI inherit on both. One link expresses the is-a kind of relation; it allows
prclpcrtics from thc nctwork dcscription of onc clclncnt to bc inhcritcd by
properties from the network description of one clement to be inherited by
anothcr. 'l'hus thc gcncric mrcting is a conccptual clcmcnt which is-a kind of
another. 'lllUs the generic mceting is a conceptual clement whkh is-a kind of
clrnt. and it inhcrits Lhc assr~intionsof placc and t i ~ n cfro~nIhc is-a rc1atiun. The
c\cnt, and it inherits the associations of place and time from the is-a relation. The
sccond rclation with inlicritancc ciipturcs thc notion of en instancc typc. I b i s
second relation with inheritance captures the notion of an instance type. This
I~
likc 111rcting and a pl~rticular
rclatio~l(ccurs bclwccn a C O I I C C ~ ~ U ; clcmcnl
relation occurs between a ConCel)llIal clement like mceting and a particular
lnccting likc nicctinp-rritll-St;lnocrjk. 'I'liis clclncnt is a particular copy or
meeting like mccling-"ilh-Sl:mOcl.}k. This clement is a particular copy or
insu~nccof its parcnt nodc. Elcmcnw that arc gcncrics rcprcscnt abstract kinds
instance of its parent node. Elements that are generics represent abstract kinds
.
may scrvc as thc spccification
~ l l i l ci~lstanccsrcprcscnt objccts in thc W L I I . ~ ~110th
\\hile instances repleSenl objects in the world. Both may serve as the specification
of a pl1r;lsc in discourse.
of a phrase in discourse,
'I'hc datic stl.uctrtrc nccdcd must havc othcr propcr~ics. It must also allow for
The data structure needed must have other properties. It must also allow for
thc clnbcddi~lgof structure wilhin stnlctnrcs, bccausc tl~csc rcprcscnt olhcr
the embedding of structure within structures. bel:ause these represent other
discou~.scclcmcnts srthjcct to discussion and rc-n~cntion.If wc arc told that John
di~course clements subject to discussion and relllention. If we are told that John
is cating an icc crcaln conc, h c rcprcscnlation must show that thc act of cating
is eating an ice cream mne, me representation must show thaI the act of eating
includcs rwo sub-structures. one rc1)rcscnting John and dlc othcr thc k c crcarn
includes two sub-structures, one representing John and the uther the icc cream
conc. I-inally thc data structure must allow for a n;~turalrcprcscnhtion of scope of
COlle. Finally the data structure must allow for a natLlral representation of scope of
quantilicrs: thcir rcprcscntation is considcrcd by Wcbbcr in this volume, and a
quantiliers: their representation is considered by Webber in this volume, and a
discussion of how that rcprcscntation may bc 11scd is givcn latcr in this chaptcr.
discus~ion of how that representation may be lIsed is given later in this chapter.
'I'hcsc data stnlcturc clraractcristics arc ncccssary for anaphor comprchcnsion
These d:lta stnlcture characteristics arc necessary for alwphor comprehension
bccausc loss of any characteristic has important cffccts on what anaphors can bc
because Illss of (lny characteristic has important effeus on what anaphors can be
co~nprchcndcd,as wc will scc in thc cxamplcs in h i s chapter.
comprehended, as we will see in the examples in this chapter.
'I'hc pmpcrtics of a nct structure cxprcsscd hcrc arc sonlctimcs pan of
The properties of a net structure expressed here are sometimes part of
Artificial Intclligcncc rcprcscntation Ianguagcs (scc K1:ONE [llrachman 19781,
!l.rtificial Intelligence representution languages (sec KLONF., [Brachman 1978].
OWI.. [Hawkinson 19781. KRL, Itjobrow and Winograd 19771 among othcrs).
OWl., [Hawkinsun 1978]. KRL (Bobrow and Winograd 1977] among others).
'I'hcsc propcrrics will be ncccssary in thc discussions of focusing and anaphor
These.- properties will be necessary in the discussions of focusing and anaphor
intcrprctation that follow, so ~ h i c h c v c rrcprcscntation language is uscd, it must
interpretation that follow, so whichever representation language is used, it must
havc thc fcaturcs mentioned for the focusing theory.
have the features mentioned for the focusing theory.
'I'hc illustration of ftrus in figurc 1 is slightly misleading bccausc it suggests
The illustration of focus in figure 1 is slightly misleading because it suggests
that thc f i ~ u sis only thc computational encoding of thc spccification of a
that the focus is only the computational encoding of the specification of a
particular noun phrase. In fact, thc proccss which cstablishcs thc spccification o f a
palticular noun phrase. In fact, the process which establishes the specification of a
noun phrasc in fixus must hakc acccss to thc syntactic and semantic forms of the
noun phrase in foctls must have access to the syntactic and semantic forms of the
phrasc. If thcy arc lcft out, somc anaphors wit1 appcnr ambiguous. when in fact
phrase. If they arc left out, some anaphors will appear ambiguous, when in fact
thcy arc not. Unncccssary ambiguity can bc illustrated using 117 bclow.
they arc not. Unnecessary ambiguity can be illustrated using 1)7 below.
117-1 'i'hc first lnan on thc mtwn bccamc a national hcro.
D7-1 The first man on the moon bCC3me a n:Jtional hero.
2 Ilue to his status, hc rodc ~n ticker tapc paradcs, met public
2 Due to his status. he rode in ticker tape parades, met public
officials and was chascd by autograph hunters.
officials and was chased by autograph hunters,
'Ihe focus of this example is the firs1 mall on the moon. Suppose that the
The focus of this example is the first man on the moun. Suppose that the
spccificarion of that phrasc is the focus as dcpictcd bclow, wirhout benefit of the
specification of that phrase is the focus as depicted below, without benefit of the
rcfcrring dcfinite noun phrase.
referring definite noun phrase.
282
Sidner
FOCUS:
NEIII, .\RRlS~I'HONG
I h k : colonel in U.S. Anny
Rank: colonel in U.S. Anny
I'itllic~: 3 childrcn
Father: 3 children
Acllicvcn~cnt: first moon walkcr
Achievement: first moon walker
NEIL ,\HI\ISTRONG
283
actually wcnt to thc movics froin among thc thrcc acton. Rulcs for annphors
actually went to the movie~ from among the three actors. Rules for anaphors
co-spccifying with actors will bc disc~isscdlatcr.
co-specifying with acWfS will be discussed later.
119-1 1 wantcd to go to thc movics on Saturday,
J )9-1 I wanted to go to the movies on Saturday,
2 John said hc'd comc too, but l%illdccidcd to stay homc and
2 John said he'd come too, hut Bill decided to stay home and
study.
study.
3 S o wc wcnt and af~crwardshad a bccr. ( n r = Johti, the sj>e(jker)
3 So we went and afterwards had a beer. (we::: John. tJl(' speaker)
5.2.3 I:indi~~g
'l'ht I )iscosrsc Focus
5.2.3 Finding Till' J)iscourse Focus
Sidner
284
1. 'Ihis is Ciruhcr's [Grubcr 19761 lcrm Tor the n~dionoithc object case oTa vcrb. Ilis hcory extends
1. This verb
is Gruber's
[Gruber
1976jlerm
for the
of the
object
casewhere
of a verb.
I lis theory
e~tcnds inside a
scrnanlics
to include
verbs
suchnotion
as the
ones
bclow
the theme
is Itmted
verb semanlics
to include
prcposi~ional
phrase.verbs such as the ones below where the theme is located inside a
prepositional phrase.
(IS) Wc wailcd out Ihc lhundcrstorm in a ntndown old shack.
(lS) We (16)
wailed
out the
thunderstorm
a rundown
I'lcase
rocus
on thc star orinIndia
in Ihe old
caseshack.
on h e l e k
H6) Please focus on the slar of India in the ca~e on the lell
IT the conccpc or thcme is used as h e dcrault choice for expccled focus, thcse examples fit naturally
If the concept
is used a~ the default choice for ex petled focus. these examples fit natu rally
within of
thetheme
framework.
within the framework..
285
Lhc initial scntcncc. For cxamplc. in D10-2. if the h t l k wcre uscd in placc of it,
the initial sentence. For example, in 010-2, if the lxtllk were used in place of it,
thc cxpcctcd rtxus would hc ovcrriddcn in favor of bank.
the expected fOt:us would he overridden in favor ofbank.
In ii scntcncc without a thcmc, that is. whcrc only non-thcmc prepositional
In a sentence without a theme, that is. where only non-theme prepositional
pllrascs arc prcscnt, tllcrc docs not appcar to hc a prcfcrcncc for cxpcctcd focus.
phrases arc present, there docs not appear to be a preference for expected focus.
Most otlicr tllcl~~i~tic
positions (insirulncnt. goal and locntivcs) do not offcr a
Most other thematic positions (instrument, goal and locatives) do not offer a
strong prcfcrcncc for fisus allhougli somc wcak prcfcrcnccs somctimcs nppcar.
strong preference for focus although some weak preferences sometimes appear.
'Ilicsc wcak prcfcrcnccs arc for goal and any posilion in which an indclinitc
'Illese weak preferences arc for goal and any position in which an indefinite
occurs. Ilowcvcr. it is difficult to kl~owhow rclii~blctlicsc prcrcrcnccsi~rcwithout
occurs. However, it is diflicultto know how reliable these preferences arc without
somc nic;lns of dctcr~rii~lilig
tlic rolc of stress and pn)sodlcs in lhcsc cases.
some means of determining the role of stress and prosodies in these cases.
'I'licrcL)rc, no cl~rimcwill he rnadc ahotit prcfc~rnccfor cxpcctcd fiwrus for thcsc
Therefore, no claims will he m;H.lc <Ihollt preference for expected focus for these
positions, I~istci~d
tlic algorithln li~rco~nputingcxpcctcd ftrcus bclow will rcly on
positions. 1nste,lJ the algorithm f(Jr computing expected focus below will rely on
a sinlplc sclicnlc o~scntcnccsurfr~ccosdcr Tor tllcsc tlic~naticpositions.
a simple scheme of sentence surface order for these thematic positions.
One tlicm;~ticposition that is 11ot prcfcrrcd for discourse f i r i ~ sis thc agcnt.
One thematic position that is not preferred for discourse focus is the agent.
Whcn a pn~nountmurs i n a nun-agcnt position. iuid in tlic prcccding scntcncc,
When i.l pronoun occurs in a non-agent position, and in the preceding sentence,
ht)[li ;in agcnt and ;I pllriisc in anorllcr tlicmviic position can bc its co-spccificrs.
both an agent and a phrase in another thematic position can be its co-specifiers.
thc agcnt is not prcfcrrcd, as is illustrated in thc cxarnpjc bclrlw. Hcncc in the
the agent is not preferred, as is illustrilted in the example below. Hence in the
chtticc of cxpcctcd firus, tlic agcnt is ordcrcd last among possiblc noun phrasc
choice of expected focus, the ilgent is on.lcred last among possible noun phrase
cboiccs.
etlOices.
1112-1 A group at tiXN dcvclopcd a high spccdtcchnical chip packer,
.D12-1 A group at HXN developed a high speedtcchnical chip packer.
2 'l'hc prcss gnvc it ravc rcvicws.
2The press gave it rave reviews.
'I'wo scntcncc ronns affecting focus do not dcpc~ldon thcrnntic position. Onc.
Two sentence fonns affecting focus do not depend on thematic position. One.
is-a vcrbs, takc thc subjcct of thc scntcncc as cxpcctcd focus.
is-a verbs, takc the SUbject of the sentence as expectcd focus.
1313-1'I'hc I'ersonal Assistant nrouni is a rcscarch grow that is
1)]3-1 The Personal Assistant Wl!!lli is i! research &t:Q!!2 thill ~
dcsicninq picccs of a ~crsoni~l
assiqtant maram.
J'
designing pieces ill.il personal assistant programj'
2 (a) Scvcral graduatc sli~dcntsand rcscarch faculty arc mcmbers
2 (a) Scveral graduate students ilnd research faculty arc members
of di.
of tli'
(b) * Sevcra! graduatc studcnts and rcscarch faculty are
(b) Several graduate studcnts and research faculty are
mcmbcrs of
J.
members of tlj'
Whilc thc prcdicalc norninativc is bcing asstriatcd with thc subjcct in is-a
While the predicate nominative is being associated with the subject in is-a
scntcnccs, it docs not co-specify with tlic suhjcct. Instcad thc st~bjcctis being
sentences, it docs not co-specify with the subjcct. Instead the subject is being
dcscrihcd as having sonlc particular propcstics. i~ndhcncc is fundamental to the
described as having
some particular properties, and hence is fundamental to the
discussion. 1
discussion. 1
Sklner
286
1 . I%y "scrnanlically nculral," I mean lhal Ihc sclntional rcslriclions on f i e thematic posil~onin
I. By "semanlically
neutral,"
mean
that the bcaror
sclcctional
restrictions
on loose.
the thematic posilion in
qualion do not
rule outIthc
usc orcilhcr
Ihc evcnl
olgclting
question do not rule out the usc of either bear or the event of gelling loose.
287
'I'hc Expcctcd Focus Algorilhm:
The Expected Focus Algorithm:
Choosc an cxpcctcd fixus as:
Choose an expected focus as:
I . 'Shc subjcct of a scntcncc if thc scntcncc is an is-a or a ~hereinscnion
I. The sllbject of a sentence jf the sentence is an is-a or a there-insertion
scn tcncc.
sentence.
This stcp prcsurncs inkrrmatiun from a parsc lrcc about what the
This step pres,umes information from a parse lree about what the
subjcct, and vcrb arc and about whcthcr the scnttnce is
subject, and verb arc and about whether the sentence is
thcrc-inscrtion.
there-insertion.
2. 'I'hc fir-st nlcmhcr of Ihc dcCiiilt cxpcctcd f t ~ i i s!is[ (Dlil: list), computcd
2. The fir~t member of the default expected focus lisl (DEF list), computed
from the t l i ~ t ~ l ircla~ions
i ~ i ~ oFtlic vcrb, as follows:
from the thematic relations of the verb, as follows:
~ r d c rthc sct of pliruscs in Ihc scntcncc using thc following
Order the set of phrases in the sentence using the following
prcfcrclicc schcma:
preference schema:
- llic~ncunlcss thc thclnc is a vcrb complcn~cntin which casc the
- theme unless the theme is a verb clJlnplement in which case the
tllcn~cfiom thc romplc~ncntis uscd.
tllel1le from the l'Omplement is lIsed.
- all olhcr thc~lli~tic
positions with thc agcnt last
- all other thematic positions with the agent last
- the tcrh phrase'
- tbe verh phrase
'I'his sicp rcquircs a lisi of thc surfrlcc ordcr of rhc noun phrascs, and
This slep requires a list of the surface order of the noun phrases, and
a dat;~strrlcturc which indicates wliicl~ noun phr:tscs fill wllich
a data structure which indicates which noun phrases 1111 which
tlicm;~licslots in dlc verb. Such a data s ~ ~ - u c t imust
~ ~ . c hc uo~nputcd
thematic slots in the verb. Such a data Slructure must he computed
by a casc framc tnccliatiism such as thc onc rcportcd in [Marcus
hy a case frame mechanism such as the one reported in [Marcus
19801.
1980].
'Il!c cxpcclcd focus algorithm is uscd to chansc thc discourse focus. An
'Il~e expected focus algorithm is used to choose tlle discourse focus. An
analogous algorithm to choosc thc actor focus call bc dcfincd. 'his algorithm
analogous algorithm to choose the actor focus can be defined. This algorithm
~ o u l choosc
d
John as thc cxpcclcd actor in thc scntcncc bclow.
would choose John as the expected actor in the sentence below.
(1 8) John radc his pony to t11c big mc;tdow at tlic cdgc of thc farm.
(18) John rode his pony to the big meadow at the edge of me farm.
Wcrc h a t scntcncc to be followcd by a scntcncc with a pronoun in agcnt position,
Were that sentence to be followed by a sentence with a pronoun in agent position,
as bclow. thc pronoun would confinn thc cxpccicd actor focus as thc actor focus.
as below, the pronoun would con finn the expccted aclOr focus as the actor focus.
(19) flc likcd to sing as hc rode.
(9) He liked 10 sing as he rode.
Latcr 1 will discuss the itltcraction bctwccn thc actor and discoursc foci.
Later I will discuss the interaction between the actor and discourse foci.
Whilc rhc cxpcctcd fixus algorithm can always choosc an cxpcctcd focus, its
While the expected focus algoritllm can always choose an expected focus, its
choicc may havc to hc ~.cjcctcd~ C C S I I S Cthc dcfihtilt position is ovcrriddcn by other
choice may have to he rejected because the default position is overridden by other
frrc~ors.'I'ypically, his tccurs whcn a pronoun, which docs not co-specify with the
factors. Typically, this occurs when a pronoun. which docs Ilot co-specify with the
cxpcctcd fixus, is ilscd in thc sccond scntcncc of thc discoursc. and no anaphor is
expected focus, is used in the second sentence of the discourse, and no anaphor is
uscd to co-specify with thc cxpcctcd fixus. In thc salnplc 1117. thc cxpcctcd fixus
used to co-specify with the expected ((lCUS. In the sample DI7, the expected fllCUS
is thc graduarion party, but in thc Following scntcncc the use of it Lo co-spccify
is the graduation party. but in the f(ll1owing sentence the usc of it to co-specify
with Cathy's housc indicatcs that thc focus is on thc house.
with Cathy's house indicates that the focus is on the house.
1117-1 Cathy wanrs to havc a big graduarinn party a t hcr house.
D17-1 Cathy wams to have a big graduation party at her house.
2 She clcancd it u p
2 She cleaned it up
3 so that thcrc would be room for everyone.
3 so that there would be room for everyone.
Sidner
Sidncr
288
288
'I'wo questions comc to mind: tlow can onc rccog~iizcthat the cxpcctcd focus
Two questions come to mind: How can one recognize that the expected focus
is not thc ftxus? How can an altcrnativc noun phrasc bc choscn ns thc focus?
is not the focus? How can an alternative noun phrase be chosen as the focus?
I{ccognition and sclcction both dcpcnd on thc ~ i s cof infcrcnccs about gcncral
Recognition and selection both depend nn the usc of inferences about general
knowlcdgc. I'or cx;tmplc. in I117 thc choicc of party for ir can bc rcjcclcd since
knowledge. For example. in 1)17 the choice of party for if can be rejected since
h;rving clcancd up an cvcnt wcluld bc rcjcctcd as incompntiblc with othcr
having cleaned up an event would be rejected as incompatible with other
kaowlcdgc about cleaning. I:ollowing tlic rcjcction of tllc cxpcctcd ftrus, a
knowledge about ckaning. Following the rejection of the expected focus, a
corrcct co-spccificr can bc sclcctcd bccsusc it is availuhlc in rllc p r c ~ i o ~Acnlcncc.
rs
wnect co-specifier can be selected bec;lllse it i~ avail<Jble in the previolls sentence.
'I'o find it. cnch a!tcr~intcdcfiiult focus ~ n ~ lb~
s t considcrcd ill turn. ilnlil one is
To find it. each alternate default focus Inust be considered in turn. until one is
found wliicli is not rcjcctcd o n thc hasis of gcncral knowlcdgc. 'Whcn thc fkusing
found which is not rejected on the hash; of general knowledge. 'When the focusing
PIIK'CSS runs agi~inaficr all tlic anaphors arc i~itcrprc~cd,
thc propcr firus can be
process nllls again after all lhe anaphors arc illlerpreted. the proper focus can be
clioscn.
chosen.
'I'hc dcfiiult cxpcctcd f t ~ u scan bc rcjcctcd only whcn Blc infcrcnce
The default expected focus can be rejected only when the inference
mccl~anismclcarly i~idicatcsthal llic prcdictcd co-spccificr is unacccplablc. 'I'hat
mechanism clearly indic<ltes thai the predicted co-specifier is unacceptable. That
is. thc infcrc~lccmust contradict givcn knowlcdgc from the discoursc or be
is. the inference must contradict given knowledge from the discourse or be
inco~np;~tiblc
w i ~ hotlicr gcncr;ll knowlcdgc. 'Ilic fact that a no1111phrasc besides
incompatible with other general know ledge. '1l1e lact U1at a noun phrase besides
hc acccpt;~blcas a co-spccificr is irrclcvant as long as the
Ihc cxpcctcd f t ~ u might
s
the expected focus might be acceptable as a co-specifier is irrelevant as long as the
cxpcctcd focus is acccp~thlc.For cxitmplc, in I111. rcpcatcd bclow, wliilc thc bull
expected focus b accep14lble. For example, in DlI. repeated below. while Lhe bull
night hc an acccp~ablca)-specifier for he, it is not considcrcd sincc tllc cxpcctcd
might he an n(cepwble co-specilier for he, it is not considered since the expected
fwos is acccpthlc.
f~lCus is ncceptable.
1311-1 Sirndy walked hcr dog ncar a bt~llOIIC day.
1)1]-1 Sandy walked her dog ncar a bull one day.
2 ilc ~ r o t ~ cquictly
d
along.
2 lie trolled quietly along.
A matter whicli is rclatcd to thc problcm of rcjccting a f i ~ u is
s how speakers
A matter which is related to the problem of rejecting a focus is how speakers
rccovcr from co-specification failures. Considcr thc following variation on 1)ll:
recover from co-specification failures. Consider the following variation on DIl:
1118-1 Si~ndywalkcd hcr dog ncar a bul!.
D181 Sandy walked her dog ncar a bull.
2 Shc saw how hc thscw back his grcat mcnacing horns.
2 She saw how he threw back his great menacing horns.
3 Hc certainly was aa unusual looking dog and thc namc "Little
3 He certainly was an unusunllooking dog and the name "Little
llull" fit him wcll.
Bull" fit him well.
Aftcr 1118-2 h c co-spccificr of hcsccms to bc thc bull mcntioncd in 1318-1. After
After Dl8- 2 the co-specifier of he seems to be the bull mentioned in DI8-1. After
thc third scntcncc, h c rcadcr is Iikcly to havc discovcrcd that thc dog has bccn the
the third sentence. the reader is likely to have discovered that the dog has been the
f i ~ u all
s along and that this discoursc is a bit hizarrc. Virginia Woolf [Woolf 19571
fllCus all along and that this discourse is a bit bizarre. Virginia Woolf[Woolfl957]
points out that litcraturc is intcrcsting for thc ways in which authors brcak rulcs.
points out that literature is interesting for tbe ways in which authors break rules.
'I'his chaptcr will only point out what rulcs can bc violatcd. Why thosc mlcs arc
This chapter will only point out what rules can be violated. Why those rules are
violatcd, and how ni~tiicspcakcrs rccovcr from violalions of thosc nrlcs remains to
violaled. and how nati\ e speakers recover from violations of those rules remains to
hc cxplaincd.
he explained.
In summary. tlic cxpcctcd ftxus can bc rcjccicd in favor of anotlicr phrasc in
In summary. the expected focus can be rejected in favor of another phrase in
~ h cdiscottrsc. Itcjcction is possiblc. only whcn thc prcdictcd co-spccitication
the discourse. Rejection is possible. only when the rredicted co-specification
hctwccn a dcfinitc anaphor and tlic cxpcctcd ftlcus is unacccptablc. 'I'hc rcjcctcd
hetween <l definite <Jnaphor and the expected focus is unacceptable. The rejected
plirasc must bc rcti~incdfor possiblc rc-in~roductionlater in the discourse.
phrase must he re14lined for possible re-introduction later in the discourse.
289
5.2.5 I t~rcrringand Focusing
5.2.5 Inferring :md Focusing
Confirming thc cxpectcd focus oftcn rcqiiircs infcrring somc truths and can be
Confinning the expected focus often requires inferring some truths and can be
quitc complcx, [Winograd 19721 citcs thc scntcncc "-I'hc city council rcfuscd h e
quite complex. [Winograd 1972] cites the sentence "The city council refused the
demonstrators a pcrmit bccausc thcy fcarcd violcncc," and hc dcscribcs some of
demonstrators" permit because they feared violence," and he describes some of
thc knowlcdgc nccdcd to dcrcrrninc thc antcccdcnt of [IIPJ!. ICharniak 19721
the knowledge needed to determine the antccedent of l!Iey. ICharniak 1972]
prcscnts nuincrous cxanlplcs of gcncral knowlcdgc, and [lsncr 19751 prcscnts onc
presents numerous examples of general knowledge, and [Isner 1975] presentS one
approach to handling infcrcncc for Wino~grad'sscntcnccs. 'Ihc crucial diffcrcncc
approach to handling inference for Winograd's sentences. '111e crucial difTerenee
bctwccn thcsc d~coricsand thc onc prcscntcd hcrc is tl~atthc fi~irsingprcrcss
between these thcories and the one presented here is that the focusing process
prcdicts a n anaphor's co-spccificr and tllcn an infcrring prtrcss confirnis thc
predicts an anaphor's co-specifier and then an inferring process confim1s the
prediction. A contradictic~nmay bc rcachcd. which indicates &at thc cxpcctcd
prediction. /\ contr,tdictioll may he reached, which indicates that the expected
bc rcjcctcd. 'I'hc infcrring rnny bc trivial: for 1318. he as co-specifying
f t ~ u musl
s
focus mUSl be rejected. The inferring may be trivial: for lJl8, !Ie as co-specifying
Sirndy's dog is rcjcctcd bccausc dogs d o not lravc horns. Whcn inferring is
Sandy's dog is rejccted because dogs do not have horns. When inferring is
complcx, fwusing is irdvantagcous.
complex, focusing is advantageous.
Iai~usingsimplifies thc infcrcncc proccss bccausc it dclimits thc beginning and
Focusing simplifies the inference process because it delimitS the beginning and
end proposilions that thc inrcrriog prcccss uscs, and it gotcrtls which infcrcncc
cnd propositions that the inferring process lIseS, and it governs which inference
can hc takcn back if a contradiction rcsults. Scl~cmcssuch as Isncr's dcpcnd upon
can he taken back if a contr<tdiction results. Schemes such as Isner's depend upon
unification to bind tllc pronoun rllry to a "constant" noun phmsc. Whilc [Ricgcr
unification to bind the pronoun (!Ir)' to it "constant" noun phrase. While [Rieger
19741 ncvcr stated 11ow pronouns arc to bc rcsolvcd for his systcm, his
1974] never stated how pronouns arc to be resolved for his system, his
mctIic;dology for infcrcncc si~ggcstsitsing unification in a manner similar to lsncr.
met];(ldology for inference suggests llsing unification in a manner similar to Isner.
I3y contract, focu5 tccliniqucs "bind" tllc pronoun to thc specification of thc focus
By contrast, focus techniques "bind" the pronoun to the specification of the focus
and dlcn look for an infcrcncc chain that supports thc resulting scntcnce.
and then look for an inference chain lhnt supports lhe resulting sentence.
For Winograd's scntcncc and its dual, both givcn bclow, thc use of actor focus
For Winograd's sentence and its dual, both given below, the use of actor focus
co-spccifics thc city council in both scntcnccs (actor focus
rulcs prcdicts that rhcl~~
rules predicts that (he)' co-specifics the city council in both sentences (actor focus
n ~ l e sarc pcnincnt bccat~sc ~ h c yis an aclor in Ihc scntcncc "Lhcy fcarcd
mles arc pertinent because they is an actor in the sentence "they feared
violcncc".);
violence" .)~
D19-I (a)'l'hc city council rcfuscd lo givc the womcn a pcnnit because
Dl9-l (a) The city council refused to give the women a pennit because
thcy fcarcd violcncc.
they feared violence.
(b) 'Ihc city council rcfuscd to givc thc womcn a pcnnit
(b) The city council refused to give the women a permit
bccausc thcy advncatcd revolution.
because lhey advocated revolution.
For 1319-la, thc infcrcncc chaitl from "city council fcars violcncc" to "city council
For Dl9-la,the infercnce chain from "city council fears violence" to "city council
rchscs to givc thc pcrmit" would be cstablishcd by reasoning of thc following
refuses to give the permit" would be established by reasoning of the following
form:
form:
Form o f rcc~soning:
Form of reasoning:
- Find chain of infcrcncc from (CC fcars violcncc) to (CC rcfuscs (CC givcs
- Find chain of inference ffllm (CC fears violence) to (CC refuses (CC gives
l'crmit Women)).
Permit Women).
If (X rcfusc (X givcs Y 2))is dcfincd as causcd by cithcr:
- If(X refuse (X gives Y Z) is defined as caused by eilher:
X is-Scltish or
X is-Selfish or
(X wants Y) or
(X wants Y) or
(X Dislikcs 2)or
(X Dislikes Z) or
Sidncr
Sidner
290
thcrc is cvcnt (W) and (W is-undcsirablc-to X) and ((% havc Y) --> Occur
there is event (W) and (W is-undesirable-to X) and Z have Y) _A) Occur
W)
W)
- 'Ihcn the chain of infcrcncc must bc found bctwccn (CC fcars violcnce)
- Then the chain of inference must be found between (CC fears violence)
291
1 will now state rhc focusing algorithm which confirms or rcjccts thc expcctcd
1 will now state tlle focusing algorithm which confirms or rejects tl1e expected
~ ( K ' U S(fOund by thc cxpcctcd f t ~ ualgorithm
s
applicd lo the firs[ scntcncc in the
focus (found by tl1e expected focus algoritl1m applied to tl1e first sentence in tlle
disoursc), and in thc casc o f rcjcction, dctcrrnincs which phrasc is to move into
discourse), and in tlle case of rejection. determines which phrase is to move into
~(K'LIS:it is U S C ~for all scntcnccs of a discoursc cxccpt thc fir& whcrc thc cxpcclcd
foclIs; it is used for all sentences of a discourse except the first, where tl1e expected
ftcus ;~lgorilhniappliiu. 'I'hc algorihm lnakcs usc of scvcrirl data structurcs: Lhc
f(Jells algorithm appliL's. The algoritllm makes usc of several data structures: tlle
currcnt ftxus (C'l:). Lhc altcrnalc f i ~ u list
s (AI.J;I.), which is initialized to cither
current focus ('1'"). tlle alternate focus list (I\LFI.), which is initialized to either
thc dcraull cxpcclcd lisus list (l1F.F) or rhc potcniial fisus list (PFI.); and the
tlle default expected tilCuS list (DEF) or the potential focus list (PFL); and the
ftrus stack: the. lattcr two stnlcturcs havc not yct bccn introduced, and their
focus stack; the.lauer two stnJctures have not yet been introduced, and their
Sidncr
Sid ncr
292
292
purposc. as wcll 2s thc condition givcn in stcps 2 and 9. will bc discussed below.'
purpose. as well as the condition given in steps 2 and 9. will be discussed below.!
In d ~ calgorithm, thc tcrrn "discoursc initial" applies 10 t l ~ calgorithm's first
In the algorithm, the term "discourse initial" applies to the algorithm's first
opcration, which is on thc sccond scntcncc o f a discuursc, whilc "in progress"
operation, which is on the second sentence of a discuurse. while "in progress"
applics to any o f thc latcr scntcnccs.
applies to any of the later sentences.
'I'hc I:ocusi~~g
.\lgorithm
The Focusing Algorilhm
discourse
flX:US
I. Stcps 2 and 9 makc usc of rocus scls. 'Ihcsc will bc discuscd lartcr in thc chaplcr. S ~ c p7 of h c
I. Steps 2 and 9 make use of focus scts. These will be discus.~ed uHter in the chapter. Step 7 of the
algorilhm makcs usc or implici~spccilrca~ion.a conccpt which wilt bc no1 discurscd here: Ihe stcp is
usea-uof rc
implicit
specilication.
a concept
willspcdlicalion
be nOl discus.~ed
here: the
stepconsidering
is
algorithm
makes lo
includcd
complclcncss
of thc algoril
hm. which
lmplicil
is in~porlant
whcn
included to a'i5U re complcleness of the algorithm. Implicit !>flcdlicalion is important when considering
thc intcrprciation or rull dclinitc noun phrascs. a topic not includcd in [his chaplcr. I!or a disntssion.
full definitc noun phrases, a topic nol included in this chapleT. I'or a discussion.
the interprctation
sce [Sidncrof19791.
see (Sidner 1979).
293
Sidner
294
no fociunspccificd
mentioned. rel<tin
the CFtheas ftlCUS.
any
pronouns,
missingForco-spccificr
unspecified pronouns. the missing co-specifier
condition holds.2
condition holds. 2
'1'0 illustrate thc firusing algoriihm in action, ils bchavior will hc traccd during
To illustrate the fllCllsing algorithm in action, its behavior will be traced during
thc recognition of the initial ~ O C L Ior
S 116, wllic11 is rcpcatcd below:
the recognition of the initial focus of 1)6, which is repeated below:
1120-1 I .as[ wcck Ihcrc wcrc somc nice strawbcrrics in thc refrigerator.
D2D-l I.ast week
were
some
in me
refrigerator.
2 'l'hcyU1ere
cainc
from
our nice
foodstrawberries
co-op and wcrc
unustially
frcsh.
2 They came from our food co-op and were unusually fresh.
3 1 went to use h c m for dinncr. but sornconc had catcn thcm all.
3 1 went to usc IDem for dinner, but someone had eaten them all.
4 Iatcr I discovered it was Mark who had catcn thcm.
4 I.<lter I discovered it was Mark who had ealen mem.
5 Mark has a hollow lcg. and it's i~npossiblcto kccp food around
5 Mark has
a hollow
leg, and
irs impossible
to keep food around
when
his stomach
nccds
filling.
when his stomach needs filling.
D2O-I is a thcrc-inscrtit~nscntcncc, so stcp 1 of the cxpcctcd f o c ~ ~algorithm
s
D20-] is a U1ere-insertion sentence, so step 1 of the expected focus algorithm
indicates that the cxpcctcd focus is thc subjccl of thc scntcncc, that is, some
indicates that the expected [llCUS is me subject of the sentence, that is, some
s~mwberries.' I l c fi~usingalgorithm contains a nore that thc currcnt focus is set to
strawberries. The focusing algorithm contains a note that the current focus is set to
thc cxpcctcd focus, and thc Al.FI. be sct to the othcr phrascs in 1120-1, that is, last
the expected focus, and the ALFL be set to the Olher phrases in D20-1, mat is, last
week and the refiigeralor and thc vcrb phrase. I h c statc of the entirc focusing
week and the refrigermor and the verb phrase. 111e slate of lhe entire focusing
process at thc point in which D20-2 is cncountcrcd, is iltustratcd in Figure 2.
process at the point in which D20-2 is encountered, is illustrated in Figure 2.
1. Obligatory rclalions arc caws ofa verb lhal must he filled or thc mntcnce is odd as in "John sold."
L Oblillatory
relations are
cases
ofnot
a ~erb
that must
be filled
is odd as in "John
Non-obligatory
caws
nccd
bc lillcd:
in "John
~ or
~athe
book,"
l sentence
d onc non-obligatory
casesold."
is the pCMn
Non-obligatory
need was
not sold.
he filled: in "John sold a book," one non'obligatory ca.~e is the pcoon
lo whomcases
(hc book
to whom Z
theSce
book
was sold.
section
4.4 or his chapicr lijr a discussion of missing co-specifier uses,
2 See section 4.4 of this chapler for a discuS1;ion of missing co-specifier uses.
295
&ntenee:
D2~2
Sincc spcakcrs do not always talk about just onc thing. thc focusing process must
Since speakers do not always talk about just one thing, the focusing process must
provide for thc focus of thc discourse to changc. In M, thc strawbcrrics are
provide for the focus of the discourse to change. In 06, the strawberries are
discusscd for a fcw scntcnccs, and thcn the discussion moves to thc pcrson who ate
discussed for a few sentences, and then the discussion moves to the person who ate
thcm. Accounting for this movemcnt is ncccssary for anaphor interpretation
them. Accounting for this movement is necessary for anaphor interpretation
bccausc thc ncw discourse elcmcnt rnay be co-spccificd in latcr scnlcnccs. 'I'his
because the new discourse clement may be co-specified in later sentences. This
proccss of moving thc ftrus will bc callcd ftxlus movcmcnt. 'I'hc discnursc bclow
process of moving the focus will be called focus movement. The discourse below
illustratcs firus movcnicnt from nleeling to oflfic-e(in 1121-3) and back u) ~tleering
iIlustratcs focus moycment from meeting to office (in 1)21-3) and back to meeting
(in 1122-4).
(in 1)21-4).
1121-1 1 want to scl~cdulca mccting with Harry, Willic and Fdwina.
))21-1 I want to schedule.i! mcctingj with JHarry, Willie and Edwina.
2 Wc can usc my ufficq.
2 We can USC!!lY officci'
3 Ujkind of small,
3.l1Jii kind of small,
4 but &c rnccting. won't last long anyway.
4 but~~j won'tJ last long anyway.
Sidner
296
1.
1. Note that
lhat itir cannol
cannot ro'spccify
m-spedfy with
wiUl the
Ule meeting
mecling because
bemuse lhe
h e tense
tense of
af the
Ule sentence
m t e n c e rules
~ l e oot
out
r the
lhe
co-specification.
rnrpedfieation.
297
occur in one scntcncc. somc mcans of choosing a potcntial focus is nccded.
occur in one sentence. some means of choosing a potential focus is needed.
Syntactic clcft constructions, as I have noted, indicate focus movcmcnt. In
Syntactic cleft constructions, as I have noted, indicate focus movement In
addition, onc would cxpcct that thc thcmc should bc the prcfcrrcd position for a
addition, one would expect that the theme should be the preferred position for a
potcntial focus. In fact the samc ordcr used for dcfault cxpcctcd foci can bc used
potential focus. In fact Ule same order used for default expected foci can be used
for potcntial firi, cxccpt that thc phrasc which confinns thc fixus is not included
for potential foci. except that the phrase which confinns the fiJeus is not included
in thc list bccausc it cannot bc a potcntial focus. In English scntcnccs, phrascs
in the list because it cannot be a potential f{JeLlS. In English sentences, phrases
which rncntion ncw information tcnd to occur towards thc cnd of thc scntcncc,
which mention new information tcnd to lJecur towards the cnd of the sentence,
wliilc old infcwmatian cxcurs at thc beginning: this niclllod of choosing potcntial
while old information occurS at thc beginning: this meUlOd of choosing potential
ftxus capturcs tliat bcllavior.
focus caplUrcs that behavior.
In summary. thc algori~limfor dctcrlnining rllc polc~itinlfirus list (PFI.) is:
In summary, the algorithm for determining Ule potential focus list (PFJ.) is:
1.
Sidncr
Sidner
298
298
n ~ l c .Whcn two diffcrcnt anaphors co-specify with Lhc focus and a potcntial focus
rule. When two different anaphors co-specify with the focus and a potential focus
rcspcctivcly, and only onc of thc anaphors is a pronoun, thcrc is no certainty about
respectively, and only one of the anaphofS is a pronoun. there is no certainty about
which will bc focuscd on in thc ncxt scntcncc. Yct, thcrc sccms to bc a prcfcrcncc
which will be focused on in the next sentence. Yet, there seems to be a preference
for thc f t ~ uto
s bc markcd by thc pronoun use. In othcr words, pronouns scern to
for the focus to be marked by the pronoun usc. In other words, pronouns seem to
support a filcus more strongly than anaphoric dcfinitc noun phrases do. Consider
support a focus more strongly than anaphoric def1nite noun phrases do. Consider
thc casc below:
the case below:
1324-1 1 got a ncw hat
1)24-1 I got a new hat..
2 and I dccoratcd it with a big rcd bow.
2 and I decorated it with a big red bow.
3 (a) I think thc how will brighlcn it up a lot. (it = thc hat)
3 (a) I ulink the how will brighten it up a lot. (it = the hat)
(h) I rhink it will bl-ightcn up thc hat a lot. ( i l = the bow)
(b) I think it will brighten up ule hat a lot. (it = the bow)
4 If not, I think 1'11 still usc it.
4 If not, I think I'll still usc it.
Aftcr 1124-2, d ~ focus
c
is thc hat, eel-spccificd by if, and the potcntial fixus list
After 1)24-2, Ule focus is the hat. co-specified by iI, and the potential focus list
inch~dcsa big rcci buw. t'irhcr fonn of 1124-3 uses anaphors which co-spccify the
includes a big red buw. Eiuler fonn of D24-3 uses anaphors which co-specify the
hat and ihc bow. 1124-4 is syn~acticallyi~ndscmnntically ncutral on thc choicc of
hat and the bow. 1)24-4 is syntactically and semantically neutral on ule choice of
hat or bow as antcccdcnt of ir. llowcvcr. if thc scqucncc 1124-3a and 1324-4
hat or bow as antecedent of if. However. if Ule sequence 1)24-3a and 1)24-4
occurs. the i~co-specifics hat, wliilc if 112.1-3b is uscd. thc bow is sliglltly prcferred.
occurs, the if co-specifics hat, while ifD24- 3b is used, the bow is slightly preferred.
'l'his cxamplc suggcsts that i~nlikcthc gcncrnl casc. thc clcrncnt co-spccificd by the
This example suggests thilt unlike the gcneral case, the dement co-specified by the
pronoun should bccomc thc fixus. 1-lic sccond condition for focusing is:
pronoun should become the focus. The second condition for focusing is:
Whcncvcr both the current focus and a potcntial f r ~ u arc
s co-spccificd but only
Whenever both the current foclls and a potential focus are co-specified but only
onc of them is co-spccificd by a pronoun, ttic focus is dctcrnlincd by thc pronoun
one of them is co-specified by a pronoun, the focus is detemlined by the pronoun
co-spccificr. 'I'his condition appcars in the ft~usingalgorithmas stcp 3.
co-specifier. This condition appears in the focllsing algorithm-as step 3.
Steps 2 and 9 of the focusing algorithm distinguish a feature uniquc to focus
Steps 2 and 9 of the focusing algorithm distinguish a feature unique to focus
confirmation. The focus sct initialization steps in d ~ cfocusing algorithm are
confirmation. The focus set initialization steps in the focusing algorithm are
dcsigncd to rccognizc a discoursc situation illustrated in D25.
designed to recognize a discourse situation illustrated in D25.
1325-1 lohn and Mary sat on thc sofa and playcd cards.
])25-1 John and Mary sat on the sofa and played cards.
2 Flcnry rcad a book.
2 flenry read a book.
3 At 10 p.m. thcy went to Jocy's I3ar to hcar a ncw r t ~ group.
k
3 At 10 p.m. they went to Joey's Bar to hear a new rock group.
Aftcr D25-1 and 2, Lhc fixus of this discoursc is not sofa cards or book. It
After D25-1 and 2, Ule filCUS of this discourse is not sofa, cards or book. It
appears that D25 is about John, Mary and Hcnry and what thcy did for an
appears that 025 is about John. Mary and Henry and what they did for an
evening. In othcr words, thc focus in D25 is collcctcd ovcr scveral scntcnces. The
evening. In other words, the focus in 025 is collected over several sentences. The
cxpcctcd focus algorithm will choosc soh as cxpcctcd fixus, wliilc thc focusing
expected focus algorithm will choose sofa as expected fiJeus, while the focusing
algorithm without stcp 9 would confirm the cxpcctcd focus sincc no anaphors
algorithm without step 9 would' confirm the expected focus since no anaphors
occur in 1125-2. '1'0 capturc thc ftsuscollcction and to prcvcnt confirmation whcn
occur in 1)25-2. To capture the f(Jeus collection and to prevent confirmation when
no anaphors occur in thc ini~ialpart of a discoursc, fixus sct collcction is uscd. In
no anaphors occur in the initial part of a discourse, f(Jeus set collection is used. In
collccting focus scts. discoursc itcms in the salnc thcn~atic position in each
collecting focus sets. discourse items in ule same ulematic position in each
scntcncc arc collcctcd as onc sct. For 1125, this rncthod makcs scts of (1)cards
sentence arc collected as onc set For D2S, this method makes sets of (I) cards
and book, (2) John, Mary and Hcnry. and (3) sitting, playing and reading actions.
and book, (2) John, Mary and Henry, and (3) sitting, playing and reading actions.
Whcn 1125-3 is proccsscd through the ftxusing algorithm, they co-spccifics with
When 1)25-3 is processed through the f(Jeusing algorithm, they co-specifics with
John, Mary and Hcnry, and so thc thrcc will bc chosen as discoursc focus.
John. Mary and Henry, and so the three will be chosen as discourse focus.
'I'hc informal description of focus movcmcnt given hcre illustrates that focus
The informal description of focus movement given here illustrates that focus
299
confirmation and focus rnavcmcnl arc similar behaviors. So similar arc these
confirmation and focu~ movement arc similar behaviors. So similar arc these
bchaviors h a t thcy can bc fonnally dcscrihcd by one algorithm. That algorithm
behaviors that they can be fonnally described by one algorithm. That algorithm
providcs a statcmcnt of control flow and the dclails of thc conditions for
provides a statement of control flow and the details of the conditions for
ctmfinning or rcjccting a focus.
confinning or rejecting a focus.
'I'hc significancc of :I single algorithm for both pnxcsscs must not be
The significance of a single algorithm for both processes must not be
o\crlookcd; thc onc algorithm providcs n uniform trcatmcnt of two phcnomcna
OIerlooked: the one algorithm provides a uni!ixrn treatment of two phenomena
which at first gliincc appcar unrcl;itcd, namely, cxpcctcd f i ~ u confirmation
s
and
which at fiN glance appear unrcl<l!ed. namcly. expected focus confinnation and
ftrus Inovclncnt. l:i~rlllcrmorc. thc onc algoritJ31n indicatcs just how the two
focus movement. Furthermore. the one algorithm indicates just how the two
proccsscs arc silnilar. 'I'hc similarit!, in cxpcctcd firus cttlifirrnation and focus
processes arc similar. The similarity in expected focus confirmation and focus
cxpcctcd foci and
moycnlcnt can bc cxtcndcd bcyond tllc p;~rallclbctwccn dcri~~ilt
movement C,1n be extended beyond the parallel between default expected foci and
potcnti;rl f i ~sincc
i
thcrc arc syntactic structures which mark fwus rnovcmcnt just
potential foci since there arc syntactic structures which mark focus movement just
as thcrc arc syntactic forms which mark initial fi~cus. hlorc impo~tantly,the
as there arc syntJctic ulrms which mark initial focus. More impol1antly, the
focusing algorithm shows h a t cxpcctcd f i ~ u confirtnation
s
i ~ n dfocus movcmenr
focusing algorithm shows that expected focus confirmation and focus movemem
arc both prczcsscs which rcqiiirc additiotlal mention of rhc clcmcnt in focus to
arc both processes which require additional mention of the clement in focus to
contirm h c choicc.
Cllnfinn the choice.
300
Sidner
FOCUS:
FOCUS:
I . Current implcmcntn~ionsor thc ftrusing p r c n x have becn dcsigncd with a simple lasl-in fial-out
I. Current implementations of the focusing (lroc($.~ have been designed with a simple last-in first-out
stack to pop loci unlil thc propcr focus is four~d.sincc this lypc of slack rcflecls Ihc popping back in a
stack to pop foci until the prollcr focus is found, since lhis typc of slack rcflcc15 Ihe popping back in 11
d~scourse.
discourse.
301
what is bcing
being talked
because of
talkcd about bccausc
of its distinguishing noun
noun phrase
phrasc head.
Pronouns
can bc
be used
Pronounscan
used as wel1.
wcll, but their usc is more
morc restricted
rcstrictcd and governed
govcrncd by the
interpreter
intcrprctcr rules,
rulcs, which prevent
prcvcnt a pronoun from co-specifying a stack item
itcm if
if the
discoursc or actor foci
firi arc acceptable
acccptablc co-specifiers.
co-spccificrs. The
I l i c interpreter
intcrprctcr roles,
ntlcs, in
in
discourse
conticction with the
thc foclis
k r u s algorithm.
algorid~m,impose
impost a kind
kind of
o f :'Il.1cKCd
stickcd focus
firus constraint
connection
that maintains
foclis
In
mainbins localized
l~~calizcd
firus movement.
~n~~\,cmcnt.
I n 1127
1127 a
n pronoun
pronoun may be
bc used
i ~ s c dto pop
b:.lck
uld focus.
fizus. There
'I'hcrc the
tlic focus
fucus begins
bcgins on career
carccr in
in law and moves
tnovcs to the
back to an
an old
o f dle
thc speaker,
spcakcr. with
will1 a potcnti;tl
k r u s of
o f (heir
/heir jobs. 'Il,e
l l i c i(i/ in
i n the last
friends of
potential focus
scntcncc co~spccifics
a]-specifics with a law career
cdrccr <lnd
and rccstablishcs
rc-cstablishcs it as focus.
sentence
1)27-1
1127-1 A: Have
Havc you ever
c w r thought
ihought of
o f a;Icareer
carccr in
in law?
law?
2 Il:
~llked with
Ii:I have
havc some
sumc friends
fricnds who
whc~arc lawyers.
lawycrs, and I've
i'vc C~lkcd
thc~nabout thcir
thcirjobs,
f i ~me.
r
them
jobs. but I don't think it's filf
In
because the
at D268
somc readers
rcadcrs find \)26
I126 difficult
diflicult bcc:~usc
tlic transition
transiti~~n
1126-8 is
I n contrast
contrast some
;~brupt for
f i ~ ra
;I pronoull
prn1101111
bc used,
ilscd, especially
cspcci;tlly if
i f anyway
n!lynsay is
is dl'1clcd
dclctcd from the
too abrupt
to be
sentence;
/he bouk
Luok in
in place
pl;sc of
of it;I seems
sccnls to be
bc more
morc acceptable.
acccpti~hlc. 'Illis
'lhis suggests that
scnicncc; the
stacked focus
firus constraint should be
bc modified
niodificd so that
h a t pronouns arc
are not
the stacked
111 co-specify with items
itcms lllorC
111orcthan one
onc position
position back in
in the stack.
intcrprctcd to
1l;lthcr than propose
propusc such
sucll a modification. some
somc additional evidence
cvidcncc about
Rather
discourse
disaursc must first bc
hc considered.
considcrcd.
Thcre
'lhcre arc circumstanccs
circu~nswnccswhcrc the stackcd
swckcd focus
firus constraint,
constraint. in
in its original fonn
form
or modified
modified as suggcsted
suggcstcd above,
ahow. fails to account
account adequately
adcquatcly for,
for some
solnc language
language
bchavior.
behavior. In
I n some
sumc discourses, a pop back to an
an old focus
focus can
can occur with a
pronoun
thc pronoun
pronoun could co-specify with the discourse
discoursc or actor
pronoun cven
cvcn though the
focus.
focus, and
and even
cvcn when
whcn many
Inany foci
f w i intcf\'cnc.
intcrvcnc. In
I n thcse
thcsc cases
cascs the non-local, popping
back movcment
rnovcnlcnt occurs
occun not only
unly because
hccausc a pronoun
pronoun was used, but
hut also because
bccause the
hearcr
hcarcr is aware
awarc of
o f other
othcr stmcturcs
stnlclurcs that help
hclp him
him or her
hcr to disccrn where
whcrc to
t11pop to.
to.
These
'lhcsc slructurcs
svucturcs make
makc it
i t possible
passiblc to usc a pronoun
pr1111ounto move
movc the
thc focus back without
confusion.
below. ili / co-specifics with the
cx;~mplcbelow,
thc pump, not the ratchet
ratchet
confusion. In
I n d,e
the example
discussion when
whcn \)289
1128-9 occurred. It
It appears that in
wrcnch. which was under discussion
wrench.
convcrsations,
conversations, words suchsuch as ok in
in combination
combination with a discoursc
discoursc that rcflccts
rcflccls the
lnsk
possible for speakers
wsk structure
structurc defined
dctincd by [Grosz
[Grosz 1977J.
19771, make
makc it possiblc
speakers to use
pronouns
lo co-spccify
CII-spccify with a phrase
phrasc from an earlier
carlicr part of
o f thc
the discourse.
pronouns La
Sidner
302
To make clcarcr how Ihc focusing algorithm is uscd For fixus movcrncnt, let us
To make clearer how the focusing algorithm is used for focus movement, let us
trace thc action of the algorithm on thc following cxamplc discourse.
trace the action of the algorithm on the following cxample discourse.
1129-1Alfrcd and 7~1har
likcd to play baseball.
D291 Alfred and Zohar liked to play baseball.
2 'l'hcy playcd it cvcryday aftcr school beforc dinncr.
2 They played it everyday after school before dinner.
3 Aftcr lhcir gamc, Alfrcd and % ~ h ahad
r ice crcam cones.
3 After their game, Alfred and Zohar had icc cream cones.
4 lhcy tastcd rcally good.
4 They tasted really good.
5 Alfrcd always had rhc vanilla supcr scooper,
S Alfred always had the vanilla super scooper,
6 while Zollar tricd thc flavor of thc day cone.
6 while Zohar tried the flavor of the day cone.
7 Afrcr thc concs had bccn eaten.
7 After the cones had been eaten,
8 thc boys wcnt home to study.
8 the boys went home to study.
Using thc cxpcctcd focus algorithm. thc cxpcctcd f i ~ u for
s 1129-1 is baseball
Using the expected focus algorithm, the expected focus for ])29-1 is baseball
(it is t l ~ cthcn~cof thc vcrh complcmcnt). 'I'hcrc arc two pronouns in 1329-2, but
(it is the theme of the verb complement). There are two pronouns in ])29-2, but
only onc is considcrcd by thc discourse firusing algorithm bccausc [hey is in agcnt
only one is considered by the discourse focusing algorithm because they is in agent
position. As shown bclow, bcc:rusc ir co-specifics with thc cxpcctcd firus. the
position. As shown below, because it co-specifics with the expected focus, the
cxpccrcd f t ~ u iss confirmcd its ftxus.
expected focus is confirmed as focus.
303
Sidncr
304
305
pr(Im1lln.
;lnd then
thcn if
i f that focus
firus is unaccept.lhle,
unacccpt~blc,checks alternate
altcrnatc candidates
candidatcs in
in
pronoon, and
Hy "end-of-sentence
"cnd-of-scntcncc pr~rcssing."
Incan that prcmauns
intcrprctcd
turn. By
processing," I1 mean
pronouns arc not interpreted
Uic entire
cntirc sentence
scntcncc has
has heen
hccn syntactically
syntactic:~llyand semantically
scni:~nticallyinterpreted.
intcrprctcd. Both
until O,e
u f these
Uicsc criteria can
can he
hc given
givcn lip
up without undermining
undcrtnining the
thc focusing
hrusing theory.
theory. One
One
of
col~ldcl1\'ision
cn\.ision processing
prl~cssingin
in parallcl
pnrallcl by checking
checking the
thc foci and alternates
altcrnatcs and
and then
could
determining
thosc candidates
candidatcs
dclcrrnining the
thc pronuun's
prt~nuun'sspecification
spccification from an
;in ordering of
i ~allf those
that
for interpretation
th:tt meet
111ccrthe
thc criteria of
o f choice:
ch~~icc:
intcrprct;ltiun of
o f pronouns
pronoonc before
bcfurc the
Uic end
cnd of
of
the
lonccivahly
tlic sentence.
SCII~CIICC.
~ i ~ ~ i c c i v a the
tlic
b l y pronollll
p r ~ n o u ninterpreter
intcrprctcr could
col~ldchouse
chousc a specification
spccification
frum <l\'<lilahlc
;~v;iili~hlcin
inL~nn;ltion
;~ndthen
Uicn review
rcvicw it as more
murc {)f
Of the
Ulc sentence
scntcncc is
is
from
furmation and
prtrcsscd.
'lhcsc
two
ituplicit
pr~rccssing
assuniptic~ns
1h;l
v
c
hccn
tnadc
bccnuse
processed. These
implicit processing assumptions h;:IVC been made because
Uicy simplify the
tllc account
accoltnt of
o f focusing
fixusing and
and because
hccausc they
tlicy reflect
rcflcct an implemented
i~nplcmcntcd
they
version
syslc~ilwith focusing.
fircusing. Further
FurUicr research
rcscarcll will indkate
indicatc whether these
vcrsion of
o f a system
assumptions
assuniptions arc too
t i x ~strong--ifso,
sm~ng--ifso. the
thc focusing
firusing theory
tlicory can be
hc revised.
rcviscd.
5.3.1 Using
I:ocss for
lor Pronoun
I'ronot~nInterpretation
Intcrprut:~tionRules
Rules
Using Focus
1.0begin
hcgin the disclission.
discussion. let
lct us consider a
3 pronoun
pro~iouninterpretation
intcrprctatian rule
rulc that follows
follws
To
natur;llly from the
thc discussion
disct~ssionof
o f focus
firus and focus
firus mov,me11l
moverncnl in
in the
thc first half of
o f this
naturally
chaptcr. This
'l'his rule
rulc is not adequate
adcquatc for reasons
rcasons II will discuss
discuss hclow
chapter.
below and will be
revised
rcviscd over
ovcr the
thc course
coursc of
o f the
thc section.
scction.
R1:
Ill: If O,e
UCI pronoun under
undcr interpretation
intcrprctation appears
appcars in
in a sentence
tlrcm:~licrelation
rclalion uther
uthcr than agenl,
agcnc choose
choosc the
Uic discourse focus
focus
thematic
as the
Uic co-specifier
co-spccificr unless
unlcss any of
o f the syntactic,
syntactic. semantic
scmantic and
infcrcntial knowiedge
knowlcdgc constraints
constraints rule
rulc out the choi.ce.
chaise. If the
inferel1lial
pronoun appears
appcars in
in agent
agcnt position. choose
choosc the actor focus
focus as
pronoun
co-speciJier
thc same
wme way.
co-spccilicr in
in the
lluling out a co-specifier
co-spccificr on the hasis
hasis of
o f syntactic
synwctic and semantic
scmantic constraints
constraints is
Ruling
accomplished
thc various
various syntactic
synwclic relationships
relationships and restrictions
rcstrictions
accomplished by computing the
(such as Lasnik's
l.asnik's disjoint feference
rcfcrcncc fules)
rules) and by use
usc of
of semantic
scmantic selC'ctional
sclrctional
rcstrictions (such
(sucll as those
t l i ~ ~ sof
ocf Fodor
141dor and
and Kat/.
Katr discussed in
in the
thc beginning
beginning of
o f this
restrictions
chaptcr) on sentence
scnrcncc thelnutic
Uic~natic categories.
catcgorics.
knowlcdgc, the
chapter)
For inferential knowledge,
pr~rcssdiscLissed
discuwcd earlier
carlicr is
is used.
uscd.
inferring pnK'ess
When
sugjicstcd co-speciJier
co-spccilicr for
fur a pronoun
pronoun must be
hc given
givcn up,
up. R1 docs not
Wllcn a suggested
suggcst how to proceed
pnxccd in
in using
using either
ciUicr the
thc ,(ctor
istor or the
thc discourse focus
f < r i ~ to
s find
suggest
somc other
otlicr choice
choicc for a co-specifier.
co-spccilicr. It
I t is here
hcrc the
thc potential
are used, as
some
potential foci arc
illustrated
below,
illustratcd in
in the example
cxamplc bclow.
Sidner
306
307
constraints on syntax, scmsntics and itifcrcntial knowlcdgc. Howcvcr, some
constraints on syntax, semantics and inferential knowledge. However, some
additional pragmatic critcria nccd to bc addcd to dcal with thc intcractions
additional pragmatic criteria need to be added to deal with the interactions
betwccn actor and discoursc foci. bccausc a pronoun in agent position may
between actor and discourse foci, because a pronoun in agent position may
co-specify with thc discoursc fcxus rathcr than thc actor focus! Onc such examplc
co-specify with the discourse focus rather than the actor focus! One such example
of this observation is below.
of this observation is below.
D30-1 I haven't sccn Jcff for scvcral days.
D30-1 I haven't seen Jeff for several days.
2 Carl thinks hc's studying for his exams.
2 Carl thinks he's studying for his exams.
3 but it's obvious tonic Illat hc wcnt to thc Capc wilh Linda,
3 but it's obvious to me that he went to the Cape with Linda.
Although Carl is thc actor focus aftcr 1130-2. and he in D3O-3 is an agent
Although Carl is the actor focus after D30-2, and he in D30-3 is an agent
tlicmatic rclation in thc cmbcddcd scntcncc. thc prtipcr choice Tor tile co-spccificr
thematic relation in the embedded sentence, the proper choice ror the co-specifier
of ht. is Jcff. Italhcr than suggcst that Carl is considcrcd as thc co-spccificr and
of he is JetT. Rather than suggest that Carl is considered as the co-specifier and
thcn is rulcd oul (which sccms unlikely as thcrc arc no syiitactic, semantic or
then is ruled out (which seems unlikely as there arc no syntactic, semantic or
knowlcdgc constraints whicli climinatc it). onc might ask whclhcr Carl is ever
knowledge constraints which eliminate it), one might ask whether Carl is ever
cvcn considcrcd for prt~cssing. I n D30 it appcars not. 'Ihc discussion is about
even considered for processing. In D30 it appears not. 'Ille discussion is about
Jcff. who is introduced first. whilc Ulc actors sccln rclativcly incidcntal. In general
JetT. who is introduced first, while the actors seem relatively incidental. In general
such cascs arc rcsolvcd bccausc if the disr-oursc focus is animate and is established
such cases arc resolved because if the discourse focus is animate and is established
carlicr in thc discoursc than tl~cactor f i ~ u s thc
. dieoursc focus takcs prcccdence
earlier in the discourse than the actor filCUS, the discourse focus takes precedence
cvcn for pronouns in agcnt position. In csscncc this approach takcs thc discourse
even fOf pronouns in agent position. In essence this approach tilkes the discourse
focus as pritnary, the discoursc focus being what thc spcakcr is tajking about so far
focus as primary, the discourse focus being what the speaker is talking about so far
whitc ihc actor focus is Ltlc locus of infomiation about actions in thc discoursc.
while the actof focus is the locus of information about actions in the discourse.
'Ihis prcccdcncc rulc docs nor indicatc what to do whcn thc discoursc focus
'\his precedence rule docs not indicate what to do when the discourse focus
and actor focus arc both animatc, havc thc same gcndcr, number and person, and
and <Ictor focus arc both animate, have the same gender, number and person, and
arc both cstablishcd during the samc scntcncc of the discoursc. lntcrcstingly
are both established during the samc sentcnce of the discourse. Interestingly
cnough, pcoplc somctimcs havc difficulty choosing intcrprctations in such
enough. people sometimes have difficulty choosing interpretations in such
circumstances. In D31-2a below, he co-specifics with Johrt.(thc actor focus) but jf
circumstances. In D31-2a below, he co-specifics with John(the actor focus) but if
1131-2b followcd D31-1, he may co-specify with cithcr John or Mike (the expected
DJl-2b followed DJH, he may co-specify with either John or Mike (the expected
discourse focus).
discourse focus).
D31-1John callcd up Mikc yesterday.
031-1 John called up Mike yesterday.
2 a. Iic wanted to discuss his physics homework.
2 a. He wanted to discuss his physics homework.
b. Hc was studying for his driver's test.
b. He was studying for his driver's test
In these cases, nativc speakers rcport that thc co-spccificr for thc pronoun is
In these cases, native speakers report that the co-specifier for the pronoun is
ambiguous. If the pnlnoun fills an agcnt rclation, thc actor focus is prcfcrrcd, but
ambiguous. If the pronoun fills an agent relation, the actor focus is prefcrrcd, but
this prcfcrcncc is not a strong onc. It appcars h a t in such cascs thc ambiguity may
this preference is not a strong one. It appears that in such cases thc ambiguity may
not hc casily rcsolvcd unlcss additional information about thc two fwi is known
not be easily resolved unless additional information about the two foci is known
that stipulates that thc scntcncc is truc ofonly one.
that stipulates that the sentence is true ofonly one.
What about thc intcrprctation of posscssivc pronouns such as their in 1129-3?
What about the interpretation of possessive pronouns such as their in D2931
Sidner
Sidncr
J08
308
'l'hc general rulc for posscssivcs' can bc formulatcd as: if thc discoursc fixus and
The general rule for possessives! can be formulated as: if the discourse focus and
actor ftlcus wcrc not cstablishcd in thc samc scnrcncc of thc discourse, Lhcn the
actor focus were not established in the same sentence of the discourse, then the
discoi~rscfocus is thc co-spccificr (if acccptablc on thc untal grounds); if d ~ e
discourse focus is the co-specifier (if acceptable on the usual grounds); jf the
diroursc focus was unacceptable. thc actor focus is chcckcd for acccptability and
discourse focus was unacceptable, the actor focus is checked for acceptability and
that failing, thc po[cntial discoursc foci arc considcrcd; if both wcrc cstablishcd in
that failing, the potential discourse foci arc considered; if both were established in
thc snmc scntcncc, tlic usc will bc ambiguous. As in thc intcrprctation of pcrsonal
the same sentence, the usc will be ambiguous. As in the interpretation of personal
pronouns, this n ~ l cshows that tllc discourse ftxus takc prcccdencc as the
pronouns, this rule shows thilt the discourse focus take precedence as the
co-spccificr. Unlikc thc casc of pcrsonal pronouns. {heir is nlnbiguous bctwccn
cospccifier. Unlike the case of person'll pronouns, their is ambiguous between
agcnts and othcr discoursc clcmcnts: whcn thc p~.cccdsnceof ihc discoursc focus
agents and other discourse clements: when the precedence of the discourse focus
cannot bc cst~l~lishcd,
ambiguity occurs. Only when lhc discoursc fixus is not a
cannot be established, ambiguity occurs. Only when the discourse focus is not a
possihlc co-spccificr. ciln the actor f t ~ u I)c
s cclnsidcrcd witliout ambiguity.
possible co-specifier. can the actor focus he considered without ambiguity.
Anotlicr sourcc of a~nbiguityoccurs whcn an actor and onc potcntial actor are
Another source of ambiguity occurs when an aClor and one potential actor are
both prcscnt in s prciio~~s
scntcncc wlicrc thc discoursc ftxt~sis a non-animate
both present in a pre\iOllS sentence where the discourse focus is a non-animate
clcmcnt. An cxa~nplcis givcli bclow. Supposc that scntcncc (21) bclow followcd
clement. An eX'lmple is given below. Suppose that sentence (21) below followed
c;rh of (22). (23), (24) and (25).
each of(n), (23), (24) and (25).
(?I) klc knows;~lot about high cncrgy physics.
(21) He knows a lot about high energy physics.
(22) l'rof. llarby will tcll hlonty ahout tlic ncutron cxpcrimcnt.
(22) Prof. Darby will tell Monty about the neutron experiment.
(23) Prof. I3arby will lccti~rcMonty on h c ncutron cxpcrimcnt.
(23) Prof. Darby will lecture Monty on the neutron experiment.
(21) Prof. 13arhy will Iiclp Monty with L ~ nc~~trgn
C
cxpcriment.
(24) Prof. Darby will help Monty with the neutron experiment.
(25) I'rof. Ilarby ill tcach Monty about thc ncutron cxperimcnt.
(25) Prof. Darby will te<Jeh Monly about the neutron experiment.
Somc nativc spcakcrs find all of thcsc scntcncc pain ambiguous, while some
Some native speakers find all of these sentence pairs ambiguous, while some
nativc spcakcrs find only thc pair (22) followcd by (21) ambiguous. These
native speake~ find only the pair (22) followed by (21) ambiguous. These
cxamplcs arc surprisingly similar to D31. Fiow do snmc spcakcrs dccidc that he
examples arc surprisingly similar 10 D31. How do some spcakers decide that he
co-spccifics with hlonty or Prof. Ilarby? It appcars that lhcy makc a comparison
co-specifics with f\1onty or Prof. Darby? It appears that they make a comparison
and chonsc bctwccn thc actor focus and thc potcntial actor on the basis of
and choosc between the actor focus and the potential actor on the basis of
cvidcncc for thcir prcfcrrcd intcrprctation. When that cvidcnce is not
evidence for their preferred interpretation.
When that evidence is not
forlhcuming, informants arc confuscd. Such a bchavior suggests that tllc inferring
forthcoming, informants arc confused. Such a behavior suggests that the inferring
proccss postuintcd thus far should bc capable of a spccial judgrncnt whcn given
process postulated thus far should be capable of a spccial judgment when given
onc actor and onc potcnrial actor; it must wcigh its findings, and clioosc onc of the
onc actor and onc potential actor; it must weigh its findings, and choose one of the
t w o candidates as supcrior.
two candidatcs as supcrior.
What kind of cvidcncc can bc uscd in such cascs? 'l'hc hcarcr knows that thcrc
What kind of evidence can be used in such cases? The hearer knows thaI there
is sonic pcrson namcd Monty, who is probably malc, that thcrc is a professor
is some person named Monty, who is probably male, that there is a professor
n;~mcdl)arby, possibly nialc as well, that Ilarby is giving information to Monty
namcd Darby, possihly male as well, that Darby is giving information to Monly
~ I H I I I SOIIIC
~
physics cxpcrimc~it,tllc cxpcrimcnt bcing mnrkcd by thc dcfitiite
ahout somc physics cxperimcnt, the cxperiment being marked by thc definite
ai-tick ;aknt~wnto llic hcarcr, as wcll as somc critcria for dctcnnining who knows
article as known to the hearer, as wcll as somc criteria for detennining who knows
a lot about high cncrgy physics in thc context of tcaching. hclping, or lecturing.
a lot about high encrgy physics in the (wntext of teaching, helping, or lccturing.
1. Somc spcciaf cases rcquirc a Tcw addllional rulcs (ha1 arc discuswd Tully in [Sidncr 19791
1. Some SJlccial ca~ require a few additional rules thaI arc discussed fully in [Sidncr ]979].
309
A c o m p u t ~ t i o ~syslcm
li~
that makes such judgmcnts musf havc a vcry rich
II. computatiollal system that makes such judgments must have a very rich
knowlcdgc basc (that is. to know that Monty is 3 ~nalcnamc. that prclfcsson can
knowledge base (tilat is, to know that Monty is a male name, that professors can
bc malt. Lhnt professors arc cxpcrts. fiat neutron cxpcri~ncnts arc physics
be male. Ulat professors are experts, that neutron experiments are physics
cxpcri~ncnts)and hc ahlc to draw infcrcnccs from that basc. Nonc of this is
experiments) and he ahle to draw inferences from that base. NOlle of this is
surprising: howcvcr, a co~npotationalframework Tor carrying out such subtle
surprising: however, a computational framework for carrying out such subtle
judgnlcnts is still hcyond thc state of tllc art. 'l'llc mattcr of wcighing cvidcnce to
judgments is still heyond the state of the art. Tile matter of weighing evidence to
is si~nitarto thc sc~nanticcl~oiccincchanism
dccidc bctwccn t w o ciitidid;~~cs
decide hetween two candidates is similar to the semantic choice me(;hanism
postulated by [Marcus 19801 fi)r p'lrsing ccriain kinds of slructurcs such as
postu1<ned by [M,lrcUS 1980) for parsing certain kinds of structures such as
~ ~ ' c p o ~ i t i ophrnscs.
n;~I
'Ibis dcvicc, iihcn i~ltachingprcposilional phrascs. asks the
prepositional phrases. 'Illis device, when allaching prepositional phrases. asks the
scm;i~lticprtrccssor ; ~ h t ~its
u t prcfcrcr~ccsfur making tlic attachincnt.
semantic processor ah(llit its preferences for making the atwchmenl.
In sttinmary. tliis kind of ambiguity can hc gcncrillly handlcd with thc
[n summary. this kind of amhiguity can be genemlly handled with the
following condition.
rllllowing condition.
POTENTIAL ACTOR AMIlIGUITY CONDITION:
pronouns can bc found in [Sidncr 19791. In this scction wc havc considcrcd thcir
pronouns can be found ill lSidner 1979). In this section we have considered their
motivation and gcncral form. 'I'hcsc rulcs rcprcscnt what can bc said about
motivation and general fonn. These rules represent what can bc said about
pronoun intcrprctation in thc abscncc o f any additional information in knowledge
pronoun interpretation in the absence of any additional infonnation in knowledge
rcprcscntation bcyond that suggcstcd in thc discussion of co-specification. 1.0
representation beyond that suggested in the discussion of co-specification. To
intcrprct ccrtain pronouns, such as thosc whcrc a co-spccifying phrasc docs not
interpret certain pronouns, such as those where a co-specifying phrase docs not
prcccdc thc pronoun in thc discourse. as in Lhc previously givcn 111, we must
precede the pronoun in the discourse, as in the previously givcn Ill, we must
considcr how knowlcdgc is structurcd and rcprcscntcd. It is to this mi~ttcrthat wc
consider how knowledge is structured and rcpresented. It is to this maller that we
now turn.
now turn.
Sidner
310
1. Some inrormanls find this example unacceptable hglish because they cannot dcdde whether she
1 Some informanLs find this example unacceptable r:nglish because they cannol dcdde whether she
in DJZ-2 is Wendy or someone else.
in D322 is Wendy or someone else.
311
A
possible reading
A tilird
third representation
rcprcscntatioli (R3) (which is a possiblc
rcading only
for s"me
be derived from Rl
somc speakers)
spci~kcrs)can bc
R1 by interpreting
intcrprcting a
crayon a'S
as a unique
uniquc set
sct having
hi~vingonly one
onc member.
mcmbcr. This
Ibis
corresponds to the
thc reading
rcading that there
thcrc was only one
onc crayon
crayon given
(in a collective
collcctivc way) to ti,e
Uic set
sct ofgir!s
ofgirls IJruce
nmcc knows.
Supposc now that 1)321132-1
adjudicates the two
Suppose
I is vague. and no processing
processing of it adjudic<ltes
readings
rci~dings(RI)
( I t l ) <Ind
and (R2)
(112) for a
o crayon.
rrnj,o~l.When
Whcn ti,e
tllc focus
f ~ r l l salgoritilm
algl)rithm runs after
aRcr 1)32-1
1132-1
is
is processed.
pr~rcsscd.a crayon will become
bccomc the
thc discourse
dicoursc focus.
fixus. but
hut its representation
rcprcscnlation either
cithcr
will not distinguish
between (l{1)
both as possible
possible
disting~lisll bctwccn
(1<1) and
and (R2)
(112) or will list botll
representations.
a
rcprcscntations.ll When
Wlicn the
thc pronoun
pronoun interpreter
i~itcrprclcr uses
llses iLS
ils rules
rules to choose
cb~~ose
ro-spccificr for
fi)iilir in
in 1)32-2.
1132-2, hotli
rci~dingsmust bc
a\,aili~hlc. The
'llic set
sct reading
rcading (RI)
(K1)
{'o-specifier
both readings
be available.
bc eliminated
climinatcd immediately
im~ncdiatcly because
bccausc of
o f syntactic
sylltac~icconstraints
colistraints that
Uiat rule out a
may be
co-spccilicr fi,r
fi1i il
ir with tbat
that reading.
rcading. so
st, (R2)
(112) is
is left.
Icft. forcing
forcing a reading
rcading for each girlas
{'o-specifier
girl as
the
prototypic girl. If
reading,
I f however.
howcvcr, a speaker
spcakcr accepts
accept? (R3)
(113) as an alternative
altcr~~ativc
rcading.
thc prou)typic
ti,en
botb one
Uicn 1)32
1132 is ambiguous
amhipuous since
sincc bc~th
onc crayon
crayon (R3)
(1<3) and
and a prototypic crayon (R2)
(112)
arc available
awilahlc to
10 focusing.
focusing.
To
sheh in
'1'0 determine
detcrminc ti,e
thc actor focus
f i ~ u sspecification
spccificati~lifor ~
in
r D32-2,
D32-2. the ambiguity
bctwccn Wendy
Wcndy and
and each girl must be
bc resolved
rcsolvcd since
sincc this is
is a case
casc of pOlcntiai
potcntiai
between
intcrpl-cution is
is three
thrcc ways ambiguous:
anihiguous: there
thcre is
is
ambiguily. Actually
Aculally the interpretation
actor <)mbiguilY.
Wendy,
each
girl
interpreted
as
a
set
as
in
(RI)
and
(RJ),
or
as
a
prototype
as
in
intcl.pretcd
sct in (111) and (R3).
in
Wcndy. rnch
(112). The
'l'hc set
sct reading
reading may
m.ly be c1imill<llcd
clilnin;~tedimmediately
immcdiatcly because
bcc;~~lsc
s11e
is singular. For
(R2).
she is
thc anaphor interpreter
intcrprctcr to choose
chtrosc between
bctwccn the remaining
remaining two, the
thc infcning
inferring
the
pnxcss must rule out one
onc of
o f the
tllc readings
rcadings and
and find the
thc other
othcr acceptable.
acccplablc. Many
process
hearers
choosc between
bctwccn Wendy
Wcndy and the
thc prototypic girl: in
in fact numerous
nulncrous
hcarcrs cannot
cannot choose
native
because neither
spcakcrs find
find D32
1132 odd. presumably because
ncithcr choice
choicc is particularly
native speakers
sensible
people do scnsihlc for them.
thcm. On the
thc whole,
whole. the focusing
focusing rulcs suggest
suggcst just what pcople
they
tl~cyrule otlt
out several
scvcral readings
rcadings (all of
o f which can apply after
aRcr D32-1)
1132-1) and U,en
U ~ c nthey
fail to choose given
givcn the discourse actor ambiguity.
Anothcr case
casc of
ofscmantic
in D32,
1132, is illustrated in
in
Another
semantic ambiguity, similar to the one in
1133 below.
D33
below.
J)33-j
buy a vegomatic,
1133-1 Sally wanted
wilntcd to huy
vcgomatic.
Shc had
hitd seen
sccn it advertised
advcrtiscd on TV.
I'V.
2 She
A l'cgomatic
may he
A
vrgo~~iaric
hc interpreted
intcrprctcd specifically,
spccifically. to
111 mean
mcan there
thcrc is
is one
onc particular
panicular
I.I. (kIth
Iknh (Martin
[Manin 19791
19191and
2nd Wehber
Wrhber (Chapler
[Chaplcr 6 or
or Ihis
lhis volume]
ralurnc] have asked
%Led whether
vhclher a sentence
xnlcncc which is
ambiuous
amblgunus between
bclween scveral
rrcral rcading.~
reading must be
be represented
rcprcenlcd hy several
avcral different
diffcrcnl structures.
nrucluw. one
onc forcach
for each
reading.
rcading Martin
hlanin proposes
proposa rcpn.'SCntations
rcpre\cnlalinns Ihal
lhal preserve
prcscrvc ambiguity
mhiguily until
until some
wlmc processor
processor demands a
refinement.
approoeh or
an al!C'rnrttivC'
still an
rrlincrncrlt Whether
Whclhrr this
Lhisapprmch
aran
allcrnnlivc rC'prC'scntation
rcprcwnlalion listing
lisling all readings
reading5 is best
b m is
isaill
open question.
qucslion.
Sidncr
Sidncr
312
312
I . 'lhc tcrms .'non-specific" and "spccilic" are tradi~ionalsemantic exprcsions which bcar no rclalion
I. The lerms
"non-specific"
and "speci/ic"A are
tradilionalrcading
semantic
which bear
no relalion terms
to "specify"
and "spccihcalion."
non-specific
of exprcs.~ions
o dog corresponds
in computational
to "specify"
and "specificalion."
A non-specific
rcadin~ of a dog corresponds in compulationallclTns
to a rcprcsenlalion
or an inslancc
orthc gcncric
dog. Lhal is. what is rcprcsenlcd is a dog which has Lhe
to a rerrcsenlalion
of anofinslance
of thedog
generic
L~, what is represented is a dog wh ich has the
charactcrislim
Ihc gcncric
- Lhaidog.is.thai
an animal
with four Icgs. a rail. mcdium s i ~ c .brown.
characterislics
ofbarks,
the generic
like.- thaI is. an animal wilh four legs, a tail. medium size. brown.
friendly,
and thedog
friendly. barks, and the like.
3D
spccification of "it" must bc gcncratcd using only part of thc phrasc froin D34-1.
specification of "if' must be generated using only part of the phrase from D34-1.
Ihis cxnmplc sccms problcmatic bccausc it placcs much wcight on the
lllis eX:lmple seems problematic because it places much weight on the
infcrring prnccss to dccidc that ccrlain rcadings arc odd. Howcvcr. this is likcly to
inferring process to decide that certain readings are odd. However, this is likely to
be just whcrc the wcight of tllc decision ougllt to bc; many nativc spcakcrs find
be just where the weight of the decision ought to be; many native speakers find
1134 slightly bizarrc bccausc thcir first rcading is that Sally had sccn a vcgomaric
D34 slightly bizarre because their first reading is that Sally had seen a vegomatic
with thc broken bladc advcrtiscd on 'I'V. In fact, it appcars that whcn a spccific
with Ule broken blade advertised on TV. In fact, it appears that when a specific
is introduccd, the spcakcr can dircct
indcfinitc noun phrasc such as a 11>~011ra/ic
indefinite noun phrase such as a l'egol!wlic is introduced, the speaker can direct
attention LO the non-spccific rcading morc rcadily with a plural pronoun, as shown
attention to the non-specific re<Jding more re;ldily with a plural pro'loun, as shown
bclow.
below.
(26) Sllc had seen thcrn advcrtiscd on TV.
(26) She had seell them advcrtised on TV.
'l'hc plural non-spccific rcading as in (26) hits bccn incorporatcd in thc pronoun
The plural non-specific re<luing as in (26) has been incorporated in the pronoun
intc~prctationrules. but Lhc gcncration of spccific;~tionsfor tllc singulars in more
interpretation rules. but Ole generation of specifications for the singulars in more
unusual cascs. such as 1134-2, has not: it is difficult to rccognizc that a
unusual cases. such as \)34-2, hilS not; it is difficult to recognize that a
spccification should bc gciicratcd whcn thc infcrring pnmss rejccts thc two
specification should he generated when tJle inferring process rejects the two
rcadings. tlowcvcr th;ti pcoplc f~ltcrin such cnscs.suggcsts that somc additional,
rcadings. However that people falter in such casessuggests that some additional,
possibly gcncral, problcin solving bcllavior is rclcvant to the propcr trcatmcnt of
possibly general, problem solving behavior is relevant to the proper treatment of
thcsc cascs.
th esc cases.
I:,xaniplcs such its 1134 arc pcrplcxing for anothcr reason; they arc cxamples of
Examples such as 1)34 arc perplexing for another reason; they arc examples of
what I will call. following IFahlman 19791. the "copy phctiomcnon". The
what I will call. following [Fahlman 1979]. the "copy phenomenon". The
ambig;ity ccnters around rhc fact that thcrc can bc many copics of an abstract
ambiguity centers around Ole fact that there can be many copies of an abstract
prototype. Autoinobilcs, computcr programs. airplane flights and money are
prototype. Automobiles, computer programs. airplane flights and money are
othcr common cascs of cntitics which exhibit the copy phcnomcnon. In 1135, the
other common cases of entities which exhibit lhe copy phenomenon. In ))35, the
intcrprctation of i/ dcpends on whcthcr the speaker is rcfcrring to a particular
interpretation of if depends on whether the speaker is referring to a particular
flight or thc normal Sunday flight, a copy of which occurred on /his Sunday.
flight or tl1e normal Sunday flight, a copy of which occurred on Ihis Sunday.
D35-1 TWA 384 was so bumpy this Sunday I almost got sick.
035-1 TWA 384 wa$ so bumpy this Sunday I almost got sick.
2 It usually is a very smooth flight.
2 It usually is a very smooth flight
Note that the i/ cannot co-specify with thc particular flight an this Sunday,
Note that lhe if cannot co-specify with the particular flight on Ihis Sunday.
Howcver, it is possiblc that b e spcakcr intcndcd TWA384 to rcfcr to a particular
However, it is possible lhat tl1e speaker intended TWA384 to refer to a particular
flight; if this is so,thc spcakcr mixcd thc spccific and non-spccific interpretations
flight; if tl1is is so, tl1e speaker mixed the specific and non-specific interpretations
for the co-spccif cr of if. just as in 1334-2.
for the co-specifier of il. just as in D342.
Anothcr characteristic usc of anaphora is the bound variable casc given by
Another characteristic use of anaphora is the bound variable Gase given by
t'artcc [Parlcc 1972. 19781. In 1136 bclow, him co-specifics with Archibald. while if
Partee [Partee 1972, 1978]. In D36 below, him co-specifics witl1 Archibald. while if
hinlselfwcrc uscd, it would constitute a rcflexivc usc of every man.
himselfwere used, it would constitute a reflexive use of every man.
1136-1 Archibald silt down on the floor.
D36-1 Archibald 54lt down on the floor.
2 bvcry man put a flowcr in front of him.
2 Every man put a flower in front of him.
In linguistic thcory, bound variables arc assumcd to be rcprcscnted in sentence
In linguistic lheory, bound variables arc assumed to be represented in sentence
semantics: when uscd in conjunction with syntactic disjoint rcfcrcnce rulcs,
semantics: when used in conjunction with syntactic disjoint reference rules,
pronouns within Ihc scopc of thc quantifier can bc disunguishcd from non-scoped
pronouns within the scope of the quantifier can be distinguished from non-scoped
oncs. Since the pronoun intcrprctct takes account of these conditions, it can easily
ones. Since the pronoun interpreter takes account of lhese conditions, it can easily
choose a propcr co-specifier for him in D36 in tcrms of the focus, but for himselfjt
choose a proper co-specifier for him in D36 in terms of the focus, but for himse/fit
Sidner
Sidncr
314
314
bill rccognisc the bound rclation to every man. I t is crucial to thcsc cases that part
will recognize the bound relation to every man. It is crucial to these cases that part
of thc rcprcscntntion of the intc~prctationof scntcncc phrascs is some scope of
of the representation of the interpretation of sentence phrases is some scope of
quantification, cspccially whcn it is unambiguous.
quantification, especially when it is unambiguous.
She had worn an old brown coat and a grecn scarf over her
She had
worn an old brown coat and a green scarf over her
head.
head.
I n this cxa~nplc,sllc co-specifics with the old lady discussed previously. If Potok
In this example, she co-specifics with the old lady discussed previously. If Potok.
liad told of a discussion hctwccn thc spcakcr and a fcmalc tcachcr, it would no
had told of a discussiol1 between the speaker and a female teacher. it would no
longcr bc possiblc 10 tell that she was co-spccisying with thc old woman. 'Ihe
longer be possible [0 tell that she was co-specifying with the old woman. The
rcading of slre as tcirchcr ~rlightbc a bit surprising bccausc what Ulc tcachcr is
reading Df she as tcacher might be a bit surprising because what the teacher is
wcaring was not relevant to thc previous conversation, but it certainly is not the
wearing was not relevant to the previous conversation, but it ccrUlinly is not the
I . "In ~e lkginning" by Chain] I'otok, page 212, chapter 4.I:awcett Publicalions.Inc. Conn. 1975.
I. "In \he IJcginning" by Chain] POlok, page 212, chapter 4. Fawcett Publications, Inc. Conn.1975.
315
casc that an inferring proccss would dccidc h a t tcuchcrs do nbt wcar old bmwn
case that an inferring process would decide that teachers do not wear old brown
coats and so forth.
coats and so forth.
'Thc slacked f ~ u constraint
s
is not shtcd dircctly within thc ftrusing
The stacked focus constraint is not stated directly within the focusing
algorithm. Rathcr it is implicit in its function. '171~following situation is rulcd out
algorithm. Rather it is implicit in its function. '111e fil110wing situation is ruled out
by Uic manncr in which a ficus is uscd: A pronoun cannot co-spccify with a
by the manner in which a focus is used: 1\ pronoun cannot co-specify with a
IS
a cLII.rcIit l i ~ u iss a11; I C C C P ~ ; I ~CO-spccificr,
~C
sincc that current
str~ckcd~ ~ X T Lwlic~l
stJcked focus when a curren! focus is an acceptable co-specifier, since that current
f t ~ i i swill bc take11 as thc inlcrprct:rlion, and tllc slilckcd focus will ncvcr come
focus will be taken as the interpretation, and the stacked focus will never come
into considcratioo. 'lllc stl~ckcd ftxus constraint is a conscqucncc of the
into consideration. 'Ille stacked focus constraint is a consequence of the
Inovcmcnt of fixus in thc fwus machincry.
movement of )(US in the focus machinery.
'I'hc stackcd firus constriiint. howcvcr, may bc twcrriddcn. An astonishing sct
The stacked fucus constraint, however, may be llverridden. An astonishing set
19741. [I)ci~tsch19751). Onc
of cxamplcs was idcntificd by CJrosr (SCC [I)CIIISC~
of examples was identified by GroSI (sec [Deulsch ] 974], [DeutSlh 1975]). One
such cxamplc was givcn previously (1128). and ;cnothcr is sllown bclow.
such eX<lmple was given previously (1)28), and another is shown below.
A: 011cof thc bolts is stuck and I'm trying to usc both h c plicrs
1\: One ill the bolts is stuck and I'm trying to usc both the pliers
Sidner
Sidner
316
316
or somc other wcll spccificd structure which guidcs the hcarcr in understanding.
or some other well specified structure which guides the hearer in understanding.
Without this stnlcturc rhc l~carcrhas no mcans for choosing somctllii~gotllcr than
Without this structure the hearer hils no means for choosing something other than
thc currcn t ftrus as co-specifier.
the current focus as co-specifier.
317
missing cil-spccificr, and this recognition forins tllc basis of thc condition to which
missing co-specifier, and this recognition forms the basis of lhe condition to which
the fi~usingalgorid~mrcsptmds.
the focusing algnritllm responds.
'Ihcrc arc many iiscs of pronouns wlicrc thc pronoun has n o co-spccificr in the
There arc many uses of pronouns where the pronoun has no co-specifier in lhe
prcccding discourse. whcrc tlic pronoun is not uscd to co-spccify forward. and
preceding discourse. where the pronoun is not used to co-specify forward. and
whcrc it is not uscd in conji~nc~ion
with sotnc action such as pointing. Onc sucl~
where it is not used in conjunction with some action such as pointing. One such
usc, pronutuls that spccify a gcncric from a non-spccific reading. has alrc;idy bcen
usc, pronouns t11<.lt specify a generic from a non-specific reading. has already been
considcrcd. fiuwcver. such a cast is disting~rislicdfrom tlic ones givcn bclow
considered. Huwever. slIch a case is distinguished from thc (Jncs given below
bcc;~uscthe pronoun is not uscd t o specify a gcncric which is gcncmlcd from the
because the pronoun is not used 10 specify a generic which is generated from the
f i ~ u s c dnoiln phrase: cithcr thc pronouns hclow spccify willlout a gcncrating
focllsed noun phrase: rather the pronouns hclow specify without a generating
p11r;lsc. 'I'hc cxa~nplcsgi\cn hclow ;ire ii-o~nscvcril wurccs: thc fist tllrcc are
phrase. The examples giYen bdow arc from se\er<.ll sources: lhe first t1lree are
from I1'ost;il 19691. dic iilurth fron~[Cllafc 19721. thc fifth from diiilogucs collcctcd
from 11'05tall%9). the lilUrth from [Chafe 19721. the fifth from dialogues collected
for tlic Phl. sysrcnl ([Sidncr 10781). and the last was spokefi by a lccturcr at a
for the P/\ I. systell] ([Sidner 1978]). and lhc last was spoken by a lecturer at a
prc~cntnt~on
this autllor i~ttcndcd.
presentation this author attended.
1137-1 1 s,~whlr. Smiih tlic othcr day; you know. & dicd last ycar.
I)371 I SdW Mr. Smith the other day; you know. she died last year.
2 John is ;in olpti;~n.IIc ~nisscsIhcm vcry 111llch.
2 John is an orphan. He misses them very Illuch.
3 1'1-0-Castro pcoplc clon'~bclicvc hc is a monstcr.
3 Pro-Castro people don't believe he is a monster.
4 1 wcnt lo a conccr~liist 11igIlt. 'I'hcv playcd I~cctliovcn'sninlh.
4 I went to a concert last night. They played Beethoven's ninlh.
5 1 want to rncct wit11 llrucc ncxt wcck. Plcasc arrange for us.
5 1want to meet with Bruce next week. Please arrange .it for us.
6 1 ~ ~ s to
c dbc qi~itca tcnnis player. Now whcn I gct together
6 I- used to he quite a tennis player. Now when I get together
with ~ h young
c
guys lo play, I can hardly gct it ovcr the net.
with lhc young guys to play, I can hardly gel.i! over lhe net
With thc cxccption of 1137-1. most hcarcrs arc able to say which is the intcndcd
With lhe exception of 1)37-1. most hearers arc able to say which is the intended
spccificatifln of thc prulioun in thc cascs abovc; 1337-1 can bc understood if the
specification of the pronoun in lhe cases above: 1)37-1 can be understood if the
hcarcr is infonncd that Mr. Smith had a wifc. I ~ O H ' C Y Cs~o. ~ n eof thcsc, cspccially
hearer is infonned that Mr. Smilh had a wife. However. some of lhesc, especially
1 and 2, arc so odd that most hcarcrs rcad thc scntcncc scvcral timcs bcfore
1 and 2. are SlJ lJdd Ulal most hearers read lhe sentence several times before
comprchcnding. Hcarcrs arc dividcd on the acccptability of 3, and most hcarcrs
comprehending. Hearers are divided on lhe acceptability of 3, and most hearers
find 4 and 5 acccptahlc. Such examplcs, as far as I can tell, do not occur naturally
find 4 and 5 acceptahle. Such examples. as far as I can tclf, d~ not occur naturally
in writtcn samplcs.
in written samples.
WI-iilc thcsc uscs of pronouns can bc rccognizcd by the nllc intcrprctcr, how
While lhese uses of pronouns can be recognized by lhe mle inlcrpreter, how
thcy arc undcrslood rcrnains a mysrcry. IWcbbcr 1978a. 1978b1 provides some
they arc understood remains a mystery. [Webber 1978a. 1978bl provides some
additional constrainrs on Lhcir usc. t.Iowewr, thc ftxusing approach provides
additional conslraints on their usc. However, U1C focusing approach provides
somc basic struclurc h a t hclp to providc ;In cxplanation. In all thc rnul!i-scntcnce
SlJIne basic structure lhat help to provide an explanation. In all the multi-sentence
cascs. ~ h pronoun
c
specifics sonicthing which is cloxly asst~iatcdailh Lhc fixus.
c;lses. the pronoun specifics something which is closely aSSllCiated with lhe focus.
More cxplvna(ion is rcquircd sil~ccspcakcts do nu( i~ndcrstiindthose pronoun
More explanation is required since speakers do not lInderst;1lld lhose pronoun
iisos which sccm to hc rclatcd to thc non-cxisrcncc of an objcct (such as John's
uses which seem to he rclat('d to the non-existence of an ohject (such as John's
piircnts in light of Jol~n's~ ~ r l ~ h s n l ~ o oWh;~lcvcr
d).
thc Inanncr in which hcarcrs
parenl" in light of John's orph;mhoodl. Whatever the manner in which hearers
rccovcr sl~ccilicationsfor such prt~nouns.somc principles arc nccdcd that govern
recover spccificalilJns for such pronouns, some principles arc needed that govern
why somc pronouns arc acccplahlc and others arc not.
why some pronouns arc acceptable and others arc not.
Sidner
318
..
1. In certain caws a special audicncc may have dilfercnl rcsponscs lo thc parallelism above. I:or
1. In ccrtain
cases
a special
audience
mayflowers
have different
re;ponscs
to bchare
the parallelism
abovc.
For special
cramplc.
bohnists
who
know whal
are ncar olhcrs
might
dilrcrenuy.
Ilul crcn
e~ample, botanists who know what nowers are ncar others might beha~e dilferenlly. But cI'en special
audicnccs must somctimcs t~scgcncral cechn~qucs. Such is Ihc caw in thc 1139 cramplc. -use
usc general
audiences
must sometimcs
Whitierlcars
are imaginary
flora. tcchnlques. Such is thc case in the D39 e~ample. because
Whitierlcars are imaginary flora.
319
semantics
scmantics of most
mas1 and mine.
mitie. After
Aftcr D40-1,
I>l0-1. the
thc initial focus
kxus is the car radiator (that
is, the
thc prototypic car radiator).
radiator). Using the
thc focusing
fixusing niles,
ntlcs, it will be taken
takcn to
co-specify
co-specify with that radiator. But
I3ut this prediction is incorrect;
incorrect; itil co-specifics
co-spccifics with
tl,e
tlic radiator of the
thc speaker's
spcakcr's car.t
car.'
frce bolt hook_
hook.
1140-1On most cars the radiator has a free
D40-1
2 But
l3ut on mine.
minc. it has
hi~sa floating
floating bolt hook.
The
'lhc usc of it11 here
licrc is simi1.lf
similar to tnc
ttic instance
insolncc of a generic
gcncric for itif in the example
example of
chc vcgomatic
vcgomatic with
will1 the
tlic broken
brokcn cuuing
cutting blade.
bladc. What makes
makcs it different
diffcrcnt is that
the
))40-2
semantic fonn which is similar to ))40-1.
1110-2 has an
;in underlying
undcrlyingscma~itic
1140-1. D40-1
D40-1 specifics
specifics
a universal set
sct of cars and says something about one
onc of the
tlic parts for those cars;
can;
1)40-2
1x0-2 specifics
spccilics a set
sct of one thing. the
thc speaker"s
spc;~kcr'scar. and says
says something
sotnctliing about a
part
p;irt of it;
it; the
thc speaker's
spcakcr's car is related
rclatcd to the
ihc universal
univcrs;~lcar by instantiation.
instanliation. Thus
'Thus itif
in 1)40-2
1140-2 is not pointing to some instance of the
thc prototypic radiator; it co-specifics
with the
Lhc radiator of the
thc speaker's
cpcakcr's car,
car. but the
thc co-specification seems
sccms to come
comc about
partly due to tl,e
Uic representation
rcprcccntation of tl,e
die radiator
radiaror in 1)40-1.
1x0-1. The similarity in the
underlying semantics
scmantics of ))40-1
1140-1 and ))40-2
1x0-2 must bc
uscd in interpreting
intcrprcting the
be used
pronoun.
Onc might wish to construct some
solnc special
spccial purpose
purpasc mechanism that looks
looks for
One
similarities
siniili~ritiesin structure between
bctwccn two sentences.
scntcnccs. This
'This method
mcthod is
js doomed
doomcd for two
reasons.
parallelism exists
rcasons. First,
firs^ parnllclistn
exicts in many aspects
aspccts of language,
languagc, and it happens
happcns at
arbitrary levels
lcvcls of stnlcture.
stntcturc. Second.
Sccond. at any given
givcn level.
lcvcl. tl,e
~Iicproblem of
arhitrary
modcls of language since
recognition of parallelism has plagilcd
plagued computational models
such models
firs suggested.
suggcstcd. For example,
cxamplc, parsing of English sentences
scntenccs
modcls were
wcrc first
containing conjunction is as yet
problem. Methods
yct an unsolved problcm.
Mcthods tried
tricd for parsing
cnnjunctions,
thosc of [Woods
[Woods 1973]
19731 in LUNAR, fail
fail because of
conjunctions, such as those
ovcrgcncralization.
I~ccognition
of
parallelism
is
still
beyond
computational
overgncralization. Recognition
stilt
theory.
The fact that interpretation
parallelism has failed for other aspects
intcrprctation of parallclis~n
aspects of
cotnputational models
modcls of language only indieates
indicates that the
thc problem is a deep one.
computational
An extended
be shown here.
cxamplc in [Sidoer
[Sidncr 1979],
19791, which will not bc
hcrc, indicates
cxtcndcd example
tllat
be found bctwccn
between whole
dial parallel constructions may he
wholc paragraphs in a discourse.
discourse.
Such constructions affect
affcct the
thc interpretation
intcrprctation which speakers
spcakcrs and hearers
licarcrs choose
choose for
anaphors;
parallel stnleture
between two
anaphors: hearers
hcarcrs seem
sccm to L1ke
takc advantage of the
thc parallcl
structure bctwccn
paragr;lphs
paragrz~phsof a discourse
discourse in deciding
dcciding what was meant.
One
possible consequence
conscqucticc of these
lhcsc observations
obscrvations could be that the focus
focus
Onc possiblc
shouldbc
yct unspecified
unspccificd mechanism
algorithm should.
algoritllm
be abandoned in favor of some as yet
1.
example.
1. The author
author thanks
Ulanks R.C.
R.C. Moore
Mmrc for
for suggesting this
thisexample.
Sidner
320
In an earlier section. it was pointed out that sometimes speakers discuss several
conccpts at oncc without indicating that thcy arc doing SO. Gcncrillly this causes
concepts at once without indicating that they arc doing so. Generally this causes
thc discoursc to bc confusing cnougll to prcvcnt thc hcarcr fiom understanding.
the discourse to be confusing enough to pre\'ent the hearer from understanding.
Somctimcs howcucr pcoplc discuss morc than onc Lhing wiihour confusion. How
Sometimes however people discuss more than one thing without confusion, How
is that possiblc? Olic such case has already hccn prcscntcd; in t l ~ clast scction, it
is that possible? One such case has already been presented; in the last section, it
was sl~ownth31 an actor ftcus may hc prcscrlt in many discoiirscs in addition to
was shown that an actor focus may be present in many discourses in addition to
thc discoursc f t ~ u s .It is also possible to havc co-nrcscnt discoursc foci within the
the discourse focus. It is also possible to have co-present discourse foci within the
discourse.
discourse.
In this scction 1 will first dcscribc in morc dctail rhc conccpt of co-prcscnt foci.
In this section I will first describe in more detail the concept of co-present foci.
'I'hcn I will illustratc hctw i~ can cxplain thc usc of /hi5 and f h a ~in discourse.
Then I will illustrate huw it can explain the usc of this and that in discourse.
t-inally I will discuss ~ h crulcs of intcrprctiition for thcsc anapllors and their
Finally I will discuss the rules of interpretation for these anaphors and their
rclation lo thc firusing ;~lgo~-ilIlrn.I will show thaL wlicn uscd togcthcr in a
relation (0 the focusing algorithm. I will show that, when used together in a
diwo~~rsc,
tlri\ ;rnd !ha, kccp ~ h clirus on t ~ ol~jccts
o
at oncc, whilc when uscd
disl:our~e, this and Ihat keep the !i.)Cus on two objects at once, while when used
scpi~ratcly,/hi.\gcncrally niovcs tlic f t n u ~and rhnr docs no[.
separately, this generally movcs the focus and that docs nol.
Just w h ~ tis mcanl by co-~rcscnt
Wlicn morc than onc clcmcnt is
Just what is meant by co-present fhci? When more than one clement is
introduced in a dixicoursc ;~ndci~chis discussed rclativc to thc othcr or rclativc to a
introduced in a dismUfsc and each is discussed relative to the other or relative to a
class in which both trcur, tfic discoursc is said to bc maintaining co-prcscnt foci.
class in which both occur, the discourse is said to he maintaining co-present foci.
An cxamplc will bc hclpful for i~ndcrstandinghow this bchavior occurs.
An example will be helpful for underst<lnding how this behavior occurs.
m?
321
1l5-1
115-1 I'm having
ha\.ing a party tomorrow night;
night:
likc ule
Uic one
onc II had last week.
wcck.
2 it will be like
3 That party was a big success
succcss
4 bccausc
because everyone
cvcryonc danced.
5 This one
better food.
onc will have
havc bcttcr
food.
6h I've
I'vc asked
askcd everyone
c\,cryonc to bring something
so~ncthingspecial.
spccial.
7 Want to come?
Two dilTerent
parties arc wlked
dircrcnt partics
talkcd abollt;
about; that is.
is, both of Ulem
U~cmarc in
ill focus.
fixus. To
indicatc that
th;~tthe
thc speaker
spcakcr wants to discuss both, thaI
rhar is introduced
inuoduccd to co-specify
co-spccify
indicate
with the
d ~ cone mentioned
mcntioncd second.
sccond. The
'l'hc second
sccond party
pirrty is used
uscd as a means
Incans for
c(lmparison
compi~risonto the
thc first:
first: hence
hcncc Ihis
111;s indicates
ind~catcsthe
dic main concern
conccrli of tile
UIC speaker
spcakcr while
thaI
sccondary concern.
conccm.
rhor a secondary
lhe sample
samplc case below, the
thc speaker
speaker would be pointing
pointingat
meh painting
painling as the
lhc senlence
renlenee is utlered.
ullemd.
I. In the
at each
(2Y
)m
ir a Van Gogh:
Gogh: !b.!! one is
isaa Renoir.
Kenoir.
(29)
This
painting i..
lZ [Halliday
58~59.
[Ilalliday and Hasan
llasan 19761.
19761. op. cit.,
cir.. pages 58-59
Sidncr
Sidncr
322
322
establish a focus on the party tolnorrow night according to thc ftrusing algorithm.
cstablish a focus on thc party tomorrow night according to the focusing algorithm.
Among thc potcntial foci of D5-2 is (he one I hod last week. 135-3 indicatcs that
Among tllC potential foci of 052 is the one I had last week. D5-3 indicates that
thc spcakcr wants to siiy morc about thc potcntial filcus whilc maintaining Bic first
tI1e speaker wants to say more about me potential fiJeus whilc maintaining tlic first
focus: this is acco~nplishcdby nlcans of using rhnr illstcad of rhc to co-spccify with
focus; tI1is is accomplished by means of using that instead of the to co-specify with
tire party last wcck. If [he had bccn uscd. it would causc thc hcarcr to s ~ ~ p p o s e
til\: party last weck. If the had bcen used, it would causc the hearer to suppose
initially in prvccssing that lllc spcakcr was talking about tlic upco~njngparty; thcn
initially in proccssing that the speaker was talking about the upcoming PLlrty; then
tlic hcarcr would nccd to rcjcct thc choice bccausc of thc tcnsc of thc vcrb. Thafis
tile hearer would need to reject the choice because of the tense of the verb, ThaI is
a much clcarcr tncans of tclling thc hcarcr which one is undcr discussion.
a much clearer means of telling tile hearer which one is undcr discussion.
How is thc first f i ~ um;~intiiiicd
s
in thc f(rusing algorithm for 135? Whilc two
How is the first filCUS maintaincd in thc focusing algorithm for J)5? While two
discoursc f i ~could
i
he introdl~ccd.a sitnplcr clloice is available; thc first focus
discourse foci could he introduced, a simpler choice is available; thc first focus
could bc stackcd at thc top of thc. ftcus suck. Whcn a noun phnlsc wit11 [his as
could be stacked at lie top of the fllCus stlck. When a noun phrasc with {his as
dctermilicr is cncountcrcd. thc co-prcscnt fixus from thc stack could h e n be
determiner is encountered, We co-present focus from the stack could then be
choscn as the co-spccificr.
chosen as the co-specifier.
I h c kcrncl of thc rulc b r this and thnr is:
The kernel oftl1e rule for Ihis and that is:
(his i s a dctcrmincr uscd for main f t ~ u s that
.
is. [his + <noun
Ihis is a determiner used for main focus, wat is, {his + <noun
phrase> dctcrmincs I n i n ftcus, wliilc ~Aar (noun phrase>
phrase>
determineswith
main
focus, while
phrase>
co-spccifics
a potcntial
or oldthat
f ~ u+s (noun
.Howcrcr,
if thc focus
co-specifics
with
a
potential
or
old
focus.
However,
if
the
focusphrase
has bccn mcn~ioncdusing rhal, thcn a rlrisdcfi~iitcnoun
has been
mentiolled
then
a this definite noun phrase
must
co-spccifyusing
with thai,
an old
focus.
must co-specify with an old focus.
D5 is a casc whcrc ttic primary focus (tomorrow night's party) is stackcd, in favor
D5 is a case where the primary focus (tomorrow night's party) is stacked, in favor
of a discussion of a sccond clcmcnt in thc discoursc. signalled by t l ~ cuse of rhal. It
of a discussion of a second clemcnt in the discourse, signalled by tile use of (hal. It
moues into focus, and w h c ~thc~ first party is discussed again, lhis must bc uscd.
movcs inw focus, and when the first party is discussed again, this must be used.
An cxamplc of the nonnal n ~ l cinstantiation is givcn below in D41. First Hilda's
An example of the nonnal mle instantiation is given belllw in D41. First Hilda's
plan is in focus and thcn thc speaker's own plan. 'I'hcrcaftcr Hilda's plan is talked
plan is in focus and then tile speaker's own plan. Thereafter Hilda's plan is talked
aboutmsing that.
about'using that.
D41-B: What arc the plans for thc banana raid?
041 B: What are the plans for the banana raid?
A: According to Hilda's plan, you and I stay hcre until
A: According
plan, you
I stay
untilI think
evcryonc to
clscHilda's
is in position.
1 don'tandmuch
like ithere
bccause
everyone else is in position. I don't much like it because I tI1ink
wc'll miss all thc action. I tl~inkI'vc got a bcttcr plan: we'll be
we'll miss
all the party,
action.and
I think
I've got a better plan: we'll be
Eloise and Hilda thc scarch party. With
thc guidc
We guide party, and Eloise and Hilda the search party. With
this plan, wc'll bc in o n thc action. Wcll, what d o you think,
wis plan, we'll be in on the action, Well, what do you think,
isn't il a bctlcr p l ~ nthan that onc?
isn'[ it a better plan Wan tllat one?
'I'hc rule for rhisand 01ar rcflccts thc locus of h c spcakcr's conccrn. In 115 the
Thc rule for this and that reflects tI1e locus oflle speaker's concern. In))5 me
first thing introduced is t l ~ cchicf conccrn. whilc in 1M1,A indicatcs conccrn with
first Wing introduced is the chief concern, while in ))41, A indicates concern with
hcr own plan rathcr than Hilda's. In gcncral, when thc spcakcr uses hat as
her own plan rather Wan Hilda's. In gcneral, when the speaker uses that as
dctcrmincr for a dcfinitc anaphor that co-specifics with the f t ~ u s Ihe
, spcaker is
detennincr for a dcfinitc anaphor tI1at co-specifics with the f(Jeus, the speaker is
indicating that chicf conccrn lies with anOthcr clcmcnt onc that has prcviously
indicating fuat chief concern lies witl1 anoller clement, one that has previously
323
bccn in ftxus, but that has bccn "put on hold" until thc spcakcr finishcs with the
been in focus, but that has been "put on hold" until the speaker finishes with the
clcrncnt mcntioncd by thc rhnl phrase.
element mentioned by the that phrase.
'Ihe abovc explanation is incomplete. Spcakcr conccrn functions slightly
The above explanation is incomplete. Speaker concern functions slightly
differently whcn thc spciikcr and hcarcr arc not focuscd on thc same elcmc~~ts.
differently when the speaker and hearer arc not focused on the same elements,
and cspccially whcn r h i . ~and /hat co-spccify clcmcnts of diffcrcnt typcs. In DS
and especially when this and thaI co-specify elements of different types. In D5
and 1341, spcakcr and hcarcr arc ft~uscdon what thc spcakcr makcs lhc focus of
and \)41, speaker and hearer arc foc~sed on what the speaker makes tJle focus of
thc discnursc. Ilowcvcr, in somc dialogues, the spcz~kcrand hcarcr do not always
the discourse. However, in some dblogues, the speaker and hearer do not always
sharc focuscd itcms. Considcr 1142 bclow.
share focllsed items. Consider \)42 below.
1342-1 A: 1.ct.s flip a coin i~ndscc whocalls it.
1)42-1 1\.: l.et's nip a coin and sec who calls it.
2 11: Hcads.
2 B: Heads.
3 A: 'I'hat's what it is. ( * 'I'his is what it is.)
3 1\.: That's what it is. (. This is what it is.)
'I'hc f(xus of Uiis dialoguc is the coin which is hcing flipped. Il has a sccond
The focus of this dblogue is the coin which is being flipped. n has a second
focus which is thc rcsull of thc loss. Wlicn A speaks of 13's ftcus, A uscs that to
focus which is the resull of me toss. When I\. speaks of Irs focus, I\. uses thai to
rcfcr to it: [hir cannot bc so uscd. Whcn scvcral cxamplcs arc considcrcd, the
refer to it this cannot be so used. When several examples arc considered, the
propcr fo~mulationof thc rulc bccotncs clear: whcn spcakcr and hearer (as a
proper fOlmulation of We rule becomes clear: when speaker and hearer (as a
sccond spcakcr) havc diffcrcnt f(rus, usc rhar as Ihc dctcrrnitlcr of a dcfinitc noun
second speaker) have different focus, usc Ihat as tJle detennincr of a definite noun
plirasc that co-spccitics thc hcascr's focus, and use /his for Lhc spcakcr's focus.
phrase that co-specities the hearer's focus, and usc Ihis for tJle speaker's focus.
That in IW2 is used non-co-prcscntly. Non-co-prescnt 11stsof [his and rho/ are
ThaI in 1)42 is lIsed non-co-presently. Non-co-prcsent uses of this and Ihal are
thosc whcrc thc iinaphors spccify discoursc clcmcnts that rcprcscnt t w o different
liose where lie anaphors specify discourse elements liat represent two different
In
typcs 'of objcas and whcrc thcre is only one ftrus pcr spcakcr.
types' of objects and where there is only one focus per speaker.
In
nun-co-prcscncc, this co-specifics an clcincn[ which bccotncs h c fmus whilc rhar
non-co-presence, this cO'specifics an clement which becomes lie focus while that
co-specifics somc othcr discoursc elcmcnt which stays out of focus in spitc of the
co-specifics some other discourse element, which SL.1yS out of focus in spite of the
anaplioric tcnn. Sincc non-co-prcscnt uscs conccrn focus movcment, let us
anaphoric tenn. Since non-co'present uses concern focus movement, let us
discovcr just how thcy bchavc in discourse.
discover just how they behave in discourse.
5.4.3 This and Thor in Focus hlovcment
5.4.3 This and That in Focus Movement
When considering thc bchavior of this in discount, onc may observe that a this
When considering the behavior of this in discourse, onc may observe that a this
dcfinite noun phrasc movcs thc focus to whatcvcr is spccificd by the hcad noun of
definite noun phrase moves the focus to whatever is specified by the head noun of
thc dctinitc noun phrasc. As the rulcs in the discussion of pronouns indicate,
the definite noun phrase. As the rules in the discussion of pronouns indicate,
usually thc focus mows to Ihc Icading potential firus in thc potcntial focus list
usually Ule focus moves to the leading potential focus in tJle potential focus list
Yct somctitncs thc f t ~ u smovcs to thc cntirc tlcscription givcn by thc prcvious
Yet sometimes the focu~ moves to the entire description given by the previous
scntcnccs: somctimcs, surprisingly. thc fixus docs not rc;rlly mnvc in Ihc scnsc
sentences; sometimes. surprisingly, lie filCUS does not really move in fue sense
that a ncw clcmcnt is co-spccificd: thc samc clcmcnt is spccificd but from a
that a new clement is co-specified; the same clement is specified but from a
diffcrcn
t pcrspcctivc.
different
perspective.
Thir
dcfinitc anaphora arc uscd in four ways dcpcnding upon the typc of noun
This definite anaphora arc used in four ways depending upon the type of noun
phrasc hcads that thcy and thc f t ~ u contain.
s
'Ihc cascs arc cnumcratcd bclow
phrase heads that they and the f(lCUS contain. 'Ibe cases are enumerated below
with
sample
illustrative
tcxls.
with sample illustrative texts.
324
Sidncr
Thc focus
focus and rhi~dcfinitc
nouhs: thc
a,-specifier
1. The
lhis definite anaphor have thc
the same head nouns:
the co-specifier
Rlr
mcmbcr of the
dlc potcnlial
Rrus list and can be
bc chosen just as
for thc
the anaphor is a member
potential focus
co-spccificrsarc; move the
t l ~ cfocus
focus to co-specifier.
pronoun co-specifiers
co-spccificr. 1
1M-17hc
nln for a long distance, sending
scnding off
offseveral
D4-11bc axon niuy
may nlO
several
sidcbro~lchrs
sidebranchfs along thc
the way.
bcforc it terminates
tcl-~ninatcsin an cvcn
nctwork of tilarner-ls,
Filamccts, the
2 before
even finer
finer network.
rrni~i~arl
urbor. (FOCUS:
(FOCUS: thc
ten"illal
arbllr.
d,e axon)
3 h1;ln's
sc\,cral feel,
fcct. from thc
column
Man's ~nc!Ilongest axon
~xon runs for several
the spinal cotumn
111
rnuxlcs that control
conlrol movements
mo\,cmcnts ofthe
ofthc toes.
to muscles
441n
In spite
spitc of its grcnt
m
ncrvc fibers,
fibers, is a pan
great Icngth,
length. ~
this axon.
like all.oerve
part
cell. (NEW
(NI:W FOCUS:
1:OCUS: Man's longest axon)
o f a single cell.
of
5 It is living matter.
2. The
'lhc focus
foclls and this
rhis definite
dcfinitc anaphor
aniiphor have
h;lvc different head
hcad nouns: the focus
focus
bc considered
considcrcd as a co-specifier
co-spccificr of thc
{his definite
dcfinitc an.aphor
anaphor bcfo~~.should b"
the this
before other
potcntinl
fcrus is an acceptable
;~cccptablcco-specifier.
co-specifier, thc
focus does
docs not mm'c.
move.
potential foci. If die
the focus
the focus
))43] Consider the roomful of
electronic equipment
makes up a
1143-1
ofclcctronic
cquipmcnt that m;ikcs
modem, high-spccd
high-speed digital computer.
modcni.
2 Rack after
;lftcr rack of transistors. diodes,
diodcs. magnetic
magnetic core memories.
nlcmories.
magnctic
mcmorics-magnetic film
film mcmories-3 all laced
togedler by an intricate system
laccd u~gcthcr
system ofwiriog
of wiring many miles in
length.
length.
4 llnaginc
& m,
every thing in it, shrunk to about the size
Imagine.l!J
Lillilll. and everything
of a cigarette
the room)
cigarcttc package. (FOCUS: thc
i?nan,clougl2ill.
& to a clever elcctrical
electrical
5 Now suppose we give 1h.b marvelous
en.gineer, a man working,
working, however.
however, not in our own midcenturv.
rnidccnturv.
cnginccr,
but about d,e
hut
thc year 1900.
1900.
6 Wc
We prcscnt
present our gift
dcmonstratc a few ofdle
of the remarkable feats
feats it can perform:
7 and demonstrate
perfonn:
several
scvcral hundred thousand additions in one second...
sccond...
8 We lc;~vc&
le,lve this tantalizing
tantalirine,-device with the suggestion
suggestion that he try
package...
to find out what's inside the cigarette package
...
1
head. Choose
the co-specifier
3. This
Tliis definite
dcfinitc anaphor has an empty
cmpry hcad.
Choosc thc
co-spccificr from the
potential focus list.
list but order
ordcr the verb phrasc
clioicc. Move the
phrase predication as first choice.
325
focus to the co-s~ccificr.~
focus to the co-specifier. 2
1144-1 Sincc howcvcr, thc intcrprctation has bccn put forward as a
D44-1 Since however, the interpretation has been put forward as a
hypothcsis,
hypothesis,
2 somc wcight will bc addcd to it
2 some weight will be added to it
3 if it can bc shown to havc an antcccdcnt probability. (FOCUS:
3 if it can be shown to have an antecedent probability. (FOCUS:
the intcrprclation)
the interpretation)
4'l&
is what I shall cndcavnr to do in the remaining pages.
4 This is what I shall endeavor to do in the remaining pages.
(NEW IzOCUS: show rhat thc intcrprctation has antcccdcnt
(NEW FOCUS: show that the intcrpretation has antecedent
probability)
probability)
,
4. A this dcfinitc anaphor occurs inside of a qt~antificdplirase. 'Ihc /his dcfinitc
4. A this definite anaphor occurs inside of a quantified phrase. The this definite
anaphor takcs its co-spccificr from thc quantificd variable; such cascs arc similar
anaphor takes its co-specifier from the quantified variable; such cases arc similar
to thc bound vari;tbIc pronouns discusscd in ~3 a r l i c r .I~h c focus docs not move.
to the bound variable pronouns diSCllssed in earlier. The focus docs 110t move.
(In thc cxarnplc4bclow. Lhc quantificd phriisc and fhisanaphora arc undcrlincd.)
(In the example4 below. the quantified phrase and Ihis anaphora arc underlined.)
1M5-1Wc can. thcrcforc. asscxiate wit11 & aoint ncar the earth a
])45-1 We can. therefore. associate with
Illiin.t ncar Lhe earth a
vector g whicli is lhc accclcration that a body would cxpcrience
vector g whicll is the acceleration that a body would experience
if it wcrc rclcascd at this aoint.
if it were released at this 1lQiul.
2 We call g thc gru~sfario~mlfilr/sfrrt~g/lr
at the poinr in question.
2 We call g the gravitational field slrcng/h at the point in question.
Why arc this anaphora prtrvidcd as a signalling hchavior whcn if and definite
Why arc Ihis anaphora provided as a signalling behavior when it and definite
anaphora using /ha arc availablc? 'Ihc cascs citcd abovc pc~micLhc conclusion
anaphora using Ih(' arc available? The cases cited above permit the conclusion
h i t the spcakcr nccds a way to signal fcxus mobcmcnt whcre it and the anaphora
that Lhe speaker needs a way to signal focus movement where il and Ihe anaphora
would kccp thc focus on an cxisting discourse clcmcnt. Somctimcs the speaker
would keep the focus on an existing discourse clement. Sometimes Lhe speaker
also nccds a way to signal a ~ I C Wview of the focus (case 2); in such cascs it could
also needs a way to signal a new view of the focus (case 2); in such cases il could
not. provide this signal, and rhe noun phrascs arc too easily takcn to be ncw items
not provide this signal, and the noun phrases arc too easily taken to be new items
in the discourse, rathcr Lhan the focus from a diffcrcnt description. Only in
in the discourse, rather Lhan the focus from a different description. Only in
quantificd phrascs can rllis i~napliorabc used without moving focus; as thcse cases
quantified phrases can this an<Jphora be used without moving focus; as these cases
arc wcll marked by thc quantifier, the hcarcr can distinguish them as a spccial
are well marked by the quantifier. the hearer can distinguish them as a special
case.
case.
In contrast to /his, llrat uscd non-co-prcscntly singles out an clcmcnt of the text
In contrast to this, lhat used non-co-presently singles out an clement of the text
for rc-mention without causing a focus movcmcnt. Thc focus may move later, but
for re-mention without causing a focus movement. The focus may move later, but
another anaphor must causc thc move.
another anaphor must Cause the move.
Ihcre arc two kinds of nun-co-prcscnt rkar, which will bc called new mention
ll1ere arc two kinds of non"co-present that. which will be called new mention
rum
2. This cxamplc is from Ihomzi A. Goudge. [Goudge 19691 7ke lhoughr ojCS. Peirce. Dover
2. This Publications.
example is from
A. Goudge. [Goudge 1969) The Thought of c.s. Peirce. Dover
Inc., Ihomas
New York,
page 326.
Publi~lions.lne., New York, page 326.
3. Quanlificd phrase patterns also use that in a similar way.
3. Quantified
phrase
patterns
also and
use David
rhat in Ilalliday
a similar [Rcsnik
way. and Ilalliday 19661. Physics: ParlI. John Wiley
4 1:rorn
Kcwick.
K&ca
4. Fromand
Resnick. Roben and David lIalliday (ResniK and Halliday 1%6]. Physics: Part I. John Wiley
Sons. lnc. Ncw York. page 405.
and Son.... Inc. New York, page 405.
Sidner
Sidncr
326
326
& and prcvious mention tho[. New rncntion thar dcscribcs an clerncnt which
J.h.m, New mention that describes an clement which
has not bccn mcntioncd previously in fhc tcxt. It signals a new discoursc element
has not been mentioned previously in the text. It signals a new discourse element
and can bc uscd without confusion as long as no other dcfinitc noun phrases with
and can be used without confusion as long as no other definite noun phrases with
thc samc noun hcad as thc l h a ~phrasc cxist in the discoursc. Two examples of
the same noun head as the lhal phrase exist in the discourse. Two examples of
ncv rncntion tho1 arc givcn below.'
new mention that arc given below. 1
1346-1 l h i s is a coursc in biology.
))46-1 This is a course in biology.
2 Iiiology studics thosc crititics that arc callcd or~anisms:mcn,
2 Biology studies those entities that are called organisms: men,
worms, ycast cclls. hactcrial cclls arc organisms.
worms, yeast cells. bacterial cells arc organisms.
3 Some organisms arc unicellular,
3 Some organisms arc unicellular,
4 somc ilrc multi-cellular.
4 some arc multi-cellular.
1347-1 In Marigold's gardcn, roscs grow cvcry whcre.
))47-1 In Marigold's garden, roses grow everywhere.
2 Shc likcs roscs of thc Fastcrn gorgc variety morc than thosc of
2 She likes roses of the Eastern gorge variety more than those Qf
thc Wcstcrn shore.
1h Western
3 so shcshore,
has a lo1 of thcm in hcr collcction.
3 so she has a lot of them in her collection.
4 'lhcy grow to priyc winning sl~apcsand sizes.
4 They grow to prile winning shapes and sizes.
A prcuious rncntion rhor phrasc takcs as its co-spccificr thc intcrprctation of
A previous mention tlwl phrase takes as its co-specifier the interprctation of
sornc pllrasc. mcnrioncd prcviorlsly in thc discoursc An cxamplc2 is givcn below.
somc phrase, mentioned previously in the discourse. An examrle2 is given below.
1148-1 If hlNhlS11 is rcfcrrcd to by I3 citl~cras "thc may& of San
D48-1 If MNMSD
referred
to by D either as "the mayor of San
Ilicgo"is or
"1)'s neighbor,"
Diego" or "I)"s neighbor,"
2 r h a l nodc 'MNMSIY rcprcscnis thc individual rcfcrrcd to.
2 then node 'MNMSD' represents the individual referred to.
3 'Illc problcrn is that only looking at that nodc providcs no
3 'llle problem
is that
only
looking atth"t
provides no
of thc
dil'fcrcnccs
in thcnode
two rcfcrcnccs t o MNMSD,
rcflcction
reflection of the differences in the two references to MNMSD,
4 cvcn though the surfacc IlEFNPs d o cxprcss lhis difference.
4 even though the surface DEFNPs do express this difference.
5 Ftrus spaces provide a mcans of rcprescnting this diffcrcnce.
S Focus spaces provide a means of representing this difference.
In 1348, that nod? co-specifics with thc nodc of D48-2. If thc rcst of the
In D48, that node co-specifics with the node of 048 2. If the rest of the
discoursc is ignored, 1M8-3 would havc bccn cqunlly acccptablc using this.
discourse is ignored, D48-3 would have been equally accept1blc using this.
liowcvcr, the author dtxls not want to focus on that node sincc in the next scntence
However, the author docs not want to focus on that node since in the next sentence
shc uses {his dflere~lceto focus.
she uses this dijJerence to focus.
'I'hc important question about prcvious mention thar is why it exists at all in
The important question about previous mention that is why it exists at all in
English. I t is clcar from 1)48 why tlris cannot bc uscd, but what about rhe or if? In
English. It is clear from 1)48 why this cannot be used, but what about the or if! In
thc cxarnplcs I havc found, it in place of thal is ambiguous in indicating what
the examples I have found, it in place of that is ambiguous in indicating what
objcct is bcing rcfcrrcd to. T ~ inP placc of that sccms to bc possible, but has a
object is being referred to. The in place of that seems to be possible, but has a
ccnain c f f i t . Supposc 1M8-3 wcre:
cenain effect. Suppose 1)48-3 were:
(30) 'l'hc prohlcm is that only looking at thc nodc providcs no
(30) The prohlcm is that only looking at the node provides no
rcflcction of thc dill'crcnccs in thc two rcfcrcnccs Lo MNMSI) ...
reflection of the dif1crences in the two references to MNMSD...
'Ihc LISC of thr forces a movcmcnt o f fmus from thc pcrson to thc nodc, whcn
'[he lise of the forces a movement of focus from the person to the node, when
I . 'thc first cxa~nplcwmcs trorn I.uria. S.E. [IL.uria 19751 Thir~ySixI.ecturer in Biologv, Cambridge:
I. The first
I.uria.
S.E.of[Luria
M ITexample
Prm, 3. comes
11 alsofrom
contains
a usc
dciflic1975]
this. ThirtySix Lecturer in Biology, Cambridge:
MIT Prcs.~,
It also
contains
a usc
this.
2. 3.I:rorn
[Cirma
19771.
pageof8deictie
2
2. From (Grosl. 19771. page 82
327
what thc author actually wants to turn hcr attcntion to is thc diffcrcnccs in thc two
what the author actually wants to turn her attention to is the differences in the two
rcfcrcnccs to thc person. In othcr words, an intcrvcning, and in this case
references to the person. In other words, an intervening, and in this case
unncccssary, focus movcmcnt occurs.' t~lcnccrhar scrvcs a uscful function in the
unnecessary, focus movement occurs.! Hence that serves a useful function in the
languagc; it allows the spcakcr re-mcntion discoursc clcmcnts without them
language: it allows the speaker re-mention discourse clements without them
becoming thc focus of thc spcakcr's (and thcrcforc thc hcarcr's) attcntion.
becoming the focus afthe speaker's (and therefore the hearer's) attention.
5.4.4 Using thc I~ocusRlovc~tlcntAlgorithm
5.4.4 Using thc Focus 1\ lovclllcnl Algorithm
'To concludc this scction ilnd illustrate how thc anaphor inlcrprctcr and the
To conclude this section and illustrate how the anaphor interpreter and the
ftrusing algorithm fitnction for /his and rhar anaphora, thcir bchavior will be
focusing algorithm function for this and that anaphora, their behavior wilt be
illustrated o n an cxamplc which uses !his and lhar non-co-prcscntly.
illustrated on an example which uses this and 'hat non-co-presently.
1149-1 Onc day Iiill's fathcr bougl~tIjill a ncw softball.
])49-1 One day Bill's fJlher bought Bill a new softball.
2 Ilill and his fricnds playcd with it daily.
2 Hill and his friends plJyed with it daily.
3 Not long aftcr Ilnrry was givcn a hardball by his uncle.
3 Notillng after Harry was given a hardball by his uncle.
4.l'his ball. allowing morc spccd and accuracy than Bill's, bccame
4 This ball. allowing more speed and accuracy than Ilill's, became
the boys' choicc for all thcir baseball games.
the boys' choice for all their baseball games.
5 That bo~hcrcdIlill's father
5Thal bothered Bill's father
6 bccatlsc hc didn't Iikc to scc I3ill ncglcct his toys.
6 bccause he didn't like to see Bill neglect his toys.
?bc cxpcctcd f i ~ u sof 1149 is a sofiball. I t is confirmed by the use of it in
Tbc expected focus of 1)49 is a softball. It is confirmed by the use of it in
1349-2. 1339-3 introduces a hardball, which is a potcntial Focus fi~rthc discourse.
])49-2. 049-3 introduces a hardball, which is a potential filCUS filr the discourse.
Sincc /his dcfinitc anaphom must use thc potcritial fccus list as a sourcc for
Since this definite anaphora must use the potential focus list as a source for
co-spccificrs (whcn no co-pi-cscnt that is prcscnt), and since thc source passes
co-specifiers (when no co-present that is present), and since the source passes
synlactic. semantic. and infcrcncc critcria. this boll in 1x94 is chosen to co-specify
syntactic, semantic, and inference criteria, this ball in D49-4 is chosen to co-specify
with the putcntial focus of a hardball. Whcn rlle focusing algorithm runs after
with the potential focus of a hardball. When the fllCusing algorithm runs after
D49-4, thc focus must move because of the use of this. In Lhc ncxt scntence a
D49-4, the focus must move because of the use of this. In the next sentence a
noun phrasc consisting only of rhar occurs. 'Thc potcntial focus list of the previous
noun phrase consisting only of that occurs. The potential focus list of the previous
sentcncc contains /he boys' choice, their baseball games and tllc predication
sentence contains the boys' choice, their baseball games and the predication
cxprcsscd by the vcrb phrase, l l l c rules for that prcdict that the last mcmber of
expressed by the verb phrase. 111e rules for that predict that the hist member of
hc potential ftxus list is thc co-specifier of thar, a predication, which is what,
the potential focus list is the co-specifier of that, a predication, which is what,
intuitively, rhar co-specificswith.
intuitively, that co-specifics with.
lhjs scctjon has introduced and dcvclopcd thc notion of co-prcscnce in
This seclion has introduced and developed the notion of co-presence in
discoursc, for undcrstanding thc use of {his and rh~rranaphora. Co-presence is a
discourse, for understanding the usc of this and that anaphora. Co-presence is a
mcans for talking about two or rnorc discoursc clcmcnts that are rclatcd to each
means for talking about two or more discourse clements that are related to each
othcr, Dccausc Ianguagc is spokcn in a lincar dimension, and pcrhaps bccause
other. 13ecausc language is spoken in a linear dimension. and perhaps because
1. There is anolher reason for using !hat. Ihe context which precedes the tcxt oTIM8 makes reference
1. Theretoisaanother
for using
that.
contc~t which precedes the tc~t orI)48 makes reference
Lgure inrca.~on
h e tcxt.
7b!
node The
doubles
as a dc~aicphraw. Ihis cnanlple wggcsls hat here is an
to a figure
in the te~l
That
node doubles
a dciClie
c~ample suggCl>1S that there is an
imporlant
relation
bclwecn
focusingasand
dcixis.phrase.
a matterlbis
demanding
hnher research.
important relation belween focusing and deixis. a matter demanding further research.
328
Sidner
pcoplc havc troublc paying closc attcn~ionto two things at oncc, it is not rcally
people have trouble paying close atlention to two things at once, it is not really
possible to focus on both clc~ncntssimultaneously. lnstcad, two clcments are set
possible to focus on both clements simultaneously. Instead, two elements are set
up for discussion and considcrcd in turn using thc normal Ftzusing process.
up for discussion and considered in turn using the normal focusing process.
Co-prescncc cascs arc wcll signallcd in languagc bcl~rtvior,pcrhaps to prcvcnl
Co-presence cases arc well signalled in language behavior, perhaps to prevent
cunfusion for hcarcr. Sincc hcarcrs arc somctimcs confuscd by single focus, it is
confusion for hearer. Since hearers arc sometimes confused by single focus, it is
not surprising that co-prcscnt ftri most bc signallcd clcarly cnough so that some
not surprising that co-present foci must be signalled clearly enouh so that some
of thc ptttcntial confi~sionis rcduccd. It may wcll bc that Lhc signalling is
of the potential confusion is reduced. It may well be that the signalling is
ncccssary for tlic spcakcr as wcll. to hclp kccp track of what IIC or shc is trying to
necessary for tile speaker as well, to help keep track of what he or she is trying to
say. 'lhis is mcrc speculation until focusing is applied to thc gcncnuion of
say. This is mere specu1<Jtioll until focusing is applied to the generation of
languagc. and a thcor): of its bchuvior is givcn. Somc rcscarch in this direction is
language, and a theory of its behavior is given. Some research in this direction is
discussed by h1cl)onald in this volumc.
discussed by McDonald in this volume.
In contrast to co-prcscnl f c ~LISC
i of his and rhclr noun phrascs, nun-co-prcscnt
In contrast to co-present foci usc of this and {har noun phrases, non-eo-present
uscs of thcm allow thc spc;~kerto indicntc whicll of all thc things she or hc has
uses of them allow the speaker to indicate which of all the things she or he has
mcntioncd is most i~npol-tant to thc discussion. This and rhnr uscd in
mentioned is most important to the discussion. This and Ihat used in
non-co-prcscncc allow thc spcakcr to point at thc rclcvant malcrial with the lcast
non-eo-presence allow the speaker to point at the relevant malerial with the least
confusion. Hcncc thc real diffcrcncc in h c s c uscs is a difTcrcncc in Ule spcakcr's
confusion. Hence the real difference in these uses is a difference in the speaker's
plans for, and thc hcarcr's mcans of dcciding. what will bc talkcd about.
plans for, and tile hearer's means of deciding, what will be talked about.
5.5 Conclusions
5.5 Conclusions
In this chaprcr thc conccpt of ftxus has bccn defined and thc rolc of focusing in
In tllis chapter tile concept of focus has been defined and the role of focusing in
understanding discoursc has bccn illustrated. '1'0 formalize and clarify this
understanding discourse has been illustrated. To fonnalize and clarify this
bchavior, I havc dcscribcd algorithms for finding thc focus and for moving the
behavior, ] have described algorithms for finding the focus and for moving the
focus as the discoursc progrcsscs. 'l'racking the movemcnt of thc f t m s includes a
focus as the discourse progresses. Tracking the movement. of tile focus includes a
means of distinguishing tllc prcscncc of morc than onc f t ~ u in
s thc discourse; the
means of distinguishing the presence of more than one focus in thc discourse; the
focusing algorithm tracks both thc discoursc ftrus. thc chicf ctcmcnt of discussion
focusing algoritllm tracks both the discourse focus, the chief clement of discussion
and the actor focus, thc cllicf actor in a portion of the discourse.
and the actor focus, tile chief actor in a portion of tllC discourse.
Focusing providcs thc foundation for a theory of anaphor intcrprctation. The
Focusing provides the foundation for a theory of anaphor interpretation. The
foci and aspccts of thc focitsing algorithm togcthcr with linguistic rulcs for disjoint
foci and aspects of the focllsing algorithm together with linguistic rules for disjoint
rcfcrencc and sclcctional restrictions, and with rcprcscntations of ncrwork
rcference and selectional restrictions, and with representations of network
relations and scntcntial scopc information provide an account of thc intcrprctation
relations and sentential scope information provide an account of tile interpretation
of many uscs of dcfinite anaphora. i3otb thc rulcs and rcprcscntations havc bccn
of many uses of definite anaphora. Both the rules and representations have been
shown to bc compatible with thc proccss of L~using,and ncccssary to focusing in
shown to be compatible with the process of focusing, and necessary to focusing in
providing information rclcvant to dctcrmining Lhc co-spccificr of a pronoun. In
providing in formation relevant to determining the co-specifier of a pronoun. In
most cascs, thc focus itsclf providcs a co-spccificr, and in somc cascs, a gcncrator
most cases, the focus itsclf provides a co-specifier, and in some cases, a generator
for thc spccification.
for the specification.
Focusing simplifies a crucial stcp of anaphor interpretation. In choosing a
Focusing simplifies a crucial step of anaphor interpretation. In choosing a
co-specification, i~lfcrcnccsabout knowlcdgc of thc evcryday world arc necdcd,
co-specification. infcrences about knowledge of the everyday world arc needed.
and focusing has contributed a mcans of controlling thc inferring process.
and focusing has contributed a means of controlling the inferring process.
329
Because
I%ccausctl,e
thc pronoun interpreter
intcrprctcr predicts a co-specifier
co-spccificr and then
thcn asks for
confinnation
based on the prcscncc
presence of contradictions in the
confirmation or rejection
rcjcction bascd
thc inferring
infcrring
process. tl,e
process is colli
rolled by the focus
In prcvious
previous
Uic inference
infcrcncc pnxcsc
conlrollcd
f i ~ u machinery.
machincry.
s
pnrcsc.
Al natural langungc systems
systcms interpretation
inlcrprclatioli resulted
rcsul~cdfrom
from binding of free
frcc variables
Ainallirallanguage
howcvcr, many inferences
infcrcnccs had to hc
thcn "undone"
during inferring: however,
be drawn and then
due
binding choices.
eliminates
duc to incorrect hinding
choiccs. Tbe
'l'hc focusing
focusing approach
appr~~acli
climinatcs this kind of
tiS
blind hinding and
arid unbinding
i~nl~inding
:IS well
wcll as shortening
shortc~lingtJ1C
thc inference
infcrcncc chain search.
sct~rch.
In this chapler
cliaplcr II hale
hi~vcabo
;II\o illustrated
illustr;~tcdthe value
valuc of concept of co-presence
co-prcscnce for
interpreting
and thaI.
i~i[crprctinglhc
~liclise
usc of this
rhisand
rhar. Co-presence
Co-prcscncc is the
thc means
mc;~nJby which
wliicli a speaker
spcakcr
C~1ll
be focused
and it has bccn
been shown
can bc
fi~cuscdon more
inorc than
d1;11ione
onc thing in a conversation.
convcrst~lio~i.
tIi;~trhi,
rhar used
uscd co-presently
c~vprcscntly<.t110,"",
allou, the
dic maintenance
rnaintcnancc of two foci,
foci, one of main
that
,his and thaI
concern
conccrn and the
thc other
(~lhcrof secondary
sccondnry concern
co~iccrnto the
thc speaKer.
spcakcr. This
This and that
rhar lIsed
used
nun-co-prcscntlg. that
Iht~tis.
ic, when
whcn only one
onc of the
thc two types
typcs of noun phrases is found,
found,
non-eo-presently,
also indicate
indicatc main
~iiainconcern
conccrn (this),
(rhi~),or secondary conccrn (thaI)
(rhar) relative
rclativc to some
other
otlicr focus.
fixus.
111e
by Grosz.
cascs described
dcscrihcd hy
Grosz, and the
thc need
nccd for parallelism
I h c focus
f(rus "popping" cases
undcrscorc the
thc role
n ~ l cof higher
liighcr discourse
discoursc structures
struclurcs in focus
ftrus intcllJrctation.
intclprct;~[ion. The
underscore
(iros?
Gros?. examples
cxnmplcs \"iobtc
viol;~tc the stacked
st;~ckcd theus
firus constraint. but arc comprehensible
comprchcnsible
knowledge of task
wsk structures.
structures. The
'lhc p.~rallclism
bccaosc
tlic speaker
spcakcr relics
rclics on knowledge
b(':caus~ the
parallelism
cxamplcs show that
tliat some allditional
addition;il structure
structul-c is also used
uscd in
ill understanding
undcrstanding
examples
dirourccs. While
\\'liilc ftx:us
firus popping makes
nit~kcsusc
use of the focus
firus algorithm.
;~lgoriUim.the parallel
discourses.
structure cases seem
sccni to rely
rcly on a mechanism
mcclianisn~which is dfrferent
dirfcrcnt in kind.
This ch<.lpter
chaptcr further
filrthcr specifics
spccilics the nature
naturc of focusing
firusing as
;IS it relates
rclalcs to a theory
thcory of
definite
dclinitc anaphor
anaplior illlerpretation.
intcrprctalion. /\
A focus-based
focus-b;scd theory
thcory with stipulations for syntax,
syntax.
infcrcntial knowledge.
knowlcdgc, pm\,idcs
cxplanatory theory
lhcory
semantics and inferential
provides a prcdictivc
predictive and explanatory
anaplior interpretation.
intcrprctatiun. The
'rlic theory
tha~ryis prcdictivc
stipulatcs legal
lcgal and
of anaphor
predictive bccausc
because it stipulates
illcgal pronoun uses
uscs as well
wcll as their
thcir interpretations:
intcrprctations: it is explanatory
cxplanatory bccausc
illegal
because it
hinges on the
thc focusing
focusing algorithm using <tl1aphora
anaphora as signals
signals of what is being
dirusscd. while
whilc the
thc syntactic,
syntactic, semantic
scmantic and inferential
infcrcnti31 knowledge
knowlcdgc used in
discussed.
interpreting
provide for changes
intcrprcting anaphora providc
changcs in the foci
foci of discourse,
discoursc, changes that
rcflcctcd in pronoun usc.
use.
arc reflected
Focusing
be a necessary part of the tlleory
thcory of the
thc pragmatics of
Fcrusing seems
sccms to hc
language.
bnguagc. In his well
wcll known William James
Jamcs lcclUres.
Icclurcs, [Grice
[Gricc 19751
19753 defined
dcfincd several
several
maxims
of conversation.
n~;~xims
convcrsatio~i,one of whieh
which was the
thc maxim of relevance.
rclcvancc. Grice
~ r i c 1c 'says
about this maxim:
Sidner
330
5.6 hcknowlcdgcmcnts
5.6 Acknowledgements
'I'hc rcscarch rcportcd in this paper was supported in part by dlc Advanced
The research reported in this paper was supported in part by the Advanced
licscarch Projccts Agency undcr contract No. N0014-77-C-0378. licscarch
Research Projects Agency under contract No. NOOI4-77C-0378. Research
rcportcd hcrc was also donc at thc Artificial Intclligcncc 1,aboratory of the
reported here was also done at the Artificial Intelligence Laboratory of the
~assachuscttsInstitute of 'l'cchnology. Support for thc hboratory's artificial
Massachusetts Institute of Technology. Support for the laboratory's artificial
intclligcncc rcscarch is provided in part by thc Advanccd IZcscarch Projects
intelligence research is provided in part by the Advanced Research Projects
Agcncy of the Dcpartmcnt of Dcfcnsc undcr ONli contract N0014-75-0643.
Agency of the Department of Defense under ONR contract N0014-75-0643.
CHAPTER 6
CHAPTER6
So What
W e Talk
About
Now?
So What
CanCan
We Talk
About
Now?
Bonnie
Lynn
Webber
Bonnie
Lynn
Webber
6.1 Introduction
6.1 Introduction
Wcbbcr
Webber
332
332
was only of intcrcst within thc singlc scntcncc and cvcn that, only with rcspcct to
was only of interest within the single sentence and even that, only with respect to
Icxical, syntactic and/or scmantic constraints on what a pronoun could be
lexical, syntactic and/or semantic constraints on what a pronoun could be
asstciatcd with. witliirr /hat SUI)IP sentence.
associated with, within Ihat same sentence.
1:ive ~ o l l ~is~clcarly
rs
not an iwdatcd cxamplc. Onc can quickly discovcr many
Fil'e Jollars is clearly not an isolated example. One can quickly discover many
instances wherc a singlc scntcncc admits a varicry of things avail:iblc for
instances where a single sentence admits a variety of things available for
suhscqucnt annphora. bascd on what sccms to bc thc samc phrasc. For cxamplc,
subsequent anaphora, based on what seems to be the same phrase. For example,
'I'hc I~liodcsianridgcback down thc block bit
The Rhodesian ridgeback down the block bit
me yestcrduy. Its rcally a vicious bcast
me yesterday. Its T('ally a vicious beast.
333
undcrstanding continuous discourse and thc phcnomcna common to it.
Surprisingly cnough, such proccdurcs wcrc not con?plctc fililurcs, and cven
Surprisingly enough. such procedures were not complete failures, and even
had modcst succcss. I.UNAII. for cxarnplc, was ablc to make use of its logical
had modest success. LUNAR. fllr example, was able to make usc of its logical
rncaning rcprcscntation to dcal with such ncrn-obvious anaphoric rcfcrcnces as
meaning represeJ1lation to deal with such non-obvious anaphoric references as
D3-1130 any salnplcs contain bis~nuthand rulhcnium?
03-1 Do any samples contain bismuth and ruthenium'!
2 YES
3 Givc me rheir ovcrall analyses.
3 Give me their overall analyses.
whcrc it corrcclly intcrprctcd "lhcir" in D3-3 to rcfcr to thc sct of samples which
where it correctly interpreted "their" in D3~3 to refer to the set of samples which
contain bismuth and ruthenium. In forming this sct dcscriplion -.i.c., "samples
contain bismuth and ruthenium. In forming this set description -' i.e., "samples
that contain bismuth and rulhcnium" - I.UNAR ignorcd its own answer to
that contain bismuth and ruthenium" - I.UNAR ignored its own answer to
qucstion D3-1. 'Ibus in cxamplc 1M. it would incorrcctiy proposc "samplcs that
question ))3- L 'Ibus in example D4, it would incorrectly propose "samples that
cantain bismuth and rutl~cnium"(mthcr than "samplcs") as thc intcrprctrrtion of
contain bismuth and ruthenium" (rather than "samples") as the interpretation of
"they".
"they".
D4-1130 any samples contain bismuth and ruthcnium?
04-1 Do any samples contain bismuth and ruthenium'!
2 NO
2ND
3 ?hcn what d o rheycontain?
3 '111en what do they contain?
So, to summarize thc thcn-current situation in linguistics and A1 natural
So, to summarize the then-current situation in linguistics and Al natural
languagc undcrstanding, thc formcr cithcr wcrcn't intcrcstcd in discoursr:
language understanding, the former either weren't interested in discourse
anaphora or tricd to handlc it by eithcr string or structure matching - clcarly
anaphora or tried to handle it by either string or structure matching - clearly
inadcquatc - whilc Lhc lattcr attcmptcd to dcal with it using whatcvcr ad hoc
inadequate - while the latter attempted to deal with it using
ad hoc
, whatever
rncthods could bc graftcd onto rcprcscntations primaril;
dcsigncd for othcr
2YES
Webbcr
Webber
334
purposes. What my rcscarch has bccn dircctcd at thcn is (1) a definition of what a
purposes. What my research has been directed at then is (1) a definition of what a
tcxt makcs availablc for anaphora that can acco~nmodatc~ h ckinds of cxarnples
text makes available for anaphora that can accommodate the kinds of examples
presented abovc and also bc amcnablc to computational trcatmcnt and (2) within
presented above and also be amenable to computational treatment and (2) within
that computational trcatmcnt, a characterization of fcaturcs of a rcprcscntational
that computational treatment, a characterization of features of a representational
formalism (or sct of rclatcd formalisms) Illat would most cficicntly support the
formalism (or set of related formalisms) that would most efficiently support the
proccdurcs.l 'Ihat is. I havc attcmptcd to articulatc what a tcxt rnakcs available
procedures.] That is. I have attempted to articulate what a text makes available
for annphora in tcrms of thc structurc (as opposcd to contcnt) of its scntenccs, as
for anaphora ill terms of the structure (as opposed to content) of its sentences, as
thcy arc rcprcscn~cd in such a fonnaIism. I.ikc any othcr structure-bascd
they arc represented in such a fonnalism. l.ike any other structure-based
undcrslanding stralcgy. this would have thc advantage of bcing colnrnon to all
understanding strategy. this would have the advantage of being common to all
uscrs of a l;uigi~agc.whcthcr thc contcnt wcrc complctcly u~ldcrstoodor not.
users of a language, whether the content were completely understood or not.
As for thc rcmnindcr of this papcr, thc first part is bascd on my thcsis rcscarch
I\s for the remainder of this paper, the first part is based on my thesis research
[Wcbbcr 1978a1, aidlough it has profitcd from rcccnt work wirh R. Ilobrow on a
[Webber 1978a], although it has profited from recent work with R. Bobrow on a
naturat Idngr~agc i1ltcrf;lcc wc call PSI-KI.ONF, for "l'arsing and Scrnantic
natural language interf:lce we call i'SI-KLQNr~ for "Parsing and Semantic
Intcrprctatioa in KI,-ONE" [Iiohrow and Wcbbcr 1980a. 1980b, 19811. I have
Interpretation in KLQNE" [Bobrow and Webber ]980a. 198Gb, 1981]. I have
a150 changed my tcminolcigy somc~)llat.to cmpllasizc thc commonality of this
aho changed my terminology somewhat, to emphasize the commonality of this
work with thc complcmcntary sct of issucs discussed by Sidncr (this volumc). 'Ihe
work with the complementary set of issues discussed by Sidncr (this volume). llte
sccond part o f thc papcr cont;tins an approach to "onc" anaphc~rathat diffcrs
second part of the paper contains an approach to "one" anaphora that differs
substantially from that prcscntcd in [Wcbbcr 1978aI. 'Ihis new approach has the
substantially from that presented in [Webner 1978a]. 'Iltis new approach has the
attractive fcaturc of reducing two scparatc difticult problems inlo h c same (albeit
auractive feature of reducing two separate difficult problems into the same (albeit
still difficult) one.
still difficult) one.
I'hc approach I have adoptcd to identifying what a tcxt makes availablc for the
llte approach] have adopted to identifying what a text makes available for the
in~crprctatioilof dcfinitc pronoun and "oncW-anaphorais bascd on the notion of a
interpretation of definite pronoun and "one"-anaphora is based on the notion of a
"discoursc model". I h c assumption is that onc objcctivc of discourse is to talk
"discourse model". 2 The assumption is that one objective of discourse is to talk
about somc situation or statc of thc rcal or some hypothetical world. 'To do this, a
about some situation or state of the real or some hypothetical world. To do this, a
spcnkcr must havc a mcntal modcl of that situation or state. Ibc ensuing discourse
speaker must have a mental model of that situation or state. The ensuing discourse
is thus. at one Icvcl, an attcrnpt by thc spcakcr to dircct thc listcncr in synthcsizing
is thus. at one level. an attempt by the speaker to direct the listener in synthesizing
a similar "discoursc modcl" and by that. acquirc information about the spcakcr's
a similar "discourse model" and by that. acquire information about the speaker's
situation or statc. (In this scnsc, I am equating "undcrstanding" with
situation or state. (In this sense, I am equating "understanding" with
"synthcsizing an appropriate rntdcl".)
"synthesizing an appropriate model".)
1. As [llankarncr and Sag 19761 point out, both definite pronoun and "one" anaphora may be
L As [Ilankamer
both definite
pronoun
and lhcy
"one"
anaphora "control"
may be by the
"controlled"and
by Sag
l h i n19761
g olhcrpoint
than out.
UIC prcvious
tcxt. In
panicular,
dcrnonstrale
"controlled"
by
Ihing,s
other
Ihan
Ihe
previous
text.
In
particular.
!hey
demonstrate
"control"
by the what
spatio-lcmporal conlcxl that spcakcr and listcner share. In Lhis papcr. I shall only be discussing
spatio-temporal
context
that speaker
share.make
In this
paper, I shall only be discu~;ng what
tcxts (and,
to a lim~ted
cxlcnt,and
whatlistener
infercncc)
available.
leKlS (and.2. toThis
a limited
extent, that
whathas
inference)
make available.
is a ndion
bcen cxplorcd
in cognitive psychology to a p l a i n h e infcrcnccs Ihat
2. This pcoplc
is a nolion
thaIunderstanding
has been explored
in Tor
cognitive
to explain
the inferences
draw in
t c x t See.
examplc.psychology
[Collins. Brown,
and Iarkin
19771. that
people draw in understanding texl See, for example, [Collins, Brown, and Larkin 1917].
335
Informally, a discoursc model (DM) may bc dcscribcd as thc sct of cntitics
Infonnally, a discourse model (DM) may be described as the scl of entities
"naturally cvokcd" (or in Sidncr's terms, "spccificd") by a discoursc, li'nked
"naturally evoked" (or in Sidner's terms, "specified") by a discourse, linked
togcthcr by thc rclalions they pilrticipatc in. I havc callcd thcsc things "discourse
together by the relations they participate in. I have called these things "discourse
cntitics", and Sidncr lias callcd thcm "cognitive clcrncnts". In linguistics, they
entities", and Sidner has called them "cognitive clements". In linguistics, they
harken back to what [Karttuncn 19761 has callcd "discoursc rcfcrcnts". 'lhe
harken back to what [Kamunen 1976) has called "discourse referents", The
sltcrnatc tcnninologics h a t Sidncr and I havc ndoptcd rcst on wanting to kcep
alternate terminologies that Sidner and I have adopted rest on wanting to keep
"rcfcr" a scparalc tcchniciil Icrin. That is. "rcfcrring" is what pcople do with
"refer" a separate technical term. That is, "referring" is what people do with
language. Ihoking and accessing discoursc cntitics arc what tcxts/discourscs do.
language. Evoking and accessing discourse entities arc what texts/discourses do.
A discoursc cntity inhabits a spcakcr's discoursc modcl and rcprcscnts sotncthing
1\ discourse entity inhabits a speaker's discourse model and represents something
thc spcakcr has rcfcrrcd to. A spcakcr refus to so~llcthingby uttcranccs that
the speaker has referred to. 1\ speaker refffs to something by utterances that
cithcr erloke (if first rcfcrc11cc) or nccess (if subscqucnt rcfcrcncc) its
either evoke (if first reference) or access (if subsequent reference) its
corresponding discoursc cntity.
corresponding discourse entity.
'1'0 illuslratc thc notion of cntidcs "naturally cvokcd" by a discoursc, considcr
To illustrate the notion of entities "naturally evoked" by a discourse, consider
the following scntcncc.
the following sentence.
115-1 Fach 3rd-gradegirl brought a brick to Wcndy's house.
1)5-1 beh 3rd-grade girl brought a brick to Wendy's house.
-lhcn cansidcr cach of thc f[)llowing continualiuns. In cach casc, I would label
Then consider each of the following continuations. In each case, I would label
what is HCCCSSC~by thc dcfinitc pronoun an cnlity "iiaturally cvokcd" by scntence
what is accessed by the definite pronoun an entity "naturally evoked" by sentence
115 -1.' As thc rcadcr can scc, such cntitics may havc dcscriprions appropriate'to
1)5 -1.] I\s the reader can sec, such entities may have descriptions appropriate'to
individuals, scts, stuff, cvcnts, activities, etc.
individuals, sets, stuff, events, activities, etc.
in thcsc conlinualions should be taken as indicatine the same urect Tor bolh
I. 'lhe svmbol
1. The svmbol
;:, in these continuations should be taken as ind icaling the same largel for both
expressions.
expressions.
Wcbbcr
336
Webber
336
337
directly infcrrablc from it), s/hc may d o so with a dcfinitc anaphor (pronoun o r
directly inferrable from it}, sthe may do so with a definite anaphor (pronoun or
NP). In so doing, thc spcakcr assumcs (1) that on t l ~ cbasis of thc discoursc thus
NP). In so doing. the speaker assumes (1) that on the basis of the discourse thus
far, a similar cntity will bc in (or "dircctly" infcrrablc from) thc listcncr's growing
far. a similar entity will be in (or "directly" inferrable from) the listener's growing
I3M iind (2) that thc listcner will bc ;tblc to re-access (or infer) that cntity on rhe
DM and (2) that the listener will be able to re-access (or infer) that entity on the
basis of thc spcakcr's cucs. (For cxamplc, pronouns arc less o f a cuc than
basis of the speaker's cues. (ror example. pronouns arc less of a Clle than
anaplloric NPs.) l h c problcm ~ h c n ,at Icnst for dcfililc anaphor5 is idcntifying
anaphoric NPs.) "Ihe problem then. at least for definite anaphor~ is identifying
what discoursc cntitics ;I tcxt naturally cvokcs.
what discourse entities a text naturally evokes.
What charactcrizcs a discoursc cntity? hly minimal vicw is that ;I discourse
What characterizes a discourse entity? My minimal view is that a discourse
cntity is a "conceptual coatlioclk" (a tcrm coincd by WiIlialli Woods) (111 which to
entity is a "conceptual coathoo~" (a term coined by William Woor,is) on which to
hang dcscriplions of thc cntity's rcal world or hypothctical world corrcspondcnt.
hang descriptions of the entity's real world or hypothetical world correspondent.
As soon as a discoursc cntity is evokcd. it gcts a dcscription. Oicr thc course of
As soon as a discourse entity is evoked, it gets a description. Over tile course of
thc tcxt. t l ~ cdcscriptions it rcccivcs arc dcritcd from both Ihc content of the
the tex!. the descriptions it receives arc derived from both tile content of the
spcakcr's utlcranccs a i d thcir position within the discoursc, as wcll as whatever
speaker's utLerances and their position within the discourse, as well as whatever
gcncral or spccific infornmation about tlic disco~~rsc
cntity thc listcncr can bring to
general or specific information abOllt the discourse entity the listener can bring to
bcar. (For cxamplc, as Lhc tcxt convcys thc passagc of timc. a dcscription likc "the
bear. (For example, as the text conveys the passage of time. a description like "the
16-ycar old girl that ..." might changc to "tlic 20-ycar old girl that....") Illese
16-year old girl tIlat..." might change to "the 20-year old girl thaL...) 'I11ese
dc~riptionsprovidc part of Ihc mcans by which a lisrcncr can dccidc tllc intcndcd
descriptions provide part of the means by which a listener can decide the intended
hrgct of subscqucnt dcfinitc anaphora (the othcr bcing providcd by focusing
target of subsequent definite anaphnrJ (the other being provided by focusing
rncchanisms. as discusscd by [Grosy. 19771 and Sidncr in Chapter 5 of this
mech~nisms. as discussed by [GroS/. 1977] and Sidner in Chapter 5 of this
volume). What 1claim is a spccial status for thc initial dcscription (11)) that tags a
volume). What I claim is a special St..1tus for tile inilial description (II)} that tags a
ncwly cvokcd discoursc entity. (Exilmplcs of such Irk Follow the cquivalcnce
newly evoked discourse entity. (Examples of such IDs follow the equivalence
symbol (=)in thc continuations to cxamplc D5-1above.)
symbol (=) in the continuations to example D5-1 above.)
In what way is a discoursc entity's I13 spccial? For onc thing, it is Lhe only
]n what way is a discourse entily's ID special? For one thing. it is the only
inf~nnationabout an cntity that can, from Ihc first and wifhout question, b e
infonnation about an entity tIlat can, from tile first and without question. be
assurncd to bc sharcd (though not ncccswrily bclicvcd) by both spcakcr and
assumed to be shared (though not necessarily believed) by both speaker and
listcncr alikc. 'lhus, a t Last initially, ir is an infcrencc that thc spcakcr can assume
listener alike. Thus, at least initial1y, it is an inference tIlat tile speaker can assume
Uic listcncr both capable of and likcly to makc. That UIC spcakcr nccdn't believe
the listener both capable of and likely to make. That Ule speaker needn't believe
thc dcscription for it to be cffcctivc is discusscd in [Pcrrault and Cohcn 19811 -the description for it to be effective is discussed in [Perrault and Cohen 1981] -h a t thc listcncr nccdn't bclicvc it cithcr is discussed in [Webbcr 1978bl. The
that the listener needn't believe it eimer is discussed in [Webber 1978b]. The
important thing is that it is shared, and hcncc uschl.
important tiling is that it is shared, and hence useful.
Now this view of discoursc undcrstanding docs not prccludc discourse entities
Now this view of discourse understanding docs not preclude discourse entities
from bcing cvokcd by othcr things than thc tcxt. In fiict, 1 will arguc that certain
from being evoked by other tIlings than the text. ]n fact. I will argue tIlat certain
typcs of discoursc cntitics must bc dcrivcd from othcr oncs infcrcntially. In
types of discourse entities must be derived from other ones inferentially. In
p3rticuI;lr, I will arguc that it is thc simplcst way of accounting for anaphoric
particular, I will argue that it is the simplest way of accounting for anaphoric
acccss to "gcncric sct" discoursc cntitics. 1 will show that, from any discourse
access to "generic set" discourse entities. I will show that, from any discourse
cntity (cxccpt, in gcncral, oncs cvokcd by a proper noun phrase, namc, title, ctc.),
entity (except. in general. ones evoked by a proper noun phrase. name, title, ele.),
thc spcakcr can prcsumc that a listcner is capablc of deriving a discourse cntity
me speaker can presume that a listener is capable of deriving a discourse entity
corrcsponding to onc of a limited numbcr of gcncric sets to which the rcfcrcnt of
corresponding to one of a limited number of generic sets to which the referent of
thc original discourse cntity belongs.
tile original discourse entity belongs.
I h c problcms I sct out to solve - idcntifying what a tcxt makes available for
The problems I set out to solve - identifying what a text makes available for
Wcbber
Webber
338
dcfinitc pronotlo nnaphora (and, it turns out, for "onc" anaphora as wcll) and
definite pronoulI anaphora (and, it turns out, for "one" anaphora as well) and
dcvcloping computationally fcasiblc ways for making thcm available in an N1-U
developing computationally feasible ways for making them available in an NLU
syslcm wcrc Lhus transformed into (1) idcntifying thc discoursc cntitics a text
system were thus transformed into (1) identifying the discourse entities a text
cvokcs and (2) ascribing to thcm appropriatc Ills. What I discovcrcd was that
evokes and (2) ascribing to them appropriate IDs. What I discovered was that
thcsc things dcpcnd hcavily on co,rrbirtaroric fcaturcs o f a scntcncc.' Moreover,
these things depend heavily on combinatoric features of a sentence. 1 Moreover,
thcsc fcaturcs can bc capturcd in Ihc structure o f a rcprcscntational formalisln (as
these features can be captured in the structure of a representational formalism (as
opposcd to itr lcxical content). and can bc tlic basis for prt~cdurcswhich identify
opposed to it5 lexical content), and can be the basis for procedures which identify
thc cntitics cvukcd by ij tcxt and dcrivc tllcir Ills. What I havc rcalizcd more
the entities evoked by a text and derive their IDs. What I have realized more
rcccntly is discussod in Section 6.5: namely, that thc sc~tiantic problem of
recently isdiscllsscd in Section 6.5: namely, that the sel1lantic problem of
intcrprcting "onc" anaphora2
can be rcduccd to thc alrcady considcrcd problcrn
interpreting "one" anaphora 2 can be reduced tD the already considered problem
of identifying possihtc "sct-typc" rcsolvants fur dcfinirc plural anaphora.
of identifying possihle "set-type" resolvant,> for definite plunll anaphora.
assumptions, I want to
Ilchrc finishing this statcmcnt of my f~~ndamcntal
Before finishing this statement of my fundamental assumptions, I want to
comment on whcrc 1 scc cvoking ond labcling discour-sc cntitics fhting into thc
CDmment on where I sec evoking and labeling discourse entities fitting into the
1. Combinatoric lcaturcs are disarssed in the ncxt scction. Bricfly put, the o m s I am considcring are:
1. Combinatoric features are discussed in the next scction. Brieny put, the OITeS I am considering are:
iteration "A window was ~estedin each house" implies Ihc speaker is viewing the situaljon in
iteration"A window was lested in each housc" implies the speaker is viewing Ihe situation in
tcrms of one tcsdng per house.
terms of one testing per house.
dcncndcncy 'A window was tesled in mch house" implies undcr one inlemrctation that the
dependency
"A window
testedoninIhc
each
house" house:
impliesa under
i laswialcd
l inlemrellltion
that the
parlicular
mndowwas
dcpcnds
particular
window
with housel
was tested
particular
on the
particular
house:
a window
a5.'iOCialcd
with housel
was Icsted
in window
housel. depends
a window
associaled
with
housc2
was leslcd
in hnuse2.
elc. Under
a dimerent
in housel.
a
window
a'isociated
with
housc2
was
tested
in
housc2.
etc.
Under
a
different
inlcrprcblion. thc particular window is indcpcndcnl of h e house. Ihc same one tested
interpretation. the particular window is indcpendenl of the house. ll1e same one tested
throughoul
throughout
cardinalily "'1-wo windows wcrc lcslcd in each house" implies Tor any given htusc lherc were two
cndinalily "Two windows were tested in each housc" implies for any given house there were two
windows Ic~lcd.whcrc h c two arc distinct From one anothcr. N o h h e scnlcnce docs not (on
windows tl'Slcd. where the two are distinct from one another. Notice the scntenee docs not (on
its own) imply anyd~ingahaul the cardinalily of thc cnlirc window SCL
its own)
anY~ling aboul the cardinality of the entire window SCl
2. as imply
opposcd
lo h e ~ynrflcfic.prohlcm o i chanclcrizing where "onc(s)" can and cannot occur. a
2. as opposed 10 the ~}'nlaeri(' prohlem of characlcrizing where "onc(s)'" can and cannot occur. a
prhlcm cd intcrcsl to transiormalional grammarians, cf. Seelion 6.5.
problem 3.of Clauscs
intcrest m
toatransformational
grammarians,
Seetionmms,
6.5. as may vcrb phrases. For example.
y dso cvokc discourse
entities ofcr.various
J. Clauses may also evok e discourse entities of various son~, as may verb phrases. For example,
Slir file disso1vpd)~aslinro theflour, [hen knead the douahfor 10 minutes or until elasrk
Stir Ille dis.to/vl'd )'Fa.tl inlo thejlour, IIlen knead the dough/or 10 minutes or until elastic.
'Ihc discourn cniity dwribablc as "[he dough" is cvokcd by Ihc lira clause, or ralher. the reader's
'Ille discoUTSC
entity describable
as "the
dough"
is evoked
the lirstinclause,
or rather, the reader's
understanding
or it. I lowcvcr,
I will
be ignoring
suchbyexamplcs
this discussion.
understanding oril. However, I will be ignoring such examples in Ihisdiscussion.
339
Webber
340
Aftcr this, I shall show onc way in which combinntoric aspects of a scntcnce
After this, ] shall show one way in which combinatoric aspects of a sentence
can be aniculatcd in a logical formalism, and hcncc provide a ntructural basis for
can be articulated in a logical formalism, and hence provide a 5tructural basis for
forming appropriatc discoursc cntity Ills. I h i s is illustrated bricfly in Scction 6.4,
forming appropriate discourse entity IDs. lhis is illustrated briefly in Section 6.4,
ancr which 1 discuss thc derivation of "gcncric sci" discourse cntitics from specific
afier which I discuss the derivation of "generic set" discourse entities from specific
ones and thc usc of both in understanding "onc" anaphora.
ones and the usc of both in understanding "onc" anaphora.
6.3.1 The ncrinilc/lndefinitc Distin"clion
1, As I mcniioncd carlicr. dcfinilc descrip~ionscan be uscd in two ways: they can be u r d like dclinite
L As I mentioncd
two ways:
they canmodcl
be used
like definite
pronouns 10earlier,
acccscdefinitc
cntiticsdescriptions
prcsumcd Lcan
o bebeinused
h e in
lislcncr's
d~smursc
or lhcy
rran be used to
to access
cntities inlo
presumed
to beItinis the
discourse model
or theythat
canis be
used to
pronounsevoke
new cniitics
that modcl.
Ihe listener's
kttcr use ordcfiniic
dcscriplions
rclcvant
here.
evoke new entitics into that modeL It is the latter use or dcfinite descriptions that is relevant here.
341
dcscription fornrs tlic entity's initial description (ID).
description fomls the entity's initial description (lD).
Noticc h a t forming thc sccond conjunct rcquircs all cllipscd vcrb phrascs in
Notice that forming the second conjunct requires all cllipsed verb phrases in
thc scntcncc bc rccovcrcd. If not, a scntcnce like
the sentence be recovered. Ifnot. a sentence like
1110-1 A woman wliom Wcndy knows is too.
DIO-l A woman whom Wendy knows is too,
would cvokc a discoursc cntity which coilld only bc dcscribcd as the
would evoke a discourse entity which could only be described as the
jusl-t~iet~lioncd
wo~)la)ln~hot~r
lV~11dykilows n,Jro i.s loo. 'lhis is not vcry useful from
just-mc/ltioncd woman who/H Wendy kllows who is too, This is not very useful from
thc point of vicw of reasoning about cntitics.
the point of view of reason ing about entities.
Iiowcvcr, a morc important rcason for requiring Lhc rccovcry of clIipscd vcrb
However, a more important reason for requiring the recovery of ellipsed verb
phrrlscs is that doing so LniIy rcvcal othcr noun pllrascs that should bc aswxiatcd
phrases is that doing so may reveal other noun phrases that should be associated
with discoursc cntiiics. Failurc to do w may result in subscqucnt dcfinitc
with discourse entities. Failure to do so may result in subsequent definite
anal)liora fiiling to havc rcfcrcnts. For cxamplc.
anaphora failing to have referents. For example.
111 1-1 John didn't bokc a cnkc for Wcndy. On thc othcr hand. Elliot
I>11-l John didn't bake a cake for Wendy. On the other hand. Elliot
did 0,but stic didn't like it.
did 0, but she didn't like it.
0 = bakc a cakc for Wcndy
o= bake
a cake for Wendy
il = (tic ']j~s~-ii~c~~tioncd''
cake that Elliot
it = the "just-mentioned" cake that Elliot
biikcd for Wcndy
baked for Wendy
If tllc cllipscd vcrb j)hrasc has not hccn rccovcrcd by thc tirnc thc bur clause is
Iftlle ellipsed verb phrase has not heen recovered by tlle time the but clause is
bcing prt~csscd.rhcrc \rill hc 110 way of accounting fur thc pronoun ir. (I inguists
being processed. there will hc no way of accounting fm the pronoun it. (I inguists
havc used thc tcrm "missing antcccdcnt" [Grinder and Postal 19711 to dcscribe
havc used tile tcrm "missing antecedent" [Grinder and Postal 1971] to describe
this situation. in wllicli thc "antcccdcnt" of a dcfinitc pronoun is not cxplicit,
this situation. in which fie "antecedent" of a definite pronoun is not explicit.
bcing somehow "containcd" in an cllipscd connitucnt.)
being somehow "contained" in an ellipsed constituent.)
'I'hc siimc chiiractcrislic bchi~viorof dcfinitcs and indcfinitcs just discussed for
The same characterisLic behaYior of detinites and indefinites just discusscd for
singular noun phrascs holds for plural noun phrascs as wcll. 'Ihc rcfcrcnt of the
singular noun phrases holds for plural noun phrases as well. The referent of lhe
dcfinitc plural pronoun the): likc thc rcfcrcnt of a dcfinitc singular pnlnoun, must
definite plum! pronoun ,he)', like the referent of a definite singular pronoun, must
satisfy a uniql~cdcscription sharcd by spcakcr and listcncr. While both indefinite
satisfy a uniquc description sharcd by speaker and listencr. While both indefinite
and dctinitc plural noun phrases in contcxt may evokc uniqhcly dcscribablc sct
and definite plural noun phrases in context may evoke uniql.lely dcscribable set
cntitics, lhc proccdurc for forming thcir descriptions again differs in the two cascs.
entities. tile procedure for forming tl1eir descriptions again differs in fie two cases.
Consider, for cxamplc, thc following scnrcnccs. (Comments arc in parcnthescs):
Consider. for cxample, the following sentences. (Comments are in parentheses):
1112-1 I mw Ihc guys from "Ycs" on 'I'V tonight. ( I saw all of thcm.)
Dl2-1 I saw tlle guys from "Yes" on TV tonight. (I sawall offiem.)
2 1 SAW tllc live guys from "Ycs" on 'l'V tonight. (I saw all of them
2 I saw tile five guys from "Yes" on TV tonight. (I sawall of them
- that is, tivc.)
- that is, five.)
3 1 saw a11 fivc guys from "Ycs" on 'rV tonight. (Usually they're
3 I sawall five guys from "Yes" on TV tonight. (Usually they're
only around in twos and Ihrccs.)
only around in twos and threes.)
4 1 saw somc guys from "Ycs" o n 'I'V tonight. (I didn't scc thcm
41 saw some guys from "Ycs" on TV tonight. (I didn't sec fiem
all.)
all.)
5 1 saw four guys from "Ycs" on '1'V tonight. ('lhcrc arc more
51 saw four guys from "Yes" on TV tonight. C1here are more
than four guys in Ycs.)
than four guys in Yes.)
'Ihc first thrcc scntcnccs cach contain a dcfinitc plural noun phrase.
"he first three sentences each contain a definite plural noun phrase.
Corresponding to that noun phrase, a discoursc cntity will bc cvokcd into the
Corresponding to that noun phrase. a discourse entity will be evoked into the
listcncr's discoursc modcl which can bc dcscribcd appropriately as rhe (ser ufl guys
listener's discourse model which can be described appropriately as the (sci of) guys
from 'Yes'. Clhc sccond two scntcnccs providc thc cardinality of that sct as wcll.)
from 'Yes'. ((he second two sentences provide the cardinality of that set as well.)
Wcbbcr
Webber
342
342
This can bc vcrificd by following cithcr of thcsc scntcnccs by '"'I'hcy wcrc bcing
This can be verified by following either of these sentences by '''They were being
intervicwcd by Dick Cavctt" and considering what is ncccsscd by /hey. Ihc last
interviewed by Dick Cavett" and considering what is accessed by they. lhe last
two scntcnccs, on thc othcr hand, cach contain an indcfinitc plural noun phrase.
two sentences, on the other hand, each contain an indefinite plural noun phrase.
'Thc only appropriate dcscription for thc discoursc cntity that cach of thcse noun
The only appropriate description for the discourse entity that each of these noun
phrascs in context cvokcs is something likc the ~ U S / - ! I I P I I ~ ~ Oset
I I Fof~ guys /TOm
phrases in context evokes is something like Ihe jushllel1tiol1ed set of guys [rom
'Yes' thut I SOU' 011 TI' tot~ight.'l'his is bccausc cithcr scntcncc is cansistcnt with
'Yes' that J saw 011 TV lonight. This is because either sentence is consistent with
thcrc bcing othcr mcrnbcrs of "Ycs" whom I didn't scc o n 'I'V tonight, as wcll as
there being other members of "Yes" whom I didn't see on TV tonight, as well as
othcr mcmbcrs whom I did scc but w1~1rn I don't mean to include in my
other members whom I did sec but whom I don't mean to include in my
st-~tcmcnt.'('l'hc last scntcncc simply providcs additional cardinality information
statement. (The last sentence simply provides additional cardinality information
about that sct of guys frum 'Ycs' that I saw.)
about that set of guys from 'Yes' that I saw.)
6.3.2 Qu;~ntilicrScoping
6.3.2 Quantifier Scoping
l'hc phcnorncnon to bc discussed hcrc has traditionally callcd "quantifier
The phenomenon to be discussed here has traditionally called "quantifier
scoping" or "quantificr ordcring", aftcr its fo~n~ulation
in thc lint ordcr predicate
scoping" or "quantifier ordering", after its formulation in the first order predicate
calcul~~s.
b r l y in thc devclupmcnt of'l'ransfomational G n m ~ ~ l aitr was
, found to
calculus. Early in the development of Transformational Grammar, it was found to
pusc a problcm for Ihc trcatmcnt of passivizatiun as an "optional" (it.,
pose a problem for the treatment of passivization as an "optional" (i.e.
sc~nanticallyneutral) transformation. 'Ihc problcm can bc illustrated by the
semantically neutral) transformation. '111e problem can be illustrated by the
minimally diffcrcnt pair of sentcnccs
minimally different pair of sentences
343
Wcbbcr
Webber
344
344
rcflccts thc considcrablc (and growing) body of cvidcncc that there is wide
reflects the considerable (and growing) body of evidence that there is wide
variation in pcoplc's grasp of quantificd scntcnccs: certain aspccts of thcm sccm to
variation in peoplc's grasp of quantified scntences: certain aspects of them seem to
bc undcrstood casily and consistcntly. evcn whcn the scntcnccs arc prcscntcd with
be understood easily and consistently. even when the sentences arc presented with
no contcxt (c.g., "thrcc <x>'sW implies. to people that Lhcre arc at lcast thrce
no context (e.g., "three <x)'s" implies. to people that there arc at least three
scparatc things dcscribablc as an <x>, ncvcr just onc or two things, each
separate things describable as an <x>. never just one or two things, each
dcscribablc in rnorc than onc Hay as an <x>). Certain othcr a$pccts pcoplc have
describable in more U1an one way as an <x. Certain other aspects people have
troublc grasping at all [\'an1 chn 19781 - much lcss trying to sclcct which ofscvcral
trouble grasping at all [Vanl ehn1978] much less trying to select which of several
altcrnativc rcadings is intcndcd (c.g., t l ~ cdata basc qucrics bclow that [I'homas
alternative readings is intended (e.g" the data base queries below that rrhomas
19761 rlskcd Ilirman sul)jcctsto answer.
1976] asked human subjects to answer.
- out
I'rint out any dcpartmcnts that scll cvcry articlc that some
. Print
anyrnakcs.
department~ that sel1 every article that some
company
company makes.
345
Plural noun phrnscs may providc infonnation about two scparatc things: a set and
Plural noun phrases may provide infonnation about two separate things: a set and
its mcmbcrs. For example,
its members. For example,
1117-1 thrcc dottcd lincs which intcrscct at point P
J)] 7-1 three dOlled lines which intersect at point P
2 chc hrcc dottcd lincs which intcrscct at point P
2 the Uuee dolled lines which intersect at point P
Dolled is a prupcrty of cach individusl line. Three, on thc othcr hand, supplies
DOlled is a property of each individual line. Three, on the other hand. supplies
information about tlic cardinality of thc scts of lincs which satisfy tllcse
infonnation about the cardinality of the set" of lines which satisfy these
descriptions. Morcovcr. UIC rclalivc clausc - which itlter.scc/ of poi17r /' - docs not
descriptions. Moreover. the relative clause - which illterscrt at point r - docs not
dirccdy restrict wllictl indi\idu:il lincs bclong to thcsc scts, but ralllcr spccilies a
directly restrict which individual lines belong to these sets, but rather specifies a
propcrty of apprt~priatcscts uf thrcc lincs. I'~.cnominal. prcpositioniil and clausal
property of appropriate set~ of three lines. Prenominal, prepositional and clausal
mod~ficrswi~hina noun phrase may all bc uscd to dcscribc cithcr a sct as a unit or
modifiers within a noun phrase may all be used to describe either a set as a unit or
thc sct's individual mcmbcrs.
the set's individual members.
For handling anaphora, a distinction must be drawn bctwcen sct and ~ncmber
For handling anaphora, a distinction must be drawn between set and member
infonnation within a plural noun phrasc. both for dcscribing lhc crility i[ cvokcs
infonnation within a plural noun phrase, both for describing the entity it evokes
and for dcscribing host cntitics cvokcd by any cmbcddcd noun phrascs.
and for describing those entities evoked by any embedded noun phrases.
Considcr tlic following scntcnces.
Consider the following sentences.
1118-1'I'hrcc mcn who tricd to lift a piano dropped it.
D18-1 Three men who tried to lift a piano dropped it.
2'l'hc thrcc lncn ~ h tricd
o to lift a piano dropped it.
2 The three men who tried to lift a piano dropped it
3 'l'hrcc mcn who tricd to lift a piano dropped them.
3Three men who tried to lift a piano dropped them.
4 ' l a c dlrcc lncn who tricd to lift a piano droppcd thcm.
4 The three men who tried to lift a pi.mo dropped them.
In the first two scntcnccs, thc rclativcclausc convcys information about the set
In the first two sentences, the relative clause conveys information about the set
of mcn as a unit. 'ihus it can bc undcrstotd as acccssi~igthc discourse entity
of men as a unit. Thus it can be understood as accessing the discourse entity
dcscribablc as the jusf-met~tionedpiunowhich rl~ejusl-,ner71iolzed three met1 tried lo
describable as the just-mentioned piano which the just-mentioned three men tried to
18. Howcvcr, in Ule sccond two scntcnces, thc rclativc clause convcys information
lift. However, in Ule second two sentences. the relative clause conveys infonnation
about cach mcmbcr of thc sct. 'Ilus /hey can bc understood as accessing the
about each member of the set Thus Ihey can be understood as accessing the
entity dcscribablc as rhe jusr-lllenrioned pianos, each of which one of the
entity describable as the just-mentioned pianos, each of which one of the
just-mentioned men tried to 18.'
just-mentioned men tried 10 lift.l
1. Thme four sentences hint at another distinction that must be madc in order to identify discourse
1. These
four sentences
at another
distinction
must be made in order to identify discourse
a noun
phrasethat
&curs crnbcddcd in a rclalivc clause (as a piano dm3
enlilia
adcquatclyhint whclhcr
entities above)
adequately'
whether
noun phrase
embedded
in a relative clause (a~ a piano does
or in the
matrixascnlcnce.
This isoccurs
discussed
a1 lenglh in [Wcbbcr 1978~1and p c b b c r l978bl.
above) or in the matrix sentence. This is discussed at length in [Webber 1978c] and (Webber 1978b].
346
Webber
one of three distinct scnscs that a sentence containing a plural noun phrasc can be
one of three distinct senses that a sentence conlaining a plural noun phrase can be
uscd to convcy. '!he hrcc scnscs 1 call distrihutivc, collcctivc and coniunctivg
used to convey. The three senses I call distrihutive. collective and conjunctive.
Cansidcr for cxample
Consider for example
(collective reading).
3. t l ~ the
~ t total of rosc-buying boys is three and the total
3. thatnumber
the totalofofroses,
rose-buying
boyswhich
is three
the total
cach of
wasand
bought
by some
numberrose-buying
of roses, boy,
each
of
which
was
bought
by
is fivc (cotijuncrive readirlg). 'I'his some
implics that
rose-buying
boy, is cithcr
five (conjunctive
reading).
Thisnot
implies
tllc speaker
does not know
or does
care that
to tell the
the speaker
either
knowupor with
does roscs.
not careAsto irtellis the
listencr
howdocs
boysnot
match
the least
listenercommital
how boys
match up with
it is wllcn
the least
dcfault
thcrc is no
interpretation,
it mayroses.
be tllc As
commital
interpretation,
contextual
bias. it may be the default when there is no
contextual bias.
It is important for the listener to identify the intended sense because of their
It is important for the listener to identify the intended sense because of their
diffcrcnt implications. That is,
different implications. That is,
- If Ihe cxamplc is understood distributively. it implics that
- If the
example
is understood
distributively,
that
each
of thc boys
owns fivc roses
as a rcsult itofimplies
thc transaction.
each of the boys owns five roses as a result of the transaction.
Distinguishing these implications is important not only for reasoning but for
Distinguishing these implications is important not only for reasoning but for
347
anaphora as well, as thc following pairs of sentcnces show:
anaphora as well, as the following pairs of sentences show:
1119-1' h c thrcc boys ordered a hrgc anchovy pi7.m.
D191 The three boys ordered a large anchovy pi1.Za.
2 Bccausc of thc hcavy traffic, ir was dclivcrcd cold,
2 Decause of the heavy traffic, it was delivered cold.
1120-1 'The thrcc boys cach ordcrcd a largc anchovy p i n .
1)20-1 The three boys each ordered a large anchovy pizza.
2 Dccausc of the hcavy traffic, [hey wcre dclivcrcd cold,
2 Decause of the heavy traffic, they were delivered cold.
l3ccausc English has a diffcrcnt pnlnoun for accessing a sct than a n individual,
Because English has a different pronoun for accessing a set than an individual,
thc distributivc usc of a plural must bc distinguished from a conjunctive or
the distributive usc of a plural must he distinguished from a conjunctive or
collcctivc usc. Olily whcn a plural is uscd to convcy distributivc quantification can
collective usc. Only when a plural is used to convey distributive quantification can
it changc the discoursc cntity cvokcd by a singular noun phrasc within its scope
it change the discourse entity eyoked by a singular noun phrase within its scope
from an individual to a sct. 71'l~is
mcans that a diffcrcnt pronoun would be uscd to
from an individual to a set. This means that a different pronoun would be used to
rcfcr to it.
refer to it.
Spccifically. in 1)19 il acccsscs a discoursc cntity appropriately dcscribcd as !he
Specifically. in D19 if accesses a discourse entity appropriately described as the
In 1120. /hey ~CCCSSCSthe
j u s / - ~ r i r ~ ~ / i oIurge
t ~ e da~rchoq.pizza/hat rlre Boj.5 ord~r~rl.
jus/-!I1rllfiol1cd large mre/lOl')' pizza 'hal ,Ire bo)'s ordeml. In 1)20. they accesses the
sct cvokcd by t l ~ csame noun phrasc, this tirnc dcscribablc as the sel of
set evoked by the same noun phrase. this time describable as the set of
jusf-~net~~io~~ed
Iorge ot~cholyp i i ~ nccich
~ ofwhich ~ l a ordered
s
by olie of rhe three
just-mell{iolled large anrho!')' pizzas, each ofwhirh was ordCfrd by ol1e of {he three
boj~s. 'Thc gcncral issuc is getting appropriatc descriptions. In thc original
boys. 'The general issue is getting appropriate descriptions. In the original
cxamplc,
example,
'lhrce boys bought fivc roses.
Three boys bought five roses.
dcpcnding on which sensc of three boys thc spcakcr mcans to convcy. the
depending on which sense of three boys the speaker means to convey. the
description appropriatc to the discourse entity cvokcd by jive roses will be
description appropriate to the discourse entity evoked by five foses will be
somcthing like
something like
Webber
Webber
348
348
1. The formalism nctually hcing uscd lo implcmcnt Ihc5c idcas i s K1;ONL a uniform language bascd
I. The on
formalism
actually
heing usedinhcr~lancc
(0 iml'llemenl!hese ideas is KL-ONE. a uniform language based
be idca
of Wuclurcd
nctworks [Ilrachman 1978. 19791. K1,-ONE has scvcral
on the idea of structured inheritance networKs [Ilraehman 1978. 1979]. KL-ONE has several
advanlagcs ovcr even a lypcd fin[-ordcr prcdicatc c~lculus( I 01'C) rormalism: bcing a non-linear
adl/antages
over c\'en aittyped
(I DPe) formalism: being a non-linear
allowsfirst-order
Tor pirlinlpredicate
ordcringcalculus
of dcpcndcncia.
(In Lhc IaIC. IcR-lo-righi ordcring
rcprcantalion.
representation. il allows for p~rlial ordering of dependencies. (In the F01'C. len-to-righl ordering
risidly dcfincs dcpcndcncies.) Morcovcr, il will allow us to rcprmnt -- in tcrms of mappings-- all and
rigidly defines
Moreover, it currently
will allowknown.[Robrow
us to represcnl and
.. inWcbbcr
terms of1980aI
mappings -- all and
only thedependencies,)
combinaloric informalion
only !he combinatoric information currently known.IBobrow and Webber 198Oa)
349
whilc thc noun phrasc a paar~ur[hat I.tfejendy gave lo a gorillrr ciln bc rcprcscntcd as
while the noun phrase a peanut that Wendy gave 10 a gorilla can be represented as
3x:,\(u:Pcanut)[(3y :Gorilla). Gave(Wcndy,u.y)]
I argucd in Scction 6.3 that in order to fonn appropriatc Ills, it was ncccssary to
I argued in Section 6.3 that in ordcr to fonn appropriate IDs, it was necessary to
distinguish whclhcr a noun phrasc was singular or plural. dcfinitc or indefinite.'
distinguish whelher a noun phrase was singular or plural. definite or indetinite.l
One way to d o so is to use a typcd existential quantificatit~nalopcrator ("there
One wny to do so is to use a typed existential quanlificatil1nal operator ("there
exist$", o r 3) for indcfinitc NPs and anothcr opcrator I!, to bc rcad "thcre exists
exisl<;", or 3) for indefinite NPs and another operator' 31, to be read "there exists
a unique" - for dcfinitc Nl's. 110th arc o f h e form
a unique" - for definite NPs. Both are of the form
<operatorXvariable):<S>
1. The rollowing discussion contains a more uniform treatmcnr of delinite and indefinite n w n
I. The following discussion contains a more unifonn tre30ncnt of definite and indefinite noun
phrases than that prcscnlcd in mcbber 1978al. Ilowever. it dm not ailcn~pllo capture the notion of
phrases undermnstraincd
than thal presentcd
in (W cbbcr 1978a]. Ilowe~er. it docs not attcmpt to raplure the notion of
combinatoticr
underconstraincd oombinatorics.
Wcbbcr
Webber
3x: Hat
3x: Hat
3!x:h(u:Hat)Saw(Suc,u)
3!x:>..(u: Hat)Saw(Sue,u)
3x: A(u:Hat)Rcd(u)
3x:>..(u:Hat)Red(u)
350
a hat
ahat
the hat Sue saw
the hat Sue saw
a rcd hat
a red hat
35\
prcdicatc truc
argitmcnt is a set
sct of men
mcn such that the set
sct of them
thcm lifted
liftcd a
is a predicate
true if
jf its argument
othcr hand,
piano. On the other
set(A(v:Man)[(3y:Piano). L(v,y)))
'
I.(v.y)]
(i) 3x: A(v:sct(Ma11))[(3y:l'iano)
A(v:set(Mml[(3y:Piano) L(v,y)]
some
somc men
mcn who
whit (togclhcr)
(togchcr) lifted
liftcd a piano
(ii)
(ii) 3x: set(iI(v:Man)[(3y:Piano)
sct(~(v:Ma11)[(3y:l'iano)L(v,y)))
I.(v,y)])
some
lifted a piano
somc men
mcn who (each)
(cach) liftcd
Definite plurals can bc
be rcprcscntcd
represented like definite
Dcfinitc
dcfinitc singulars
singulars using the "unique
cxi!itcntial"
cxistcntial" operator.
(iii) 3!x: il(v:seI(Man[(3y:Piano)
h(v:sct(Man))1(3y:Piano) l.(v,y)]
l.(v.y)]
O,e
men who (together)
Ulc lncn
(togcthcr) lifted
lincd a piano
(iv) 3!x: seI(A(v:Man)[(3y:l'iano)
sct(A(v:M;1n)[(3y:l'iano) L(v,y)))
I.(v,y)])
(cach) lifted
liftcd a piano
the lncn
meo who (each)
In (iv) the definiteness
the pluial
pluia! should be intcrprctcd
interpreted as indicating
dcfinitcncss of thc
indicating O,e
the lolal
rota1
set ofall
of all and only thosc
those individuals
individuals (in
(in the context)
contcxt) satisfying
satisfying the given
giwn predicate.
etc.),
included in
rwo metl,
met the
rhe (wo
rlvo men.
~r~err,
ctc.), can be
bc includcd
Cardinality, if
if specified
spccificd (c.g. (wo
these rcprcscntations
representations simply by using the cardinality "[
=
thcsc
"1 ["1" and equality
equality "="
operaton. For example,
cxamplc, parallel to (i)-(iv)
(i)-(iv) above are
operators.
3x:
A(v:set(Man))[(3y:Pig), L(v,y)
/\ [v[=3)
3x: A(v:sct(Man))[(3y:ap).
I.(v.y) A
lvl=3]
(togclhcr)liftcd
"thrcc rncn
"three
men who (together)
lifted a pig"
3x: il(u:set(A(v:Man)[(3y:Pig).
L(v,y)]([ul=3)
~(u:sel(h(v:Man)[(3y:Pig).I.(v.~)l))[lul=3]
"three
men who (cach)
hfted a pig"
"thrcc mcn
(cacb) liflcd
1.
lhat prcdicales
predicates like I.l.. ("lifC')
to both
both individuals
individuals and sets,
1. 1
1 am assuming
asuming U~ai
("lift") can
can be
bc applied
applicd la
a s , with the
appropriate
semantics falling
evaluation. This
"semantic ovcrloadlng"
overloading" is well-known
appropriate semantics
Salling out at
a1 cvalualion.
l h e nolion
nottan of
oS"seman11c
thee programming
in B
programming languages
hnguages lilerature.
lilcrature.
Webber
352
31x:
v:set(Man[(3y:Pig) . I.(v,y) 1\
Ivl = 3]
3!x: h(
A(v:sct(Man))[(3y:l'ig).
A lvl=3]
"the three
lhrce men
mcn who (together)
(togcthcr) lifted a pig"
31x:
L(v,y)])[Jul = 3]
3!x: h(u:sct(h(v:Man)[(3y:Pig)
A(~l:set(A(v:Man)[(3y:Pig)L(v,y)]))[lul=3]
"the
"thc tluee
Uircc men who (e;,ch)
(cr~cli)lifted a pig"
At this
be ppuulcd
the abscncc
absence of
of universal
this point,
point. the
Uic reader
rcadcr might bc
u ~ ~ l cabout
d
thc
quantifiers
V's
tllUS
far,
given
that
ill
clemen
wry
logic,
the
swndard
quantificn - V's - tlius far, givcn
in clcmcnciry logic.
standard practice is
to usc
'1l1C slnndard
standard cxamplc
example of
of this is
usc them
thcm to represent
rcprcsclit plural noun phrases. 'lhc
atc a pi7.1..a
pins
h c h man ate
F,ach
Vx: Man)(
Man)( 3y: Pizza)
P~iza)Ate(x,y)
Atc(x,y)
( 'Ix:
misses the point that such sentences are
Howcvcr, this misses
However,
arc rarely meant
mcant to
to imply true
truc
llicy imply that the
univcrsality. Rathcr they
univcrsality.
thc prcdicatc holds ofevery
of every rncmber
member ofsome
of some
nlorc limited
limited set
sct that the
thc speaker and listener
listcncr jointly rccognix.
h a t is, "each
more
recognize. ''mat
<xY
is
morc
correctly
interpreted
as
"each
of
thc
<x>sW
i.e.,
in
tcrms
of
<x>" is morc corrcctly intcrpreted "cach ofthc <x>s" - i.e.,
of a definite
dcfinite
sct and a univcrsal
univcml quantificr
quantifier ovcr
over that sct:
set:
sct
353
blCh man
a pilla,
(3!w:atesct(Man))(Vxcw)(
3y: Pizza) Alc(x,y)
As with o ~ h c dcfinitc
r
noun plirnscs, it is tlic task of pragmatics lo figure out what
As with other definite noun phrases, it is the task of pragmatics to figure out what
acctiunts for thc dcfinitcncss, including dcfinitcncss by virtue of being the
accounts for tile definiteness, including definiteness by virtue of being the
totrll/univcrsal sct of rhi~igsdcscribablc as an <x>. In thc ft~llowingdiscussion - as
total/universal set of things describable as an <x>. In the following discussion - as
in [Ibhrow and Wcbbcr 19811 - I will bc rcprcscnling cxplicitly thc dcfinite set
in [Bobrow and Webber 1981}- I will be representing explicitly the definite set
asst~iatcdwith "cach Nl's".
associated with "each NPs",
tIcrc tllc rcprcscntation for thc clausc is simply thc opcn formula
Here the representation for the clause is simply the open formula
Gave(x,y,z)
Wcbber
Webber
354
354
will vary according to how individual constants arc substituted for thc variablcs in
will vary according to how individual constants are substituted for the variables in
thc pattcrn. 'I'hc quantifier collar, on thc othcr hand, can bc vicwcd as a
the pattern. The quantifier collar, on the other h,md, can be viewed as a
co~~zbirraroricspec/'Jca~iot~which dctcrmi~~cswhat ordcrcd con~binntions of
cOlllbinaloric specification which. determines what ordered combin<ltions of
constants can bc assigncd to thc variablcs to instantiatc or stamp out copies of the
constants can be assigned to the variables to inst<lntiatc or stamp out copies of the
pattern. Among Lhc co~nhinatoricconstraints on individual instantiations arc the
pattern. Among the combinatoric constraints on individual instantiations arc the
thrcc carlicr mcntioncd factors - dcpcndcncy, distribution and cardinality. It is
three earlier mentioned factors' dependency, distribution and cardinality. It is
with rcspcct to thcsc thrcc factors1 that thc rulcs for forming appropriate spccific
with respect to these three factors! that the rules lr forming appropriate specific
(as opposcd to gcncric) discoursc cntity Ills can bc spccificd.
(as opposed to generic) discourse entity IDs can be specified.
.
In what follows, I will first prcsclit two rulcs for forming Ilk whcrc thc
In what follows, J will first present two rules for forming IDs where the
cvoking iiolln phrasc (NP) is no1 dcpcndcnt on any iteration, and dlcn two rulcs
evok ing JlOlln phrase (NP) is not dependent on any iteration. and then two rules
for forming 11)s whcn h c r c is such a dcpcndcncy. 'I'hc Ills arc filrmcd by a
for forming IDs when there is such a dependency. The I\)s arc formed by a
prwcdurc which movcs across a clausc rcprcscntation Icft-to-right, applying
procedure which moves ,ICTllSS a clause representation left-to-right, applying
wllichcvcr rulc matchcs in ordcr to idcntify tltllc ncxt discoursc cntity. and thcn
whichever rule matches in order to identify the next discourse entity, and then
rcwriting thc rcprcscnti~tionin tcrrns of h a t cntity in ordcr to rcmovc sonlc of
rewriting the represent,ltion in terms of lIlat entity in order to remove some oft'le
qusntificational cotnplcxily. ('l'his should hccomc clcarcr' through tllc cxamplcs.)
quantific.ltional complexity. (This should hecome clearer through tlle examples.)
Onc filrthcr now bcforc bcgiluning: in [Wcbbcr 1978i11, I nccdcd six tulcs in
One further noll~ before beginning: in [Webber 1978'1], I needed six rules in
ordcr to account for thc salnc data as hcrc. 'I'hc current reduction comcs from a
order to <lccount for the same data as here. The current reduction comes from a
more uoiform trcatmcnt of dcfinitc and indcfinitc Nl's - cf. Scction 6.4.2 - and a
more uniform treatment of definite and indefinite NPs - cf. Section 6.4.2 - and a
trcntmcnt of "cach" NPs as itcnting ovcr sornc definite (possibly
tr9atment of "each" NPs as iterating over some definite (possibly
discoursc-dcfinitc) sct - i.c.. rcprcscnting "cach" Nl's as a quantihcr scqilcnce of
discourse-definite) set - i.e., representing "each" NPs as a quantifier sequence of
thc form (3! V). cf. Scction 6.4.3
the fonn (3! \f). cf. Section 6.4.3
Non-itcratcd Contcxts
Non-iterated Contexts
Hcrc wc considcr discoursc cntitics cvokcd in thc following two contexts
Here we consider discourse entities evoked in the following two contexts
( 3x : <type) . p(x)
( 3!x : <type , P(x)
1. disregarding other factors like lense. negation and modality [Webber 1978a]
1. disregarding other factors like tense, negation and modality [Webber 1975al-
355
I will also not make cxplicit any quantifiers to thc right of the onc of iritcrcsf
I will also not makc explicit any quantifiers to the right of the one of interest,
absorbing thcm rather illto the predicate for simplicity - i.e..
absorbing them rather into the predicate for simplicity - Le.,
( 3xQ) QlIantyQlIant k
,\(u)[QlIant2... Quant k . Pu:...] x
P(x)....::>
( 3xQ)
= deft 3xQ) . P'(x)
Itulc 1: 'l'hc first n ~ l ca1,plics in tllc following indefinite contexts (lcft column) to
Rule 1:--The first nile applies in the following indefinitc contexts (left column) to
product discourse cnritics with Ills as in tl~cright column. ('l'hc cl;rusc being
produce discourse entities with IDs as in the right column. (The clause being
prtrcsscd is labclcd S and i sti~ndsfor t l ~ ciota Ft~nctionuscd in forming dctinite
processed is labeled Sand i stands for the iota function used in forming definite
descriptions.)
descri ptions.)
( 3xQ). p(x)
( 3x: sct(Q)). P(X)
( 3x: sct(Q))(VXEX) . p(x)
~ule
Wcbber
Webber
356
(3!x:Q). p(x)
(3!x:sct(Q). P(X)
( 31x:sct(Q)('v'XEX). P(x)
iX:Qx
iX:QX
iX:QX
This nllc is very sirnylc, assigning to cach discourse cntity. associated with an
This mle is very simple. assigning to each discourse entity. associated with an
indcpcndcnt dcfinitc NP. simply that description. I f thc dcfinitc is mcnnt t o b e
independent definite NP. simply that description. If the definite is meant to .be
annphoric, lhcn this ncw description must bc compatible with oncs already
a'naphoric, then this new description must be compatible with ones already
a~tributcdto ~ l i ccntity so spccificd. If not, Ulc 113 is assigncd to thc ncwly evoked
attributed to the entity so specified. If not. tile IJ) is assigned to the newly evoked
cntity. Rule 2 is intcndcd to co\cr all thc following cascs:
entity. Rule 2 is intended to cover all the following cases:
--
357
Sornc boy kisscd a girl hc likcd.
Some boy kissed a girl he liked.
( 3x1: Ii~y)(3x~:h(l1:Girl)
[l.ikcd(xl,u)]) K(xl.xZ)
( 3x I: Boy)(]x2: x(u:Girl) [Liked(x 1,u))) K(xI,x2)
Aftcr thc first quantifier (3xl: Boy) is prcxcsscd and its asst~iatcddiscourse cntity
i\fter the first qllantifier (3xI: Boy) is processed and its associated discourse entity
(el) idcntificd, w c can rcwritc this wK in tcms of cl, thcrcby rcmuving thc first
(cI) idcntified, we can rewrite this wff in tenns of el' thereby removing the first
quantilicr LC.,
quantifier Le.,
Iterated Contexts
Hcrc wc consider discourse cntitics cvokcd in the remaining two contexts
Here we consider discourse entities evoked in the remaining two contexts
and
and
( 3yl~el)-.(3ykEek)
where el...ck are sct-typc discourse entities. How arbitrary wffs can bc rewritten
where cI...ek are sct-type discourse entities. How arbitrary wITs can be rewritten
Wcbber
358
Webber
358
in the above rather gcncral format will bc justified after Rulcs 3 and 4 are
in the above rather general format will be justified after Rules 3 and 4 are
presented.
presented.
Rule
J:The third rule applies in the following three indefinite contexts (left
Rule
I: 'l'hc third rulc applies in thc following thrcc indcfinitc contcxts (left
(VYl"'Yk)(]x:Q). p(x)
(VYl"'Yk)(3X: sct(Q. P(x)
(VYl'''Yk)(3X: sct(Q))(\fx: X). p(x)
Each cat
atc a mouse~(u:Mousc)[Saw
it saw.
y,u]) .Ate y.x
(Vycl)(3x:
(VyEel)(3x: ,I,.(u:Mouse)[Saw Y,u]). Ate y,x
(wlicre cl is the discourse cntity associated with the dcfinitc set o f cats) the
(where e1 is the discourse entity associated with the definite set of cats) the
discoursc cntily dcscribable as
discourse entity describable as
h e sct of tliings each of which is a rnausc and for cach of which
!.he set thcrc
of things
eachwho
of which
is a mouse
andwhich
fOf each
whichby S
wasof
evoked
is a cat
saw it and
atc it and
there is(Xa 1cat
who
saw
it
and
ate
it
and
which
was
evoked
by
( ~ Y E cMousc(x)
~)
A Atc(y,x) A Evokc(S.X)) S
(x I (3yEel) Mouse(x) 1\ Ate(y,x) 1\ Evokc(S,X}}
and associating with thc indefinite plural in a clause like
359
IXI(3Y]"'Yk)' Qx]
IXI(3Yl"'Yk)' sCl(Q)X]
[XI(3y],..y}.). sct(Q)X]
It has the effect of associating with the NP "the steering wheel" in the first
clause
Wcbber
Webbcr
360
thc sct of things. cach of which is associated with somc car and
the sel of things. cilch of which is associilted with some car and
is !he sct c ~ two
f front whccls for that car
is !he set (If two front wheels for that car
(YI (3xEcl) h(u:scl(Whccl))[lul= 2]Y A Havc(Y.x))
(YI (3xEe1) A(u:scl(Wheel[Iul = 21Y A Havc(Y.x
where cl is again thc discoursc cndty associatcd with thc dcfinite sct ofcars.
where el is again the discourse entity associated with the definite set ofcars.
13efarc winding up &his prcscntalion, 1 want to take up thc restriction in Rules
Before winding up this presentation. I want to take up the restriction in Rules
3 and 4 h a t a distributive contcxt can bc rcprcscntcd simply as a quantifier collar
3 and 4 that a distributh'e context can be represented simply as a quantifier collar
of the form
of the fonn
361
Not all dcfinitc plural anaphora are intcnded to spccify particular scts of <x>s
Not all definite plural anaphora are intended to specify particular sets of <x>s
evoked by a tcxt. Othcrs sccm intcndcd to spccify scts that onc c o ~ ~characterize
ld
evoked by a text. Others seem intended to specify sets that one could characterize
roughly as "thc sct of Lhings dcscribablc as an <x>". 'I'hcsc sct cntitics I have
roughly as "the set of things describable as an <x>". These sct entities I have
callcd "generic sca", although I do not mean to imply thcrcby ha[ <x> nccd be
called "generic sets". although 1 do not mean to imply thereby thaC <x> need be
any son of "natural gcnus". For cxamplc, just as a dcfrnitc plural anaphor may
any sort of "natural genus", For example, just as a definite plural anaphor may
spccify a particular scts of <x>s like
specify a particular sets of <x>s like
1121-1 I scc thrcc Japancsc cars outside.
])21-11 sec three
carsbclong
outside.
2 110 Japanese
any of thcm
to you?
2 Do any of them belong to you?
Webber
362
D22-1
each boy a grccn'r-shirt
green T-shirt at
1122-1 Last
I.ast week
wcck Wendy
Wciidy again bought cach
Maey's.
Macy's.
2 She's always buying them.
them;;
just melltiolled
green T-shirts,
T-shirt.~ each o
oJwhich
Wendy
rhej~rst
ttietlriotledgreen
f which kVe~~dy
!hem = the
bought
at
Alacy'sJor
some
bay)
bough! a! Aiacyifor SUIIIC boy)
a definite
may also specify
generic sct
set entity like
dclinitc plural anaphor
nnaphorrnay
specify a;I gcncric
D23-1
the parking lot.
1)23-I I1see
scc seven
scvcn Japanese
Japancsc cars in thc
22'llicy'rc
'Illey're really
rcally selling
sclling like hot cakes.
the)'
ttrej, == Japanese
Japat~esccars
can
D24-1
green 1'-shirt
T-shirt at Macy's,
Macy's.
1124-1 Last
1.ast week
wcck Wendy
Wcndy bought
buught each
cach boy a grccn
2 She
gives
them
to
everyone.
Shc givcs thcrn cvcryonc.
them;:;;
!he111 grew
green T-shirts
T-shirts
'lhc imponant
important questions
qucstions regarding access to gcncric
The
generic sets arc thus:
1.
interpreted as specifying a
1. When is a definite
dcfinitc plural anaphor intcrprctcd
set?
generic set?
2. Is there
thcrc a limit on the
thc generic
gcncric scts
h a t"aa dcfinitc
2.
sets tthat
definite plural
specify, and if so,
so. in what way is that limit related
anaphor can specify,
thc material
matcrial prescnt
prcscnt in the
thc tcxt
to the
text and whcrc
where it is located?>
Aspccts of these
thcsc questions arc discussed
discusscd in Sidner
Sidncr (Choptcr
Aspects
(Chapter 55 of
of this volume)
volume) and
[Sidncr 1979].
1979). In particular.
p;lrticular. she
shc shows that it is the clcmcnts
(Sidner
clements jg
in 1il!;ill aatt any
timc that
h a t arc the
thc major (if not thc
particular time
dle only) tcxtual
textual source of
of gcncric
generic set
cntitics. Reflecting
llcflccting this.
this, she
shc augments
augmcnts her anaphor resolution hcuristics
entities.
heuristics for
dcfinitc plural anaphora to try generic
gcncric set rcw~lvants
definite
rcsolvants bascd
based on the clc~ncnts
clements in
frrus
at
thc
particular
timc.
'lhc
cornplcmcntary
problcm
that
1
havc
focus the
time. The complementary problem dlat I have considered
considered
briclly here
hcrc is tllat
Uiat of characterizing this "bascd
and want to discuss briefly
"based on" relation
relation
hctwccn foclIsed
fcruscd clemenL~
clcnicnLs and generic
gcncric set
sct cntitics
between
entities and hcncc,
hence, the range of
of gencric
generic
363
sct cnlitics that can and cannot bc accesscd?
set entities that can and cannot be accessed.l
For cxarnplc, thc cntity dcscribablc as the set ofjus~-n~et~tiot~ed
grcerr T-shirts
For example, the entity describable as the set o/just-mentioned green T-shirts,
each of which Il'endj~ gave 10 sonle boj, can givc risc ro an cntity appn)priatcly
each of which Wendy gave to some boy can give rise to an entity appropriately
dcscribablc as greet1 T-shirts as in 1124 abovc, or cvcn T-shirts as in 13-23 below,
describable as green T-shirls as in 1)24 above, or even T-shirts as in 1)-23 below,
but not shirls, conort rfiings.ctc. If onc o f thc lattcr is rcquircd to understand an
but not shim, col/all things. etc, If one of the latter is required to understand an
uucrancc, it is distinctly biTarrc.as in cxamplc 13-25.
utterance, it is di~tinctly bizarre, as in example 1)-25.
1125-1 I.ast wcck Wcndy bought cacli boy a grccn '1'-shirt at Macy's.
/)25-) Last week Wendy bought each boy a green T-shirt at Macy's.
2 Shc prcfcrs thc~iiin mtwc subducd colors, but lhcsc wcrc on
2 She prefers thelll in more subdued colors, but these were on
salc.
sale,
1126-1'I'hc grccn '1'-shirt yclo gavc mc is lovely.
/)26- I The green T-shirt you gave me is lovely.
2 ?? I3ut I prcfcr thcin wid1 long slccvcs and a button-down collar.
2?? But I prefer them with long sleeves and a button-down collar.
111en1 shirr5
thelll == shirts
I would likc to claim that Lhc listcncr call gcncratc ncw gcncric-sct cntitics,
I would like to claim that the listener can generate new generic-set entities,
whosc 11)s arc bascd on gcilcralizations of a rccent dcscl.iption thc listcncr has
whose IDs are based on generali/.ations of a recent description the listener has
cilhcr hcard or dcrivcd. 'll~cscgcncraliza~ionswill bc lilllitcd to oncs that the
either heard or derived, '111ese generalizations will be limited to ones that the
listcncr can, with sumc ccrt;iinty, assulnc that lhc spc;tker avumcs that s/hc the
listener can, with some certainty, assume that the speaker assumes that s/Ile the
Ijstc~lcr call (and will) makc. 'l'hat is, thcy will rarcly dcpcnd on world knowlcdgc
listener' can (and will) make. That is, they will rarely depend on world knowledge
cvcn a typc/inhcritancc liicrarchy, sincc that cannot bc assumcd to bc sharcd.
. even a type/inheritance hierarchy, since that cannot be assumed to be shared.
As for thc dcscriptiolls that are subjcc~to such gcncralizations. 1 agrcc with
~s for the descriptions that are subject to such generalizations, I agree with
~ i d n c rthi~tthcy arc rclalcd to notions of focus - what tlic spcakcr'is talking about
Sidner that they are related to notions of focus - what the speaker' is talking about
and in tcrms of. Such available dcscriptions can includc nr)t only (1) thc Ills
and in terms of. Such available descriptions can include nut only (l) the IDs
dcrivcd for and ascribable to all thc focuscd discourse cntitics. but also (2) those
deri~ed for and ascribable to all the focuscd discourse cntitics, but also (2) those
dcscriplions in thc tcxt which don't cvokc or acccss discuursc cntitics. To scc this,
dcscriptions in the tcxt which don't evoke or access discourse entities. To sec this,
considcr thc following two cxarnples.
consider the following two examples.
D27-1 Wcndy bought somc 'T-shirts ycstcrday,
D27-1 Wendy bought some T-shirts yestcrday.
Usually shc chargcs thcm. but ycstcrday, shc paidcash.
Usually she charges them, but ycsterday. shc paid cash.
then?G T-shirrs Wendy buy3
them == T-shirts Wendy buys
1128-1 Wcndy wouldn't buy a grccn T-shirt, bccause thcy always run
D28-1 Wendy wouldn't buy a grcen T-shirt, because they alwa:ys run
in the wash.
in the wash.
theti E green T-shirfs
them == green T-shirts
In U27 UIC gcncric sct accesscd by rhon is not dcscribablc by a gcncralization
In 027 the generic set accessed by them is not describable by a generalization
1. Thcrc arc olhcr dcfini~cplural anaphors ha1 a c m to target cnlilics corrcspondirig lo Ihc "natural
1. Therescl"
are lo
other
definile
pluralindividual
anaphors that
seem pcrhaps
to larget entities
corresponding
10 the I'rincc
"natural(personal
which
a givcn
belongs.
in a given
conlcxL I31cn
set" 10 communication)
which a given has
individual
perhaps examplc
in a given
conlexL orspokcn
IJlen Prince
(personal
poirltcd belongs,
out Ihc rollowing
in a lranscripl
narralivc:
communication) has pointed out the rollowing example in a transcript or spoken narrative:
I wcnt lo pick up Jan Ihc olhcr day. You know, they live in thai big house on Vine.
I wentto piek up Jan the other day. You know, they live in thaI big house on Vine.
Ilcre rhey .seems LO a c c w Jan's nalural "living" scl - i.c.. hcr family. Ilowcvcr. I don't plan to discuss
I1ere ,hey
seems
access Jans natural
"living" sel
I.e.,inrcrenlial
her ramily.proccss
I1owever,
don't plan
discuss(and not
hcrc
Ihc to
charactcrisiics
and boundaries
or .that
thatI nrakcj
suchtoenLitics
here the other
characteristics
and boundarics
or and
that allows
inrcrential
processLOthat
makespresume
such entitics
(and not
ones) available
to the listener
Lhe spcaker
corrcclly
that availabilily.
other oncs) available to the listener and allows the speaker to corrcctly prcsume thaI availability.
Webbcr
Webbcr
364
364
365
Although in most cascs it is casy to distinguish anaphoric from formal or numcric
Although in most cases it is easy to distinguish anaphoric from formal or numeric
"onc" on surfacc syntactic grounds alone, it is possible for thcrc tu bc syi~tacrically
alone, it is possible for there to be syntactically
"one" on surface syntactic grounds
ambiguous cascs in tcxt, 1 c.g.,
ambiguous cases in text,l e.g.,
- Sincc anyonc can choosc his favorite numbcr, I want one.
- Since anyone can choose his favorite number, ] want one.
Sincc John has a cat and I don't, I want one.
Since John has a cat and I don't, I want one.
. w
condition: 2 = 5
condition: 2 = 5
1,2,3,4, one .6
==- 1.2,3,4, Number
one ,6
Number
whcrc a NOM inhcrits its fcaturcs (c.g, count NUMI%I:R, clc.) from thosc of its
where a NOM inherits its features (e.g. count. NUMBER, ctc,) from those of its
1. In speech. h e ambiguity may not arise bemuse anaphoric "one" is uns~rwcd,while Ihe other Lwo
1. In ~T'ccch.
ambiguity
uscs ofthc
"anc"
aren'l. may not arise because anaphorie "one" is ullStressed. while thc othcr Lwo
uses of "onc" aren'l
Wcbbcr
Webber
366
366
hcad noun. Informally, thc above transformation states that a' NOM constitucnt
head noun. Infonnally. the above transfonnation states that a" NOM constituent
prcccdcd by an adjcctivc or dcfinitc dctcnnincr, wliosc licad is a count noun, can
preceded by an adjective or definite detenniner. whose head is a count noun, can
is singular or
bc rcplaccd by "onc" o r "ones" (dcpcnding on whcthcr d ~ NOM
c
be replaced by "one" or "ones" (depending on whether the NOM is singular or
plural in NUMIEK) if an identical NObl appears carlicr in the scntcncc. I h i s
plural in NUM HER) if an identical NOM appears earlier in the sentence. This
transformation is mcant to account for cxa~nplcslike
transformation is meant to account for examples like
1129-1 1 prcfcr thc stripcd tic you got from your aunt to the paislcy
D29-11 prefer the striped tic you got from your aunt to the paisley
onc.
one.
'l'hc problcm with this structt~ral-identityaccount is not only that it is limitcd
The problem with this structural-identity account is not only that it is limited
to indii idual scntcnccs. hut h a t it is not cvcn an adcquntc syntactic account a t that
to indh idual sentences, but that it is not even an adequate syntactic account at that
Icvcl. Considcr for cxamplc is thc following.
level. Consider for example is the following.
1130-1 If Mary offcrcd you a ncw Porschc and Sally offcrcd you a '68
D30-1 If Mary offered you a new Parsche and Sally offered you a '68
Morgan. wliicli onc would you choosc?
Mnrgan, which one would you choose?
Undcr no analysis docs lhis scntcncc nlcct thc aructural conditions of Rakcr's
Under no analysis docs this sentence meet the structural conditions of Raker's
rulc: rather n,hich V I I P means ro~lglily"whicl~mcrnbcr of thc sct consistin: of the
rule: rather which 011(' means roughly "which member of the set consisting of the
ncw I'orschc Mary offcrcd yo^^ and tllc '68 hlorgtn Sally on'crcd you". Bakcr's
new Porsche Mary offered you and the '68 Morgan Sally oncred you". Baker's
approach has nothing to say about 1his.I In tcxt linguistics, a particularly clear
approach has nothing to say about this. l In text linguistics, a particularly clear
falhcit purcly discursive) analysis of both dcfinitc pronoun and "onc" aniiphora is
(albeit purely discursive) analysis of both definite pronoun and "one" anaphora is
prcscntcd in [tlalliday nnd Hasnn 19701, tvhcrc thc primary concern is with h e
presented in [Halliday .md Has.1n 197(1]. where the primary concern is with !he
notion of "cohesion" what makcs a tcxt l~oldtogcthcr. what makcs it morc than a
notion of "cohesion" what makes a text hold together, what makes it more than a
random sct of scntcnccs. According to thc authors. botli dcfinitc pronouns and
random set of sentences. According to the authors. both definite pronouns and
"onc(s)" can instantiate typcs rrf cohcsivc relations: thc fornlcr, thc rclation of
"one(s)" can instantiate types of cohesive relations: the former, the relation of
"rcfcrcncc", thc latlcr, the rclation of "substitulion". "llefcrcncc". as Halliday
"reference", the latter, the relation of "substitution", "Reference", as Halliday
and Hasan use the term, rclatcs a tcxt clcmcnt likc a dcfiiiitc pronoun and
and Hasan usc the term, relates a text element like a definite pronoun and
...something
clse by rcfcrcncc to which it is intcrprctcd in the
...something
else by reference
to which
it is interpreted
in therelation
givcn instance.
Kcfcrcnce
is a potentially
cohesive
given because
instance.thcReference
a potcmiaJly
cohesive
thing that isscrvcs
as the sourcc
of tlle relation
interpretation
becausemay
the itsclf
thing bc
thatanserves
as the
of the interpretation
of source
tcxt [Idalliday
and Hasan 19761.
clcmcnt
may itself
be
an
clement
of
text
[Halliday
and Hasan 1976],
pp.308-9.
ppJOg-9.
Except for their terminology, Halliday and Hasan's gcncral position on definite
Except for their tenninology, Halliday and Hasan's general position on definite
anaphora and its rcla~ionto the djscoursc is not all that far from that which i have
anaphora and its relation to the d}scourse is not all that far from that which I have
bccn attempting to formalize.
been attempting to formalize.
"Substit~ition" on thc othcr hand, is
"Substitution" on the other hand, is
1. baker poses an additional constraint on ''one" anaphora in (Raker 19791 - eWeclively. a
L Baker"transderivalional
poses an additional
constraint
on "one"
anaphora
in (l\aker
1979] .trans~ormalional
effectively, a rulcs
constraint"
arbitrating
bclween
optional,
applicable
"transderi_alional
constraint"
arbitraling
belween
optional,
applicable
transrormational
rules.cxamplcr
Ilowever. this still lreals "one"anaphora purcly inira-scntenlially and !,[ill docs not addrcss
However.
thisasstilllrea15
"one" anaphora purely intra'scntentially and still docs not address examples
such
DU)above.)
such as D30 above.)
367
formal (lexicogrammatical)
(lcxicogrammatiol) relation.
rclation, in which a form
form (word or
a formal
Uic usc of a grammatical signal
spccificd through tl,e
words) is specified
gonc bcforc.
bc recovered
rccovcrcd from
from what has gone
before.
indicating that it is to be
The source
sourcc of recovery
rccokcry is the text,
tcxt, so that the relation
rclation of
Tbe
inhcrcntly
substitution is basically an cndophoric onc. It is inherently
cohcsivc, since
sincc it is the
thc preceding
prcccding text
tcxt that provides
providcs the relevant
cohesive,
cnvironmcnt in which the
thc presupposed
prcsupposcd item
itcm is located
environment
[Flelliday and Hasan
Hawn 1976];
19761; p.308.
[Halliday
pJ08.
S o unlike
i~nlikcdefinite
dcfinitc pronouns. "one(s)"
"onc(s)" establishes
cstablishcs cohesion
cohcsion simply at the
thc level
lcvcl of
So
smlcturc. Thus
'l3us except
cxccpt for
fix not confining
confining itself
itsclf to the
thc single
wording and syntactic structure.
scntcncc and being
bcing more
ink~rcconcC'rncd
conccrncd with the filnction
"onc(s)" than with its
sentence
function of "0I1C(5)"
syntax, Halliday and Hasan"s
llasan's account of "onc(5)"
"onc(s)" anaphora
snaphora still
still mirrors
formal syntax.
flaker's.
Raker's.
In [Webber
[Wcbbcr 1978a),
1978al. I1 lOok
took.an
to fonnalizing
fonnali~ingwhat
whit1 a text
tcxt makes
makcs
'an approach 10
available
loo far from
from Halliday and Hasan's. I
a\;~ilablcfor "one"-anaphora that was not too
hascd that work on the
thc view
kicw that
tliat what "one"
"onc" accessed
acccsscd was a "description" that the
based
fclt was available
availablc to the
thc listener.
listcncr. Such descriptions
dcscriptions can bc
spcakcr felt
speaker
be made available
thc speaker's
spcakcr's and hearer's
hcarcr's shared
sharcd spatio-temporal
spatio-tcmporal context.
contcxt as in two pcoplc
by the
people
peering into a geology exhibit
pccrir~g
cxliibit case
casc and one
onc saying
saying to the
thc other.
othcr, "Even
"llvcn larger ones
wcrc.found
thc Marc
hlilrc Cambrium."
Caml)rium." However,
Howcvcr, a speaker can usually rely
rcly more on
were
found in the
descriptions
slhe
has
uttered
being
available
to
tl,e
listener.
lIence.
uttcrcd bcing availablc thc listcncr. Ilcnce, the most
dcscriptions s/hc
likcly place to look for descriptions
dcscriptions accessible
acccssiblc to "one" anaphora is the text.
likely
iny 1978a approach.
approach, I came to
IO feci
feel
With more thought about the problcms
problems in my
simplcr account was possihlc.
cllrrcnt approach to
Bl anaphoric "onc(s)"
"one(s)"
possible. My current
that a simpler
rcduccs it to tl,e
Uic earlier-discussed
carlicr-discussed problcm
idcntifying the
thc possiblc
rcsolvanu of
possible resolvanlS
reduces
problem of identifying
?his approach is bascd
thc intuition that "one"
dcfinitc plural anaphors.
anaphors. This
definite
based on the
phrascs
indicatc to a listener
listcncr selection
sclcction from
from a set.
set. That
Ihat is,
is. tl,e
UIC interpretation
phrases always indicate
bc the same as the
thc interpretation
intcrpretation of "one of them".
them".
of anaphoric "onc" should be
lhis reduces
rcduccs the
thc problem
problcm to the
thc (still nontrivial)
non-trivial) one of identifying
idcntifying the
thc set-type
sct-type
This
spccific and generic)
gcncric) that this implicit "them"
"thcm" can
can access.
acccss?l
discourse entities (both specific
l h i s way of treating "onc" anaphora may seem
sccm fairly obvious
obvious h~re:
hcrc: however,
rIbis
obviousncss only follows
follows from considering the
thc sets
scts a text
tcxt makes
makcs available
available for
its obviousness
acccss and realizing
rcalir.ing that these
thcsc selS
scts -- both specific
spccific and generic
gcncric -- must also be
access
providc an account of definite
dcfinite anaphora. As for the evidence,
"around" to provide
1. Evidence
Evidcncc for
lor this approach
appraach also comes
comcs from
lrom Baker
Dakcr [Baker
[Dakcr 1978].
1978]. Ills
llis rewrite
rcwrilc rules
mlcs given
given above
above 1.
require
+count" . i.e.,
rcquire lhe
Ihc "one" constituenl
canslilucnl to
lo be interpretable
inarprclable a<;
as having the
thc feature"
feature "+counl"
ie.. to be capable
capable of
specifying a set.
X", docs
set. A
A mass
mass tcnn
term X, cxccpt
exccpl whcn
when interprclcd
inlcrprclcd as "types of
ofX",
d m nol
no1 spccify
specify a set.
r L It is
also
X" case - e.g.
alsc not
no1 open
apcn to
lo "onc"
"one" anaphora.
maphora. except
exccpl in this "types
"typcs of
olX"
eg.
lave red wine, especially ones
ancs thaL
Ulal have
have been aged properly.
properly.
I love
Wcbber
Webber
368
{hem = wines
[hem == wines
369
1138-1 Suc gavc cach boy a grccn hat
D38-1 Sue gave
each boy
a green hat
2 Usually
shc pays $8 apiccc for thcm.
2 Usually she pays $8 apiece for them.
1139-1 I saw 7 Japancsc station wagons today.
D39-1 I saw2Ihcy
7 Japanese
wagons
muststation
really bc
sellingtoday.
like hotcakes.
2111ey must really be selling like hotcakes.
Wcbbcr
Wcbbcr
370
6.6 Conclusion
6.6 Conclusion
1 am writing Illis papcr thrcc ycars aftcr cornplcting and publi'shing thc rcsults of
am writing this paper three years after completing and publfshing the results of
my thesis rescurch. Sirlcc thcn thc othcr authors rcprcscntcd in this volume have
my thesis research. Since then the other authors represented in this volume have
finislwd tllcir rcscarch as wcll. I havc tl~usbccn ablc to bcncfit from their
finished tI)eir research as well. I have tl1US been able to benefit from their
invcstigations in rcthinking my past rcscarch and co~nposingthis papcr. l have
investigations in rethinking my past research and composing this paper. I have
also bcncfittcd from their many LISC~LII
commcnts on i t
also benefitted from their many useful comments on it
What 1 llavc prcscntcd hcrc is an approach to idcntifyi~gwhat particular kinds
What J have presented here is an approach to identifyi~g what particular kinds
of noun phrascs makc available to talk about ncxt (dcfinitc anaphora) or in tcms
of noun phrases make available to t3lk about next (definite anaphora) or in tcoos
of ncxt (onc anapl~ora). In some way, this must be part of any spcakcr's
of next (one anaphoral. In some way, this must be part of any speaker's
knowledge o f the language. If s/hc wants to talk about (or in tcnns of) something
knowledge ofLhe language. Ifs/he wants to talk about (or in terms of) something
and havc thc listcncr follow, s/he'must abcy the rules prcscntcd here at least more
and have the listener follow, s/he "must obey the rules presented here at least more
oftcn than not. Sidncr also presents sjmilar rulcs, relating to sentcncc organization
often ilian not. Sidner also presents similar rules, relating to sentence organization
- some positions inviting thc infcrcnce morc than othcrs that thc associated entity
- some positions inviting ilie inference more than others that the associated entity
will bc talked about or in tcrms of ncxt. McDonald's concern is to provide these
will be talked ahout or in terms of next. McDonald's concern is to provide these
typcs of knnwlcdgc for rcal timc gcncration.
types of knowledge for real time generation.
It is cxciling to bc working in this arca, and it is my fccIing that results of
It is exciting to be working in this area, and it is my feeling that results of
J
1. I am aware that it seems pcrfcctly fine to say "You know. the one who broke his ankle?". In that
1. I am aware that it seems perfectly fine to say KYOli know. the onc who broke his ankle''', In that
case, it would sccm thal "one" is sclccling from the generic set or Jans that h e speaker prcsumes the
case, it would seem thaL onc" is selecting from the generic set of Jans that the speaker presumes the
lis~cncrlo know (and pmibly confuse). I low proper names evoke gcncric sels is an object of Furlher
li~tener to know (and possibly confuse). Ilow proper names evoke generic sets is an object of further
study.
study. 1 Work being carried out at b l t hranek and Newman Inc. and the University of Pennsylvania
2 Work being carried out at Bolt Beranek and Newman Inc. and the University of Pennsylvania.
371
BIBLIOGRAPHY
Aho.
Tlic Theory
'Ihcorv ill
of I';!rsina.
'l'ranslati~in.,md
Com~iling
Aha. A. and
aod Ullrnan,
Ullman. J. 1972 The
Parsing Tennsl"t;on
and Comoiling
VoI.
I'rcnticc-Hall.
Vol. 1. Englcwood
Englewood Cliffs. NJ: Prentice-Hall.
Alio,
119731The
').he 'Illeory
'llieorv Q[J.'illiing.
i,fl'arsi~i&'l'ranslntiiin,and
Cornoiline.
Aho. A. and Ullmnn.
Ullman. J. 119731
Transbtiim and Compiling,
Vol. 2.
Englewood Cliffs.
Vul.
2, Englcwo~id
Cliffs, NJ: Prealice-Hall.
l'rcnticc-liall.
Allcn,
lo .S'pcrch
'I'R 121179.
121/79.
Allen. J. 1979 A I'QII
I'/an Nirsed
/lased Apl~rnach
Approach 10
Speech Acl
Act R~cug~ririun.
Recognition. TR
of Compu[cr Science,
Scicncc. University
Univcrsity ofToronto.
of'l'oronto, Toronto.
I)cpartnicnt ofComputcr
!)r.:partmcnt
J.F.. and Perrault.
Analyzing inteillion
ill ullerances,
Artificial
Allen. J.I.'..
Allcn.
l'crrault. C.R.
C.11. 1980
1980 il~ral)ui~rg
i~~re~lrio~r
in
urrerairces, iZnificial
Intelligence 15(3):
Intclliecnce
15(3): 143-178.
143-178.
Austin.
J.L. 1962.
1962, How To
'1.0 l2ll'lllings
ilo lliines With Words.
Austin, 1.L.
Words. New York: Oxford Ullivcnity
University
Press.
Prcss.
w.
Baker. C.L.
Syntactic theory
the projecliuri
projection prub1r111,
problrm, 1.inauistic
Linguistic !..lliu!iI.Y
Bakcr,
C.L. 1979
1979 Sy~rlacric
~ h n , r yand rhe
10(4):
lO(4): 533-581.
Baranovsky.
1973 .SOIIIP
h~urislicsfur
of
Daranovsky. S. 1973
Some heuristics
jar Aulomaflc
Automalic Deleclion
Detection and Resolu~ion
Resolution of
A~raphora
Scicnccs. University of
Anaphara in Discourse. M.S.
M.s. thcsis.
thesis. Ucpt.
Dept. of Computer Sciences.
Texas. Austin.
Barwisc.
1. 1982 Situations and Attitudes, Cambridge,
Cambridgc, MA: Bradford
Barwise, J. and Perry. J.
Books.
Bclnap.
Stccl. 1976 The
Thc Logic
I.oeic ill
of Oueaions
Belnap. N.D.,
N.D.. and T.B. Steel.
Ouestions and Answcrs,
Answers. Yale
University Press,
Univcrsity
Prcss, New Haven.
klavcn, Conn.
Belnap. N.D.,
N.D., 1963
Analysis oj
I'reliminary Report,
1963 An
Ana/ysis
of Questions:
Quesrionr
Preliminary
Hclnap.
TM-12871000/00.
'I'M-1287/000/00. Systems Development
Dcvclopmcnr Corp.
Corp., Santa Monica, Ca..
Ca..
Bcrwick.
co~~slrainrs
Procccdinq~2f
~f
Berwick. R. and K. Wcxlcr
Wexler 1982 Parsing
I'arsing constraints
and bindirrg,
binding. Proceedings
the First West Coast
Coost Conference
thc
Confcrcncc on Formal
F~irmalLinguistics,
1.incuistics. Stanford University.
-
374
Bibliography
llibliugraphy
Berwick, ll.
Models of
Language
Ilcrwick,
R. and Weinberg,
Wcinbcrg, A,
A. 1982
1982 The
The Role ofGrammars
ofGrairrtttars in
itr Alodels
of 1,oitguage
Use,
Use, Cognition.
Connition, in press.
Bobrow, D.G.
An Overview
KRL
Knowledge
nobrow.
1l.G. and Winograd, T. 1977
1977 AII
Overvtew of
of K
R L A Kitowledge
Representation l.aflguage,
Cl~tnitivcScience. 1(1):346.
1(1):3-46.
Represerr~a~ion
Iarrguage, Cognitive
m,
Bobrow, RJ.
PSI-KLONE: l'arsitrg
Parsing and
semantic
R.J. and Webber,
Wcbbcr. BL
11.1. 1980a.
'1980a. PSI-KI.ONl?:
atrd semntlric
Hc~brow,
i1ilerprelation
in
the
BUN
natural
language
understanding
system,
Proc.
1980
it!rerpre/n/ion it! [he B l l N rrarural larrguage underrlorrdittg sysrettt,
I980
CmadiJIl
Socictv for COlllputJlional
Conrr111tational Studies
Stlrdics Q[
( ~ f Intelligence
lntclliacricc Conference.
Cdnndian Society
pp.13l-l42.
pp.131-142.
RJ.
Knowledge represemation
for
K.J. and Webber,
Wcbbcr. B.L
H.I.. 198Gb.
1980b. K~tow/edge
represeir~riotr for
processing.
Prot.
Pmc. 1st Annual
Anno;ll Conference
C11nfcrcncc ill
,F the
thc American
Amcrican
s~Jllfatic/scmantic
j ~ r l a / c / s c ~ ~ t tprucnsitrg,
~lic
I\ssociation
Aisoci;rlion fill
for Artificial
Artifici;~l InlcOigcncc,
Inlcflincncc, Stanford University,
Univcrsity, Stanford CA, pp.
316-323.
Bobrow,
nobrow,
Brachman,
A srnicrurnl
structural paradigm for
for representing
Ilrachman, R. 1978
1978 A
represerrritrg knowledge, Technical
Report
3605,
Bolt
Beranek
&
Newman
Inc..
Cambridge
MA.
I<cpon 3605.13olt Ikranck & Ncwman
Brachman,
Rrachman, R.
K. 1979
1979 0/1
011 the
ihe epistemological
episrr,rtological status
slarus of
of semantic
semarrric networks,
neiworks, in
Associative Networks. ed.
cd. N. Findler.
I-indlcr. New York: Academic Press, pp. 3-50.
3-50.
nradg.
1981 Compoter
C ~ ~ m ~ uVision,
t c r Amsterdam: North-Holland:
North-Holland.'
Brady, Michael
Micbael 1981
Brcsnan. 1.
J. 1978
1978 AA realistic
realis~ictrans!unnatioflal
rrnnsfontra~ionni grantntar
1.insuistic l11cory
111eorv S!.lliJ
and
Bresnan,
grammar in Linguistic
Hallc, J. Bresnan,
13rcsnan. G.
G. Miller,
Millcr, MIT
MI'I' Press,
Prcss, pp. 1-54.
I'svcholosical Reality. cd. M. Halle,
Psychological
m,
nronnenbcrg. W..
W., Bunt
H., Landsbergen,
Landsbcrgcn, S.,
S.. Seha,
Scha. R..
R.. Schoenmakers,
Schoenmakers. W., and
Bronnenberg,
Bun~ H.,
Utteren, E.
System P
PHLlQAI,
Van Utteren.
E 1980
1980 The
The Question-ANswering
Ques/io~~-ANsweritrg
Sysrem
H L I Q A I , in Natura!
Langoage
Systems. cd. L. llloe
l a n n u a g Ouestion
Ouestim Answering Svstcms,
Bloc Munich,
Munich. Germany: Hanser
Hanscr
Publishing.
I'ublishing.
375
Brown,
Brown. G.P.
G.P. 1980
1980 Characterizing
Characrerizi~rgindirect
indirecl speech
speech acls,
ac/$ Am.
Am, 1.:.
1, Como. Linguistics
I.inauisticp
6(3-4):
150-166.
6(3-4):150-166.
Mulliple-Body 111rcrup1s
IlIlerupls ill
Discourse Gelleratioll,
B.S.
Brown,
Brown. R.
It. H. 1973
1973 Use
Usr oj
of liul~ipk-Body
it1 Discourse
Ge~rera~ion,
Dissertation,
Massachusetts
Institute
ofTechnology.
ofl'cchnology.
1)isserration.
Bruce,
Jar lIalurallanguage,
Hruce. B.C.
I1.C. 1975
1975 Case
Case systems
sysrenrs for
~mluralIn~rguage.Technical Report
Rcport 3010, Bolt
Rolt
Beranek
Camhridge MA.
Ilcranck &
& Newman Inc.,
lnc..Camhridgc
firucc,
Plans and
social actiun,
Bruce. B.C.
B.C. 1981
1981 Plarrs
arrd sociul
ocriu~l. Thcorctic~l]
'fheoretic:~l Issues
issucs .in
in Reading
Comprehension.
cd.
R.
Spirt). B.
II. Bruce,
Ilrucc, and-W.
and W. Brewer.
Ilrca,cr. Hillsdale.
Hlllsdalc, NJ: Lawrence
lawrence
C , ~ m ~ r c h c n s icd.
~ ~ nR.
. Spiro,
Erlbaum
Associates. Publishers.
Krlbaurn Associatcs.
I'ublishers.
BurtD.n,
Semanfic gro~rl~~rnr:
grammar: AII
An ellgillecring
Ilurton, R.R.
R.K. 1976
1976 .Te~~lo~lric
engirrecri~lgtechnique
rech~rique
ffor
i r constructing
cu~~srruc~i~ig
nalura/language
systems.
~aliurollnr~guogrunderstanding
u~ldersra~ldi~rg
s)'s/cr~r$nBN
IIIIN Report
Rcport 3453.
Chafe,
Discourse Structure
Human K~ron,ledgc.
Kllawledge, Lan.ua.e
Slrucrurr and
orrd Hur~ro~r
1.anguaee
Chafe. W. 1972 Discourse
Comprehension
and the
the Acquisition
Comprchcn~it~n
&
Acattisition ill
d Knowledge
Knowlcdee ceL
cd. J.n.
J.B. Carrol and R.O.
R.O.
Freedle.
Frccdlc. Washington:
Washingu~n:Winston, ppAI-69.
pp.41-69.
Charniak, E. 1972 Toward
Mudel OJChildrcll's
Slory Comprehellsiol,
M.LT,
A.L
To~varda hlc~del
Of Childrrn'sSror)~
Corrrprehensio~r,
M.I.T. A.I.
Lab
-266, Cambridge,
l a b TR
1.R-266,
Cambridge. MA.
Chester,
oj
Fonllal
Proofs
illlo
Ellglish, i\nificial
Artificial
1976 The
The Trallslation
Tra~lsln~io~r
of F
o n ~ ~ aProo/s
l
i111o English,
Chester. I1 1976
Intelligence 7(3),
hdl&mc
7(3). pp.261-278.
Chomsky, N. 1955
1955 The Logical
l.oeical Strocture
Structure of Lingoistie
1.ineuistic lheary
ihroru reprinted
rcprintcd in part
1975, New York: Plenum.
Plcnum.
1975,
Chomsky, N. 1965
1965 Aspects
Asoccts illUle
of thc Theory
Thcarv illSyntax,
ofSvntax. Cambridge, MA: MIT Press.
Presq.
Chomsky. N. 1973
1973 COfldiliollS
Condiriu~rs0"
011Transfanna/ions.
Tronsfonnalio~ls.in !l
A Festschrift
Fcstschrift for Morris
S.I<. Anderson
Andcnon and P. Kiparsky New York: tlolt.
Halle, cd. S.R.
1I01~ Ilinchan,
Rinehart, and
Winston.
m,
1975 Reneetions
l < c f l c c t i oon
n s Language,
I.anauaec,
~
Ncw
Rntheon.
Chomsky. N. 1975
Chomsky,
New York: Pantheon.
Chomsky. N. 1976 Conditions
Conditions on Rules oj
of Grammar,
Grammar. Linguistic
1.ineuistic Analvsis
Chomsky,
Analysis
2(4):303-351,
2(4):303-351.
Chomsky,
Chomsky.
N. 1977
1977
Fonn illl.!!
m hm
ad
Q.Il
Interpretation, New
m.
York:
York:
Bibliography
376
North-Holland.
Chomsky. N. 1977
1977 all
011 u~h-frru~~en~e~~r
Svntax. ed. P. CulicoYer.
Culic~iver,A.
wh-movemell! in Formal Syntax.
Akmajian.'l:
Prcss.
Akmajian, T. Wasow, New york: Academic Press.
nn Government
Go\,crnment and
nnd Binding.
Hinding. Dordrecht:
1)ordrccht: Foris.
1981 l.cctllrcs on
Chomsky. N.
N.1981l.cclores
Faris.
tlallc, M.
hl. 1968
1968 :I!J
' l l c Soond
Sound Pattern
Pattcrn ill
of English,
Eti~lish,Ncw
New York:
Chomsky. N. and Halle.
Harper and Row.
Harpcr
Referellce lDiaries.
Tinlap-2:
'Illeoretieal
Clark. H.H. and M;~rsh;~ll
Marshall C. 1978
1978 Rcferrncr
l i n r i ~ ~in. ~ 'l'inlnn-2:
,
'llicorctical
in Natural 1.:1l1guagc
131ieui1ec Processing.
Proccssing. 7:57-63.
757-63, New York: Association for
Issucs
Issues in.
Computing Machincry
ofCompuwti~malLinguistics.
1.inguistics.
Machinery and h
thee Association ofCompuwti(Jnal
Clippinger,
Speaking with hlmry
MUIlY Tongues:
Problems in Aludcling
Modeling
Clippingcr. J. H.
tl. 1975
1975 Speaki~rg
Turigucr: '\'011/(,
SUIIIPI'roblerirs
Speakers of
Ae!ual lliscuurxc,
Discourse. in B.
Nash-Webber and R. Schank.
Si~mkrrr
of Acrual
11. Nash-Wcbbcr
Scliank. cds.. Tinl,p-I'
Tinlao-l:
'l'hcoreticill Issues in.
in Natural II.anguage
anruasc Proccssing.
Theoretical
Processing, Ncw
New York: Association for
Computing Machincry
Machinery and tl,e
Linguistics. pp.
thc Association of Computational
Cumputational I.ingoistics,
68-73.
68-73.
A computer
comnuter model ill
of
Clippingcr. J. H. 1978
1978 Mcanina
Clippinger.
Meaning and 1)iscounc:
Discourse: !l
psychoanalytic speech
Baltimore. MD:
psvchoan;~lvtic
soecch and cognition
c<~enitionHaltimore.
MI): Johns Hopkins University
University
Prcss.
Press.
Codasyl 1977
1977 Data Base
nasc Task G
r o w ill
of CODASYI.
C0I)ASYI. Prorramminc!
Codasyl
Group
Programming lanruage
Language
N.Y., 1971.
Committee Report, ACM Publications,
l'ublications, N.Y.,
1971.
Cohcn, P.R.
P.II. 1978
1978 all
O n k~roairrg
l'R 118,
118. Dept of
Cohen.
kllowillg wha1
what IU
/0 soy:
say: pla~ming
plollllillg speech acrs
act~ TR
Computer Science.
Science. U. Toronto.
Colc. P..
1.1.. 1975
1975 Syntax
Svntax i!lli!
and Semantics
Semantics. Vol. 3.;
3: Speech
Socech Acts. NY:
Cole.
p .. and Morgan. lL.
Academic Press.
Collins, A.,
A.. Drown, J.S., and larkin.
Larkin, K. 1977
Inference in rexl
text understanding,
1977 Infermce
urrderslanding.
Technical Report CSR-TR-40,
'l'cchnical
CSR-I'R-40, Center
Centcr for the
thc Study of Reading,
Reading, Champaign IL.
Cooper, W. and Paccia-Cooper.
Pacda-Cooper, I.
J. 1980
1980 SvnU!
Svntax ill!Jl
and
Harvard University Press.
Prcss.
~. Cambridge,
&.K.&.
Cambridge. MA:
377
Fikcs. R.E.. and Nilsson. N.J. 1971 STRIPS: A tlrnj~lpproachro rhe applicarion of
Fikes, R.E., and Nilsson, NJ.1971 STRIPS; A new approach to the applica/ion of
~hcoret~t
provi~lgto problctt~s;olaitlg. Artificial Intcllincncc 2:89-205.
theorelll provillg to problem soll'l'ng. Artificial Intelligence 2:89-205.
Fillmore. C. 1968 The casefir case, in Universals i11 1-in~uisric'Theory, ed. E.. nach
Fillmore, C. 1968 The caseJilr case, in Universals in Linguistic '[neary, cd. E. Bach
and G'. Hams, New York: Holt, Ilinchart. and Winston, pp. 1-90.
and G. Hanns, New York: Holt, Rinehart. and Winston. pp. 1-90.
Fi tlmnrc. C.F. 1971 I,ecrurcs o n .Deixis, Unpublished manuscript, 1,inguistics
Fillmore. c.F. 1971 Lectures on Deixis, Unpublished manuscript, Linguistics
D;partrncnt, Univcnity of California, Bcrkclcy.
I\:partment, University of California, Berkeley.
Fodor, J. 1975&J I .anmaec oflhouaht. Ncw York: Thomas Crowcll.
Fodor. J. 1975111e l.anguage ofThought, New York: Thomas Crowell.
Fodor, J. 1982 The Modularity 1)f Mind, unpublished manuscript.
Fodor, J.1982 The Modularity illMind, unpublished manuscript.
Bibliography
378
Gcrritscn, Rob, 1978 S1:'FD Referettce Atanual, Vcrsion COO B04 draft,
Gerritsen, Rob, 1978 SE/:'D Reference Manual, Version COO
B04 draft,
lntcrnational Data Iiasc Systcrns, Inc., l'hiladclphia, Pa., 19104.
Intcrnational Data Ba5e Systems, Inc., Philadelphia, Pa. 19104.
Goldman. N.M. 1974 Co~~lpuler
Get~eru/iotiof Nurural Language fiotn a Deep
Goldman, N. M. ]974 Computer Generation of Natural Language from a Deep
Ca~~ceprual
Base, I't1.1). 1)isscrtation. Sun ford University.
CO/laptual Base, Ph.D. Dissertation, Stanford Univcrsity.
Gordon, I)., and 1.akoff. G. 1975 Cotlvcrsa/ionol posrulflres in Svntax and
Gordon, D., and l.akoff, G. 1975 Conversational postulatl's, in Syntax and
Scinnntics, Vol. 3; Snccch W. cd. 1'. C d c and J.1.. Morgan. Ncw York:
Semantics. Vol. J; Snmh .1ill. cd. P. Cofe and 1.1.. Morgan. New York:
Acadcmic Prcss. pp.83-106.
Academ ic Press. pp.. 83-1 06.
Goudgc, '1. 1969, 'l'lic 'I'hou~htufC.S. I'circc. New York: 13ovcr Publications.
Goudge. T. 1969, The ThoughtillC.S. Peirce, New York: Dover Publications.
Svil[nx and Scrnantics, Vol. 1;Spcech
Gricc, H.P. 1975 I.ogic- o11d cor~~lo:~fltiu~~.
Grice. H.P. 1975 I,og;c alld COllversatioll. Syntnx and Semantics, Vol. J..;, Speech
P.
Colc
and
].I_.
Morgan.
Ncw
York:
Acadcmic Prcss, pp. 41-58.
Acrs.
cd.
Acts. cd.
P. Cole and 1.L Morgan. New York: Academic Press, pp. 41-58.
Grinder, J.. and Postal, P. 1971 hiissing atitecedct?ts, 1.in~uistic Inauiry
Grinder, 1.. and Postal. P. 1971 Missing antecedellts. Linguistic !.lliI..Yl.n:.
2(3):269-312.
2(3):269-3]2.
379
Press.
Hankamer.
Deep and
Ilankamcr, J.,
J.. and Sag,
Sag. I.1. 1976 ilrep
arrd surface
surfncr anapllOra.
arrallhora, Linguistic
1.ineuistic Inquiry
7(3):391-426.
7(3):391-426.
Harris. I..
R. December.
RO/lOT.~)'stelll:
Natura/l.anguage Processing
Pracessing
I.. R..
I)cccmbcr. 1978 The
The ROIIOT
Sysrerrr: N~rruralI.arrguoge
Applied ta Uara
/Jala nusr
Base Query.
Aplrliedru
Query. in Proceedings
Procccdines of
u f the
h c ACM 78. Washington D.C.
Hendrix,
language processing.
Hcndrix. G.
G. 1977 llumon
ilurrrarr engi1lrering
errgineerirrg ffor
i r applied natural
rrarural brrguoge
Proc.
l'nic. 5th Inh
Int. J.L
J t , Conf. on Artificial
Artificial Intelligence,
lntclliacnc~
Hendrix.
Hcndrix. G.
G. G.
G.. E. D.
I). Sacerdoti,
c c r d o t i D.
I Sagalowicz.
Sngalowicz. and J. Slocum.
Sl(xum. June,
Junc, 1978.
1978.
IJeI'eluping a Narriral
Natllra/l.anguage
InterIaee tu
Data. hCM
ACM Transactions
IIei~eIopi~~g
Larrgunge Irrrerface
lo COII/plex
(bnrl~lexi)oru.
'l'ranwctionp QJl
pl!
Dotabase
No.2.
Ijatabasc Systems.
Svstcm, Vol. 3. No.
2.
m,
Higginbotham,
Pronouns and
Bound Variables.
Higginhotham. 1.
J. 1980 Pronourrs
arrd noutld
i'uriables. Linguistic
l.inai~isticInquiry, 11(4).
Il(4).
pp. 679-708.
679-708.
Hintikka.
Belief. Ithaca. New York: Cornell
Ilintikka. J. 1963
1903 Knowledge
Knowlcdac and l%clicf.
Corncll University
Press.
Hobbs, J. R. 1976 Prorrvun
Pronoun Resolu~iorr,
Resolution. Rcscarch
Research Report #76-1,
#76-1, City College,
City University
Univcrsity of New York. New
Ncw York.
Hobbs,
Examp/es oj
Elusive At~receder~ls
Antecedents porn
from Published Texis,
Hobbs. J. R. 1977 38 Exantples
of l7lusive
Texts,
Rcport #76-2.
#76-2, City College.
Collcge, City University
Univcrsity of Ncw
York: New York.
Research Report
New York:
Hobbs. J. R. 1979 Cohamce and Co-reJerence. Cognitive Science. 3(1): 67-90.
and Answers,
Answers. Ph.D.
Bibliography
Bibliography
380
Isner, I).
D. W. 1975
"Understanding"
Through
Representation and
Isncr.
1975 Understanding
Undrrsro~~di~ig
"Und~rs~andii~g"
Through Rrpresen~otion
Reasoning, Ph.1).
Ph.D. dissertation.
Pittsburgh. 1)cpartmcnt
Department of
Rwso~rirrg.
disscrotion. University of I'iwburgh.
o f Computer
Science.
Jackcndoff, 1
1972 Scmnntic
Grammar. Cambridge.
Cambridge.
R.1. 1972
Semantic lntcmrctati~m
Interpretation hGcnerativc
in Generative Grammar,
Ma: M
MIT Press.
Joshi. A. K. and
and R. Wcischcdcl,
1977 Compulation
Co~~ipu~aliotr
of a suhcloss
of inferences:
itrfrretrces:
Weischedel, 1977
ofa
subclass of
l'rrsupl~osiliu~i
J. Q[Comp
of C ~ m n .I-inguistics.
I.ineuistic$.
PresuppositiO/l and l~~rroilrner~l.
/;"111ailmCf11. in
in Ani,
Am. L
firesnan, ]981
Lexical-fulIctjollal Grammar:
fonnal System fur
for
Kaplan. R.
I<.and
and J. Brcsn:~n,
1981 L.exirol-fuircrioi~ol
Groiirr~rar:.A fur~~rolSys~oii
Gnirn~~iorical
Repr~sertruriuri.M
1.1' Center
Cc~itcrfor Cognitive
Cog~iitivcScience
Scic~iccOccasional
Occasional Paper
Gmmmmicaf
RepreSenlaliull,
MIT
#13.
#13.
Kaplan, S.L
Dij)/orellce
Natural I.o,,guog~
J.onguage and
High revel
Kaplan.
S.J.. 1978
1978 On
0 1 1 rthe
l i D~$jiirr~rcr
~
R
IBelll'{,{>11
PP
I PI,
Nnrurnl
o,ld High
/.eve/
Quev I.mrguogr~.
I'rucccdincs of
nftlic
Washington D.C.
Query
I.al/guages. in
in Proceedings
the ACM 1978. Washington
Kaplan.
1979a Cooperati"e
Cooperurir'e Rr.~potisrs
1.anguage
Kaplan. S. J.,
J.. 1979a
RespollSes froin
from ao Porroblc
Portable Nurural
Natural l.anguage
Ilorn
Ouerj SysIe111,
1'h.l). I)isser(~tion.
as
Dala Rase
Rase Query
Syslel1l. Ph.D,
Disscrt<'.ltion. U.
U. of Pennsylvania,
Pennsylvania, available ,as
Stanford Heoristic
ticuristic Programming
Prtrgramnri~igProject
I'n~jcct Rcport
Co~nputcrScience
Science
Stanford
Report HPP-79-19.
HPP-79-19. Computer
I)cpartmcnr,
Stanford Univcrsity,
Stanford. Ca. 94305. July. 1979.
Department. Stanford
University. Stanford.
Kaplan. SJ..
Joshi. 1979
A Techniquefor
Alanaging the
Kaplan.
S.J., F..Mays.
E.Mays, and
and AX
A.K. Joshi,
1979 A
Techniquefur ilatrngirig
rhe Lexicon
iin
ti a
o Narurol
latrguogc 111ter~h-e
lo a Changing
Chotrgilrg Dora
Narurall.anguagc
liller/ace 10
Data Rosr,
Base, in
in Pri~cccdinep
Proceedings of&
of ~
IJCAI
1979,'rokyo.
!JeAI
1979.
Tokyo.
Japan.
-Kantuncn,
I..1976
1976 Discourse
in Svntax
and Semantics.
Scrnantics, vol. 7.
7.. ed. 1
Kantunen. L.
Discourse refire~~/~,
referelils. in
Syntax and
J..
McCawlcy.
Prcss, pp. 363386.
363-386.
McCawley. New York: Acadcrnic
Academic Press.
Karttuncn,
L.. 1977
Presupposition and Li~rguistic
Linguistic COlltext,
Qfi n1JK
Kantuncn, L..
1977 Prcsupposirion
Corllext, in.
i n Proceedings
P r ~ ~ c c c def&g
~
Prcsn~~osirion.
lmnlicatur~. (A.
Texas Confcrcncc
Conference ill! I'crfiirmativcs.
]>crformativcs Presupposition
i!llii Implicature.
Ilogcrs.
13. Wall,
and J. P. Murphy. Ed.).
Ccntcr for Appliid
12inguistics,
Rogers. II.
Wall. and
Ed.). Center
Applie'd Linguistics.
Arli~igton.
Arlington. Va.
381
Rinchart, and Winston, N.Y,pp. 45-52.
Keenan, E.L., and Hull, RD., 1973 The !.ogical Presuppositions ofQuestions and
Atiswen. in Prasu~~ositioncn
h Philosonliic und I.innuistik, (Pclofi and Frank,
Answers. in Prasuppositioncn in Philosophic und Lingllistik, (Petofi and Frank,
Ed.), Atlicnaum Vcrlag, I'rankhn.
Ed.), tl.thenaum Verlag, Frankfurt.
Krauwcr, S. and I.. dcs Tombcs 1980 Fitlire slare ~rurtsdziccrsas a rflodel oJ(he
Krauwcr, S. and L. des Tombes 1980 finile slate lrallsducers as a model of the
Iut~guagcpmccssor lJ~rccl~tWorking l'a~crsin I .inn~~istics.
no. 19
language processur Utrecht Working Papers in Linguistics. no. 19
LakoiT, R. 1974 Ret~zarksU I I This atid Tho!, P r c z c c d i n ~~f fie 'I'cntli licaional
Lakoff. R. 1974 Remarks 011 This and That. Proceedings ill1!:l Tenth Regional
Mcctin~of thc Chicago I-innuistic Socicty. cd. 1-aGaly,Fox and Ruck. Chicago:
Meetin\! of the Chicago Linguistic Society, cd. LaGaly,Fox and Ruck. Chicago:
Chicago 1,inguistics Socicty, lCk345-356.
Chicago Linguistics Society. H}:345356.
lashlcy, K. 1951 The Problc~nof Serial Order it) Beha~lior,in ],.A. Jeffrcss (ed.)
Lashley. K. 1951 The Problem of Serial Order in Eehm'ior, in L.A. Jeffress (cd.)
Ccrcbral Mcchanisrns I3cllavior: 'lhc ljixon Svm~osium.Ncw York: Wiley.
Cerebr<11 Mechanisms in Behavior: The Hixon Symposium. New York: Wiley.
Lasnik, H. 1976 Remarks on C'o-ref2ret1ce,1.inalistic Analvsis. 2(1):1-22.
Lasnik, H. 1976 Remarks on C(Y reference, Linguistic Analysis, 2(1): 1-22.
1 , c h n c ~W. 1977a Humat] at~dcotnpu~u~ional
yue~lioa atlswrritlg, Cnanilive
Lehnen. W. 1977a Human and computational questiun answering, Cognitive
Scicncc ](I). pp. 47-73.
Science 1m, pp. 4773.
I.chncrt, W., 1977 The I'rixess cfQucsrioti Atrsweri~lg,'l'li 88, Ilcpt. of Computer
Lehnert, W., 1977 The /'roass o!Ques(ion Answering, TR 88, Dept of Compuler
Scicncc, Yalc University.
Science. Yale University.
pf
Bibliography
382
1.ibernian.
1967 Intonation.
Cambridge, MA: Mn
MI'I
Libemlan, P. 1967
Inlonation. I'ercc~tion.
Perception. and lanaoane.
Language. Canibridge,
Press.
I.ightfoot,
Hornstcin, N. 1981
1981 Exolanation
I.inauistics, 11Jndon:
lnndon:
Lightfoot, I).
D. and Hornstein.
Explanation ill Linguistics,
langrnan.
l.ongman.
1). 1978
1978 COf/textual
Lb~trrxr~ral
Refirenre
irt Nnrurnl
1.ockman.
Lockman, A. D.
Reference Rcsolurion
Resolution ill
Natural Ignguage
l..{lnguage
Proassing, I)cpt.
Dept. of Computer Science
Rutgers University, New
Scicncc TR-70.
'IR-70. Rutgcrs
I'rocccsirtg.
Hrunswick.
Brunswick, N.1.
N.J.
Imia, S. E. 1975
Thirty-,ix I.eetures
in Biology.
I.uria.
1975'l'hirtv-six
l.cct~~rcs
Hiolonv. Cambridge,
Carnbridgc, MA: MIT
MII' Press.
Prcss.
Mann, W. C"
C.. Bates.
Iiatcs. M"
M.. GroSI,
Grosf. B.
H. J..
Mcllonald. D.
I). I)..
McKcown, K. R. and
J" McDonald.
D" McKeown.
prcss. The
The Srarr
Tex/ Gefleration.
Getleroriun. Asrericon
f
Swartout. I. in press.
.)'tafe ofrhr
of/he Arr
Art irr
in Text
American .lourt~al
Journal u
of
Computationall.inguistics.
Co~ttpu~aric~rtoll.biguisrics.
Mann, W. C. and M~oorc,
Moore, J. 1981
Multiparagraph Text.
Mann.
1981 Computer
Cotttpurer GenPration
Getlrroriotr of
of hlulliparr~graph
Texr.
American Jourtxil
Journal o
ofCompulativnal
Unguis/ies,
1), pp. 17-29.
17~29.
Arrlerican
f C'o~rrpumriu~~nl
/.irrguis/ics,7(
7(1).
Marcus. M. 1980
1980 f!
A TIlcory
l l i m r v !!f
of Syntactic
Svntactic Recognition
Rccoenition for Natural L,mguagc.
1.aneuag.
Cambridgc:
Cambridge: MI'I
M IT Prcss.
Press.
Project. Technical Report, Information
1981 The
The Consul Projrcr.
information Sciences
Mark, W. 1981
Institute,
Institute. Marina Dcl
Del Ray, California.
Martin, W. 1979
Role~ Co-Descriptors
FomlOl Represerr/arion
Representation of
Martin.
1979 Roles
Co-Dcscriprors and the
rhe Fomtal
of
Quol/tified
Expression~ Laboratory for Computer
Qunrlrijied English Expressions
Cvmputcr Science, TM-139,
'I'M-139,
M.LT., Cambridge, MA.
M.I.T.,
The Grammar o
offQuantification,
dissertation, MIT
May, R. 1977
1977 ihc
Ouantification, PhD disscrtation,
])epanment of Linguistics.
Department
McAllester, D. 1\.
Mail/tenance Sysrnn.
,~)stem. MIT Al
AI Lab
McAllcstcr,
A. 1978
1978 A Three
T h r e ~Valued
Valued Truth
Tmrh Alairl/ma~rce
Mcmo
473, Cambridge, MA.
Memo 473.
McDonald. D.M. 1977
Subsequent reference:
reference: synracric
syntactic artd
al/d rherorical
rhetorical constrain
Is,
Mcl~onald.
1977 Subsequent
consrroinls,
Processing. 7:64-72,
in Tinlap-2: Theoretipl
Thcorctical ~ ill
in ~
Natural l.anguage
l a n e u a ~hxm,hg.
7:H-72, New
Assuciation of Computational
York: Association fur Computing
Cornpuling Machinery and Association
Linguistics.
I.anauaae Generation 3
a Er=
Process ~f
McDonald D. D. 1980a Natural
Natural l.anguage
~ ~
J1f
383
Morgan,
. L. 1978 Toward a Rational Model of Discovrse C m p d ~ *
Morgan,
J. L. 1
1918
Toward a Ratiotull Model of DiscOll~ Com~nsion.
lhcorctical
in Natural
7:109-114, New
IinlaD-2: lbcorcticaJ ~ m&1uW 'ADgyAee Processiua. 7:109-114, New
Yo&:
Association for Computing Machinery and Association of ComputaW
Yorl: Association for Computing Machinery and Association of Computational
Linguistic&
Unguistics.
Nash-Webbcr B. and Schank R. 1975, I'inlan-1: Thcorctical Issucs in Natural
Nash-Webbcr B. and Schank R. ]975. Tinlap-l: Theoretical Issues in Natural
Tanguaec Processing, New York: Association for Computing Machinery and the
Language Processing. New York: Association for Computing Machinery and the
Bibliography
384
Solvine Methods
Mclhods in Artificial
Artificial Intelligence.
lntclliecncc. Ncw
Nilsson,
1971 Prohlcm
Nilsson. N.J. 1971
Prohlem Solving
New York:
McGraw-HilI.
McGraw-14ill.
Nilsson,
1980 I'rincinlcs
l'ioga Press.
Nilsson. N.J. 1980
Principles of Artificial
Artificial lntclliecncc
Intelligence I'alo
Palo Alto.CA:
AltoCA: Tioga
Partcc.
1%.ti.
1972 Opacit)'.
Opacirl: Co-referenre
Cu-rcfcrrrrrr arid
of ~
atural
Partee. II.
H. 1972
and Pmrluur~s
Pronou",; Semantics ill
Natural
I.aneuaec
I). Davidson
I)a\,idwn and G.
G . tiarman.
415-441.
I.anguage cd. D.
Harman. Iloston:
Iloston: I(cidcl.
Reidel. pp. 41
5-441.
Partcc.
B. H. 1978
1978 Mourld
Olher Arra/~hors
'lhcorctical
Partee. II.
Bound Variables orid
and Other
Anaph()r~ 'rinlnn-2:
Tinl,p-2: Theoretical
Iwl~cs
in
N;~tl~ral
I.aneuazc
I'rc~ccssing,
7:79-85,
Ncw
York:
Association
Issues in Natur<ll Language Processing,
New
Association for
Cornputati[mal
Linguistics.
Computing Mechincry
Machinery and Association of Com
pula tiona1Linguistics.
I'ostal. P.
P. M. in preparation. Relational Grammar.
Pcrlmuttcr,
Perlmutter. I).
D. M. and Postal.
R., Allcn.
atid Cohen.
Cohcn, P.R. 1978'
1978'Sprech
Pcrrault.
Allen. J.F..
J.I'.. and
Speech acts as a basis for
for
Perrault. C. R.
undrrsranding
cohrrmcr. Tinlap-2:
'l'inlan-2: Theoretical
'I'licorctical lssucs
in Natural
understanding dialogue coherence,
Issues ill
I.aneuaec
I'roccssine, 7: Ncw
Language ProL:cssing.
New York: Asa~cialion
Association fur
for Computing Machinery and
Association of Computational
Computational Linguistics.
L.inguistics, pp. 125-132.
125-132.
Pcrrault,
Allcn. J.F. 1980
1980 A plan-basedanalysis
of indirecr speech acls
L
Perrault, C. R.. and Allen,
plan-based anal)'sis ofindirect
aCI~ 1.
ASSOC,
C m , Linguistics
l.ineuistics 6(3),
6(3),pp. 167-182,
167-182.
Assoc, ~
Pcrrault,
Cohcn. P. 1981
1981 11's
ow11 good,
irouns
Perrault, C. It..
R.. and Cohen.
It's for
for your own
good, Elcmcnts
Elements of D
Discourse
Undcrstandinp,
Joshi, 1l.L.
H.L. Webber
Webbcr and I.A.Sag.
I.A. Sag. New York:
Understanding. eds. A.K. Joshi.
I'rcss, pp.217-230.
Cambridge Univcrsity
University Press.
pp.217230.
1969 Arrophuric
l=ifth Regional
Ilcaional Meeung
Mccttnn 2f~
o f b
R~stal.
Postal. P. 1969
Anaphoric Islands.
Islands. l'ancrs
Papers from thc
the Fifth
Chicaeo
Ilinnick. Davison,
I)avisan, Green
Grccn and Morgan.
Chicago I.ineui?lic
I.inguistic m,
Society, cd. Binniek,
Morgan. Chicago:
Scricty. 5;205239.
5:205-239.
Chicago 1.inguistic
Linguistic Society.
Prince. E..
8, 1978
1978 A Comparisoll
Conlparison of
of Wh-clefts
M'h-cleJs alld
and 11-clejls
I.aneuap,
It'clefts in Discourse.
Discourse, '-angllage,
54, Vo1.4.
Rcichman.
1978 COnl'ersalional
Con~,ersolionalcoherency.
S c h x . 2(4):283-327,
2(4):283-327.
Reichman, R. 1978
coherency, Coenitivc
Cognitive~,
1976 The
The Synracric
of Anaphorq
Reinhan,
Reinhart, T. 1976
Syntactic Domain of
Anaphoro. Ph.D. dissertation,
dissertation,
385
w,H.
Kciter, R.. 1978 On Cluscr! World Data fluses, in I.onic and Ilata
Reiter. R.. 1978 On Clused World Data Bases, in Logic and Data Bases, H.
Gallairc and J. Minkcr (Eds.). I'lcnum Prcss. N.Y.
Gallaire and 1. Minker (Eds.). Plenum Press, N.Y.
Rcsnick and Halliday 1966 Phvsics: Pan 1, New York: John Wiley.
Resnick and Halliday 1966 Physics: Part 1, New York: John Wiley.
Bibliography
386
w,
Schiffcr, S.R.
S.R. 1972
1972 Meaning., London:
London: Oxford Univcrsity
Prcss.
Schiffer,
University Press.
Goodson. J.I..
1979 The
The plan
recognition
Schmidt. C.F..
C.F.. Sridharan,
Sridharan, N.S..
Schmidt,
N.S., and Goodson,
J.L. 1979
plall recogllitioll
proble~n:
inlerseclion o
f arlijcial
inlelligence alld
and psycholog:
problem: An
All illtersectioll
of
artificial ill/elligence
psycholog)~ Artificial
Intelligence
45-83.
Intclliecnc~9(1), pp. 45-83.
R.. 1969
1969 Speech
Snccch Acts
Acts. an Essay ill
in thc
Pliilosoohv of
Scarlc. J. R.,
the Philosophy
of i.anaua%.
Language. New
Searle,
York: Cambridgc
Prcss.
Cambridge Uni\.crsity
University Press.
Svntax and Semantics
Semantic$ Vo1.
3: Speech
Soecch
Scarlc, J.R. 1975
1975 Indirecl
Searle,
Indirect speech acls
acts. in Syntax
Vol. 1;
Acts.
P.
Colc
J.I..
Morgan.
Ncw
York:
Acadcmic
Prcss,
pp.
Acts,
cd.
Cole
and
New
Academic
Press,
59-82.
Sliapiro. S. C. ]975
1975 Gel/frat/on
Gei~eralioflas Porsh~g/,o~s
~ l w o r kinto
irrto a Lir~ear
Shapiro.
Parsing from a N
Network
Linear String.
A~~rericar~
C ( ~ ~ ~ ~ p u l a llirguis~ics.
idr~ai
Microfiche 35.
American Jourf~aluf
Journal ojCol/1pulaliullall.inguistics,
Microfiche
Sliattuct-Hufnagcl 1979
1979 Senlolce
rlrodel based on speech
errors
Shattuck-Hufnagel
Sentence produdion:
production: a model
speech e"ors
Hillsdale, NJ:
E. Walkcr,
Walker, Hillsdale,
lawrcncc
Erlbaum, pp. 295-342.
Lawrence Erlbaom,
295-342.
polrrnrs.
Scntcncc Processing
Proccscing cd. W. Cooper
Coopcr and
pllllerll5, in Sentence
m.
1976 Computer
Coni~utcr lIased
Raccd Mcd~cal
C~,nsultations: MYCIN.
Shortl~ffc. E. H. 1976
Medical Consultations:
Shortliffe,
Amstcrdam.'lhc
o l l a Inc.'
Inc.
d
Amsterdam, The Ncthcrlands:
NeOmlands: I'.lscvicr
Elsevier North ~
Holland
C.1.. 1978
1978 The
The Use
Use of
of Focus as a Tool forDisa~flbiguorio,r
ofDefinile Noun
Sidncr, C.L.
Sidner,
for Disambiguatioll ofDefinite
Phrases,
ll~corcticalIssues
lssucs ill
in Natural I.anguage
1.aneuanc Processing,
Proccssing. 7:86-95,
7:Sb-95.
Phras", l'inlao-2:
Tinlap-2: neorctical
New York:
Yurko ACM and ACL.
C. 1979
1979 Towards
Towards a cor?~pulationol
of dejinile
Sldncr. C.
computational iheory
theory of
definite anaphora
anaphora
Sidncr,
cornprehension
m Clgllsh
discourse, Technical
Tcchnical Report
Rcport 537. MII'
comprehension in
English discourse,
MIT Artificial
Intelligence Laboratory.
Laboratory, Cambridge MA.
lnlclligcnce
Silverman. H. 1975
1975 A Digilalb
Therapy Advisor, Technical
Tcchnical Rcport
Digilalis Therapy
Report 143, Project
Silverman,
Massachu~etts.
MAC, Massachusetts
Institute
of
Technology,
Cambridge,
Massachusctts lnstitulc ofTcchnology, Cambridge. Massachusetts.
Shxurn. I.
1972 Generating
Generarir~gEnglish discourse
discoursefrorn
Simmons. R. F. and Slocum,
Simmons.
J. 1972
from semantic
rrcrworks
C o m m u t ~ i c a till
i ~lh
tlic
~ nM:M
ACM
~ ~ f 15,
15. 10,
10,891-905.
Iletworks. Commuoications
891-905.
Solan, I..
Anaphora in Child language.
Language, Ph.D. dissert.1tion,
Dept. of
1.. 1978
1978 Anaphura
disscrtation. Dcpt.
1.inguisrics.
Linguistics, Univcrsity
University of Massachusctts,
Massachusetts, Amhcrsf
Amherst, MA (available through the
Graduate Linguistic Student
Graduatc
Studcnt Association, University
Univcrsity of Massachusetts).
Massachusetts).
Johnson-Laird P. N.,
June. 1976
1976 A Pmgramma~ic
Steedman, M. I.,
Steedman,
J" and Johnson-Laird,
N., June,
Programmatic Theory
387
of
of Linguistic
Lirrguislic
Perfom1al1ce,
Pefomiarice,
presented
prcscntcd at
the Stirling
thc
Conference on
Confercnce
Psycho]
ingu ist ics.
Psvcholineuistics.
Stenning,
pragmatics, in 1.ineuistic
l.inguistic Theory
K.. 1978
1978 AI/aphara
Arlaphora as all
a11 approach
approach 10
lo progrrlalics
Stcnning. K.,
and
Psychological
Reality,
cds.
M.
Halle,
J,
Bresnan,
and
G.
Miller.
and Psvclioloeical
Hallc. J. IIrcsnan,
G . Millcr. Cambridge:
MIT
MIT press,
press.
m,
Strawson,
and
ill sprech
spcech oaels.
Philosoohy of
of
Strawson. P.F.
P.F. 1971
1971 Intentioll
Irrr~rrrior~
arsi cOI1l'ClItion
corn,err/iur~in
c ~ sin &'1l1C
?
I'hilosoohy
Language,
Uni,ersity 1'
I'ress:pp,
23-38,
I.anauaec, cd.
cd. lR,
J.R. Searle
Scarlc New
Ncw York: O'ford
Oxford Uni\crsity
rcss:pp. 23-38.
Swartout,
Advisur vvirh
lVilh lHxp/(maliolls.
Technical
Report
SwartouL I.1. ]977
1977 A Digitalis
1)igirolis Thtrapy
Tlrrrapy Advisor
~ x p i ~ n ~ u r i'l'cchnical
ur~s
Rcport
l.aboratory
Institute of
of Tcchnolugy.
Technology,
1.aburatory for
fur C",nputer
Co~nputcr Science,
Scicncc. Massachusetts
Massachusctls Institutc
Cambridge,
Carnbridgc. Massachusetts.
Massachusctls.
Swartout.
alld
}uslijicatiol1S
of Expert Cwrsul~ing
Consulting
Swartout. I.1. 1981
1981 Produ<'ing
I'roiiu<.brgE'(planat;olls
~.~pIa~ro~
orrd
i ~ Jus/r~coriorrs
~
rrr
,f,fxperI
Programs,
Computer Scicnce.
Science,
Pmgran~s. Technicai
l'cchnical Report
llcport 251,
251. Laboratory
I.aboratory for Cornputcr
Massachusctls Institute
lnstilutc ofTcchnulogy,
of'l'ccl~~~~~lugy.
Carnbridgc. Mas~rhuxtts.
MassachusetLS
Cambridge.
MaSSi.lChusctts.
Thomas,
asking, 'I'cchnical
Technic"l rcpurt
report 11C5866.
RC5866,
Thomas. J. e.,
C.. 1976
1976 Qualltifiers
Quorrrfiers alld
arid questioll
yursriorr askirig,
IIIM 'fhomas
'Thomas J,I. Watson Research
Ilcscarch Ceoter.
Ccntcr.
111M
I.cckcrnan. Il.H,
H.H. I)ustcrL
'l'hornpson. 1'.1'"
F.P.. P.e.
P.C. Lockcman,
Thompson,
J)oster~ R. I)cvcrill,
DeYerill, 1969 REL:
REL: AA
Rapidly EXlensible
Ex~ensihleI.angllage
1.arrguage System,
S~,srorl,in Procccdina
F thc
Rapidly
Proceedings ,ill
tlle 24th ACM
Nnriunal Conference,
Confcrcng. N.Y.,
N.Y.. N.Y.
N.Y.
National
Van Lehn,
Lchn. K. A. 1978
1978 Detennillillg
Derennirrir~gthe
/he Scope uffirrglish
Van
oj r:l/glish Quorrlijiers,
Qual/lifiers. AI-1'R-483,
AI-TR-483.
Artificial
Institute of
of Technology,
Technology.
Artificial Intelligence
Intelligence Laboratory.
Laborao~ry, Mass;,chusetts
Mass;rchusctts lnstitutc
Cambridge, MA.
Cambridge.
Walker, D.
D. E,
E. 1976
1976 Speech
Speerh UI/derstal/ding
U~rdersla~~ding
Research,
Walker.
Research, Final Technical Report,
Report,
Projcct 4762.
4762. Artificial
Artificial Intelligence
lnlclligcnce Center.
Ccntcr. Stanford
Sunford Rcscarch
Project
Research Insritute.
Institute, Menlo
Park. CA.
CA.
Park.
Walk D.
D. 1..,
I.., July. 1978
1978 All
An Ellglish
Fnglish l.anguage
1.anguoge Queslion Answrirlg
Waltz.
Answering Sysrcrn
System for
Jar a
large Relatiollal
Relarionol Database,
Da~ahosc,Communications
Communic;~tionsof ihc
o c . Corno.
Large
the aAssoc,
Compo Mach.,
Mach.. 21(7),
21(7).
pp. 526-539.
pp.526-539.
Wasow, T. 1977
1977 TransJonnations
Tronsfinna~ior~s
and the
/he Lexicon,
Wasow.
LexiCfJll. in Formal
Fonnal Svntax,
Syntax. ed.
cd. P.
Culicovcr,A. Akmajian. and
and'r.
Wasow, Ncw
CulieDYer,
T. WasDw.
New York: Academic Press.
Wcbber, B. L., 1978a
1978a A Fannal
Formal Approach 10
lo Discourse Anophora,
Webber.
AllapllOra. Technical
Tcchnical
Bibliography
388
Annphora in
Wilcnsky. li. 1978 1,ircusillg it1 rhe Co~rrprclrensrorr of U~Jir~ire
Wilensky, R. 1978 Focusing in Ihe Comprehension of Definite Anaphora in
Urrtlers~ot~Jirrg
Goc~l-lhtwdS/orios,lJh.13. Lhcsis. Yalc Univcrsity.
Understanding Gllill IJiI5l'd Siorics. Ph,D. thesis. Yale University.
I . i ~ i ~ u i sl~lai~iry,
~ic
8(1), pp. 101-139.
Williams, F.. 1977 I)iscoursea~rJlugi~~alfi)~~~,
Williams, E.1977 fJiscollrsealld logicalform, Linguistic 1.llilli.iJ:y, 8(1), pp. 101-139.
Willams, F,. 1980 I'redict7liui1,1.in~~listic
Inquiry. 11(1), pp.203-238.
Wiliams, E. 1980 Predication. I.inguistic Inquiry, ll(l), pp.203-238.
Winograd, '1'. 1972 Undctstandinp Natural I.aneuaec. Ncw York: Acadcmic Press.
Winograd, '1'.1972 Understanding Natural Language. New York: Academic Press.
Winston, P. H. 1980 l.canring and Reasorritrg by Analog)#:ihe details AIM-520,
Winston, P. H. 1980 I.earning and Reasoning by Analogy: the details, AIM-520,
Artificial lntelligcncc laboratory, Massachusctls Instituie of Tcchnology,
Artificial Intelligence Laboratory, Massachusclts Institute of Technology,
Cambridge. Massachusetts.
Cambridge, Massachusctts.
Woods, W. 1967 The Ser~ranticsfor a Question Rt~~weri~ig
syslenl, Ph.D.
Woods, W. 1967 The Semantics for a Question Answering system, Ph.D.
disscrtation, Harvard Univcrsity. Rcport NSF-19, Aikcn Computation
disscrtation, Harvard Univcrsity, Report NSF'19, Aiken Computation
laboratory, Iiarvard Univcrsity. (rcprintcd by Garland Publishing. 1979)
Laboratory, Harvard University. (rcprinted by Garland Publishing. 1979)
Woods. W.A. 1970 Transirio~tnelwork gron~ntarsfir t~oiurallanguage analysi~
Woods. W.A. 1970 Transilion network grammars for nalural language analysis,
Comm. Assnc. Corno. Mach. 13(10), pp. 591-606.
Comm. Assoc. Compo Much, 13(10), pp. 591-606.
389
Woolf, \I. 1957 A Room of One's Own. Ncw York: Harcourt, Rrace and
Woolf, V. 1957 !1 Room of One's Own. New York: Harcourt, Brace and
Jovanovich.
Jovanovich.
Index
In 117-120
focus. 283
283
Actor focus.
52.77,107.108,
110,112-113,
116-117, 118,119, 120-125,127
52.
77.107-108.110,
112-113,116-117,118,119,120-125,127
i111dindirect
indirect speech
spcccll acts,
BC~S,
148-150.155
and
148-150,
155
spcccli acts,
acts, 135-136
135-136
and speech
Agent
Agen~
69. 76,
76. 77.
77. 80
SO
Allernative, 69,
Alternatiye,
meanings and
and logical
logicill foml,
fornl, 43,
43.73-74
meanings
73-74
null alternative,
alteroatire, 135
135
ill planning,
planning. 113.
113. 122,
122.126.128-129,130.
131
in
126.128-129,130.131
Ambiguous paths,
paths. 69.
69.200.201
Amhiguous
200, 201
hnaphor resolution,
resolution. 2(,7-269,
267-269,270-273,331-334
Anaphar
270-273, 331-334
Annphora, bound Yariable,
variable, 314,
314,325
An.phora,
325
Anaphoric
An.phoric reference
and computaiional
101
computational constraints.
constraints. 101
Antecedent, 267,341
267, 341
Assurnablc
132-173, 181,207
Assumable knowledge,
knowledge, 172-173.
AlN,
261. Seealso
ATN,261.
See also LUNAR
LUNAR
Attribute
database.
database. 63-64.65.70.205
63-64,65, 70,205
Backward
ll.ckward anaphora.
anaphora, 316-317
Belief
llelief suucturc,
structure, 149-150
Belief.
176,187,207,291
llelief, 33.75,76,83,111,116-117,167-170,172.
33, 75, 76, 83,111,116-117,167-170,172,176,187,
207,291
Index
Co-indexing. 38, 39
Cu-indexing. 38, 39
and communicativc behavior, 27
and communicative behavior, 27
and predication co-indexing, 53-54
and predication co-indexing, 53-54
Co-prescnt discounc foci, 320
Co-specification, 269
Co-spc:cifter, 269
Co-spedfy,269
392
393
Co~~vcrsationnl
Maxim ofquantity (Gricc). 175.182, 191
Conversational Maxim of Quantity (Grice), 175,182, 191
Coopcrativc conversation
Cooperative conversation
conventions, 171,172-175
conventions, 171, 172-175
dcfiniiion. 167-168
definition, 167-168
Coopcrativc rcsponsc, 167-171
Cooperative response, 167171
Corrcctivc indircct rcsponsc, 174-175,207
Corrective indirect response, 174-175,207
cotnputation of. 180-183
computation of, 180-183
definition, 174
dcfinition,174
examples OK 181,190,191
examples of, 181, 190, 191
Database ( I N ) schema
Database (DB) schema
in infcrcncing. 194,195.200
in infercncing, 194, 195, 200
as knowledge source, 170,178,201
as knowledge source, 170,178,201
as uscr model, 200
as user model, 200
Ikfinitc anaphora. 170,268,276. 331.337-339
Definite anaphora, 170,268,276, 331,337-339
Definite plural anaphora, 338,361-362,367,370
Definite plural anaphora, 338,361-362.367,370
Definite pronoun anaphora, 332.338
Definite pronoun anaphora, 332, 338
Direct rcsponsc, 168,169,171-174. See also Indirect response
Direct response, 168. 169.171-174. See also Indirect response
Discourse and generative grammar, 30
Discourse and generative grammar, 30
394
Index
I )iscou rsc
Discourse
crcation of discoursc cntitics, 42-44
creation of discourse entities, 42-44
clcmcots, 280
clements, 280
cntitics. 37, 104,335.280
entities. 37, 104,335.280
focus, 282
focus, 282
fonning descriptions of, 354-363
fonni ng descriptions of, 354-363
gcncric scls, 347
generic selS, 347
liaking of discoursc cnti~ics.50-63
linking of discourse enliLies, 50-63
Discourse model, 280,306,334-336
Entity
Entity
database, 178, 179,205
database, 178, 179, 205
discourse, 37,104,336,280
discourse, 37, 104,336,280
Expectation, 126
395
Expected
Expected focus
focus rejection,
rejection, 287
287
Expected
Expcctcd focus,
focus.283
283
Extensional
Extcnsio~ialquestions,
questions.175,
175. 183,200
183.2W
Extensional semantics, 64, 71
Focus,
273,274
Focus,32,
32,273,274
confirmation,
276, 289,299
confirmation,55-56,
55-56,276,289,
299
current
current focus,
focus, 51
51
in
183-185
in suggestive
suggestiveindirect
indirect response,
rcsponse,183-185
it-clefts,
59
it-clefts, 59
movement,
296, 302
movement, 295,
295,296,302
movement,
300
movemcnt, local,
local, 299,
299,300
movemcnt, non-local,
non-local,301
301
movement,
popping,
329
popping. 299,
299.329
potential
potential focus,
focus,52-53
52-53
set,:298
sets;298
space,272
space,272
stack,57-59
57-59
stack,
stress,61-62
61-62
stress,
task
task structure,
structure,63
63
topicali7.ation,
59
topicali7ation,59
tracking,278,303
tracking. 278,303
wh-questions,59,
59,185
wh-questions,
185
Focusingalgorithm,
algorithm,283,
283,291,294
Focusing
291, 294
Focusing,274,275,278,303
274,275,278,303
Focusing,
Forwardco-specifiers,
co-specifiers,315,
315,316
Forward
316
Gencration
Generation
specificationof
ofNoun
Phrase,269
269
specification
Noun Phrase,
Genericset
set
Generic
in
discourse,
337
in discourse, 337
396
Index
Ilclpful bchavinr. 107. 167-172. See also Corrcctivc indircct rcsponse,
Helpful behavior, 107, 167-172. See also Corrective indirect response,
Suggcs~ivcindircct rcsponse
Suggestive indirect response
277,288,289-291.308
INFORMIF, 136
INFORMIF, 136
INFORMREF, 136
INFORMREF,136
Inlcnsional language, xxii
Intensional language. xxii
397
Invcrtcd fields
Inverted fields
in qucry, 196,204-205
in query, 196,204-205
Is-a verbs, 285
LUNAR, 188,331-333
LUNAR,188,331-333
190,191,195
190,191,195
Index
Parallelism, 318-320,329
Pancr, 92-97.190
Parser, 92-97, 190
398
399
Panin& 92-97.179.190-199
Parsing. 92-97,179, 19(H99
Path finding in databasc. 198-199,200,201
Path finding in database, 198-199,200, 201
Perlocutionary farce. 171,207
Perlocutionary force. 111. 207
PHIdIQA1.176.333
PHLlQAI,176,333
Plan consuuction rules, 124
Plan construction rules. 124
PUNNEIZ, 219.220
PLANNER, 219-220
Planning and discourse, 34,74-83
Planning and discourse. 34. 74-83
297
Preconditions, 111,118
Preconditions, Ill, 118
400
Index
Prcdicatc calculus
Predicateascalculus
a rcprcscntation for natural Ianguagc, 342,348-353
as a representation for natural language, 342,348-353
Prcncx normal fonn, 44-45,46.48,353-354
Prenex normal fonn. 44-45, 46.48, 353-354
Prcscntcd scl178,186, 189
Presented set. 178. 186, 189
Preamption. 173-174,175.177.181-182,185,191.203,207
Prcsumption.
173-174.70-73
17S, 177,181-182,185,191. 203,1J11
in discourse,
in discourSe,70-73
Presupposition, 73,172-174.175.207
73,172-174,175,207
Prcsup~tion,
401
R F L 188
RF.L.188
REQUFSf. 136
Rule of construal, 42
Rule of construal, 42
Ssuucture (surfacesou~turc),60,61
S-strueture (surface structure). 60. 61
ScnScaca
Sentenca
combinatoric f a m m of. 337-339
combinatoric features of. 337-339
Sentential, 286
402
Index
Speech act,
75-77,and
108.dircct
11] 119,
135,acts,
171comparison of, 76,119
indircct
spccch
indirect and direct speech acts, comparison of, 76, 119
Stonewallitl& 168,175
Stonewalling, 168,175
Scrcss, and focus, 6162
Suess. and focus, 61-62
Subjacency, 51
403
Vagucncss
Vaguencss
in qucrics, 187-189,190,205-206
in qucries, 187-189, 190.205-206
Vcrb phrase ellipsis, 341
Verb phrase eJJjpsis, 341
Wants. 116
Wants,U6