Top Banner
Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation: University of Edinburgh, School of Informatics Telephone: +44 131 650 4190 Fax: +44 131 650 4587 e-Mail: [email protected] running heading: D-LTAG Keywords: discourse structure, discourse connectives, discourse parsing, lexi- calized grammar, anaphora
41

Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

Sep 06, 2018

Download

Documents

dangdan
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

Author: BonnieWebber

Title: D-LTAG: ExtendinglexicalizedTAG to Discourse

Affiliation: Universityof Edinburgh,Schoolof Informatics

Telephone:+441316504190

Fax: +441316504587

e-Mail: [email protected]

runningheading:D-LTAG

Keywords: discoursestructure,discourseconnectives,discourseparsing,lexi-calizedgrammar, anaphora

Page 2: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

Abstract

This papersurveys work on applying the insightsof lexicalized grammarstolow-level discourse,to show the value of positing an autonomousgrammarfor low-level discoursein which words (or idiomatic phrases)are associatedwith discourse-level predicate-argumentstructuresor modification structuresthatconvey their syntactic-semanticmeaningandscope.It startsby describinga lexicalizedTreeAdjoining Grammarfor discourse(D-LTAG). It thenreviewsaninitial experimentin parsingtext automatically, usingbotha lexicalizedTAGand D-LTAG, and then touchesupon issuesinvolved in how lexico-syntacticelementscontribute to discoursesemantics.The paperconcludeswith a briefdescriptionof thePennDiscourseTreeBank,a resourcebeingdevelopedfor thestudyof discoursestructureandsemantics.

Page 3: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 1

1 Introduction

For many linguists,syntax– thestructuralregularitiesof a languagethatprojectthemeaningsof wordsontothoseof utterances– stopsat thesentenceboundary.Materialoutsidethatboundary– i.e., thepreviousdiscourse– is simply contextthatmay(or maynot) licenseaparticularconstructionof linguistic interest.

Of course,discoursetoo hasstructuralregularities.Evenin just theareasofformal andcomputationallinguistics, therehave beenseveral attemptsto pro-ducea rigorouscharacterizationof the regularitiesof discoursestructure. Forexample,

� McKeown (1985) took the regularitiessheobserved in the structureofdefinitionsandencodedthem into schemata, which could then be usedto automaticallygeneratedefinitionsof conceptsunderlyinga databasemodel.

� Mann & Thompson(1988), observingregularities in the semanticandpragmaticrelationshipsholdingbetweenadjacentclausesandtakingthemto hold recursively betweenlargerunitsof discourseaswell (i.e., clauseslinked togetherby suchrelations),codified their observationsabouttheresultingstructuresin asystemcalledRhetoricalStructureTheory(RST).RST hasprovided an underpinningfor work in Natural LanguageGen-eration(Hovy, 1988;Moore,1990)andmorerecently, in summarization(Marcu,2000).

� Grosz& Sidner(1986)focusedonspeaker intentionsasastructuringprin-ciple for discourse,with a structuraldominancerelationholdingbetweenonediscoursesegmentandthosesegmentsthatsupportedits purpose,anda structuralprecedencerelation betweena discoursesegmentand thosewhosepurposesrequiredprior satisfaction.

� Sibun (1992) stressedthe aleatorystructureof expository discourse,asdemonstratedin thedescriptionsof houseandapartmentlayoutscollectedby Linde (1974), as well as similar descriptionsshe collectedherself.Sibun showed how this structurecould be modelledas the output of asemi-deterministicprocessreactingsequentiallyto propertiesof theworld(viewedaspotentiallycomplex graph)thatit wascalleduponto describe.

Page 4: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

2 B O N N I E W E B B E R

� Asher& Lascarides(1998),in SegmentedDiscourseRepresentationThe-ory focusedon reasoningasan underpinningto discoursestructure,ex-plainingbothdiscoursestructureandthe interpretationof discoursephe-nomena(e.g.,anaphorresolutionandpresuppositiongrounding)asa by-productof reasoning(eithermonotonicor defeasible)aboutthewaythatapropositionconnectsto anaccessiblespeech act discoursereferent in thediscoursecontext. Constraintson what is accessiblemeanthattheresult-ing discoursehasa treestructure.

While all thesenotionsof structureapplyto discourse,amorebasicquestion– tied to syntaxat thesentence-level – is whethersuchsyntaxdoesstopat thesentenceboundaryor whetherthekind of syntacticregularitiesoneseesat thephraseandsentence-level, thatactwith wordsto convey meaning,extendbeyondthesentenceinto discourse.

Herewe seework by Gardent(1997),Polanyi & van denBerg (1996)andSchilder(1997). They, like Asher& Lascarides(1998),wereconcernedwithbothdiscourseprocessinganddiscoursesemantics– how eachnew segmentof adiscoursewouldbecorrectlyattachedto anevolving, interpreteddiscoursestruc-ture,suchthattheinterpretationof thecurrentstructurewasalwaysavailable.Ofparticularinteresthereis thattheseresearchersusedtheadjoiningoperationfromTreeAdjoining Grammar(Joshi,1987)anda relatedsister-adjoiningoperationin their work, asa way of constructingdiscoursestructuresincrementallyfromasequenceof sentencesandclauses.

But theseresearchersdid not explicitly addressthe way in which syntaxmight extendbeyondthesentence,which is essentiallytheconcernof theworkthat Aravind Joshiand I and someof our colleaguesand studentshave beencarryingout, in looking at lexicalisedgrammars for discourse.

In a lexicalized grammar, structurehas a more intimate associationwithwords than it doesin a phrasestructure grammar. For example,LexicalizedTreeAdjoining Grammar(Schabes,1990)differsfrom a basicTAG in associat-ing eachentry in the lexicon with thesetof treestructuresthatspecifyits localsyntacticconfigurations.1 Someof thesetreestructurescancombinewith one

1Other lexicalizedgrammars includeCombinatoryCategorial Grammar(Steedman,1996)andDependency Grammar(Melcuk, 1988). Lexicalizedgrammarshave provedto bea signif-icant tool in thetheoreticalunderstandingof clause-level phenomenaandhave spurredcompu-tationaldevelopmentof robust,wide-coverageparsersfor NaturalLanguagetext (Bangalore&Joshi,1999;Hockenmaier& Steedman,2002;Clark & Hockenmaier, 2002).

Page 5: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 3

anothervia substitution, while othersmakeuseof TAG’sadjoiningoperationinorderto produceacompleteanalysis(cf. Section2).

In 1997,workingwith DanCristea(Cristea& Webber, 1997),I noticedthatifonewantedto “parse”discourseincrementallyin a TAG framework (followingGardent(1997) andSchilder(1997)), onealso neededto exploit substitution,aswell as the adjoining operationthat they werealreadyusing. This wasbe-causeit wasnecessaryto associatea discourseconnective suchas“on the onehand” with a treestructureinto which a subsequent,not necessarilyadjacent,sentencemarked by “on the other (hand)” or othercontrastive marker, wouldthensubstitute,ratherthanadjoin. This broughttheframework closerto a lexi-calizedTAG, andled Aravind Joshiandmyself to begin to explorewhethertheinsightsof lexicalized grammarscould also be appliedto low-level discourse– that is, whetheronecould have an autonomousgrammarfor low-level dis-coursein which words (or in somecases,idiomatic phrases)were associatedwith discourse-level predicate-argumentstructuresor modification structuresthat conveyed their syntactic-semanticmeaningand scope(Webber& Joshi,1998).

Thisexplorationhascontinuedover thelastsix years,engagingtheattentionandefforts of several studentsandcolleagues– (Creswellet al., 2002;Forbeset al., 2001;Forbes& Webber,2002;Forbes,2003;Forbes-Riley et al., 2004;Miltsakaki et al., 2003,2004;Prasadet al., 2004;Webberet al., 1999a,b,2000,2001,2003).Someof whatwebelievehasbeengainedthroughthis explorationis specificto a lexicalizedapproachto discourse,while othergainshave beentruly new and generalinsightsinto the way in which discoursestructureandsemanticsprojectfrom lexico-syntacticelements.Wehopethereaderwill graspbothsortsof benefitsfrom this brief survey paperandfrom thepapersit drawson. In particular, wehopeto show that:

� The approachprovides a uniform way for lexico-syntacticelementstocontribute to the syntaxandsemanticsof both the clauseanddiscourse,openingup the(still to berealised)possibilityof sentenceprocessingandlow-level discourseprocessingbeingcarriedout in anintegratedfashion.

� The approachshows that low-level discoursestructureandsemanticsisnot simply amatterof attachingeachnew clauseor sentenceinto thepre-viousdiscoursethroughits discourseconnectives:thereareotherwaysinwhichdiscourseconnectivescancontributeto discoursecoherence.Thesecontributionscantheninteract,allowing certaincomplex featuresof dis-

Page 6: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

4 B O N N I E W E B B E R

courseto becomputedthroughtheinteractionof simplermechanismsthatareoperationalelsewhereaswell.

� Theapproachallowsoneto reliablyannotatealargecorpuswith low-leveldiscoursestructure,in whichthebasisfor annotationdecisions– discourseconnectives(viewedaspredicates)andtheir arguments– is clear.

Thepaperaimsto demonstratethesebenefits,surveying work carriedout in thislexicalized approachto discourseand floating somenew ideasas well. Sec-tion 2 illustrateswhat it meansto have a lexicalizedTAG for discourse– a D-LTAG – and how it relatesto lexicalized TAG at the clause-level. It therebyshows how D-LTAG providesa uniform way for lexico-syntacticelementstocontribute to both theclauseandthediscourse.Section3 presentsa brief lookat our first experimenton analysingdiscourseautomaticallywith respectto D-LTAG (Forbeset al., 2001). This work usesthe samechart-basedleft-cornerLTAG parser(Sarkar,2000)for bothsentenceanddiscourseprocessing,takingthesequenceof derivationtreesproducedfrom sentence-level analysisandout-putting a derivation treefor the discourseasa whole. It is a first steptowardsintegratingsentenceprocessingandlow-level discourseprocessing.

Section4 briefly describeshow looking at text from a D-LTAG perspective– which requiresone to associatea compositionalsemanticconstructionwitheachelementof lexicalizedsyntax– hasforcedusto look morecarefullyat justhow lexico-syntacticelementscontributeto discoursesemantics.Theresultsaresurprising:while somediscourseconnectivescontributea relationshipbetweenadjacentdiscourseelementsasexpected,otherscreateananaphoricrelationbe-tweena discourseelementandthe discoursecontext. Still others,suchas forexampleandfor instance, contributeby abstractingover thenearestpredication,be it clause-level or discourse-level, andaddingtheresultto thediscoursecon-text (Webberet al., 2003). All of these– alongwith thewaysthey caninteract– are describedbriefly in Section4. Section5 describesthe Penn DiscourseTreeBank(http://www.cis.upenn.edu/� pdtb),a resourcebeingdevelopedfor thestudyof discoursestructureandsemantics.Finally, Section6 speculateson thefutureof D-LTAG.

It shouldbestressedthatthefocusof this work is propertiesof the low-levelstructureandsemanticsof monologicdiscourse.It doesnot addressissuesofhigh-level rhetoricalstructure(e.g.,standardforms of narrative, argumentationor exposition),intermediate-level discoursestructurein termsof speaker inten-tions, or dialoguestructure (e.g.,question-answerpatterns,patternsof clarifi-

Page 7: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 5

VP

V NP

P

like

NP

P

by

NP

NP

like

like

like

like

S

S

NP

NP

VP

V NPε

S

NP

S

NP

NP

NP*

VP

PPVP*

PP

VP

V PP

(a) (b) (c)

P

(d) (e)

Figure1: Elementsof thetreesetof like

cationdialoguesor of expositionandacknowledgement,etc.). Thusit doesnotposeanalternative to theoriesof intermediate-or high-level discoursestructureor dialoguestructure,but rathera necessarysubstratefor suchtheories,similarto thatof sentence-level syntaxandsemantics.

2 D-LTAG: Lexicalized TAG for Discourse

In a lexicalizedTree-AdjoiningGrammar(LTAG), a word is associatedwith asetof treestructures(its treeset), onefor eachminimal syntacticconstructionin which the word canappear. For example,within the treesetof like is onetree(Fig. 1a)correspondingto simpleSVO orderfor transitive verbs,asin Theboyslikeapples, anothertreecorrespondingto topicalizedOSVorder(Fig. 1b),asin Applestheboyslike, anda third treecorrespondingto thesimplepassive(Fig. 1c), as in Applesare liked by the boys. All thesetreesrealizethe samepredicate-argumentstructure,with oneNPargumentfor the“lik er” andasecondNP argumentfor the“lik ee”. The treesetalsoincludesa treecorrespondingtolike astheprepositionalheadof anNP post-modifier(Fig. 1d),asin appleslikethisoneandanothertreecorrespondingto likeastheprepositionalheadof aVPpost-modifier(Fig. 1e),asin Singlikea bird.

The above syntactic/semanticencapsulationis possiblebecauseof the ex-tendeddomainof locality of a lexicalizedgrammar:Whenlike is simplycharac-terizedasa verb(or a prepositionor a noun)in a non-lexicalizedgrammar, the

Page 8: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

6 B O N N I E W E B B E R

NP

NP

NP*

PP

P

like det N

applean

VPNP

npr

JohnVP NP

ate

S

det N

applean

S

VPNP

npr

JohnVP NP

ate

PPNP

P

like

NP

Figure2: An auxiliaryPPtreeadjoiningto aninitial NP tree

informationaboutthesyntacticconfigurationsit canappearin andhow its inter-pretationcombineswith thatof otherelementsin thosesyntacticconfigurationsis spreadoutacrossotherpartsof thegrammarratherthanbeinglocalisedin oneplace.

In anLTAG, therearetwo kindsof elementarytreestructuresthatcanappearin a treeset: initial treesthat reflectbasicfunctor-argumentdependenciesandauxiliary treesthat introducerecursionandallow elementarytreesto be mod-ified and/orelaborated.Fig. 1(a–c)are all initial trees,while (d) and (e) areauxiliary trees. The specialsymbolsusedin thesetrees(

�and � ) relateto the

two operationsby whichtreescancombineto form moreextendedderivedtrees.�indicatesa substitutionsitewhereanelementarytreecansubstituteinto a de-

rivedtree,providedthelabelat its root matchesthatof thesubstitutionsite.Forexample,an NP treeanchoredby thepropernounJohncansubstituteinto anyof thesubstitutionsitesin Fig. 1. � indicatesanadjunctionsite (or foot node),wherean auxiliary treecanadjoin into a root, leaf or non-terminalnodeof anelementaryor derivedtree,againprovidedthat its label (thesameasthatof itsrootnode)is alsothesameasthelabelof thenodeto which it is beingadjoined.Fig. 2 shows the like PPtreefrom Fig. 1(d) anda treecorrespondingto “Johnateanapple”,alongwith theresultof adjoiningthefirst treeinto thesecondatits secondNP node.Additional examplesof adjoiningcanbefoundthroughoutthepaper, aswell asin otherpapersin this volume.

Now, one way of projectingthe insightsof lexicalized grammarinto dis-coursewould be to have a singlegrammarthat mappedlexical itemsinto dis-coursestructuresdirectly.2 Sucha radicalstepwould not be impossible.How-

2I thankoneof theCognitiveSciencereviewersfor pointingthis out.

Page 9: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 7

ever, we have not thoughtthroughits many consequencesin detail, given thatone would not want to lose the generalizationsthat have beencapturedovermany yearsof work in lexicalizedsentence-level grammars.

Instead,we have simply positeda separateLTAG for discourse(D-LTAG)thatusesthesameoperationsof substitutionandadjoining. While thereis someoverlapbetweenthetwo (e.g.,bothproviding ananalysisof subordinateclause– mainclauseconstructions,thereonestrikingdifference:While LTAG requiresawidevarietyof differentelementarytreesto describeclause-level structure,wehave foundthatD-LTAG requiresvery few elementarytreestructures,possiblybecauseclause-level syntaxexploits structuralvariationin waysthat discoursedoesn’t. For example,like is associatedin theXTAG grammar(XTAG-Group,2001)with 28elementarytreessuchasFig.1(a–c)in whichit servesasaverban-chor. In contrast,a subordinateconjunctionsuchasbecause, which in D-LTAGis a discourse-level predicatethat takes two (clausal)arguments,is associatedwith only two elementarytrees– thesametwo asevery othersubordinatecon-junction. Thus, all the elementarytreesso far identified as being neededforD-LTAG arepresentedin this shortsection.

The root nodeof anelementarytreein D-LTAG is a discourseclause(Dc).At eachsubstitutionsite,a basicclausecanbesubstitutedor a derivedtree. (Abasicclauseis treatedasanatomicunit with features,just asword or lemmaisin a sentence-level grammar.3) Other leavesareadjunctionsitesor the lexico-syntacticelementsthatanchorthe tree. Herewe will first look briefly at initialtreesin D-LTAG andthe rangeof lexical items that anchorthemand therebyserve as the predicateof discourse-level predicate-argumentstructures(Sec-tion 2.1). We thenlook at auxiliary treesin D-LTAG andthe lexical itemsthatanchortreesthatelaboratetheongoingdiscourse(Section2.2).

2.1 Initial treesin D-LTAG

D-LTAG associatesinitial treeswith a varietyof lexico-syntacticelementsthatserve aspredicateson clausalarguments:subordinateconjunctionsandothersubordinators; the lexico-syntacticanchorsof parallelconstructions;someco-ordinateconjuctions;andevensomespecificverbforms.

3We areonly beginningto explorethis aspectof LTAG now at thediscourselevel. In LTAG,eachnodein atreehasanassociatedfeaturestructure thatcan,alongwith thenodelabel,beusedto constrainpossiblesubstitutionsand/oradjunctionsat thatnode.While suchfeaturestructuresarenot discussedin this paper, see(Forbes-Riley et al., 2004).

Page 10: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

8 B O N N I E W E B B E R

Dc

DcDcsubconj

(a)

Dc

Dc Dcsubconj

(b)

α:subconj_mid α: subconj_pre

Figure3: Initial trees(a-b)for asubordinateconjunction.

In LTAG (XTAG-Group,2001), subordinateconjunctionssuch as if, al-though, sinceand so that anchorauxiliary treesbecausethey are outsidethedomainof locality of the verb, headingclausalor VP adjuncts. In D-LTAG,however, it is predicatesonclausalargumentsthatdefinethedomainof locality.Thus,at the discourse-level, subordinateconjunctionsanchorinitial treesintowhich clausessubstituteasarguments.Fig. 3 shows the initial treesfor post-posedsubordinateclauses(a) andpreposedsubordinateclauses(b). In this andotherfigures,Dc standsfor “discourseclause”,

�indicatesa substitutionsite,

and � subconj� standsfor the particularsubordinateconjunctionthat anchorsthetree.

Similar to subordinateconjunctionsarewhatQuirk et al. (1972)call subor-dinators – lexical itemssuchas in order for, in order to, and to (which headpurposeclauses) andby (which headsa mannerclause). Thesealsoanchorini-tial treesin D-LTAG, while anchoringauxiliary treesin LTAG. They only differfrom subordinateconjunctionsin having a non-finite(untensed)clauseasoneargumentandafinite (tensed)clauseastheotherone.

Dc

On theone hand

Dc Dc

α:contrast

On theother hand

Figure4: Initial treefor aparallelcontrastiveconstruction

D-LTAG alsoassociatesinitial treeswith thelexical anchorsof parallelcon-structionssuchas

(1) On theonehand, Johnis generous.On theotherhand, he’s hardto find.

Page 11: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 9

The initial treefor this parallelconstructionis shown in Fig. 4. It is associatedwith both the lexical anchorson the onehandandon the other (hand). Whilein LTAG, theseidiomaticprepositionalphraseswould anchorseparateauxiliarytreesthatadjoinat thesentence-level, in D-LTAG, they bothserveasanchorsforthe sameinitial tree,keepingthe two discourseclauses(Dc) that substitutein,within thesamedomainof locality. Therearesimilar multiply-anchoredinitialtreesfor disjunction(“either”... “or”...), addition (“not only”... “but also”...),andconcession(“admittedly”... “but”...).

Therearealsoinitial treesanchoredby coordinateconjunctionsthatconveya particularrelationbetweentheconnectedunits,suchasso, conveying result.Its initial treeis shown in Fig. 5. In contrast,we take thecoordinateconjunctionandto anchoranauxiliary tree,asdiscussedin thenext section.

Dc

DcDc

α:so

so

Figure5: Initial treefor thecoordinateconjunctionso.

Finally, thereis motivationfor takingtheimperative form of supposeasan-choringan initial tree in D-LTAG. This differs from LTAG, whereverbssuchas supposethat take sententialcomplementsare taken to anchoran auxiliarytreerootedin an S-node,asshown in Fig. 6a. This analysisprovidesa naturalway for LTAG to handlethesyntacticphenomenonof long-distanceextraction(XTAG-Group,2001),illustratedin sentencessuchas“Who doesJohnsupposelikesbeans?” (wherewho is the subjectof likes) and “Who did the elephantthink the pandaheardthe emusaysmellsterrible?”, wherewho is the subjectof smells. With respectto this clausalanalysis,theauxiliary treefor supposeinExample2, wouldadjointo therootof thetreefor theclause“an investorwantsto sell astock. . . ”.

(2) Supposeaninvestorwantsto sellastock,but not for lessthan$55.A limitorderto sell couldbeenteredat thatprice.

At the discourselevel, the motivation for taking imperative supposeto an-chor an initial treewith two substitutionsites(Fig. 6b), is that it correspondsmorecloselyto its discourse-level predicate-argumentstructure:Onesubstitu-tion sitewill befilled by its sententialcomplement,which specifiesa hypothet-ical or counterfactualcondition,while thesecondwill befilled by a subsequent

Page 12: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

10 B O N N I E W E B B E R

Dc

DcDc

S

NP

ε

VP

V

suppose

suppose

*S

(a) (b)

Figure6: LTAG andD-LTAG treesfor imperativesuppose.

discourseclausewhichshouldbeevaluatedunderthatcondition– here,“A limitordercould be enteredat that price”. This is equivalentto the discourse-levelpredicate-argumentstructureof the subordinateconjunctionif. As with if, thesecondargumentof supposeneednot beanassertion.It caninsteadbea com-mand(Example3) or aquestion(Example4),asin theseexamplesreturnedfromGoogle:

(3) Supposethatthemarket is semi-strongform efficient,but not strongformefficient. Describea tradingstrategy thatwould resultin abnormallyhighexpectedreturns.

(4) Supposethat you want to sendan MP3 file to a friend, but your friend’sISPlimits theamountof incomingmail to 1 MB andtheMP3file is 4 MB.Is thereaway to handlethis situationby usingRFC822andMIME?

Of course,imperative supposedoesn’t always play this discourserole, whichleadsto ambiguityin D-LTAG analysesasto whethera particulartokenof sup-poseprojectsaninitial treeinto thediscourse,or justanchorsasimplediscourseclause,I will mentionothersourcesof ambiguityin thenext section.

Onefinal point here.In all our D-LTAG papersto date,we have talkedasifwordsanchorbothLTAG treesandD-LTAG trees.Becauseit is often thecase(aswith suppose) thatonly whena lexical item occursin a particularstructuralconfigurationthat it shouldbeassociatedwith a particulartreein D-LTAG, it ismoreaccurateto talk in termsof anchoredLTAG treesanchoringD-LTAG trees.This is, in fact,how our initial parserfor D-LTAG operates,aswill bedescribedin Section3.

Page 13: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 11

(a) (b)

Dc

Dc

Dc

Dc<dadv> *

Dc

<conn>*

Figure7: Auxiliary treesin D-LTAG. � conn� standsfor any explicit coordi-nating conjunctionor null connective (φ). � dadv� standsfor any discourseadverbial.

2.2 Auxiliary Treesin D-LTAG

Auxiliary treesin anLTAG introducerecursionandallow elementarytreesto bemodifiedand/orelaborated.Auxiliary treesin D-LTAG do the same(Webberetal.,1999a,b,2003).Herewedescribetheauxiliary treesthatwehavetakentobepartof D-LTAG andthenreflecton thejustificationfor thesedecisions.

Thefirst useof auxiliary treesin D-LTAG is in connectionwith descriptionsof objects,events,situationsandstatesthatextendover severalclausesin a dis-course. Suchextendeddescriptionsare formed with coordinateconjunctionsand/orunrealized(null) connectives. Thus,D-LTAG hastakenbothcoordinateconjunctionsandnull connectivesto anchorauxiliary trees– cf. Fig. 7a. Whensucha treeis adjoinedto adiscourseclauseandits substitutionsiteis filled withanotherdiscourseclause,thelatterextendsthedescriptionof thesituationor en-tity conveyedby the former.4 Suchauxiliary treesareusedin thederivationofsimplediscoursessuchas(5):

(5) a. Johnwentto thezoo.b. He tookhiscell phonewith him.

This derivation is shown in Fig. 8. To the left of thearrow ( � ) aretheele-mentarytreesto becombined:T1 standsfor theLTAG treefor clause5a,T2 forclause5b,andβ:unrealized, for theauxiliary treethatconnectsadjacentclauseswithout anovert connective. In thederivation, the foot nodeof β:unrealizedisadjoinedto the root of T1 and its substitutionsite is filled by T2. The resultis shown to the right of � . (A standardway of indicatingTAG derivationsisshown under � , in the form of a derivationtree in which solid lines indicate

4This simplerecursionis relatedto dominanttopic chaining in (Scha& Polanyi, 1988)andentitychainsin (Knott et al., 2001).But null connectivesarealsocompatiblewith theinferencethata strongerrelation(suchasexplanation) holdsbetweendiscourseclauses.If suchaninfer-encedoeshold, thenit wouldno longerbeacaseof dominanttopic chainingor entitychains.

Page 14: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

12 B O N N I E W E B B E R

β:unrealised

β:unrealised*

T1

T2T1 T2

τ10

3

τ2

φφ

Figure8: D-LTAG derivationof Example5

adjunction,anddashedlines,substitution.Eachline is labelledwith theaddressof theargumentat which theoperationoccurs.τ1 is thederivationtreefor T1,andτ2, thederivationtreefor T2.)

We have positeda secondtype of auxiliary tree for D-LTAG, shown inFig.7b. Thisoneis anchoredbyadiscourseadverbialsuchasinstead, otherwise,then, in contrast, therefore, for example, nevertheless, etc.Whatis strikingaboutthis treeis thatit is associatedwith only asinglediscourseclause,while boththeinitial treesin Fig. 3–6andtheauxiliary treein Fig. 7aareassociatedwith twodistinctdiscourseclauses.Thatonly asinglediscourseclauseis involvedin thissecondauxiliary tree(Fig. 7b) follows from our argument(Webberet al., 2003)thatdiscourseadverbials,by andlarge,establishananaphoriclink betweentheinterpretationof theclauseto which they adjoinandthepreviousdiscourse.

But note that adverbialssuchasthesecould be interpretedwith respecttothe discoursewithout being distinct elementsof discoursegrammar, as is thecasewith the demonstrative pronouns(“this” and“that”): While a demonstra-tive pronounis taken to refer to anabstract objectevokedby thepreviousdis-course(Webber, 1991;Asher,1993),in subjector objectposition,it is partof thepredicate-argumentstructureof theverb, sowould not automaticallybepartofthediscoursegrammar. (In otherpositions,demonstrative pronounscontributeto adjunctson the verb,but that doesnot make themautomaticallypart of thediscoursegrammareither.)

Now in LTAG, the reasonthat adverbialsanchorauxiliary treesis becausethey areoutsidethepredicate–argumentstructureof theverb,contributingmod-ifiers likemannerof action(e.g. “swiftly”), frequencyof actionsor events(e.g.,“annually”),speaker attitudetowardseventsor situations(e.g.,“unfortunately”),etc.5 If onetook thecomparablepositionin D-LTAG, thendiscourseadverbials

5Syntactically, LTAG doesn’t distinguishbetweendiscourseadverbialssuchas insteadand

Page 15: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 13

would anchorauxiliary treesin thediscoursegrammarif they wereoutsidethepredicate–argumentstructureof any nearbydiscoursepredicate– i.e.,any struc-tural connective(includingthenull connective)or otherdiscourseadverbial.

Sotherearetwo questions:(1) Shoulddiscourseadverbials,whichareinter-pretedwith respectto discoursein away thatclausaladverbialsarenot (Forbes,2003),be treatedaspartof thediscoursegrammar;and(2) if they should,is itauxiliary treesthatthey anchor?

Discourseadverbials like insteadand otherwisebelong in the discoursegrammarbecausethey relatedtwo abstractobjectsin thesamewayasdoclausaladjuncts,asin Example6:

(6) a. Insteadof stayinghome, Johnwentto thezoo.

b. After cleaningthesnowoff hiscar, Johnwentto thezoo.

c. Becausehefelt boredat home, Johnwentto thezoo.

The only differencein Example7 below is that oneof the abstract objectar-gumentsto insteadis provided anaphorically– in this case,from the clausalsubjectof theprevioussentence.Hence,we take discourseadverbialsto belongto discoursegrammaraswell asto sentence-level grammar.

(7) Goingto thebeachsoundedboring. Instead, Johnwentto thezoo.

As for the secondquestion,it is possiblethat thesediscourseadverbialsshouldbe taken to anchoran initial tree (asdo subordinateconjunctions),butonewhosefirst argumentmustbe recovedanaphorically. Discourseadverbialslike for exampleand for instanceshow why the projectedstructuresshouldbetakento beauxiliary trees.In (Webberet al., 2003),we show that theseadver-bialscanoperateon discoursepredicates,asin Example8:

(8) Johnbrokehisarm,sofor example,hecan’t cycle to work now.

Here,thestructuralconnective so is interpretedasrelatingthe interpretationof“John broke his arm” and “he can’t cycle to work now” – the latter being aconsequentof the former. The discourseadverbial for examplemodifiesthe

clausaladverbialssuchasswiftly, annuallyor unfortunately. They areall associatedwith thesamesetof auxiliary treesbecausethey canall appearat thesamepositionswithin the clause.Forbes(2003)givesanextensive analysisof thefeaturesof anadverbial that leadit to beinter-pretedasa discourseadverbialratherthanaclausalmodifier.

Page 16: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

14 B O N N I E W E B B E R

extentof theconsequence– thelatterbeingbut oneexampleof theconsequencesof theformer. Sotheseadverbialsserve asadjunctsto discoursepredicates,andhenceasanchorsfor auxiliary trees.Notice,of course,that thepredicateneednotbeexplicit, asin Example9:

(9) Youshouldn’t trustJohn.For example,henever returnsanything.

Here,one infers that John’s lying is meantto be an explanation for why oneshouldn’t trusthim, with for examplemodifying its extent– thatit’ sonly oneofpossiblymany reasons.

Nothing elsein this paperdependson whetherdiscourseadverbialsshouldbemodelledasauxiliary treesin bothsentence-level LTAG anddiscourse-levelD-LTAG, but thereadershouldbeawarethatit is aquestionwhoseanswertellsuponhow onethinksaboutdiscoursegrammar.

I turn now to the topic of lexical ambiguity in D-LTAG, noting that thereareothersourcesof lexical ambiguitybeyond thosementionedin Section2.1.One is associatedwith the fact that adverbialscan appearin one structureinwhich they arediscourseadverbials(dependingonthediscoursefor partof theirinterpretation),asin 10a-b,andin otherstructuresin whichthey areindependentof thediscourse,asin 10c-d.

(10) a. Instead, Johnateanapple.

b. Otherwise, youcanforgetdessert.

c. Johnateanappleinsteadof apear.

d. Mary wasotherwiseoccupied.

In thesecases,the clause-level analysisservesto disambiguatewhetheror notthelexical item functionsat thediscourselevel.

Anothersourceof ambiguityis invisibleat theclauselevel. It stemsfrom thefactthatmany of theadverbialsfoundin secondpositionin parallelconstructions(e.g.,on the other hand, at the sametime, nevertheless, but) canalsoserve assimplediscourseadverbialson their own. In the first case,they will be oneofthe two anchorsof an initial tree,suchas in Fig. 4, while in the second,theywill anchorthesimpleauxiliary treeshown in Fig. 7(b). This lexical ambiguityleadsto local ambiguityat thediscourselevel. That is, while thereis only oneconsistentglobal analysisof thediscourse,an incrementalparser, working left-to-right, facesa choicethat canonly be decidedbasedon materialthat comeslater. This is somethingthatclause-level parsersfaceona regularbasis.

Page 17: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 15

For example,in thefollowing passage,at thesametimeservesasthesecondanchorof an initial tree expressingcontrast,whosefirst anchoris on the onehand.

(11) Brooklyn College studentshave an ambivalent attitude toward theirschool. On the onehand, thereis a senseof not having moved beyondtheambianceof theirhighschool.This is particularlyacutefor thosewhoattendedMidwoodHigh Schooldirectly acrossthestreetfrom BrooklynCollege. . . .At thesametime, thereis agooddealof self-congratulationatattendingagoodcollege. . . .

However, in thefollowing minor variationof Example11, at thesametimeanchorsan auxiliary tree that elaborateson the positive aspectsof attendingBrooklyn College,with on the other handservingasthe secondanchorof theinitial treethatexpressescontrast.

(12) Brooklyn College studentshave an ambivalent attitude toward theirschool.On theonehand, thereis a gooddealof self-congratulationat at-tendingagoodcollege.At thesametime, they know they’resaving moneyby living athome.Ontheotherhand, thereis asenseof nothaving movedbeyondtheambianceof theirhighschool.

D-LTAG analysesdonot introduceany kind of localor globaldiscourseam-biguity that is not presentin the original discourse.As with ambiguityat theclause-level, discourseambiguityis a problemthatparsersmustpunton or dealwith, asI will discussbriefly in thenext section.As with clause-level ambiguity,discourseambiguity is a problemthat will probablybe bestsolved by parsersusinga combinationof statistics(favoring analyseswith thehighestpriorsandtextualevidence)anddiscoursesemantics(favoring analysesthatmake referen-tial andrelationalsensein thecurrentcontext). All suchwork is in thefuture.

3 A parser for D-LTAG

Discourseparsinginvolvesanalyzingadiscourseaccordingto adiscoursegram-mar – in our case,D-LTAG. To date,we have carriedout a singleexperimentwith discourseparsing(Forbeset al., 2001)thatshows thatthesameparsercanbeusedfor both clause-level LTAG andD-LTAG. While it doesnot pretendtohave any psycholinguisticvalidity, it doesbring up someaspectsof discourseprocessingworthcommentingon further.

In this work, a chart-basedleft-cornerLTAG parser, LEM (Sarkar,2000)makestwo passesthroughthetext, thefirst producingXTAG derivationtreesfor

Page 18: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

16 B O N N I E W E B B E R

Tree

Extractor

TreeMapper

Discourse Input

Generation

and

Tree Selection

Clausal

Derivations

Tree Database

Clausal Tree

LEM

Derivation Structure

for

Discourse

Representations

LEM

Input Discourse

(Sentence Parsing) (Discourse Parsing)

Figure9: Two-passsentence/discourseparsingusingLEM

eachsentencefrom thesequenceof elementarytreesassociatedwith its words,the secondproducinga D-LTAG derivation for the discourseasa whole fromthe sequenceof elementarytreesassociatedwith its discourseconnectivesandclausalderivations.Theflow of processingis shown in Fig. 9.

For eachsentencein the discourse,LEM usesits chart to recordpossiblederivation trees for the sentenceaccordingto the XTAG grammar(XTAG-Group,2001). To producea singleanalysis,heuristicscanbe usedto decidewhich elementarytreeto assignto eachword (to dealwith lexical ambiguity),andto choosewhereto attachmodifiers(currently, thelowestattachmentpoint)to dealwith structural ambiguity. Eventually, statisticswill replaceheuristicsinthis process.6

Thesequenceof derivationtreescorrespondingto thesequenceof sentencesin the discourseis input to a TreeExtractor (TE), which extractstwo sortsofthingsfrom eachone:(1) thederivationtreefor eachclausein thesentence,and(2) eachelementarytreeanchoredin adiscourseconnective. This is donein two

6A separateversionof thediscourseparserusesLexTract(Xia et al., 2000)at thesentence-level andLEM at thediscourse-level. LexTractprovidesuniqueTAG derivationsfor sentencesinthePennTreeBank,sothatheuristicsarenot neededto selecttreesor chooseattachmentpoints.This just avoidssever ambiguityproblemsat thesentence-level,in orderto focuson discourse-level processing.The processfollowing the useof LexTract to produceuniquesentence-levelderivationsis thesameasin Fig. 9.

Page 19: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 17

passes– thefirst, to identify thediscourseconnectives,andthesecond,to detachclausalderivationsfrom their substitutionand/oradjunctionnodes.Thefirst – atop-down traversalof thederivationtree– considersboth lexical andstructuralpropertiesof eachlexical itembecause,asnotedearlier,

� lexical itemsthat canserve asdiscourseconnectivescanalsobe usedinother ways (e.g., insteadcan serve as an NP post-modifier– “an appleinsteadof apear”;andcanserveasanNPconjunction).Solexical featuresaloneare insufficient to determinewhethera particulartoken is actuallyservingasadiscourseconnective in aparticularcontext.

� LTAG doesnot distinguishbetweenclausaladverbialslike frequentlyanddiscourseadverbialslike otherwise. So structuralfeaturesalonearealsoinsufficient.

Sofrom thesentence

(13) While shewaseatinglunch,shesaw a dog.

TE extractsthe two clausalderivationsandoneelementarytreeanchoredin adiscourseconnective shown below. With clause-medialdiscourseconnectives,

saw

she dog while

she was lunch

a eating

,

Extractor

saw

she dog

a

(i) eating

she was lunch

(ii)

,

(iii) while

asin

(14) Susanwill thentakedancinglessons.

TE makesa copyof the derivation andreplacesthe discourseconnective withan index, to retain its clause-internalposition. This is becauseclause-medialadverbialsappearto berelevantto InformationStructure(Steedman,2000),andthus their position in the clauseis importantto preserve.7 So in Example14,TE extractsa singleclausalderivation andoneelementarytreeanchoredin adiscourseconnective,asshown in Fig. 10.

7While onedoesindeedwant to identify, for InformationStructure,wherea clause-medialadverbialoccursin clausestructure,doingit via this copy-and-replacemechanismis specifictothisparticularimplementation.A processthatinterleavedclausalparsingwith discourseparsingwould,presumably, identify amedialadverbialwhereit occursandprocessit at thatpoint.

Page 20: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

18 B O N N I E W E B B E R

dancingwill

then

take

lessonsSusan

(ii) thentake

lessonsSusan

dancingwill

{then}

Extractor(i)

Figure10: Applicationof TE to thederivationtreeof Example14.

Tree Mapping appliesto the outputof TreeExtraction,to mapsentence-levelstructuraldescriptorsof connectiveelementarytreesto theirdiscourse-levelstructuraldescriptors.(Notethatthisembodiesthesuggestionat theendof Sec-tion 2.1thatit is not lexical itemsthatanchorD-LTAG trees,but ratheranchoredLTAG trees– e.g., only otherwiseasan S-adjoiningadverbial, and not as anadjective-adjoiningadverbial.)

Therole of thenext stageof theprocess,DiscourseInput Generation(DIG)is to producea sequenceof lexicalized treeswhich can be submittedto LEM

for discourseparsing.Thesequenceof lexicalizedtreesconsistsof theconnec-tive elementarytreesobtainedfrom Tree Mapping andtheclausalelementarytreescorrespondingto the clausalderivationsobjtainedfrom the Tree Extrac-tor. Whenthereis no structuralconnective betweenclausalunits,DIG insertsanauxiliary treewith anemptylexical anchorinto theinput sequence.

Ambiguity is handledat thediscourselevel muchin thesameway asat theclauselevel – a singletreeis chosenfor eachconnective andthe lowestattach-mentpoint is selected.(In addition,adjunctionin initial treesis only allowedat their root node.)Lowestattachmentheuristicsareillustratedin Example15.Thereasonfor selectingthis exampleis that the interpretationof they in thefi-nal sentenceseemsto vary with the analysisselected,andso canbe usedasadiagnosticfor thatprocess.(15) Johnis stubborn.(T1)

His sisteris stubborn.(T2)His parentsarestubborn.(T3)Sothey arecontinuallyarguing. (T4)

Fig. 11 shows theoutputfrom DIG for this example.Thefive possiblederiva-tionsfor this exampleareshown in Fig. 12,correspondingto five derivedstruc-turesshown in Fig. 13. Structure(i) canbeparaphrasedas

Johnandhis sisterarestubborn.His parentsarestubborn.Sothey�hisparents� arealwaysarguing.

Page 21: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 19

T2T1 T3 T4

so* φ * φ

β:φβ:φ α:so

Figure11: Treesthatserveasinputto LEM’sdiscourseparsingfromExample15:“John is stubborn.His sisteris stubborn.His parentsarestubborn.Sothey arecontinuallyarguing.”

(iii)(ii)(i) (iv) (v)

α:so

β: φ

τ1

β: φ

τ2

τ3

τ4

α:so

α:so

τ4τ3

β: φτ2

β: φ

τ1 α:so

τ4

β: φ

τ1

τ2 β: φ

τ3

β: φ

τ1β: φ

τ1

β: φ

τ2

τ3

τ4

α:so

β: φ

τ2

τ4τ3

Figure12: Potentialdiscourse-level derivationtreesfor Example15.

(i) (iii)(ii)

(v)(iv)

T1 φT2

soT3 T4

φT1

φT2 T3

soT4

φT2T1

T3

soT4

φ

φ

T1

φT2

soT4T3

φT1

soT4

T2 φT3

φ

Figure13: Derivedstructuresfor discourseparsingof Example15.

Page 22: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

20 B O N N I E W E B B E R

Structure(iv) canbeparaphrasedas

Johnis stubborn.His sisterandhis parentsarestubborn.So they�hissisterandhisparents� arealwaysarguing.

while structures(ii), (iii) and(v) canall beparaphasedas

Johnandhis sisterandhis parentsarestubborn.Sothey�thewhole

family� arealwaysarguing.

Most readerswill take eitherthis or the interpretationassociatedwith structure(i) as the correctinterpretationof Example15, while having no feeling as towhich of thestructureshasgivenriseto it. Our discourseparser, however, onlyconsiderstheuniquederivationin which (i) for aninitial tree,adjunctionis onlyallowedat therootnode,while (ii) for all othertrees,only thelowestadjunctionis allowed. This meansthat the discourseparseronly producesderivation (v)andderived tree(v) for Example15, which happily accordswith the onethatmostreaderscanget. Nevertheless,a morerobust treatmentof both lexical andstructuralambiguityshouldbepursued.

Thereis onemoreproblemthataparserfor discoursemustaddress– thatofdiscourseembeddedin indirectspeechor apropositionalattitude,asin (16)and(17).

(16) Thepilotscouldplayhardballby notingthatthey arecrucialto any saleorrestructuringbecausethey canrefuseto fly theairplanes.

(17) Epigenesistsbelievedthattheorganismwasnotyetformedin thefertilizedegg. Rather, it aroseasa consequenceof profoundchangesin shapeandform duringthecourseof embryogenesis.

In bothexamples,thesententialcomplementof theverb(notein (16)andbelievein (17)) mustitself beanalysedasa discourse,extendingin thecaseof (17) tothenext sentenceaswell.

Ourinitial solutionto thisproblemresembles,in part,ourtreatmentof imper-ativesupposein Example2. I havealreadymentioned,in discussingimperativesuppose, thatin LTAG, verbsthattakesententialcomplementsdosoin theformof an auxiliary treethat adjoinsto the objectclause(cf. Fig. 6a). In D-LTAGhowever, wepositan initial treefor imperativesupposethattakestwo discourseclausesasarguments.For indirect speechandpropositionalattitudeverbs,wearefollowing a suggestionfrom Aravind Joshiandpositingsomethingsimilar:

Page 23: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 21

Dc

Dc

φS

NP VP

v X_i

_i

ε

Figure14: ProposedD-LTAG initial treefor propositionalattitudeandindirectspeechverbs

an initial treeanchoredby thepropositionalattitudeor indirectspeechverbthathasa covert argumentthat is coindexed with the (overt) clausalcomplementintroducedby thecomplementizer(Fig. 14). So,

(18) JohnbelievesthatMary is tired.

is analyzedas

(19) JohnbelievesX i that�Mary is tired� i.

Thereis cross-linguisticevidencefor suchananalysiscomingfrom Hindi8,wheretheX i maybeovertly expressed,asin

(20) raamyesamajhtaahai ki sitathakii-huii haiRamthis believes is thatSitatired isRambelievesthis thatSitais tired

Thediscourseanalysisof Example16would theninvolve thetreesshown inFig. 15,whereT1 representstheanalysisof “The pilotscouldplayhardball”,T2representstheanalysisof “they arecrucialto any saleor restructuring”,andT3,theanalysisof “they canrefuseto fly theairplanes”.

Similarly, the discourseanalysisof Example17 would involve the treesshown in Fig. 16,whereT1 representstheanalysisof “the organismwasnotyetformedin thefertilized egg” andT2, theanalysisof “it aroseasa consequenceof profoundchanges. . . ”.

8RashmiPrasad,personalcommunication

Page 24: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

22 B O N N I E W E B B E R

by

Dbecause

by

prop-att

because

1 3

3

1

1 3

T1

T2 T3c

prop-attDc

DcS

v X_i

because

φ_i

NP VP

εnoting

byDc

α:

α:

α:

α:

T2

α:

α:

T3

T1

Figure15: D-LTAG derivationof Example16 – “The pilots couldplay hardballby notingthatthey arecrucialto any saleor restructuringbecausethey canrefuseto fly theairplanes.”

rather*

0

T2

prop-attDc

Dc

Dc

unrealisedprop-att

unrealisedDc

Dc

rather

rather

φ

NP VP

v X_i

3

0

S_i

εbelieve

* φ

3

T1

β:α:

β:

β:

β:

α:

T2

T1

Figure16: D-LTAG derivationof Example17 – “Epigenesistsbelievedthat theorganismwasnot yet formedin the fertilized egg. Rather, it aroseasa conse-quenceof profoundchangesin shapeandform duringthecourseof embryoge-nesis.”

Page 25: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 23

Neither this view of propositionalattitudeand indirect speechverbs,norimperative suppose, nor the (local) ambiguitycausedby discourseconnectivesthatcanappearin morethanoneD-LTAG tree,have yet beenincorporatedintotheparserdescribedearlier. I expectthatwhenthey are,we will discover otheraspectsof low-level discourseanalysisthatneedexploring.

4 Differences between discourse connectives in D-LTAG

As shown in Section2,D-LTAGdistinguishesbetween(1) structuralconnectivesthatanchorinitial treesandconvey discourse-levelpredicate-argumentrelations;(2) structuralconnectives(including the null connective) that anchorauxiliarytreesand that elaboratethe precedingdiscourse;and (3) discourseadverbialsthat anchorauxiliary treesandcontribute predicate-argumentrelationsdistinctfrom (but thatmayinteractwith) thoseconveyedby structuralconnectives.

Webberet al. (2003)argueextensively thatwhile structuralconnectivesanddiscourseadverbialsmaybothconvey discourse-level predicate-argumentrela-tions,they gettheirargumentsin differentways.Structuralconnectivesgetboththeir argumentsfrom the discourseclausesto which they arestructurallycon-nectedin thediscourse,asin thefollowing9

(21) a. Because�HealthcareactuallyowesHealthVest$4.2million in rentand

mortgagepaymentseachmonth� , �the amountdueabove the amount

paidwill beaddedto thethree-yearnote.�b. Even though critical,

�it was just the kind of attention they were

seeking.� So�they fired backat theGoldmanSachsobjectionsin their

own economicsletter, “The BMC Report.” �On the otherhand,many discourseadverbialsget only oneargumentfrom theclauseor sentenceto which they areadjoinedandtheotheranaphoricallyfromtheprecedingdiscourseasin

(22) a.�If thelight is red� , stop.

Otherwise,�continuedown theroad.�

b. One greatdifferencedistinguishedthe Soviet and Germansystems:�therewasno Soviet equivalentof thedeathcamps� . Peoplesentenced

9Following theconventionsusedin thePennDiscourseTreeBank(Section5), argumentsarebracketted,while connectivesareunderlinedandin bold.

Page 26: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

24 B O N N I E W E B B E R

to deathin the Soviet Union weregenerallyshotbeforeenteringthecampnetwork. Applebaumestimatesthesevictims at just underonemillion during the Stalin years. Instead,

�Soviet prisonerswere ex-

pectedto earn their keep by contributing to the creationof SovietSocialism� .

c. A personwho hates�to sit watching television� might instead

�try

skydiving� .Empirical evidencefor this distinctionbetweenstructuralconnectivesanddis-courseadverbialscomesfrom Creswelletal. (2002),whodescribeanannotationexperimentin which annotatorswereaskedto identify theminimal text unit intheprecedingdiscoursecontainingthesourceof the“left-hand” argumentof thefollowing nineconnectives:

� Resultatives:asa result,so,therefore� Additives:also,in addition,moreover� Concessives:nevertheless,yet,whereas

The datacamefrom Brown corpus,WSJcorpus,Switchboardcorpus,and58transcribedoral historiesof online SocialSecurityAdministration(SSA) OralHistory Archives10. Theresultsshowedavarietyof distributionpatterns:

� Soalwaystook the immediatelyprecedingsentenceor sequenceof sen-tencesasits left argument.

� NeverthelessoftentookXP (i.e.,phrasal)arguments.� Thereforeoftentook its left-handargumentfrom asubordinateclause.

Connectivesthatpatternedwith soweretakento bestructuralconnectives,whiletheothersweretakento gettheir “left-hand” argumentanaphoricallyfrom interalia a non-adjacentclause,a relative clause,etc. – that is, from a clausethat isnot structurallyconnectedto thediscourseadverbial.

The problemsof dealingwith thesetwo typesof discourseconnective dif-fer. With structuralconnectives, one hasto rely on the parserto associateaconnective with its intendedarguments:An incorrectattachmentdecisionwill

10http://www.ssa.gov/history/orallist.html

Page 27: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 25

meanan incorrectargumentassignment.With anaphoricconnectives,aswithany anaphor, onemustdevelopaprocedurefor resolvingthem.

Now it is well-known thatdifferentanaphorsdisplaydifferentpatternsvis-a-vis the distribution and type of their antecedents:plural pronounsallowsplit antecedentswhile singularpronounsdo not; definitenounphrases(NPs)commonlyallow antecedentsrelatedthroughbridging while pronounsdo soonly rarely; theantecedentsof demonstrative pronounscommonlyderive fromclauses,while thoseof personalpronounsmostcommonlyderivefrom NPs;etc.In the caseof a discourseadverbial, if its “left-hand” argumentis anaphoric,thenoneneedsto articulateaprocedurefor finding its antecedentandfrom that,deriving its argument.

We do not think all discourseadverbialswill patternexactly the samevis-a-vis their antecedents,sowe areproceedingon a case-by-casebasisto gatherdataon how they patternandon what featuresare relevant to that patterning.The preliminarystudywe have carriedout on the discourseadverbial instead(Miltsakaki et al., 2003)illustrateswhat is needed.HereI will summarizeandelaborateon thatstudyandcommenton how wearenow proceeding.

Insteadcomesin two forms: (i) abareadverbial,asin

(23) InsteadJohnateanapple.

and(ii) modifiedby an“of ” PP, asin

(24) Johnateanappleinsteadof apear.

(25) Johnspenttheafternoonat thezoo insteadof at themuseum.

With an “of ” PP, bothargsof insteadderive structurally: thefirst from themodified phrase(e.g., “an apple”) and the secondfrom the “of ” PP (e.g., “apear”).Semantically, thatsecondargumentis a salientbut unchosenalternativeto thefirst, with respectto thegivenpredication.This is basicto theinterpreta-tion of insteadin bothits modifiedandbareforms.

As a bareadverbial, insteadcontinuesto get its first argumentstructurally,but its secondargument– thesalientbut unchosenalternative– mustbederivedanaphorically, from thediscoursecontext. But not everycontext providesalter-natives:

(26) a. Johnfoundit hardto eatanapple.Insteadheateapear.

b. Johnfoundit easyto eatanapple.#Insteadheateapear.

Page 28: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

26 B O N N I E W E B B E R

c. I told, expected Johnto eatanapple.Insteadheateapear.

d. John told, expected meto eatanapple.#Insteadheateapear.

As far as I am aware, there is no theoretical account of what types ofphrases/clausessuggestalternativesthatlicense“instead”.11

To begin to discoverthisempirically, pairsof annotatorsseparatelyexamined100successive instancesof bareinsteadin thePennTreeBankandrecordedtheminimal text spancontainingthe antecedentof its anaphoricargument. Therewasagreementin 97/100cases,andtheother3 caseswereexcludedfrom furtheranalysis.

We thenchosefeaturesto annotatethatwe hadobserved in serendipitouslyencounteredinstancesof instead:

� clausalnegation

(27) Johncouldn’t sleep.Instead,hewrotecode.(Verbal neg)

(28) Noonecouldsleep.Instead,everyonewrotecode.(Subj neg)

(29) Johnatenoneof hisspinach. Instead,hefed it to his frog. (Obj neg)

� presenceof amonotone-decreasingquantifier(MDQ)

(30) Few studentslike to do homework. Instead,they would ratherparty.

(31) Studentsseldomsleepin class.Instead,they takenotesassiduously.

� presenceof amodalauxiliary (Modal)

(32) Youshouldexercisemore.Insteadyou sit likeacouchpotato.

� whethertheantecedentis embeddedin a higherclause(Embed)

(33) Johnwantedto eatapear. Instead, heateanapple.

(34) Chrysler officials resistedcutting output. Instead, they slapped$1000cashrebateson vehicles.

11Forbes(2003)shows that thesearenot the samealternativesthat underpinthe semanticsof focusparticlessuchas“only” and“even”. On theotherhand,thereareclearly relationshipsbetweenthem,as“Only Johnateanapple.Insteadtheotherboysatepears.”

Page 29: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 27

Features YES(of 97) NO (of 97)Verbalneg 37(38%) 60 (62%)Subjneg 5 (5%) 92 (95%)Obj neg 10(10%) 82 (85%)MDQ 1 (1%) 96 (99%)Modal 12 (12%) 85 (88%)Condit 1 (1%) 96 (99%)Embed 57(59%) 40 (41%)

Figure17: Distributionof Featuresof theAntecedentof instead

Antecedents PCAsFeatures YES(of 97) NO (of 97) YES(of 169) No (of 169)Verbalneg 37(38%) 60 (62%) 21 (12%) 148(88%)Subjneg 5 (5%) 92 (95%) 8 (5%) 161(95%)Obj neg 10(10%) 82 (85%) 6 (4%) 139(82%)MDQ 1 (1%) 96 (99%) 0 (0%) 169(100%)Modal 12 (12%) 85 (88%) 17 (10%) 152(90%)Condit 1 (1%) 96 (99%) 0 (0%) 169(100%)Embed 57(59%) 40 (41%) 14 (8%) 155(91%)

Figure18: Distributionof Featuresof thePCAsof instead

(35) PaineWebberconsidered recommendingspecificstocks.Instead, itjust urgedits clientsto stayin themarket.

Theresultsareshown in Fig. 17.12

We theninvestigatedwhetherotherclausesthat don’t serve asantecedentsfor instead, whichwecall “potentiallycompetingantecedents”or “PCAs”, havea similar distribution with respectto thesefeatures.As in Soonet al. (2001),we limited potentially competingantecedentsto onesoccurring betweentheanaphorand its true antecedent.Here,PCAswerefinite or non-finiteclausesinterveningbetweeninsteadandits trueantecedent.For the97tokensof insteadon which annotatorsagreed,this produced169 PCAs. The distribution of thesamesevenfeaturesfor thesePCAsis shown in Fig. 18.

12Antecedentscould display one or more compatiblefeatures– e.g., both Subj neg andModal.

Page 30: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

28 B O N N I E W E B B E R

Therearesomeobvious differencesbetweenthe antecedentsandPCAsofinstead. First,asshown in thefollowing summaryof clausalnegationfeatures

Antecedents PCAsFeatures YES(of 97) NO (of 97) YES(of 169) No (of 169)Verbalneg 37(38%) 21 (12%)Subjneg 5 (5%) 8 (5%)Obj neg 10(10%) 6 (4%)

clausalnegationwasfoundto beover2.5timesmorecommonin theantecedentof insteadthanin PCAs– 52/97times( � 53%)versus35/169times( � 20%).

Second,focussingon theembed feature

Antecedents PCAsFeatures YES(of 97) NO (of 97) YES(of 169) No (of 169)Embed 57(59%) 14 (8%)

theantecedentof theanaphoricargumentof insteadwasfoundto beoverseventimesmorefrequentlyembeddedin ahigherverbthanaPCAwas– 57/97times( � 59%)vs 14/169times( � 8%).

On theotherhand,for thefeaturesrelatedto theantecedentbeingin a con-ditional (condit) or containinga monotonicallydecreasingquantifier(MDQ),thereisn’t enoughdatato draw any conclusions.The featurerelatedto thean-tecedentcontaininga modalauxiliary (Modal) doesnot, assuch,seemat allpredictive.

Subsequentto this study, we reviewed the dataanddecidedthat this initialfeaturesetshouldberefinedin at leastthefollowing ways,to widenthediffer-encebetweenantecedentsandPCAs.

1. Although the embeddingfeatureis stronglypredictive, we realisedthatnot all embeddingcontexts suggestalternativesto their embeddedclauses.Inparticular, someembeddedPCAs(but noembeddedantecedentsof instead) wereembeddedunderfactive verbslike know. It is well-known that factive verbspresupposethetruth of their embeddedclause(Kiparsky & Kiparsky, 1970),asin

(36) Johnknows thatFredeatsmeat.

They thereforedo not provide alternativesthat canserve asantecedentsfor in-stead, cf.

Page 31: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 29

(37) Johnbelieves/*knowsthatFredeatsmeat.InsteadFredeatstofu.

Therefore,we shouldannotatea featureon the embeddingverb, identifyingwhetheror not it is factive, to excludeclausesembeddedunderthelatteraspo-tentialantecedents.Sincethereis only asmallnumberof factiveverbs(althoughthey arerelatively common),sucha featurecould be annotatedautomatically,with high reliability.

2. Certainverbsappearto suggestalternatives,independentof whethertheclausealsocontainsexplicit negation,a monotonically-decreasingquantifier, amodalauxiliaryor clausalembedding.Considerthefollowing examples.

(38) JohndoubtedMary’s resolve. Instead, he thoughtshewould give up assoonasheleft.

(39) NBC is contemplatinggettingout of thecartoonbusiness.Instead, it may“counter-program”with shows for anaudiencethatis virtually ignoredinthattimeperiod:adults.

(40) Investorshave lost theirenthusiasmfor thestockmarket. Instead, they arebuyinggovernmentbonds.

(41) But respectabilitystill eludesItaly’s politics. Instead, it has the phe-nomenonof Mr. Berlusconi.

Many additionalsuchverbshave cometo our attention. They appearto fallroughly into two classes,althoughneithercorrespondsto any known thesaurusor WordNetclass.Thefirst class– includingdoubt, refuse, deny, preclude, etc.–appearsto containanelementof implicit negation,andmightbecallednegativepropositionalattitudeverbs. Thesecondclass– includingstop, lose, get out of,change, drop, giveup, elude, etc.– mightbecallednegativestatechangeverbs.They indicatethat in thesituationafter theeventconveyedby theclause,someearlier featureof the situationno longerholds. This featurethenseemsto beavailableasanalternative to theindicatedchange.

While verbsin bothclassesappearto suggestalternatives,thecompositionof theseclassesremainsto bespecified.Sowe mustacquiretheir membershipconcurrentlywith carryingout annotation.

3. Evenmoreof achallengeto automaticidentification,is thefactthatotherlexico-syntacticelementsthat do not fall into a priori classesappearable tosuggestalternativesaswell. In thefollowing examplefrom thePennTreeBank

Page 32: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

30 B O N N I E W E B B E R

(42) Thetensionwasevidenton WednesdayeveningduringMr. Nixon’s finalbanquettoast,normallyan opportunityfor recitingplatitudesabouteter-nal friendship. Instead, Mr. Nixon remindedhis host,ChinesePresidentYangShangkun,thatAmericanshaven’t forgivenChina’s leadersfor themilitary assaultof June3-4 that killed hundreds,andperhapsthousands,of demonstrators.

eithertheadverb“normally” or thenoun“opportunity” appearsto beasufficienttriggerfor alternativesandhencetheuseof instead:

(43) Normally, weeatpastaonTuesday. Instead, tonightwe’re having fish.

(44) Johnhadthe opportunityto buy a cheapusedcar. Instead, he boughtascooter.

Sowhile it is clearthatweshouldbroadentherangeof featuresbeingconsidered,it is not clearhow to go aboutidentifying them,exceptby noticing themin thecontext of instead.

Finally, I shouldcommenton relationalfeaturesthatderive from thepair ofstructuralandanaphoricargumentsto instead– for example,whetherthe twohavethesamesurfacesubject(asin most,but notall, of theexamplesabove),orrelatedsubjects,asin Example45.

(45) In an abruptreversal,the United StatesandBritain have indefinitely putoff their planto allow Iraqi oppositionforcesto form a nationalassemblyandaninterimgovernmentby theendof themonth.Instead, topAmericanandBritish diplomatsleadingreconstructioneffortsheretold exile leadersin ameetingtonightthatalliedofficialswouldremainin chargeof Iraqforanindefiniteperiod,saidIraqiswhoattendedthemeeting.

While relational features appearrelevant to resolving instead, they were notincludedin our original featureset.But it is clearthatrelationalfeaturesshouldbe includedas well. The context in which we will examinetheseand otherfeaturesis thePennDiscourseTreeBank.

5 Penn Discourse TreeBank

ThePennDiscourseTreeBank(http://www.cis.upenn.edu/� pdtb)aimsto dofordiscoursewhatthePennTreeBankhasdonefor sentence-level processing– thatis, to provide a sharableresourcefor thedevelopmentof automatedtechniquesof discourseanalysisandgeneration.Thevalueof a TreeBankcomesfrom the

Page 33: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 31

“knowledge” addedto it, over and beyond its sequenceof sentences.Whencompleteandreleased(aroundNovember2005),it is expectedto haveapproxi-mately20,000annotationsof the250typesof explicit connectivesidentifiedinthecorpus,and10,000annotationsof implicit conenctives(seebelow).

Creatingthe PennDiscourseTreeBank(PDTB) involvesmanuallyidenti-fying, annotatingandassessinginter-annotatoragreementon (a) all discourseconnectivesin the PennTreeBank,and(b) the text segmentsfrom which eachconnective draws its arguments(Miltsakaki et al., 2004). While the PDTB re-flects the theoreticalbiasof D-LTAG in termsof a lexical basisfor discourseanalysisanddifferenttypesof discourseconnectives,the instructionsto anno-tators13 only requirethemto identify theminimal spansof text whosemeaningis involved in the useof a particularconnective. Thesespansmay cover interalia anembeddedclause,asin thefirst (anaphoric)argumentto insteadin Ex-ample46,a previous(non-adjacent)clause,asin thefirst (anaphoric)argumentto otherwisein Example47,or theimmediatelyprecedingsentenceor clause,asin Example48.

(46) AnneCompocciawanted�to beanun� .

Instead,�shefoundherselfin prisonfor embezzlingcity funds� .

(47)�If thelight is red� , stop.

Otherwise,�just continuedown theroad.�

(48)�Thereareno separateraftersin aflat roof� ;

instead,�theceiling joistsof thetop storysupporttheroofing.�

(Otherpossibilitiesinclude the immediatelyprecedingdiscourse,a string thatdoesn’t correspondto anexisting syntacticconstituent,or evena discontinuousstring.)

PDTB annotationis producedusingWordFreak14, anannotationtool devel-opedby Tom Morton andthenmodifiedby JeremyLacivita to satisfytheneedsof PDTB annotation.To supportmulti-level analysis,annotationis renderedinXML as“stand-off ” annotation,alignedwith similar stand-off versionsof thePennTreebanksyntacticannotationand the predicate-argumentannotationofPropBank(Kingsbury & Palmer,2002). In thefirst tranchof connectivesto beannotatedwere the discourseadverbialsinstead, otherwise, nevertheless, as a

13http://www.cis.upenn.edu/� pdtb/manual/pdtb-tutorial.pdf14http://www.sourceforge.net/projects/wordfreak

Page 34: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

32 B O N N I E W E B B E R

resultandtherefore, andthesubordinateconjunctionsbecause(both aloneandwhenprecededby partly, in part, only, just or largely), although, eventhough,when(bothaloneandwhenprecededby just, only, evenor largely) andsothat.

In addition, the PDTB is annotatingimplicit connectivesbetweenadjacentsectionswith no explicit connective betwenthem. Here,the two sentencesaretaken to be the two arguments,andthe annotatorsareasked to provide, wherepossible,anexplicit connectivethatcapturestheinferredrelationbetweenthem.For example,

(49)�The $6 billion that some40 companiesare looking to raisein the year

endingMarch 31 compareswith only $2.7 billion raisedon the capitalmarket in thepreviousfiscalyear� . IMPLICIT-(In contrast)

�In fiscal1984

beforeMr. Gandhicameto power, only $810million wasraised� .Thefinal versionof thePDTBwill alsocontaincharacterizationsof thesemanticrolesassociatedwith theargumentsof eachtypeof connective, similar to bothPropBankannotationof thesemanticrolesof verbs(Kingsbury & Palmer,2002)andNomBankannotationof thesemanticrolesof nouns(Meyerset al., 2004).Suchrole annotationswill allow softwarerunningover thePDTB to distinguishbetweendifferentsensesof aconnective(e.g.,temporalversusconcessivewhile)or, for example,to backoff to all connectivesthatsharethesamesetof semanticroles.

Furtherdiscussionof thePDTB,its annotationguidelinesandlevelsof inter-annotatoragreementcanbefoundin (Miltsakaki et al., 2004)and(Prasadet al.,2004).

ThePennDiscourseTreeBankis not thefirst or only effort to annotatedis-coursestructure.Efforts to do sostartedover tenyearsago,asa way of provid-ing empirical justificationfor high-level theoriesof discoursestructure(Grosz& Sidner, 1986;Moser& Moore,1996). Althoughmuchtime andenergy wasdevotedto thework (Di Eugenioet al., 1998),theresultshave not beenwidelyusedin the computationalarena,unlike the PennTreeBank. It is hopedthatcurrenteffortswill not suffer this fate.

The work closestto the PennDiscourseTreeBankin Englishis the corpusdevelopedby CarlsonandMarcu and their colleagues(Marcu, 1999;Carlsonetal.,2002)basedonRhetoricalStructureTheory(Mann& Thompson,1988).15

RSTis a theoryof discourseanalysisthatholdsthat (1) thereis a specifiedset

15For German,thereis now a similar effort to annotatediscourseconnectivesaspart of thePotsdamCommentaryCorpus(Stede,2004).

Page 35: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 33

of rhetoricalrelationsthatcanhold betweenadjacentunitsof discourse;(2) ad-jacentunitsof discoursearerelatedby a singlerhetoricalrelationthataccountsfor thesemanticor pragmatic(intentional)senseassociatedwith theiradjacency;(3) unitssorelatedform largerunits thatparticipatein rhetoricalrelationswithunits that they themselvesare adjacentto; and (4) in many, but not all, suchjuxtapositions,oneof theunits(thesatellite)providessupportfor theother(thenucleus),whichthenappearsto bethebasisfor rhetoricalrelationsthatthelargerunit participatesin. Giventheseprinciples,themainaspectsof RSTannotationare(1) demarcatingtheelementarydiscourseunitsthatunderpintherepresenta-tion; (2) identifying how they fit togetherinto larger spans;and(3) annotatingtheparticularrhetoricalrelationthatholdsbetweenelementsthat form a largerspan.

The RST-annotatedcorpus16 differs from the PennDiscourseTreeBankinseveralways– themostsignificantbeingthedifferencein theoreticalperspec-tive. The RST-corpusis basedon an a priori set of rhetoricalrelations,andannotatorsaregivenspecificinstructionsasto wheneachshouldbe chosenastheannotationfor a text. In contrast,thePDTB is groundedin thecorpusitself:While annotatorsmaybe instructedasto whento considera particulartokenadiscourseconnective(asopposedto e.g.awh-complementizeror a relativepro-noun),onceatokenis judgedto beaconnective,theannotators’job is to identifyits two argumentsin thecorpus.Operationally, this meansthatRSTannotationstartswith identifying discourseunits and then selectingwhat rhetoricalrela-tionsholdsbetweenthem,while PDTB startswith identifying connectivesandthenwhatit is thatthey connect.

We arenot downplaying the importanceof having an annotatedcorpusofcoherencerelationsassociatedwith adjacentdiscourseunits. But we believethat the taskof producingsucha corpuscanbemadeeasierby having alreadyidentifiedthehigher-orderpredicate-argumentrelationsassociatedwith explicitdiscourseconnectives.They canthenbefactoredinto thecalculationor removedfrom thecalculation,asappropriate(Webberet al., 2003).

6 Conclusion

This paperhasreviewed our work on a lexicalizedgrammarfor low-level dis-course,explainingwhathasmotivatedthework andwhatit achieves,including

16distributednow by theLinguistic DataConsortium,http://www.ldc.upenn.edu

Page 36: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

34 B O N N I E W E B B E R

� allowing us to make specificgeneralizationsabouthow lexico-syntacticelementscontribute to the syntaxand semanticsof both the clauseanddiscourse,andhow thosecontributionsmayinteract.

� openingup the (still to be realised)possibility of allowing sentencepro-cessingandlow-level discourseprocessingto beintegrated.

� allowing usto developalarge,reliablyannotatedcorpusin whichthebasisfor annotationdecisions– discourseconnectives (viewed as predicates)andtheirarguments– is clear.

For thenext few years,thePennDiscourseTreeBankis the futureof D-LTAG.It will provideaGoldStandardfor furtherparserdevelopmentfor D-LTAG, andthroughits integrationwith thePennTreeBankandPropBank,enablethedevel-opmentof data-intensive,probabilisticmethodsfor resolvinganaphoricconnec-tives.It will undoubtedlybeasourceof interestingdataandinterestingideasformany yearsto come.

Acknowledgements

Much of thematerialin this articlederivesfrom talksI havegivenandfrompapersco-authoredby membersof theD-LTAG group— CassandreCreswell,KatherineForbes,Eleni Miltsakaki, RashmiPrasadandAravind Joshi,at theUniversity of Pennsylvania,andmyself, at the University of Edinburgh. Thepaperhasalsogainedfrom on-goingdiscussionsamonggroupmembers,in con-nectionwith thedevelopmentof thePennDiscourseTreeBank,from commentsfrom Mark Steedman,and from suggestionsfrom FernandaFerreiraand twoother(anonymous)CognitiveSciencereviewers.ThePennDiscourseTreeBankprojectis partially supportedby NSFGrantEIA 0224417(Joshi).

References

Asher, N. (1993). Referenceto Abstract Objectsin Discourse. BostonMA:Kluwer.

Asher, N. & Lascarides,A. (1998).Thesemanticsandpragmaticsof presuppo-sition. Journalof Semantics, 15(3), 239–300.

Bangalore,S.& Joshi,A. (1999).Supertagging:An approachto almostparsing.ComputationalLinguistics, 25(2), 237–265.

Page 37: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 35

Carlson,L., Marcu,D., & Okurowski,M. E.(2002).Building adiscourse-taggedcorpusin theframework of rhetoricalstructuretheory. In Proceedingsof the2nd SIGdialWorkshoponDiscourseandDialogue, Aalborg, Denmark.

Clark, S. & Hockenmaier, J. (2002). Evaluatinga wide-coverageCCG parser.In Proceedingsof the LRECWorkshopon BeyondParseval, pp. 60–66,LasPalmas,Spain.

Creswell,C., Forbes,K., Miltsakaki, E., Prasad,R., Joshi,A., & Webber, B.(2002).Thediscourseanaphoricpropertiesof connectives.In ProceedingsoftheDiscourseAnaphora andAnaphorResolutionColloquium, Lisbon,Portu-gal.

Cristea,D. & Webber, B. (1997). Expectationsin incrementaldiscoursepro-cessing. In Proceedingsof the 35th AnnualMeetingof the AssociationforComputationalLinguistics(ACL97/EACL97), pp.88–95,Madrid,Spain.

Di Eugenio,B., Jordan,P. W., Moore, J. D., & Thomason,R. H. (1998). Anempiricalinvestigationof proposalsin collaborativedialogues.In Proceedingsof COLING/ACL’98, pp.325–329,Montreal,Canada.

Forbes,K. (2003).Discoursesemanticsof s-modifyingadverbials.Ph.D.thesis,Departmentof LinguisticsUniversityof Pennsylvania.

Forbes,K., Miltsakaki, E., Prasad,R., Sarkar, A., Joshi, A., & Webber, B.(2001).D-LTAG System– discourseparsingwith a lexicalizedtree-adjoininggrammar. In ESSLLI’2001Workshopon Information Structure, DiscourseStructureandDiscourseSemantics, Helsinki,Finland.

Forbes,K. & Webber, B. (2002).A semanticaccountof adverbialsasdiscourseconnectives.In Proceedingsof Third SIGDialWorkshop, pp.27–36,Philadel-phiaPA.

Forbes-Riley, K., Webber, B., & Joshi,A. (2004).Computingdiscourseseman-tics: Thepredicate-argumentsemanticsof discourseconnectivesin D-LTAG.Submittedto theJournalof Semantics.

Gardent,C. (1997). Discoursetree adjoining grammars. Clausreport nr.89,Universityof theSaarland,Saarbr̈ucken.

Page 38: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

36 B O N N I E W E B B E R

Grosz,B. & Sidner, C. (1986). Attention, intentionand the structureof dis-course.ComputationalLinguistics, 12(3), 175–204.

Hockenmaier, J.& Steedman,M. (2002).Generativemodelsfor statisticalpars-ing with combinatorycategorial grammar. In Proceedingsof 40th AnnualMeetingof theAssociationfor ComputationalLinguistics, PhiladelphiaPA.

Hovy, E. (1988). Generating Natural Language underPragmaticConstraints.HillsdaleNJ:LawrenceErlbaumAssociates.

Joshi,A. (1987). An introductionto TreeAdjoining Grammar. In A. Manaster-Ramer(ed.),Mathematicsof Language, pp. 87–114.Amsterdam:JohnBen-jamins.

Kingsbury, P. & Palmer, M. (2002).Fromtreebankto propbank.In Proceedingsof the 3rd International Conferenceon Language Resourcesand Evalution(LREC), LasPalmas.

Kiparsky, P. & Kiparsky, C. (1970). Fact. In M. Bierwisch& K. E. Heidolph(eds.),Progressin Linguistics, pp.143–173.Mouton.

Knott, A., Oberlander, J.,O’Donnell, M., & Mellish, C. (2001). Beyondelabo-ration: Theinteractionof relationsandfocusin coherenttext. In T. Sanders,J.Schilperoord,& W. Spooren(eds.),Text Representation:Linguisticandpsy-cholinguisticaspects, pp.181–196.JohnBenjaminsPublishing.

Linde, C. (1974). The linguistic encodingof spatialinformation. Ph.D.thesis,Departmentof LinguisticsColumbiaUniversity.

Mann,W. & Thompson,S. (1988). Rhetoricalstructuretheory:Towarda func-tional theoryof text organization.Text, 8(3), 243–281.

Marcu,D. (1999). Instructionsfor manuallyannotatingthediscoursestructureof texts. Availableat http://www.isi.edu/˜marcu/software/manual.ps.gz, Ac-cessedon 28 May 2004.

Marcu,D. (2000).Thetheoryandpracticeof discourseparsingandsummariza-tion. MIT Press.

Page 39: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 37

McKeown, K. (1985). Text Generation: Using DiscourseStrategiesand Fo-cusConstraints to Generate Natural Language Texts. Cambridge,England:CambridgeUniversityPress.

Melcuk, I. (1988).Dependencysyntax:TheoryandPractice. Albany NY: StateUniversityof New York.

Meyers,A., Reeves,R.,Macleod,C.,Szekely, R.,Zilinska,V., Young,B., & Gr-ishman,R. (2004).TheNomBankproject:An interimreport.In NAACL/HLTWorkshopon Frontiers in CorpusAnnotation, BostonMA.

Miltsakaki, E., Creswell, C., Forbes, K., Joshi, A., & Webber, B. (2003).Anaphoricargumentsof discourseconnectives: Semanticpropertiesof an-tecedentsversusnon-antecedents.In EACL Workshopon ComputationalTreatmentof Anaphora, Budapest,Hungary.

Miltsakaki,E.,Prasad,R.,Joshi,A., & Webber, B. (2004).Annotatingdiscourseconnectivesandtheir arguments. In NAACL/HLT Workshopon Frontiers ofCorpusAnnotation, BostonMA.

Moore, J. (1990). Participating in ExplanatoryDialogues. BostonMA: MITPress.

Moser, M. & Moore,J. (1996).Towardasynthesisof two accountsof discoursestructure.ComputationalLinguistics, 22(3), 409–419.

Polanyi, L. & van denBerg, M. H. (1996). Discoursestructureanddiscourseinterpretation. In P. Dekker & M. Stokhof (eds.),Proceedingsof the TenthAmsterdamColloquium, pp.113–131,Universityof Amsterdam.

Prasad,R., Miltsakaki, E., Joshi,A., & Webber, B. (2004). Annotationanddataminingof thePennDiscourseTreeBank.In ACL WorkshoponDiscourseAnnotation, Barcelona,Spain.

Quirk, R., Greenbaum,S.,Leech,G., & Svartik, J. (1972).A Grammarof Con-temporary English. Harlow: Longman.

Sarkar, A. (2000). Practicalexperimentsin parsingusingtree-adjoininggram-mars.In Proceedingsof the5th TAG+ Workshop, pp.193–198.

Page 40: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

38 B O N N I E W E B B E R

Scha,R. & Polanyi, L. (1988). An augmentedcontext free grammarfor dis-course.In Proceedingsof the12th InternationalConferenceonComputationalLinguistics(COLING’88), pp.573–577,Budapest,Hungary.

Schabes,Y. (1990). Mathematicaland computationalaspectsof lexicalizedgrammars.Ph.D.thesis,Departmentof ComputerandInformationScience,Universityof Pennsylvania.

Schilder, F. (1997). Treediscoursegrammar, or how to get attachedto a dis-course. In Proceedingsof the SecondInternationalWorkshopon Computa-tional Semantics, Tilburg, Netherlands.

Sibun, P. (1992). Generatingtext without trees. ComputationalIntelligence,8(1), 102–122.

Soon,W. M., Ng, H. T., & Lim, D. (2001). A machinelearningapproachtocoreferenceresolutionof noun phrases. ComputationalLinguistics, 27(4),521–544.

Stede,M. (2004). The PotsdamCommentaryCorpus. In ACL WorkshoponDiscourseAnnotation, Barcelona,Spain.

Steedman,M. (1996). SurfaceStructure and Interpretation. CambridgeMA:Linguistic Inquiry Monograph30,MIT Press.

Steedman,M. (2000).Informationstructureandthesyntax-phonologyinterface.LinguisticInquiry, 34, 649–689.

Webber, B. (1991). Structureandostensionin the interpretationof discoursedeixis. LanguageandCognitiveProcesses, 6(2), 107–135.

Webber, B. & Joshi,A. (1998).Anchoringa lexicalizedtree-adjoininggrammarfor discourse.In Coling/ACLWorkshoponDiscourseRelationsandDiscourseMarkers, pp.86–92,Montreal,Canada.

Webber, B., Joshi,A., & Knott, A. (2000). Theanaphoricnatureof certaindis-courseconnectives.In MakingSense:fromLexemeto Discourse, Groningen,TheNetherlands.

Page 41: Author: Bonnie Webber Title: D-LTAG: Extending lexicalized TAG to Discourse …pdtb/papers/cogsci04.pdf · 2004-06-04 · Title: D-LTAG: Extending lexicalized TAG to Discourse Affiliation:

D - L T A G 39

Webber, B., Knott, A., & Joshi,A. (2001). Multiple discourseconnectivesina lexicalizedgrammarfor discourse.In H. Bunt, R. Muskens,& E. Thijsse(eds.),ComputingMeaning(Volume2), pp.229–249.Kluwer.

Webber, B., Knott, A., Stone,M., & Joshi,A. (1999a). Discourserelations:AstructuralandpresuppositionalaccountusinglexicalisedTAG. In Proceedingsof the36th AnnualMeetingof theAssociationfor ComputationalLinguistics,pp.41–48,CollegeParkMD.

Webber, B., Knott,A., Stone,M., & Joshi,A. (1999b).Whatarelittle treesmadeof: A structuralandpresuppositionalaccountusinglexicalisedTAG. In Pro-ceedingsof InternationalWorkshoponLevelsof Representationin Discourse(LORID’99), pp.151–156,Edinburgh.

Webber, B., Stone,M., Joshi,A., & Knott, A. (2003). Anaphoraanddiscoursestructure.ComputationalLinguistics, 29(4), 545–587.

Xia, F., Palmer, M., & Joshi,A. (2000). A uniform methodof grammarex-tractionandits applications. In Proceedingsof the SIGDAT ConferenceonEmpiricalMethodsin Natural LanguageProcessing(EMNLP), HongKong.

XTAG-Group (2001). A Lexicalized Tree Adjoining Grammar for En-glish. Tech. Rep. IRCS 01-03, University of Pennsylvania. Available atftp://ftp.cis.upenn.edu/pub/ircs/technical-reports/01-03,Accessedon 28 May2004.