Top Banner
06/23/22 CPSC503 Winter 2007 1 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini
49

6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

Dec 19, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 1

CPSC 503Computational Linguistics

Lecture 10Giuseppe Carenini

Page 2: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 2

Knowledge-Formalisms Map(including probabilistic formalisms)

Logical formalisms (First-Order Logics)

Rule systems (and prob. versions)(e.g., (Prob.) Context-Free

Grammars)

State Machines (and prob. versions)

(Finite State Automata,Finite State Transducers, Markov Models)

Morphology

Syntax

PragmaticsDiscourse and

Dialogue

Semantics

AI planners

Page 3: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 3

Next three classes

• What meaning is and how to represent it

• How to map sentences into their meaning

• Meaning of individual words (lexical semantics)

• Computational Lexical Semantics Tasks– Word sense disambiguation– Word Similarity– Semantic Labeling

Page 4: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 4

Today 16/10

• Semantics / Meaning /Meaning Representations

• Linguistically relevant Concepts in FOPC / POL

• Semantic Analysis

Page 5: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 5

SemanticsDef. Semantics: The study of the meaning of words,

intermediate constituents and sentences

Def1. Meaning: a representation that expresses the linguistic input in terms of objects, actions, events, time, space… beliefs, attitudes...relationships

Def2. Meaning: a representation that links the linguistic input to knowledge of the world

Language independent!

Page 6: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 6

Semantic Relations involving Sentences

Paraphrase: have the same meaning• I gave the apple to John vs. I gave John the apple• I bought a car from you vs. you sold a car to me• The thief was chased by the police vs. ……

Same truth conditions

Entailment: “implication”• The park rangers killed the bear vs. The bear is dead• Nemo is a fish vs. Nemo is an animal

Contradiction: I am in Vancouver vs. I am in India

Page 7: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 7

Meaning Structure of Language

• How does language convey meaning?– Grammaticization

– Display a partially compositional semantics

– Display a basic predicate-argument structure (e.g., verb complements)

– Words

Page 8: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 8

GrammaticizationConcept Affix

• Past• More than one• Again• Negation

• -ed• -s• re-• in-, un-, de-

Words from Nonlexical categories• Obligation• Possibility• Definite, Specific• Indefinite, Non-specific• Disjunction• Negation• Conjunction

• must• may• the• a• or• not• and

Page 9: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 9

Common Meaning Representations

FOL

Semantic Nets

FramesConceptual Dependency

I have a car

Page 10: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 10

Requirements for Meaning Representations

e.g, • Does Maharani serve vegetarian food? -> Yes• What restaurants are close to the ocean?-> C and Monks

• Sample NLP Task: giving advice about restaurants– Accept queries in NL– Generate appropriate responses by

consulting a KB

Page 11: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 11

Verifiability (in the world?)

• Example: Does LeDog serve vegetarian food?• Knowledge base (KB) expressing our world

model (in a formal language)

• Convert question to KB language and verify its truth value against the KB content

Yes / No / I do not know

Page 12: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 12

Canonical Form

Paraphrases should be mapped into the same representation.

• Does LeDog have vegetarian dishes?• Do they have vegetarian food at LeDog?• Are vegetarian dishes served at LeDog?• Does LeDog serve vegetarian fare?• ……………

Page 13: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 13

How to Produce a Canonical Form

• Words have different senses – food ___– dish ___|____one overlapping meaning sense– fare ___|

• Meaning of alternative syntactic constructions are systematically related server thing-being-served– [S [NP Maharani] serves [NP vegetarian dishes]] thing-being-served server [S [NP vegetarian dishes] are served at [NP Maharani]]

Page 14: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 14

Inference and Expressiveness• Consider a more complex request

– Can vegetarians eat at Maharani?– Vs: Does Maharani serve vegetarian food?

• Why do these result in the same answer?

• Inference: System’s ability to draw valid conclusions based on the meaning representations of inputs and its KB

• serve(Maharani,VegetarianFood) => CanEat(Vegetarians,At(Maharani))

Expressiveness: system must be able to handle a wide range of subject matter

Page 15: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 15

Non Yes/No Questions

• Example: I'd like to find a restaurant where I can get vegetarian food.

• Indefinite reference <-> variable serve(x,VegetarianFood)

• Matching succeeds only if variable x can be replaced by known object in KB.

What restaurants are close to the ocean?-> C and Monks

Page 16: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 16

Meaning Structure of Language

• How does language convey meaning?– Grammaticization

– Display a partially compositional semantics

– Display a basic predicate-argument structure (e.g., verb complements)

– Words

Page 17: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 17

Predicate-Argument Structure

• Subcategorization frames specify number, position, and syntactic category of arguments

• Examples: give NP2 NP1, find NP, sneeze []

• Represent relationships among concepts

• Some words act like arguments and some words act like predicates:– Nouns as concepts or arguments: red(ball)– Adj, Adv, Verbs as predicates: red(ball)

Page 18: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 18

Semantic (Thematic) Roles

• Semantic Roles: Participants in an event– Agent: George hit Bill. Bill was hit by George– Theme: George hit Bill. Bill was hit by George

Source, Goal, Instrument, Force…

This can be extended to the realm of semantics

• Verb subcategorization: Allows linking arguments in surface structure with their semantic roles• Mary gave/sent/read a book to Ming

Agent Theme Goal• Mary gave/sent/read Ming a book Agent Goal Theme

Page 19: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 19

Non-verbal predicate-argument structures

• Semantic (Selectional) Restrictions: Constrain the types of arguments verbs take– George assassinated the senator– *The spider assassinated the fly

Selectional Restrictions

• A Spanish restaurant under the bridge

Under(SpanishRestaurant,

bridge)

Page 20: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 20

First Order Predicate Calculus (FOPC)

• FOPC provides sound computational basis for verifiability, inference, expressiveness…– Supports determination of truth– Supports Canonical Form– Supports compositionality of meaning– Supports question-answering (via variables)– Supports inference– Argument-Predicate structure

Page 21: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 21

Today 16/10

• Semantics / Meaning /Meaning Representations

• Linguistically relevant Concepts in FOPC / POL

• Semantic Analysis

Page 22: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 22

Linguistically Relevant Concepts in FOPC

• Categories & Events (Reification)• Representing Time• Beliefs (optional, read if relevant to your project)

• Aspects (optional, read if relevant to your project)

• Description Logics (optional, read if relevant to your project)

Page 23: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 23

Categories & Events

• Events: eg. “Make a reservation”– Reservation (Speaker,Joe’s,Today,8PM,2)– Problems:

• Determining the correct number of roles• Representing facts about the roles associated

with an event• Ensuring that all and only the correct

inferences can be drawn

• Categories:– VegetarianRestaurant (Joe’s) - relation vs. object– MostPopular(Joe’s,VegetarianRestaurant)

Reification– ISA (Joe’s,VegetarianRestaurant)– AKO (VegetarianRestaurant,Restaurant)

Page 24: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 24

MUC-4 Example

INCIDENT: DATE 30 OCT 89 INCIDENT: LOCATION EL SALVADOR INCIDENT: TYPE ATTACK INCIDENT: STAGE OF EXECUTION ACCOMPLISHED INCIDENT: INSTRUMENT ID INCIDENT: INSTRUMENT TYPEPERP: INCIDENT CATEGORY TERRORIST ACT PERP: INDIVIDUAL ID "TERRORIST" PERP: ORGANIZATION ID "THE FMLN" PERP: ORG. CONFIDENCE REPORTED: "THE FMLN" PHYS TGT: ID PHYS TGT: TYPEPHYS TGT: NUMBERPHYS TGT: FOREIGN NATIONPHYS TGT: EFFECT OF INCIDENTPHYS TGT: TOTAL NUMBERHUM TGT: NAMEHUM TGT: DESCRIPTION "1 CIVILIAN"HUM TGT: TYPE CIVILIAN: "1 CIVILIAN"HUM TGT: NUMBER 1: "1 CIVILIAN"HUM TGT: FOREIGN NATIONHUM TGT: EFFECT OF INCIDENT DEATH: "1 CIVILIAN"HUM TGT: TOTAL NUMBER

On October 30, 1989, one civilian was killed in a reported FMLN attack in El Salvador.

Page 25: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 25

Subcategorization frames• I ate• I ate a turkey sandwich• I ate a turkey sandwich at my desk• I ate at my desk• I ate lunch• I ate a turkey sandwich for lunch• I ate a turkey sandwich for lunch at my

desk

no fixed “arity”!

Page 26: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 26

Reification Again

• Reification Advantages:– No need to specify fixed number of

arguments for a given surface predicate– No more roles are postulated than

mentioned in the input– Logical connections among related

examples are specified

“I ate a turkey sandwich for lunch” w: Isa(w,Eating) Eater(w,Speaker)

Eaten(w,TurkeySandwich) MealEaten(w,Lunch)

Page 27: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 27

Representing Time• Events are associated with points or

intervals in time.• We can impose an ordering on distinct

events using the notion of precedes.

• Temporal logic notation: (w,x,t) Arrive(w,x,t)

• Constraints on variable tI arrived in New York(t) Arrive(I,NewYork,t) precedes(t,Now)

Page 28: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 28

Interval Events• Need tstart and tend

“She was driving to New York until now”

tstart,tend ,e, i ISA(e,Drive) Driver(e, She)

Dest(e, NewYork) IntervalOf(e,i)Endpoint(i, tend) Startpoint(i, tend)

Precedes(tstart,Now) Equals(tend,Now)

Page 29: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 29

Relation Between Tenses and Time• Relation between simple verb

tenses and points in time is not straightforward

• Present tense used like future:– We fly from Baltimore to Boston at 10

• Complex tenses:– Flight 1902 arrived late– Flight 1902 had arrived late

Page 30: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 30

Reference Point

• Reichenbach (1947) introduced notion of Reference point (R), separated out from Utterance time (U) and Event time (E)

• Example:– When Mary's flight departed, I ate lunch– When Mary's flight departed, I had eaten

lunch

• Departure event specifies reference point.

Page 31: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 31

Today 16/10

• Semantics / Meaning /Meaning Representations

• Linguistically relevant Concepts in FOPC / POL

• Semantic Analysis

Page 32: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 32

Semantic Analysis

Syntax-drivenSemantic Analysis

Sentence

Literal Meaning

DiscourseStructure

Meanings of words

Meanings of grammatical structures

Context

Common-SenseDomain knowledge

Intended meaning

FurtherAnalysis

INFERENCE

Page 33: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 33

Compositional Analysis

• Principle of Compositionality– The meaning of a whole is derived from

the meanings of the parts

• What parts?– The constituents of the syntactic parse

of the input

• What could it mean for a part to have a meaning?

Page 34: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 34

Compositional Analysis: Example

• AyCaramba serves meat

),()^,()^( MeateServedAyCarambaeServereServinge

Page 35: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 35

Augmented Rules

• Augment each syntactic CFG rule with a semantic formation rule

• The class of actions performed by f will be quite restricted.

)}.,....({... 11 semsemfA nn • Abstractly

• i.e., The semantics of A can be computed from some function applied to the semantics of its parts.

Page 36: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 36

Simple Extension of FOL: Lambda Forms

– Lambda-reduction: variables are bound by treating the lambda form as a function with formal arguments

)(),( yCountryyxyInx

)())((

SallyPSallyxxP

)(),())((),(

yCountryyBCyInBCyCountryyxyInx

)(),( yCountryyBCyIn

)(),())((),(

CANADACountryCANADABCInCANADAyCountryyBCyIn

)(xxP– A FOL sentence with variables in it that are to be bound.

Page 37: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 37

Augmented Rules: Example

– PropNoun -> AyCaramba– MassNoun -> meat

• Attachments{AyCaramba}{MEAT}

assigning constants• Easy parts…

copying from daughters up to mothers.

– NP -> PropNoun– NP -> MassNoun

• Attachments{PropNoun.sem}{MassNoun.sem

}

Page 38: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 38

Augmented Rules: Example

• Verb -> serves

• {VP.sem(NP.sem)}• {Verb.sem(NP.sem)

),(^),(^)(

xeServedyeServereServingeyx

Semantics attached to one daughter is applied to semantics of the other

daughter(s).• S -> NP VP• VP -> Verb NP

lambda-form

Page 39: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 39

Example

• S -> NP VP• VP -> Verb NP• Verb -> serves• NP -> PropNoun• NP -> MassNoun• PropNoun -> AyCaramba• MassNoun -> meat

• {VP.sem(NP.sem)}• {Verb.sem(NP.sem)

• {PropNoun.sem}• {MassNoun.sem}• {AC}• {MEAT}

),()^,()^( xeServedyeServereServingeyx

MEAT

MEAT

…….

y y

AC

Page 40: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 40

Full story more complex• To deal properly with quantifiers

– Permit lambda-variables to range over predicates. E.g.,

)(. xPxP

)yMenuyHad(e,

)xRestaurantxeHaver

eeHaving

)(

)(,(

)(

– Introduce complex terms to remain agnostic about final scoping

Page 41: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 41

• Similarly to PP attachment, number of possible interpretations exponential in the number of complex terms

Solution: Quantifier Scope Ambiguity

• likelihood of different orderings• Mirror surface ordering• Domain specific knowledge

• Weak methods to prefer one interpretation over another:

Page 42: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 42

Attachments for a fragment of English (Sect. 18.5)

• Sentences• Noun-phrases• Verb-phrases• Prepositional-phrases

Based on “The core Language Engine” 1992

Page 43: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 43

Integration with a Parser• Assume you’re using a dynamic-

programming style parser (Earley or CYK).

• Two basic approaches– Integrate semantic analysis into the

parser (assign meaning representations as constituents are completed)– Pipeline… assign meaning representations to complete trees only after they’re completed

Page 44: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 44

Pros and Cons• Integration

– use semantic constraints to cut off parses that make no sense

– assign meaning representations to constituents that don’t take part in any correct parse

• Pipeline– assign meaning representations only to

constituents that take part in a correct parse

– parser needs to generate all correct parses

Page 45: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 45

Next Time

• Read Chp. 19 (Lexical Semantics)

Page 46: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 46

Non-Compositionality

• Unfortunately, there are lots of examples where the meaning of a constituent can’t be derived from the meanings of the parts

- metaphor, (e.g., corporation as person)– metonymy, (??)– idioms, – irony, – sarcasm, – indirect requests, etc

Page 47: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 47

English Idioms

• “buy the farm” • “bite the bullet” • “bury the hatchet” • etc…

• Lots of these… constructions where the meaning of the whole is either – Totally unrelated to the meanings of the parts

(“kick the bucket”)– Related in some opaque way (“run the show”)

Page 48: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 48

The Tip of the Iceberg– “Enron is the tip of the iceberg.”NP -> “the tip of the iceberg” {….}

– “the tip of an old iceberg”– “the tip of a 1000-page iceberg”– “the merest tip of the iceberg”

NP -> TipNP of IcebergNP {…}TipNP: NP with tip as its head IcebergNP NP with iceberg as its head

Page 49: 6/3/2015CPSC503 Winter 20071 CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

04/18/23 CPSC503 Winter 2007 49

Handling Idioms– Mixing lexical items and grammatical

constituents– Introduction of idiom-specific constituents– Permit semantic attachments that introduce

predicates unrelated with constituents

NP -> TipNP of IcebergNP {small-part(), beginning()….}

TipNP: NP with tip as its head IcebergNP NP with iceberg as its head