Top Banner
What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute Carnegie Mellon University
48

What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

Dec 21, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

What is the Jeopardy Model?A Quasi-Synchronous Grammar

for Question Answering

Mengqiu Wang, Noah A. Smith and Teruko Mitamura

Language Technology InstituteCarnegie Mellon University

Page 2: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

2

The task

High-efficiency document retrieval

High-precision answer ranking

Who is the leader of France?

1. Bush later met with French president Jacques Chirac. 2. Henri Hadjenberg, who is the leader of France ’s Jewish community, …3. …

1. Henri Hadjenberg, who is the leader of France ’s Jewish community, …2. Bush later met with French president Jacques Chirac. (as of May 16 2007)

3. …

Page 3: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

3

Challenges

High-efficiency document retrieval

High-precision answer ranking

Who is the leader of France?

1. Bush later met with French president Jacques Chirac.

2. Henri Hadjenberg, who is the leader of France ’s Jewish community, …3. …

Page 4: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

4

Semantic Tranformations

Q:“Who is the leader of France?”

A: Bush later met with French president Jacques Chirac.

Page 5: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

5

Syntactic Transformations

Who leaderthe Franceofis ?

Bush met Frenchwith president Jacques Chirac

mod mod

mod

Page 6: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

6

Syntactic Variations

Who leaderthe Franceofis ?

Henri Hadjenberb , who leaderis the of France ’s Jewish community

mod mod

mod

mod

Page 7: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

7

Two key phenomena in QA

Semantic transformation leader president

Syntactic transformation leader of France French president

Q A)|( QAP

Page 8: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

8

Existing work in QA

Semantics Use WordNet as thesaurus for expansion

Syntax Use dependency parse trees, but merely

transform the feature space into dependency parse feature space. No fundamental changes in the algorithms (edit-distance, classifier, similarity measure).

Page 9: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

9

Where else have we seen these transformations?

Machine Translation (especially in syntax-based MT)

Paraphrasing Sentence compression Textual entailment

F E)|( FEP

Page 10: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

10

Noisy-channel

Machine Translation

Question Answering

S E)()|()|( EPEFPFEP

Q A)()|()|( APAQPQAP

Language modelTranslation model

retrieval modelJeopardy model

Page 11: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

11

From wikipedia.org: Jeopardy! is a popular international television

quiz game show (#2 of the 50 Greatest Game Show of All

Times). 3 contestants select clues in the form of an

answer, to which they must supply correct responses in the form of a question.

The concept of "questioning answers" is original to Jeopardy!.

What is Jeopardy! ?

)|( AQP

Page 12: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

12

Jeopardy Model

We make use of a formalism called quasi-synchronous grammar [D. Smith

& Eisner ’06], originally developed for MT

Page 13: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

13

Quasi-Synchronous Grammars Based on key observations in MT:

translated sentences often have some isomorphic syntactic structure, but not usually in entirety.

the strictness of the isomorphism may vary across words or syntactic rules.

Key idea: Unlike some synchronous grammars (e.g. SCFG,

which is more strict and rigid), QG defines a monolingual grammar for the target tree, “inspired” by the source tree.

Page 14: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

14

Quasi-Synchronous Grammars In other words, we model the generation of

the target tree, influenced by the source tree (and their alignment)

QA can be thought of as extremely free translation within the same language.

The linkage between question and answer trees in QA is looser than in MT, which gives a bigger edge to QG.

Page 15: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

15

Jeopardy Model Works on labeled dependency parse trees Learn the hidden structure (alignment between Q and

A trees) by summing out ALL possible alignments

One particular alignment tells us both the syntactic configurations and the word-to-word semantic correspondences

An example…

question answer

answerparse tree

questionparse tree

an alignment

Page 16: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

whoWP

qword

leaderNN

isVB

theDT

FranceNNP

location

Q: A:$

root$

root

root

subj obj

det of

root

subj with

nmod

nmod

Page 17: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

whoWP

qword

leaderNN

isVB

theDT

FranceNNP

location

Q: A:$

root$

root

root

subj obj

det of

root

subj with

nmod

nmod

Page 18: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

isVB

Q: A:$

root$

root

root root

subj with

nmod

nmod

root)|P(root

noNE)|P(noNE

VBD)| P(VB

Our model makes local Markov assumptions to allow efficient computation via Dynamic Programming (details in paper)

given its parent, a word is independent of all other words (including siblings).

Page 19: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

whoWP

qword

isVB

Q: A:$

root$

root

root

subj

root

subj with

nmod

nmod

child)-parent|P(subj

person)|P(qword

NNP)|P(WP

Page 20: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

whoWP

qword

leaderNN

isVB

Q: A:$

root$

root

root

subj obj

root

subj with

nmod

nmod

child)-tgrandparen|P(obj

noNE)|P(noNE

NN)|P(NN

Page 21: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

whoWP

qword

leaderNN

isVB

theDT

Q: A:$

root$

root

root

subj obj

det

root

subj with

nmod

nmod

)word-same|P(det

noNE)|P(noNE

N)|P(DT

Page 22: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

whoWP

qword

leaderNN

isVB

theDT

FranceNNP

location

Q: A:$

root$

root

root

subj obj

det of

root

subj with

nmod

nmod

)child-parent|P(of

location)|P(location

JJ)|P(NNP

Page 23: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

23

6 types of syntactic configurations

Parent-child

Page 24: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

whoWP

qword

leaderNN

isVB

theDT

FranceNNP

location

Q: A:$

root$

root

root

subj obj

det of

root

subj with

nmod

nmod

Page 25: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

Parent-child configuration

Page 26: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

26

6 types of syntactic configurations

Parent-child Same-word

Page 27: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

whoWP

qword

leaderNN

isVB

theDT

FranceNNP

location

Q: A:$

root$

root

root

subj obj

det of

root

subj with

nmod

nmod

Page 28: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

Same-word configuration

Parent-child configuration

Page 29: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

29

6 types of syntactic configurations

Parent-child Same-word Grandparent-child

Page 30: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

whoWP

qword

leaderNN

isVB

theDT

FranceNNP

location

Q: A:$

root$

root

root

subj obj

det of

root

subj with

nmod

nmod

Page 31: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

Parent-child configuration Same-word configuration

Grandparent-child configuration

Page 32: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

32

6 types of syntactic configurations

Parent-child Same-word Grandparent-child Child-parent Siblings C-command(Same as [D. Smith & Eisner ’06])

Page 33: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

Parent-child configuration Same-word configuration Grandparent-child configuration

Child-parent configuration Siblings configuration C-command configuration

Page 34: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

34

Modeling alignment Base model

)child-parent|P(of

location)|P(location

N)|P(N

Page 35: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

whoWP

qword

leaderNN

isVB

theDT

FranceNNP

location

Q: A:$

root$

root

root

subj obj

det of

root

subj with

nmod

nmod

Page 36: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

BushNNP

person

metVBD

FrenchJJ

location

presidentNN

Jacques ChiracNNP

person

whoWP

qword

leaderNN

isVB

theDT

FranceNNP

location

Q: A:$

root$

root

root

subj obj

det of

root

subj with

nmod

nmod

Page 37: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

37

Modeling alignment cont.

Base model

Log-linear modelLexical-semantic features from WordNet,Identity, hypernym, synonym, entailment, etc.

Mixture model

Page 38: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

38

Parameter estimation

Things to be learnt Multinomial distributions in base model Log-linear model feature weights Mixture coefficient

Training involves summing out hidden structures, thus non-convex.

Solved using conditional Expectation-Maximization

Page 39: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

39

Experiments

Trec8-12 data set for training Trec13 questions for development

and testing

Page 40: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

40

Candidate answer generation

For each question, we take all documents from the TREC doc pool, and extract sentences that contain at least one non-stop keywords from the question.

For computational reasons (parsing speed, etc.), we only took answer sentences <= 40 words.

Page 41: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

41

Dataset statistics Manually labeled 100 questions for training

Total: 348 positive Q/A pairs 84 questions for dev

Total: 1415 Q/A pairs 3.1+, 17.1-

100 questions for testing Total: 1703 Q/A pairs 3.6+, 20.0-

Automatically labeled another 2193 questions to create a noisy training set, for evaluating model robustness

Page 42: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

42

Experiments cont.

Each question and answer sentence is tokenized, POS tagged (MX-POST), parsed (MSTParser) and labeled with named-entity tags (Identifinder)

Page 43: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

43

Baseline systems (replications) [Cui et al. SIGIR ‘05]

The algorithm behind one of the best performing systems in TREC evaluations.

It uses a mutual information-inspired score computed over dependency trees and a single fixed alignment between them.

[Punyakanok et al. NLE ’04] measures the similarity between Q and A by

computing tree edit distance. Both baselines are high-performing, syntax-based,

and most straight-forward to replicate We further enhanced the algorithms by augmenting

them with WordNet.

Page 44: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

44

ResultsMean Average

PrecisionMean Reciprocal

Rank of Top 1

Statistically significantly better than the 2nd best score in each column

28.2% 23.9% 41.2% 30.3%

Page 45: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

45

Summing vs. Max

Page 46: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

46

Conclusion We developed a probabilistic model for QA

based on quasi-synchronous grammar Experimental results showed that our model

is more accurate and robust than state-of-the-art syntax-based QA models

The mixture model is shown to be powerful. The log-linear model allows us to use arbitrary features.

Provides a general framework for many other NLP applications (compression, textual entailment, paraphrasing, etc.)

Page 47: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

47

Future Work Higher-order Markovization, both horizontally

and vertically, allows us to look at more context, at the expense of higher computational cost.

More features from external resources, e.g. paraphrasing database

Extending it for Cross-lingual QA Avoid the paradigm of translation as pre- of post-

processing We can naturally fit in a lexical or phrase

translation probability table into our model to model the translation inherently

Taking into account parsing uncertainty

Page 48: What is the Jeopardy Model? A Quasi-Synchronous Grammar for Question Answering Mengqiu Wang, Noah A. Smith and Teruko Mitamura Language Technology Institute.

48

Thank you!

Questions?