Top Banner
Synchronous context-free grammar
141

Edinburgh MT lecture 12: Synchronous context-free grammar

Jul 25, 2015

Download

Technology

alopezfoo
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

Page 2: Edinburgh MT lecture 12: Synchronous context-free grammar

Phrase-based models

•Exact decoding is NP-hard.

•As a consequence of arbitrary permutation…

•…but real permutations are not arbitrary!

•Parameterization of reordering is weak.

•No generalization!

Page 3: Edinburgh MT lecture 12: Synchronous context-free grammar

Garcia and associates .

Garcia y asociados .Carlos Garcia has three associates .

Carlos Garcia tiene tres asociados .his associates are not strong .

sus asociados no son fuertes .Garcia has a company also .

Garcia tambien tiene una empresa .its clients are angry .

sus clientes estan enfadados .the associates are also angry .

los asociados tambien estan enfadados .

la empresa tiene enemigos fuertes en Europa .

the company has strong enemies in Europe .the clients and the associates are enemies .

los clientes y los asociados son enemigos .the company has three groups .

la empresa tiene tres grupos .its groups are in Europe .

sus grupos estan en Europa .the modern groups sell strong pharmaceuticals .

los grupos modernos venden medicinas fuertes .the groups do not sell zanzanine .

los grupos no venden zanzanina .the small groups are not modern .

los grupos pequenos no son modernos .

Page 4: Edinburgh MT lecture 12: Synchronous context-free grammar

Garcia and associates .

Garcia y asociados .Carlos Garcia has three associates .

Carlos Garcia tiene tres asociados .his associates are not strong .

sus asociados no son fuertes .Garcia has a company also .

Garcia tambien tiene una empresa .its clients are angry .

sus clientes estan enfadados .the associates are also angry .

los asociados tambien estan enfadados .

la empresa tiene enemigos fuertes en Europa .

the company has strong enemies in Europe .the clients and the associates are enemies .

los clientes y los asociados son enemigos .the company has three groups .

la empresa tiene tres grupos .its groups are in Europe .

sus grupos estan en Europa .the modern groups sell strong pharmaceuticals .

los grupos modernos venden medicinas fuertes .the groups do not sell zanzanine .

los grupos no venden zanzanina .the small groups are not modern .

los grupos pequenos no son modernos .

Same pattern:NN JJ → JJ NN

Phrase-based models do not capture this generalization.

Page 5: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

Page 6: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

Page 7: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

Page 8: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

Page 9: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

NP VP

Page 10: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

NP VP

Page 11: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

NP VP

watashi wa

Page 12: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

NP VP

watashi wa

Page 13: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

NP VP

NP Vwatashi wa

Page 14: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

NP VP

NP Vwatashi wa

Page 15: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

NP VP

NP Vwatashi wa

hako wo

Page 16: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

NP VP

NP Vwatashi wa

hako wo

Page 17: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

NP VP

NP Vwatashi wa

hako wo akemasu

Page 18: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

NP VP

NP Vwatashi wa

hako wo akemasu

Page 19: Edinburgh MT lecture 12: Synchronous context-free grammar

Context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S

NP VP

NP Vwatashi wa

hako wo akemasu

watashi wa hako wo akemasu

Page 20: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

Originally: syntax-directed translation (Lewis & Stearns 1966; Aho and Ullman 1969)

Page 21: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S → NP VPNP → INP → the boxVP → V NP V → open

Originally: syntax-directed translation (Lewis & Stearns 1966; Aho and Ullman 1969)

Page 22: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP VPNP → watashi waNP → hako woVP → NP V V → akemasu

S → NP VPNP → INP → the boxVP → V NP V → open

Originally: syntax-directed translation (Lewis & Stearns 1966; Aho and Ullman 1969)

Japanese is SOV. English is SVO.

Page 23: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

Originally: syntax-directed translation (Lewis & Stearns 1966; Aho and Ullman 1969)

Page 24: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

REQUIREMENT:one-to-mapping

between source and target nonterminals,

indicated by coindexes

Originally: syntax-directed translation (Lewis & Stearns 1966; Aho and Ullman 1969)

Page 25: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

Page 26: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

Page 27: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

Page 28: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

Page 29: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

NP VP NP VP

Page 30: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

NP VP NP VP

Page 31: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

NP VP NP VP

Page 32: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

NP VP NP VP

watashi wa I

Page 33: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

NP VP NP VP

watashi wa I

Page 34: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

NP VP NP VP

NP V V NPwatashi wa I

Page 35: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

NP VP NP VP

NP V V NPwatashi wa I

Page 36: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

NP VP NP VP

NP V V NPwatashi wa I

Page 37: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

NP VP NP VP

NP V V NPwatashi wa I

hako wo the box

Page 38: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

NP VP NP VP

NP V V NPwatashi wa I

hako wo the box

Page 39: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammar

S → NP1 VP2 / NP1 VP2

NP → watashi wa / I NP → hako wo / the boxVP → NP1 V2 / V2 NP1

V → akemasu / open

S S

NP VP NP VP

NP V V NPwatashi wa I

akemasu openhako wo the box

Page 40: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammarS S

NP VP NP VP

NP V V NPwatashi wa I

akemasu openhako wo the box

Page 41: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammarS S

NP VP NP VP

NP V V NPwatashi wa I

akemasu openhako wo the box

watashi wa hako wo akemasu

Page 42: Edinburgh MT lecture 12: Synchronous context-free grammar

Synchronous context-free grammarS S

NP VP NP VP

NP V V NPwatashi wa I

akemasu openhako wo the box

watashi wa hako wo akemasu I open the box

Page 43: Edinburgh MT lecture 12: Synchronous context-free grammar

Translation as parsing

watashi wa hako wo akemasu

Page 44: Edinburgh MT lecture 12: Synchronous context-free grammar

Translation as parsingS

NP VP

NP Vwatashi wa

akemasuhako wo

watashi wa hako wo akemasu

Page 45: Edinburgh MT lecture 12: Synchronous context-free grammar

Translation as parsingS

NP VP

NP Vwatashi wa

akemasuhako wo

S

NP VP

V NPI

open the box

watashi wa hako wo akemasu

Page 46: Edinburgh MT lecture 12: Synchronous context-free grammar

Translation as parsingS

NP VP

NP Vwatashi wa

akemasuhako wo

S

NP VP

V NPI

open the box

watashi wa hako wo akemasu I open the box

Page 47: Edinburgh MT lecture 12: Synchronous context-free grammar

Preliminaries

S → NP VPNP → watashi wa NP → hako woVP → NP V V → akemasu

Transform source grammar into Chomsky normal form:all productions in form X → w or X → YZ.

Page 48: Edinburgh MT lecture 12: Synchronous context-free grammar

Preliminaries

S → NP VPNP → watashi wa NP → hako woVP → NP V V → akemasu

Transform source grammar into Chomsky normal form:all productions in form X → w or X → YZ.

S → NP VPNP → X Y

X → watashi Y → wa NP → Z W Z → hako W → woVP → NP V V → akemasu

Page 49: Edinburgh MT lecture 12: Synchronous context-free grammar

Preliminaries

S → NP VPNP → watashi wa NP → hako woVP → NP V V → akemasu

Transform source grammar into Chomsky normal form:all productions in form X → w or X → YZ.

S → NP VPNP → X Y

X → watashi Y → wa NP → Z W Z → hako W → woVP → NP V V → akemasu

Q: how do synchronous productions interact with

this transformation?

Page 50: Edinburgh MT lecture 12: Synchronous context-free grammar

Decoding

Page 51: Edinburgh MT lecture 12: Synchronous context-free grammar

Decoding

•A binary-branching (i.e. CNF) grammar can produce a Catalan number of parses of an input sentence.

Page 52: Edinburgh MT lecture 12: Synchronous context-free grammar

Decoding

•A binary-branching (i.e. CNF) grammar can produce a Catalan number of parses of an input sentence.

O((2n)!

(n + 1)!n!)

Page 53: Edinburgh MT lecture 12: Synchronous context-free grammar

Decoding

•A binary-branching (i.e. CNF) grammar can produce a Catalan number of parses of an input sentence.

•Dynamic programming to the rescue!

O((2n)!

(n + 1)!n!)

Page 54: Edinburgh MT lecture 12: Synchronous context-free grammar

ParsingNN → duck, pato

PRP → I, yo

VBD → saw, vi

PRP$ → her, ella

NP → PRP$1 NN2, PRP$1 NN2

VP → VBD1 NP2, VBD1 NP2

S → PRP1 VP2, PRP1 VP2

PRP → her, su

VB → duck, agacharseSBAR → PRP1 VB2, PRP1 VB2

VP → VBD1 SBAR2, VBD1 SBAR2

Page 55: Edinburgh MT lecture 12: Synchronous context-free grammar

ParsingNN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

Page 56: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

Page 57: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

Page 58: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

Page 59: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

Page 60: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

PRP0,1 � (w1 = I) ⇤ (PRP ⇥ I)Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

Page 61: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

PRP0,1 � (w1 = I) ⇤ (PRP ⇥ I)Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

Page 62: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

Page 63: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

Page 64: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

Page 65: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

Page 66: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

NN3,4

VB3,4

Page 67: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

NN3,4

VB3,4

Page 68: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

NN3,4

VB3,4

Page 69: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

NN3,4

VB3,4

Page 70: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

NN3,4

VB3,4

Page 71: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)NP2,4 � PRP$2,3 ⇤NN3,4 ⇤ (NP⇥ PRP$ NN)

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

NN3,4

VB3,4

Page 72: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)NP2,4 � PRP$2,3 ⇤NN3,4 ⇤ (NP⇥ PRP$ NN)

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

NN3,4

VB3,4

NP2,4

Page 73: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

NN3,4

VB3,4

NP2,4

Page 74: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

NN3,4

VB3,4

NP2,4 SBAR2,4

Page 75: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

NN3,4

VB3,4

NP2,4 SBAR2,4

VP1,4

Page 76: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

NN3,4

VB3,4

NP2,4 SBAR2,4

VP1,4

Page 77: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

Xi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)

PRP0,1

Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR VBD1,2

PRP$2,3

PRP2,3

NN3,4

VB3,4

NP2,4 SBAR2,4

VP1,4

S0,4

Page 78: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

Page 79: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

Page 80: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

I saw her duck

NP

VP

PRP VBD PRP$ NN

S

Page 81: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

I saw her duck

NP

VP

PRP VBD PRP$ NN

SNP

VP

PRP VBD PRP$ NN

S

yo vi su pato

Page 82: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

I saw her duck

SBAR

VP

PRP VBD PRP VB

S

Page 83: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

I saw her duck

SBAR

VP

PRP VBD PRP VB

SSBAR

VP

PRP VBD PRP VB

S

yo vi ella agacharse

Page 84: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

Analysis

Page 85: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

Analysis

nodesO(Nn2)

O(Gn3) edges

Page 86: Edinburgh MT lecture 12: Synchronous context-free grammar

Wait a second!

Page 87: Edinburgh MT lecture 12: Synchronous context-free grammar

Wait a second!

•Phrase-based MT is NP-hard because of permutations (there are a factorial number).

Page 88: Edinburgh MT lecture 12: Synchronous context-free grammar

Wait a second!

•Phrase-based MT is NP-hard because of permutations (there are a factorial number).

•SCFGs also permute sentences.

Page 89: Edinburgh MT lecture 12: Synchronous context-free grammar

Wait a second!

•Phrase-based MT is NP-hard because of permutations (there are a factorial number).

•SCFGs also permute sentences.

•But the decoding algorithm is polynomial…

Page 90: Edinburgh MT lecture 12: Synchronous context-free grammar

Wait a second!

•Phrase-based MT is NP-hard because of permutations (there are a factorial number).

•SCFGs also permute sentences.

•But the decoding algorithm is polynomial…

•What are we giving up for this efficiency?

Page 91: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

X → b, bX → c, cX → d, d

Page 92: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

X → b, bX → c, cX → d, d

Page 93: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

What permutations of a b c d can this

grammar produce?

X → b, bX → c, cX → d, d

Page 94: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

b a c d

b a d c

b c a d

b c d a

b d a c

b d c a

What permutations of a b c d can this

grammar produce?

d a b c

d a c b

d b a c

d b c a

d c a b

d c b a

a b c d

a b d c

a c b d

a c d b

a d b c

a d c b

c a b d

c a d b

c b a d

c b d a

c d a b

c d b a

X → b, bX → c, cX → d, d

Page 95: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

b a c d

b a d c

b c a d

b c d a

b d a c

b d c a

What permutations of a b c d can this

grammar produce?

d a b c

d a c b

d b a c

d b c a

d c a b

d c b a

a b c d

a b d c

a c b d

a c d b

a d b c

a d c b

c a b d

c a d b

c b a d

c b d a

c d a b

c d b a

X → b, bX → c, cX → d, d

Page 96: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

b a c d

b a d c

b c a d

b c d a

b d a c

b d c a

What permutations of a b c d can this

grammar produce?

d a b c

d a c b

d b a c

d b c a

d c a b

d c b a

a b c d

a b d c

a c b d

a c d b

a d b c

a d c b

c a b d

c a d b

c b a d

c b d a

c d a b

c d b a

X → b, bX → c, cX → d, d

Page 97: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

b a c d

b a d c

b c a d

b c d a

b d a c

b d c a

What permutations of a b c d can this

grammar produce?

d a b c

d a c b

d b a c

d b c a

d c a b

d c b a

a b c d

a b d c

a c b d

a c d b

a d b c

a d c b

c a b d

c a d b

c b a d

c b d a

c d a b

c d b a

X → b, bX → c, cX → d, d

Page 98: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

b a c d

b a d c

b c a d

b c d a

b d a c

b d c a

What permutations of a b c d can this

grammar produce?

d a b c

d a c b

d b a c

d b c a

d c a b

d c b a

a b c d

a b d c

a c b d

a c d b

a d b c

a d c b

c a b d

c a d b

c b a d

c b d a

c d a b

c d b a

X → b, bX → c, cX → d, d

Page 99: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

b d a c

What permutations of a b c d can this

grammar produce?

c a d b

X → b, bX → c, cX → d, d

Page 100: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

b d a c

What permutations of a b c d can this

grammar produce?

c a d b

X → b, bX → c, cX → d, d

Page 101: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

b d a c

What permutations of a b c d can this

grammar produce?

c a d b

X → b, bX → c, cX → d, d

inside-outside alignments

Page 102: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

b d a c

What permutations of a b c d can this

grammar produce?

c a d b

X → b, bX → c, cX → d, d

inside-outside alignments

X → X1 X2 X3 X4, X2 X4 X1 X3 X → X1 X2 X3 X4, X3 X1 X4 X2

Page 103: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

b d a c

What permutations of a b c d can this

grammar produce?

c a d b

X → b, bX → c, cX → d, d

inside-outside alignments

X → X1 X2 X3 X4, X2 X4 X1 X3 X → X1 X2 X3 X4, X3 X1 X4 X2

No equivalent binary-branching SCFG

Page 104: Edinburgh MT lecture 12: Synchronous context-free grammar

PermutationsX → X1 X2, X1 X2

X → X1 X2, X2 X1

X → a, a

b d a c

What permutations of a b c d can this

grammar produce?

c a d b

X → b, bX → c, cX → d, d

inside-outside alignments

X → X1 X2 X3 X4, X2 X4 X1 X3 X → X1 X2 X3 X4, X3 X1 X4 X2

No equivalent binary-branching SCFGComplexity of many problems is exponential in rank

Page 105: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing as deductionXi,i+1 � (wi+1 = w) ⇤ (X ⇥ w)Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)

[X ! w][wi+1 = w]

[i,X, i+ 1]

[X ! Y Z][i, Y, k][k, Z, j]

[i,X, j]

For sentence w1…wn, grammar G with nonterminals N

[i,X, j] 8i, j 2 0, ..., n,X 2 N

[X ! w] 8X ! w 2 PG

[wi = w] 8i 2 1, ..., naxioms:

items:inference rules:

[1, S, n]goal:

[X ! Y Z] 8X ! Y Z 2 PG

Page 106: Edinburgh MT lecture 12: Synchronous context-free grammar

From proof to (pseudo)codeInput: w1…wn, grammar Gfor i in 1,…,n: for X->w_i in P(G): chart[i-1,X,i] := truefor l in 2,…,n: for i in 0,…,n-l: j := i+l for k in i+1,…,j-1: for X->YZ in P(G): if chart[i,Y,k] and chart[k,Z,j]: chart[i,X,j] := truereturn chart[0,S,n]

Page 107: Edinburgh MT lecture 12: Synchronous context-free grammar

That’s nice, but…

•We need probabilities.

•We need to compute the most probable parse.

•We need to compute expectations.

Page 108: Edinburgh MT lecture 12: Synchronous context-free grammar

Most probable parse

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

Page 109: Edinburgh MT lecture 12: Synchronous context-free grammar

Most probable parse

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

Page 110: Edinburgh MT lecture 12: Synchronous context-free grammar

0.7

Most probable parse

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

Page 111: Edinburgh MT lecture 12: Synchronous context-free grammar

0.3 1.0

1.01.0

1.0

0.7

Most probable parse

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

Page 112: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Most probable parse

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

Page 113: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Most probable parse

I1

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

Page 114: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Most probable parse

I1

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

0.06

Page 115: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Most probable parse

I1

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

0.06

Page 116: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Most probable parse

I1

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

0.06

Page 117: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Most probable parse

I1

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

0.8

Page 118: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Most probable parse

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

0.8

Page 119: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Most probable parse

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

0.80.56

Page 120: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Most probable parse

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

0.80.56

Xi,j = max(Xi,j , Yi,k � Zk,j � p(X ⇥ Y Z))

Page 121: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Rule expectations

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

Page 122: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Rule expectations

I1

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

Page 123: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Rule expectations

I1

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

0.06

Page 124: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Rule expectations

I1

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

0.06

Page 125: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Rule expectations

I1

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

0.86

Page 126: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Rule expectations

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

0.86

Page 127: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Rule expectations

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

NN → duck

PRP → I

VBD → saw

PRP$ → her

NP → PRP$ NN

VP → VBD NP

S → PRP VP

PRP → her

VB → duckSBAR → PRP VB

VP → VBD SBAR

(1.0)(1.0)(0.3)(0.7)(1.0)(1.0)(1.0)(1.0)(0.8)(0.2)(1.0)

0.860.602

Page 128: Edinburgh MT lecture 12: Synchronous context-free grammar

1.0 0.3

0.3 1.0

1.01.0

1.0

0.7

Rule expectations

I1 saw2 her3 duck4

PRP0,1

VBD1,2

VP1,4

PRP$2,3 NN3,4

NP2,4

PRP2,3 VB3,4

SBAR2,4

S0,4

0.860.602

Xi,j = Xi,j + (Yi,k � Zk,j � p(X ⇥ Y Z))

Page 129: Edinburgh MT lecture 12: Synchronous context-free grammar

Similarities

Xi,j = Xi,j + (Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = max(Xi,j , Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j � Yi,k ⇤ Zk,j ⇤ (X ⇥ Y Z)

Page 130: Edinburgh MT lecture 12: Synchronous context-free grammar

Similarities

Xi,j = Xi,j + (Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = max(Xi,j , Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = Xi,j ⇤ (Yi,k ⇥ Zk,j ⇥ (X � Y Z))

Page 131: Edinburgh MT lecture 12: Synchronous context-free grammar

Similarities

Xi,j = Xi,j + (Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = max(Xi,j , Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = Xi,j ⇤ (Yi,k ⇥ Zk,j ⇥ (X � Y Z))

h{T, F},_, F,^, T i

Page 132: Edinburgh MT lecture 12: Synchronous context-free grammar

Similarities

Xi,j = Xi,j + (Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = max(Xi,j , Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = Xi,j ⇤ (Yi,k ⇥ Zk,j ⇥ (X � Y Z))

h{T, F},_, F,^, T i

hR+,max, 0,⇥, 1i

Page 133: Edinburgh MT lecture 12: Synchronous context-free grammar

Similarities

Xi,j = Xi,j + (Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = max(Xi,j , Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = Xi,j ⇤ (Yi,k ⇥ Zk,j ⇥ (X � Y Z))

h{T, F},_, F,^, T i

hR+,max, 0,⇥, 1i

hR+,+, 0,⇥, 1i

Page 134: Edinburgh MT lecture 12: Synchronous context-free grammar

Similarities

Xi,j = Xi,j + (Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = max(Xi,j , Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = Xi,j ⇤ (Yi,k ⇥ Zk,j ⇥ (X � Y Z))

Xi,j = Xi,j � (Yi,k ⇥ Zk,j ⇥R(X ⇤ Y Z))

h{T, F},_, F,^, T i

hR+,max, 0,⇥, 1i

hR+,+, 0,⇥, 1i

Page 135: Edinburgh MT lecture 12: Synchronous context-free grammar

Similarities

Xi,j = Xi,j + (Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = max(Xi,j , Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = Xi,j ⇤ (Yi,k ⇥ Zk,j ⇥ (X � Y Z))

Xi,j = Xi,j � (Yi,k ⇥ Zk,j ⇥R(X ⇤ Y Z))

h{T, F},_, F,^, T i

hA,�,0,⌦,1i

hR+,max, 0,⇥, 1i

hR+,+, 0,⇥, 1i

Page 136: Edinburgh MT lecture 12: Synchronous context-free grammar

Similarities

Xi,j = Xi,j + (Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = max(Xi,j , Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = Xi,j ⇤ (Yi,k ⇥ Zk,j ⇥ (X � Y Z))

Xi,j = Xi,j � (Yi,k ⇥ Zk,j ⇥R(X ⇤ Y Z))

h{T, F},_, F,^, T i

hA,�,0,⌦,1isemiring

hR+,max, 0,⇥, 1i

hR+,+, 0,⇥, 1i

Page 137: Edinburgh MT lecture 12: Synchronous context-free grammar

Similarities

Xi,j = Xi,j + (Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = max(Xi,j , Yi,k � Zk,j � p(X ⇥ Y Z))

Xi,j = Xi,j ⇤ (Yi,k ⇥ Zk,j ⇥ (X � Y Z))

Xi,j = Xi,j � (Yi,k ⇥ Zk,j ⇥R(X ⇤ Y Z))

boolean

tropical

inside

h{T, F},_, F,^, T i

hA,�,0,⌦,1isemiring

hR+,max, 0,⇥, 1i

hR+,+, 0,⇥, 1i

Page 138: Edinburgh MT lecture 12: Synchronous context-free grammar

Parsing as weighted deductionFor sentence w1…wn, grammar G with nonterminals N

[i,X, j] 8i, j 2 0, ..., n,X 2 N

[X ! w] 8X ! w 2 PG

[wi = w] 8i 2 1, ..., naxioms:

items:inference rules:

[1, S, n]goal:

[X ! w] : u [wi+1 = w] : v

[i,X, i+ 1] : u⌦ v

[X ! Y Z] : u [i, Y, k] : v [k, Z, j] : y

[i,X, j] : u⌦ v ⌦ y

[X ! Y Z] 8X ! Y Z 2 PG

Page 139: Edinburgh MT lecture 12: Synchronous context-free grammar

From proof to (pseudo)codeInput: w1…wn, grammar Gfor i in 1,…,n: for X->w_i in P(G): chart[i-1,X,i] := u(X->w_i)for l in 2,…,n: for i in 0,…,n-l: j := i+l for k in i+1,…,j-1: for X->YZ in P(G): chart[i,X,j] += chart[i,Y,k]*chart[k,Z,j]return chart[0,S,n]

Page 140: Edinburgh MT lecture 12: Synchronous context-free grammar

Semiring parsing•Viterbi, inside, boolean (Goodman 1999)

•Expectation and variance semirings (Li & Eisner 2009)

•Feature expectations

•Minimum Bayes Risk

•Gradients, etc.

•minimum error upper envelope (Kumar et al. 2009)

Page 141: Edinburgh MT lecture 12: Synchronous context-free grammar

Remaining questions

•How do we get the grammar?

•How do n-gram language models fit in?

•Is this really a plausible model of translation?

•Does it actually work? Why or why not?