Top Banner
Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from the paper. Critique and snarky remarks, however, are original.
27

Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Dec 21, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Statistical Phrase-Based TranslationAuthors: Koehn, Och, Marcu

Presented by Albert BertramTitles, charts, graphs, figures and tables were extracted from the paper.Critique and snarky remarks, however, are original.

Page 2: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Motivation

We have a new way of learning phrase translations, but…

“What is the best method to extract phrase translation pairs?”

Page 3: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

What to do, what to do

Compose a framework for consistent comparison

Implement each algorithmCompare the results

Page 4: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Evaluation Framework

PhrasesModels involved

Language modelStatistical model for translationDistortion model

Decoder

Page 5: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Evaluation Framework: Phrases

We all know what phrases are right?NP, VP, wait, what? Oh.

Here, they’re generic spanning and non-overlapping subsequences of words.Are these guys really linguists?

Page 6: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Evaluation Framework: Models

Language Model Trigram usually p(en|en-1,en-2)

Translation Model Argmaxe p(e|f) = argmaxe p(f|e)p(e)

ebest = argmaxe p(f|e)pLM(e)ωlength(e)

p(f|e) is decomposed into

Page 7: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Evaluation Framework: Models

Distortion Modeld(ai – bi-1)

ai = start position of the foreign phrase translated into the ith English phrase

bi-1 = end position of the foreign phrase translated into the (i-1)th English phrase

Learned from the joint probability model Ping told us about

Page 8: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Evaluation Framework: Decoder

Left-to-right incrementalStack-based beam searchEstimates future costsSame decoder used in all experiments

Page 9: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Baseline Experiments

Word-based alignmentSyntactic phrasesPhrase alignments

Page 10: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Baseline: Word-based Alignment

Learn the phrases from word alignments

Page 11: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Baseline: Syntactic Phrases

Learn only syntactically correct phrasesStart with the word based alignment Prune out the phrase pairs which aren’t

subtrees in the parsed sentences for either language.

Page 12: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Baseline: Phrase Alignment

Marcu and Wong, 2002 Yes, this is the paper Ping just

presented.

Page 13: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Experiment Background

Europarl and BLEUTraining corpus of 10, 20, 40, 80,160

and 320 kilo-sentence pairs

Page 14: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Baseline Results

Notice the bottom row there? Comparing these models is like taking a 5-year old to a chess tournament.

Page 15: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Baseline Results

Page 16: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

More Experiments

Weighting Syntactic PhrasesMaximum Phrase LengthLexical WeightingPhrase Extraction HeuristicSimpler Underlying Word-Base ModelsOther Languages

Page 17: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Experiments and Results

Weighting Syntactic PhrasesDouble the count on syntactic phrases

Is that sufficient? Insufficient post-analysis on this oneThe same BLEU score

Were the translations in better syntax?Did the translations at least use more syntactic

phrases?

Page 18: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Experiments and Results

Maximum Phrase Length

Page 19: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Experiments and Results

Lexical WeightingLexical probability distribution

Lexical Weight

Page 20: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Experiments and Results

Lexical WeightingExample

Page 21: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Experiments and Results

Lexical WeightingWith multiple alignments

Extended to fit this model

Page 22: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Experiments and Results

Lexical Weighting Improvement: .01 BLEU

Page 23: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Experiments and Results

Phrase Extraction HeuristicAlign Bidirectionally

Note this gives two different word alignment setsStart with the intersection of the two setsAdd possible alignments

Only if they’re in the union of the setsOnly if they connect at least one previously

unaligned word

Page 24: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Experiments and Results

Phrase Extraction HeuristicAlgorithm

Start with the first English wordExpand only directly adjacent alignment pointsMove to the next English word, repeat.Finally add non-adjacent alignment points which

meet the heuristic criteria.

Page 25: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Experiments and Results

Page 26: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Experiments and Results

Simpler Underlying Word-Base Models IBM models 1-4

Page 27: Statistical Phrase-Based Translation Authors: Koehn, Och, Marcu Presented by Albert Bertram Titles, charts, graphs, figures and tables were extracted from.

Experiments and Results

Other Languages