Top Banner
A Novel Rule Refinement Method for SMT through Simulated Post-Editing Sitong Yang 1,2 , Heng Yu ?1 , and Qun Liu 1,3 1. Key Laboratory of Intelligent Information Processing. Institute of Computing Technology, Chinese Academy of Sciences 2. University of Chinese Academy of Sciences 3. CNGL, School of Computing, Dublin City University 2014/12/23
68

A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Apr 23, 2021

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

A Novel Rule Refinement Method for

SMT through Simulated Post-Editing

Sitong Yang1,2, Heng Yu?1, and Qun Liu1,3

1. Key Laboratory of Intelligent Information Processing. Institute of Computing Technology, Chinese Academy of Sciences

2. University of Chinese Academy of Sciences

3. CNGL, School of Computing, Dublin City University

2014/12/23

Page 2: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

2014/12/23

Post-Editing

Pros & Cons

Our method

Data set & Experiment

Conclusion & Furture Work

Page 3: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Post Editing(PE)

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 4: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Post Editing(PE)

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Automatic post editing

Page 5: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Post Editing(PE)

2014/12/23

MT

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 6: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Post Editing(PE)

2014/12/23

MT Post Editing (SMT)

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 7: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Post Editing(PE)

2014/12/23

MT Post Editing (SMT)

Result

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 8: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Post Editing(PE)

2014/12/23

MT Post Editing (SMT)

Result

Multiple stream

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 9: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Post Editing(PE)

2014/12/23

MT

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 10: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Post Editing(PE)

2014/12/23

Post Editing MT

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 11: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Post Editing(PE)

2014/12/23

Better MT Post Editing

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 12: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Post Editing(PE)

2014/12/23

Single stream

Better MT Post Editing

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 13: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

2014/12/23

Post-Editing

Pros & Cons

Our method

Data set & Experiment

Conclusion & Furture Work

Page 14: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Pros & Cons

2014/12/23

Pros:

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 15: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Pros & Cons

2014/12/23

Pros:

• Better adaptation

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 16: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Pros & Cons

2014/12/23

Pros:

• Better adaptation

• No additional burden for SMT

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 17: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Pros & Cons

2014/12/23

Pros:

• Better adaptation

• No additional burden for SMT

Cons:

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 18: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Pros & Cons

2014/12/23

Pros:

• Better adaptation

• No additional burden for SMT

Cons:

• Expensive

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 19: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Pros & Cons

2014/12/23

Pros:

• Better adaptation

• No additional burden for SMT

Cons:

• Expensive

• Hard to learn

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 20: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

2014/12/23

Post-Editing

Pros & Cons

Our method

Data set & Experiment

Conclusion & Furture Work

Page 21: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Our method

2014/12/23

We Learn from PE results to enhance the original SMT Model.

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 22: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Our method

2014/12/23

We Learn from PE results to enhance the original SMT Model.

• Simulated Post Editing

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 23: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Our method

2014/12/23

We Learn from PE results to enhance the original SMT Model.

• Simulated Post Editing

• Error-Driven Frame work

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 24: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Our method

2014/12/23

We Learn from PE results to enhance the original SMT Model.

• Simulated Post Editing

• Error-Driven Frame work

Error Detection Rule Extraction Rule Filteration

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 25: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Simulated PE

2014/12/23

Daniel [2010] formulated the task of simulated post-editing, wherein pregenerated reference translations are used as a stand-in for actual post-editing.

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 26: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Simulated PE

2014/12/23

Machine Translation

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 27: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Simulated PE

2014/12/23

Machine Translation

Human Post Editing

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 28: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Simulated PE

2014/12/23

Machine Translation

Human Post Editing

PE

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 29: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Simulated PE

2014/12/23

Machine Translation

Human Post Editing

PE

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Expensive

Page 30: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Simulated PE

2014/12/23

Machine Translation

Human Post Editing

Reference

PE

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 31: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Simulated PE

2014/12/23

Machine Translation

Human Post Editing

Reference

PE

SiPE

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 32: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Simulated PE

2014/12/23

Machine Translation

Human Post Editing

Reference

PE

SiPE

Cheap

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 33: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error-Driven Rule Refinement

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 34: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error-Driven Rule Refinement

2014/12/23

This man lived a dog ’s life

这个 人 生活 潦倒

Src:

Tgt:

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 35: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error-Driven Rule Refinement

2014/12/23

This man lived a dog ’s life

这个 人 生活 潦倒

Src:

Tgt:

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 36: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error-Driven Rule Refinement

2014/12/23

This man lived a dog ’s life

这个 人 生活 潦倒

Src:

Tgt:

Alignment Error!

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 37: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error-Driven Rule Refinement

2014/12/23

This man lived a dog ’s life

这个 人 生活 一只 狗 的 生活

这个 人 生活 潦倒

Src:

Tgt:

MT:zhege ren shenghuo yizhi gou de shenghuo

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 38: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error-Driven Rule Refinement

2014/12/23

This man lived a dog ’s life

这个 人 生活 一只 狗 的 生活

这个 人 生活 潦倒

Src:

Tgt:

MT:

这个 人 生活 潦倒Ref:zhege ren shenghuo liaodao

zhege ren shenghuo yizhi gou de shenghuo

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 39: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error-Driven Rule Refinement

2014/12/23

This man lived a dog ’s life

这个 人 生活 一只 狗 的 生活

这个 人 生活 潦倒

Src:

Tgt:

MT:

这个 人 生活 潦倒Ref:zhege ren shenghuo liaodao

zhege ren shenghuo yizhi gou de shenghuo

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 40: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error Detection

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 41: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error Detection

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Editing distance

Page 42: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error Detection

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

TERplus

Synonym match Stem match

Phrase substitution Shift

Deletion Word substitution

Insertion

Editing distance→

Page 43: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error Detection

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 44: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error Detection

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

0

1000

2000

3000

4000

5000

6000

7000

8000

9000

Y T Ps S D Ws I Y T Ps S D Ws I 0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

Y: Synonym match

T: Stem match

Ps: Phrase subst i t ut i on

S: Shif t

D: Delet i on

Ws: Word subst i t ut i on

I: Insert i on

SiPE Distribution SiPE Precision

Page 45: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Error Detection

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

0

1000

2000

3000

4000

5000

6000

7000

8000

9000

Y T Ps S D Ws I Y T Ps S D Ws I

small sample hard to learn

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

Y: Synonym match

T: Stem match

Ps: Phrase subst i t ut i on

S: Shif t

D: Delet i on

Ws: Word subst i t ut i on

I: Insert i on

SiPE Distribution SiPE Precision

Page 46: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

Y: Synonym match

T: Stem match

Ps: Phrase subst i t ut i on

S: Shif t

D: Delet i on

Ws: Word subst i t ut i on

I: Insert i on

Error Detection

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

0

1000

2000

3000

4000

5000

6000

7000

8000

9000

Y T Ps S D Ws I Y T Ps S D Ws I

SiPE Distribution SiPE Precision

Page 47: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Rule extration and Filteration

2014/12/23

Filteration

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 48: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Rule extration and Filteration

2014/12/23

Filteration • C (words of Context )

• P (words of Source side Substitution part)

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 49: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Rule extration and Filteration

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

这个 人 生活 一只 狗 的 生活MT:

这个 人 生活 潦倒Ref:zhege ren shenghuo liaodao

zhege ren shenghuo yizhi gou de shenghuo

C=1 P=4

Extration Monolingual rule:

生活 一只狗的生活 ||| 生活 潦倒

Page 50: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Rule extration and Filteration

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

这个 人 生活 一只 狗 的 生活MT:

这个 人 生活 潦倒Ref:zhege ren shenghuo liaodao

zhege ren shenghuo yizhi gou de shenghuo

Extration Monolingual rule:

生活 一只狗的生活 ||| 生活 潦倒

Page 51: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Rule extration and Filteration

2014/12/23

Extration Monolingual rule:

生活 一只狗的生活 ||| 生活 潦倒

This man lived a dog ’s life

这个 人 生活 一只 狗 的 生活

Src:

MT:

这个 人 生活 潦倒Ref:zhege ren shenghuo liaodao

zhege ren shenghuo yizhi gou de shenghuo

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 52: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Rule extration and Filteration

2014/12/23

Extration Monolingual rule:

生活 一只狗的生活 ||| 生活 潦倒

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 53: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Rule extration and Filteration

2014/12/23

Extration Monolingual rule:

生活 一只狗的生活 ||| 生活 潦倒

Original Bilingual rule :

lived a dog ‘s life ||| 生活 一只 狗 的 生活 ||| 0.5 0.0149508 0.4 7.97148e-06 2.718

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 54: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Rule extration and Filteration

2014/12/23

Extration Monolingual rule:

生活 一只狗的生活 ||| 生活 潦倒

Original Bilingual rule :

lived a dog ‘s life ||| 生活 一只 狗 的 生活 ||| 0.5 0.0149508 0.4 7.97148e-06 2.718

New rule:

lived a dog ‘s life |||生活 潦倒 ||| 0.5 0.0149508 0.4 7.97148e-06 2.718

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 55: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Filtering Criterion

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 56: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Filtering Criterion

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

calming the emotions of

calming the feelings of

C=2 P=2

the beginning of the new

the opening

C=1 P=3

C=2 P=1

between the faculty members and

between teachers and

C=2 P=3

to open up and

opening up and

Page 57: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Filtering Criterion

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

calming the emotions of

calming the feelings of

C=2 P=2

the beginning of the new

the opening

C=1 P=3

C=2 P=1

between the faculty members and

between teachers and

C=2 P=3

to open up and

opening up and

Page 58: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Filtering Criterion

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

calming the emotions of

calming the feelings of

C=2 P=2

the beginning of the new

the opening

C=1 P=3

C=2 P=1

between the faculty members and

between teachers and

C=2 P=3

to open up and

opening up and

Page 59: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Filtering Criterion

2014/12/23

• Should Contain More Context ( c>=2 )

• More Accurated Substitution ( 2<=p<=5 )

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 60: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

2014/12/23

Post-Editing

Pros & Cons

Our method

Data set & Experiment

Conclusion & Furture Work

Page 61: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Experiments-Setup

2014/12/23

• Baseline system: Moses:a state-of-art phrase-based SMT system

Hiero: Hierarchical phrase-based system

• Word-alignment tool: GIZA++

• Language model: SRILM toolkit

• MT evaluation metric: Case-insensitive Bleu-4, Ter-plus

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 62: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Experiments-Setup

2014/12/23

• Data Set:

• SiPE dataset:

Training-set: 10-fold cross validation

1 10 9 8 7 6 5 4 3 2

SMT MT Result

10 9 8 7 6 5 4 3 2 1 10 group

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Domain Language Training-set Dev-set Test-set

News C2E 240k Nist02 Nist04 Nist05 Nist06

Medical C2E 560K 1000 1000

Page 63: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Main Results:Rule Refinement Method

2014/12/23

• new

• medical

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

System Bleu TERp

04 05 06 avg 04 05 06 avg

moses 32.02 29.00 27.18 29.34 61.47 64.04 66.45 64.47

balanced 33.47 30.02 28.80 30.76 60.24 59.37 63.89 61.77

hiero 34.10 29.89 28.78 30.92 59.55 62.73 64.84 62.37

balanced 34.09 29.87 28.81 30.92 59.56 62.75 64.85 62.38

System Bleu TERp

moses 29.64 66.06

ours 30.15 63.80

hiero 29.48 63.53

ours 30.26 62.57

Page 64: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Main Results: Rule Filteration

2014/12/23 Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

BLEU Score

Rule-size(1000)

Page 65: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

2014/12/23

Post-Editing

Pros & Cons

Our method

Data set & Experiment

Conclusion & Furture Work

Page 66: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Conclusion & Furture work

2014/12/23

Conclusion: • a novel rule refinement method for SMT.

• a simulated post-editing paradigm to efficiently collect the training data.

• TER-Plus for translation error detection.

• a simple and effectively heuristic algorithm for rule-filteration.

• both phrase-based and syntax-based SMT systems.

• gains an overall improvement of 1.4 BLEU point without using any additional resources.

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 67: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Conclusion & Furture work

2014/12/23

Furture work:

• test our method on more complex translation models.

• produce more powerful feedbacks to improve SMT systems.

Sitong Yang, Heng Yu, and Qun Liu. A Novel Rule Refinement Method for SMTthrough Simulated Post-Editing

>> Post Editing >> Pros & Cons >> Our method >> Data set & Experiment >> Conclusion & Furture Work

Page 68: A Novel Rule Refinement Method for SMT through Simulated ...tcci.ccf.org.cn/conference/2014/ppts/nlpcc/ppt126.pdfA Novel Rule Refinement Method for SMT through Simulated Post-Editing

Thanks for your attention!

2014/12/23