Top Banner
Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval Johns Hopkins University Center of Language and Speech Processing Summer Workshop 2000 Opening Ceremony The MEI Team July 17, 2000
20

The MEI Team July 17, 2000

Jan 03, 2016

Download

Documents

shaine-ashley

Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval Johns Hopkins University Center of Language and Speech Processing Summer Workshop 2000 Opening Ceremony. The MEI Team July 17, 2000. MEI Team. Senior Members Students. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: The MEI Team July 17, 2000

Mandarin-English Information (MEI):Investigating Translingual Speech Retrieval

Johns Hopkins University Center of Language and Speech Processing

Summer Workshop 2000Opening Ceremony

The MEI Team

July 17, 2000

Page 2: The MEI Team July 17, 2000

MEI Team

• Senior Members

• Students

Helen Meng Chinese University of Hong KongErika Grams Advanced Analytic ToolsSanjeev Khudanpur Johns Hopkins UniversityGina-Anne Levow University of MarylandDouglas Oard University of MarylandPatrick Schone US Department of DefenseHsin-Min Wang Academia Sinica, Taiwan

Berlin Chen National Taiwan UniversityWai-Kit Lo Chinese University of Hong KongKaren Tang Princeton UniversityJianqiang Wang University of Maryland

Page 3: The MEI Team July 17, 2000

Outline

• Motivation

• MEI project overview

• Research challenges

• System architecture

Page 4: The MEI Team July 17, 2000

Motivation

• Monolingual speech retrieval applications are emerging, e.g.– http://speechbot.research.compaq.com

source: www.real.com, Feb 2000

529

1367

English

OtherLanguages

Internet-accessibleRadio and Television Stations

Page 5: The MEI Team July 17, 2000

Translingual Speech Retrieval

• Allow anyone to find information that is expressed in any spoken language

Page 6: The MEI Team July 17, 2000

The Big Picture

Search Refinements

Speech-to-SpeechTranslation

English Spoken Documents

Retrieval Engine

English Text Query

English-to-ChineseTranslation

Mandarin News Broadcasts

Mandarin AudioIndexing

Retrieved MandarinSpoken Documents

Page 7: The MEI Team July 17, 2000

Related Work• TREC Spoken Document Retrieval

– Close coupling of recognition and retrieval

• TREC Cross-Language Retrieval– Close coupling of translation and retrieval

• TDT-3 Topic Tracking– Coupling recognition, translation and retrieval– Using speech recognition transcripts

Page 8: The MEI Team July 17, 2000

Research Challenges• Multi-scale audio indexing

– Multiple feature sets capture more information

• Multi-scale translation– Lexicon and pronunciation are complementary

• Multi-scale retrieval– Combination of evidence can add robustness

The MEI Project• Closely couple recognition and translation

– for the purpose of retrieval

Page 9: The MEI Team July 17, 2000

Task and Corpus

Query byExampleEnglish

ExampleNewswire

Stories

MandarinAudio

Collection

• TDT Corpus

• Challenge: using an English story as the query, find related Mandarin audio documents

Page 10: The MEI Team July 17, 2000

Multi-scale Mandarin Processing: Phonological Considerations

• Chinese is a syllable-based language• Mandarin is the official Chinese dialect

– ~400 base syllables, 4 lexical tones + light tone

• Syllable structure (CG)V(X)– (CG): onset, optional, consonant+medial glide

– V: nuclear vowel

– X: coda, glide / alveolar nasal / velar nasal

– ~ 21 initials, 39 finals

• Circumvent the OOV problem

Page 11: The MEI Team July 17, 2000

Multi-scale Mandarin Processing:Linguistic Considerations

• Characters (written) -> syllables (spoken)• Degenerate mapping

– /hang2/, /hang4/, /heng2/ or /xing2/

– /fu4 shu4/ (LDC’s CALLHOME lexicon)

• Tokenization / Segmentation– /zhe4 yi1 wan3 hui4 ru2 chang2 ju3 xing2/

Page 12: The MEI Team July 17, 2000

Multi-scale Audio Indexing

Initial/Final

Preme/Core Final

Phones

/iang//ji/

/j//ang/

/i/ /a/ /ng//j/

Preme/Toneme, Initial/Tonal-Finalsyllable /jiang/, single-character word, part of a multi-char word

Page 13: The MEI Team July 17, 2000

• Word-scale– Phrase-based, e.g. “the White House”– Dictionary-based [Levow & Oard 00]– Lexical gaps: parallel corpora, comparable corpora

• Subword-scale – Named entities: transliteration by cross-lingual phonetic

mapping– Northern Ireland /bei2 ai4 er3 lan2/– Kosovo (/ke1-suo3-wo4/, /ke1-suo3-fo2/, /ke1-suo3-fu1/,

/ke1-suo3-fu2/)

Multi-scale Translation

Page 14: The MEI Team July 17, 2000

Cross-Lingual Phonetic MappingNamed entity Jiang Zemin, Kosovo

Syllabify Pinyin Spelling E.g. jiang ze min

English Pronunciation Lookupor

Letter-to-Phone Generation

English Phones, e.g. k ao s ax v ow

Cross-lingual Phonetic Mapping

Chinese Phones, e.g. k e s u o w o

Syllabification

Chinese syllables, e.g. ke suo wo

Page 15: The MEI Team July 17, 2000

Multi-scale Retrieval

• Word-scale exploits lexical knowledge– Enhances precision

• Subwords can achieve complete coverage – Enhances recall

• Combination of evidence may be best– If a good merging strategy can be found

Page 16: The MEI Team July 17, 2000

Merging Strategies

• Loose coupling– Separate retrieval runs

– Merge ranked lists [Voorhees 1995]

• Tight coupling – Unified indexing of words and subwords

– Single ranked list

– [Ng 2000]

Page 17: The MEI Team July 17, 2000

Multi-scale Retrieval Techniques• Subword-scale

– Syllable lattice matching [Chen, Wang & Lee 2000]

– Overlapping syllable n-grams [Meng et al. 1999]

– Syllable confusion matrix [Meng et al. 1999]

• Word-scale– Structured queries [Pirkola 1998]

– Structured translation [Sperer & Oard 2000]

• Robust Retrieval– speech recognition errors

– translation / transliteration ambiguities

Page 18: The MEI Team July 17, 2000

Research Plan

• “MEI: Mandarin-English Information”, Proceedings of the TDT Workshop, 2000.

• “Mandarin-English Information (MEI): Investigating Translingual Retrieval” Proceedings of the NAACL Workshop on Embedded Machine Translation, 2000

Page 19: The MEI Team July 17, 2000

Named Entity

Tagger

Phrase tagging

Unknown words and phrases

Unknown words and phrases

English to Chinese translation dictionary

Term Translation

Spoken Mandarin document

s

Spoken Mandarin document

s

Dragon Mandarin

ASR

Query

Pro

cess

ing

Query

Pro

cess

ing

Query

Pro

cess

ing

Query

Pro

cess

ing

Docu

ment

Pro

cess

ing

Docu

ment

Pro

cess

ing

Docu

ment

Pro

cess

ing

Docu

ment

Pro

cess

ing

Query to Query to INQUERYINQUERY

Document Document to INQUERYto INQUERY

Character n-gram generation

Character n-gram

generation

Mandarin-English Information: Investigation Translingual Speech RetrievalMandarin-English Information: Investigation Translingual Speech Retrieval <http://www.glue.umd.edu/~meiweb><http://www.glue.umd.edu/~meiweb>Johns Hopkins University, Center for Language and Speech Processing, JHU/NSF Summer Workshop 2000

MEI Team : Helen MENG (CUHK), Berlin CHEN (National Taiwan University), Erika GRAMS (Advanced Analytic Tools), Sanjeev KHUDANPUR (JHU/CLSP), Gina-Anne LEVOW (University of Maryland), Wai-Kit LO (CUHK)Douglas OARD (University of Maryland), Patrick SCHONE (Department of Defense), Karen TANG (Princeton University), Hsin-Min WANG (Academia Sinica), Jianqiang WANG (University of Maryland)

MEI Team : Helen MENG (CUHK), Berlin CHEN (National Taiwan University), Erika GRAMS (Advanced Analytic Tools), Sanjeev KHUDANPUR (JHU/CLSP), Gina-Anne LEVOW (University of Maryland), Wai-Kit LO (CUHK)Douglas OARD (University of Maryland), Patrick SCHONE (Department of Defense), Karen TANG (Princeton University), Hsin-Min WANG (Academia Sinica), Jianqiang WANG (University of Maryland)

Word sequence

Charactern-gram

sequence

INQUERY

Ranked List of Possibly

Relevant Documents

Ranked List of Possibly

Relevant Documents

Translated words and

phrases

Translated words and

phrases

Relevance Assessments

Figure of Merit

Figure of Merit

Scoring

Query Term

Selection

As of Sunday July 9, 2000

Word sequence

Character n-gram

sequence

Segmented Chinese Text

Segmented Chinese Text

Input English

text query

Input English

text query

Page 20: The MEI Team July 17, 2000

END