Initiation of Standardization on Network-based Speech-to-speech Translation at ITU-T SG16 National Institute of Information and Communications Technology, Japan Satoshi Nakamura Chiori Hori Contact : Name Satoshi Nakamura Chiori Hori Organization NICT Country Japan Tel: +81-774 95 1370 Fax: +81-774 95 1308 Email: [email protected][email protected]INTERNATIONAL TELECOMMUNICATION COM 16 – C 196 – E TELECOMMUNICATION STANDARDIZATION SECTOR STUDY PERIOD 2009-2012 October 2009 English only Original: English Question(s): 7, 21, 22/16
15
Embed
Initiation of Standardization on Network-based Speech-to-speech Translation at ITU-T SG16
Initiation of Standardization on Network-based Speech-to-speech Translation at ITU-T SG16 National Institute of Information and Communications Technology, Japan Satoshi Nakamura Chiori Hori. Many Languages All Over the World. http://en.wikipedia.org/wiki/List_of_language_families. - PowerPoint PPT Presentation
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Initiation of Standardization on Network-based Speech-to-speech Translation
at ITU-T SG16
National Institute of Information and Communications Technology, JapanSatoshi Nakamura
Chiori Hori
Contact : Name Satoshi Nakamura Chiori HoriOrganization NICTCountry Japan
Breaking Language Breaking Language BoundariesBoundaries
Language boundaries is one of the causes of barriers to mutual understanding.
To remove language boundaries between people who speak different languages, Speech-to-Speech Translation (S2ST) technologies are an effective means of communication.
S2ST technologies have been studied.
EnglishEnglish““I go to school”I go to school”
Speech RecognitionRecognition
(ASR)(ASR)
MachineTranslationTranslation
(MT)(MT)
SpeechSynthesisSynthesis
(TTS)(TTS)
w a t a sh i w a t a sh i w a g a xtu w a g a xtu k o o n i…..k o o n i…..
私は私は学校に行く学校に行く
I to I to school goschool go
I go to I go to school school
JapaneseJapanese「私は学校に行「私は学校に行く」く」
CorporaCorpora
Convert to English word sequence
“「私は」⇒ I” “「学校に」⇒ to school”
“「行く」⇒ go”
Convert toword sequenceusing lexicon and grammar
Convert toJapanese phoneme sequence“w”, “a”, “t”…
Select appropriate waveform for English text
Reorder word sequences according toEnglish grammar “I” “ I” “to school” “ go” “go” “ to school”
Data format forData format forASR and MT results ASR and MT results
Communication protocol Communication protocol among modulesamong modulesTTSTTS
S2ST Client
MTMT
ASRASR
Synthesized Speech
S2ST Client
Lexicon for overall S2ST systemsLexicon for overall S2ST systems
An example of a lexicon for overall modules in S2ST systems
EntryLanguage
AttributeJapanese Korean Chinese English
Osaka
大阪おおさか
4モーラ0型
Osaka
・・
大阪ダーバンDaban
Da4ban3
四声三声
Osaka
Ōsaka
ɔː s a k a
Surface
Pronunciation
Accent
Tokyo
東京とうきょう・・・・
・・
東京トンジン
Tong1jing1
・・
Tokyo
Tōkyō
・・
Surface
Pronunciation
Accent
The global standardization for lexicon format and a system to collect and provide lexicon for all languages is requisite to maintaining reliable lexicon for overall S2ST systems.
Asian Network-Based S2ST System Asian Network-Based S2ST System by by A-STAR ConsortiumA-STAR Consortium
11National Institute of Information and Communications Technology (NICT), National Institute of Information and Communications Technology (NICT), JapanJapan
22Electronics and Telecommunications Research Institute (ETRI), KoreaElectronics and Telecommunications Research Institute (ETRI), Korea33Chinese Academy of Sciences (CASIA), ChinaChinese Academy of Sciences (CASIA), China
44National Electronics and Computer Technology Center (NECTEC), ThailandNational Electronics and Computer Technology Center (NECTEC), Thailand55Agency for the Assessment and Application of Technology (BPPT), IndonesiaAgency for the Assessment and Application of Technology (BPPT), Indonesia
66Center for Development of Advance Computing (CDAC), IndiaCenter for Development of Advance Computing (CDAC), India77Institute of Information Technology (IOIT), VietnamInstitute of Information Technology (IOIT), Vietnam
88Institute for Infocomm Research (I2R), SingaporeInstitute for Infocomm Research (I2R), Singapore
Server Location for Network-based S2ST
Speech Translation using Distributed Service Servers
Example: From Korean to Thai Speech Translation
Speech translation service client
TTSTTSserverserver
ASRASRserverserver
① Speech recognition (Korean)
② Language translation (Korean→Thai)
Synthesized speech
(Thai)
MTMTserverserver
Translated text (Thai)
Speech (Korean)
MTMTserverserver
TTSTTSserverserver
Text (Korean)
ASRASRserverserver
③ Speech synthesis (Thai)
S2ST Client and Server S2ST Client and Server
1212
Scope of StandardizationScope of Standardization
Draft Title Scope Target Date
F.S2STreqs Functional Requirements for Network-based S2ST
Definition of network-based S2ST
Functions and service requirements of network-based S2ST
During this Study Period (2009-2012)
H.S2STarch Architectural Requirements for Network-based S2ST
Functional architectures, mechanisms and
interface of network-based S2ST
During this Study Period (2009-2012)
Table : Draft Roadmap to develop standards for network-based S2ST
ConclusionConclusion
We would like to invite more people to standardization activities on network-based S2ST systems.
By leveraging the standardization, network-based S2ST systems can cover more languages.