Preparing for the 2008 Beijing Olympics : The LingTour and KNOWLISTICS projects MAO Yuhang, DING Xiao-Qing, NI Yang, LIN Shiuan-Sung, Laurence LIKFORMAN, Christian BOITET, Gérard CHOLLET, Alain GOYE, Eric LECOLINET, Jacques PRADO Presented here by Gérard CHOLLET chollet @ tsi.enst.fr GET-ENST/CNRS- LTCI
23
Embed
Preparing for the 2008 Beijing Olympics : The LingTour and KNOWLISTICS projects
Preparing for the 2008 Beijing Olympics : The LingTour and KNOWLISTICS projects. MAO Yuhang, DING Xiao-Qing, NI Yang, LIN Shiuan-Sung, Laurence LIKFORMAN, Christian BOITET, Gérard CHOLLET, Alain GOYE, Eric LECOLINET, Jacques PRADO - PowerPoint PPT Presentation
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Preparing for the 2008 Beijing Olympics :
The LingTour and KNOWLISTICS projects
MAO Yuhang, DING Xiao-Qing, NI Yang,LIN Shiuan-Sung, Laurence LIKFORMAN,
Christian BOITET, Gérard CHOLLET, Alain GOYE, Eric LECOLINET, Jacques PRADO
Multimedia, multilingual information server Information kiosk Intelligent Camera Bilingual Voice Communicator
Needs and relevance A PDA for tourists and travelling businessmen
Conclusions and Perspectives
Rationale for the IP-KNOWLISTICS
Logistics for knowledge in a specific domain (OG) Language independent knowledge representation
and management Multimedia (text, speech, image, video) Multimodal access (text, speech, pen, visual I/O) Distributed multilingual, multimedia server
accessible from mobile terminals (phone, PDA, PC,…) and kiosks
Primarily targetted for tourist applications initially 2008 Beijing Olympics as a field trial
Technical developments
Language independent knowledge representation (using conceptual graphs and an Intermediate Representation Language like the ‘Universal Networking Language’)
Tools for enconversion and evaluation Generation in 12 target languages Multilingual Speech Synthesis and Recognition VoiceUNL-based interactive dialog agent ‘Intelligent camera’ with Chinese character recognition Cross-language ‘Multimodal communicator’ on a PDA Cross-language lexical access
Searching for character regions (text has uniform color) Multi-channel decomposition Connected components analysis Grouping of components Alignment analysis (number of horizontally or
vertically aligned components) Text identification (language independant features :
size, alignment,…)
Detection rate : 84 % False alarm rate : 5.6 %
Cross-language Multimodal Communicator
Use of a visual display (e.g. on a PDA) to mediate the dialogue between 2 persons speaking different languages.
Recognition of short utterances, display of a word graph, selection of keywords, visualisation (and synthesis) of the translation of key words and groups of words.
Specialised lexicon for dialog acts in typical touristic situations (in a restaurant, at the hotel, medical assitance, in the street, in public transport, about the Olympic games,…)
UMTS access to an information server offering maps, photographs, video sequences, web browsing, …
Automatic Speech Recognition in Multiple Languages
Sharing of acoustic models between languages to simplify extensibility to other languages.
Combination of phone models and adaptation from small amounts of data in new languages.
Model adaptation to user and environmental situations.
French
ChineseSharedacousticmodels
Language specific models
Knowledge representation
A formal language for representating the meaning of natural language sentence.
UNL (Universal Networking Language) introduced to describe natural language semantics.
Language-independent context indexing for cross-language information retrieval.
Use of conceptual hierarchy of UNL to address the inherent ambiguity of natural languages.
A set of semantic relations (linking concepts together) for a structured information pattern.