Scanalu2002 23.5 Jan Alexandersson Experiences from large NLP Projects Jan Alexandersson German research center for Artificial Intelligence GmbH Stuhlsatzenhausweg 3, Geb. 43.1 66123 Saarbrücken Tel.: (0681) 302-5347 Email: [email protected]www.dfki.de/~janal
74
Embed
Scanalu2002 23.5 Jan Alexandersson Experiences from large NLP Projects Jan Alexandersson German research center for Artificial Intelligence GmbH Stuhlsatzenhausweg.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Scanalu2002 23.5 Jan Alexandersson
Experiences from large NLP Projects
Jan Alexandersson
German research center for Artificial Intelligence GmbHStuhlsatzenhausweg 3, Geb. 43.1
Source LanguageSource Language Target LanguageTarget Language
Scanalu2002 23.5 Jan Alexandersson
The VerbMobil Case
SyntacticAnalysis
WordStructure
WordStructure
Direct Translation
Syntactic Transfer
SemanticTransfer
Interlingua
SemanticStructure
SemanticStructure
SemanticAnalysis
SemanticGeneration
SyntacticGeneration
SyntacticStructure
SyntacticStructure
MorphologicAnalysis
MorphologicGeneration
Source LanguageSource Language Target LanguageTarget Language
Speech
Signal
Speech
Signal
ProsodicAnalysis
ProsodicAnnotation
Scanalu2002 23.5 Jan Alexandersson
The Graphical User Interface
Scanalu2002 23.5 Jan Alexandersson
Focuses of Speech Recognitionin Verbmobil
RobustnessMultilinguality
LargeVocabulary
DaimlerChrysler
RWTHAachen
University ofKarlsruhe
Scanalu2002 23.5 Jan Alexandersson
General Speech Recognition Task
GermanGerman
EnglishEnglish
JapaneseJapanese
Audio Signal Recognizers Word Hypotheses Graph
interface between acoustic and linguistic processing
Scanalu2002 23.5 Jan Alexandersson
What Linguistic Analysis Really Needs
• Syntactic Boundaries He saw ? the man ? with the telescope Prosody cannot help
• Dialog Act Boundaries No, I have no time at all on Thursday. D But how about on Friday? Dialog acts are pragmatic units that chunk the input into units which can be processed alone.
• Prosodic Syntactic Boundaries Of course ? not ? on Saturday Syntactic boundaries that correlate to the acoustic-phonetic reality; help during analysis within one chunk/dialog act. Important in spontaneous speech with elliptical utterances.
Scanalu2002 23.5 Jan Alexandersson
Speech Signal Word Hypotheses Graph
Multilingual Prosody ModuleProsodic features:F0 duration energy ....
Search SpaceRestriction
Parsing
Dialog ActSegmentation and
Recognition
Dialog Understand.
Constraints forTransfer
Translation
LexicalChoice
GenerationSpeech
Synthesis
SpeakerAdaptation
BoundaryInformation
BoundaryInformation
BoundaryInformation
BoundaryInformation
SentenceMood
SentenceMood
AccentedWords
AccentedWords
Prosodic FeatureVector
Prosody in Verbmobil
Scanalu2002 23.5 Jan Alexandersson
Facts about Repairs in the Verbmobil Corpus
• 21% of all turns in the Verbmobil corpus (79 562 turns ) contain at least one self correction
• The syntactic category is preserved in most cases(For example: Out of a sample of 266 verb replacements, 224 are again mapped to verbs)
• Repairs take place in a restricted context(in 98% the reparandum consists of less than 5 words)
• Repair sequences underlie certain regularities
Scanalu2002 23.5 Jan Alexandersson
The Understanding of Spontaneous Speech Repairs
I need a car next Tuesday oops Monday
Original Utterance Editing Phase Repair Phase
Reparandum Editing Term Reparans
Recognition ofSubstitutions
Transformation of theWord Hypotheses Graph
I need a car next Monday
Scanalu2002 23.5 Jan Alexandersson
Architecture of Repair Processing “On Thursday I cannot no I can meet äh after one”
Scanalu2002 23.5 Jan Alexandersson
Multiple Approaches• Mono-cultural approaches are dangerous
– humans vs. viruses diversity– Microsoft vs. ILOVEYOU and copycats alternative software solutions
• Some sources of errors in a speech translation system– external
• spontaneous speech: not well formed, hesitations, repairs• bad acoustic conditions• human dialog behavior
– internal• knowledge gaps in modules• software errors• probabilistic processing
Use multiple engines, varying approaches on various stages of processing
Scanalu2002 23.5 Jan Alexandersson
• Exclusive alternatives: three different 16 kHz German speech recognizers with various capabilities
• Competing approaches:
– three parsers: HPSG, Chunk, Statistical
– five translation tracks: case-based, dialog-act based, statistical,
substring- based, linguistic (deep) semantic translation• Needed: selection and combination of results from competing tracks
– parsers: combination of partial analyses in the semantic processing modules
– translation: pre-selection module
Multiple Approaches in Verbmobil
Scanalu2002 23.5 Jan Alexandersson
Multiple Translation Tracks - Approaches and Advantages
• Case-based: – Approach: uses examples from the aligned bilingual Verbmobil corpus– Advantage: good translation if input matches example in corpus
Verbmobil – The BookThere are over 600 refereed papers on the various aspects of and achievements in Verbmobil.
Wolfgang Wahlster (ed.):
"Verbmobil: Foundations of Speech-to-Speech
Translation"Springer-Verlag Berlin Heidelberg
New York. 679 Pages
ISBN 3-540-67783-6
Scanalu2002 23.5 Jan Alexandersson
What is...
?
http://smartkom.dfki.de
Info
rmat
ion
, Ap
plic
atio
ns,
Peo
ple
User(s)
UserModeling
DiscourseManagement
IntentionRecognition
InteractionManagement
ModeAnalysis
Language
Graphics
Gesture
Sound
Media InputProcessing
Media OutputRendering
Reference Architecture for Multimodal Systems
Context Management
ExpectationManagement
User ID
Bio
met
rics
Application Interface
Integrate
Respond
Request
Terminate
Initiate
T
A
V
G
G
ModeCoordination
PresentationDesign
Multimodal ReferenceResolution
Multimodal Fusion
A
A
V
G
G
ModeDesign
Language
Graphics
Gesture
Sound
AnimatedPresentation
Agent
Select Content
Design
Allocate
Coordinate
Layout
UserModel
DiscourseModel
DomainModel
MediaModels
TaskModel
Representation and Inference, States and Histories
ApplicationModels
ContextModel
ReferenceResolution
Action Planning
2 Nov. 2001Dagstuhl SeminarFusion and Coordinationin Multimodal Interactionedited by: M. Maybury
Scanalu2002 23.5 Jan Alexandersson
User specifies goal
delegates task
cooperate
on problems
asks questions
presents results
Service 1 Service 1
Service 2Service 2
Service 3Service 3
IT Services
PersonalizedInteraction
Agent
Situated Delegation-oriented Dialog Paradigm: Collaborative Problem Solving
Scanalu2002 23.5 Jan Alexandersson
The Main Modules on the Control GUI
Scanalu2002 23.5 Jan Alexandersson
More About the System• Modules realized as independent processes• Not all must be there (critical path: speech or graphic input to speech or graphic
output)• (Mostly) independent from display size • Pool Communication Architecture (PCA) based on PVM for Linux and NT
– Modules know only about their I/O pools– Literature:
• Andreas Klüter, Alassane Ndiaye, Heinz Kirchmann: Verbmobil From a Software Engineering Point of View: System Design and Software Integration. In Wolfgang Wahlster: Verbmobil - Foundation of Speech-To-Speech Translation. Springer, 2000.
• Data exchanged using M3L documents • All modules and pools are visualized here ...
Scanalu2002 23.5 Jan Alexandersson
The Real Story
Scanalu2002 23.5 Jan Alexandersson
Frame Languages
Object-oriented ModelingPrimitives
Frame Languages
Object-oriented ModelingPrimitives
NL/MM-Semantics
More formal SemanticsSubsumption, Inferences
NL/MM-Semantics
More formal SemanticsSubsumption, Inferences
W3C Standards
XML Schema/DTDs
W3C Standards
XML Schema/DTDs
M3LM3L
The “Glue“ - M3L: XML based Multimodal Markup Language
Domain Knowledge
NL/MM Representation
Pool Pool Pool. ... .
XML schema XML schema XML schema
Scanalu2002 23.5 Jan Alexandersson
Validation of Dialogue Systems
Analysis
Generator
DatabaseDM
ASR
Synthesis
Dialoguemodel
• Project ValDia (DFKI – DaimlerChrysler ULM)
• Tool for validation of Dialogue Models/Managers (DM)
Automatic
Manual
Scanalu2002 23.5 Jan Alexandersson
Validation of DM• Even slight changes can make test suites for DM invalid
(but not for parser, recognizer, …) • Put persons in front of the complete system
+ We will eventually find errors- It is time consuming
- For some scenarios impossible to exhaustively validate a DM
- What module failed to perform its task?- Combination of errors?
the whole system has to be put together
Scanalu2002 23.5 Jan Alexandersson
Validation of DM• ValDia approach: Replace test person and I/O modules
with ValDia
DatabaseDM
Analysis
Generator
ASR
Synthesis
Dialoguemodel
Scanalu2002 23.5 Jan Alexandersson
Experiences• ValDia detects errors
• Logical:
– Combination of greet und request leads to goal conflict in DM – DM hang!
• Technical:
– After about 500 Dialogues DM crashed due to erroneous memory handling
Scanalu2002 23.5 Jan Alexandersson
What is
Scalability?
Scanalu2002 23.5 Jan Alexandersson
What is Scale (-able)?
• WordNet (1.6):– Noun scaling has 3 senses
• (grading) the act of arranging in a graduated series• act of measuring, arranging or adjusting according to a scale• ascent by or as if by a ladder
– Verb scale has 8 senses• measure by or as if by a scale; "This bike scales only 25 pounds• pattern, make, ... or estimate according to some rate or standard• take by attacking with scaling ladders• (surmount) -- reach the highest point of• climb up by means of a ladder• scale, descale -- remove the scales from; "scale fish"• measure with or as if with scales; "scale the gold"• size or measure according to a scale
Scanalu2002 23.5 Jan Alexandersson
Scaling what/how?
Bigger
Better
Faster
Robuster
PrecisionCoverage
Multilinguality
Cheaper
Depth
Scanalu2002 23.5 Jan Alexandersson
Coverage
Linguisticconstructions
Domain,Task,
Application
Sub-Languages,Type of Lang.
Multilingual,Cultur
Interactionstyle
SIZE
RobustnessDepth
Speed
Scanalu2002 23.5 Jan Alexandersson
Who are we scaling for?
• EU
• NSF
• BMBF
• Industri
• ...
Basic research Research Prototypes
Applied research / Product development
``Real´´ Systems
Scanalu2002 23.5 Jan Alexandersson
Experiences VerbMobil
• ``Many´´ people has said:– With 15-20 persons on one spot I would make a VerbMobil of my
own. But muuuuuch better/cheaper/...• This is not true!