1 Multi-agent architectures that facilitate apprenticeship learning for real-time decision making: Minerva and Gerona David C. Wilkins Center for Study of Language and Expertise Stanford University David Fried Department of Computer Science University of Illinois at U-C November 5, 2005 Supported by ONR: N00014-00-1-0660, N00014-02-1- 0731
31
Embed
David C. Wilkins Center for Study of Language and Expertise Stanford University David Fried
Multi-agent architectures that facilitate apprenticeship learning for real-time decision making: Minerva and Gerona. David C. Wilkins Center for Study of Language and Expertise Stanford University David Fried Department of Computer Science University of Illinois at U-C November 5, 2005 - PowerPoint PPT Presentation
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
1
Multi-agent architectures that facilitate apprenticeship learning for
real-time decision making: Minerva and Gerona
David C. WilkinsCenter for Study of Language and Expertise
Stanford University
David FriedDepartment of Computer Science
University of Illinois at U-C
November 5, 2005Supported by ONR: N00014-00-1-0660, N00014-02-1-0731
2
Outline
• Goal– Expert shells multi-agent capabilities
• Minerva – medical diagnosis (1992-1994)– Apprentice program observes expert, improves agent
• Genona – ship damage control (2002-2005)– Apprentice program observes student, improves student
• Summary and conclusions
3
Expert Shells -> Multi-Agent Capabilities
• Traditional performance capabilities– Correct solution, Efficient problem solving
• Research philosophy– Critiquing & apprenticeship should be natural artifact of shell architecture– Same apprenticeship method should support both learning and tutoring– Unified arch for dimensions of expertise is approach to cognitive modeling
4
Apprenticeship Learning Paradigm
Problem
Human Expert Problem Solver Agent Actions Learning Actions Program
KN Differences
• Situated Learning: within context of problem solving• Good for knowledge refinement of human or expert agent
5
Apprenticeship Learning Challenges
• Global credit assignment – Does good explanation of human action exist?– Challenge: some explanation usually exists
• Local credit assignment– What KN difference creates good explanation?– Challenge: Many repairs will create explanation
• Variance among human problem solvers– How to distinguish between allowable variations among
human problem solvers (who among other things often disagree) and variations that suggest knowledge errors
• Solution– Minerva shell architecture
6
Minerva-Based Apprenticeship Learning: Domain of Neurology Diagnosis
1. Debra Arbed, a 39 year old black female.2. Chief complaint is headache, nausea, vomiting, stiff neck.3. Headache duration? 6 hours.4. Headache severity? 4 on scale of 0-4.5. Fever? No. 6. Recent seizures? No.7. Visual problems? No. 8. Headache onset? Abrupt.30. Final diagnosis is subarachnoid hemorrhage.31. Secondary dx is acute bacterial meningitis.
7
Evolution of Decision-Making Expert Shells:Separation of Different Knowledge Types
Domain Kn
Task Kn
Sched Kn
Inference
Inference
Inference
Program
Minerva(1992)
Odysseus2(1994)
Neomycin(1982)Guidon2(1987)
Odysseus(1988)Mycin
(1972)GuidonTieresias(1978)
Task Kn
Domain Kn Domain Kn
8
Domain, Task, and Scheduling KN are Distinct
• Domain KN: vocabulary and predicates mention domain• Task KN: no mention of domain (e.g., medicine):
• Damage Control Assistant (DCA)– Responsible for overall crisis management– Makes damage control decisions– Coordinates investigation and repair teams
13
“Damage Control Assistant” ExpertiseHow to get decision-making practice?!
• Expertise requires practice – Time-critical decision-making – High stress, information overload– Uncertain and incomplete information
• “Whole task” practice difficult to acquire– Actual ship crises infrequent – Realistic practice expensive and dangerous – Rotation cycle is 2-3 years
14
The DCA Decision-Making Task:Fires, Smoke, Floods, Ruptures, etc
• Event to DCA: fire observed in compartment 1-174-0-L
• Event to DCA: pipe rupture observed compart 1-191-0-Q
• Action by DCA: send repair party to compart 1-174-0-L
• Action by DCA: go to General Quarters (GQ)
• Action by DCA: start fire pump #3 on port side
• Critique to DCA: Error of omission: must request permission of CO to turn on fire pump during GQ
• Action by DCA: Close firemain valve 3-274-2
• Critique to DCA: Error of commission: valve 3-274-2 does not isolate pipe rupture
compartment”,[compartment = Compartment, station = Station], _, _)
23
Graph Modification Operators (cont)
THENaction(modify, correct, 5120, “Fight fire in space”,
[compartment <- Compartment, station <- Station], _, A)goal(modify, addressed, 7118, “Apply fire suppressant”,
[compartment <- Compartment], _, G)END RULE
…
END GMO
24
Meta-GMO Question Types• About 100 templates cover all past instructor-student QAs
– “Why” questions for justifying CSG nodes (12)• “Why should I have ordered firefighting?”
– “What” questions for retrieving expert recommendations (32)• “What should I have done after I got the fire report?”
– “What if” questions to get critiques on hypothetical actions (4)• “What if I ordered fire boundaries to be set?”
– “When/How” questions to explain domain rules (9)• “How do you determine what repair locker has jurisdiction?”
– “When/What/Is” questions evaluate conditions and relations (26)• “Is there a starboard fire pump on at 3:00?”
– More complex questions involving chaining and inference (14)• “How can I satisfy the preconditions for dewatering?”• “If I ordered smoke boundaries, what could I do then?”
25
Meta-GMO Example• “When is it appropriate to order firefighting?”• Question ECL 9300 “when action”
“There are two conditions under which you should order firefighting.
“First, when you receive a report that electrical and mechanical isolation has completed, you still need to extinguish the fire in that compartment, you have either active desmoked the compartment or do not need to active desmoke the compartment, and either there is no halon or halon has failed, find the best repair locker for that compartment, and order that repair locker to fight the fire in the compartment.
“Second, when you receive a report that halon has failed, you have either isolated the compartment or the compartment cannot be isolated, and you have either active desmoked the compartment or do not need to active desmoke the compartment, find the best repair locker for that compartment, and order that repair locker to fight the fire in the compartment.”
27
In English(intelligent translation)
“There are two things that might trigger ordering firefighting. The first is a report of electrical and mechanical isolation achieved, and the second is a report that halon has failed.
“The first case only applies when you need to extinguish a fire. You also need to have active desmoked the compartment, if necessary, and if the compartment has halon, it has to already have failed.
“In the second case, you must have active desmoked if necessary and isolated the compartment if possible.
“In both cases, you should send the best repair locker for the compartment to fight the fire.”