Top Banner
Georgia Tech / Mobile Intelligence 1 Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems DARPA MARS Kickoff Meeting - July 1999
21

Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Mar 18, 2016

Download

Documents

amory

Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems. DARPA MARS Kickoff Meeting - July 1999. Georgia Tech College of Computing Prof. Ron Arkin Prof. Ashwin Ram Prof. Sven Koenig Georgia Tech Research Institute Dr. Tom Collins - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 1

Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot

Architectural Software Systems

DARPA MARS Kickoff Meeting - July 1999

Page 2: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 2

Personnel Georgia Tech

– College of Computing Prof. Ron Arkin Prof. Ashwin Ram Prof. Sven Koenig

– Georgia Tech Research Institute

Dr. Tom Collins Mobile Intelligence Inc.

Dr. Doug MacKenzie

Page 3: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 3

Impact Provide the DoD community with a platform-

independent robot mission specification system, with advanced learning capabilities

Maximize utility of robotic assets in battlefield operations

Demonstrate warfighter-oriented tools in three contexts: simulation, laboratory robots, and government-furnished platforms

Page 4: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 4

New Ideas Add machine learning capability to a proven robot-independent architecture with a user-accepted human interface Simultaneously explore five different learning approaches at appropriate levels within the same architecture Quantify the performance of both the robot and the human interface in military-relevant scenarios

Page 5: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 5

Adaptation and Learning Methods

Case-based Reasoning for:– deliberative guidance

(“wizardry”)– reactive situational- dependent

behavioral configuration Reinforcement learning for:

– run-time behavioral adjustment– behavioral assemblage

selection Probabilistic behavioral

transitions– gentler context switching– experience-based planning

guidance

Available Robots and Available Robots and MissionLabMissionLab Console Console

Page 6: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 6

AuRA - A Hybrid Deliberative/Reactive Software Architecture

Reactive level– motor schemas– behavioral fusion via

gains Deliberative level

– Plan encoded as FSA

– Route planner available

Page 7: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 7

1. Learning Momentum Reactive learning via dynamic gain alteration

(parametric adjustment) Continuous adaptation based on recent

experience Situational analyses required In a nutshell: If it works, keep doing it a bit

harder; if it doesn’t, try something different

Page 8: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 8

2. CBR for Behavioral Selection

Another form of reactive learning Previous systems include: ACBARR and SINS Discontinuous behavioral switching

Page 9: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 9

3. Q-learning for Behavioral Assemblage Selection

Reinforcement learning at coarse granularity (behavioral assem-blage selection)

State space tractable Operates at level above

learning momentum (selection as opposed to adjustment)

Page 10: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 10

4. CBR “Wizardry” Experience-driven

assistance in mission specification

At deliberative level above existing plan representation (FSA)

Provides mission planning support in context

Page 11: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 11

5. Probabilistic Planning and Execution

“Softer, kinder” method for matching situations and their perceptual triggers

Expectations generated based on situational probabilities regarding behavioral performance (e.g., obstacle densities and traversability), using them at planning stages for behavioral selection

Markov Decision Process, Dempster-Shafer, and Bayesian methods to be investigated

Page 12: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 12

Integration with MissionLab

Usability-tested Mission-specification software developed under DARPA funding (RTPC/UGV Demo II/TMR programs)

Incorporates proven and novel machine learning capabilities Extends and embeds deliberative Autonomous Robot

Architecture (AuRA) capabilitiesArchitecture Subsystem Specification Mission OverlayArchitecture Subsystem Specification Mission Overlay

Configuration Editor

Communications Expert

User Data Logging

Hummer Groundstation

MissionLab Console

Runtime Data Logging

Reactive Behaviors

Hardware Drivers

Low-level Software

Robotic Hardware

"Robot" "Robot" "Robot" "Robot"

RUNTIME

EXECUTIVE

PREMISSION

IPTIPT IPT IPT

IPT

IPT

Real-time Specification

Page 13: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 13

Development Process with Mlab

Behavioral SpecificationMissionLab

Simulation Robot

Page 14: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 14

MissionLab Example: Scout Mission

Page 15: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 15

MissionLab EXAMPLE: LAB FORMATIONS

Page 16: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 16

MissionLab Example: Trashbot (AAAI Robot Competition)

Page 17: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 17

MissionLabReconnaissance Mission

– Developed by University of Texas at Arlington using MissionLab as part of UGV Demo II

– Coordinated sensor pointing across formations

Page 18: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 18

Evaluation: Simulation Studies

Within MissionLab simulator framework Design and selection of relevant

performance criteria for MARS missions (e.g., survivability, mission completion time, mission reliability, cost)

Potential extension of DoD simulators, (e.g., JCATS)

Page 19: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 19

Evaluation: Experimental Testbed

Drawn from our existing fleet of mobile robots

Annual Demonstrations

Page 20: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 20

Evaluation: Formal Usability Studies

Test in usability lab Subject pool of

candidate end-users Used for both

MissionLab and team teleautonomy

Requires develop-ment of usability criteria and metrics

Page 21: Multi-Level Learning in Hybrid Deliberative/Reactive Mobile Robot Architectural Software Systems

Georgia Tech / Mobile Intelligence 21

Schedule

Milestone

Demonstration of all learning algorithms in simulation

Initial integration within MissionLab on lab robots

Learning algorithms demonstrated in relevant scenarios

MissionLab demonstration on government platforms

Enhanced learning algorithms on government platforms

Final demonstrations of relevant scenarios with govt. platforms

Oct Jan Apr

GFY04Jan Apr JulJul Oct

GFY01 GFY02 GFY03Jul Oct Jan AprJul Oct Jan Apr