Top Banner
Eric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June 27, 2016 Exploratory Technical Workshop on Safety and Control for AI Carnegie Mellon University
79

Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Jul 05, 2019

Download

Documents

lamdien
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Eric Horvitz

Reflections on Safety

and Artificial Intelligence

Pittsburgh, PAJune 27, 2016

Exploratory Technical Workshop on Safety and Control for AICarnegie Mellon University

Page 2: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

AI & Safety

Constellation of methods referred to as Artificial Intelligence will touch our lives more closely and intimately

AI moving into high-stakes applicationsHealthcareTransportationFinancePublic policyDefense

Much to do on principles, methods, and best practices

Page 3: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Safe AI Systems

Relevance of Multiple Subdisciplines

Page 4: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Safe AI Systems

Relevance of Multiple Subdisciplines

MultiagentSystems

Mechanism Design

Planning

SensorFusion

ControlTheory

Machine Learning

Metareasoning

RobustOptimization

HCI

Verification

Security

Page 5: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Safe AI Systems

Relevance of Multiple Subdisciplines

MultiagentSystems

Mechanism Design

Planning

SensorFusion

ControlTheory

Machine Learning

Metareasoning

RobustOptimization

HCI

Verification

Security

Safety-Critical Systems

Page 6: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

safetyˈsāftē/

noun1. the condition of being protected from or unlikely to cause danger, risk, or injury

Page 7: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

safety-critical ˈsāftēˌkridək(ə)l/

adjective1. systems whose failure could result in loss of life, significant property damage, or damage to the environment.

2. designed or needing to be fail-safe for safety purposes.

Page 8: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

adjectiveincorporating some feature for automatically counteracting the effect of an anticipated possible source of failure

noundevice or practice that, in the event of a failure, responds or results in a way that will cause no harm, or at least minimizes harm.

fail-safe \ˈfāl-ˌsāf\

Page 9: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Fail-safe

George Westinghouse, 1869Train braking system

Fail-safe designAir brakes

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

Page 10: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Fail-safe

George Westinghouse, 1869Train braking system

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

Fail-safe practiceFull-power throttle on arrested landing

Page 11: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Fail-safe

George Westinghouse, 1869Train braking system

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

Fail-safe planFree return trajectory

Page 12: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Fail-safe

George Westinghouse, 1869Train braking system

✓ Mechanism✓ Practice✓ Plan

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

Fail-safe planFree return trajectory

Page 13: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Fail-safe

George Westinghouse, 1869Train braking system

✓ Mechanism✓ Practice✓ Plan

✓ Monitoring

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

Page 14: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Growing interest in issues & directions with AI in real-world settings

Grappling with uncertainty and more general incompleteness

AAAI President’s address (2008), “Artificial Intelligence in the Open World.”

AAAI President’s address (2016), “Steps Toward Robust Artificial Intelligence.”

AI in the Open World

E. Horvitz. Artificial Intelligence in the Open World, AAAI President’s Address, Chicago, IL, July 2008.

T. Dietterich, Steps Toward Robust Artificial Intelligences, AAAI President's Address, Phoenix, AX. February, 2016.

Page 15: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Open-world complexity incomplete understanding

Uncertainties & poor-characterization of performancePoor operating regimes, unfamiliar situations

Challenges of transfer across time & space

Special Considerations with AI

Rich ontology of failures

Numerous failure modalities New attack surfaces (e.g., machine learning attack)Self-modification & gaming (e.g., modify reward fcn)Unmodeled influences

Challenge of coordinating human-machine collaborations

Operational opacity

Page 16: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

AI & Open-World Complexity

Frame problem

How to tractably derive consequences of an action?

Qualification problem

Understanding preconditions required for actions to have intended effects

Ramification problem

Understanding all important effects of action

Page 17: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

AI & Open-World Complexity

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

Page 18: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

AI & Open-World Complexity

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

Page 19: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Decision making under uncertainty & incompleteness

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Page 20: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Decision making under uncertainty & incompleteness

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Page 21: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Decision making under uncertainty & incompleteness

Page 22: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Decision making under uncertainty & incompleteness

Page 23: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Decision making under uncertainty & incompleteness

Page 24: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Decision making under uncertainty & incompleteness

Page 25: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Decision making under uncertainty & incompleteness

Page 26: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Fang, et al., 2015

Direction: Learn about abilities & failures

Performance

Successes & failures

p( fail | E, t)

Predictive model of

confidenceImage

H1

H2

H3

W1

W2

W3

W4

Input s

H3

Caption:

a man holding a tennis

racquet on a tennis court

H1

H2

H3

W1

W2

W3

Input t1

H3

W4

Deep learning about deep learning performance

Page 27: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

InferredState

InferenceReliability

Inference Inference

Performance

Successes & failures

p( fail | E, t)

Predictive model of

confidence

Direction: Learn about abilities & failures

Toyama & H. 2000

Page 28: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

InferredState

InferenceReliability

Inference Inference

Performance

Successes & failures

p( fail | E, t)

Predictive model of

confidence

Direction: Learn about abilities & failures

Toyama & H. 2000

Page 29: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Inference modality A

Inference modality B

Inference modality C

Direction: Robustness via analytical portfolios

Toyama & H. 2000

Page 30: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Robustness via analytical portfolios

Toyama & H. 2000

Page 31: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Robustness via analytical portfolios

Toyama & H. 2000

lights out

jolted camera

facing away

periph. distraction

back. subtract color based motion decay

Perceptual modalitiesUnmodeled situations in open world

Joint inference

Page 32: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Vary model structure, parameters, inferences

Direction: Understanding robustness via sensitivity analyses

Page 33: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Vary model structure, parameters, inferences

Direction: Understanding robustness via sensitivity analyses

Page 34: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Robust optimization to minimize downside

Tamar, 2015; Chow, et al., 2014; per Dietterich, AAAI lect. 2016

Robust optimization under uncertain parameters

Risk-sensitive objective

e.g., conditional-value-at-risk budget

Methods trade upside value

for reducing probability of

costly outcomes

Page 35: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Learn about unknown unknowns

Data, experience, rich simulations

Detect anomalies, unexpected variations, distributional shifts

Meta-analysis & transfer

Human engagement

Page 36: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Learn about unknown unknowns

“Beat the Machine” (Attenberg, Ipeirotis, Provost 2015)

Data, experience, rich simulations

Detect anomalies, unexpected variations, distributional shifts

Meta-analysis & transfer

Human engagement

Page 37: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Predict previously unseen destination

Destinations, E1..,En, t

Day 1 Day 14

Predict new distinctions, combine open- & closed-world models

Krumm, H., 2006

Direction: Learn about unknown unknowns

Page 38: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Predict previously unseen destination

Destinations, E1..,En, t

Day 1 Day 14

Predict new distinctions, combine open- & closed-world models

Direction: Learn about unknown unknowns

Krumm, H., 2006

Page 39: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Predict previously unseen destination

Destinations, E1..,En, t

Day 1 Day 14

Predict new distinctions, combine open- & closed-world models

Direction: Learn about unknown unknowns

Krumm, H., 2006

Page 40: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

2

1.5

x

1

0.5

00

0.5

y

11.5

20.2

0

-0.2

-0.4

-0.6

-0.8

-1

z

Direction: Joint modeling of key dimensions of error

Example: Learn about errors of perception & control

Probabilistic models of control

Probabilistic models of sensing 𝝋𝐨𝐛𝐬𝐭𝐚𝐜𝐥𝐞

𝝋𝐫𝐨𝐥𝐥

Sadigh & Kapoor, 2016

Page 41: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Proposed trajectory

Sample 1 Sample 2 Sample n

S.D.Mean

S.D.

𝝋𝐫𝐨𝐥𝐥

𝝋𝐨𝐛𝐬𝐭𝐚𝐜𝐥𝐞

Direction: Joint modeling of key dimensions of error

Sadigh & Kapoor, 2016

Page 42: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Proposed trajectory Trajectory safe if:

Σ

Σ Σ+> 1 − 𝜖

S.D.Mean

S.D.

Σ

𝝋𝐫𝐨𝐥𝐥

𝝋𝐨𝐛𝐬𝐭𝐚𝐜𝐥𝐞

Direction: Joint modeling of key dimensions of error

Sadigh & Kapoor, 2016

Page 43: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Proposed trajectory

Sample 1 Sample 2 Sample n

Trajectory safe if:

Σ

Σ Σ+> 1 − 𝜖

S.D.Mean

S.D.

Σ

𝝋𝐫𝐨𝐥𝐥

𝝋𝐨𝐛𝐬𝐭𝐚𝐜𝐥𝐞

𝐩 > 𝟏 − 𝝐

Direction: Joint modeling of key dimensions of error

Sadigh & Kapoor, 2016

Page 44: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Joint modeling of key dimensions of error

Value of refining models & system- Value of additional data- Value of enhancing sensors- Value of better controller

Sadigh & Kapoor, 2016

(video)

Page 45: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Joint modeling of key dimensions of error

𝐩 > 𝟏 − 𝝐

Sadigh & Kapoor, 2016

(video)

Page 46: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Joint modeling of key dimensions of error

𝐩 > 𝟏 − 𝝐

Fail-safe

Sadigh & Kapoor, 2016

(video)

Page 47: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Joint modeling of key dimensions of error

Sadigh & Kapoor, 2016 (video)

Page 48: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Verification, security, cryptography

Verification

Cryptography

Security

Safe AI Systems

Page 49: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Static analysis

Run-time verification

Whitebox fuzzing

Cybersecurity to protect attack surfaces

Appropriate use of physical security, isolation

Encryption for data integrity, protection of interprocess comms.

Direction: Verification, security, cryptography

Page 50: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Difficult to do formal analysis of large-scale system

Analysis & execution considers info. from running system

Direction: Runtime verification

Satisfy or violate desired properties?

Identify problem, future problem

Engage human

Take fail-safe action

Page 51: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

State’

Page 52: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’

Page 53: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’

Page 54: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’

Reinforcement

Page 55: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

State

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward Reinforcement

State’ Perception

Page 56: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’ Perception

Reinforcement

Adversary

Page 57: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’ Perception

Reinforcement

Adversary

Self-modification

e.g., see: Amodei, Olah, et al., 2016

Page 58: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Run-time verificationStatic analysis

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’ Perception

Reinforcement

Adversary

Reflective analysis• Operational faithfulness• Ensure isolation, detect mods• Identify external meddling

Page 59: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Human-machine collaboration

Models of human cognition

Transparency of state, explanation

Mastering coordination of initiatives

Page 60: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Human-machine collaboration

Air France 447 (June 2009)

China Airlines 006 (Feb 1985)

747 dives 10,000 in 20 seconds. 5g, supersonic.

Unrecoverable stall.

Page 61: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Human-machine collaboration

Air France 447 (June 2009)

China Airlines 006 (Feb 1985)

747 dives 10,000 in 20 seconds. 5g, supersonic.

Unrecoverable stall.

Page 62: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Machine learning & inference

Human cognition

Machineintelligence

How to best work together for safety?

Rich spectrum of autonomy

Direction: Human-machine collaboration

Kamar, Hacker, H., 2012

Page 63: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Designs for mix of initiatives

Machine learning & inference

Human cognition

Machineintelligence

How to best work together for safety?

Rich spectrum of autonomy

Direction: Human-machine collaboration

Kamar, Hacker, H., 2012

Page 64: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Human-machine collaboration

Page 65: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Human-machine collaboration

Infer challenges with machine competency

Page 66: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Infer challenges with machine competency

Direction: Human-machine collaboration

Infer human attention

Page 67: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Human-machine collaboration

p(a

tte

nti

on

sta

te|E

)

Time

Infer human attentionInfer challenges with machine competency

Page 68: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Human-machine collaboration

Continual prediction of trajectories

p(a

tte

nti

on

sta

te|E

)

Time

Infer human attention

Page 69: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Human-machine collaboration

Continual prediction of trajectories

p(a

tte

nti

on

sta

te|E

)

Time

Infer human attention

Page 70: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Human-machine collaboration

Continual prediction of trajectories

p(a

tte

nti

on

sta

te|E

)

Time

Infer human attention

Page 71: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Direction: Human-machine collaboration

Safety-assuring mixed-initiative planner

- Driver’s attention over time

- Latency of human input

- Latency tolerance of situation

- Cost & influence of alerting driver

- Custom language, ongoing dialog

Gain driver attention t

Slow to defer need t’

Implement failsafe t’’

Page 72: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

• Phases of study, testing, reporting for rolling out new capabilities in safety-critical domains (akin to FDA clinical trials, post-marketing surveillance)

• Disclosure & control of parameters on failure rates, tradeoffs, preferences

• Transparency & explainability of perception, inference, action

• System self-monitoring & reporting machinery

• Isolation of components in intelligence architectures

• Detecting & addressing feedback of system’s influence on self

Direction: Develop Best Practices for Safe AI

Page 73: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

• Standard protocols for handoffs, attention, awareness, warning, in human-machine collaborations

• Policies for visible disclosure of autonomy to others (e.g., indication to others that a car is currently on automated policy)

• Fail-safe actions & procedures given predicted or sensed failures

• Enhancing robustness via co-design of environment & systems

• Testing for drift of assumptions, distributions in domains

• Special openness & adherence to best practices for data, learning, decision making for applications in governance & public policy

Direction: Develop Best Practices for Safe AI

Page 74: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Addressing concerns of public

Significant differences of opinion, including experts

Direction: Address concerns about “superintelligences”

Page 75: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Addressing concerns of public

Significant differences of opinion, including among experts

Direction: Address concerns about “superintelligences”

Page 76: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Alan Turing Script, BBC broadcast, 1951

Addressing concerns of public

Significant differences of opinion,

Direction: Address concerns about “superintelligences”

Page 77: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

Alan Turing Script, BBC broadcast, 1951

Addressing concerns of public

Significant differences of opinion,

Direction: Address concerns about “superintelligences”

“For it seems possible that once the machine thinking method had started, it would not take long to outstrip our feeble powers.

…they would be able to converse with each other to sharpen their wits.

At some stage therefore, we should have to expect the machines to take control in the way that is mention in Samuel Butler’s Erewhon.”

Alan Turing, 1951

Page 78: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June

• Do we understand possibilities?

• What kind of research should done proactively?

• Can we “backcast” from imagined poor outcomes

• Designs of clear ways to thwart possibilities, ease concerns

Addressing concerns of public

Significant differences of opinion, including experts

Direction: Address concerns about “superintelligences”

Page 79: Reflections on Safety and Artificial Intelligenceerichorvitz.com/OSTP-CMU_AI_Safety_framing_talk.pdfEric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June