Top Banner
Machine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford University IJCAI - July 13
46

Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Apr 15, 2018

Download

Documents

dinhthien
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Machine Learning and Decision

Making for Sustainability

Stefano Ermon

Department of Computer Science

Stanford University

IJCAI - July 13

Page 2: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Vision

2

Technology

Push

Society

Pull

Big Data

Artificial Intelligence

Sensing revolution

Crowdsourcing

Page 3: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Challenges in reasoning about complex systems

Operations Research

Management Science

Economics

Three major challenges:

1. High dimensional spaces: need to consider many variables

2. Uncertainty: limited information, need to use stochastic models

3. Preferences and utilities: need to consider optimization criteria

Statistics

Computer Science Engineering

Physics

Page 4: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Yield mapping [Ongoing]

Computational Sustainability

Uncertainty

Materials discovery[UAI-16, AAAI-15, SAT-12]

Large scale Poverty mapping

[AAAI-16]

High dimensional spaces

Preferences and utilities

4

Migrations[AAAI-15]

natural resources management [UAI-10, IJCAI-11]

Page 5: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Research Agenda

5

Foundations

Applications

inference by hashing and optimization

[ICML-13, UAI-13, ICML-14, UAI-15, AISTATS-16, AAAI-16, ICML-16]

bridging statistical physics and computer science

[CP-10, IJCAI-11, NIPS-11, NIPS-12]

sampling[NIPS-13, UAI-12, AAAI-16]

Materials discovery[UAI-16, AAAI-15, SAT-12]

Large scale Poverty mapping

[AAAI-16] Migrations[AAAI-15]

Page 6: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Why poverty?

• #1 UN Sustainable Development Goal

– Global poverty line: $1.90/person/day

• Understanding poverty can lead to:

– Informed policy-making

– Targeted NGO and aid efforts

Page 7: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Data scarcity

7

• Expensive to conduct surveys

• Poor spatial and temporal resolution

• Questionable data quality

Page 8: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Satellite imagery is low-cost and globally available

8

• Challenge: Lots of useful information, but data is unstructured

Shipping recordsInventory estimates

Agricultural yieldDeforestation rate

• Many cheap, unconventional data sources: remote sensing, phones/smartphones, crowdsourcing, …

• Remote sensing is becoming cheaper and more accurate

Economic indicators

Page 9: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Standard supervised learning won’t work

- Very little training data

- Nontrivial for humans

Poverty, wealth, child mortality, etc.Model

Input Output

Page 10: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Transfer learning overcomes label scarcity

Transfer learning: Use knowledge gained from one task to solve a different (but related) task

Page 11: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Transfer learning bridges the data gap

A. Satellite images C. Poverty measures

Data gap!

Page 12: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Transfer learning bridges the data gap

A. Satellite images C. Poverty measures

Data gap!

B. Proxy outputs

Model

Plenty of data!

Page 13: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Transfer learning bridges the data gap

A. Satellite images C. Poverty measures

Data gap!

B. Proxy outputs

Model

Plenty of data!

Less data needed

Page 14: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Nighttime lights as proxy for economic development

Page 15: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Nighttime lights as proxy for economic development

Page 16: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Nighttime lights as proxy for economic development

Page 17: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Training data on the proxy task is plentiful

> 300,000 training images

training images sampled from these locations

Low nightlight intensity

,

High nightlight intensity

,

(

(

)

)

Labeled input/output training pairs

Page 18: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Images summarized as low-dimensional feature vectors

Inputs: daytime satellite images

{Low, Medium, High}

Outputs: Nighttime light intensities

Convolutional Neural Network

(CNN)

Page 19: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Images summarized as low-dimensional feature vectors

Have we learned to identify useful features?

f1

f2

f4096

PovertyNonlinear mapping

Inputs: daytime satellite images

{Low, Medium, High}

Outputs: Nighttime light intensities

Convolutional Neural Network

(CNN)

Target task

Page 20: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Model learns relevant features automatically

Satellite image

Filter activation map

Overlaid image

Urban Non-urban Water Roads

Page 21: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Model learns relevant features automatically

Page 22: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Poverty is NOT spatially independent

Uganda poverty rates (2005)

How to model spatial correlations?

Page 23: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Use Gaussian Process to model spatial correlations

23

Ou

tpu

t va

lue

r(

x)

space xspace x

Ou

tpu

t va

lue

r

(x)

How to take images into account?

Page 24: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Linear GP model

24

Key Idea: combine GP with CNN

Features from CNN

Gaussian process layer

Location 1 Location n…

Page 25: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Predicting household asset-based wealth

We outperform recent methods based on mobile call record data

Blumenstock et al. (2015) Predicting Poverty and Wealth from Mobile Phone Metadata, Science

Page 26: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Predicting Poverty from Space

26

Estimates from the model using about 500,000 images from Uganda:

Scalable and inexpensive approach to generate high resolution maps.

Most up-to-date map

Page 27: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Ongoing work: food security

• Mapping and estimating crop yields

• Est. 2 billions more people to feed by 2050: information

technologies will have to play a role to increase

productivity

27

Page 28: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Increasing productivity

Crop Challenge: which soybean varieties to plant to

maximize yield, given knowledge about soil and climate?

28

Stochastic

optimization

maximize expected yield

small variances.t.

Data

> 100 varieties

Uncertainty

model

Monte Carlo simulations

Yield model

Ensemble of ML techniques

1st prize

Page 29: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Yield mapping [Ongoing]

Computational Sustainability

Uncertainty

Materials discovery[UAI-16, AAAI-15, SAT-12]

Large scale Poverty mapping

[AAAI-16]

High dimensional spaces

Preferences and utilities

29

Migratory pastoralism[AAAI-15]

natural resources management [UAI-10, IJCAI-11]

Page 30: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Motivation: migratory pastoralism

30

8 million pastoralists in Ethiopia and 3 million in Kenya

depend on livestock to make a living, relying on the vast

arid and semi-arid rangelands of East Africa.

Page 31: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Motivation: migratory pastoralism

• Issues: droughts, environmental degradation, climate change

31

Understanding herding and grazing choices is critical to characterizing

the impact of policy interventions on pastoralists’ lives and on the

ecosystem of the region

Page 32: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Motivation: migratory pastoralism

32

Develop a generative model to capture

the decision making processes of pastoralists

Can use the model to predict what would happen if we provide

insurance, build new water points, climate changes, …

GPS Collar Data

5 min intervals

NDVI Precipitation

Land coverAgronomy

Environmental Remote Sensing Socio-Economic

Field Surveys

Herd size

Household size

Home location

Household

assets

Education

Age

Page 33: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Migratory pastoralism: Ethiopia data

33

Camp

Water point

Trajectories(color varies by household)

GPS collar traces

Anonymized for privacy

Page 34: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Markov Decision Process

• Markov Decision Process

• States S

• Actions A

• Reward function (immediate):

• Transition Probabilities: P(s’|s,a)

• Planning Problem/ Reinforcement learning: pick actions to

maximize (expected discounted) total reward

– Policy: sequence of decision rules that prescribe the action to be

taken (depending on the current state)

r :S®R

+5

+1

0

Page 35: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Estimation Problem

35

Inverse Reinforcement

Learning (IRL)

Optimal

policy p*

Environment

Model (MDP)

Expert’s

Trajectories

s0, s1, s2, …

Reward R that

explains expert

trajectories

Assumptions:- Agents (pastoralists) are following a policy- Rational: Policy is optimal with respect to (unknown) reward R- Goal: estimate R from the trajectories

Reinforcement learning

Page 36: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Inverse RL

Update reward

Run planning

Compare with

expert

Page 37: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Inverse RL

Update reward

Run planning

Compare with

expert

Gradient

step

Gradient

computation

Page 38: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Inverse RL

Expensive: have to solve a

planning problem in a learning loop

Update cost

Run planning

Compare with

expert

Page 39: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Simultenous Learning and planning

Expensive: have to solve a

planning problem in a learning loop

Update cost

Run planning

Compare with

expert

TT-1T-2…

Dynamic Programming Table:

we have an optimal policy for the last 4 steps

Can update the cost as we fill the DP table

Page 40: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Convergence Rate

40

Our algorithm converges much faster (20x).

Page 41: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Policy gradient approach

Expensive: have to solve a

planning problem in a learning loop

Update cost

Run planning

Compare with

expert

What if state space is too big for

Dynamic Programming?

Policy gradient (with TRPO)• Model free

• Fast

• Scales to high-dimensional,

raw observations

Model-Free Imitation Learning with Policy Optimization” ICML-16

Page 42: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Migratory pastoralism: Ethiopia data

42

Camp

Water point

Trajectories(color varies by household)

Features: distance between camps, greenness, dist. to water and village, distance to road, etc.

Anonymized for privacy

Page 43: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Evaluation

• 4 fold cross-validation: log-likelihood and number of

predicted (movements)

• Can fit well to the data

• The trained model recovers facts that are consistent

with our intuition, e.g. herders prefer short travel

distances

• Future work: compare what-if predictions with a

randomized trial43

Page 44: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford
Page 45: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Generative Adversarial Imitation Learning, 2016 on Arxiv

Ongoing work

Page 46: Machine Learning and Decision Making for …ermon/slides/ermon_ijcai_early.pdfMachine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford

Conclusions

46

• Growing concerns about the threats of AI to the future of

humanity

• Recent advances in AI also create enormous opportunities

for having deeply beneficial influences on society

(healthcare, education, sustainability, …)

• New opportunities for CS research

Computational Sciences

Sustainability

Sciences

Computational

Sustainability