Top Banner
Intelligent Perception S. M. Ali Eslami December 2016
60

Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Aug 19, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Intelligent Perception

S. M. Ali Eslami

December 2016

Page 2: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas
Page 3: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas
Page 4: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas
Page 5: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas
Page 6: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Underlying scene Observation

?

Page 7: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Underlying scene Observation

?

1. How should the scene be represented?

2. How should the representation be computed?

Page 8: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Learning paradigms

x

z

SupervisedLearning

yhorse

Page 9: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Computer

Horse

Cow

OutputInput

Prep

roce

ssin

g

Feat

ure

Extr

actio

n

Feat

ure

Sele

ctio

n

Lear

ned

Dis

crim

inat

ion

Calib

ratio

n

Algorithm

Page 10: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Computer

Horse

Cow

OutputInput

Stag

e 1

Stag

e 2

Stag

e 3

Stag

e 4

Stag

e 5

Algorithm

Page 11: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Introduction

● Optimize directly for the end loss

● End-to-end training, no engineered inputs

● With enough data, learn a big non-linear function

● Supervised labeling is often enough for transferrable representations

● Large labeled dataset + big / deep neural network + GPUs

Deep Supervised Learning

Page 12: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Clarifai (2014)

Page 13: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Introduction

Deep Supervised Learning

Zhang et al. (2015) Simonyan et al. (2014)

Text Classification Video Classification

Page 14: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Introduction

● Innovation continues○ Inception (Szegedy et al., 2015)○ Residual connections (He et al., 2015)○ Batchnorm (Ioffe et al., 2015)

● Performance is continuously improving

Deep Supervised Learning

Szegedy et al., (2015)

Page 15: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Where does the data come from?

What is the correct representation?

Page 16: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Learning paradigms

x

z

x

SupervisedLearning

Reinforcement Learning

y

z

ahorse left env

Page 17: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Human-level control in ATARI

End-to-end reinforcement learning

Mnih et al. (2015)

Page 18: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

How much experience do we really need?

Page 19: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Learning paradigms

Decoder

x

z

xx

z

x

SupervisedLearning

Reinforcement Learning

GenerativeModelling

y

z

a yhorse left env

Page 20: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Learning paradigms

Decoder

x

z

xx

z

x

SupervisedLearning

Reinforcement Learning

GenerativeModelling

y

z

a y

(2.3, -1, 0.5, 3)

not blinkinghorse left env

Page 21: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Highly structured

General Purpose Graphics Programming

Vikash Mansinghka, Tejas D. Kulkarni, Yura N. Perov, and Joshua B. Tenenbaum (2013)

Page 22: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Partially structured

A Stochastic Grammar of Images

Song-Chun Zhu and David Mumford (2007)

Page 23: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Partially structured

S. M. Ali Eslami and Christopher K. I. Williams (2012)

Page 24: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Fully unstructured

Geoffrey Hinton (2006) Antti Rasmus et al. (2016) Jeff Donahue et al. (2016)

Page 25: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Attend, Infer, Repeat: Fast Scene Understanding with Generative Models

S. M. Ali Eslami, Nicolas Heess, Theophane Weber, Yuval Tassa, Koray Kavukcuoglu, Geoffrey HintonNeural Information Processing Systems (NIPS), 2016

Page 26: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

To obtain object-based representationsTo learn from orders-of-magnitude less data

Motivation

Page 27: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

x

z

blue brick

Mod

elIm

age

Cau

se

Page 28: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

x

z

blue brick pile of bricks

x

z

Mod

elIm

age

Cau

se

not sufficient forgraspingcountingtransfergeneralisation

Page 29: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

x

z

x

z1 z2

Mod

elIm

age

Cau

se

blue brick red brickpile of bricks

Page 30: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

x

z

Mod

elIm

age

Cau

se

x

zwhat

y1

z1 zwherez1 zwhat

y2

z2 zwherez2

atty1

atty2

blue brick red brickpile of bricks blue brickabove

red brickbelow

x

z1 z2

Page 31: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Decoder

x y

z Decoder

x y

h1 h2 h3

z1 z2 z3

x

z

x

z1 z2 z3

Mod

elIn

fere

nce

Net

wor

k

Page 32: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Decoder

x y

h1 h2 h3

z1 z2 z3 Decoder

x y

h1 h2 h3

zpresz1 zpresz2 zpresz3zwhatz1 zwhatz2 zwhatz3zwherez1 zwherez2 zwherez3

x

zwhat

y1

z1 zwherez1 zwhat

y2

z2 zwherez2

atty1

atty2

Mod

elIn

fere

nce

Net

wor

k

x

z1 z2 z3

Page 33: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Decoder

x y

h1 h2 h3

z1 z2 z3 Decoder

x y

h1 h2 h3

zpresz1 zpresz2 zpresz3zwhatz1 zwhatz2 zwhatz3zwherez1 zwherez2 zwherez3

x

zwhat

y1

z1 zwherez1 zwhat

y2

z2 zwherez2

atty1

atty2

Mod

elIn

fere

nce

Net

wor

k

x

z1 z2 z3

focus on representation not reconstruction

output is a setorder? count?

Page 34: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

x y

zpres

zwhatxatt yatt

hi

zwhere...

VA

E

yi

i ii

i

i

... ...

Page 36: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Omniglot

Page 37: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

6

9

no

yes

Representational power

Sum? Increasing order?

Page 38: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Additional structure

x

z

distributed vector that correlates with blue brick

learned

Page 39: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

x

z

Additional structure

x

z

distributed vector that correlates with blue brick

class=brickcolour=blueposition=Protation=R

learned

specified

Page 40: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas
Page 41: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas
Page 42: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Decoder

x y

h1 h2 h3

z1 z2 z3

x

z1 z2 z3

Additional structure

specified

Page 43: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Inverse graphics

Page 44: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas
Page 45: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Policy learning

Tabl

e-to

pM

NIS

T

Page 46: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Unsupervised Learning of 3D Structure from Images

Danilo Rezende, S. M. Ali Eslami, Shakir Mohamed, Peter Battaglia, Max Jaderberg, Nicolas HeessNeural Information Processing Systems (NIPS), 2016

Page 47: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

To recover 3D structure from 2D imagesTo form stable representations, regardless of camera position

Motivation

Page 48: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

To recover 3D structure from 2D imagesTo form stable representations, regardless of camera position

● Inherently ill-posed○ All objects appear under self occlusion, infinite explanations○ Therefore build statistical models to know what’s likely and what’s not

● Even with models, inference is intractable○ Important to capture multi-modal explanations

● How are 3D scenes best represented?○ Meshes or voxels?

● Where is training data collected from?

Motivation

Page 49: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Unsupervised Learning of 3D Structure from Images

Page 50: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Unsupervised Learning of 3D Structure from Images

Page 51: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Projection operatorsUnsupervised Learning of 3D Structure from Images

Page 52: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Unsupervised Learning of 3D Structure from Images

Unconditional samples

Page 53: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Unsupervised Learning of 3D Structure from Images

Class-conditional samples

Page 54: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Unsupervised Learning of 3D Structure from Images

Class-conditional samples

Page 55: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Unsupervised Learning of 3D Structure from Images

Multi-modality of inference

Page 56: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Unsupervised Learning of 3D Structure from Images

3D structure from multiple 2D images

Page 57: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Unsupervised Learning of 3D Structure from Images

Inferring object meshes

Page 58: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

Unsupervised Learning of 3D Structure from Images

Inferring object meshes

Page 59: Intelligent Perception - ACMacm.ut.ac.ir/.../post_attachment/attachment/87/Intelligent_Perception… · Highly structured General Purpose Graphics Programming Vikash Mansinghka, Tejas

● Deep Supervised Learning

● Deep Reinforcement Learning

● Model-based Methods

● Structured / Unstructured Generative Models

Recap