Top Banner
by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel UC Berkeley, Department of Electrical Engineering and Computer Sciences OpenAI
33

by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Aug 01, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel

UC Berkeley, Department of Electrical Engineering and Computer Sciences

OpenAI

Page 2: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Unsupervised learning of disentangled representation

Usually, learned representation is entangled

(encoded in complicated manner)

When representation is disentangled,

it would be easier to apply to tasks

Page 3: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Disentangling information

man with glasses man without glasses woman

{ { {

Page 4: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Supervised Learning Unsupervised Learning

“to learn is

to recognize”

“to learn is to

replicate”

Page 5: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 6: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

max log𝐷(𝑥) + log(1 − 𝐷(𝐺(𝑧))max log𝐷(𝐺(𝑧)

Page 7: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

where

𝐷 𝑥 =)𝑃𝑑𝑎𝑡𝑎(𝑥

)𝑃𝑑𝑎𝑡𝑎(𝑥) + 𝑃𝐺(𝑥

Page 8: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 9: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 10: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 11: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 12: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 13: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 14: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Mutual information between latent code c and

generator distribution G(z, c) should be high

Page 15: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

measures the “amount of information” learned from knowledge

of random variable Y about the other random variable X

is the reduction of

uncertainty in X

when Y is observed

Page 16: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Given 𝑥 𝑃𝐺 𝑥𝑃𝐺 с|𝑥 should have

small entropyProblem!

𝐼(𝑐; 𝐺(𝑧, 𝑐) )is hard to minimize

directly because of access

to the posterior 𝑃(𝑐|𝑥).

Page 17: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

𝐼 𝑐; 𝐺 𝑧, 𝑐 = 𝐻 𝑐 − 𝐻 𝑐 𝐺 𝑧, 𝑐

= 𝜠𝑥𝐺 𝑧,𝑐 𝜠𝑐′𝑃 𝑐|𝑥 log 𝑃 𝑐′ 𝑥 + 𝐻(𝑐)

= 𝜠𝑥𝐺 𝑧,𝑐 [𝐷𝐾𝐿(𝑃( |𝑥))||𝑄 𝑥 ) + 𝜠𝑐′𝑃 𝑐|𝑥 log 𝑄 𝑐′ 𝑥 ] + 𝐻(𝑐)≥ 0

≥ 𝜠𝑥𝐺 𝑧,𝑐 [𝜠𝑐′𝑃 𝑐|𝑥 log𝑄 𝑐′ 𝑥 ] + 𝐻(𝑐)Treat as a

constant

Page 18: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 19: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Remind the lemma: 𝛦𝑥𝑋,𝑦𝑌|𝑥 𝑓 𝑥, 𝑦 = 𝛦𝑥𝑋,𝑦𝑌|𝑥,𝑥′𝑋|𝑦 𝑓(𝑥′, 𝑦)

𝐿𝐼 𝐺, 𝑄 = 𝛦𝑐𝑃(𝑐),𝑥𝐺(𝑧;𝑐) log𝑄(𝑐|𝑥) + 𝐻(𝑐)

= 𝛦𝑥𝐺(𝑧;𝑐) 𝛦𝑐′𝑃(𝑐|𝑥)[log𝑄 𝑐′ 𝑥 ] + 𝐻(𝑐)

≤ 𝐼 𝑐; 𝐺 𝑧, 𝑐

Page 20: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

In particular, 𝐿𝐼 can be maximized w.r.t. 𝑄 directly and w.r.t. 𝐺 via the

reparametrization trick.

for discrete latent codes, the bound becomes tight and

the maximal mutual information is achieved.

Page 21: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 22: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 23: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 24: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 25: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 26: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Manipulating latent codes on MNIST

(a) Digit type (b) No clear meaning

(c) Rotation (d) Width

Page 27: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

(a) Pose (angle) (b) Elevation

(c) Lighting (d) Width

Manipulating latent codes on 3D Faces

Page 28: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Manipulating latent codes on 3D Chairs

(a) Rotation (b) Width

Page 29: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation

Manipulating latent codes on SVHN

(a) Continuous code (b) Discrete code

Page 30: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 31: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 32: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation
Page 33: by Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya … · 2018-03-21 · Supervised Learning Unsupervised Learning “to learn is to recognize ... InfoGAN: Interpretable Representation