Top Banner
9.523/6.861: Aspects of a Computational Theory of Intelligence Shimon Ullman + Tomaso Poggio Gemma Roig + Chia-Jung Chang
62

9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Apr 12, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

9.523/6.861: Aspects of a Computational Theory of

Intelligence

Shimon Ullman + Tomaso Poggio

Gemma Roig + Chia-Jung Chang

Page 2: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

9.523/6.861: Aspects of a Computational Theory of

Intelligence

Class 4, Part A

Page 3: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

• Human Brain –1010-1011 neurons (~1 million flies) –1014- 1015 synapses

Vision:whatiswhere

• Ventral stream in rhesus monkey –~109 neurons in the ventral stream

(350 106 in each emisphere) –~15 106 neurons in AIT (Anterior

InferoTemporal) cortex

• ~200M in V1, ~200M in V2, 50M in V4

Van Essen & Anderson, 1990

Page 4: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Desimone & Ungerleider 1989

ventral stream

Theventralstream

Source: Lennie, Maunsell, Movshon

Page 5: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

[software available online]Riesenhuber & Poggio 1999, 2000; Serre Kouh Cadieu Knoblich Kreiman & Poggio 2005; Serre Oliva Poggio 2007

HMAX is in the family of “Hubel-Wiesel” models such as

Hubel & Wiesel, 1959: Fukushima, 1980, Oram & Perrett, 1993; Wallis & Rolls, 1997; Riesenhuber & Poggio, 1999; Thorpe, 2002; Ullman et al., 2002; Mel, 1997; Wersing and Koerner, 2003; LeCun et al 1998; Serre et al., 2007; Freeman and Simoncelli, 2011….

Convolutional networks such as HMAX

Page 6: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Hierarchical feedforward models of the ventral stream do “work”

Page 7: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Computational Model based on Deep Learning

Figure 1: HCNNs as models of sensory cortex.Using goal-driven deep learning models to understand sensory cortex. Daniel Yamins & James DiCarlo.

Nature Neuroscience(2016)

Page 8: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

A key aspect of hierarchical models is

invariance to viewpoint

Page 9: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Theorem (transla)on case) Consider a space of images ofdimensions pixelswhichmayappearinanyposi5onwithin a window of size pixels. The usual imagerepresenta5on yields a sample complexity ( of a linearclassifier) oforder ;the oracle representa5on(invariant)yields(becauseofmuchsmallercoveringnumbers)a--muchbeBer--samplecomplexityoforder

9

moracle = O(d2 ) =

mimage

r2

d × drd × rd

m = O(r2d 2 )

poggio, rosasco

Theorem: invariance can significantly reduce sample complexity

Page 10: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

10

Empiricaldemonstra5on:invariantrepresenta5onleadstolowersamplecomplexityforasupervisedclassifier

Page 11: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Thus a new hypothesis

A main computational goal of the feedforward ventral stream hierarchy — and of vision — is to compute a representation for each incoming image which is invariant to transformations previously experienced in the visual environment (in general, transformations of other objects).

Page 12: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Old neural data: IT neurons can be selective and invariant

Page 13: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Background: recording sites in Anterior IT

Logothetis, Pauls & Poggio 1995

…neurons tuned to faces are intermingled

nearby….

Page 14: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Neurons tuned to object views, as predicted by model!

Logothetis Pauls & Poggio 1995

Page 15: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

12 7224 8448 10860 12036 96

12 24 36 48 60 72 84 96 108 120 132 168o o o o o o o o o o o o

-108 -96 -84 -72 -60 -48 -36 -24 -12 0-168 -120

Distractors

Target Views60

spi

kes/

sec

800 msec

-108 -96 -84 -72 -60 -48 -36 -24 -12 0-168 -120 oo o o o o o o o oo o

Logothetis Pauls & Poggio 1995

A very selective “view-tuned” cell in IT

Page 16: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

View-tuned cells: scale invariance (one training view only) !!!

Logothetis Pauls & Poggio 1995

Page 17: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

How neurons may compute an invariant signature

Page 18: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

18

Algorithm that learns in an unsupervised way to compute invariant representations

ν

P(ν )

νµkn(I) = 1/|G|

|G|X

i=1

�(I · gitk + n�)

Page 19: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

...

Our basic machine: a HW module (dot products and histograms/moments for image seen through RF)

• The cumulative histogram (empirical cdf) can be be computed as

• This maps directly into a set of simple cells with threshold

• …and a complex cell indexed by n and k summating the simple cells

µnk (I ) = 1

|G |σ ( I ,git

k + nΔ)i=1

|G |

The nonlinearity can be arbitrary for invariance, if optimal selectivity is not required

Page 20: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

20

Invariant signature from a single image of a new object

Page 21: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

21

Invariant signature from a single image of a new object

Page 22: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

...

< x,t >

• Highly simplified neuroscience suggests that a natural functional for a neuron to compute is a high-dimensional dot product between an “image patch” and another image patch (called template) stored in terms of synaptic weights (synapses per neuron )

• Projections via dot products are natural for neurons: here simple cells

∼ 102 −105

Neuroscience definition of dot product!

Remark: biological motivation for image representation

Page 23: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

...

Our basic machine: a HW module (dot products and histograms/moments for image seen through RF)

• The cumulative histogram (empirical cdf) can be be computed as

• This maps directly into a set of simple cells with threshold

• …and a complex cell indexed by n and k summating the simple cells

µnk (I ) = 1

|G |σ ( I ,git

k + nΔ)i=1

|G |

Page 24: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Pooling

Page 25: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Dendrites of a complex cells as simple cells…

Active properties in the dendrites of the complex cell

Page 26: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Invariance explains a puzzle

• what is visual cortex computing?

• function and circuits of simple-complex cells

• how does the face network work?

• what is the computational reason for the eccentricity-dependent size of RFs in V1, V2, V4?

poggio, anselmi, rosasco, tacchetti, leibo, liao

Page 27: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

RF size depends on eccentricity in a special way

Page 28: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Note:wefocusonthesamplinglayoutoftheretinalganglioncells(RGCs)-theoutputsoftheretina.

Page 29: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Desimone & Ungerleider 1989

ventral stream

Theventralstream

Source: Lennie, Maunsell, Movshon

Page 30: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Retinalsamplingisnonuniform

• Thumbnailatarm’slength=1degree• By+/-1degree,resolutionhasdroppedby½• Mostcommonexplanation:

– Fullresolutioneverywherewouldrequireanopticnervethethicknessofyourneck,andvisualcortexthesizeofasmallcar(*)

– Solution:asmallpatchofhighresolutionthatyoucanmovearound

• However:theparticularsamplingstrategytheretinahaschosensuggeststhereismoretothestory

(*)calculationsareapproximate

Page 31: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

An application of i-theory: translation and scale invariance implies

a specific model of eccentricity-dependent RFs in cortex

Hubel and Wiesel, 1971

Page 32: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Scatter of receptive field sizes in V1

Schiller, P., Finlay, B., Volman S. Quantitative Studies of Single Cells Properties in monkey striate cortex, 1976

Page 33: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Explaining the puzzle

Page 34: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

34

Computational reason for eccentricity dependence of RFs size

ν

P(ν )

νµkn(I) = 1/|G|

|G|X

i=1

�(I · gitk + n�)

Page 35: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

to compute invariant representation

Recipe:

• memorize a set of images/objects called templates and for each template memorize observed transformations as images

• to generate an invariant signature - compute dot products of transformations with image - pool, e.g. compute histogram of the resulting values

35

Page 36: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

36

Geometry of scaling

Page 37: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

37

Sampling in the window

Page 38: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

38

Magic window in V1

5 degree! total 40x40 units

25’ !!! total 40x40 units

Page 39: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Qualitative predictions

Page 40: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

• Very small foveola ~25’

• In the center of fovea “full” scale invariance, little position invariance

• Position invariance proportional to spatial frequency

• Anstis

• Bouma’s law for peripheral crowding d= b x (role of V2 b=0.5)

• Prediction: crowding in the fovea at less than d=2’40” in fovea

Qualitative predictions

Page 41: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Anstis, 1974

“Prediction” of Anstis observation

Page 42: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Computational model

Page 43: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

V

D

SV = 2 * arctan (S/(2D))

D= 50.39 cm

…… … … …

… …

… …

5 degrees - 4.4 cm - 224 px

0.63 degrees - 0.55 cm - 28 pxtemplate smaller resolutiontemplate 2nd smaller resolution

template larger resolutiontemplate 2nd larger resolution

Eccentricity dependent model for quant predictions

0 eccentricity (deg)

scale

Page 44: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

smallest res

largest res

orig

inal

imag

ew

hat t

he m

odel

“see

s”

(sam

plin

g ph

otor

ecep

tors

)…

……

… … … …

… ………

……

… … … …

… ………

……

… … … …

… ………

……

… … … …

… ………

Gemma Roig

Page 45: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Smallest scale

largest scale

original image

what the model “sees” (sampling photoreceptors)Gemma Roig

Page 46: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

V

D

S

V = 2 * arctan (S/(2D))

D= 50.39 cm

template larger scale - convolutiontemplate 2nd larger scale - convolution

template smaller scale - convolutiontemplate 2nd smaller scale - convolution

1st layer- model with 4 scales

input crops at 4 scales input image

…… … … …

… …

… …

input crops at 4 scales what the model sees

5 degrees - 4.4 cm - 224 px

0.63 degrees - 0.55 cm - 28 px

Page 47: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Psychophysical experiments

Page 48: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Experimental question: is the window of visibility…

…the same as the window of invariance to scale and shift for novel, unfamiliar objects, never seen before

(as predicted by i-theory)?

window of visibility

scale

eccentricity0 deg

window of invariance

scale

eccentricity0 deg

?Gemma Roig

Yena Han

Page 49: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Notice: published data (refs…) are inconclusive and inconsistent

Question: is the window of visibility…

Gemma RoigYena Han

Page 50: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

50

Examples

Page 51: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

51

Example

Page 52: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

52

Page 53: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

ExperimentsPhase 1

check parameters of the visual window

psychophysics experiments with very familiar letters:

recognition of letters at different eccentricities and sizes we have seen letters in all positions: no need for training

sanity check

Gemma RoigYena Han

Page 54: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

A

Phase 1 experiments

A A

A

A

A

recognize familiar letters of different sizes at different eccentricities

visual window:

scale

eccentricity0 deg

Gemma RoigYena Han

Page 55: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

ExperimentsPhase 2

check position invariance with unfamiliar characters (Chinese letters) psychophysics experiments with Chinese letters:

training phase: learn few new letters at one eccentricity, testing phase: is the letter recognizable at other eccentricities? (same/different? task)

is the visual window the same as the window of invariance?

Gemma RoigYena Han

Page 56: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Phase 2 experiments

learn a novel character (never experienced before) at a eccentricity and scale, test recognition of the character at other eccentricities and /or scales

train (show once): test:

visual window:

scale

eccentricity0 deg

ecc. (deg.)ecc. (deg.)0 0

scale scale

Page 57: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Psychophysics Experiment • Stimuli: Korean Letters. Should be unfamiliar to subjects.

• Same/Different Discrimination Task

• Scale Invariance: Present target letter and test either the target or a distractor letter at the center. The letters vary in size.

• Position Invariance: Present target letter at one eccentricity and test either the target or a distractor letter at another eccentricity

• Presentation time 33 ms

• Letter size 1 deg

모 보==

모 == 모

Gemma RoigYena Han

Page 58: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Position Invariance

Scale Invariance

Gemma RoigYena Han

Page 59: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Next Question(experiments and simulations)

Which kind of pooling?In V1 and V2 and V4?

Page 60: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Next Question(experiments and simulations)

Crowding predictions depending on pooling

Page 61: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

Psychophysics Experiment

Volunteers?

Page 62: 9.523/6.861: Aspects of a Computational Theory of Intelligencecbmm.mit.edu/sites/default/files/documents/Invariance_tp_2016 cop… · Aspects of a Computational Theory of Intelligence

http://cbmm.mit.edu/eit/join-us

Engineering Intelligence Teams

Sound engineering practices for cutting-edge Machine Learning research

Psychophysics

Deep Learning

Web Interfaces