Top Banner
Networks and Hierarchical Processing: Object Recognition in Human and Computer Vision Guest Lecture: Marius Cătălin Iordan CS 131 8 Computer Vision: Foundations and Applications 01 December 2014
43

Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

Aug 09, 2018

Download

Documents

trinhkien
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

Networks and Hierarchical Processing: Object Recognition in Human and Computer Vision

Guest&Lecture:&Marius&Cătălin&Iordan&&CS&131&8&Computer&Vision:&Foundations&and&Applications&

01&December&2014

Page 2: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:
Page 3: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

1. Processing Pathways in the Human Visual System• “what” and “where” pathways • building features in the ventral stream

2. Hierarchical Pattern Recognition Systems

• early stages: small scale neural network • injecting neuroscience knowledge into design

3. Third Age of the Neural Network: Modern Deep Nets

• extremely large scale ( data + computation ) • sudden, huge performance boost for recognition

2Roadmap 3

Page 4: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

Processing Pathways review 7

world

retina (compression)

LGN

visual cortex (expansion)

From Retina to Cortex

V1

Page 5: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Processing Pathways review 5

The Flow of Information

Weiner & Grill-Spector (2012)

ITIT

Page 6: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Processing Pathways review 6Van Essen (1991)Retina

V1

Page 7: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Processing Pathways “what” and “where” pathways 7

Specialization: “What” and “Where” Pathways

Mishkin & Ungerleider 1982

monkey lesion studies

Page 8: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Processing Pathways “what” and “where” pathways 7

Specialization: “What” and “Where” Pathways

Mishkin & Ungerleider 1982

monkey lesion studieslesion “where” pathway: difficulty in spatial reasoning lesion “what” pathway: difficulty in object recognition

Page 9: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Processing Pathways “what” and “where” pathways 7

Specialization: “What” and “Where” Pathways

Mishkin & Ungerleider 1982

monkey lesion studieslesion “where” pathway: difficulty in spatial reasoning lesion “what” pathway: difficulty in object recognition

Page 10: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Processing Pathways building features in the ventral stream 8

Object Recognition: The “What” Pathway

DiCarlo & Cox (2007)

Page 11: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

ITV1

3Processing Pathways building features in the ventral stream 9

Object Recognition: The “What” Pathway

Freeman & Simoncelli (2011), Tanaka (1997)

Page 12: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Processing Pathways key points 10

Object Recognition: Building Features and Invariance

DiCarlo & Cox (2007)

invariance is built gradually across many successive stepseach area performs a transformation on its inputs

visual processing is done in stages

Page 13: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation 11

2. Hierarchical Pattern Recognition Systems

neuroscience-inspired computer vision

Page 14: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation neocognitron 12

Neocognitron: Neural Network

Layer 1

Layer 2

Layer 3Cell

Page 15: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation neocognitron 13

Neocognitron: Neural Network

Fukushima (1988)

Page 16: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation neocognitron 13

Neocognitron: Neural Network

Fukushima (1988)

input layer

S-cell layer

C-cell layer

Page 17: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation neocognitron 14

Neocognitron: S-Cell

Fukushima (1988)

SUM THRESHOLD

C-cell layer / input layer C-cell layerS-cell layer

LEARNING

Page 18: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation neocognitron 15

Neocognitron: S-Cell

figure courtesy of A. Alahi

Page 19: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation neocognitron 16

Neocognitron: C-Cell “Pooling”

Fukushima (1988)

building position invariance

Page 20: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation neocognitron 17

Neocognitron: Network

Fukushima (1988)

input S-cell C-cell S-cell C-cell S-cell C-cell S-cell C-cell

Page 21: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation neocognitron 17

Neocognitron: Network

Fukushima (1988)

input S-cell C-cell S-cell C-cell S-cell C-cell S-cell C-cell

Page 22: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation neocognitron 17

Neocognitron: Network

Fukushima (1988)

input S-cell C-cell S-cell C-cell S-cell C-cell S-cell C-cell

Page 23: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation neocognitron 17

Neocognitron: Network

Fukushima (1988)

input S-cell C-cell S-cell C-cell S-cell C-cell S-cell C-cell

Page 24: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation neocognitron 18

Neocognitron: Robust Results

Fukushima (1988)

Page 25: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

input layer

S-cell layer

C-cell layer

3Hierarchical Computation neocognitron key points 19

Neocognitron

invariance is built gradually across many successive stepssimple neural network solves complicated non-linear problem

biologically inspired hierarchical processing pipeline

Fukushima (1988)

Page 26: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation feed-forward model 20

Feed-Forward Object Recognition Model

Hubel & Wiesel (1962), Riesenhuber & Poggio (1999), Serre et al. (2007)

Page 27: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation feed-forward model 20

Feed-Forward Object Recognition Model

Hubel & Wiesel (1962), Riesenhuber & Poggio (1999), Serre et al. (2007)

Page 28: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation feed-forward model 21

Feed-Forward Object Recognition Model

task: is there an animal in the picture?

Serre et al. (2007)

Page 29: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation feed-forward model 22

Feed-Forward Object Recognition Model

task: is there an animal in the picture?

Serre et al. (2007)

Page 30: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation feed-forward model 22

Feed-Forward Object Recognition Model

task: is there an animal in the picture?

Serre et al. (2007)

Page 31: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:
Page 32: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation feed-forward model 24

Feed-Forward Object Recognition Model

Serre et al. (2007)

task: is there an animal in the picture?

Page 33: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation feed-forward model key points 25

Feed-Forward Object Recognition Model

Serre et al. (2007)

biologically-inspired processing pipeline

patches — receptive fields building invariance

hierarchical processing

major drawback?no feedback

Page 34: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

Discussion

Page 35: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

Discussion

how closely should we aim to copy human vision?

is reverse-engineering human vision a self-imposed limitation?

perfect recognition vs. visual understanding?

Page 36: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Modern Neural Networks 27

3. Modern Neural Networks

extremely large scale data and computation

Page 37: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Modern Neural Networks deep CNN 28

Deep Convolutional Neural Network

Krizhevsky et al. (2013)

650,000 cells — 60,000,000 parameters

input convolution pooling fully-connectedconvolution pooling…

Page 38: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Modern Neural Networks deep CNN 28

Deep Convolutional Neural Network

Krizhevsky et al. (2013)

650,000 cells — 60,000,000 parameters

input convolution pooling fully-connectedconvolution pooling…

Page 39: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Modern Neural Networks CS 231N 29

Deep Convolutional Neural Network

Want to know more about state-of-the-art neural networks?

CS 231N http://vision.stanford.edu/cs231n/

Winter QT 2015 Fei-Fei Li & Andrej Karpathy

Page 40: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Processing Pathways key points 30

Object Recognition: Building Features and Invariance

DiCarlo & Cox (2007)

invariance is built gradually across many successive stepseach area performs a transformation on its inputs

visual processing is done in stages

Page 41: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Hierarchical Computation feed-forward model key points 31

Feed-Forward Object Recognition Model

Serre et al. (2007)

biologically-inspired processing pipeline

patches — receptive fields building invariance

hierarchical processing

major drawbacks?no feedback

much less complex than human vision

Page 42: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

3Modern Neural Networks deep CNN key points 32

Deep Convolutional Neural Network

Krizhevsky et al. (2013)

extremely large scale data and computation

sudden, huge performance boost for recognition

if you want to know more, take CS 231N in Winter QT

Page 43: Networks and Hierarchical Processing: Object Recognition in …vision.stanford.edu/teaching/cs131_fall1415/lectures/lec... · 2014-11-30 · Networks and Hierarchical Processing:

End-Quarter Feedback Forms