Top Banner
EECS 442 Computer Vision fall 2011 Instructor Silvio Savarese [email protected] Office: ECE Building, room: 4435 Office hour: Tues 4:30-5:30pm or under appoint. (after conversation hour) GSIs: Mohit Bagra ([email protected]) Murali Telaprolu ([email protected]) Class Time & Location Tu Th 3:00PM - 4:30PM -- G906 COOL Conversation hour [it‟s part of the course!] Wed 3:30PM - 4:30PM -- 1200 EECS http://www.eecs.umich.edu/vision/teaching/EECS442_2011/eecs442.html
68

EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Aug 25, 2018

Download

Documents

truongminh
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

EECS 442 – Computer Vision – fall 2011

• Instructor – Silvio Savarese

[email protected]

– Office: ECE Building, room: 4435

– Office hour: Tues 4:30-5:30pm or under appoint.

(after conversation hour)

• GSIs: – Mohit Bagra ([email protected])

– Murali Telaprolu ([email protected])

• Class Time & Location – Tu Th 3:00PM - 4:30PM -- G906 COOL

• Conversation hour [it‟s part of the course!]

– Wed 3:30PM - 4:30PM -- 1200 EECS

http://www.eecs.umich.edu/vision/teaching/EECS442_2011/eecs442.html

Page 2: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

• Text books:

– [FP] Computer Vision, A Modern

Approach, by D.A. Forsyth and J. Ponce,

Prentice Hall, 2003.

– [HZ] Multiple View Geometry in Computer

Vision, by R. Hartley and A. Zisserman,

Academic Press, 2002

– Also: Computer Vision: Algorithms and

Applications, R. Szeliski, Springer, 2011

If you plan to audit this class,

please signup your name on the mailing list

EECS 442 – Computer Vision – fall 2011

Page 3: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Agenda

• Administrative – Grading policy

– Project

• What is computer vision?

• Syllabus

Page 4: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Grading policy • Homeworks: 40%

– 5 homeworks

• Mid term exam: 10% [end of October]

• Course project: 45% – progress report 5%

– final report 30%

– presentation 10%

• Attendance and class participation: 5% – Questions, answers, remarks…

• Late policy home works: – If 1 day late, 50% off the grade for that homework

– Zero credits if more than one day.

• Late policy project: – If 1 day late, 25% off the grade for the project

– If 2 days late, 50% off the grade for the project

– Zero credits if more than 2 days

• Collaboration policy – Read the student code book, understand what is „collaboration‟ and what is „academic infraction‟.

– Discussing project assignment with each other is allowed, but coding must be done individually

– Home works or class project coding policy: using on line code or other students/researchers‟ code is not allowed in general. Exceptions can be made and individual cases will be discussed with the instructor.

Page 5: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Course Project

• Replicate an interesting paper

• Comparing different methods to a test bed

• A new approach to an existing problem

• Original research

• Write a 8-page paper summarizing your results

• Release the final code

• Give a presentation

• We will introduce projects in two weeks

• Important dates: look up class schedule

Page 6: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Course Project

• Form your team:

– 1-3 people

– the quality is judged regardless of the number of

people on the team

– be nice to your partner: do you plan to drop the

course?

• Evaluation – Quality of the project (including writing)

– Final ~15 minutes project presentation in class – students will

vote your presentation!

– For final code and paper due dates please consult webpage

Page 7: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Agenda

• Administrative – Grading policy

– Project

• What is computer vision?

• Syllabus

Page 8: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Illustration by Arthur Rackham

“The table was a large one, but

the three were all crowded

together at one corner of it …”

From “A Mad Tea-Party”

Alice's Adventures in Wonderland

by

Lewis Carroll

“There was a table set out under

a tree in front of the house,

and the March Hare and the

Hatter were having tea at it.”

Page 9: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

-semantic

Image/video

Computer vision

Object 1 Object N

- semantic

Page 10: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

-semantic

Image/video

Object 1 Object N

- semantic

-geometry -geometry

Computer vision

Page 11: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

-semantic

Image/video

Object 1 Object N

- semantic

-geometry -geometry

spatial & temporal relations

Computer vision

Page 12: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

-semantic

Image/video

Object 1 Object N

- semantic

-geometry -geometry

-Semantic

- geometry

Scene

spatial & temporal relations

Computer vision

Page 13: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Sensing device

•Extract

information

• Interpretation

Computer vision

• Scene

• Objects

• People

• Actions Information: visual cues, 3D structure, motion flows, etc…

Interpretation: recognize objects, scenes, actions, events

Computational

device

Computer vision studies the tools and theories that enable the design of

machines that can extract useful information from imagery data

(images and videos) toward the goal of interpreting the world

Page 14: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Physical attributes

Se

ma

nti

c

3D modeling

Categorization

Object Recognition

Have we reached

humans?

Actions, Events

3D scenes

… not yet – computer vision is still no match for human

perception

– but catching up, particularly in certain areas

Page 15: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Successful Applications Finger prints recognizer

Sources: L. Fei-Fei

Page 16: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Medical Imaging

Sources: L. Fei-Fei

Page 17: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Special effects movies - videogames

Sources: L. Fei-Fei

Page 18: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Consumer applications

Page 19: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view
Page 20: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Auto Stitch

Page 21: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Toy & Robots

Page 22: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Applications of computer vision

Driver assistance

(collision warning, lane departure

warning, rear object detection)

Factory inspection Surveillance

Autonomous driving,

robot navigation

Assistive technologies

Sources: K. Grauman, L. Fei-Fei, S. Laznebick

Security

Page 23: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Computer vision

•Automatic

control

•Robotics

•Signal processing

•Compression

Robot

vision

Non linear SP

Multi-variate SP

•Physics

• Imaging •Optics

•Smart

cameras

•Neurobiology

Biological vision

Visual Psychophysics

•Applied math

Statistics

Geometry

Optimization

•Machine learning

•Artificial intelligence

Visual pattern

recognition

Computer

graphics

Acquisition

methods

•Data

mining

Image

retrivial

Page 24: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

EECS 442 course overview

1. Geometry

2. Low & Mid-level vision

3. High level vision

Page 25: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

1. Geometry

2. Low & Mid-level vision

3. High level vision

Geometry: - How to extract 3d information?

- Which cues are useful?

- What are the mathematical tools?

EECS 442 course overview

Page 26: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Visual cues: texture shading contours shadows reflections

Page 27: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Visual cues: texture shading contours shadows reflections

Page 28: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Visual cues: texture shading contours shadows reflections

Page 29: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Vision techniques Visual cues: texture shading contours shadows reflections

Page 30: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Visual cues: texture shading contours shadows reflections

Page 31: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Number of observers: monocular multiple views

Vision techniques

camera

Visual cues: texture shading contours shadows reflections

Page 32: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Vision techniques

camera 1 camera 2 camera N

Number of observers: monocular multiple views

Vision techniques Visual cues: texture shading contours shadows reflections

Page 33: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Projective structure from motion:

Here be dragons!

Stereo

Structure from motion

Tomasi & Kanade (1993)

Epipolar geometry

Image sources: S. Laznebick

Page 34: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Structure from motion

Courtesy of Exford Visual Geometry Group

Page 35: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Active lighting: laser stripes structured lighting patterns

Vision techniques

camera Laser/projector/light

Number of observers: monocular multiple views

Visual cues: texture shading contours shadows reflections

Page 36: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

3D Scanning

Scanning Michelangelo‟s “The David” • The Digital Michelangelo Project

- http://graphics.stanford.edu/projects/mich/

• 2 BILLION polygons, accuracy to .29mm

Courtesy of Stanford computer graphics lab

Page 37: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

3D Digital models

Architecture

Entertainment

Medicine

Page 38: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

The „yellow line‟ in superbowls

Page 39: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

The „yellow line‟ in superbowls

Page 40: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Course overview

1. Geometry

2. Low & Mid-level vision

3. High level vision

Mid-level vision: - Extract useful building blocks

- Region segmentation

- Motion flows

Page 41: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Extract useful building blocks

Page 42: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Alignment Extract planar regions

Object segmentation

Mid-level vision

Extract useful building blocks

Page 43: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Image enhancement

Page 44: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Automatic Panorama Stitching

Sources: M. Brown

Page 45: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Automatic Panorama Stitching

Sources: M. Brown

Page 46: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Co

urt

esy o

f J

ea

n-Y

ve

s B

ou

gu

et

– V

isio

n L

ab

, C

ali

forn

ia In

sti

tute

of

Te

ch

no

log

y

Tracking and 3D modeling

Page 47: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Camera tracking and V.R.

insertions

Courtesy of Exford Visual Geometry Group

Page 48: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Course overview

1. Geometry

2. Low & Mid-level vision

3. High level vision

High level operations –Recognition of objects and people

–Places

–Actions & events

Page 49: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Building

clock

person car

Object recognition and categorization

Pedestrians crossing street

Downtown chicago

Page 50: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Challenges: viewpoint variation

Michelangelo 1475-1564 slide credit: Fei-Fei, Fergus & Torralba

Page 51: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Challenges: illumination

image credit: J. Koenderink

Page 52: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Challenges: scale

slide credit: Fei-Fei, Fergus & Torralba

Page 53: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Challenges: deformation

Page 54: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Challenges:

occlusion

Magritte, 1957 slide credit: Fei-Fei, Fergus & Torralba

Page 55: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Challenges: background clutter

Kilmeny Niland. 1995

Page 56: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Challenges: object intra-class variation

slide credit: Fei-Fei, Fergus & Torralba

Page 57: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view
Page 58: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Face recognition

Page 59: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Recognizing scenes

Page 60: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Personal photo albums

Surveillance and security

Movies, news, sports

Medical and scientific images

Credit slide: S. Lazebnik

Sorting out millions of images/videos

Page 61: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Detecting and tracking people C

hoi &

S

hah

id &

Sa

va

rese

WM

C 2

01

0

Ch

oi &

S

ava

rese

, E

CC

V 2

01

0

Page 62: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Recognizing human activities

Biking Walking Walking

Credit slide: Song & Perona

Page 63: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

“The table was a large one, but

the three were all crowded

together at one corner of it …”

From “A Mad Tea-Party”

Alice's Adventures in Wonderland

by

Lewis Carroll

“There was a table set out under

a tree in front of the house,

and the March Hare and the

Hatter were having tea at it.”

Page 65: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Next lecture

- Review of linear algebra for multi-view geometry

- Basic image transformations

Page 66: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Homework 0.1:

Who painted this?

Or… what is a smart way to search

an image on the web…?

Page 67: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view
Page 68: EECS 442 Computer Vision fall 2011 - University of … · EECS 442 – Computer Vision – fall 2011 •Instructor ... • Mid term exam: ... - Review of linear algebra for multi-view

Homework 0.1:

Who painted this?

Or… what is a smart way to search

an image on the web…?