Top Banner
Computer Vision, CS766 Staff Instructor: Li Zhang [email protected] TA: Jake Rosin [email protected]
54

Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Apr 06, 2018

Download

Documents

nguyenkhuong
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Computer Vision, CS766

Staff

Instructor: Li Zhang

[email protected]

TA: Jake Rosin

[email protected]

Page 2: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Today

Introduction

Administrative Stuff

Overview of the Course

Page 3: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

About Me

• Li Zhang (张力)

– Last name pronounced as Jung

– www.cs.wisc.edu/~lizhang

• Research– Computer Vision

– Computer Graphics

• Teaching– CS766 Computer Vision

– CS559 Computer Graphics

Page 4: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Previous Research Focus

• 3D shape reconstruction

Scene

Depth Map

3D Model

Page 5: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Previous Research Focus

• 3D shape reconstruction

Four examples of recovered 3D shapes

of a moving face from six video streams

Page 6: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Previous Research Focus

• 3D shape reconstruction

• Application

Entertainment:

Games & Movies

Medical Practice:

Prosthetics

Page 7: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Previous Research Focus

• 3D shape reconstruction

• Application

Biology: genotype phenotype

Page 8: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Please tell me about you

Who you are?

Why you are taking this class?

What do you want to learn?

Page 9: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Prerequisites

• Prerequisites—these are essential!

– Data structures

– A good working knowledge of C++/Java

programming

• (or willingness/time to pick it up quickly!)

– Linear algebra

– Vector calculus

• Course does not assume prior imaging experience

– no image processing, graphics, etc.

Page 10: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Administrative Stuff

• 1 written assignment– 5% (this week)

• 3 programming projects – 15%, 2-3 weeks each

• Paper presentation– 15%, over a month

• 1 final project – 35%, 5 weeks, open ended of your choice, but needs

– project proposal after 1 week

– progress report after 3 weeks

– Final presentation after 5 weeks

Page 11: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Administrative Stuff

• Computer account: – Everyone registered in this class will get a Computer

Systems Lab account to do project assignments.

• Email list: – [email protected]

Page 12: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Questions?

Page 13: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Every picture tells a story

Goal of computer vision is to write computer programs that can interpret images

Page 14: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Can computer match human perception?

• Yes and no (but mostly no!)

– computers can be better at ―easy‖ things

Page 15: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Can computer match human perception?

• Yes and no (but mostly no!)

– computers can be better at ―easy‖ things

– humans are much better at ―hard‖ things

Page 16: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Computer Vision vs Human Vision

• Can do amazing things like:

• Recognize people and objects

• Navigate through obstacles

• Understand mood in the scene

• Imagine stories

• But still is not perfect:

• Suffers from Illusions

• Ignores many details

• Doesn’t care about accuracy of world

Srinivasa Narasimhan’s slide

Page 17: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Computer vision vs Human Vision

What we see What a computer sees

Srinivasa Narasimhan’s slide

Page 18: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Components of a computer vision system

Lighting

Scene

Camera

Computer

Scene Interpretation

Srinivasa Narasimhan’s slide

Page 19: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Topics Covered

Page 20: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Cameras and their optics

Today’s Digital Cameras

The Camera Obscura

Srinivasa Narasimhan’s slide

Page 21: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Biological vision

Human Eye Mosquito Eye

Srinivasa Narasimhan’s slide

Page 22: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

A tiny camera

PHOTO: FRAUNHOFER INSTITUTE FOR

BIOMEDICAL ENGINEERING

Page 23: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

A tiny camera

PHOTO: FRAUNHOFER INSTITUTE FOR

BIOMEDICAL ENGINEERING

Page 24: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Project 1: High Dynamic Range Imaging

• Cameras have limited dynamic range

Short Exposure Long Exposure

Desired Image Shree Nayar’s slide

Page 25: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Project 1: High Dynamic Range ImagingL

ow

Dynam

ic R

ange

Exposure

s

+

Combination Yields High Dynamic Range

Shree Nayar’s slide

Page 26: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Image Processing

Fourier Transform

Sampling, Convolution

Image enhancement

Feature detection

Srinivasa Narasimhan’s slide

Page 27: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Camera Projection

Page 28: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Image Transformation

Steve Seitz and Chuck Dyer, View Morphing, SIGGRAPH 1996

Page 29: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Project 2: Panoramic Imaging

Input images:

Output Image:

Steve Seitz’s slide

Page 30: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Projective Geometry

Page 31: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Single View Metrology

• http://research.microsoft.com/vision/cambridg

e/3d/default.htm

Page 32: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Single View Metrology

• http://research.microsoft.com/vision/cambridg

e/3d/default.htm

Page 33: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Shading and Photometric Stereo

http://www.eecs.harvard.edu/~zickler/helmholtz.html

Page 34: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Project 3: photometric stereo

Page 35: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Texture Modeling

repeated stochastic

―Semi-stochastic‖ structures

radishes rocks yogurt

Alexei Efros’ slide

Page 36: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Texture Synthesis

Input

Output

Image Quilting, Efros and Freeman., SIGGRAPH 2002.

Page 37: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Texture Synthesis

Graphcut Textures, Kwatra et al., SIGGRAPH 2003.

Input images:

Output Image:

Page 38: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Multi-view Geometry

• Binocular Stereo (2 classes)

• Multiview Stereo (1 class)

• Structure from Motion (2 classes)

http://phototour.cs.washington.edu/

Page 39: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Applications

• http://photosynth.net/default.aspx

Page 40: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Face Detection and Recognition

Page 41: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Motion Estimation

Hidden Dragon Crouching Tiger

Page 42: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Motion Estimation

Application

Andy Serkis, Gollum, Lord of the Rings

Page 43: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Segmentation

http://www.eecs.berkeley.edu/Research/Projects/CS/vision/bsds/

Page 44: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Segmentation

Application

Medical Image Processing

Page 45: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Matting

Input Matting Composition

Page 46: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Light, Color, and Reflection

Page 47: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Capturing Light Field

Camera Arrays, Graphics Lab, Stanford University

Page 48: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Capturing Light Field

Applications: synthetic aperture imaging

bike-sap.mov

crowd0-parallax.mov

Camera Arrays, Graphics Lab, Stanford University

Page 49: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Structured Light and Ranging Scanning

http://graphics.stanford.edu/projects/mich/

Page 50: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Structured Light and Ranging Scanning

http://graphics.stanford.edu/projects/mich/

Page 51: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Structured Light and Ranging Scanning

http://graphics.stanford.edu/projects/mich/

Page 52: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Novel Cameras and Displays

http://www1.cs.columbia.edu/CAVE/projects/cc.htm

Page 53: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Assignment 0, Imagination

• Due next Tuesday

• Give FIVE interesting things that you may

wish to do with images

– Better Image Capture

– Making use of images

– Design imaging systems

Page 54: Computer Vision, CS766 - pages.cs.wisc.edupages.cs.wisc.edu/~lizhang/courses/cs766-2008f/syllabus/09-02... · of a moving face from six video streams. ... •1 final project –35%,

Course Info

http://www.cs.wisc.edu/~cs766-1/