Top Banner
COMP 776: Computer Vision COMP 776: Computer Vision
60

COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

May 25, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

COMP 776: Computer VisionCOMP 776: Computer Vision

Page 2: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

TodayToday

I t d ti t t i i• Introduction to computer vision• Course overview• Course requirements• Course requirements

Page 3: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

The goal of computer visionThe goal of computer vision

T t t “ i ” f i l• To extract “meaning” from pixels

What we see What a computer seesSource: S. Narasimhan

Page 4: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

The goal of computer visionThe goal of computer vision

T t t “ i ” f i l• To extract “meaning” from pixels

Humans are remarkably good at this…

Source: “80 million tiny images” by Torralba et al.

Humans are remarkably good at this…

Page 5: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

What kind of information can be extracted What kind of information can be extracted f ?f ?from an image?from an image?

M t i 3D i f ti• Metric 3D information• Semantic information

Page 6: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Vision as measurement deviceVision as measurement device

Real-time stereo Structure from motionReconstruction from

Internet photo collections

NASA Mars RoverNASA Mars Rover

Pollefeys et al. Goesele et al.

Page 7: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Vision as a source of semantic information

slide credit: Fei-Fei, Fergus & Torralba

Page 8: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Object categorization

sky

building

flag

wallbannerface

bus busstreet lamp

cars slide credit: Fei-Fei, Fergus & Torralba

Page 9: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Scene and context categorizationtd• outdoor

• city• traffic• …

slide credit: Fei-Fei, Fergus & Torralba

Page 10: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Qualitative spatial information

slantedslanted

non-rigid moving objectobject

rigid moving

vertical

rigid movingrigid moving object

horizontal slide credit: Fei-Fei, Fergus & Torralba

rigid moving object

Page 11: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Why study computer vision?Why study computer vision?• Vision is useful: Images and video are everywhere!Vision is useful: Images and video are everywhere!

Personal photo albums Movies, news, sports

Surveillance and security Medical and scientific images

Page 12: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Why study computer vision?Why study computer vision?

• Vision is useful• Vision is interesting• Vision is difficult

– Half of primate cerebral cortex is devoted to visual processing– Achieving human-level visual perception is probably “AI-complete”Achieving human level visual perception is probably AI complete

Page 13: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Why is computer vision difficult?Why is computer vision difficult?

Page 14: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Challenges: viewpoint variation

Michelangelo 1475-1564 slide credit: Fei-Fei, Fergus & Torralba

Page 15: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Challenges: illumination

image credit: J. Koenderink

Page 16: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Challenges: scale

slide credit: Fei-Fei, Fergus & Torralba

Page 17: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Challenges: deformation

Xu, Beihong 1943

slide credit: Fei-Fei, Fergus & Torralba

Page 18: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Challenges: occlusion

Magritte, 1957 slide credit: Fei-Fei, Fergus & Torralba

Page 19: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Challenges: background clutter

Page 20: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Challenges: Motion

Page 21: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Challenges: object intra-class variationvariation

slide credit: Fei-Fei, Fergus & Torralba

Page 22: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Challenges: local ambiguity

slide credit: Fei-Fei, Fergus & Torralba

Page 23: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Challenges: local ambiguity

Source: Rob Fergus and Antonio Torralba

Page 24: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Challenges: local ambiguity

Source: Rob Fergus and Antonio Torralba

Page 25: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Challenges or opportunities?Challenges or opportunities?

I f i b t th l l th t t f• Images are confusing, but they also reveal the structure of the world through numerous cues

• Our job is to interpret the cues!Our job is to interpret the cues!

Image source: J. Koenderink

Page 26: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Depth cues: Linear perspectiveDepth cues: Linear perspective

Page 27: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Depth cues: Aerial perspectiveDepth cues: Aerial perspective

Page 28: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Depth ordering cues: OcclusionDepth ordering cues: Occlusion

Source: J. Koenderink

Page 29: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Shape cues: Texture gradientShape cues: Texture gradient

Page 30: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Shape and lighting cues: ShadingShape and lighting cues: Shading

Source: J. Koenderink

Page 31: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Position and lighting cues: Cast shadowsPosition and lighting cues: Cast shadows

Source: J. Koenderink

Page 32: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Grouping cues: Similarity (color, texture,Grouping cues: Similarity (color, texture,proximity)proximity)p y)p y)

Page 33: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Grouping cues: “Common fate”Grouping cues: “Common fate”

Image credit: Arthus-Bertrand (via F. Durand)

Page 34: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Inherent ambiguity of the problemInherent ambiguity of the problem

M diff t 3D ld h i i t• Many different 3D scenes could have given rise to a particular 2D picture

Page 35: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Inherent ambiguity of the problemInherent ambiguity of the problem

M diff t 3D ld h i i t• Many different 3D scenes could have given rise to a particular 2D picture

• Possible solutions– Bring in more constraints (more images)

Use prior knowledge about the structure of the world– Use prior knowledge about the structure of the world

• Need a combination of geometric and statistical methods

Page 36: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Connections to other disciplinesConnections to other disciplines

Artificial Intelligence

Machine LearningRobotics

Computer Vision

Cognitive scienceNeuroscience

Computer Graphics

Image Processing

Neuroscience

Page 37: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Origins of computer visionOrigins of computer vision

L. G. Roberts, Machine Perception of Three Dimensional Solids,Ph.D. thesis, MIT Department of pElectrical Engineering, 1963.

Page 38: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Successes of computer vision to date

Page 39: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Optical character recognition (OCR)

Digit recognition License plate readersg gyann.lecun.com

License plate readershttp://en.wikipedia.org/wiki/Automatic_number_plate_recognition

Sudoku grabberhttp://sudokugrab.blogspot.com/

Source: S. Seitz, N. SnavelyAutomatic check processing

Page 40: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Biometrics

Fingerprint scanners on many new laptops

Face recognition systems now beginning to appear more widely

htt // ibl i i /many new laptops, other devices

http://www.sensiblevision.com/

Source: S. Seitz

Page 41: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Biometrics

How the Afghan Girl was Identified by Her Iris Patterns

Source: S. Seitz

Page 42: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Mobile visual search: Google Goggles

Page 43: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Face detection

Many new digital cameras now detect faces• Canon, Sony, Fuji, …

Source: S. Seitz

Page 44: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Smile detection

Sony Cyber-shot® T70 Digital Still Camera Source: S. Seitz

Page 45: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Face recognition: Apple iPhoto software

http://www.apple.com/ilife/iphoto/

Page 46: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Automotive safety

Mobileye: Vision systems in high-end BMW, GM, Volvo models • Pedestrian collision warning• Forward collision warning• Forward collision warning• Lane departure warning• Headway monitoring and warning Source: A. Shashua, S. Seitz

Page 47: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Vision-based interaction: Xbox Kinect

http://electronics.howstuffworks.com/microsoft-kinect.htmhttp://blogs.howstuffworks.com/2010/11/05/how-microsoft-

http://www.xbox.com/en-US/Live/EngineeringBlog/122910-HowYouBecometheController

kinect-works-an-amazing-use-of-infrared-light/

http://www.ismashphone.com/2010/12/kinect-hacks-more-interesting-than-the-devices-original-intention.html

Page 48: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Special effects: shape and motion capture

Source: S. Seitz

Page 49: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

3D visualization: Microsoft Photosynth

http://photosynth.net Source: S. Seitz

Page 50: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Vision for robotics, space exploration

NASA'S Mars Exploration Rover Spirit captured this westward view from atop

Vision systems (JPL) used for several tasks

a low plateau where Spirit spent the closing months of 2007.

y ( )• Panorama stitching• 3D terrain modeling

Obstacle detection position tracking• Obstacle detection, position tracking• For more, read “Computer Vision on Mars” by Matthies et al.

Source: S. Seitz

Page 51: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

The computer vision industryThe computer vision industry

A li t f i h• A list of companies here:

http://www.cs.ubc.ca/spider/lowe/vision.htmlp p

Page 52: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Basic InfoBasic Info• Instructor: Svetlana Lazebnik (lazebnik@cs unc edu)• Instructor: Svetlana Lazebnik ([email protected])• Office hours: By appointment, FB 244• Class webpage: http://www.cs.unc.edu/~lazebnik/spring11Class webpage: http://www.cs.unc.edu/ lazebnik/spring11

• Textbooks (suggested): F h & P C t Vi i A M d A hForsyth & Ponce, Computer Vision: A Modern ApproachRichard Szeliski, Computer Vision: Algorithms and Applications (available online)pp ( )

Page 53: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Course requirementsCourse requirements• Philosophy: computer vision is best experienced hands-onPhilosophy: computer vision is best experienced hands on

• Programming assignments: 50%– About four assignments– Expect the first one in the next couple of classes– Brush up on your MATLAB skills (see web page for tutorial)

• Final assignment: 30% – Recognition competitionRecognition competition– Winner gets a prize!

• Participation: 20%• Participation: 20% – Come to class regularly– Ask questions

A ti– Answer questions

Page 54: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Collaboration policyCollaboration policy

F l f t di i t ith h th b t di• Feel free to discuss assignments with each other, but coding must be done individually

• Feel free to incorporate code or tips you find on the Web, provided this doesn’t make the assignment trivial and you explicitly acknowledge your sourcesexplicitly acknowledge your sources

• Remember: I can Google too (and I have the copies of g ( peverybody’s assignments from the last three years this class was offered)

Page 55: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

Course overviewCourse overview

I E l i i I f ti d iI. Early vision: Image formation and processingII. Mid-level vision: Grouping and fittingIII Multi view geometryIII. Multi-view geometryIV. RecognitionV. Advanced topicsd a ced top cs

Page 56: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

I. Early visionI. Early vision

B i i f ti d i• Basic image formation and processing

* =

Cameras and sensorsLight and color

Linear filteringEdge detection

Light and color

Feature extraction: corner and blob detection

Page 57: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

II. “MidII. “Mid--level vision”level vision”

Fitti d i• Fitting and grouping

Alignment

Fitting: Least squaresHough transformHough transform

RANSAC

Page 58: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

III. MultiIII. Multi--view geometryview geometry

Stereo Epipolar geometry

Tomasi & Kanade (1993)

Projective structure from motionAffine structure from motion

Page 59: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

IV. RecognitionIV. Recognition

Patch description and matching Clustering and visual vocabularies

Bag-of-features models ClassificationBag of features models

Sources: D. Lowe, L. Fei-Fei

Page 60: COMP 776: Computer Vision - Computer Sciencelazebnik/spring11/lec01_intro.pdfWhy study computer vision? • Vision is useful • Vision is interesting • Vision is difficult – Half

V. Advanced TopicsV. Advanced TopicsTi itti• Time permitting…

Segmentation Face detection

Articulated models Motion and tracking