Top Banner
CS332 Visual Processing Department of Computer Science Wellesley College Binocular Stereo Vision Properties of human stereo processing Marr-Poggio-Grimson multi-resolution stereo algorithm 1-2 Properties of human stereo processing Use features for stereo matching whose position and disparity can be measured very precisely Stereoacuity is only a few seconds of visual angle difference in depth 0.01 cm at a viewing distance of 30 cm
8

Binocular Stereo Vision - Wellesley CScs.wellesley.edu/~cs332/ppt/stereo3.pdf · CS332 Visual Processing Department of Computer Science Wellesley College Binocular Stereo Vision Properties

Aug 29, 2018

Download

Documents

lamtuong
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Binocular Stereo Vision - Wellesley CScs.wellesley.edu/~cs332/ppt/stereo3.pdf · CS332 Visual Processing Department of Computer Science Wellesley College Binocular Stereo Vision Properties

CS332 Visual ProcessingDepartment of Computer ScienceWellesley College

Binocular Stereo Vision

Properties of human stereo processingMarr-Poggio-Grimson multi-resolution

stereo algorithm

1-2

Properties of human stereo processing

Use features for stereo matching whose position and disparity can be measured very precisely

Stereoacuity is only a few seconds of visual angledifference in depth ≈ 0.01 cm at a viewing distance of 30 cm

Page 2: Binocular Stereo Vision - Wellesley CScs.wellesley.edu/~cs332/ppt/stereo3.pdf · CS332 Visual Processing Department of Computer Science Wellesley College Binocular Stereo Vision Properties

1-3

Properties of human stereo processing

Matching features must appear similarin the left and right images

For example, we can’t fuse a left stereo image with a negative of the right image…

1-4

Properties of human stereo processing

Only “fuse” objects within a limited range of depth around the fixation distanceVergence eye movementsare needed to fuse objects over larger range of depths

Page 3: Binocular Stereo Vision - Wellesley CScs.wellesley.edu/~cs332/ppt/stereo3.pdf · CS332 Visual Processing Department of Computer Science Wellesley College Binocular Stereo Vision Properties

1-5

Properties of human stereo visionWe can only tolerate small amounts of vertical disparity at a single eye position

Vertical eye movements are needed to handle large vertical disparities

1-6

Properties of human stereo processing

In the early stages of visual processing, the image is analyzed at multiple spatial scales…

Stereo information at multiple scales can be processed independently

Page 4: Binocular Stereo Vision - Wellesley CScs.wellesley.edu/~cs332/ppt/stereo3.pdf · CS332 Visual Processing Department of Computer Science Wellesley College Binocular Stereo Vision Properties

1-7

Neural mechanisms for stereo processing

G. Poggio & colleagues:

complex cells in area V1 of primate visual cortex are selective for stereo disparity

neurons that are selective for a larger disparity range have larger receptive fieldszero disparity: at fixation distance

near: in front of point of fixation far: behind point of fixation

1-8

In summary, some key points…

• Image features used for matching:simple, precise locations, multiple scales, similar between left/right images

• At single fixation position, match features over a limited range of horizontal & vertical disparity

• Eye movements used to match features over larger range of disparity

• Neural mechanisms selective for particular ranges of stereo disparity

Page 5: Binocular Stereo Vision - Wellesley CScs.wellesley.edu/~cs332/ppt/stereo3.pdf · CS332 Visual Processing Department of Computer Science Wellesley College Binocular Stereo Vision Properties

1-9

Matching features for the MPG stereo algorithmzero-crossings of convolutions with ∇2G operators of different size

L

M

S

rough disparities over large range

accurate disparities over small range

1-10

large w left

large w right

small w left

small w right

correct match outside search range at small scale

Page 6: Binocular Stereo Vision - Wellesley CScs.wellesley.edu/~cs332/ppt/stereo3.pdf · CS332 Visual Processing Department of Computer Science Wellesley College Binocular Stereo Vision Properties

1-11

large w left

right

small w left

right

correct match now inside search range at small scale

vergence eye movements!

1-12

Stereo images (Tsukuba, CMU)

Page 7: Binocular Stereo Vision - Wellesley CScs.wellesley.edu/~cs332/ppt/stereo3.pdf · CS332 Visual Processing Department of Computer Science Wellesley College Binocular Stereo Vision Properties

1-13

Zero-crossings for stereo matching

- +

… …

1-14

Simplified MPG algorithm, Part 1

To determine initial correspondence:(1) Find zero-crossings using a ∇2G operator with central

positive width w(2) For each horizontal slice:

(2.1) Find the nearest neighbors in the right image for each zero-crossing fragment in the left image(2.2) Fine the nearest neighbors in the left image for each zero-crossing fragment in the right image(2.3) For each pair of zero-crossing fragments that are closest neighbors of one another, let the right fragment be separated by δinitial from the left. Determine whether δinitial is within the matching tolerance, m. If so, consider the zero-crossing fragments matched with disparity δinitial

m = w/2

Page 8: Binocular Stereo Vision - Wellesley CScs.wellesley.edu/~cs332/ppt/stereo3.pdf · CS332 Visual Processing Department of Computer Science Wellesley College Binocular Stereo Vision Properties

1-15

Simplified MPG algorithm, Part 2To determine final correspondence:(1) Find zero-crossings using a ∇2G operator with reduced width w/2(2) For each horizontal slice:

(2.1) For each zero-crossing in the left image:(2.1.1) Determine the nearest zero-crossing fragment in the left image that matched when the ∇2G operator width was w(2.1.2) Offset the zero-crossing fragment by a distance δinitial, the disparity of the nearest matching zero-crossing fragment found at the lower resolution with operator width w

(2.2) Find the nearest neighbors in the right image for each zero-crossing fragment in the left image(2.3) Fine the nearest neighbors in the left image for each zero-crossing fragment in the right image(2.4) For each pair of zero-crossing fragments that are closest neighbors of one another, let the right fragment be separated byδnew from the left. Determine whether δnew is within the reduced matching tolerance, m/2. If so, consider the zero-crossing fragments matched with disparity δfinal = δnew + δinitial

1-16

Coarse-scale zero-crossings:

Use coarse-scale disparities to guide fine-scale matching:

Ignore coarse-scale disparities:

w = 8m = 4

w = 4m = 2

w = 4m = 2