Top Banner
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky
27

Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Dec 22, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Accurate, Dense and Robust Multi-View Stereopsis

Yasutaka Furukawa and Jean PoncePresented by Rahul Garg and Ryan Kaminsky

Page 2: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Agenda

• Problem Statement

• Multi-view Stereo Taxonomy

• Algorithm

• Results

• Comparison to other works

• Questions

Page 3: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Problem Statement• Multi-view Stereo

– Dense shape reconstruction from multiple views

+ + + =

Page 4: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Multi-View Stereo Taxonomy

• Scene Representation

• Photoconsistency Measure

• Visibility Model

• Shape Prior

• Reconstruction algorithm

• Initialization

S. M. Seitz, B. Curless, J. Diebel, D. Scharstein, and R. Szeliski

Page 5: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Scene Representation

– Geometry on 3D grid• Voxels, Level sets

– Polygon Mesh• Set of planar facets

– Depth Map• Image that stores depth

per pixel

Page 6: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Photoconsistency Measure

• Definition: Measures visual compatibility of reconstruction with input images

– Scene Space• Project part of reconstruction into images, measure

closeness• Measures: Variance , sum of squared distances, normalized

cross-correlation

– Image Space• Use scene geometry to transform image to different view,

measure error of predicted vs. actual (prediction error)

Page 7: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Visibility Model

• Definition: Views to consider when evaluating photo consistency– Geometric

• Explicitly model geometry of the scene

– Quasi-Geometric• Approximate geometric reasoning

– Outlier based approaches• Treat occlusions as outliers

Page 8: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Shape Prior• Definition: Additional constraints or

assumptions about reconstruction

– Minimal Surfaces• Level sets, Min-cut

– Maximal Surfaces• Voxel coloring, space carving

– Local Measures• Assume local smoothness on nearby pixels

Page 9: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Reconstruction Algorithm

• Optimize cost function– Voxels, graph cut, level sets, meshes

• A set of consistent depth maps

• Feature extraction, matching, surface fitting

Page 10: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Initialization

• Definition: Constraints on scene geometry

– Bounding box or volume

– Visual hull

– Range of disparity

Page 11: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Overview of Algorithm

input image detected reconstructed final patches polygonal surface features patches after after expansion from reconstructed the initial and filtering patches matching

Page 12: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Algorithm Block Diagram

InitializationExpansion Filter

Feature Detection

Reconstruction

Patch Model

Page 13: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Init

• Detect features using Harris Corner and DoG

• Feature matching to generate sparse set of patches

Page 14: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Patch Models

• R(p): Most closely associated image with p

• S(p): Images where p should be visible

• T(p): Images where p is truly visible

Page 15: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

β pixels

Epipolar line

c(p): from triangulation

n(p): Direction of optical ray from c(p) to O

Page 16: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Normalized Cross Correlation (NCC)

Optimization step: Maximizing the average NCC score

where   is the mean of the feature and      is the mean of f(x,y) in the region under the feature.

Page 17: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Patch Expansion

• Expand patches along tangential planes into empty areas.

• Optimize for normal and center and add if photometric constraints are satisfied in at least k images.

Page 18: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Filtering

• Analyzing visibility consistency

Page 19: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Filtering (Contd.)

• Local smoothness constraint : Remove patches for which proportion of neighboring patches with tangential plane “nearly” parallel is less than ε

Page 20: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Polygonal Surface Reconstruction

• Initialize using convex hull of patches

• Iteratively deform/snap to the patch model using two kinds of forces– Smoothness term– Photometric Consistency term

S : Current surfaceS* : True surfacen(v) : Normal at vΠ(v) : Set of patches compatible with vd(v) : Distance between S and S*

Page 21: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Algorithm Taxonomy Categorization

• Scene Representation– Depth Map + Mesh

• Photoconsistency Measure– NCC

• Shape Prior– Assume local smoothness

• Reconstruction– Feature extraction , depth maps, optimization over

patches

• Initialization– None

Page 22: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Results

Patch Model Polygonal Surface Model

Page 23: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Results (Contd.)

Page 24: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Results (Contd.)

• Evaluation on vision.middlebury.edu

Temple (# of views) Dino (# of views)

Full(312)

Ring(47)

Sparse(16)

Full(312)

Ring(47)

Sparse(16)

This paper 0.54 0.55 0.62 0.32 0.33 0.42

Goesele et. al. 0.42 0.61* 0.87* 0.46 0.46* 0.56*

Hernandez et. al. 0.36 0.52 0.75 0.49 0.45 0.60

Accuracy Measure: Distance d in mm which brings 90% of the reconstruction within ground truth

* Old Results

Page 25: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Results (Contd.)

• Handle occlusions/obstacles

Page 26: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Similar Approaches

• Setup similar to Goesele et al. (ICCV’07) – initialize patches, expand and optimize for position and normal

This Paper Goesele et. al.

Initialize patches using triangulated points

Initialize using Structure from Motion features

Explicit occlusion handling Occlusion handling through outlier removal and view selection, prioritize patch candidates for expansion

Page 27: Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Questions

• Pose the problem as an optimization problem simultaneously accounting for local smoothness, photo consistency, occlusion

• Convergence of Expand/Filter – do more iterations lead to better reconstructions?

• Occlusion/Outlier handling – results on more datasets

• Advantages of patch model – Adaptive Resolution, generalizes to large number of object classes