YOU ARE DOWNLOADING DOCUMENT

Please tick the box to continue:

Transcript
Page 1: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

gIn

teg

rati

ng

Recog

nit

oin

an

d R

econ

str

ucti

on

Integrating Recognition and Reconstruction for Cognitive Scene Interpretation

Bastian Leibe, Nico Cornelis, Kurt Cornelis, Luc Van GoolComputer Vision LaboratoryETH Zurich

Sicily Workshop, Syracusa, 22.09.2006

VISICSKU Leuven

&

CVPR’06 Video Proceedings

DAGM’06

Page 2: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

2

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Motivation

• Urban traffic scene analysis from a moving vehicle Detect objects in the image Localize them in 3D Build up a metric scene model

• Applications e.g. in driver assistance systems

Page 3: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

3

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Challenges

Brightly-lit areasMotion blur

Lense flaring

+ Intra-category variability, multiple viewpoints, partial occlusion, ...

Dark shadows

Page 4: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

4

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Cognitive Loop with 3D Geometry

• Connect recognition and reconstruction• Reconstruction pathway delivers scene geometry

Greatly improves recognition performance

• Recognition detects objects that disturb reconstruction More accurate geometry estimate

Page 5: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

5

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Outline

• Hardware setup

• Reconstruction pathway Real-time Structure-from-Motion Real-time dense reconstruction

• Recognition pathway Local-feature based object detection Incorporation of scene geometry Temporal integration in world coordinate frame Feedback to reconstruction

• Results and Conclusion

Page 6: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

6

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Hardware Setup

• Stereo camera rig mounted on top of the vehicle• Calibrated w.r.t. wheel base points• Video streams captured at 25 fps, 360288

resolution

Page 7: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

7

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Real-Time Structure-from-Motion

• Basis: very fast feature matching Simple features Optimized for urban environment Only computed on green channel of a single camera

• Rest: standard SfM pipeline[Cornelis et al., CVPR’06]

Page 8: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

8

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

• Dense reconstruction on rectified images Ruled surface assumption to speed-up dense reconstruction Correlation measure: Sum of per-pixel SSDs along vertical

lines Line-sweep algorithm with ordering constraints (DP) Fast computation on GPU

• Errors introduced by pixels not belonging to facades!

Real-Time Dense Reconstruction

[Cornelis et al., CVPR’06]

Page 9: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

9

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

• Merge dense reconstructions using known camera poses.• “Voted polygon carving” on 2D projection

Real-Time Dense Reconstruction (2)

Page 10: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

10

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

• Merge dense reconstructions using known camera poses.• “Voted polygon carving” on 2D projection• Surfaces registered on world map using GPS

Real-Time Dense Reconstruction (2)

Page 11: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

11

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

• Run-times SfM + Bundle adjustment: 26-30 fps on CPU Dense reconstruction: 26 fps on GPU

Textured 3D Model

Original 3D Reconstruction

Page 12: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

12

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Information Flow into Recognition

• For each frame, 3D reconstruction delivers External camera calibration Ground plane estimate Used for improving recognition of the next frame.

Page 13: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

13

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Appearance-Based Car Detection

• Bank of 5 single-view ISM detectors• Each based on 3 local cues

Harris-Laplace, Hessian-Laplace, and DoG interest regions Local Shape Context descriptors

• Semi-profile detectors additionally mirrored• Not real-time yet…

[Leibe, Mikolajczyk, Schiele,06]

Page 14: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

14

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Implicit Shape Model - Representation

• Learn appearance codebook Extract patches at interest points Agglomerative clustering codebook

• Learn spatial distributions Match codebook to training images Record matching positions on object

training images(+reference segmentation)

Appearance codebook…

………

Spatial occurrence distributionsx

y

s

x

y

sx

y

s

x

y

s

Page 15: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

15

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Implicit Shape Model - Recognition

BackprojectedHypotheses

Interest PointsMatched Codebook Entries

Probabilistic Voting

3D Voting Space(continuous)

x

y

s

Backprojection

of Maxima[Leibe & Schiele,04]

Page 16: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

16

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Implicit Shape Model - Recognition

BackprojectedHypotheses

Interest PointsMatched Codebook Entries

Probabilistic Voting

Segmentation

3D Voting Space(continuous)

x

y

s

Backprojection

of Maxima

p(figure)Probabilities

[Leibe & Schiele,04]

Page 17: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

17

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

2D/3D Interactions

• Likelihood of 3D hypothesis H given image I and 2D detections h:

• 2D recognition score Expressed in terms of per-pixel p(figure) probabilities

recognitionscore (2D)

Page 18: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

18

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

2D/3D Interactions

• Likelihood of 3D hypothesis H given image I and 2D detections h:

• 3D prior Distance prior (uniform range) Size prior (Gaussian) Significantly reduced search space

3D prior

x

s

y Search corridor

Page 19: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

19

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

2D/3D Interactions

• Likelihood of 3D hypothesis H given image I and 2D detections h:

• 2D/3D transfer Two image-plane detections are consistent if they

correspond to the same 3D object

Multi-viewpoint integration Multi-camera integration

2D/3D transfer

Page 20: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

20

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Detections Using Ground Plane Constraints

left camera 1175 frames

Page 21: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

21

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Quantitative Results

• Detection performance on first 600 frames All cars annotated that were >50% visible Ground plane constraint significantly improves precision Performance: 0.2 fp/image at 50% recall

Page 22: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

22

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Temporal Integration

• Temporal integration in world coordinate frame Using external camera calibration from SfM. Each detection transfers to a 3D observation H. Find superset of 3D hypotheses . Estimate orientation using cluster shape & detected

viewpoints. Select set of 3D hypotheses that best explain the

observations.

Page 23: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

23

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Hypothesis Selection for 3D Detections

• Quadratic Boolean Optimization Problem (from MDL)

• Individual scores (diagonal terms)

• Interaction costs (off-diagonal terms)

temporaldecay

likelihood ofmembership to

hypothesis

penalty forphysicaloverlap

[Leonardis et al,95]

Page 24: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

24

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Result of Temporal Integration

Page 25: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

25

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Online 3D Car Location Estimates

Page 26: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

26

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

3D Estimates After Convergence

Page 27: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

27

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Feedback into 3D Reconstruction

• Feedback of detections & segmentation maps Used to discard features on cars for SfM Used to mask out cars in dense reconstruction More accurate 3D estimates in the next frame.

Page 28: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

28

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Another Application: 3D City Modeling

Original 3D Reconstruction

Enhancing your driving experience…

Page 29: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

29

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Conclusion

• System for traffic scene analysis integrating Structure-from-Motion Dense 3D Reconstruction Object detection and localization in 2D and 3D Temporal integration in world coordinate frame

• Cognitive Loop between 2D and 3D processing Reconstruction delivers camera calibration, ground plane 3D context tremendously improves recognition

performance Car detection, segmentation makes 3D estimation more

accurate

• System applied to challenging real-world task Real-time 3D reconstruction (26-30 fps) Accurate object detection & 3D pose estimation results

Page 30: Perceptual and Sensory Augmented Computing Integrating Recognitoin and Reconstruction Integrating Recognition and Reconstruction for Cognitive Scene Interpretation.

30

Perc

ep

tual an

d S

en

sory

Au

gm

en

ted

Com

pu

tin

g

B. Leibe, N. Cornelis, K. Cornelis, L. Van Gool

Inte

gra

tin

g R

ecog

nit

ion

an

d R

econ

str

ucti

on

Thank you very much for your attention!


Related Documents