Top Banner
Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015
37

Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Dec 25, 2015

Download

Documents

Walter Gaines
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Survey of Object Classificationin 3D Range ScansALLAN ZELENER

THE GRADUATE CENTER, CUNY

JANUARY 8 T H 2015

Page 2: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Overview1. Introduction

◦ Problem definition: Object Recognition, Object Classification, and Semantic Segmentation◦ Problem domains: LiDAR scanners for outdoor scenes and RGB-D sensors for indoor scenes

2. Urban object classification◦ Case study: Vehicle object detection and classification

3. Indoor object classification◦ Cluttered scenes with large variety of objects

4. Related Works

5. Comparison and Conclusions◦ Criteria for evaluation: Classification accuracy, range of classes, use of data◦ Context through structured prediction and learned 3D feature representations

Page 3: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Object Recognition

Lai and Fox (IJRR 08)

Matches (Decreasing score order)Query Model

Scene

Mian, Bennamoun, Owens (IJCV 09)

Page 4: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Object Classificationo Segmentation or sliding template used to find candidate regions for classification

o Feature based classification may be invariant to pose and intra-class variation

o More compressed representation than entire database of object models

o Detection and recognition may still work better in practice for controlled applications

Golovinskiy, Kim, and Funkhouser (ICCV 2009)

Page 5: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Semantic Segmentationo Every point in the scene is labeled, including both objects of interest and background

o Typically a joint optimization of segmentation and classification

o Formally utilizes context in MRF/CRF model, where by context we mean nearby regions

Wu, Lenz, and Saxena (RSS 2014)

Page 6: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

LiDAR Scans for Outdoor/Urban Sceneso Long range sensors for outdoor scenes

o Fast scans at low resolution or slow scans at high resolution, depending on number of individual sensors

o Moving sensors and registration from multiple scans result in unstructured point cloud data with no adjacency grid

o RGB imagery tends to be low quality, challenging to align, or simply unavailable

Page 7: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

RGB-D Images for Indoor Sceneso Short range sensors for indoor scenes

o Real-time 30 FPS depth maps based on structured light or time of flight in infrared

o Integrated RGB camera is better aligned and provides better quality under indoor conditions than LiDAR systems

o RGB-D image grid makes it well suited for traditional 2D computer vision techniques on image frames from a single view

Page 8: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Patterson et al.• Object Detection from Large-Scale 3D datasets Using Bottom-up and Top-down Descriptors. Patterson, Mordohai, and Daniilidis. (ECCV 2008)

Spin Image

Extended Gaussian Images

Page 9: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Patterson et al.1. Compute normals for all points and spin images for a subset of sampled points

2. Classify spin image features as either positive (object) or negative (background) points using nearest neighbor classifier.

3. Greedy region growing of positively classified points gives object hypothesis

4. Compute EGI and constellation EGI for object hypothesis and compute alignment and similarity with database model objects.• Rotation hypothesis based on angles subtended by pairs of points• Translation based on maximum frequency of Fourier transform of best rotation hypothesis• Similarity based on fraction of inliers defined as query points that are nearby model points with small

cosine similarity between normals after alignment

5. If similarity is above a threshold then the object is positively detected and points that overlap with the database model after alignment are labeled to obtain segmentation.

Page 10: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Patterson et al. o Precision 0.92 and Recall 0.74 for chosen inlier threshold parameter

o Computation and comparison of EGIs is slow due to alignment

o Cost of object detection grows linearly in the size of the database

Recall

Precision

Page 11: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Huber et al.• Parts-based 3D Object Classification. Huber, Kapuria, Donamukkala, and Hebert. (CVPR 2004)

Page 12: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Huber et al.o Vehicles are segmented into front/middle/back parts and part classes are generated as follows:

oFor each part , the distance between spin image features in and is computed to produce where the event denotes a nearest neighbor match from a feature of part to a feature in part .

oA symmetric similarity matrix is computed as the average of the matching probabilities between all pairs of parts. Part classes are determined by agglomerative clustering and the features for each part class are clustered by k-means to produce a class representation.

Page 13: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Huber et al.o Relationship between object class and part class is determined by Bayes’ theorem,

o is determined empirically from the training data and is assumed uniform

o Object class is determined by maximizing likelihood over all parts

o Here is determined by matching features between the query part and the set of part classes as described during the part class generation stage.

Page 14: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Huber et al.o Excellent accuracy on simulated scans but lacks

experiments for real data.o Consistent part segmentation requires recovery of

vehicle pose.o Improvement over classifier without using parts

Solid Line: Parts-basedDashed Line: Object-based

Page 15: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Golovinskiy et al.• Shape-based Recognition of 3D Point Clouds. Golovinskiy, Kim, and Funkhouser. (ICCV 2009)

Page 16: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Golovinskiy et al.o Localization and segmentation are based on K-NN graph weighted by point distances

o Localization performed by agglomerative clustering

o Segmentation performed by min-cut using virtual background vertex and background radius parameter.

o Contextual features use geolocation alignment with street map and occupancy grid of neighboring objects.

o Relatively poor classification performance, perhaps due to a lack of local features

Page 17: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Stamos et al.• Online Algorithms for Classification of Urban Objects in 3D Point Clouds. Stamos, Hadjiliadis, Zhang, and Flynn. (3DIMPVT 2012)

o Online classification of scan lines using HMMs and CUSUM hypothesis testing

o is the likelihood of observation under the null hypothesis HMM

o Change detected at large value of

Page 18: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Stamos et al.o Simple features between pointso Signed angles: o Line angles: Consistent for collinear points

o Sequence of online classifications performed to refine from coarse to fine classes

o Each additional classifier incorporates more prior knowledge about the target class. E.g., cars should be on the street at a certain height

Page 19: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Xiong et al.• 3D Scene Analysis via Sequenced Predictions over Points and Regions. Xiong, Munoz, Bagnell, Hebert. (ICRA 2011)

Context accumulated from neighbor segments

Context from segment sent down to individual points

Context from points averaged and sent up to segment

Page 20: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Xiong et al.o Multi-Round Stacking generates contextual features by using a sequence of weak classifiers to predict class labels of neighbors

o Two-level hierarchy of regions: segments and points. MRS is run on one level of the hierarchy and then the results are passed on to the other level.

o Sensitive to quality of labeling in training, particularly if there is a “misc” class

Contextual features for tree-trunk class

Page 21: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Silberman and Fergus• Indoor Scene Segmentation Using a Structured Light Sensor. Silberman and Fergus. (ICCV 2011)

Page 22: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Silberman and Ferguso Conditional Random Field

o - Color/depth features and location prior

o if , otherwise

o - Spatial transition based on gradient

o Location prior improves performance for classes in consistent configurations with respect to camera but decreases otherwiseo E.g., bookshelves in office vs library

3DLocation

Priors

Page 23: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Couprie et al.• Indoor Semantic Segmentation Using Depth Information. Couprie, Farabet, Najman, LeCun. (ICLR 2013)

Page 24: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Couprie et al.o Simple application of CNN framework improves accuracy on classes at consistent depths such as walls and floors but performance for objects of interest degrades

o Depth gradients alone are not informative and depth information must be normalized or interpreted to be invariant to variations

Page 25: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Anand et al.• Contextually Guided Semantic Labeling and Search for 3D Point Clouds. Anand, Koppula, Joachims, and Saxena. (IJRR 2012)

Page 26: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Anand et al.o MRF trained by structured SVM.

o - Unary features

o - Pairwise features, may be associative or non-associative depending on o Associative – Feature between neighboring segments of same class, only has self loopso Object non-associative –Features between related class labels of neighboring segments

Page 27: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Anand et al.

(𝑟 𝑖−𝑟 𝑗 )𝑇 �̂�𝑖≥0

(𝑟 𝑗−𝑟 𝑖 )𝑇 �̂� 𝑗≥0

Page 28: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Anand et al.o Object part categories better exploit relationships than object categories alone

o Registered 3D scenes provide more coverage and context than single view scenes

o Common errors include objects that lie on top of other objects, e.g. a book on a table.o Either the result of poor segmentation or smoothing effect from pairwise potentials

Page 29: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Related Works• Unsupervised Feature Learning for RGB-D Based Object Detection. Bo, Ren, and Fox (ISER 2012)

Page 30: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Related Works• Convolutional-Recursive Deep Learning for 3D Object Classification.Socher, Huval, Bhat, Manning and Ng. (NIPS 2012)

Page 31: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Related Works Kahler and Reid. (ICCV 2013) Müller and Behnke. (ICRA 2014)

Page 32: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Related Works Sliding Shapes for 3D Object Detection in Depth Images. Song and Xiao. (ECCV 2014)

Page 33: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Related Works• Instance Segmentation of Indoor Scenes Using a Coverage Loss. Silberman, Sontag, and Fergus. (ECCV 2014)

Input PerfectSemantic Segmentation

Naïve Region Growing CorrectInstance Segmentation

Page 34: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Related Works• Hierarchical Semantic Labeling for Task-Relevant RGB-D Perception. Wu, Lenz, and Saxena. (RSS 2014)

NO-CT: Non-overlapping constraintsHR-CT: Hierarchical relation constraints

Page 35: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Related Works• Classification of Vehicle Parts in Unstructured 3D Point Clouds. Zelener, Mordohai, and Stamos. (3DV 2014)

• Unsupervised segmentation of parts by RANSAC plane fitting

• Structured prediction over parts and object class by HMM and structured perceptron

• Does not require pose estimation, experiments performed using real data

p1

p2

pn⋯

x1 x2 xn

cx1 x2 … xn

Page 36: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Comparisono Fine tuned object recognition methods still appear to work best for specific taskso E.g. for car detection in urban sceneso Indoor scenes have many potential objects of interest, difficult to scale number of classes

oObject classification requires 3D shape features that are discriminativeo Simple accumulators like the spin image are still competitive choices for featureso Learned representations may do better, but how to construct them is a challengeo Differences in representations between point clouds and RGB-D imageso Errors in segmentation may propagate to classification

o Semantic segmentation jointly optimizes segmentation and classificationo Structured prediction provides useful context-based relationships, but can lead to false assumptionso Context relationships are also often fixed and manually engineered

Page 37: Survey of Object Classification in 3D Range Scans ALLAN ZELENER THE GRADUATE CENTER, CUNY JANUARY 8 TH 2015.

Conclusionso 3D shape and context based features provide consistent improvements to classification systems

o Learned 3D representations that are aware of the unique properties of 3D shape features may see improvement over simple application of 2D techniques

o Structured prediction to model relationships between objects, their parts, and their environment also improves performance

o Sparse or hierarchical structured relationships are desirable for computational efficiency