Top Banner
Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of Technology
30

Object class recognition using unsupervised scale ...

Feb 02, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Object class recognition using unsupervised scale ...

Object class recognition using unsupervised scale-invariant learning

Rob FergusPietro Perona

Andrew Zisserman

Oxford UniversityCalifornia Institute of Technology

Page 2: Object class recognition using unsupervised scale ...

Goal

• Recognition of object categories

• Unassisted learning

Page 3: Object class recognition using unsupervised scale ...

Some object categories

Learn from examples

Difficulties:

• Size variation• Background clutter• Occlusion• Intra-class variation

Page 4: Object class recognition using unsupervised scale ...

Model: Constellation of Parts

Fischler & Elschlager 1973Yuille ‘91Brunelli & Poggio ‘93Lades, v.d. Malsburg et al. ‘93Cootes, Lanitis, Taylor et al. ‘95Amit & Geman ‘95, ‘99 Perona et al. ‘95, ‘96, ’98, ’00Agarwal & Roth ‘02

Main issues:

• measuring the similarity of parts

• representing the configuration of parts

Page 5: Object class recognition using unsupervised scale ...

• Representation

• Recognition

• Learning

Overview of talk

Page 6: Object class recognition using unsupervised scale ...

Foreground model

Gaussian shape pdf

Poission pdf on # detections

Uniform shape pdf

Gaussian part appearance pdf

Generative probabilistic model

Clutter modelGaussian background

appearance pdf

Gaussian relative scale pdf

log(scale)

Prob. of detection

0.8 0.75 0.9

Uniformrelative scale pdf

log(scale)

Page 7: Object class recognition using unsupervised scale ...

Recognition

Page 8: Object class recognition using unsupervised scale ...

Detection & Representation of regions

Appearance

Location

Scale

(x,y) coords. of region centre

Radius of region (pixels)

11x11 patchNormalizeProjection onto

PCA basis

c1

c2

c15

……

…..

Gives representation of appearance in low-dimensional vector space

• Find regions within image

• Use salient region operator(Kadir & Brady 01)

Page 9: Object class recognition using unsupervised scale ...

MotorbikesSamples from appearance model

Page 10: Object class recognition using unsupervised scale ...

Detected regions

Page 11: Object class recognition using unsupervised scale ...

Recognized Motorbikes

Page 12: Object class recognition using unsupervised scale ...

Background images evaluated with motorbike model

Page 13: Object class recognition using unsupervised scale ...

Learning

Page 14: Object class recognition using unsupervised scale ...

Learning procedure

E-step: Compute assignments for which regions are foreground / background

M-step: Update model parameters

• Find regions & their location, scale & appearanceover all training

• Initialize model parameters

• Use EM and iterate to convergence:

• Trying to maximize likelihood – consistency in shape & appearance

Page 15: Object class recognition using unsupervised scale ...

Experiments

Page 16: Object class recognition using unsupervised scale ...

Experimental procedureTwo series of experiments:• Fixed-scale model - Objects the same size (manual normalization)• Scale-invariant model - Objects between 100 and 550 pixels in width

Datasets

Training• 50% images• No identifcation of

object within image

Testing• 50% images• Simple object

present/absent test

Motorbikes Airplanes Frontal Faces

Cars (Side) Cars (Rear) Spotted cats

Between 200 and 800 images in each dataset

Page 17: Object class recognition using unsupervised scale ...

Frontal faces

Page 18: Object class recognition using unsupervised scale ...

Airplanes

Page 19: Object class recognition using unsupervised scale ...

Spotted cats

Page 20: Object class recognition using unsupervised scale ...

Cars from rear- Scale invariant

Page 21: Object class recognition using unsupervised scale ...

Summary of results

10.010.0Spotted cats

9.715.2Cars (Rear)

7.09.8Airplanes

4.64.6Faces

6.77.5Motorbikes

Scale invariant experiment

Fixed scale experimentDataset

% equal error rate

Note: Within each series, same settings used for all datasets

Page 22: Object class recognition using unsupervised scale ...

Comparison to other methods

AgarwalRoth [ECCV

’02]21.011.5Cars (Side)

Weber32.09.8Airplanes

Weber6.04.6Faces

Weber et al. [ECCV ‘00]16.07.5Motorbikes

OthersOursDataset

% equal error rate

Page 23: Object class recognition using unsupervised scale ...

Robustness of Algorithm

Page 24: Object class recognition using unsupervised scale ...

Sampling from models

Faces Motorbikes

Page 25: Object class recognition using unsupervised scale ...

Extending the ModelTwo types of parts:• Appearance patch - scale invariant region operator• Curve segment - similarity invariant detection and representation

• Canny edge detection – gives edgel chains• Detect bitangent points• Similarity transform curve segment • Represent:

- curve position (x,y coords. of centroid)

- curve scale (distance btw. bitangent points)

- curve shape by 10-vector of y values

0 1

y

x

Page 26: Object class recognition using unsupervised scale ...

Example curves

Page 27: Object class recognition using unsupervised scale ...

Fitting the extended model• Learn models with different combinations of patches and curves

• Choose between models using a validation set

• For the experiments the image datasets are divided into the ratio:

• 5/12 training

• 1/6 validation

• 5/12 testing

Example datasets

Camels Bottles Zebras

Page 28: Object class recognition using unsupervised scale ...

Camels

Page 29: Object class recognition using unsupervised scale ...

Bottles using patches and curves

Page 30: Object class recognition using unsupervised scale ...

Summary

Future work

• Comprehensive probabilistic model for object classes

• Learn appearance, shape, relative scale, occlusion etc. simultaneously in scale and translation invariant manner

• Same algorithm gives <= 10% error across 5 diverse datasets with identical settings

• Invariance to (affine) viewpoint changes

• Extend to 100’s of object categories

• Reduce training requirements - fewer imagesUse Bayesian methods – ICCV ’03 paper