Top Banner
Machine learning & category recognition Cordelia Schmid Jakob Verbeek
37
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Machine learning &

category recognition

Cordelia Schmid

Jakob Verbeek

Page 2: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

This class

• Part 1: Visual object recognition

• Part 2 : Machine learning

Page 3: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Visual recognition - Objectives

• Particular objects and scenes, large databases

Page 4: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Finding the object despite possibly large changes inscale, viewpoint, lighting and partial occlusion

requires invariant description

ViewpointScale

Lighting Occlusion

Difficulties

Page 5: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Difficulties

• Very large images collection need for efficient indexing

– Flickr has 2 billion photographs, more than 1 million added daily

– Facebook has 15 billion images (~27 million added daily)

– Large personal collections

– Video collections, i.e., YouTube

Page 6: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Search photos on the web for particular places

Find these landmarks ...in these images and 1M more

Applications

Page 7: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Applications

• Take a picture of a product or advertisement

find relevant information on the web

[Pixee – Milpix]

Page 8: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Applications

• Finding stolen/missing objects in a large collection…

Page 9: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Applications

• Copy detection for images and videos

Search in 200h of videoQuery video

Page 10: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

10K. Grauman, B. Leibe

• Sony Aibo – Robotics– Recognize docking station– Communicate with visual cards– Place recognition– Loop closure in SLAM

Slide credit: David Lowe

Applications

Page 11: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Instance-level recognition: Approach

• Extraction of invariant image descriptors

• Matching descriptors between images- Matching of the query images to all images of a database- Speed-up by efficient indexing structures

• Geometric verification– Verification of spatial consistency for a short list

Page 12: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

This class

• Lecture 2: Local invariant features – Student presentation: scale and affine invariant interest point

detectors

Page 13: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

This class

• Lecture 3: Instance-level recognition: efficient search– Student presentation: scalable recognition with a vocabulary tree

Page 14: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Visual recognition - Objectives

• Object classes and categories (intra-class variability)

Page 15: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

• Image classification: assigning label to the image

Tasks

Car: presentCow: presentBike: not presentHorse: not present…

• Object localization: define the location and the category

Car CowLocatio

n

Category

Visual object recognitionVisual recognition - Objectives

Page 16: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Difficulties: within object variations

Variability: Camera position, Illumination,Internal parameters

Within-object variations

Page 17: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Difficulties: within-class variations

Page 18: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Visual category recognition

• Robust image description – Appropriate descriptors for objects and categories

• Statistical modeling and machine learning for vision– Selection and adaptation of existing techniques

Page 19: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Why machine learning?

• Early approaches: simple features + handcrafted models• Can handle only few images, simples tasks

L. G. Roberts, Machine Perception of Three Dimensional Solids,

Ph.D. thesis, MIT Department of Electrical Engineering, 1963.

Page 20: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Why machine learning?

• Early approaches: manual programming of rules• Tedious, limited and does not take into accout the data

Y. Ohta, T. Kanade, and T. Sakai, “An Analysis System for Scenes Containing objects with Substructures,” International Joint Conference on Pattern Recognition, 1978.

Page 21: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Why machine learning?

• Today lots of data, complex tasks

Internet images, personal photo albums

Movies, news, sports

Page 22: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Why machine learning?

• Today lots of data, complex tasks

Surveillance and security Medical and scientific images

Page 23: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Why machine learning?

• Today: Lots of data, complex tasks

• Instead of trying to encode rules directly, learn them from examples of inputs and desired outputs

Page 24: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Types of learning problems

• Supervised– Classification– Regression

• Unsupervised• Semi-supervised• Reinforcement learning• Active learning• ….

Page 25: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Image classification : Approach

• Excellent results in the presence of background clutter

bikes books building cars people phones trees

Bag-of-features for image classification

Page 26: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Bag-of-features for image classification

Classification

SVM

Extract regions Compute descriptors

Find clusters and frequencies

Compute distance matrix

Page 27: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Spatial pyramids: perform matching in 2D image space

This class

• Lecture 4: Bag-of-features models for image classification– Student presentation: beyond bags of features: spatial pyramids

Page 28: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Object category localization: examples

Car

Sofa

Bicycle

Horse

Page 29: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Object category localization• Method with sliding windows (Each window is classified as

containing or not the targeted object)

• Learn a classifier by providing positive and negative examples

Page 30: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Localization approach

Histogram of oriented image gradients as image descriptor

SVM as classifier, importance weighted descriptors

Page 31: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Localization of “shape” categories

Window descriptor + SVM Horse localization

Page 32: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Localization based on shape

Page 33: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

This class

• Lecutre 5: Category-level object localization – Student presentation: object detection with discriminatively trained

part based models

Page 34: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

This class - schedule

• Session 1, October 1 2010– Cordelia Schmid: Introduction – Jakob Verbeek: Introduction Machine Learning

• Session 2, December 3 2010– Jakob Verbeek: Clustering with k-means, mixture of Gaussians – Cordelia Schmid: Local invariant features – Student presentation 1 : Scale and affine invariant interest point detectors,

Mikolajczyk and Schmid, IJCV 2004.

• Session 3, December 10 2010– Cordelia Schmid: Instance-level recognition: efficient search– Student presentation 2: Scalable recognition with a vocabulary tree, Nister and

Stewenisus, CVPR 2006.

Page 35: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Plan for the course

• Session 4, December 17 2010– Jakob Verbeek: Mixture of Gaussians, EM algo.,Fisher Vector image representation

– Cordelia Schmid: Bag-of-features models for category-level classification – Student presentation2: Beyond bags of features: spatial pyramid matching for recognizing natural

scene categories, Lazebnik, Schmid and Ponce, CVPR 2006.

• Session 5, January 7 2011– Jakob Verbeek: Classification 1: generative and non-parameteric methods – Student presentation 4: Large-scale image retrieval with compressed Fisher vectors, Perronnin,

Liu, Sanchez and Poirier, CVPR 2010.

– Cordelia Schmid: Category level localization: Sliding window and shape model – Student presentation 5: Object detection with discriminatively trained part based methods,

McAllester and Ramanan, PAMI 2010.

.

This class - schedule

Page 36: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

Plan for the course

• Session 6, January 14 2011– Jakob Verbeek: Classification 2: discriminative models

– Student presentation 6:TagProp: discriminative metric learning in nearest neighbor models for image auto-annotation, Guillaumin, Mensink, Verbeek and Schmid, ICCV 2009.

– Student presentation 7: IMG2GPS: estimating geographic information from a single image, Hays and Efros, CVPR 2008.

This class - schedule

Page 37: Machine learning & category recognition Cordelia Schmid Jakob Verbeek.

This class

• Class web page at – http://lear.inrialpes.fr/people/verbeek/MLCR.10.11– Slides available after class

• Student presentations– 20 minutes oral presentation with slides, 5 minutes questions– Two students present together one paper

• Grades– 50% final exam– 25% presentation– 25% short quiz after each presentation