ICVSS2008: Randomized Decision Forests

Randomized Decision Forestsfor

Segmentation and Recognition

Jamie Shotton

ICVSS 2008, Sicily, Italy

http://jamie.shotton.org/work/presentations/ICVSS2008.zip

Randomized Decision Forests

• Very fast– for classification– for clustering

• Generalization through random training

• Inherently multi-class– automatic feature sharing [Torralba et al. 07]

• Simple training / testing algorithms

“Randomized Decision Forests” = “Randomized Forests” = “Random ForestsTM”

detailedreferences atend of slides

(Amongothers...)

Randomized Forests in Vision

[Lepetit et al., 06]keypoint recognition

[Amit & Geman, 97]digit recognition

[Moosmann et al., 06]visual word clustering

[Shotton et al., 08]object segmentation

boatchair

Live Demo [Shotton et al. 08]

• Real-time object segmentationusing randomized decision forests– trained on MSRC 21-category database:

• Segment image and label segments:

airplane bicycle bird boat body bookbuilding car cat chair cow dog face flower

grass road sheep sign sky tree water

Winner CVPR 2008 Best Demo Award!

Outline

• Tutorial on Randomized Decision Forests

• Applications to Vision– keypoint recognition [Lepetit et al. 06]–object segmentation [Shotton et al. 08]

please ask questions as we go!

The Basics: Is The Grass Wet?

world state

is it raining?

is the sprinkler on?P(wet)= 0.95

P(wet)= 0.9

P(wet)= 0.1

The Basics: Binary Decision Trees

category c

split nodesleaf nodes

10 11 12 13

14 15 16 17

• feature vector v• split functions

fn(v)• thresholds tn

• ClassificationsPn(c)

Decision Tree Pseudo-Code

double[] ClassifyDT(node, v)if node.IsSplitNode then

if node.f(v) >= node.t thenreturn

ClassifyDT(node.right, v)else

return ClassifyDT(node.left, v)

endelse

return node.Pend

Toy Learning Example

• feature vectors are x, y coordinates: v = [x, y]T

• split functions are lines with parameters a, b: fn(v) = ax + by• threshold determines intercepts: tn

• four classes: purple, blue, red, green

• Try several lines, chosen at random

• Keep line that best separates data– information gain

• Recurse

• Recursive algorithm– set In of training examples that reach node n is split:

• Features f and thresholds t chosen at random

• At leaf node n, Pn(c) is histogram of examples In

Randomized Learning

left split

right split thresholdfunction ofexample i’s

feature vector

• Features f(v) chosen from feature pool f 2 F

• Thresholds t chosen in range

• Choose f and t to maximize gain in information

More Randomized Learning

left split

right split

Implementation Details

• How many features and thresholds to try?– just one = “extremely randomized” [Geurts et al. 06]– few -> fast training, may under-fit– many -> slower training, may over-fit

• When to stop?– maximum depth– minimum entropy gain– delta class distribution– pruning?

• Unsupervised training– information gain -> most balanced split

Randomized Learning Pseudo CodeTreeNode LearnDT(I)

repeat featureTests timeslet f = RndFeature()

repeat threshTests timeslet t = RndThreshold(I, f)let (I_l, I_r) = Split(I, f, t)let gain = InfoGain(I_l, I_r)if gain is best then remember f, t, I_l, I_r

endend

if best gain is sufficient return SplitNode(f, t, LearnDT(I_l),

LearnDT(I_r))else

return LeafNode(HistogramExamples(I))end

• Forest is ensemble ofseveral decision trees

– classification is

A Forest of Trees

……tree t1 tree tT

category ccategory c

split nodesleaf nodes

[Amit & Geman 97][Breiman 01][Lepetit et al. 06]

Decision Forests Pseudo-Code

double[] ClassifyDF(forest, v)// allocate memorylet P = double[forest.CountClasses]

// loop over trees in forestfor t = 1 to forest.CountTrees

let P’ = ClassifyDT(forest.Tree[t], v) P = P + P’ // sum distributions

// normaliseP = P / forest.CountTrees

Learning a Forest

• Divide training examples into T subsets It µ I– improves generalization– reduces memory requirements & training time

• Train each decision tree t on subset It– same decision tree learning as before

• Multi-core friendly

• Subsets can be chosen at random or hand-picked• Subsets can have overlap (and usually do)• Could also divide the feature pool into subsets

Learning a Forest Pseudo Code

Forest LearnDF(countTrees, I)// allocate memorylet forest = Forest(countTrees)

// loop over trees in forestfor t = 1 to countTrees

let I_t = RandomSplit(I)forest[t] = LearnDT(I_t)

// return forest objectreturn forest

Toy Forest Classification Demo

ToyClassification

Randomized Forests for Clustering [Moosmann et al. 06]

• Visual words good for e.g. matching, recognitionbut k-means clustering very slow

• Randomized forests for clustering descriptors– e.g. SIFT, texton filter-banks, etc.

• Leaf nodes in forest are clusters– concatenate histograms from trees in forest

8 96 7

42 61 3

……

tree t1 tree tT

[Sivic et al. 03][Csurka et al. 04]

Randomized Forests for Clustering [Moosmann et al. 06]

8 96 7

42 61 3

……

tree t1 tree tT

node index

we’ll see later how to use whole tree

hierarchy!

“bag of words”

Relation to Cascades [Viola & Jones 04]

• Cascades– very unbalanced tree– good for unbalanced binary problems

e.g. sliding window object detection

• Randomized forests– less deep, fairly balanced– ensemble of trees gives robustness– good for multi-class problems

Random Ferns

• Naïve Bayes classifier over random sets of features

• Can be good alternativeto randomized forests

[Özuysal et al. 07] [Bosch et al. 07]

set of features

“random ferns”

individual features

“naïve Bayes”

Bayes’ rule

Short Pause

Any QuestionsSo Far?

Outline

• Tutorial on Randomized Decision Forests

• Applications to Vision Problems– keypoint recognition [Lepetit et al. 06]–object segmentation [Shotton et al. 08]

Fast Keypoint Recognition [Lepetit et al. 06]

• Wide-baseline matchingas classification problem

• Extract prominent key-points in training images

• Forest to classifies:– patches -> keypoints

• Features– pixel comparisons

• Augmented training set– gives robustness to patch scaling, translation, rotation

Fast Keypoint Recognition [Lepetit et al. 06]

• Example videos– from http://cvlab.epfl.ch/research/augm/detect.php

Real-Time Object Segmentation [Shotton et al. 2008]

• Aim – a better visual vocabulary– image categorization

• does this image contain cows, trees, etc.?– object segmentation

• draw and label the outlines of the cow, grass, etc.

• Design goals– fast and accurate– use learned

semantic information

Object Recognition Pipeline

extract features

SIFT, filter bank

clustering

k-means

unsupervisedhand-crafted

classification algorithm

SVM, decision forest, boostingsupervised

assignment

nearest neighbour

clustering‘semantic textons’

local classification

Semantic Texton Forest (STF)• decision forest for both

clustering & classification• tree nodes have learned

object category associations

classification algorithm

SVM, decision forest, boostingsupervised

clustering‘semantic textons’

building

object segmentation

test image

buildingdogroad

image categorization

Support Vector Machine (SVM)• pyramid match kernel

in learned tree hierarchies

Segmentation Forest (SF)• second decision forest• features use layout & context• semantic context

semantic textons(clustering)

building

object segmentation

test image

buildingdogroad

Segmentation Forest (SF)• second decision forest• features use layout & context

extract features

Textons & Visual Words

• Textons [Julesz 81]– computed densely e.g. references– clustered filter-bank responses [Malik 01] [Varma 05]– used for object recognition [Winn 05] [Shotton

07]• Visual words

– usually computed sparsely [Mikolacjzyk 04]– clustered descriptors [Lowe 04]– used for object recognition [Sivic 03] [Csurka 04]

localdescriptors

e.g. [Lowe 04]

filterbank clustering

k-means

assignment

nearest neighbourExpensive!

Semantic Texton Forests (STF)

• A STF is– a decision forest applied at each image pixel– simple pixel-based features

• How is this new?– no descriptors or filter-banks– decision forest

• fast clustering & assignment• local classification Very Fast

learned semantic information

Image Patch Features

f(p) learnedthreshold>

Pixel i gives patch p(21x21 pixels in experiments)

tree split function

Example Semantic Texton Forest

Input Image Ground Truth

A[r] + B[r] > 363A[b] > 98

A[g] - B[b] > 28

A[g] - B[b] > 13A[b] + B[b] > 284

|A[r] - B[b]| > 21|A[b] - B[g]| > 37

Leaf Node Visualization

• Average of all training patches at each leaf node

tree 1

tree 2tree 3

tree 4tree 5

STF Training Examples

• Supervised training

• Regular grid• Random transformations

– learn invariances [Lepetit et al. 06]

(GT colors categories)

Different Levels of Supervision

• STF can be trained with:– no supervision (just the images)

• clustering only – no local classification

– weak supervision (image labels)• trained as if all image labels at pixels

– full supervision (pixel labels)

treebenchgrass

Balancing the Training Set

• Datasets often unbalanced– poor average class accuracy

• Weight training examplesby inverse class frequency

Building

CowShee

SkyAir-plan

FaceCarBike

Flower

SignBird BookChair

CatDog

Proportion of pixels by class(MSRC dataset)

Semantic Textons & Local Classification

test image

ground truth(for reference)

semantic textons(color leaf node index)

local classification(color most likely category)

comparable

Live Demo

MSRC Naïve Segmentation Baseline

• Use only local classification P(c|l) from STF

globalaccuracy

averageaccuracy

supervised 49.7% 34.5% weakly supervised 14.8% 24.1%

Bags of Semantic Textons (BoSTs)semantic textons

(colors leaf node indices)local classification(colors categories)image

semantic texton histogram

tree t1 tree tT

2 3 4 52 3 4 5depth:

region prior

object category

region r

node index

all trees

split nodeleaf node

Choice of Regions for BoSTs

• Image categorization– region r = whole image

• Object segmentation– many image regions r

r2 ir3

i ....

Other Clustering Methods

• Efficient codebooks [Jurie et al. 05]

• Hyper-grid clustering [Tuytelaars et al. 07]

• Hierarchical k-means [Nister & Stewénius 06]

• Discriminant embedding [Hua et al. 07]

• Randomized clustering forests [Moosmann et al. 06]– tree hierarchy not used– ignores classification of forest– uses expensive local descriptors

building

object segmentation

test image

buildingdogroad

Image Categorization

• SVM with learned Pyramid Match Kernel (PMK)– descriptor space [Grauman et al.

05]– image location space [Lazebnik et al. 06]

• New PMK acts on semantic texton histogram– matches P and Q in learned hierarchical histogram space– deeper node matches are more important

norm. depth weight

increased similarity at depth d

Categorization Experiments on MSRC

• Learned PMK vs. radial basis function (RBF)Mean AP

RBF 49.9 learned PMK 76.3

Number of Trees T

NB mean average precision tougher than EER or AuC

building

object segmentation

test image

buildingdogroad

Segmentation Forest

• Object segmentation

• Adapt TextonBoost [Shotton et al. 06]– boosted classifier → randomized decision forest

textons → semantic textons + region priors

– no conditional random field

bicycle

building

Features in Segmentation Forest

offset rectangle r

node index

semantictexton bin

tyobject category

regionprior bin

bincount

learnedthreshold>

?tree split function

How the Features Work

• Rectangles pair with semantic textons can capture– appearance, layout, textural context [Shotton et al. 07]

input image feature1 = (r1, t1)

feature2 = (r2, t2)semantic texton map

feature1 responses

feature2 response

Features in Segmentation Forest

• Learning the randomized forest– regular grid (10x10 pixels)– discriminative pairs of region r and BoST bin

• Region prior allows semantic context– “sheep tend to stand on grass”

• Efficient calculation– compute bins only as required– use integral images [Viola & Jones 04]– sub-sample integral images Live Demo

• Combine– image categorization (SVM with learned PMK)– object segmentation (decision forest)

Image-Level Prior (ILP)

imagecategorization

posterior

prior forsegmentation

SF ILP

weighting

ICVSS2008: Randomized Decision Forests

pyramid match

support vector

allocate memory

semantic texton

classification

learned tree

semantic texton

wise segmentation

Documents

Deep Neural Decision Forests - cv-foundation.org · Deep...

3D Hand Pose Estimation Using Randomized Decision Forest...

Randomized Distributed Decision

Oriented Edge Forests for Boundary Detection · Randomized....

Accelerating Random Forests in Scikit-Learn · First sketch...

ICS 273A Intro Machine Learning decision trees, random...

Randomized Clustering Forests for Image Classification

Mondrian Forests: Efficient Online Random Forests ·...

Semantic Texton Forests for Image Categorization and...

Spatial Decision Forests for MS Lesion Segmentation in...

Decision Trees and Forests Data Analysis and Machine …...

Decision Trees & Random Forests x Deep Neural...

Decision Trees & Random Forests - GitHub PagesDecision Trees...

The Chest Pain Choice Decision Aid: a Randomized Trial

Encoding Atlases by Randomized Classi cation …...Encoding....

Atlas Encoding by Randomized Forests for Label · Atlas...