Top Banner
Fast and Compact Retrieval Methods in Computer Vision Rahul Garg Xiao Ling
50

Fast and Compact Retrieval Methods in Computer Vision

Feb 25, 2016

Download

Documents

Fast and Compact Retrieval Methods in Computer Vision. Rahul Garg Xiao Ling. Objective. phrase. Given a Query image, find all instances of that object in an image database. On the web. Objective. phrase. Given a Query image, find all instances of that object in an image database. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Fast and Compact Retrieval Methods in Computer Vision

Fast and Compact Retrieval Methods in Computer Vision

Rahul GargXiao Ling

Page 2: Fast and Compact Retrieval Methods in Computer Vision

Objective

• Given a Query image, find all instances of that object in an image database

phrase

On the web

Page 3: Fast and Compact Retrieval Methods in Computer Vision

Objective

• Given a Query image, find all instances of that object in an image database

phrase

On the web

World Wide Web

“search this”

Page 4: Fast and Compact Retrieval Methods in Computer Vision

Text Search Overview

Page 5: Fast and Compact Retrieval Methods in Computer Vision

Document Representation

Parse into words vocabulary

Vector of frequencies of

words

(0,3,4,0,0,5,6,0,……….,1)

Page 6: Fast and Compact Retrieval Methods in Computer Vision

Document Representation

(0,3,4,0,0,5,6,0,……….,1)K dimensional vectorK : number of words in vocabulary

Page 7: Fast and Compact Retrieval Methods in Computer Vision

Document Representation: Example

The quick brown fox jumps over the brown dog

QuickBrownFoxJumpOverDogCat

Vocabulary

1 2 1 1 1 1 0

Quick Brown Fox Jump Over Dog Cat

Page 8: Fast and Compact Retrieval Methods in Computer Vision

Document Representation: Weighted Frequencies

• Uncommon words are given more weight

Weight( wordi) α log(1/number of occurrences of wordi in the whole database)

1 2 1 1 1 1 0

Quick Brown Fox Jump Over Dog Cat

Term frequency – inverse document frequency (tf-idf)

Page 9: Fast and Compact Retrieval Methods in Computer Vision

Querying

• Reduce query to vector form

• Find “nearest” document vectors

quick brown fox (1,1,1,0,0,0,0)

Page 10: Fast and Compact Retrieval Methods in Computer Vision

Text Retrieval Applied to Object Retrieval in Images

• Video Google, Sivic et. al. ICCV 2003

Page 11: Fast and Compact Retrieval Methods in Computer Vision

Text Retrieval vs Object Retrieval

Documents Images

Words

Text Retrieval Object Retrieval

Page 12: Fast and Compact Retrieval Methods in Computer Vision

“Visual Words”

• Idea: Regions of image which are easy to detect and match

Build descriptor(representation)of the region

Page 13: Fast and Compact Retrieval Methods in Computer Vision

Feature Descriptors

• Issues– Illumination changes– Pose changes– Scale….

• Video Google uses SIFT

( 130,129,….,101)

Page 14: Fast and Compact Retrieval Methods in Computer Vision

Problem: Visual Words are noisy

Descriptors may turn out to be slightly different

Solution: Quantize!

Page 15: Fast and Compact Retrieval Methods in Computer Vision

Building Vocabulary of Visual Words

Throw in all descriptorsfrom the database

Cluster Into K Visual words usingK means

Vocabulary of K visual words

Page 16: Fast and Compact Retrieval Methods in Computer Vision

Image RepresentationAnalogous to Document Representation

Find Descriptors

MapDescriptors

To nearest visual wordsFrequencyVector

(0,3,4,0,0,5,6,0,……….,1)

Page 17: Fast and Compact Retrieval Methods in Computer Vision

Querying

(0,3,4,0,0,5,6,0,……….,1)

Find similar vectors (images) in the database

Page 18: Fast and Compact Retrieval Methods in Computer Vision

Finding Similar Vectors

• Problem: Number of Vectors is large

• Vectors are sparse: index using words

Word 1 List of images containing word1

Word 2 …..

Inverted Index Files

Page 19: Fast and Compact Retrieval Methods in Computer Vision

Stop Lists

• Text Retrieval: Remove common words from vocabulary

• Analogy: Remove common visual words from vocabulary

is, are, the, that,…

Page 20: Fast and Compact Retrieval Methods in Computer Vision

Spatial Consistency Ranking

• Text Retrieval: Increase ranking of results where search words appear close

The quick brown fox jumps over the lazy dog

Fox news: How to make brownbrownies quickly>

Search: quick brown fox

Page 21: Fast and Compact Retrieval Methods in Computer Vision

Spatial Consistency Ranking

• More relevant in case of images: visual words need to be in same configuration

T

Page 22: Fast and Compact Retrieval Methods in Computer Vision

Spatial Consistency Ranking in Video Google

Page 23: Fast and Compact Retrieval Methods in Computer Vision

Performance Evaluation Metrics

K results returned by queryc: correct resultsN: total number of correct results in the database

• Precision: c/K• Recall: c/N

Increase K gradually to generate (Precision, Recall) pairs till K = N (Recall = 1.0)

Page 24: Fast and Compact Retrieval Methods in Computer Vision

Performance Evaluation Metrics

0.1 0.2 0.3 0.4 0.5

0.600000000000001

0.700000000000001 0.8 0.9 10

0.20.40.60.8

11.2

Precision-Recall Curve

Precision

RecallArea Under Curve:Average Precision (AP)

Page 25: Fast and Compact Retrieval Methods in Computer Vision

Video Google: SummaryFind Descriptors

Learn Vocabulary

Assign descriptors to words

Find Similar vectors

Rank Results

SIFT

K MeansO(Nk)

Linear SearchO(k)

Inverted Files

Loose Spatial Consistency

VocabBuilding

QueryStage

We need MORE words!

K = ~6K – 10K

Page 26: Fast and Compact Retrieval Methods in Computer Vision

Tree Structure

• Tree structure for– searching – indexing

Incorporate inner nodes for scoring

iii wnq

Term Frequency

Inverted Document Frequency

Page 27: Fast and Compact Retrieval Methods in Computer Vision

Hierarchical K-Means [Nistér et al, CVPR’06]

• K – branching factor• Time complexity:– O(N log (# of leaves)) for construction– O(log (# of leaves)) for searching

• Cons:– Wrong nearest neighbors assignment– Suffer from bad initial clusters

Page 28: Fast and Compact Retrieval Methods in Computer Vision

Hierarchical K-Means: SummaryFind Descriptors

Learn Vocabulary

Assign descriptors to words

Find Similar vectors

Rank Results

MSER. SIFT

Hierarchical K MeansO(N log(# of leaves))

Search along the pathO(k log(# of leaves))

Each node has a Inverted File list

No Spatial Consistency

VocabBuilding

QueryStage

# of leaves = 1M

SIFT

K MeansO(Nk)

Linear SearchO(k)

Inverted Files

Loose Spatial Consistency

Page 29: Fast and Compact Retrieval Methods in Computer Vision

Approximate k-means [Philbin et al, CVPR’07]

• HKM: 1. not the best NN 2. error propagation• Go back to flat vocabulary, but much faster• Nearest neighbor search is the bottleneck• Use kd-tree to speed up

Page 30: Fast and Compact Retrieval Methods in Computer Vision

Kd tree

• k-d tree hierarchically decomposes the descriptor space

Page 31: Fast and Compact Retrieval Methods in Computer Vision

Approximate k-means cont.

• Best bin first Search: O(log k)

dist

dist

Priority queue by dist

Page 32: Fast and Compact Retrieval Methods in Computer Vision

AKM cont.

• Use multiple (L) randomized kd trees for Approximate NN search, in both construction and assignment phase

• Searching complexity: O(L*log K + C)

• Approximate K-means complexity: O(N log K)

Share one priority queue!

Page 33: Fast and Compact Retrieval Methods in Computer Vision

Approximate k-means cont.

• Close to exact k-means, though much faster

• Superior to HKM empirically

Mean AP

Page 34: Fast and Compact Retrieval Methods in Computer Vision

Approximate K-Means: SummaryFind Descriptors

Learn Vocabulary

Assign descriptors to words

Find Similar vectors

Rank Results

SIFT

Approximate K MeansO(N log(# of leaves))

SearchO(log(# of leaves))

Inverted File list

Transformation based Spatial Verification

VocabBuilding

QueryStage

# of leaves = 1M

Page 35: Fast and Compact Retrieval Methods in Computer Vision

Low Recall

• Feature detection and quantization– Even for the same object,

different visual words!– Query region may not

contain enough features• Two possible solutions– Query expansion [Chum et al, ICCV’07]

– Soft Assignment [Philbin et al, CVPR’08]

NOISY!

Page 36: Fast and Compact Retrieval Methods in Computer Vision

Query Expansion

• Text retrieval– Dimension is too high!

Query: violin ……

results Results about fiddle

……Search engine

Expanded query

Search engine

Page 37: Fast and Compact Retrieval Methods in Computer Vision

Query expansion [Chum et al, ICCV’07]

• Basic idea: augment the query with visual words from initial matching region

Query initial result list

expanded query by new resultsaveraging the results

• What if the initial results are poor?– Filter by spatial constraints

Page 38: Fast and Compact Retrieval Methods in Computer Vision

Query expansion cont.

• Results

Query Initial results Results by expansion

Page 39: Fast and Compact Retrieval Methods in Computer Vision

Query expansion cont.

• Increasing recall without loss of precision

before after

Each line: each query image for a certain landmark

Precision-recall curves

Page 40: Fast and Compact Retrieval Methods in Computer Vision

Query Expansion: SummaryFind Descriptors

Learn Vocabulary

Assign descriptors to words

Find Similar vectors

Rank Results

SIFT

Approximate K Means

Search

Inverted File list

Spatial Verification to find inliers for expansion

VocabBuilding

QueryStage

Page 41: Fast and Compact Retrieval Methods in Computer Vision

Soft Assignment [Philbin et al, CVPR’08]

• Try to capture the information for the near-boundary features by associating one feature to several words

• Intuition: includes “semantically” similar variants in the context of text retrieval

• with denser image representation (thus more storage)

• Can be applied to existing methodstradeoff!

Page 42: Fast and Compact Retrieval Methods in Computer Vision

Soft Assignment

• Associate a single descriptor with r nearby cluster centers instead of its single nearest-neighbor clusterweight

• Modified tfidf scheme– tf: use real values for frequency– idf: counting occurrence as one, empirically best

• Modified spatial verification– weighted score instead of occurrence to rank hypothesis

)2

exp( 2

2

d

Page 43: Fast and Compact Retrieval Methods in Computer Vision

MatchingResults

Page 44: Fast and Compact Retrieval Methods in Computer Vision

Soft assignment cont.

• Improvements

Page 45: Fast and Compact Retrieval Methods in Computer Vision

Soft Assignments: SummaryFind Descriptors

Learn Vocabulary

Assign descriptors to words

Find Similar vectors

Rank Results

SIFT

Approximate K Means

Soft Assignment

Inverted File list

Spatial Verification reranking + query expansion

VocabBuilding

QueryStage

3 times storage when using 3-NN soft assignment

10% mAP gain

Tradeoff!

Page 46: Fast and Compact Retrieval Methods in Computer Vision

Spatial Information Lost

• Quantization is information loss process– From 2d (pixel) structure to (feature)vector

• How to model the geometry?

Page 47: Fast and Compact Retrieval Methods in Computer Vision

Spatial Consistency Constraints [Chum et al, ICCV’07]

T (3 dof)

T’ (6 dof)

scale1 scale2

Page 48: Fast and Compact Retrieval Methods in Computer Vision

Conclusion

• Borrow text retrieval methods to conduct fast image retrieval, e.g. tf-idf weight, query expansion

• Quantization, searching and indexing are the core problems

Find Descriptors

Learn Vocabulary

Assign descriptors to words

Find Similar vectors

Rank Results

Page 49: Fast and Compact Retrieval Methods in Computer Vision

Future Work

• Goal: Web-scale retrieval system

• Vocabulary to span the space of all images?

• Spatial information in Indexing instead of Ranking

Page 50: Fast and Compact Retrieval Methods in Computer Vision

QA and comments?