About me• 東京大学 情報理工学系研究科修士2年生
• テーマ:Object Retrieval,情報検索等
• 趣味:水泳,囲碁• ブログ:
https://imsmarxen68.tumblr.com/
NGUYEN ANH TUAN 東京大学・情報理工・修士2年生
Outline• Semantic Visual Search
• A visual search frameworkImage credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
Feature extraction
Featureaggregation
Featurematching Re-ranking
Preliminaryresults
Finalresults
NGUYEN ANH TUAN 東京大学・情報理工・修士2年生
Visual search
Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
Image credits: http://google.com
NGUYEN ANH TUAN 東京大学・情報理工・修士2年生
What’s the problem?• Semantic difficulties: fine-grained differences
Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
NGUYEN ANH TUAN 東京大学・情報理工・修士2年生
But for search problem?Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
Query Database
NGUYEN ANH TUAN 東京大学・情報理工・修士2年生
But for search problem?Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
Query Database
0.1
0.5
0.2Ranking problem with a variation of
fine-grained changes
NGUYEN ANH TUAN 東京大学・情報理工・修士2年生
But for search problem?Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
Query Database
0.1
0.5
0.2Find visual representations to capture all fine-grained local information in images
NGUYEN ANH TUAN 東京大学・情報理工・修士2年生
Robust feature extraction
• Robust to– Scale changes– Rotation and affine changes– Blur, sharpening, …
Feature extraction
Featureaggregation
Featurematching Re-ranking
Preliminaryresults
Finalresults
Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
A picture is worth a
thousand words
NGUYEN ANH TUAN 東京大学・情報理工・修士2年生
Statistical kernels
• Bag-of-Features (BoF)• Fisher kernel (GMM) [1]
• VLAD (K-means) [2]
Image credits: http://www.mathworks.com/matlabcentral/
Feature extraction
Featureaggregation
Featurematching Re-ranking
Preliminaryresults
Finalresults
[1] F. Perronnin, C. Dance, “Fisher Kernels on Visual Vocabularies for Image Categorization,” in Proc. CVPR, IEEE, 2007[2] H. Jegou, F. Perronnin, M. Douze, J. Sanchez, P. Perez, C. Schmid, “Aggregating Local Image Descriptors into Compact Codes,” IEEE Trans. Pattern Anal. Mach. Intell. 34 (2012) 1704–1716. NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
Statistical kernelsFeature
extractionFeature
aggregationFeature
matching Re-ranking
Preliminaryresults
Finalresults
NGUYEN ANH TUAN 東京大学・情報理工・修士2年生
Image matching = Feature matching
• Feature matching→Nearest Neighbor Search– Inverse Search with Inverted Indices– Compressed data for better memory usage [3]
Feature extraction
Featureaggregation
Featurematching Re-ranking
Preliminaryresults
Finalresults
[3] H. Jégou, M. Douze, C. Schmid, Product quantization for nearest neighbor search., IEEE Trans. Pattern Anal. Mach. Intell. 33 (2011) 117–28. Data CompressionNGUYEN ANH TUAN 東京大学・情報理
工・修士2年生
Verification
• Geometry verification– RANSAC methods [4]– Reduce the number of good inliers
Image credits: http://ai.stanford.edu/~jkrause/cars/car_dataset.html
Feature extraction
Featureaggregation
Featurematching Re-ranking
Preliminaryresults
Finalresults
[4] M.A. Fischler, R.C. Bolles, Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography, Commun. ACM. 24 (1981) 381–395. NGUYEN ANH TUAN 東京大学・情報理
工・修士2年生