Top Banner
Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards
14

Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

Dec 21, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision

A Hand-Held “Scanner” forLarge-Format Images

COMP 256

Adrian Ilie

Steps Towards

Page 2: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Previous Work: Panoramas

• Feature extracting: use SIFT, since they are scale-invariant and partially invariant to affine illumination changes - done

• Feature matching: approximate nearest neighbor - done

• Image matching: probabilistic model using RANSAC inliers/outliers - N/A

• Bundle adjustment: add images one by one and iterate using Levenberg-Marquardt - N/A

• Blending: multi-band - not done

Page 3: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Extracting the Features

• Use SIFT features– Location: peaks in DoG pyramids– Descriptors: gradient orientation

histograms

Page 4: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Matching Features

• Look for closest 2 descriptors in a k-d tree (logarithmic speed)

• If distance(descriptor, 1st closest) < 0.36*distance(descriptor, 2nd closest), descriptor is a good match

Page 5: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Computing the Homography

• MLESAC

Page 6: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Warping the Images

• Use bilinear interpolation

Page 7: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Algorithm

• Take an “overview” image• Extract its features and build a k-d tree• Take N “detail” images• For each image i

Extract the features Match the features against the ones in

the k-d tree Use MLESAC to compute the homography Warp the image Blend the image into the current estimate Update the k-d tree

Page 8: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Blending the Warped Images

• Detail image has higher resolution!• Resample the current estimate so that

the area corresponding to the warped image is equal to the area of the unwarped image

• Can blend using some weights, or just use the detail image pixel (since it is of higher quality)

Page 9: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Results

Page 10: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Scanning My Favorite Poster ;)

Page 11: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Scanning My Favorite Poster ;)

Page 12: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Scanning My Favorite Poster ;)

Page 13: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Scanning My Favorite Poster ;)

Page 14: Computer Vision A Hand-Held “Scanner” for Large-Format Images COMP 256 Adrian Ilie Steps Towards.

ComputerVision Issues and Future Work

• Issues– Radial distortion in the “overview” image– Numerical instability of the homography

computation– Illumination changes across images

• Future work– Super-resolution would be nice to have– It would be nice to have a nice viewer

that would take images and homographies as input, then blend and render them at the appropriate level of detail, depending on the zoom level