Top Banner
Deep Convolutional Neural Networks – Beyond Classification Abhishek Sharma
12

Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

Mar 31, 2018

Download

Documents

buithien
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

Deep Convolutional Neural Networks – Beyond

ClassificationAbhishek Sharma

Page 2: Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

CNNs so far• Handwritten digit classification, 1995ish

• State-of-the-art recognition accuracy for handwritten digits [0-9], used in automatic check deposit and postal applications

• Plain CNN structure with 3 stages of CNN + 2 fully connected

Courtesy – Yann Lecun et al.

Page 3: Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

CNNs so far• ImageNet challenge, 2012

• State-of-the-art accuracy on 1K object categories near 95%

• AlexNet – 5 layers of CNN + 3 fully-connected layers

• VGG – 16 layer network with decomposed filters

Courtesy – AlexNet, NIPS 2012

Page 4: Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

CNNs so far• Deep face 2014

• Close to human accuracy on face verification (same or different)

Courtesy –DeepFace, CVPR 2014

Page 5: Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

What is really important ?

• The convolutional layers are the most important part

• A pre-trained network for ImageNet classification can be used for many different vision tasks.

• Detection

Courtesy – R-CNN, CVPR 2014

Page 6: Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

What next ?

• Segmentation

• Depth Image

• Image Quality Estimation

• Stereo ?

• 3-D reconstruction ?

• Imagine and grab the glory

Page 7: Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

Semantic segmentation

I Y

Image Per-pixel Label

Page 8: Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

Can CNN be used ?• Yes

• The state-of-the-art is indeed a CNN based model.

• How ?• Use CNN and extract per-pixel features.

• Treat each output localtion (i,j) in a CNN cube as a feature for a patch.

• Match correspondence of output CNN locations to pixels.

• Use pixel-features to classify each pixel.

• But really how ??

Page 9: Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

Understanding CNN cubes

Page 10: Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

Multi-scale CNN for better accuracy

Courtesy – Farabet and Lecun, TPAMI

Page 11: Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

Pre-training is still better• Using VGG pre-trained model and simply up-sampling per-pixel

classification gives better accuracy than all previous models.

Page 12: Deep Convolutional Neural Networks – Beyond Classificationdjacobs/CMSC733/CNN2.pdf · CNNs so far •Handwritten digit classification, 1995ish •State-of-the-art recognition accuracy

Discussion and informal thoughts

• CNN is actually a very powerful feature learning paradigm and has the potential to be used almost everywhere owing to its trainability and cross-application generalization.

• Think of it as extracting features from a patch and then doing something useful depending on the application.