Top Banner
Deep Neural Networks Nathan Sprague CS444
10

Deep Neural Networks - w3.cs.jmu.edu · GPGPU Maxout RMSProp Dropout Rectified Linear Units Cluster Computing Kaggle Street View House Numbers ... Neurovision: Neural bases of …

Jul 19, 2018

Download

Documents

tranquynh
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Deep Neural Networks - w3.cs.jmu.edu · GPGPU Maxout RMSProp Dropout Rectified Linear Units Cluster Computing Kaggle Street View House Numbers ... Neurovision: Neural bases of …

Deep Neural Networks

Nathan SpragueCS444

Page 2: Deep Neural Networks - w3.cs.jmu.edu · GPGPU Maxout RMSProp Dropout Rectified Linear Units Cluster Computing Kaggle Street View House Numbers ... Neurovision: Neural bases of …

The Deep Learning “Revolution”

● Geoff Hinton introduced a simple idea in 2006● Greedy, Layer-Wise, Unsupervised Pre-Training

– Train the first hidden layer to re-represent the input.– Train the second hidden layer to re-represent the first

hidden layer– ...– Fine-tune the entire network using backpropagation on

labeled data

G. E. Hinton, S. Osindero, and Y. Teh, “A fast learning algorithm for deepbelief nets,” Neural Computation, vol. 18, pp. 1527–1554, 2006.

Page 3: Deep Neural Networks - w3.cs.jmu.edu · GPGPU Maxout RMSProp Dropout Rectified Linear Units Cluster Computing Kaggle Street View House Numbers ... Neurovision: Neural bases of …

The Flood Gates Open

● Better Hardware

● Massive Data Sets

● Better Training Algorithms

● New Architectures

GPGPU

Maxout

DropoutRMSProp

Rectified Linear Units

Cluster Computing

KaggleStreet View House Numbers

Batch Normalization

ImageNet KITTI

Adam Adadelta

Resnets

Page 4: Deep Neural Networks - w3.cs.jmu.edu · GPGPU Maxout RMSProp Dropout Rectified Linear Units Cluster Computing Kaggle Street View House Numbers ... Neurovision: Neural bases of …

Human Visual System

Urbanski, Marika, Olivier A. Coubard, and Clémence Bourlon. "Visualizing the blind brain: brain imaging of visual field defects from early recovery to rehabilitation techniques." Neurovision: Neural bases of binocular vision and coordination and their implications in visual training programs (2014).

Page 5: Deep Neural Networks - w3.cs.jmu.edu · GPGPU Maxout RMSProp Dropout Rectified Linear Units Cluster Computing Kaggle Street View House Numbers ... Neurovision: Neural bases of …

Convolutional Neural Networks

● Convolutional neural networks use the same trick of learning layers of localized features…

● CNN’s were actually being used by Yann Lecun at Bell Labs around 1990

● (He would probably argue that “deep learning” is not so new)

Page 6: Deep Neural Networks - w3.cs.jmu.edu · GPGPU Maxout RMSProp Dropout Rectified Linear Units Cluster Computing Kaggle Street View House Numbers ... Neurovision: Neural bases of …

Convolutions

Grayscale Image1 convolutional filter

http://upload.wikimedia.org/wikipedia/commons/4/4f/3D_Convolution_Animation.gifBy Michael Plotke [CC BY-SA 3.0 (http://creativecommons.org/licenses/by-sa/3.0)

Page 7: Deep Neural Networks - w3.cs.jmu.edu · GPGPU Maxout RMSProp Dropout Rectified Linear Units Cluster Computing Kaggle Street View House Numbers ... Neurovision: Neural bases of …

Convolutions

Grayscale Image1 convolutional filter

Color Image5 convolutional filters

http://upload.wikimedia.org/wikipedia/commons/4/4f/3D_Convolution_Animation.gifBy Michael Plotke [CC BY-SA 3.0 (http://creativecommons.org/licenses/by-sa/3.0)

http://cs231n.github.io/convolutional-networks/The MIT License (MIT)Copyright (c) 2015 Andrej Karpathy

Page 8: Deep Neural Networks - w3.cs.jmu.edu · GPGPU Maxout RMSProp Dropout Rectified Linear Units Cluster Computing Kaggle Street View House Numbers ... Neurovision: Neural bases of …

Pooling Layers

● Pooling layers down-sample the filter outputs to– Reduce dimensionality and computational requirements– Increase the spatial extent of subsequent filters

http://cs231n.github.io/convolutional-networks/The MIT License (MIT)Copyright (c) 2015 Andrej Karpathy

Page 9: Deep Neural Networks - w3.cs.jmu.edu · GPGPU Maxout RMSProp Dropout Rectified Linear Units Cluster Computing Kaggle Street View House Numbers ... Neurovision: Neural bases of …

Complete Network

● A “traditional” CNN is composed of convolutional layers, each followed by non-linearities, followed by pooling layers, with a dense (non-convolutional) layer at the end:

Chen, Xianjie, and Alan L. Yuille. "Articulated pose estimation by a graphical model with image dependent pairwise relations." Advances in Neural Information Processing Systems. 2014.

Page 10: Deep Neural Networks - w3.cs.jmu.edu · GPGPU Maxout RMSProp Dropout Rectified Linear Units Cluster Computing Kaggle Street View House Numbers ... Neurovision: Neural bases of …

Current State of The Art

● Current best-performing networks have somewhat more complicated architectures.

● GoogleNet for example:

Szegedy, Christian, et al. "Rethinking the inception architecture for computer vision." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016.