Top Banner
Proprietary and confidential. Do not distribute. using neon for object detection and localization Yinyin Liu, PhD March 3 2016
18

Object Detection and Recognition

Jan 22, 2018

Download

Technology

Intel Nervana
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Object Detection and Recognition

Proprietary and confidential. Do not distribute.

using neon for object detection and localization

Yinyin Liu, PhD March 3 2016

Page 2: Object Detection and Recognition

Outline

2

Intro to Deep Learning• From a user’s perspective, how to use neon to solve your

problem

• Use object localization problem as an example to

• understand and utilize neon architecture

• implement a new model

Page 3: Object Detection and Recognition

Object detection and localization

3

Page 4: Object Detection and Recognition

Object detection and localization

4

Page 5: Object Detection and Recognition

Fast region-based CNN (RCNN)

5

Fast R-CNN [Girshick 2015] http://arxiv.org/abs/1504.08083

• Pre-trained the ConvNets

• ROI pooling

• Branch architecture

• cost function to consider both

functions

Page 6: Object Detection and Recognition

neon architecture

6

dataset

model

Optimizer

Cost

callbacks

Layers/container

Metric

Architecture

Page 7: Object Detection and Recognition

Building fast R-CNN network

7

model

Optimizer

smooth L1 cost

callbacks

loading trained VGG layers

Branch architecture

Pooling

ROI pooling

Dropout

Affine

Conv

data iterator

PASCAL VOC

Object detection metric

• Input • Image • ROIs

• Target • class label • box regression • box regression mask

new components that were not in neon

Page 8: Object Detection and Recognition

PASCAL VOC in a dataset container

8

PASCAL VOC

• Input • Image • ROIs

• Target • class label • box regression • box regression mask

Page 9: Object Detection and Recognition

ROI pooling layer

9

• ROI pooling layer combines feature map from a

layer, and ROIs from the dataset

• Make a ROI pooling a container

• contains the ConvNet layers

• interface with dataset directly

Page 10: Object Detection and Recognition

ROI pooling layer

10

• Any new layer or container needs to:

• work as part of the model’s forward and

backward propagation process

• have fprop and bprop functions

• start from an python implementation

• write backend (GPU) support for speed

Page 11: Object Detection and Recognition

Cost and metric

11

• new type of cost need to be derived from Cost

class

• new type of metric need to be derived from

Metric class

Page 12: Object Detection and Recognition

Fast R-CNN model

12

Page 13: Object Detection and Recognition

Fast R-CNN model

13

Page 14: Object Detection and Recognition

Trained and tested in neon

14

Page 15: Object Detection and Recognition

Trained and tested in neon

15

Page 16: Object Detection and Recognition

Trained and tested in neon

16

Page 18: Object Detection and Recognition

\