Top Banner
Handwritten Character Recognition by Alternately Trained Relaxation Convolutional Neural Network Chunpeng Wu, Wei Fan, Yuan He, Jun Sun, Satoshi Naoi Fujitsu R&D Center, Co., Ltd. Sep 1st, 2014 Copyright 2014 FUJITSU R&D CENTER CO., LTD.
16

Handwritten Character Recognition by Alternately Trained ...

Feb 13, 2017

Download

Documents

trinhnhu
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Handwritten Character Recognition by Alternately Trained ...

Handwritten Character Recognition by

Alternately Trained Relaxation

Convolutional Neural Network

Chunpeng Wu, Wei Fan, Yuan He, Jun Sun, Satoshi Naoi

Fujitsu R&D Center, Co., Ltd.

Sep 1st, 2014

Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 2: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Outline

Introduction to Convolutional Neural Network (CNN)

Proposed Method

R-CNN: Relaxation CNN

ATR-CNN: Alternately Trained R-CNN

Experiments

Handwriting Digits - MNIST

Handwriting Chinese - ICDAR’13 Competition Dataset

Conclusions

1 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 3: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Traditional Handwriting Recognition Methods

Handcrafted features + Classifiers

Recent Deep Convolutional Neural Networks (CNN)

Learned features + Classifiers

Introduction

2 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 4: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Introduction

Success of CNN relies on

High performance computing (GPUs)

Flexible structure of neural networks

Availability of larger datasets

Effective learning algorithms

Challenges of CNN Based Methods

Slow convergence

• CNN structure vs the scale of training dataset

Over-fitting

• Typical stochastic regularizing techniques

• Dropout

• Drop-connect

• Make spatial-pooling a stochastic process

3 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 5: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Proposed Method

R-CNN: Relaxation CNN

Neurons within a feature map do not share the same kernel

Endow CNN with more expressive power

ATR-CNN: Alternately Trained R-CNN

Randomly stop one layer from learning at one epoch

Regularize R-CNN

4 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 6: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Proposed Method

R-CNN

Enhance the learning ability of CNN

5 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

CNN:

Neurons n1 and n2 share

the same weight matrix

W1 (or W2)

R-CNN:

Neurons n1 and n2 use

different weight matrices

W1 and W2

Page 7: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Proposed Method

ATR-CNN

Randomly fix a learning rate to zero at one epoch

Regularization

6 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 8: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Proposed Method

ATR-CNN

7 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Each layer has a

learning rate ηi

Randomly fix a ηi to

zero at one epoch

Revert ηi to its original

value after this epoch

Page 9: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Experiments – Handwriting Digits

MNIST (Training: 60000 Testing: 10000)

Our ATR-CNN

In-32Conv5-32MaxP2-64Conv3-64MaxP2-64RX3-64RX3-Out

NVIDIA GTX 690, 64GB RAM

8 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 10: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Experiments – Handwriting Digits

MNIST

Misclassified samples (ground-truth -> prediction)

9 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 11: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Experiments – Handwriting Chinese

Testing Set

ICDAR’13 Competition Dataset (224,419 samples, 3755 classes)

Our ATR-CNN In-64Conv5-64MaxP2-128Conv3-128MaxP2-128RX3-128MaxP2-256RX3-256Full1-Out

Narrow the gap between machine and human

10 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 12: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Experiments – Handwriting Chinese

Misclassified Samples

Top 10 errors

Ground-truth -> Prediction

Difficulties

Cursive writing

Touching strokes

Confusion in shapes

11 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 13: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Experiments – Handwriting Chinese

Contributions

Relaxation (Blue curve), Alternate Training (Red curve)

Both contribute to the improvement of recognition accuracy

12 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 14: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Conclusions

R-CNN

Neurons within a feature map do not share the same kernel

Endow CNN with more expressive power

ATR-CNN

Randomly stop one layer from learning at one epoch

Regularize R-CNN

Experiments

Both contribute to the improvement of recognition accuracy

13 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 15: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY

Questions?

14 Copyright 2014 FUJITSU R&D CENTER CO., LTD.

Page 16: Handwritten Character Recognition by Alternately Trained ...

INTERNAL USE ONLYINTERNAL USE ONLY 15 Copyright 2014 FUJITSU R&D CENTER CO., LTD.