Top Banner
Text detection in product images 10/26/2013 Naoki Chiba, Lead Scientist Rakuten Institute of Technology Rakuten Inc. http://rit.rakuten.co.jp/
36

[RakutenTechConf2013] [C4-1] Text detection in product images

Nov 07, 2014

Download

Technology

Rakuten, Inc

Rakuten Technology Conference 2013
"Text detection in product images"
Naoki Chiba (Rakuten)
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: [RakutenTechConf2013] [C4-1] Text detection in product images

Text detection in product images

10/26/2013

Naoki Chiba, Lead Scientist

Rakuten Institute of TechnologyRakuten Inc.http://rit.rakuten.co.jp/

Page 2: [RakutenTechConf2013] [C4-1] Text detection in product images

2

Product images

Sales pitches in images

Applications:• Content retrieval/filtering• Recognition• Translation

Page 3: [RakutenTechConf2013] [C4-1] Text detection in product images

3

RIT Text Detector

Far more accurate Works like magic

Page 4: [RakutenTechConf2013] [C4-1] Text detection in product images

4

Outline

1 Text detection overview

2 Current methods

3 RIT’s approach

Page 5: [RakutenTechConf2013] [C4-1] Text detection in product images

5

Outline

1 Text detection overview

2 Current methods

3 RIT’s approach

Page 6: [RakutenTechConf2013] [C4-1] Text detection in product images

6

Academic Research

Natural scene OCR ≠ traditional scanned OCRCamera capturedIllumination variationsPerspective distortionShort text

Source: ICDAR Text locating competition

Digital-born text Natural-scene text

Page 7: [RakutenTechConf2013] [C4-1] Text detection in product images

7

Product Images - Two Purposes

1. Sales pitches

2. Product list

Text’s role is different

Page 8: [RakutenTechConf2013] [C4-1] Text detection in product images

8

Product list

Sales pitch (Merchant’s names, Price, Shipping)

Page 9: [RakutenTechConf2013] [C4-1] Text detection in product images

9

“Now Printing” images

Showing image unavailability, but..

NotUpdated

Page 10: [RakutenTechConf2013] [C4-1] Text detection in product images

10

Text detection for product images

More accurate

Much Faster

Page 11: [RakutenTechConf2013] [C4-1] Text detection in product images

11

Outline

1 Text detection overview

2 Current methods

3 RIT’s approach

Page 12: [RakutenTechConf2013] [C4-1] Text detection in product images

12

Current methods

1. Texture based (Classifier-based)2. Region based (Connected components)3. Hybrids

Page 13: [RakutenTechConf2013] [C4-1] Text detection in product images

13

1. Texture-based method

Special texture ScanClassifier (SVM, AdaBoost or Neural network)

Problems:

• Scale/Rotation variant

• High computation

Page 14: [RakutenTechConf2013] [C4-1] Text detection in product images

14

2. Region-based method

Local features (edges or color clustering)

Connected component analysisText lines and word separation

Problem:

• False candidates

Output of Stroke width transform

Page 15: [RakutenTechConf2013] [C4-1] Text detection in product images

15

3. Hybrid method

Region based Edge (Stroke Width Transform) Color clustering

B

Classifier SVM Random Forrest

AdaBoost

Page 16: [RakutenTechConf2013] [C4-1] Text detection in product images

16

Problems

1. Character/word annotationTime-consuming task

2. Transparent textHard to detect

Page 17: [RakutenTechConf2013] [C4-1] Text detection in product images

17

Problem 1: Character/word annotation

Time consuming for many images

Page 18: [RakutenTechConf2013] [C4-1] Text detection in product images

18

Problem 2: Transparent text

?• Weak edges (difficult to detect)

Page 19: [RakutenTechConf2013] [C4-1] Text detection in product images

19

Outline

1 Text detection overview

2 Current methods

3 RIT’s approach

Page 20: [RakutenTechConf2013] [C4-1] Text detection in product images

20

RIT’s Approach

1. Character/word annotationTime-consuming task

Text image classifier using image-wise annotation

2. Transparent textHard to detect

Transparent text detection and background recovery

Page 21: [RakutenTechConf2013] [C4-1] Text detection in product images

21

1. Text image classifier using image-wise annotation

• Text image detection (not char/word)– Image-wise annotation (less time)– Clustering detected regions

(measure text likeliness)

Page 22: [RakutenTechConf2013] [C4-1] Text detection in product images

22

Image-wise Annotation

Draw rectangles

送料無料

Image-wiseClassify text/non-text

text non-text

Character-wise

Page 23: [RakutenTechConf2013] [C4-1] Text detection in product images

23

Clustering detected regions

f1

f2

C1

C2

C3

x

x

xx

x

Region in text imagesRegion in non-text images

x Cluster center

C 4

C 5

P(C1) = 3/4

P(C4) = 0/3

Page 24: [RakutenTechConf2013] [C4-1] Text detection in product images

24

Comparison

• Rakuten 500 images• Compared w/a traditional region-based method

Current Proposed0.0%

10.0%

20.0%

30.0%

40.0%

50.0%

60.0%

70.0%

80.0%

90.0%

Accuracy

Better than a typical method

Page 25: [RakutenTechConf2013] [C4-1] Text detection in product images

25

RIT’s Approach

1. Character/word annotationTime-consuming task

Text image classifier using image-wise annotation

2. Transparent textHard to detect

Transparent text detection and background recovery

Page 26: [RakutenTechConf2013] [C4-1] Text detection in product images

26

2. Transparent text detection and background recovery

• Edge Detection with adaptive threshold– Image content analysis

• Background recovery– Text color/opacity estimation

Page 27: [RakutenTechConf2013] [C4-1] Text detection in product images

27

Edge detection with adaptive thresholds

Less noise

Weak edges are better preserved

Page 28: [RakutenTechConf2013] [C4-1] Text detection in product images

28

Texture strength

Measuring image complexity

Direction and energy: eigenvectors and eigenvalues[1]

Image patches:

Texture strength:

[1] Xiang Zhu and Peyman Milanfar, “Automatic parameter selection for denoising algorithms using a no-reference measure of image content,” IEEE transactions on image processing, pp. 3116–32, 2010.

Page 29: [RakutenTechConf2013] [C4-1] Text detection in product images

29

Proposed text detection

1. Texture based (Classifier based)

SVM/Random Forest/AdaBoost2. Region based (Connected components)

Edge/Color Clustering3. Hybrids

Region (Edge Stroke Width) + Texture (AdaBoost)

Page 30: [RakutenTechConf2013] [C4-1] Text detection in product images

30

System flow

Components Analysis

Detected text

Stroke width transform and Connected componentInput image Adaptive Edge

detection

Page 31: [RakutenTechConf2013] [C4-1] Text detection in product images

31

Detection result

(a) constant threshold (b) proposed

Page 32: [RakutenTechConf2013] [C4-1] Text detection in product images

32

System flow

Components Analysis

Detected text

Stroke width transform and Connected componentInput image

Backgroundrecovery

Adaptive Edge detection

Page 33: [RakutenTechConf2013] [C4-1] Text detection in product images

33

Transparent Text

T I: observed pixel value

O: original pixel value

I

O

• 2 >= equations• Least squares solution• 2 unknown

text coloropacity

Page 34: [RakutenTechConf2013] [C4-1] Text detection in product images

34

Extraction result

(b) recovered(a) original

Page 35: [RakutenTechConf2013] [C4-1] Text detection in product images

35

Comparison with InPainting

Original

InPainting Rakuten

Magic

Patented!

Page 36: [RakutenTechConf2013] [C4-1] Text detection in product images

36

Thank you!

Details: ACPR 2013