Top Banner
1 Automating Tactile Graphics Translation Computer Vision CSE 455 2009 Richard Ladner University of Washington
65

Automating Tactile Graphics Translation - University of Washington

Feb 03, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Automating Tactile Graphics Translation - University of Washington

1

Automating Tactile Graphics Translation

Computer VisionCSE 455

2009

Richard Ladner University of Washington

Page 2: Automating Tactile Graphics Translation - University of Washington

2

Blind Scientists and Engineers

Kent Cullers, Ph.D.

PhysicsCary SupaloGrad Student

Chemistry

Geerat Vermeij, Ph.D.Evolutionary Biologist

Page 3: Automating Tactile Graphics Translation - University of Washington

3

Blind Scientists and Engineers

Bill GerreyElectrical Engineering

Inventor

Imke Durre, Ph.D.

Atmospheric Science

William Skawinski

Professor, Chemistry

Page 4: Automating Tactile Graphics Translation - University of Washington

4

Blind Scientists and Engineers

TV Raman

Computer Science

Google

Victor Wong

EE Grad Student

H. David WohlersProfessor, Chemistry

Page 5: Automating Tactile Graphics Translation - University of Washington

5

Blind Scientists and Engineers

Chieko AsakawaComputer ScientistIBM

Hideji NagaokaComputer ScientistTsukuba U. of Tech

Katsuhito YamaguchiPhysicsNihon University

Page 6: Automating Tactile Graphics Translation - University of Washington

65

Sangyun Hahn

Ph.D. StudentCSE

Zach Lattin

Math Major

UWStudents

Page 7: Automating Tactile Graphics Translation - University of Washington

7

The Problem

text

math

graphics

Page 8: Automating Tactile Graphics Translation - University of Washington

8

Outline

• Tactual Perception

• Text

• Math

• Graphics

• Problems

• Thanks

• Demo

Page 9: Automating Tactile Graphics Translation - University of Washington

9

Tactile Perception

• Resolution of human fingertip: 25 dpi

• Tactual field of perception is no bigger than the size of the fingertips of two hands

• Color information is replaced by texture information

• Visual bandwidth is 1,000,000 bits per second, tactile is 100 bits per second

Page 10: Automating Tactile Graphics Translation - University of Washington

10

Braille

• System to read text by feeling raised dots on

paper (or on electronic displays). Invented in

1820s by Louis Braille, a French blind man.

a b c z

and the with mother

th ghch

Critical fact:Fixed height and width

Z 3 Mode characters: cap and num.

Page 11: Automating Tactile Graphics Translation - University of Washington

11

Tiger Embosser

• 20 dpi (raised dots per inch)

• 7 height levels (only 3 or 4 are distinguishable)

• Prints Braille text and

graphics

• Prints dot patterns for

texture

• Invented by a blind man,

John Gardner

Page 12: Automating Tactile Graphics Translation - University of Washington

12

Outline

• Tactual Perception

• Text

• Math

• Graphics

• Problems

• Thanks

• Demo

Page 13: Automating Tactile Graphics Translation - University of Washington

13

Text

Page 14: Automating Tactile Graphics Translation - University of Washington

14

Text Translation

The constraints do not define a region with any points in common in Quadrant I. When the constraints of a linear programming problem cannot be satisfied simultaneously, then infeasibility is said to occur. This may mean that the constraints have been formulated incorrectly, certain requirements need to be changed, or that additional resources are required before the problem can be solved.

,! 3/ra9ts d n def9e a region ) any po9ts 9 -mon 9 ,quadrant

,i4 ,:5 ! 3/ra9ts (a l9e> programm+ pro#m _c 2 satisfi$

simultane\sly1 !n 9f1sibil;y is sd 6o3ur4 ,? may m1n t !

3/ra9ts h be5 =mulat$ 9correctly1 c]ta9 require;ts ne$ 6be

*ang$1 or t a4i;nal res\rces >e requir$ 2f ! pro#m c 2

solv$4

Text Image

Text

Braille

Optical Character Recognition (OCR)

Braille Translation (Duxbury) Speech Synthesis (Jaws)

Speech

Page 15: Automating Tactile Graphics Translation - University of Washington

15

Outline

• Tactual Perception

• Text

• Math

• Graphics

• Problems

• Thanks

• Demo

Page 16: Automating Tactile Graphics Translation - University of Washington

16

Math

Page 17: Automating Tactile Graphics Translation - University of Washington

17

Math Translation

\begin{eqnarray*}P(0,0) = 396(0) + 270(0) = 0\\P(15,0) = 396(15) + 270(0) = 5940\\P(15,5) = 396(15) + 270(5) = 7290\\P(0,20) = 396(0) + 270(20) = 5400\end{eqnarray*}

;,p(0,0) .k #396(0) + #270(0) .k #0

;,p(15,0) .k #396(15) + #270(0) .k #5940

;,p(15,5) .k #396(15) + #270(5) .k #7290

;,p(0,20) .k #396(0) + #270(20) .k #5400

Math Image

Latex

Nemeth Code

Math OCR (Infty Reader)

Braille Translation (Duxbury)

Page 18: Automating Tactile Graphics Translation - University of Washington

18

Math Translation Examples

xx

i

i

−=∑

= 1

1

0

\sum_{i=0}^\infty x^i = \frac{1}{1-x}

.,s;i ;.k #0^,="x^i .k ?1/1-x#

\frac{-b \pm \sqrt{b^2 - 4ac}}{2a}

a

acbb

2

42 −±−

?-b+->b^2"-4ac]/2a#

Page 19: Automating Tactile Graphics Translation - University of Washington

19

Outline

• Tactual Perception

• Text

• Math

• Graphics

• Problems

• Thanks

• Demo

Page 20: Automating Tactile Graphics Translation - University of Washington

20

Graphics

Page 21: Automating Tactile Graphics Translation - University of Washington

21

Graphic Translation<LocationInformation>

<NumLabels>16</NumLabels><Resolution>100.000000</Resolution>

<ScaleX>1.923077</ScaleX><ScaleY>1.953125</ScaleY>

-<Label>

<x1>121</x1><y1>45</y1><x2>140</x2>

<y2>69</y2><Alignment>0</Alignment>

<Angle>3.141593</Angle>

</Label>

preprocesstext extract

clean

image

original

scannedimage

pure

graphic

text

image

location

file

Page 22: Automating Tactile Graphics Translation - University of Washington

22

Graphic Translation<LocationInformation>

<NumLabels>16</NumLabels><Resolution>100.000000</Resolution>

<ScaleX>1.923077</ScaleX><ScaleY>1.953125</ScaleY>

-<Label>

<x1>121</x1><y1>45</y1><x2>140</x2>

<y2>69</y2><Alignment>0</Alignment>

<Angle>3.141593</Angle>

</Label>

pure

graphic

text

image

location

file

y(0,20)x=1515105Ox510152020x+y=20(15,0)(15,5)

y

(#0,#20)

x.k#15

#15

#10

#5

O

x

#5

#10

#15

#20

#20

x+y.k#20

(#15,#0)

(#15,#5)

text Braille

Page 23: Automating Tactile Graphics Translation - University of Washington

23

Finding Text

• Why not just use standard optical character recognition (OCR)?

– OCR is not effective for graphical images.

ABBYY FineReader 7.0

Professional Edition

Page 24: Automating Tactile Graphics Translation - University of Washington

24

More OCR

ScanSoft OmniPage Pro 14.0

Page 25: Automating Tactile Graphics Translation - University of Washington

25

Find Text Letters

• Uses the following principles

– Text in an image is usually in one font

– Fonts are designed to have a uniform density

at a distance.

– In the absence of noise an individual letter

tends to be connected component of one

color. Exceptions are i and j.

• Use machine learning to determine which connected components are letters.

Page 26: Automating Tactile Graphics Translation - University of Washington

26

Features

Century Gothic

CW = width of bounding boxH = height of bounding boxA = area of bounding boxRi = i-th radial slice density

C

W

HA = W • H

Ri = number of blackpixels in i-th slice wherea slice is an angle of360/n. The total numberof slices is n.

0

1

3

2

Center is center ofmass of blackpixels4

5 6

7

Page 27: Automating Tactile Graphics Translation - University of Washington

27

Machine Learning

• Training:

– Sample the connected components and

compute their features.

– Use these features to train a Support Vector

Machine (SVM).

• Finding:

– For a new connected component compute its

features.

– Feed these features into the SVM.

Page 28: Automating Tactile Graphics Translation - University of Washington

28

Example

Trained on a different images from the same book.About 200 letters in the training set.

Page 29: Automating Tactile Graphics Translation - University of Washington

29

Find Text Blocks

Page 30: Automating Tactile Graphics Translation - University of Washington

30

Group characters logically

• Extracting a set of isolated characters from an image is insufficient

– Need groups of Braille characters for easier

placement

• Challenges

– Text can be at many angles

– Individual characters may be aligned along

multiple axes

Page 31: Automating Tactile Graphics Translation - University of Washington

31

Our approach

• Step 1: User provides training set

– Software examines defining features

• Step 2: Automatically find similar groups in remaining images

A. Minimum spanning tree

B. Discard useless edges

C. Discard inconsistent edges

D. Create merged groups

Page 32: Automating Tactile Graphics Translation - University of Washington

32

Minimum spanning tree (1)

Treat the centroid of each connected component as a node

Page 33: Automating Tactile Graphics Translation - University of Washington

33

Discard useless edges (2)

Page 34: Automating Tactile Graphics Translation - University of Washington

34

Discard inconsistent edges (3)

Page 35: Automating Tactile Graphics Translation - University of Washington

35

Final merge step (4)

Merge only if the resultant group is consistent

Page 36: Automating Tactile Graphics Translation - University of Washington

36

Image oftext boxes

OCR

Text

14.0

12.010.08.0 6.0 4.0 2.0 0Performancerelative to AMDElan SC520

AutomotiveOfficeTelecomm© 2003 Elsevier Science (USA). All rights reserved. AMD EIanSC520AMD K6-2E+IBM PowerPC 750CXNEC VR 5432NEC VR 4122

OCR on Text Image

Page 37: Automating Tactile Graphics Translation - University of Washington

37

Braille Placement

• Text boxes of Braille will be of different size than the original text boxes– Mode characters

– Contractions

– Braille is fixed width

Example

,example

Example

,example

Example

,example

Left justified Centered Right justified

Page 38: Automating Tactile Graphics Translation - University of Washington

38

Example Plane Sweep

3L

Page 39: Automating Tactile Graphics Translation - University of Washington

39

Example Plane Sweep

3L

Page 40: Automating Tactile Graphics Translation - University of Washington

40

Example Plane Sweep

4L

Page 41: Automating Tactile Graphics Translation - University of Washington

41

Example Plane Sweep

8R

Page 42: Automating Tactile Graphics Translation - University of Washington

42

Page 43: Automating Tactile Graphics Translation - University of Washington

43

Subtask Pattern

Batch Process

Individual Edit

• TGA batch process

• Photoshop and Illustrator scripts

• Omnipage batch manager

• Duxbury command line

Page 44: Automating Tactile Graphics Translation - University of Washington

44

Tactile Graphics Assistant

Batch Process

Individual Edit

TGA

Training Data

Page 45: Automating Tactile Graphics Translation - University of Washington

45

Available Books• Computer Architecture: A Quantitative Approach, 3rd EditionHennessy and Patterson 2002 Elsevier 25 minutes per figure • Advanced Mathematical Concepts, Precalculus with ApplicationsGordon-Holliday, et al. 1999 Glencoe/McGraw-Hill 6.3 minutes per figure• An Intoduction to Modern AstrophysicsCarroll and Ostlie1996 Addison-Wesley10.2 minutes per figure • Discrete Mathematical StructuresKolman, Busby and Ross 2003 Prentice Hall 8.8 minutes per figure

Page 46: Automating Tactile Graphics Translation - University of Washington

46

10.2min/fig6.3min/fig8.8min/fig

598num figs1080num figs467num figs

100.0%6075100.0%6765100.0%4124Total

3.5%21022.8%15458.5%350Workflow

30.4%184519.7%133518.7%770Illustrator

7.4%4509.3%6305.5%225Duxbury

10.9%66014.4%97519.4%800Photoshop

15.6%9459.8%66017.3%714Omnipage

9.6%5858.4%57014.4%595TGA

4.4%2705.8%3905.9%245Classification

18.3%11109.8%66010.3%425SetUp

MinMinMin

AstronomyPrecalculusDiscrete Math

Time per Figure

Ave 7.9 min/figure

Page 47: Automating Tactile Graphics Translation - University of Washington

47

Work Balance

0.00%

5.00%

10.00%

15.00%

20.00%

25.00%

Set

Up

Cla

ssifi

catio

n

TGA

Om

nipa

ge

Pho

tosh

op

Duxb

ury

Illust

rato

r

Work

flow

Page 48: Automating Tactile Graphics Translation - University of Washington

48

TGA Workflow

• Advantages– Much faster production

– Batch processing instead of one figure at a time

– Much tedious work is avoided

• Disadvantages– May be of lower quality than custom

translation

– A lot of technology needs to be mastered

Page 49: Automating Tactile Graphics Translation - University of Washington

49

One-offs vs. Mass Production

1916 WoodsDual Power

Model T1906 Reo

Page 50: Automating Tactile Graphics Translation - University of Washington

50

Outline

• Text

• Math

• Graphics

• Workflow

• Problems

• Thanks

• Demo

Page 51: Automating Tactile Graphics Translation - University of Washington

51

Problem solving

• Each book present a set of unique problems.

• We consider a few today

– Classification of figures

– Legends and colors

– Text at an angle

– Math in figures

– Grids

Page 52: Automating Tactile Graphics Translation - University of Washington

52

Clean area 83

Clean lines 648

Complex62

Grid clean15

Grid overlap113

No text41 Overlapped text

94Radial

53

Classes

Page 53: Automating Tactile Graphics Translation - University of Washington

53

Legends and Colors

• Legends may have to be enlarged.

• Colors may have to be replaced with textures.

Page 54: Automating Tactile Graphics Translation - University of Washington

54

Angled Text

• Braille should be printed horizontally.

Page 55: Automating Tactile Graphics Translation - University of Washington

55

Math – Infty Reader

Extracted Math Image

Page 56: Automating Tactile Graphics Translation - University of Washington

56

Grids

• Grids may not work well in tactile form.

Page 57: Automating Tactile Graphics Translation - University of Washington

57

TGA Technology

• Tactile Graphic Assistant

– C++

– Machine Learning (Support Vector Machine)

• Learns features of text from positive and negative

examples.

– Computational Geometry

• Text justification

– Free executable

– Licensable source code

Page 58: Automating Tactile Graphics Translation - University of Washington

58

Technologies in the Future

• Include Audio with Touchpads

• Digital Pen and Paper

• Electro-rheological fluid displays

Page 59: Automating Tactile Graphics Translation - University of Washington

59

Outline

• Text

• Math

• Graphics

• Workflow

• Problems

• Thanks

• Demo

Page 60: Automating Tactile Graphics Translation - University of Washington

60

20052005200420042004

CSE Undergraduate Students

2004

20082005 2005 2006 20082007

Page 61: Automating Tactile Graphics Translation - University of Washington

61

Current Undergraduate Student

Page 62: Automating Tactile Graphics Translation - University of Washington

62

CSE Graduate Students

Page 63: Automating Tactile Graphics Translation - University of Washington

63

Thanks To

• Dan Comden

• Sheryl Burgstahler

• Raj Rao

• Melody Ivory

• Ethan Katz-Basset

• Zach Lattin

• Stuart Olsen

• Many others

Page 64: Automating Tactile Graphics Translation - University of Washington

64

Thanks To

Page 65: Automating Tactile Graphics Translation - University of Washington

65

DEMO