Top Banner
0
32

Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

Jul 15, 2018

Download

Documents

doanthuan
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

0

Page 2: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

1

Sign Language Recognition and Translation Based on Kinect

Xilin Chen

Institute of Computing Technology, Chinese Academy of Sciences

Page 3: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

2

Acknowledgement

This is a joint work with Guang Li, Yushun Lin, Zhihao Xu, Yili Tang, Jialu Zhu, Xiujuan

Chai from ICT, CAS

Hanjing Li from Beijing Union University

Xin Tong, Zhuowen Tu, Jian Sun, Ning Xu, Guobin Wu, Ming Zhou from MSRA

Zhengyou Zhang from MSR

Thanks for those students who make big contribution on data collection from BUU, especially thanks for Hui Liu , and Dandan Yin

Page 4: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

3

Disabled People in China

Unit: 10K

Source: 2nd census of disabled people in China, 2006

Health, 122556, 93.7%

Physical Dis., 2412, 1.8%

Mental handicapped, 554, 0.4%

Mental Retardation, 614, 0.5%

Multiple Dis., 1352, 1.0%

Vis. Impaired, 1233, 0.9%

Hearing Impaired, 2004,

1.5%

Speech Disorder, 127, 0.1%

Dis., 8296, 6.3%

Page 5: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

4

Sign Language

100 million people use sign language in China and 200 million people in the world

Sign language is recognized as a natural language in many countries

Language barrier between deaf-mute and health people Human sign language translator is a hot job

Automatic sign language translator Automatic sign language recognition and generation

Page 6: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

5

Alphabets in American / Chinese SL

Page 7: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

6

Some words in ASL / CSL

好 (Good) 来(Come) 请(Please)

是 (be/is/are/was/were)

能(Can) 不(No)你(You) 我(Me)

ASL

CSL

Be

Are

Was

Page 8: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

7

Challenges in SL Translation

A large vocabulary set for recognition 5000+ words in Chinese Sign Language

Page 9: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

8

Challenges in SL Translation

A large vocabulary set for recognition

Motion and posture in different scale Some words with only one posture

Some words only with fingers motion, e.g. 谢谢(thanks)

Some words with significant hand / arm motion, e.g. 大家(everyone)

谢谢(thanks)五(Five) 大家(everyone)

Page 10: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

9

Challenges in SL Translation

A large vocabulary set for recognition

Motion and posture in different scale

Vocabulary set is relatively smaller than spoken language Thousands words vs. 100+ thousands ones

Many to one mapping Sit / Chair same gesture

Page 11: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

10

Challenges in SL Translation

A large vocabulary set for recognition

Motion and posture in different scale

Vocabulary set is relatively smaller than spoken language

Grammar is different English: I like to fly small planes.

Sign: SMALL PLANES — FLY — LIKE ME

Page 12: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

11

Lessons from Previous Works

SL recognition with video camera Only works on a small vocabulary set

Segmentation is a big challenge

Sensitive to lighting change

Page 13: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

12

Lessons from Previous Works

SL recognition with video camera

Data-glove based sign language recognition Input: Data-glove + Location Sensor

Recognition Model: HMM

Merits Stable Input

Supportable to large vocabulary set (5000+ words)

Page 14: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

13

CSL Recognition with Data-glove

Page 15: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

14

Lessons from Previous Works

SL recognition with video camera

Data-glove based sign language recognition Input: Data-glove + Location Sensor Recognition Model: HMM Merits

Stable Input

Supportable to large vocabulary set (5000+ words)

Demerits Too expensive

Extra accessories

Easy damaged

Page 16: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

15

Kinect – an opportunity for SL Recognition

Depth provides additional robust information Body segmentation / tracking

Balance between data-glove and pure visual camera Cost

Robustness

Understandable to raw data

Shotton et al. CVPR11

Page 17: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

16

Page 18: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

17

An Example from Kinect

Page 19: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

18

Basic Idea

SL = Hand Motion + (Face expression)

Hand Motion = Trajectory + Key postures

Basic idea from SL dictionary Postures + a few trajectories

Page 20: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

19

Basic Idea

SL = Hand Motion + (Face expression)

Hand Motion = Trajectory + Key postures

Basic idea from SL dictionary Postures + a few trajectories

水果 (Fruit)

Postures are basic

elements in SLSome clips of the

trajectory are essential

elements in SL

Even some clips aren’t

essential elements in SL,

they still encode

important context

Page 21: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

20

Recognizing SL from trajectory

Basic task 𝐷 = 𝑓 (𝑐1, 𝑐2), where 𝑐1 and 𝑐2 are two curves in 3D

space

Manifolds matching and distance measuring

People play SL in different cases Speed (duration) to play a sign

Height of the signer

Slightly different in pose

Page 22: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

21

Alignment of Trajectories

A essential step to deal with various distortions Speed (duration) to play a sign Height of the signer Slightly different in pose

Noise remove to improve robustness

Trajectory interpolation Improve the performance on different speed

Trajectory length normalization Improve the performance between different signers

(height)

Calculation principle direction Independent with pose

Page 23: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

22

Examples of Aligned Trajectories

Black line: principle direction of blue curve

Red line: principle direction of green curve

On purpose (故意)Everyone(大家)

*All trajectories above from right hand

Page 24: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

23

Matching Same Word Trajectories

On purpose(故意) (d=212)Reserve(保留) (d=400)

Reach(到) (d= 162)Everyone (大家) (d = 561)

Page 25: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

24

Matching Different Word TrajectoriesEveryone(blue) Reach (Blue)

On Purpose

(Green)

d=1,079 d=380

Reserve

(Green)

d= 41,149 d=40,508

Page 26: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

25

Trajectory-based Recognition Result

rank count rate

1 180 75.3%

5 225 94.1%

10 232 97.1%

20 235 98.3%

50 237 99.2%

Vocabulary set size: 239

Page 27: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

26

Posture Recognition

Key posture detection

Key posture recognition

Page 28: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

27

Posture Recognition

Key posture detection Intersection-union ratio

Page 29: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

28

Posture Recognition

Key posture detection Intersection-union ratio

Key posture recognition PCA used for orientation normalization

Normalize hand size to 64*64

HOG feature block size(8*8)

cell size(8*8)

9 bins

LDA use for recognition

Page 30: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

29

Demo

Page 31: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

30

Thank you!

Page 32: Sign Language Recognition and file1 Sign Language Recognition and Translation Based on Kinect Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

31