Top Banner
eech Classification Speech Lab Spring 2009 February 17, February 17, 09 09 1 Montgomery Montgomery College College Speech Classification Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences
20

Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Dec 17, 2015

Download

Documents

Kerry Harvey
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 0911

Montgomery Montgomery CollegeCollege

Speech ClassificationSpeech Classification

Uche O. AbanuloPhysics, Engineering

And Geosciences

Page 2: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 0922

Montgomery Montgomery CollegeCollege

Detailed ClassificationDetailed Classification

PhonemesPhonemesPhonemesPhonemes

VowelsVowelsVowelsVowels SemivowelsSemivowelsSemivowelsSemivowels ConsonantsConsonantsConsonantsConsonants TransitionalsTransitionalsTransitionalsTransitionals

GlidesGlidesLiquidsLiquids

FricativesFricativesNasalsNasals

PlosivesPlosivesWhispersWhispers

AffricatesAffricatesDiphthongsDiphthongs

Page 3: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 0933

Montgomery Montgomery CollegeCollege

Speech Production SystemSpeech Production System

Page 4: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 0944

Montgomery Montgomery CollegeCollege

VowelsVowels

• Source– Quasi-periodic puffs of airflow through the vocal folds vibrating at

a certain fundamental frequency, which is the pitch

• System– The formation of each vowel depends on the vocal tract

configuration, which depends on the following factors:

• The tongue – position and degree of constriction

• Lips – degree of openness, roundedness

• Jaws

• Examples– /a/ in father – tongue is raised at the front, low degree of

constriction of tongue against palate

– /i/ in eve - tongue is raised at the front, high degree of constriction of tongue against palate

Page 5: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 0955

Montgomery Montgomery CollegeCollege

VowelsVowels

Vowel Example

i eve

I it

e Hate

E Met

@ at

a Father

c all

o obey

U foot

u boot

A up

R bird

Page 6: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 0966

Montgomery Montgomery CollegeCollege

FricativesFricatives

• Source– Noise generated buy some turbulent airflow along the oral tract

• System– The formation of each fricative depends on

• If the vocal tract is vibrating or not (for voice and unvoiced fricatives)

• the tongue

• Lips

• Teeth

– There’s usually a voiced/unvoiced pair

• Examples– /z/ in Zebra – vocal folds vibrate along with noise production

– /s/ in sweet – No vibration along with noise production

Page 7: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 0977

Montgomery Montgomery CollegeCollege

VowelsVowels

Voiced Fricative Example Unvoiced Fricative

Example

v vote f for

D then T thin

Z azure S she

z zoo s see

Page 8: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 0988

Montgomery Montgomery CollegeCollege

NasalsNasals

• Source– Quasi-periodic airflow from vibrating vocal folds

• System– Air flows through the nasal cavity, sound is radiated at the

nostrils

– Sound is determined by the position of the tongue’s constriction

• Examples– /m/ in Man – tongue’s constriction is at the lips

– /n/ in No – tongue’s constriction is at the gum ridge

– /G/ in sing – tongue’s constriction is near the velum

Page 9: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 0999

Montgomery Montgomery CollegeCollege

PlosivesPlosives

• Source/System– Burst generated as a result of pressure build-up behind an oral

tract contriction

• Oral tract is initially closed (with vibration of vocal cords for voiced plosives and no vibration for unvoiced plosives)

• Release of air pressure and generation of turbulence over a very short duration

• Examples– /g/ in go – vibration occurs along with constriction

– /k/ in baker – no vibration occurs along with constriction

Page 10: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 091010

Montgomery Montgomery CollegeCollege

WhispersWhispers

• Source– Noise generated buy some turbulent airflow along the oral tract

• System– No constriction of oral tract but no vibration either

– aspiration

• Examples– /h/ in he

Page 11: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 091111

Montgomery Montgomery CollegeCollege

TransitionalsTransitionals

• Diphthongs– Vocal tract moves between two vowel sounds

– e.g /Y/ in hide; /W/ in out; /O/ in boy; /JU/ in new

• Affricates– Formed by the transition from plosive to fricative

– Example

• /ts/ in chew – transition from /t/ to /S/;

• /J/ in just – transition from /d/ to /Z/

Page 12: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 091212

Montgomery Montgomery CollegeCollege

Semi-vowelsSemi-vowels

• Voice-like in nature • Glides

– Dynamic and transitional

– Very similar to diphthongs

– Usually occur between two vowels or before a vowel

– More constriction of oral tract during transition than diphthongs

– Greater speed of oral tract movement

– e.g: /w/ in away and /y/ in you

• Liquids– Similar to glides except for the tongue constriction

– e.g. /l/ in liquid, /r/ in right

Page 13: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 091313

Montgomery Montgomery CollegeCollege

Why Speech ClassificationWhy Speech Classification

• Artificial speech production• Speech correction• Speech recognition

Page 14: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 091414

Montgomery Montgomery CollegeCollege

Generalized speech classificationGeneralized speech classification

• VoicedVoiced

Quasi-periodic excitationQuasi-periodic excitation

Modulation by vocal tractModulation by vocal tract

Production of vowels, voicedProduction of vowels, voiced fricatives & plosivesfricatives & plosives

• UnvoicedUnvoiced

No periodic vibration of vocal No periodic vibration of vocal chordschords

Noise-like natureNoise-like nature

Production of unvoiced Production of unvoiced fricatives and plosivesfricatives and plosives

Page 15: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 091515

Montgomery Montgomery CollegeCollege

Generalized speech classificationGeneralized speech classification

-5000

0

5000

10000

-5000

0

5000

10000-5000

0

5000

10000

Embedded Voiced Speech

-2000

0

2000

-2000

-1000

0

1000

2000-2000

-1000

0

1000

2000

Embedded Unvoiced Speech

Page 16: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 091616

Montgomery Montgomery CollegeCollege

Generalized speech classificationGeneralized speech classification

0 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1-1

-0.5

0

0.5

1

Time (seconds)

Am

plitu

deA Voiced Segment from a Male Speech Signal

0 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1-1

-0.5

0

0.5

1

Time (seconds)

Am

plitu

de

An Unvoiced Segment from a Male Speech Signal

Page 17: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 091717

Montgomery Montgomery CollegeCollege

Lab – simple observationLab – simple observation

1. Record yourself saying /S/ as in shout2. Read and plot your file3. Repeat 1 and 2 above with the

phoneme /a/ as in father.

Now let’s compare the two figures

Page 18: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 091818

Montgomery Montgomery CollegeCollege

Lab – more detailed Lab – more detailed

1. Record yourself saying shout

Now we’ll work together to create a voiced unvoiced detection system using only energy.

Page 19: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 091919

Montgomery Montgomery CollegeCollege

Group ProjectGroup Project

1. Record yourself saying supercalifragilisticexpialidocious

2. Manually label your recording as voiced/unvoiced3. Automatically label your recording using the speech

classification system we just created4. Compare your manual and automatic labels

• Figures will be awesome

5. Discuss all the different phonemes contained in the recorded word including

1. What kind of phoneme are they?2. How are they produced?3. Are they voiced or unvoiced?4. more???

6. One-two page report due next week.

Page 20: Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.

Speech Classification Speech Lab Spring 2009

February 17, 09February 17, 092020

Montgomery Montgomery CollegeCollege

ResearchResearch

1. Research on different ways for classifying speech into phonemes or as voiced/unvoiced other than energy.

2. Each group will give a 10 minute presentation at the next meeting