Top Banner
SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless Excitation in a Vocal Cord-Vocal Tract Speech Synthesizer” SyRG, 28 Oct 2004 Kornel Laskowski
13

J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

Aug 23, 2019

Download

Documents

hoangkiet
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

SyRG 28 Oct 2004

reading

J. Flanagan & K. Ishizaka,“ Automatic Generation of Voiceless Excitation in a

Vocal Cord-Vocal Tract Speech Synthesizer”

SyRG, 28 Oct 2004

Kornel Laskowski

Page 2: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

2

(My) Motivations

• Where does this come from? Why?

• How can I use it?

Lots of (circa 1970) synthesis work opens with or culminates in diagrams like this one:

Page 3: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

3

Outline

Page 4: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

4

Starting Point: Hard Wall Tube Models

• Lumped-parameter modeling of the vocal tract

• Lossless tube model: consider mass and compliance of air only

• Lossy tube model: same as lossless tube model, but also consider viscous and thermal losses

Page 5: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

5

Soft Wall Tube Models

• What if tube walls are not hard, but yielding?

• Some of the acoustic energy previously propagated to the next acoustic subsystem section is now transduced to a mechanical subsystem

• It’s either– Stored in the inertia/compliance of

the vocal tract walls, or– Dissipated in some form of

resistance of the vocal tract walls

Page 6: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

6

Radiation from Tube Walls

• Vibrating walls are actually causing the air outside to vibrate too (unless in a vacuum)

• Energy is once again transduced from the mechanical subsystem to the outside

• Total sound radiated along the length of the entire vocal tract is the sum of this wall radiation from each subsystem section

Page 7: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

7

Voiceless Excitation

Page 8: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

8

Putting It All Together

Page 9: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

9

Example: Aspiration

• Demonstrates ability of model to produce, based on “articulatorycontrols”, naturally occurring aspiration following unvoiced stops

Page 10: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

10

Example: Wall Vibration

• Demonstrates “voice bar”

• Even though mouth is closed, low frequencies associated with voiced stops are present

Page 11: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

11

Coupling the Model with a Coker and Umeda machine

• Appears that could produce speech provided articulatory controls available

• Said machine performs an orthographic to articulatory mapping for speech synthesis

• Have never heard of this machine. Alan?

Page 12: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

12

Frication in the Glottis

• /h/ is produced by duplicating fricative noise source also in the glottal model

• Experiment: turn off the ability to generate turbulent excitation somewhere in the tract model, then look at the spectrogram

Page 13: J. Flanagan & K. Ishizaka, “Automatic Generation of ...cbennett/SyRG/SyRG_10-28-2004.pdf · SyRG 28 Oct 2004 reading J. Flanagan & K. Ishizaka, “Automatic Generation of Voiceless

13

The End

Thanks!