Top Banner
LING 001 Introduction to LING 001 Introduction to Linguistics Linguistics Fall 2010 Fall 2010 Sound Structure I: Sound Structure I: Phonetics Phonetics Acoustic phonetics Jan. 27
24

Sound Structure I: Phonetics

Feb 25, 2016

Download

Documents

lot

Sound Structure I: Phonetics. Acoustic phonetics Jan. 27. Make vowel sounds from household materials. Buzzing sound + Resonators = Vowels. +. = [a]. = [i]. +. From Mark Huckvale: http://www.phon.ucl.ac.uk/home/mark/vowels. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Sound Structure I: Phonetics

LING 001 Introduction to LinguisticsLING 001 Introduction to LinguisticsFall 2010Fall 2010

Sound Structure I: Sound Structure I: PhoneticsPhonetics

Acoustic phonetics

Jan. 27

Page 2: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

2Make vowel sounds from household materials

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Buzzing sound + Resonators = Vowels

+

+

= [a]

= [i]

From Mark Huckvale: http://www.phon.ucl.ac.uk/home/mark/vowels

Page 3: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

3

The source in speech production

• The production of speech consists of two kinds of operations: (1) the generation of sound sources (the raw material), and (2) the processing/resonating of these sources by the vocal tract (the resonator).

• Vocal fold vibration generates buzzing sound, which is the raw material (source) for producing vowel sounds.

Page 4: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

4

The vibration of the vocal folds

• The aerodynamics of vocal fold vibration:With the glottis (the space between the vocal folds) closed and an airstream issuing from the lungs, the excess pressure below the glottis (sub-glottal pressure) forces the vocal folds apart; the air passing between the folds generates a Bernoulli force that quickly closes the glottis. The sub-glottal pressure builds up again, forcing the vocal folds apart again.

• The vibration of the vocal folds are periodic. The frequency of the vibration is determined by the air pressure in the lungs and by the vocal folds mechanical properties.

Page 5: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

5

Periodic and frequency

• Vocal fold vibration is a complex periodic motion, it can be decomposed into many Simple Harmonic Motions.

• Simple Harmonic Motion is the simplest periodic motion, in which a body oscillates about an equilibrium position (rest position) in a sinusoidal pattern. Tuning forks are such an example.

Page 6: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

6

Simple Harmonic Motion

• Properties of Simple Harmonic Motion:displacement: momentary distance from the rest pointcycle: one complete oscillationamplitude: maximum displacementfrequency: number of cycles per second (Hz)period: number of seconds per cycle

• What is the difference?

Page 7: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

7

Spectrum

• A complex periodic motion can be represented as a spectrum showing:• Frequency of each simple harmonic component on the x-

axis• Amplitude of each simple harmonic component on on the

y-axis

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Page 8: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

8The source spectrum in speech production

• As the vocal folds open and close, puffs of air flow through the glottal opening. The frequency of these pulses determines the fundamental frequency (F0) and contributes to the perceived pitch of the produced sound.

• Besides F0, the glottal air flow also contains a large number of harmonics (overtones), whose frequencies are integer multiples of F0. The amplitude of the harmonics decreases gradually.

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Page 9: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

9

Vocal tract is a resonator

• The vocal tract is a resonator. The resonance frequencies of the vocal tract, called formants, depend on its shape. The first resonance frequency is F1, the second resonance frequency is F2, and so on.

• The first two or three formants (their frequencies, not amplitudes) are most important in speech communication.

Page 10: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

10

Final product: vowels

• The presence of the formats disrupts the uniformly sloping envelop of the source spectrum, imposing peaks at the formant frequencies. Formant frequencies determine vowel quality (/i/ vs. /a/ vs. /u/ etc.)

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Vowel spectrum

Vocal tract resonance

Source spectrum

Page 11: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

11

Acoustic vowel space

Page 12: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

12

Vowel formants

• Vowel height is closely related to the first formant frequency:

low vowels -> higher F1

• Vowel frontness/backness is closely related to the second formant frequency:

front vowels -> higher F2

• Formant frequencies of American English vowels (average):

Male Female/i/ F1 270Hz F2 2290Hz F1 310Hz F2 2790Hz /a/ F1 730Hz F2 1090Hz F1 850Hz F2 1220Hz/u/ F1 300Hz F2 870Hz F1 370Hz F2 950Hz

Page 13: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

13

Measuring vowel formants

• Formants can be displayed by computing a spectrogram, showing the energy in the signal at different frequencies.

• Spectrograms are a plot of the intensity of the frequency content of a signal as time progresses. The y-axis is frequency, the x-axis is time, and the intensity is shown on a grey-scale.

• Praat: doing phonetics by computer. http://www.fon.hum.uva.nl/praat/

Page 14: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

14

The acoustics of consonants

• The sources of consonant sounds are either turbulent airflow (fricatives) or release of a compressed air behind a closure (stops). The sources are aperiodic, noises.

Note: White noise contains an equal distribution of energy spread over all frequencies, just as white light is composed of all of the colors of the spectrum of visible light.

• The resonator is the front cavity, i.e., the part of the oral cavity that is in front of the constriction place. The nasal cavity is used as a resonator in producing nasal sounds.

Page 15: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

15

Fricatives

Time (s)0 4.477

-1

1

0

Time (s)0 4.477

0

6000

F TH S SH

• Fricatives have relatively long durations of noise, and it is this lengthy interval of aperiodic energy that distinguishes fricatives as a sound class. [s, ʃ] have more energy than [f, θ].

Page 16: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

16

Stops

• Stops are dynamic, it has a series of ‘articulatory postures’: Gap: during closure, the only possible source is voicing (shown as a voice bar).

Burst: a transient noise produced when the oral closure is released

Aspiration: a diffuse noise generated at the larynx and possibly the lower pharynx. Its spectrum resembles that for the fricative [h].

• [tie], [dye], [style].

Time (s)0 2.963

0

5000

0

1000

2000

3000

4000

5000

Voice bar

GapAspiration

Burst

Page 17: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

17VOT

• To further distinguish voiced/voiceless and aspirated/unaspirated, we can use Voice Onset Time (VOT).

• VOT is the duration of the period of time between the release of a plosive/stop and the beginning of vocal fold vibration. This period is usually measured in milliseconds (ms).

• It is useful to distinguish at least three types of VOT which are shown in the schematic diagram below:

Page 18: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

18

Speech prosody

• Segments characterize speech sounds. Suprasegmental features (prosody) superimpose on segments or segmental sequences.

• Acoustic cues of prosody: Pitch, determined by fundamental frequency Length, determined by duration Loudness, determined by amplitude

• Stress, tone, intonation, rhythm, etc.

Page 19: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

19

Tones

• Tone languages use pitch to make lexical contrasts (to distinguish words).

• contour tones (predominating in Chinese and the languages of southeast Asia): The tones are made up of dynamic pitch changes.

• Register tone (predominating in African languages). The tones are made up of a few pitch registers: high, low, and maybe mid.

Page 20: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

20

Intonation in English

A: What types of foods are a good source of vitamins?B1: Legumes are a good source of vitamins.B2: Legumes are a good source of vitamins.

A: I’d like to fly to Davenport, Iowa on TWA.B: TWA doesn’t fly there ...

B1: They fly to Des Moines. B2: They fly to Des Moines.

A1: I met Mary and Elena’s mother at the mall yesterday.A2: I met Mary and Elena’s mother at the mall yesterday.

Page 21: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

21

Intonation in English

50

100

150

200

250

300

350

400

LEGUMES are a good source of vitamins

50

100

150

200

250

300

350

400

legumes are a good source of VITAMINS

Page 22: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

22

Intonation in Mandarin Chinese

• Interaction between tone and intonation

Page 23: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

23Rhythm

• Rhythm refers to the temporal organization of speech elements. It depends on many factors.

• At the segmental level, for example, vowels are longer before voiced stops than before voiceless ones. The effect is very large in English (bad vs. bat), but smaller in many other languages such as French.

• At the level of syllables, languages have different phonotactic constraints on syllable shape. For example, many dialects of Chinese allow only alveolar and velar nasals in the coda position.

Page 24: Sound Structure I: Phonetics

LING 001 Introduction to Linguistics, Fall 2010

24Rhythm

• In natural speech, over 70% of French and Spanish syllables were open (e . g ., CV and V) , while more than 50% of English syllables were closed (e . g ., CVC and VC).

• In Hawaiian there is never more than a single consonant between any two vowels, but in English there can be seven, as in texts spread. In Polish there are even more: