Slide 2 CNBH, PDN, University of Cambridge Roy Patterson Centre
for the Neural Basis of Hearing Department of Physiology,
Development and Neuroscience University of Cambridge Part II: Lent
Term 2015: ( 2 of 4) email [email protected] Central Auditory
Processing http://www.pdn.cam.ac.uk/groups/cnbh/teaching/lectures/
Lecture slides, sounds and background papers on Lecture slides on
CamTools https://camtools.cam.ac.uk/portal.html Slide 3 CNBH, PDN,
University of Cambridge Act III: the processing of communication
sounds in the auditory system (signal processing) Act I: the
information in communication sounds (animal calls, speech, musical
notes) Contents/Progress Act IV: the processing of communication
sounds (anatomy, physiology, brain imaging) Act II: the perception
of communication sounds and the robustness of perception to changes
in acoustic scale Slide 4 CNBH, PDN, University of Cambridge
Decreasing VTL Increasing GPR Kawahara and Irino (2004). Principles
of speech manipulation system STRAIGHT. In Speech separation by
humans and machines, P. Divenyi (Ed.), Kluwer Academic, 167-179. (
S s ) (1/S f ) Auditory perception is robust to changes in S s and
S f Slide 5 CNBH, PDN, University of Cambridge Low High Pitch Long
Short VTL Time [Patterson, Smith, van Dinther and Walters (2008)]
(S f ) ( S s ) Slide 6 CNBH, PDN, University of Cambridge Spectra
on a linear frequency axis Low High Pitch Long Short VTL (S f ) ( S
s ) Slide 7 CNBH, PDN, University of Cambridge Recognition of
Scaled Vowels Smith, Patterson, Turner, Kawahara and Irino JASA
(2005) /a/ /u/ /i/ /e/ /o/
pdn.cam.ac.uk/groups/cnbh/teaching/lectures/SPTKIjasa05.pdf mean
Slide 8 CNBH, PDN, University of Cambridge I: Humans can extract
the content of the communication without being confused by size
differences Psychophysical experiments confirm: The Perception of
Communication Sounds: Summary
http://www.pdn.cam.ac.uk/groups/cnbh/teaching/lectures/SPTKIjasa05.pdf
Slide 9 CNBH, PDN, University of Cambridge Speaker Size estimates
for vowels varying in GPR and VTL log(VTL) log(GPR) Size Smith and
Patterson (2005) JASA
pdn.cam.ac.uk/groups/cnbh/teaching/lectures/SPjasa05.pdf Slide 10
CNBH, PDN, University of Cambridge II: Humans can extract the size
information without being confused by differences in the content I:
Humans can extract the content of the communication without being
confused by size differences Psychophysical experiments also
confirm: The Perception of Communication Sounds: Summary
http://www.pdn.cam.ac.uk/groups/cnbh/teaching/lectures/SPjasa05.pdf
http://www.pdn.cam.ac.uk/groups/cnbh/teaching/lectures/SPTKIjasa05.pdf
Slide 11 CNBH, PDN, University of Cambridge S f : the
discrimination of S f is more tricky S s : the semitones on the
keyboard differ by 5.9% Discrimination of S s and S f Experiments
with vowels show that people can reliably discriminate a 2%
difference in S s
http://www.pdn.cam.ac.uk/groups/cnbh/teaching/lectures/ISPjasa05.pdf
Present two vowels and ask: Which vowel came from the larger
speaker? Slide 12 CNBH, PDN, University of Cambridge waveform and
spectrum of a childs /a/ Frequency on a logarithmic axis (octaves)
SsSs SfSf Slide 13 CNBH, PDN, University of Cambridge Syllable
database ma na la ra wa ya ba da ga pa ta ka sa fa va za xa ha me
ne le re we ye be de ge pe te ke se fe ve ze xe he mi ni li ri wi
yi bi di gi pi ti ki si fi vi zi xi hi mo no lo ro wo yo bo do go
po to ko so fo vo zo xo ho mu nu lu ru wu yu bu du gu pu tu ku su
fu vu zu xu hu am an al ar aw ay ab ad ag ap at ak as af av az ax
ah em en el er ew ey eb ed eg ep et ek es ef ev ez ex eh im in il
ir iw iy ib id ig ip it ik is if iv iz ix ih om on ol or ow oy ob
od og op ot ok os of ov oz ox oh um un ul ur uw uy ub ud ug up ut
uk us uf uv uz ux uh aa ee ii oo uu CVs VCs vowels Sonorants
StopsFricatives mi en ka it so us Ives, Smith and Patterson (2005)
JASA Kawahara and Irino (2004). The vocoder STRAIGHT. Kluwer
Academic large (voiced) small (voiced) Slide 14 CNBH, PDN,
University of Cambridge Speaker-size discrimination with syllables
interval 1 interval 2 pitch /wa/ /et/ /am/ /ku/ /ma/ /te//om/ /se/
Present two intervals of syllables and ask: Which is the larger
speaker? VTL = xVTL = x + x The only consistent cue is a difference
in VTL ( S f ) The overall level is varied randomly between the
intervals. The pitch contours are different. The syllables are
randomly chosen for the intervals. Ives, Smith and Patterson (2005)
JASA
http://www.pdn.cam.ac.uk/groups/cnbh/teaching/lectures/ISPjasa05.pdf
Slide 15 CNBH, PDN, University of Cambridge Experiment: S f
discrimination thresholds for five different people Glottal pulse
rate / Hz 80 160 320 SER 0.92 1.22 1.65 small male large male small
childdwarf counter tenor Ives, Smith and Patterson, JASA (2005) ( S
s ) VTL/cm 14 10 19 SfSf Slide 16 CNBH, PDN, University of
Cambridge Results: all subjects, all syllables DWARF SMALL CHILD
SMALL MALE LARGE MALE COUNTER TENOR Trials test as smaller Ives,
Smith and Patterson (2005) JASA % reported larger Slide 17 CNBH,
PDN, University of Cambridge Results: all subjects, all syllables
DWARF SMALL CHILD SMALL MALE LARGE MALE COUNTER TENOR grand average
JND for the experiment average JND across syllable category for
specific speaker type. basically 5%, independent of acoustic scale
http://www.pdn.cam.ac.uk/groups/cnbh/teaching/lectures/ISPjasa05.pdf
Slide 18 CNBH, PDN, University of Cambridge Concurrent Speech and
the cocktail party Colin Cherry (1952) Slide 19 CNBH, PDN,
University of Cambridge Syllable database ma na la ra wa ya ba da
ga pa ta ka sa fa va za xa ha me ne le re we ye be de ge pe te ke
se fe ve ze xe he mi ni li ri wi yi bi di gi pi ti ki si fi vi zi
xi hi mo no lo ro wo yo bo do go po to ko so fo vo zo xo ho mu nu
lu ru wu yu bu du gu pu tu ku su fu vu zu xu hu am an al ar aw ay
ab ad ag ap at ak as af av az ax ah em en el er ew ey eb ed eg ep
et ek es ef ev ez ex eh im in il ir iw iy ib id ig ip it ik is if
iv iz ix ih om on ol or ow oy ob od og op ot ok os of ov oz ox oh
um un ul ur uw uy ub ud ug up ut uk us uf uv uz ux uh aa ee ii oo
uu CVs VCs vowels Sonorants StopsFricatives Slide 20 CNBH, PDN,
University of Cambridge Slide 21 concurrent-speech: experimental
paradigm Identify the syllable in the interval that stays lit wu
osh Vestergaard et al (2009) JASA Slide 22 CNBH, PDN, University of
Cambridge Sonorants (semivowels)Stops (plosives)Fricatives de osh
ez ki lu Target triplet: de mi osh Masker triplet: ki lu ez
Concurrently at 0 dB SNR Pre-cursor, 0 dB SNR Concurrent-speech
paradigm Vestergaard et al (2009) JASA mi
600400200060040020006004002000 Target Distracter ms Slide 23 CNBH,
PDN, University of Cambridge Vestergaard, Fyson and Patterson,
JASA, 2009 SfSf SsSs Slide 24 CNBH, PDN, University of Cambridge
II: Humans can extract the size information without being confused
by the content of the communication III: Auditory perception is
amazingly robust to changes in acoustic scale (S s and/or S f ) in
communication sounds I: Humans can extract the content of the
communication without being confused by the size information
Psychophysical experiments confirm: The acoustic scale values in
communication sounds tell us which individual, within a population,
is speaking or which instrument, within a family, is playing The
Perception of Communication Sounds: Summary Slide 25 CNBH, PDN,
University of Cambridge End of Act II Thank you Smith, D. R. R. and
Patterson, R. D. (2005). "The interaction of glottal-pulse rate and
vocal-tract length in judgements of speaker size, sex and age," J.
Acoust. Soc. Am. 118,3177-3186. Ives, D. T., Smith, D. R. R. and
Patterson, R. D. (2005). "Discrimination of speaker size from
syllable phrases," J. Acoust. Soc. Am. 118 (6), 3816-3822. Smith,
D. R. R., Patterson, R. D., Turner, R., Kawahara, H., and Irino, T.
(2005). "The processing and perception of size information in
speech sounds," J. Acoust. Soc. Am.117, 305-318.
http://www.pdn.cam.ac.uk/groups/cnbh/teaching/lectures/SPTKIjasa05.pdf
http://www.pdn.cam.ac.uk/groups/cnbh/teaching/lectures/SPjasa05.pdf
http://www.pdn.cam.ac.uk/groups/cnbh/teaching/lectures/ISPjasa05.pdf