Top Banner
What Audio Engineers Should Know About Human Sound Perception Part 2. Binaural Effects and Spatial Hearing AES 112 th Convention, Munich AES 113 th Convention, Los Angeles Durand R. Begault Human Factors Research & Technology Division NASA Ames Research Center Moffett Field, California
39

What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Jul 21, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

What Audio Engineers Should Know About Human Sound Perception

Part 2. Binaural Effects and Spatial Hearing

AES 112th Convention, MunichAES 113th Convention, Los Angeles

Durand R. Begault

Human Factors Research & Technology DivisionNASA Ames Research CenterMoffett Field, California

Page 2: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Overview

• ILD, ITD differences and lateralization

• HRTF spectral changes for 3D imagery

• Binaural versus monaural influence of echoes

• Effects of reverberation on perception of the environmental context

• Cues to auditory distance

• Cognitive and multisensory cues

Page 3: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Sound source(s),interaction withroom acoustics

SOURCE MEDIUM RECEIVER

Communication chain for acoustic events

FrequencyAmplitudeSpectrumLocation

Page 4: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Sound source(s),interaction withroom acoustics

SOURCE MEDIUM RECEIVER

Communication chain for acoustic events

FrequencyAmplitudeSpectrumLocation

Recording & playback: acoustical-electrical- acoustical transformation

Page 5: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Sound source(s),interaction withroom acoustics

SOURCE MEDIUM RECEIVER

Communication chain for acoustic events

FrequencyAmplitudeSpectrumLocation

Recording & playback: acoustical-electrical- acoustical transformation

Hearing: perception, cognition, multi-sensory interaction

PitchLoudnessTimbreLocalization

Page 6: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Sound source(s),interaction withroom acoustics

SOURCE MEDIUM RECEIVER

Communication chain for acoustic events

Recording & playback: acoustical-electrical- acoustical transformation

Hearing: perception, cognition, multi-sensory interaction

Mismatch between prescribed & perceived spatial events

Page 7: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Model of the binaural hearing systemA

cous

tic s

igna

l-driv

en

Binaural hearing (localization; signal separation & detection):

forming spatial auditory events from acoustical (bottom-up) and psychological (top-down) inputs

Psychologically-driven

Figure adapted from JensBlauert, “Spatial Hearing.The Pychophysics of Human Sound Localization.Revised Edition. 1983, MIT Press.

Page 8: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Model of the binaural hearing system

Filtering of acoustic signalby pinnae, ear canal

Binaural hearing (localization; signal separation & detection)

Page 9: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Model of the binaural hearing system

Filtering of acoustic signalby pinnae, ear canal

Filtering by inner ear; frequency-specific neuronfirings

Binaural hearing (localization; signal separation & detection)

Page 10: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Model of the binaural hearing system

Filtering of acoustic signalby pinnae, ear canal

Filtering by inner ear; frequency-specific neuronfirings

Physiological evaluationof interaural timing andlevel differences

Binaural hearing (localization; signal separation & detection)

Page 11: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Model of the binaural hearing system

Filtering of acoustic signalby pinnae, ear canal.

Filtering by inner ear; frequency-specific neuronfirings

Physiological evaluationof interaural timing andlevel differences

Aco

ustic

sig

nal-d

riven

Binaural hearing (localization; signal separation & detection)

Multi-sensory information; cognition

Aco

ustic

sig

nal-d

riven

Psychologically-driven

Page 12: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Two important functions of the binaural hearing system

for recording engineers:

• Localization

(lateral and 3-dimensional)

• Binaural masking:

Echo supression, room perception

Page 13: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

• ILD (interaural level difference)• ITD (interaural time difference)

“Duplex” theory of localization

Lateral localization of auditory images

Page 14: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

• ILD (interaural level difference) caused by head shadow of wavelengths > 1.5 kHz

Lateral spatial image shiftLe

vel d

iffer

ence

(dB

)Le

vel d

iffer

ence

(dB

)

Page 15: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Perceptual decoding of spatial cuesin a cross-coincident microphonerecording is based on ILDs

rotation

Page 16: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

• ITD (interaural time difference)

Lateral image shift

Page 17: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Lateralization demo. A simple time or level difference can make headphone images move from side to side inside the head.

Interaural level difference (dB)0 4 8 12

. 5 1 1 . 5

0 (center)

4

2

(max) 5

3

1

8

Interaural time difference (msec)0 .5 1.51

Adapted from Toole & Sayers, 1965 and Blauert, 1983: click stimuli

Adapted from Blauert, 1983: broadband noise

Late

ral s

hift

from

cen

ter

of th

e he

ad

2. ITD DEMO:0.00 ms0.25 ms0.50 ms0.75 ms1.00 ms1.50 ms

1. ILD DEMO:2 dB4 dB6 dB8 dB

12 dB

Page 18: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Elevation and front-back discrimination: HRTF, pinnae cues

Page 19: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Exte rnalize d pe rc ept ion

Lis t ene r

Source Le f t 3 0 °

Source Le f t 1 5 0 °

The cone of confusion causes reversals for virtual sources with identical or near-identical ITD or ILD

Page 20: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Frequency

Log

Mag

nitu

de (

dB)

2000 4000 6000 8000 10000 12000 14000100 16000

-40

-30

-20

-10

0

-50

10Right 30°, elevated

Right 90°, ear level

Right 120, below

Head-related transfer function cues (HRTFs) providecues for front-back discrimination and elevation

45°, 0°

135°, 0°

3. audio example:HRTF “clock positions”

Page 21: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Variation in HRTF magnitude with elevation at one azimuth

4. Audio example:

120 degreeazimuth: at

+36,

0 ,

-36 degrees

elevation

Graphic by William L. Martens,University of Aizu

Page 22: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

TARGET POSITIONREVERSAL ERROR WITH LOCALIZATION ERROR (ELEVATION)

TARGET POSITION

REVERSAL ERROR

REGION OF LOCALIZATION ERROR (AZIMUTH)

INTRACRANIAL LOCATION (DISTANCE ERROR)

Perceptual errors with headphone 3-D sound include inside-the-head localization (solution: reverberation cues) and reversals (solution: head tracking)

Page 23: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

0

5

10

15

20

25

30

anechoic early reflections full auralization

reverberation treatment

Uns

igne

d az

imut

h er

ror

(deg

rees

)

Localization error for headphone stimuli (azimuth)

AnechoicSpeech:Individualdifferences

Mean values for different reverberation conditions

Page 24: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Echoes, reverberation and background sound: perception of the environmental

context

Page 25: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Spatial hearing fundamentally involves perception of the location of a sound source at a point in space (azimuth, elevation, distance).

But a sound source simultaneously reveals information about its environmental context.

-reverberation-image size & extent

Distance

Elevation

Image Size

Azimuth

Environmental Context

Listener

Page 26: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Effect of delay time for a single echo

0 0.6 1.5 40 10Approximate delay time to left channel (msec)

image shift image broadening echo

Sound examples: 5. stereo echo- 6. monaural echoRelative to the reference condition,spatially separated echoes create spatial percepts;non-spatially separated echoes create timbral effects

Page 27: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Direct sound

Early reflections

Late reflections (dense reverberation)

Time

Early and late reverberant sound fieldsR

elat

ive

ampl

itude

7. Audio examples:-direct sound-direct w/ 1st, 2nd order ERs-direct with full auralization

R2

D

R1

Page 28: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Direct sound

Early reflections

Late reflections (dense reverberation)

Time

Early and late reverberant sound fields

0 . 5 1 1 . 5 2 2 . 5 3 3 . 5 4 4 . 5 5 5 . 5

x 1 04

- 4 0

- 3 5

- 3 0

- 2 5

- 2 0

- 1 5

- 1 0

- 5

Rel

ativ

e am

plitu

deR

elat

ive

ampl

itude

8. audio examples: normal and 0.25 speed impulse response

Page 29: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Echo thresholds• Sensitivity can increase as much as 10 dB

if echoes occur at different locations• Late reverberation can decrease sensitivity• Sensitivity increases with increasing time delay

Page 30: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Although thresholds for reverberation are relatively low, background noise(e.g., NC 35) can mask the reverberant decay.

Noise Criteria (NC) curves

0

10

20

30

40

50

60

70

80

One-Third Octave Band Center Frequency, Hz

reverberation threshold

NC 65

NC 60

NC 55

NC 50

NC 45

NC 40

NC35NC 30

NC 25

NC 20

NC 15Approximate Threshold of Hearing for Continuous Noise

NC 10NC 5

31.5 63 125 250 500 1k 2k 4k 8k-40

-35

-30

-25

-20

-15

-10

250

500

1000

2000 fb

w

250

500

1000

2000 fb

w

250

500

1000

2000 fb

w

Small Medium Large

Octave-Band Center Frequency (fbw=full bandwidth)

Rev

erbe

ratio

n th

resh

old

(spe

ech)

re 6

0 dB

SP

L

speech

Page 31: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Distance perception: amplitude cues

• The inverse square law states that sound decays 6 decibels per doubling of distance in a reflection-free environment.

2

4

8

85

1

79 73dB SPL

''

''

67

9. sound example

Page 32: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Distance perception: amplitude cues

However, “half-as-loud” corresponds to a 10 dB reduction in level with distance

2

4

8

85

1

75 65dB SPL

''

''

55

10. sound example

Page 33: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Distance perception: reverberant ratio cues

108642070

73

76

79

82

85

88

91

94

Anechoic

w/ ER

w/ ER + LR

distance (feet)

An increase in reverberant level indicatesmovement into the diffuse sound field

Page 34: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Concert Hall reverberation physical-perceptual parameters

• Reverberance (reverberation time, strength)

• Apparent source width (ASW) (interaural cross-correlation)

• Envelopment (spatial diffusion of reflections from all around)

• Clarity (ratio of first 50-80 ms of early sound to late sound)

• Warmth (ratio of bass frequency RT to mid-band RT)

Page 35: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Cognitive cues; multisensory cues

Page 36: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Cognitive cues to distance perception

Shouting

Whispering

Page 37: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Auditory localization can be influenced or biased bycognitive mapping

Page 38: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Influence of visual, vibratory cues

Explosions & crashes

Helicopter fly-overs

Page 39: What Audio Engineers Should Know About Human Sound Perception · broadband noise Lateral shift from center of the head 2. ITD DEMO: 0.00 ms 0.25 ms 0.50 ms 0.75 ms 1.00 ms 1.50 ms

Summary

• ILD, ITD differences and lateralization

• HRTF spectral changes for 3D imagery

• Binaural versus monaural influence of echoes

• Effects of reverberation on perception of the environmental context

• Cues to auditory distance

• Cognitive and multisensory cues