Top Banner
What do we hear for ? Seeing is knowing what is where by looking ( David Marr ) Seeing is predicting what is where, verified by looking, in order to drink that cup of coffee ( Reza Shadmehr )
81

What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Dec 21, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

What do we hear for?

Seeing is knowing what is where by looking

(David Marr)

Seeing is predicting what is where, verified by looking, in order to drink that cup of

coffee

(Reza Shadmehr)

Page 2: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.
Page 3: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

What do we hear for?

Seeing is knowing what is where by looking

(David Marr)

Seeing is predicting what is where, verified by looking, in order to drink that cup of coffee

(Reza Shadmehr)

Hearing is predicting what will happen next, verified by listening, in order to know as much as

possible about what’s out there

(Eli Nelken)

Page 4: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.
Page 5: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Even simple sounds tell stories

Page 6: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

A stupid story

Page 7: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

The calm of the sea

Vox balaenae )Voice of the whale(For flute, cello and piano )cello and piano playing(George Crumb

Page 8: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

A shout of despair

Wozzeck, orchestral transition between scenes 2 and 3 of act 3Alban Berg

Page 9: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Auditory worlds

• What are sounds?

• What do we hear?

• How do we hear?

Page 10: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Sound As a Pressure Wave

Vibrations of objects set up pressure waves in the surrounding air.The “elastic” property of air allows these pressure waves to propagate (spread).

Vibrations of objects set up pressure waves in the surrounding air.The “elastic” property of air allows these pressure waves to propagate (spread).

Page 11: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Structure of sounds

Page 12: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

What happens without structure?

Page 13: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Introducing structure

Page 14: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

The bird and Chopin

©Gabriel J. Arsante

Page 15: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Structure of sounds

©Gabriel J. Arsante

Page 16: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

What are sounds?

• Structure at a lot of time scales

• Perceptual correlates:– Melodies )1 s(– Notes )0.1 s(– Pitch )much faster than 0.01 s(

Page 17: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Peripheral processing of sounds

Page 18: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Inner Ear

Middle Ear

Outer Ear

Page 19: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Inner Ear

Middle Ear

Outer Ear

Page 20: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Inner Ear

Middle Ear

Outer Ear

Page 21: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Inner Ear

Middle Ear

Outer Ear

Page 22: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Cross Section of Cochlea

Page 23: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

“Travelling Wave” Along the Basilar Membrane

Von Békésy

Page 24: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Travelling Wave Peaks at Different Locations As the Frequency Changes

Page 25: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.
Page 26: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Outer Hair Cells

Inner Hair Cells

Page 27: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

A simple neuron in the auditory system

BF

Page 28: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

The auditory pathways

Page 29: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Responses of simple neurons to complex sounds

Page 30: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Orig Slow

A set of complex sounds

Page 31: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

In consequence…

Page 32: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

The neurogram

Page 33: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

We get a very rich and precise representation of the incoming

sound at the level of the auditory nerve

Page 34: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

The sound and its components

full

337

600

2000

Brahms, Geistlisches WiegenliedOp. 91 no. 2Kathleen Ferrier, Phyllis Spurr,Max Gilbert

Page 35: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Is that enough?

(do we hear the spectrogram)?

Page 36: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

What are the perceptual qualities of sounds?

“The basic elements of any sound are loudness, pitch, contour, duration )or rhythm(, tempo, timbre, spatial location, and reverberation.”

)D.J. Levitin, This is Your Brain on Music: The Science of a Human Obsession, p.14(

Page 37: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

The Long Road from Spectrogram to Perception

• How do we go from the ‘neurogram’ to ‘loudness, pitch, contour, duration )or rhythm(, tempo, timbre, spatial location, and reverberation’?

Page 38: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Relationships with low-level features…

• Loudness with sound intensity– Encoded by some population-averaged activity

• Pitch with periodicity

Page 39: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

pure

Pur

e to

nes

Time

Filtered clicks

Fil

tere

d cl

icks

Iterated ripple noise

IRN

AM (3 kHz)

SA

M

Pitch: examples

Page 40: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Relationships with low-level features…

• Loudness with sound intensity– Encoded by some population-averaged activity

• Pitch with periodicity– Periodicity IS NOT frequency!

• Contour with slow amplitude modulations– Encoded in the range of 1-10 Hz very clearly at the level of A1

)e.g. Shamma and collaborators(– But not slower than that )probably(

• Duration/rhythm with ???• Tempo with ???• Timbre with spatial activation patterns )e.g. in A1(• Spatial location with ITD/ILD/spectral activation patterns

– Low-level information available at the CN/SOC– But requires integration

• Reverberation with ???????

Page 41: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

The Long Road from Spectrogram to Perception

• Pitch, timbre, phonemic identity, and so on are ‘separable’ – they are independent of each other

• They represent high-level generalizations– Many different sounds have the same pitch )violin and

trumpet(, same timbre )trumpet on two different tones(, same phonemic identity )two different people talking(

– The neurograms of these pairs of sounds are very different from each other

• The generalizations should be derivable from the neurogram, but are not explicitly represented at that level

Page 42: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

The Long Road from Spectrogram to Perception

Problem no. 1: we do not hear the physics of sounds, but rather their derived properties

)Reverse hierarchies – we perceive high representation levels unless we make

serious efforts to go down into the details(

Page 43: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

The Long Road from Spectrogram to Perception

Page 44: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

The Long Road from Spectrogram to Perception

Problem no. 2: In natural conditions, sounds rarely occur by themselves

We have to group and segregate ‘bits of sounds’ in order to form representations of

‘auditory objects’

Page 45: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

What comes first, the sound or its properties?

• We may need to start by forming objects )solve problem no. 2( and only later assign properties to them )solve problem no. 1(

Page 46: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Hypothesis: the early auditory system )presumably up to the

level of primary auditory cortex( deals with the formation of

auditory objects

Page 47: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Evidence A:Object representation in primary

auditory cortex

Page 48: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

The auditory pathways

Page 49: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Primary auditory cortex is a higher brain area!

Visual system:

Photoreceptors

Bipolar cells

Retinal ganglion cells

LGN

V1

IT

Face cells

Auditory system:

Hair cells

Auditory nerve fibers

Cochlear nucleus

Superior Olive

Inferior Colliculus

MGB

Auditory cortex

Frequency

Soun

d le

vel

Localization and binaural detection

Species-specific calls?

Auditory scene analysis?

Page 50: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

The auditory pathways

Page 51: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

A1 Neurons have a large variety of frequency response areas )FRAs(

98 98

Page 52: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Memory in primary auditory cortex

Page 53: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Neurons in auditory cortex represent the weak components

of sounds(evidence for the representation of

auditory objects in primary auditory cortex)

Page 54: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Strong effects of weak backgrounds…

0.1 40kHz

100

10

dB A

ttn

0 100ms

0 100ms

0 100ms

Page 55: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Some cortical neurons respond to weak noise in mixture with high-level tones

Page 56: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Tones in modulated and unmodulated background

Page 57: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Noise (bandwidth: BF, 10 Hz trapezoidal envelope)Noise (bandwidth: BF, 10 Hz trapezoidal envelope)Tone (BF)Noise (bandwidth: BF, 10 Hz trapezoidal envelope)Tone (BF)Tone+Noise

Weak tones in strong noise

Las et al. 2005

Page 58: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Responses to high-level tones in silence and to low-level tones in

noise are similar

Page 59: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Evidence B: coding of surprising events in

primary auditory cortex

Page 60: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Time

Low Freq.

High Freq.

Time

Low Freq.

High Freq.

Time

High Freq.

Low Freq.

95% 50% 5%

Page 61: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Low Freq. Low Freq.

Low Freq.

Page 62: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Low Freq. High Freq.

Deviant

Standard

SSA =

0.34 0.32

0.23

Page 63: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.
Page 64: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

…Also with spikes…

Page 65: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Evidence C: Perceptual qualities such as pitch

are coded outside primary auditory cortex

Page 66: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Activation of auditory cortex by noise and pitched stimuli

Page 67: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Activation by intelligible speech

Page 68: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Take-home messages

• Auditory perception is far removed from the ‘physical’, low-level representation of sounds

• A major problem of early processing is the definition of the ‘objects’ to which properties will be assigned

• There is evidence that objects are defined first, properties are assigned in higher brain areas

Page 69: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Reverse Hierarchy Theory

• The hierarchical trade offs that dictate the relations between processing and perception

• We perceive the high-order constructs rather than the low-level physics

Page 70: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Interactions between high- and low-level representations

Page 71: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Interactions between high- and low-level representations

Page 72: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Interactions between high- and low-level representations

Page 73: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

From Hochstein and Ahissar 2002

Page 74: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Change blindness

Page 75: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Name the color of the letters

Page 76: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

נשר

Page 77: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

אדום

Page 78: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

כחול

Page 79: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Visual Reverse Hierarchy Theory )RHT(

(Ahissar & Hochstein, 1997; Hochstein & Ahissar, 2002)

Page 80: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

Feedback re

verse hierarch

yFeed-fo

rward hierarch

y

Low levels are sensitive to fine temporal cues,

in a μs resolution

Phonological/semantic level

……

day bay

nightdream

Initial perception is based on high-levels,

which represent phonological entities

Page 81: What do we hear for? Seeing is knowing what is where by looking (David Marr) Seeing is predicting what is where, verified by looking, in order to drink.

See: Nahum, Nelken and Ahissar, PLoS 2008

We can either hear the sounds or understand the words, but not

both at the same time