Top Banner
Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely
25

Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Dec 21, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Nearfield Spherical Microphone Arrays

for speech enhancement and dereverberation

Etan Fisher

Supervisor:

Dr. Boaz Rafaely

Page 2: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Microphone Arrays Spatial sound acquisition Sound enhancement Applications:

reverberation parameter estimation dereverberation video conferencing

Page 3: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

SpheresThe sphere as a symmetrical, natural entity.

Spherical symmetry

Facilitates direct sound field analysis:Spherical Fourier transformSpherical harmonics

Photo by Aaron Logan

Page 4: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Nearfield Spherical Microphone Array Generally, the farfield, plane wave assumption is made

(Rafaely, Meyer & Elko). In the nearfield, the spherical wave-front must be

accounted for.

Examples: Close-talk microphone Nearfield music recording Multiple speaker / video conferencing

Page 5: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Sound Pressure - Spherical Wave

Sound pressure on sphere r due to point source rp (spherical wave):

Spherical harmonics:

imm

nmn eP

mn

mnnY )(cos

)!(

)!(

4

)12(),(

0

||

),(),()()()(||

),,(*

n

n

nm

mnpp

mnpnn

p

rrik

YYkrhkrbkikarr

ekrp

p

From the solution to the wave equation (spherical coordinates):

Page 6: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Sound Pressure - Spherical Wave

Sound pressure on sphere r due to point source rp :

Spherical harmonics:

The spherical harmonics

are orthogonal and complete.

immn

mn eP

mn

mnnY )(cos

)!(

)!(

4

)12(),(

0

||

),(),()()()(||

),,(*

n

n

nm

mnpp

mnpnn

p

rrik

YYkrhkrbkikarr

ekrp

p

From the solution to the wave equation (spherical coordinates):

Page 7: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Sound Pressure - Spherical Wave Sound pressure on sphere r due to point source rp:

is the spherical Hankel function.

is the modal frequency function (Bessel):

ra radius of sphere Rigid

sphereOpen

))()('

)(')((4

)(4)(

krhkah

kajkrj

krjkrb

nn

nn

n

n

0

),(),()()()(),,(*

n

n

nm

mnpp

mnpnn YYkrhkrbkikakrp

)(krhn

)(krbn

Page 8: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Spherical Spectrum Functions)(krbn)(krhn

Page 9: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Spherical Spectrum Functions)()( krhkrb nn

Page 10: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Point Source Decomposition Sound pressure on sphere r due to point source rp:

Spherical Fourier transform:

Spatial filter – cancel spherical wave-front, yielding unit amplitude at rp=r0.

)()(

)()(

)()(

)()(

*

00p

mn

n

pn

nn

nmnm Y

krh

krhka

krhkrikb

krpkrw

)()()()()(),()(**

pmnpnn

mnnm YkrhkrbkikadYkrpkrp

0

),(),()()()(),,(*

n

n

nm

mnpp

mnpnn YYkrhkrbkikakrp

Page 11: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Point Source Decomposition Amplitude density:

Using the identity:

where Θ is the angle between Ω and Ωp,

0

*

0

)()()(

)()(),(n

n

nm

mnp

mn

n

pn YYkrh

krhkakw

)(cos4

12)()(

*

n

n

nm

mnp

mn P

nYY

0 0

)(cos4

12

)(

)()(),(n

nn

pn Pn

krh

krhkakw

Page 12: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Nearfield Criteria

N Order of array

k Wave number

rA

Array

radius

rs

Source

distance

Page 13: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

N = 4; rA (array) = 0.1m; k = kmax

kmax = N/rA = 40

kmax = 2πfmax /343

fmax = 2184 Hz

r0 – Desired source location

rp – Interference location

Radial Attenuation

Page 14: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

N = 4; rA (array) = 0.1m; k = kmax/4

kmax = N/rA = 40

kmax = 2πfmax /343

fmax = 2184 Hz

r0 – Desired source location

rp – Interference location

Radial Attenuation

Page 15: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

N = 4; rA (array) = 0.1m; k = kmax/10

kmax = N/rA = 40

kmax = 2πfmax /343

fmax = 2184 Hz

r0 – Desired source location

rp – Interference location

Radial Attenuation

Page 16: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

N = 2; rA (array) = 0.05 m; k = kmax

kmax = N/rA = 40

kmax = 2πfmax /343

fmax = 2184 Hz

r0 – Desired source location

rp – Interference location

Radial Attenuation – “Close Talk”

Page 17: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

N = 2; rA (array) = 0.05 m; k = kmax /4

kmax = N/rA = 40

kmax = 2πfmax /343

fmax = 2184 Hz

r0 – Desired source location

rp – Interference location

Radial Attenuation – “Close Talk”

Page 18: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

N = 12; rA (array) = 0.3 m; k = kmax /4

kmax = N/rA = 40

kmax = 2πfmax /343

fmax = 2184 Hz

r0 – Desired source location

rp – Interference location

Radial Attenuation – Large Array

Page 19: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

N = 4; rA (array) = 0.1m; k = kmax

kmax = N/rA = 40

kmax = 2πfmax /343

fmax = 2184 Hz

The natural radial attenuation has been cancelled by multiplying the array output by the distance.

Normalized Beampattern

Page 20: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

N = 4; rA (array) = 0.1m; k = kmax /4

kmax = N/rA = 40

kmax = 2πfmax /343

fmax = 2184 Hz

The natural radial attenuation has been cancelled by multiplying the array output by the distance.

Normalized Beampattern

Page 21: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

N = 4; rA (array) = 0.1m; k = kmax /10

kmax = N/rA = 40

kmax = 2πfmax /343

fmax = 2184 Hz

The natural radial attenuation has been cancelled by multiplying the array output by the distance.

Normalized Beampattern

Page 22: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Directional Impulse Response

Amplitude density:

Impulse response at direction Ω0:

where is the ordinary inverse Fourier transform.

0 0

)(cos4

12

)(

)()(),(n

nn

pn Pn

krh

krhkakw

)},({)( 01 kwtw

1

Page 23: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Speech Dereverberation

Room IR Directional IR

{4 X 3 X 2}

N = 4

r = 0.1 m

r0 = 0.2 m

“Dry”

“Rev.”

“Derev.”

Page 24: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Music Dereverberation Room IR Directional IR

{ 8 X 6 X 3 }

N = 4

r = 0.1 m

r0 = 1.9 m

“Dry”

“Rev.”

“Derev.”

Page 25: Nearfield Spherical Microphone Arrays for speech enhancement and dereverberation Etan Fisher Supervisor: Dr. Boaz Rafaely.

Conclusions Spherical wave pressure on a spherical microphone

array in spherical coordinates. Point source decomposition achieves radial

attenuation as well as angular attenuation. Directional impulse response (IR) vs. room IR. Speech and music dereverberation. Further work:

Develop optimal beamformer Experimental study of array