Physical and perceptual Physical and perceptual evaluation of the Interaural evaluation of the Interaural Wiener Filter algorithm Wiener Filter algorithm Simon Doclo 1 , Thomas J. Klasen 1 , Tim van den Bogaert 2 , Marc Moonen 1 , Jan Wouters 2 1 Dept. of Electrical Engineering (ESAT-SCD), KU Leuven, Belgium 2 Laboratory for Exp. ORL, KU Leuven, Belgium IHCON, Aug 19 2006 Slides available at http://homes.esat.kuleuven.be/~doclo/presentations.html
31
Embed
Physical and perceptual evaluation of the Interaural Wiener Filter algorithm Simon Doclo 1, Thomas J. Klasen 1, Tim van den Bogaert 2, Marc Moonen 1,
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Physical and perceptual evaluation of the Physical and perceptual evaluation of the
Interaural Wiener Filter algorithmInteraural Wiener Filter algorithm
Simon Doclo1, Thomas J. Klasen1, Tim van den Bogaert2,
Marc Moonen1, Jan Wouters2
1Dept. of Electrical Engineering (ESAT-SCD), KU Leuven, Belgium2Laboratory for Exp. ORL, KU Leuven, Belgium
IHCON, Aug 19 2006
Slides available at http://homes.esat.kuleuven.be/~doclo/presentations.html
o MMSE estimate of speech component in microphone signal at both ears
[Doclo, Klasen, Wouters, Moonen]
speech cues are preserved, no assumptions about position of speech source and microphones
noise cues may be distorted
Extension of MWF :
preservation of binaural speech and noise cues without substantially compromising noise
reduction performance
Problem statement
Binaural noise reduction
Multi-channel Wiener filter
Preservation of binaural cues
Experimental results
Audio demo
Conclusions
1212
Design of hearing aid SP algorithm
requires some mathematics
but perceptual evaluation in a couple of minutes…
1313
Configuration and signalsConfiguration and signals
• Configuration: microphone array with M microphones at left and right hearing aid, communication between hearing aids
• Vector notation: ( ) ( ) ( ) Y X V
noise component
0, 0, 0, 0( ) = ( ) , 0) =( 1m m mVY X m M 00, 0,, 0( ) = ( )( 0) , = 1mm mY V m MX
speech component
Problem statement
Binaural noise reduction
Multi-channel Wiener filter
Preservation of binaural cues
Experimental results
Audio demo
Conclusions
0 0 1 1( ) = ( ) ( ), ( ) = ( ) ( )H HZ Z W Y W Y
• Use all microphone signals to compute output signal at both ears
1414
Overview of cost functionsOverview of cost functions
Multi-channel Wiener filter (MWF): MMSE estimate of speech component in microphone signal at both ears
trade-off noise reduction and speech
distortion
Speech-distortion weighted multi-channel Wiener filter
(SDW-MWF)[Doclo 2002, Spriet 2004]binaural cue
preservation of speech + noise
Partial estimation of noise component
[Klasen 2005]
Extension with ITD-ILD or Interaural Transfer
Function (ITF)
[Doclo 2005, Klasen 2006]
Problem statement
Binaural noise reduction
Multi-channel Wiener filter
Preservation of binaural cues
Experimental results
Audio demo
Conclusions
1515
• Binaural SDW-MWF: estimate of speech component in microphone signal at both ears (usually front microphone) + trade-off between noise reduction and speech distortion
Binaural multi-channel Wiener Binaural multi-channel Wiener filterfilter
0
1
= , = ,x v M xx y v
M x v x
R R 0 rR r R R R
0 R R r
0
1
2 2
0, 0 0
11, 1
( )H H
r
HHr
XJ E
X
W X W VW
W VW X1=SDWW R r
trade-off parameter
Problem statement
Binaural noise reduction
Multi-channel Wiener filter
Preservation of binaural cues
Experimental results
Audio demo
Conclusions
estimate
o Depends on second-order statistics of speech and noise
o Estimate Ry during speech-dominated time-frequency segments, estimate Rv during noise-dominated segments, requiring robust voice activity detection (VAD) mechanism
o No assumptions about positions of microphones and sources
1616
Binaural multi-channel Wiener Binaural multi-channel Wiener filterfilter
• Binaural cues (ITD-ILD) :
Perfectly preserves binaural cues of speech component
Binaural cues of noise component speech component !!(cf. physical and perceptual evaluation)
Problem statement
Binaural noise reduction
Multi-channel Wiener filter
Preservation of binaural cues
Experimental results
Audio demo
Conclusions
• Extension of SDW-MWF with binaural cues
o Add term related to binaural cues of noise (and speech) component to SDW cost function
o Possible cues: ITD, ILD, Interaural Transfer Function (ITF)
o Weight factors and can be frequency-dependent
( ) = ( ) ( ) ( )x vtot SDW cue cueJ J J J W W W W
1717
Interaural Wiener FilterInteraural Wiener Filter
Problem statement
Binaural noise reduction
Multi-channel Wiener filter
Preservation of binaural cues
Experimental results
Audio demo
Conclusions
• Preserve binaural cues between input and output
o ITD: phase of cross-correlation
o ILD: power ratio
o ITF: Interaural transfer function (incorporates ITD and ILD)
0 0
1 1
Hv vout H
v
ZITF
Z
W V
W V
0 10
1 1 1
*0, 1,0, 0 1
*1, 1 11, 1,
( , )
( , )
r rrv vin
r vr r
E V VV r rITF
V r rE V V
R
Re.g.
0
1
2 2
0, 0 0
11, 1
2 2
0 1 0 1
( ) =H H
r
tot HHr
H x H H v Hin in
XJ E
X
E ITF E ITF
W X W VW
W VW X
W X W X W V W V
ITF preservation speech ITF preservation noise
o Closed form expression!
o large changes direction of speech component to noise component increase weight (cf. physical and perceptual evaluation)
1818
Overview of batch algorithmOverview of batch algorithmLeft input
signalsRight input
signals
( ) ( ) ( ) Y X VFFT FFT
0 0 0x vZ Z Z 1 1 1x vZ Z Z Left output Right output
IFFT IFFT
Frequency-domain filtering
Off-line computation of
statistics
VAD
( ), ( )v x R R
Calculate binaural input cues and
filter 0
1
( )( ) =
( )
WW
W, ,
Problem statement
Binaural noise reduction
Multi-channel Wiener filter
Preservation of binaural cues
Experimental results
Audio demo
Conclusions
1919
Experimental resultsExperimental results
• Identification of HRTFs:
o Binaural recordings on CORTEX MK2 artificial head
o 2 omni-directional microphones on each hearing aid (d=1cm)
o LS = -90:15:90, 90:30:270, 1m from head
o Conditions: T60=140 ms, fs=16 kHz, L=1366 taps
Problem statement
Binaural noise reduction
Multi-channel Wiener filter
Preservation of binaural cues
Experimental results- Physical- Perceptual
Audio demo
Conclusions
2020
Experimental resultsExperimental results
• Speech and noise material:
o Dutch sentences (VU list)
o Stationary speech-weighted noise with same long-term spectrum as speech material spatial aspects
o S0N60 , SNR=0 dB
o fs=16 kHz, FFT-size N=256, =1
• Physical evaluation:
o Speech intelligibility: SNR
o Localisation: ITD / ILD
• Perceptual evaluation:
o Preliminary study
o Speech intelligibility: SRT
o Localisation: localise S and N
Problem statement
Binaural noise reduction
Multi-channel Wiener filter
Preservation of binaural cues
Experimental results- Physical- Perceptual
Audio demo
Conclusions
2121
Physical evaluationPhysical evaluation
• Performance measures:
o Intelligibility weighted SNR improvement (left/right)
o ILD error (speech/noise component) power ratio
x xx out i in i
i
ILD ILD ILD
o ITD error (speech/noise component) phase of cross-correlation
x i x ii
ITD I ITD 1
* *0, 1, 0 10
{ } { }x i r r x xITD E X X E Z Z
L i L ii
SNR I SNR
importance of i-th frequencyfor speech intelligibility