Top Banner
Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles in Haystacks Scott C. Douglas Department of Electrical Engineering Southern Methodist University [email protected]
40

Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Mar 27, 2015

Download

Documents

Jesus Stone
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Click to edit Master text stylesSecond levelThird levelFourth levelFifth level

Click to edit Master title style

Blind Source Separation: Finding Needles in Haystacks

Scott C. Douglas

Department of Electrical Engineering

Southern Methodist University

[email protected]

Page 2: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Signal Mixtures are Everywhere

• Cell Phones• Radio Astronomy• Brain Activity• Speech/Music

How do we make

sense of it all?

Page 3: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Example: Speech Enhancement

Page 4: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Example: Wireless Signal Separation

Page 5: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Example: Wireless Signal Separation

Page 6: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Example: Wireless Signal Separation

Page 7: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Example: Wireless Signal Separation

Page 8: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Outline of Talk

• Blind Source Separation General concepts and approaches

• Convolutive Blind Source Separation Application to multi-microphone speech

recordings

• Complex Blind Source Separation What differentiates the complex-valued case

• Conclusions

Page 9: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Blind Source Separation (BSS) -A Simple Math Example

• Let s1(k), s2(k),…, sm(k) be signals of interest• Measurements: For 1 ≤ i ≤ m,

xi(k) = ai1 s1(k) + ai2 s2(k) + … + aim sm(k)• Sensor noise is neglected• Dispersion (echo/reverberation) is absent

A Bs(k) x(k) y(k)

Page 10: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Blind Source Separation Example (continued)

A Bs(k) x(k) y(k)

• Can Show: The si(k)’s can be recovered as

yi(k) = bi1 x1(k) + bi2 x2(k) + … + bim xm(k)

up to permutation and scaling factors (the

matrix B “is like” the inverse of matrix A)

Problem: How do you find the demixing bij’s

when you don’t know the mixing aij’s or sj(k)’s?

Page 11: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Why Blind Source Separation?(Why not Traditional Beamforming?)

• BSS requires no knowledge of sensor geometry. The system can be uncalibrated, with unmatched sensors.

• BSS does not need knowledge of source positions relative to the sensor array.

• BSS requires little to no knowledge of signal types - can push decisions/ detections to the end of the processing chain.

Page 12: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

What Properties Are Necessary for BSS to Work?

Separation can be achieved when (# sensors) ≥ (# of sources) • The talker signals {sj(t)} are statistically-independent

of each other and are non-Gaussian in amplitude

OR have spectra that differ from each other

OR are non-stationary

• Statistical independence is the critical assumption.

Page 13: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Entropy is the Key to Source SeparationEntropy: A measure of regularity

In BSS, separated signals are demixed and, have “more order” as a group.

First used in 1996 for speech separation.

- In physics, entropy increases (less order)

- In biology, entropy decreases (more order)

Page 14: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Convolutive Blind Source Separation

• Mixing system is dispersive:

• Separation System B(z) is a multichannel filter

Page 15: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Goal of Convolutive BSS

• Key idea: For convolutive BSS, sources are arbitrarily filtered and arbitrarily shuffled

Page 16: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Non-Gaussian-Based Blind Source Separation

• Basic Goal: Make the output signals look non-Gaussian, because mixtures look “more Gaussian” (from the Central Limit Theorem)

• Criteria Based On This Goal: Density Modeling Contrast Functions Property Restoral [e.g. (Non-)Constant Modulus

Algorithm]

• Implications: Separating capability of the criteria will be similar Implementation details (e.g. optimization strategy)

will yield performance differences

Page 17: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

BSS for Convolutive Mixtures• Idea: Translate separation task into

frequency domain and apply multiple independent instantaneous BSS procedures Does not work due to permutation problems

• A Better Idea: Reformulate separation tasks in the context of multichannel filtering Separation criterion “stays” in the time

domain – no implied permutation problem Can still employ fast convolution methods

for efficient implementation

Page 18: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Natural Gradient Convolutive BSS Alg. [Amari/Douglas/Cichocki/Yang 1997]

where f(y) is a simple vector-valued nonlinearity.Criterion: Density-based (Maximum Likelihood)Complexity: about four multiply/adds per tap

=

Page 19: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Blind Source Separation Toolbox

• A MATLAB toolbox of robust source separation algorithms for noisy convolutive mixtures (developed under govt. contract)

• Allows us to evaluate relationships and tradeoffs between different approaches easily and rapidly

• Used to determine when a particular algorithm or approach is appropriate for a particular (acoustic) measurement scenario

Page 20: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Speech Enhancement Methods

• Classic (frequency selective) linear filtering Only useful for the simplest of situations

• Single-microphone spectral subtraction: Only useful if the signal is reasonably well-

separated to begin with ( > 5dB SINR ) Tends to introduce “musical” artifacts

• Research Focus: How to leverage multiple microphones to achieve robust signal enhancement with minimal knowledge.

Page 21: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Novel Techniques for Speech Enhancement

• Blind Source Separation: Find all the talker signals in the room - loud and soft, high and low-pitched, near and far away … without knowledge of any of these characteristics.

• Multi-Microphone Signal Enhancement: Using only the knowledge of “target present” or “target absent” labels on the data, pull out the target signal from the noisy background.

Page 22: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

SMU Multimedia Systems LabAcoustic Facility

•Room (Nominal Configuration)Acoustically-treatedRT = 300 msNon-parallel walls to prevent flutter echo

•SourcesLoudspeakers playing Recordings as well as “live” talkers.Distance to mics: 50 cmAngles: -30

o, 0

o, 27.5

o

•SensorsOmnidirectional Micro- phones (AT803b)Linear array (4cm spacing)

• Data collection and processing entirely within MATLAB. • Allows for careful characterization, fast evaluation, and experimentation with artificial and human talkers.

Page 23: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Performance improvement: Between 10 dB and 15 dB for “equal-level” mixtures, and even higher for

unequal-level ones.

Blind Source Separation Example

Convolutive Mixing (Room)

Separation System (Code)

Talker 1

(MG)

Talker 2(SCD)

Page 24: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Unequal Power Scenario ResultsUnequal Power Scenario Results

Time-domain CBSS Time-domain CBSS methods provide methods provide the greatest SIR the greatest SIR improvements for improvements for weak sources; no weak sources; no significant significant improvement in SIR improvement in SIR if the initial SIR is if the initial SIR is already largealready large

Page 25: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Noise Source

Noise Source

Speech Source

Linear Processing

AdaptiveAlgorithm

Multi-Microphone Speech Enhancement

Contains most speech

Contains most noise

y1

y2

y3

yn

z1

z2

z3

zn

Page 26: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Speech Enhancement via Iterative Multichannel Filtering

• System output at time k: a linear adaptive filter

• is a sequence of (n x n) matrices at iteration k.

• Goal: Adapt , over time such that the multichannel output contains signals with maximum speech energy in the first output.

Page 27: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Multichannel Speech Enhancement Algorithm

• A novel* technique for enhancing target speech in noise using two or more microphones via joint decorrelation

• Requires rough target identifier (i.e. when talker speech is present)

• Is adaptive to changing noise characteristics• Knowledge of source locations, microphone

positions, other characteristics not needed.• Details in [Gupta and Douglas, IEEE Trans.

Audio, Speech, Lang. Proc., May 2009] *Patent

pending

Page 28: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

28

Performance Evaluations

• Room– Acoustically-treated, RT = 300 ms– Non-parallel walls to prevent flutter echo

• Sources– Loudspeakers playing BBC Recordings

(Fs = 8kHz), 1 male/1-2 noise sources– Distance to mics: 1.3 m– Angles: -30

o, 0

o, 27.5

o

• Sensors– Linear array adjustable (4cm spacing)

• Room– Ordinary conference room (RT=600ms)

• Sources– Loudspeakers playing BBC Recordings

(Fs = 8kHz), 1 male/1-2 noise sources– Angles: -15

o, 15

o, 30

o

• Sensors– Omnidirectional Microphones (AT803b)– Linear array adjustable (4cm nominal

spacing)

6 7

867

8

Page 29: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Audio Examples

• Acoustic Lab: Initial SIR = -10dB, 3-Mic System

Before: After:• Acoustic Lab: Initial SIR = 0dB, 2-Mic System

Before: After:• Conference Room: Initial SIR = -10dB, 3-Mic System

Before: After:• Conference Room: Initial SIR = 5dB, 2-Mic System

Before: After:

Page 30: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Effect of Noise Segment Length on Overall Performance

Page 31: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

31

Diffuse Noise Source Example

• Noise Source: SMU Campus-Wide Air Handling System

• Data was recorded using a simple two-channel portable M-Audio recorder (16-bit, 48kHz) with it associated “T”-shaped omnidirectional stereo array at arm’s length, then downsampled to 8kHz.

Page 32: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

32

Air Handler Data Processing

• Step 1: Spatio-Temporal GEVD Processing on a frame-by-frame basis with L = 256, where Rv(k) = Ry(k-1); that is, data was whitened to the previous frame.

• Step 2: Least-squares multichannel linear prediction was used to remove tones.

• Step 3: Log-STSA spectral subtraction was applied to the first output channel.

Page 33: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Complex Blind Source Separation

A Bs(k) x(k) y(k)

• Signal Model: x(k) = A s(k)

• Both the si(k)’s in s(k) and the elements of A are complex-valued.

• Separating matrix B is complex-valued as well.

• It appears that there is little difference from the real-valued case…

Page 34: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Complex Circular vs. Complex Non-Circular Sources

• (Second-Order) Circular Source: The energies of the real and imaginary parts of si(k) are the same.

• (Second-Order) Non-Circular Source: The energies of the real and imaginary parts of si(k) are not the same.

Non-CircularCircular Circular

Page 35: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Why Complex Circularity Matters in Blind Source Separation

• Fact #1: It is possible to separate non-circular sources by decorrelation alone if their non-circularities differ [Eriksson and Koivunen, IEEE Trans. IT, 2006]

• Fact #2: The strong-uncorrelating transform is a unique linear transformation for identifying non-circular source subspaces using only covariance matrices.

• Fact #3: Knowledge of source non-circularity is required to obtain the best performance of a complex BSS procedure.

Page 36: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Complex Fixed Point Algorithm [Douglas 2007]

NOTE: The MATLAB code involves both transposes and Hermitian transposes… and no, those aren’t mistakes!

Page 37: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Performance Comparisons

Page 38: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Complex BSS ExampleOriginal Sources

SensorSignals

16-elem ULA, /4Spacing 3000 Snapshots SINRs/elem: -17,-12,-5,-12,-12 (dB) . DOAs(o): -45,20,-15,49,35

CFPA1Outputs

Output SINRs (dB):7,24,18,15,23

Complexity: ~3500 FLOPSper output sample

Page 39: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

Conclusions• Blind Source Separation provides unique

capabilities for extracting useful signals from multiple sensor measurements corrupted by noise.

• Little to no knowledge of the sensor array geometry, the source positions, or the source statistics or characteristics is required.

• Algorithm design can be tricky. • Opportunities for applications in speech

enhancement, wireless communications, other areas.

Page 40: Click to edit Master text styles Second level Third level Fourth level Fifth level Click to edit Master title style Blind Source Separation: Finding Needles.

For Further Reading

My publications page at SMU:

http://lyle.smu.edu/~douglas/puball.html

• It has available for download • 82% of my published journal papers• 75% of my published conference papers