Top Banner
Multivariate density estimation and its applications Wing Hung Wong Stanford University June 2014, Madison, Wisconsin. Conference in honor of the 80 th birthday of Professor Grace Wahba
41

Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Apr 24, 2018

Download

Documents

vudat
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Multivariate density estimation and its applications

Wing Hung WongStanford University

June 2014, Madison, Wisconsin. Conference in honor of the 80th birthday of Professor Grace Wahba

Page 2: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Estimate the function…Put prior on function space…Use RKHS, splines, approximation theory….

Lessons I learned in graduate school:

Page 3: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

A 3‐dimensional density function from flow cytometry

Page 4: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Mass cytometry: replace fluorescent labels with elemental labels

Mass-spectra

Holmium164.93031 amu

Page 5: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

The Bayesian nonparametric problem

• x1 , x2 , …xn are independent r.v. on a space Ω• Their distribution Q is unknown but assumed to be drawn from a prior distribution π.

• Our tasks: Prior construction, posterior computation

• Want this to work well when Ω is of moderately high dimension, e.g. 5‐50

π(Q) Q(X) → Pr( Q, X) → Pr (Q | X ) → Pr (g(Q))

Page 6: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Ferguson’s criteria (1973)• Support of the prior should be large• The posterior should be tractable

Dirichlet process prior satisfies Ferguson’s conditions.

However, under this prior the random distribution Q does not possess a density

Page 7: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Density is useful in many applications

• Anomaly detection• Classification

• Compression• Probabilistic networks• Image analysisand more

Page 8: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

We want to define a prior on the space of simple density functions:

f(x) = ∑ ci IAi(x)

To reduce complexity of this space, assume that Ω=A1 U A2…. U Am is a recursive partition

Page 9: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

A

A12

A11

A1221

A1222

A21

A22

In general, Ajk = kth part of the jth way to partition A

Recursive partitions:

Level 1

Level 2

Page 10: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Q is uniform within A

Recursive definition of random simple density function:Suppose Q(A) is known, define how Q( ) is distributed within A

Q(A)

Q(A)θ11 Q(A)θ1

2 Q(A)θ21

Q(A)θ21

S ~ Ber(ρ)

J ~ Multinomial(d, λ)

θj ~ Dirichlet (αj)

Page 11: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Density on partitions of finite depth

• Suppose we have drawn Q(k) supported on a partition composing of regions up to level k

• For each region not yet stopped, repeat the partitioning process

• This gives a random distribution Q(k+1) with a density q(k+1) that is piecewise constant on a partition with regions up to level k+1

Page 12: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Q is said to have an OPT distribution with parameters ρ (stopping rule) λ (selection probabilities) α (probability assignment weights)

P ( ∫ |q(k)–q| dx → 0 for some density q ) = 1

Optional Polya Tree (OPT) (Wong & Li, 2010)

Theorem: If the stopping probabilities are uniformly positive, then Q(k) converges almost surely in variationaldistance to an absolutely continuous distribution Q.

Page 13: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

2) π( Q | x1, ..xn ) is also OPT with parametersρ (x1, ..xn ), λ (x1, ..xn ), α (x1, ..xn ),

computable in finite time

OPT prior satisfies Ferguson’s criteria

1) Any L1 ball has positive prior probability

Page 14: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Q(A)

Q(A)θ11 Q(A)θ1

2 Q(A)θ21

Q(A)θ21

n1=(n11, n12) n2=(n21, n22)

φ1=(φ11, φ1

2) φ2=(φ21, φ2

2)

depends on n and ϕ below

Page 15: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Computation of Φ(A) by recursion

where

Recursion ends when A has either 0 or 1 data points.

Page 16: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Example

Page 17: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Example (continued)

Page 18: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

2nd approach: build up partition sequentially

v

Given t, want to define a posterior score for the partition directly

… …

t=2

t=3

t=4

Page 19: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Bayesian score of a partition X of size t:

Assume Dirichlet (α) allocation of probabilities given the partition, then

Page 20: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

log(π(xt*))

Kullback Leibler

xt*= best scoring partition of size t

t

Asymptotically, the score tracks distance from true density

Page 21: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Sequential importance sampling

v

Cut 1 Cut 2 Cut 3

v

Partition 1

Partition 2

vPartition 3

vPartition 4 v

v v

Partition Sample

Generate cuts randomly

M

w1 w1

w2

w3

w4

w2

w3

w4

Page 22: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

t

k

n

k

ttt

k

ADnnDey

121

21

21

121 1

,,,,)ˆ(

How to choose the proposal density?

)|().....|()()( 1121,.....2,1 ttttttt yyyyyyyy

)|().....|()()( 112211,.....2,1 ttttt yyyyyyyyq

feasibleinfeasible

Page 23: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Data structure for region counts

Page 24: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Counting can be accelerated by hardware

• Intel(R) Xeon(R) CPU E5640 @ 2.67GHz

• GeForce GTX 680– 1536 CUDA cores (8*192) @ 1.08GHz– 512KB L2– 2G RAM– Memory clock rate: 3GHz– Memory bus width: 256‐bit– Bandwidth: 150GB/s

Page 25: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Experiment result (CountEngine)

• Partition = 300, Cut = 1000

Dim_# of data CPU GPU Speedup

32_10^5 33375.40 18.56 1798.54

32_10^6 316794.00 28.82 10992.20

64_10^5 113327.00 24.59 4609.34

64_10^6 1086660.00 39.38 27593.51

128_10^5 438960.00 34.19 12837.45

128_10^6 NA 57.50 NA

Page 26: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Resampling:

Page 27: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Comparison with Kernel Density Estimate

Sample n points from a D-dim distribution:

Results:

Page 28: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Estimate of conditional density in 7 dimension

Page 29: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Case 1:

Case 2:

Case 3:

Case 4:

Page 30: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Density estimation is a building block for other statistical methods.

Page 31: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Classification: 1) Estimate class density fi(x) for classes 1, …k

2) Use Bayes classifier: p(i|x) ~ αi fi(x)

MAGIC data: 10 dimension, 12,000 cases, 7,000 controls

Letter data: 16 dimension, 26 classes, n=16,000 within class

Page 32: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

A sequencer yield 1 billion reads in about 1 day.

Quality scores associated with the base calls take up too much disk space

Test: n=1,940,271 quality score vectors (100 dimensional, divided into 20 sub-vectors) Result:Our method uses 206 bits per read for lossless compressionIn comparison, 7-zip uses 213 bits per read

Data Compression: the estimated density can guide the design of optimal compression scheme

Page 33: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Contour plot of the energy function of a 2D density with seven modes

Sub-level tree of energy(log-density)

Visualization of information

Page 34: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

density CD11b

CD4TCR-b

B220

CD8

Sublevel tree of bone marrow data (n=2,000,000)

Page 35: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Image segmentation

Page 36: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Image enhancement

Page 37: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Convergence rates (sieve MLE case)1. Class of simple functions on BPs of size I :

ΘI = f(·): f(·)=∑βi IAi (·) , βi ≥0, ∑βi μ(Ai)=1, and Ai , i=1,…I form a binary partition

2. Log-likelihood of f :

Ln(f)= ∑j=1,…n logf(yj) = ∑i=1,…I ni log(βi)3. MLE based on ΘI :

4. Sieve MLE based on ΘI, I=1, 2, …:

Page 38: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Classical result (Stone 1980): rate ~ n-α , α= p/(2p+d)

p= # of bounded derivatives of f0d= dimension of Ω

The key is to remove dependency of α on d.

Page 39: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

δI be the approximation rate of ΘI to f0,

Let HI be the bracketing Hellinger entropy of ΘI, and

Then (with ρ denoting the Hellinger distance),

A relevant result was given in Wong & Shen (1995):

However δI was required to be much stronger than ρ

Page 40: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Result (Linxi Liu & WW)

With r>1/2 and I(n) chosen to be

our sieve MLE has a rate upper bounded by

This result can be used to establish spatial adaptation and variable selection

Page 41: Multivariate density estimation and its applications · Multivariate density estimation and its applications ... If the stopping probabilities are uniformly ... Sublevel tree of bone

Acknowledgments

OPT: Li MaBSP: Luo Lu

Clustering & image: TY Wu, CY Tseng

Flow-cytometry: Michael Yang

Compression: Luo Lu, John Mu

---------------------------------------------------------------------------------------

Convergence rate: Linxi Liu