University of Joensuu Dept. of Computer Science P.O. Box 111 FIN- 80101 Joensuu Tel. +358 13 251 7959 fax +358 13 251 7955 Gaussian Mixture.

University of JoensuuDept. of Computer ScienceP.O. Box 111FIN- 80101 Joensuu

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Gaussian Mixture Models

Speech and Image Processing UnitDepartment of Computer Science

University of Joensuu, FINLAND

Ville Hautamäki

Clustering Methods: Part 8

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Preliminaries

• We assume that the dataset X has been generated by a parametric distribution p(X).

• Estimation of the parameters of p is known as density estimation.

• We consider Gaussian distribution.

http://research.microsoft.com/~cmbishop/PRML/Figures taken from:

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Typical parameters (1)

• Mean (μ): average value of p(X), also called expectation.

• Variance (σ): provides a measure of variability in p(X) around the mean.

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Typical parameters (2)

• Covariance: measures how much two

variables vary together.

• Covariance matrix: collection of covariances between all dimensions.

– Diagonal of the covariance matrix

contains the variances of each attribute.

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

One-dimensional Gaussian

• Parameters to be estimated are the mean (μ) and variance (σ)

1 1Normal( | , ) exp

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Multivariate Gaussian (1)

• In multivariate case we have covariance matrix instead of variance

2 1/ 2

1 1 1Normal( | , ) exp

(2 ) det( ) 2

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Multivariate Gaussian (2)

Full covarianceDiagonalSingle

2 211 12

2 212 22

ln ( ) ln Normal( | , )N

Complete data log likelihood:

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Maximum Likelihood (ML) parameter estimation

• Maximize the log likelihood formulation

• Setting the gradient of the complete data log

likelihood to zero we can find the closed form

solution.

– Which in the case of mean, is the sample average.

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

When one Gaussian is not enough

• Real world datasets are rarely unimodal!

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Mixtures of Gaussians

( ) Normal( | , )M

k k kk

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Mixtures of Gaussians (2)

• In addition to mean and covariance parameters (now M times), we have mixing coefficients πk.

Following properties hold for the mixing coefficients:

It can be seen as the prior probability of the component k

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Responsibilities (1)

• Component labels (red, green and blue)

cannot be observed.

• We have to calculate approximations

(responsibilities).

Complete data Incomplete data Responsibilities

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

• Responsibility describes, how

probably observation vector x is from

component k.

• In clustering, responsibilities take

values 0 and 1, and thus, it defines the

hard partitioning.

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

We can express the marginal density p(x) as:

( ) ( ) ( | )M

p p k p k

( ) ( | )

Normal( | , )

l l ll

p l p l

From this, we can find the responsibility of the kth component of x using Bayesian theorem:

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Expectation Maximization (EM)

• Goal: Maximize the log likelihood of the whole data

• When responsibilities are calculated, we can maximize individually for the means, covariances and the mixing coefficients!

ln ( | , , ) ln Normal( | , )N M

k n k kn k

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Exact update equations

New mean estimates:

Covariance estimates

Mixing coefficient estimates

k k n nnkN

k k nn

1( )( )( )

k k nnkN

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

EM Algorithm

• Initialize parameters

• while not converged

– E step: Calculate responsibilities.

– M step: Estimate new parameters

– Calculate log likelihood of the new

parameters

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Example of EM

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Computational complexity

• Hard clustering with MSE criterion is NP-complete.

• Can we find optimal GMM in polynomial time?

• Finding optimal GMM is in class NP

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Some insights

• In GMM we need to estimate the parameters, which all are real numbers– Number of parameters:

M+M(D) + M(D(D-1)/2)

• Hard clustering has no parameters, just set partitioning (remember optimality criteria!)

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Some further insights (2)

• Both optimization functions are mathematically rigorous!

• Solutions minimizing MSE are always meaningful

• Maximization of log likelihood might lead to singularity!

Tel. +358 13 251 7959fax +358 13 251 7955

www.cs.joensuu.fi

Example of singularity

University of Joensuu Dept. of Computer Science P.O. Box 111 FIN- 80101 Joensuu Tel. +358 13 251 7959 fax +358 13 251 7955 Gaussian Mixture.

variance slide

covariance parameters

covariance estimates

covariance matrix

new parameters

typical parameters

multivariate gaussian

log likelihood formulation

Documents

Joensuu 2013 på svenska

Joensuu 2014-2015

EUROPROFILE DIN CYLINDERS...8807743 01/2012 3000 kpl, Suomen...

Joensuu 2013 - auf Deutsch

University of Eastern Finland School of Computing P.O. Box.....

Joensuun yliopisto PL 111 80101 Joensuu puh. (013) 251 111.....

University of Joensuu Dept. of Computer Science P.O. Box 111...

Joensuu 2016 FRA

Joensuu 2012 English

Joensuu 2012 Deutsch

Joensuu 2016 RUS

KARELIA-AMMATTIKORKEAKOULU Metsätalouden...

University of Joensuu Dept. of Computer Science P.O. Box 111...

Uutta virtaa vapaaehtoistoimintaan Joensuu

THE ROLE OF A WEBSITE IN COMPANY MARKETING ......THESIS...

Litteraturhusen Joensuu 2012