Daphne Koller Parameter Estimation Maximum Likelihood Estimation Probabilistic Graphical Models Learning.

Daphne Koller

Parameter Estimation

MaximumLikelihoodEstimation

ProbabilisticGraphicalModels

Learning

Daphne Koller

Biased Coin Example

• Tosses are independent of each other• Tosses are sampled from the same

distribution (identically distributed)

P is a Bernoulli distribution: P(X=1) = , P(X=0) = 1-

sampled IID from P

Daphne Koller

IID as a PGM

XData m X[1] X[M]

][)|][(

xmxmxP

Daphne Koller

Maximum Likelihood Estimation

• Goal: find [0,1] that predicts D well• Prediction quality = likelihood of D given

mmxPDPDL

1)|][()|():(

HHTTHL ,,,,:

0 0.2 0.4 0.6 0.8 1

Daphne Koller

Maximum Likelihood Estimator

• Observations: MH heads and MT tails

• Find maximizing likelihood

• Equivalent to maximizing log-likelihood

• Differentiating the log-likelihood and solving for :

TH MMTH MML )1(),:(

)1log(log),:( THTH MMMMl

Daphne Koller

Sufficient Statistics

• For computing in the coin toss example, we only needed MH and MT since

• MH and MT are sufficient statistics

TH MMDL )1():(

Daphne Koller

Sufficient Statistics• A function s(D) is a sufficient statistic from

instances to a vector in k if for any two datasets D and D’ and any we have

)':():(])[(])[('][][

DLDLixsixsDixDix

Datasets

Statistics

Daphne Koller

Sufficient Statistic for Multinomial

• For a dataset D over variable X with k values, the sufficient statistics are counts <M1,...,Mk> where Mi is the # of times that X[m]=xi in D

• Sufficient statistic s(x) is a tuple of dimension k– s(xi)=(0,...0,1,0,...,0)

Daphne Koller

Sufficient Statistic for Gaussian

• Gaussian distribution:

• Rewrite as

• Sufficient statistics for Gaussian: s(x)=<1,x,x2>

1)(),(~)(

eXpNXP if

Daphne Koller

Maximum Likelihood Estimation

• MLE Principle: Choose to maximize L(D:)

• Multinomial MLE:

• Gaussian MLE: m

2)ˆ][(1

Daphne Koller

Summary

• Maximum likelihood estimation is a simple principle for parameter selection given D

• Likelihood function uniquely determined by sufficient statistics that summarize D

• MLE has closed form solution for many parametric distributions

Daphne Koller

END END END

Daphne Koller Parameter Estimation Maximum Likelihood Estimation Probabilistic Graphical Models Learning.

likelihood of d

d sufficient statistic

likelihood equivalent

dataset d

datasets d

d wellprediction quality

gaussian mle

mh heads

Documents

Structured Models for Multi-Agent Interactions Daphne Koller...

Efficient Solution Algorithms for Factored MDPs by Carlos...

PROBABILISTIC GRAPHICAL MODELS -...

Structured Models for Decision Making Daphne Koller Stanford...

Daphne Koller Decision Making Utility Functions...

Daphne Koller,engineer/gedc2013/bio-pdfs/koller.pdf ·...

Daphne Koller Overview Maximum a posteriori (MAP)...

Multi-modal robotic perception Stephen Gould, Paul...

Projection Methods (Symbolic tools we have used to do…)...

A Probabilistic Model for Component-Based Shape Synthesis...

Coursera Presentation - Dr. Daphne Koller

Analyzing Patient Interactions within Cancer Support Groups....

Graphical' Models' Structure' Learning'€¦ · Learning'.....

Multi-Class Segmentation with Relative Location...

. Learning Bayesian Networks from Data Nir Friedman Daphne.....

Learning Probabilistic Relational Models Daphne Koller...