Top Banner
Unified Maximum Likelihood Estimation of Symmetric Distribution Properties Jayadev Acharya Hirakendu Das Alon Orlitsky Ananda Suresh Cornell Yahoo UCSD Google Frontiers in Distribution Testing Workshop FOCS 2017
42

Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Sep 01, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

UnifiedMaximumLikelihoodEstimation

ofSymmetricDistributionProperties

Jayadev AcharyaHirakendu DasAlon Orlitsky AnandaSuresh

Cornell Yahoo UCSD Google

FrontiersinDistributionTestingWorkshopFOCS2017

Page 2: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

SpecialThankstoโ€ฆ

Idonโ€™tsmile Ionlysmile

Page 3: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Symmetricproperties

๐’ซ ={(๐’‘๐Ÿ, โ€ฆ๐’‘๐’Œ) }- collectionofdistributionsover{1,โ€ฆ,k}

Distributionproperty:๐’‡:๐’ซโ†’โ„

๐’‡ issymmetricifunchangedunderinputpermutations

Entropy๐‘ฏ ๐’‘ โ‰œ โˆ‘๐’‘๐’Š๐’๐’๐’ˆ๐Ÿ๐’‘๐’Š

๏ฟฝ๏ฟฝ

Supportsize๐‘บ ๐’‘ โ‰œ โˆ‘ ๐•€ ๐’‘๐’Š5๐ŸŽ๏ฟฝ๐’Š

Rรฉnyi entropy,supportcoverage,distancetouniformity,โ€ฆ

Determinedbytheprobabilitymultiset{๐’‘๐Ÿ, ๐’‘๐Ÿ, โ€ฆ , ๐’‘๐’Œ }

Page 4: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Symmetricproperties

๐’‘ = (๐’‘๐Ÿ, โ€ฆ๐’‘๐’Œ), (๐’Œ finiteorinfinite)- discretedistribution

๐’‡ ๐’‘ , apropertyof๐’‘

๐’‡ ๐’‘ issymmetricifunchangedunderinputpermutations

Entropy ๐‘ฏ ๐’‘ โ‰œ โˆ‘๐’‘๐’Š๐’๐’๐’ˆ๐Ÿ๐’‘๐’Š

๏ฟฝ๏ฟฝ

Supportsize๐‘บ ๐’‘ โ‰œ โˆ‘ ๐•€ ๐’‘๐’Š5๐ŸŽ๏ฟฝ๐’Š

Rรฉnyi entropy,supportcoverage,distancetouniformity,โ€ฆ

Determinedbytheprobabilitymultiset{๐’‘๐Ÿ, ๐’‘๐Ÿ, โ€ฆ , ๐’‘๐’Œ }

Page 5: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Propertyestimation

๐’‘ unknowndistributionin๐’ซ

Givenindependentsamples๐—๐Ÿ, ๐—๐Ÿ, โ€ฆ , ๐—๐ง โˆผ ๐ฉ

Estimate๐’‡ ๐’‘

Samplecomplexity ๐‘บ ๐’‡, ๐’Œ, ๐œบ, ๐œน

Minimum๐‘› necessaryto

Estimate๐’‡ ๐’‘ ยฑ ๐œบ

Witherrorprobability< ๐œน

Page 6: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Plug-inestimation

Use๐‘ฟ๐Ÿ, ๐‘ฟ๐Ÿ, โ€ฆ , ๐‘ฟ๐’ โˆผ ๐’‘tofindanestimate๐’‘Fof๐’‘

Estimate๐’‡ ๐’‘ by๐’‡ ๐’‘F

Howtoestimate๐’‘?

Page 7: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

SequenceMaximumLikelihood(SML)

๐’‘๐ฌ๐ฆ๐ฅ = ๐š๐ซ๐ ๐ฆ๐š๐ฑ๐’‘ ๐’‘(๐’™๐’ )= ๐š๐ซ๐ ๐ฆ๐š๐ฑ

๐’‘(๐’™) ๐šท๐’Š๐’‘(๐’™๐’Š )

๐‘ฅQ= โ„Ž, โ„Ž, ๐‘ก๐’‘๐’‰,๐’‰,๐’•๐’”๐’Ž๐’ = ๐š๐ซ๐ ๐ฆ๐š๐ฑ๐’‘๐Ÿ ๐’‰ ยท ๐’‘(๐’•)

๐’‘๐’‰,๐’‰,๐’•๐’”๐’Ž๐’ (h)=2/3 ๐’‘๐’‰,๐’‰,๐’•๐’”๐’Ž๐’ (t)=1/3

Sameasempirical-frequencydistribution

Multiplicity๐‘ต๐’™ - #times๐‘ฅ appearsin๐’™๐’

๐’‘๐ฌ๐ฆ๐ฅ ๐’™ =๐‘ต๐’™๐’

Page 8: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

PriorWork

Differentestimatorforeachproperty

Usesophisticatedapproximationtheoryresults

A Unified Maximum Likelihood Approach for Estimating Symmetric Properties of Discrete Distributions

Property Notation SML OptimalEntropy H(p) k

"

?Support size S(p)

k

k log 1

"

?Support coverage S

m

(p)

m

m ?Distance to u kp๏ฟฝ uk

1

k

"

2 ?

Table 2. Estimation complexity for various properties, up to a constant factor. For all properties shown, PML achieves the best knownresults. Citations are for specialized techniques, PML results are shown in this paper. Support and support coverage results have beennormalized for consistency with existing literature.

Property Notation SML Optimal ReferencesEntropy H(p) k

"

k

log k

1

"

(Valiant & Valiant, 2011a; Wu &Yang, 2016; Jiao et al., 2015)

Support size S(p)

k

k log 1

"

k

log k

log

2

1

"

(Wu & Yang, 2015)Support coverage S

m

(p)

m

m m

logm

log

1

"

(Orlitsky et al., 2016)Distance to u kp๏ฟฝ uk

1

k

"

2k

log k

1

"

2 (Valiant & Valiant, 2011b; Jiaoet al., 2016)

Table 3. Estimation complexity for various properties, up to a constant factor. For all properties shown, PML achieves the best knownresults. Citations are for specialized techniques, PML results are shown in this paper. Support and support coverage results have beennormalized for consistency with existing literature.

Forseveralimportantproperties

Empirical-frequencypluginrequiresฮ˜ ๐’Œ samples

Newcomplex(non-plugin)estimatorsneedฮ˜ ๐‘˜log ๐‘˜

samples

Page 9: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Entropyestimation

SMLestimateofentropy= โˆ‘ _`alog a

_`๏ฟฝb

Samplecomplexity:ฮ˜ ๐‘˜/๐œ€Variouscorrectionsproposed:Miller-Maddow,Jackknifedestimator,Coverageadjusted,โ€ฆSamplecomplexity:ฮฉ(๐‘˜) foralltheaboveestimators

Page 10: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Entropyestimation[Paninskiโ€™03]:๐‘œ(๐‘˜) samplecomplexity(existential)

[ValiantValiantโ€™11a]:ConstructiveLPbasedmethods:ฮ˜gh

ijk h[ValiantValiant11b,WuYangโ€™14,HanJiaoVenkatWeissmanโ€™14]:

Simplifiedalgorithms,andgrowthrate:ฮ˜ hgijk h

Page 11: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

New(asofAugust)Results

Unified,simple,sample-optimalapproachforallaboveproblems

Plug-inestimator,replacesequencemaximumlikelihood

withprofilemaximumlikelihood

Page 12: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Profiles๐’‰, ๐’‰, ๐’• or ๐’‰, ๐’•, ๐’‰or๐’•, ๐’‰, ๐’•โžž sameestimate

Oneelementappearedonce,onappearedtwice

Profile:Multi-setofmultiplicities:๐šฝ ๐‘ฟ๐Ÿ๐’ = {๐‘ต๐’™: ๐’™ โˆˆ ๐‘ฟ๐Ÿ๐’}

๐šฝ(๐’‰, ๐’‰, ๐’•) = ๐šฝ(๐’•, ๐’‰, ๐’•) = {๐Ÿ, ๐Ÿ}

๐šฝ(๐œถ, ๐œธ, ๐œท, ๐œธ) = {๐Ÿ, ๐Ÿ, ๐Ÿ}

Sufficientstatisticforsymmetricproperties

Page 13: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Profilemaximumlikelihood[+SVZโ€™04]Profile probability

๐‘ ฮฆ = u ๐‘(๐‘ฅva)๏ฟฝ

w bxy zw

Maximizetheprofileprobability

๐‘w{|} = argmax

{๐‘(ฮฆ ๐‘‹va )

Seeโ€œOnestimatingtheprobabilitymultisetโ€,Orlitsky,Santhanam,Viswanathan,Zhangforadetailedtreatment,andanargumentforcompetitivedistributionestimation.

Page 14: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Profilemaximumlikelihood(PML)[+SVZโ€™04]

Profile probability

๐‘ ฮฆ = u ๐‘(๐‘ฅa)๏ฟฝ

by:w bxy zw

DistributionMaximizingtheprofileprobability

๐‘w{|} = argmax

{๐‘(ฮฆ)

PMLcompetitivefordistributionestimation

Page 15: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

PMLexample๐‘‹Q = h, h, t

๐’‘๐’‰,๐’‰,๐’•๐’”๐’Ž๐’ (h)=2/3๐’‘๐’‰,๐’‰,๐’•๐’”๐’Ž๐’ (t)=1/3

ฮฆ h, h, t = 1,2๐‘ ฮฆ = 1,2 = ๐‘ ๐‘ , ๐‘ , ๐‘‘ + ๐‘ ๐‘ , ๐‘‘, ๐‘  + ๐‘ ๐‘‘, ๐‘ , ๐‘ 

=3p(s,s,d)

= Qv โˆ‘ ๐‘๏ฟฝ ๐‘ฅ ๐‘(๐‘ฆ)๏ฟฝ

b๏ฟฝ๏ฟฝ

๐‘๏ฟฝ|} 1, 2 =31

23

๏ฟฝ 13 +

13

๏ฟฝ 23 =

1827 =

23

Page 16: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

PMLof{1,2}

P({1,2})=p(s,s,d)+p(s,d,s)+p(d,s,s)=3p(s,s,d)

p s, s, d = ฮฃ๏ฟฝ๏ฟฝ๏ฟฝ๐‘๏ฟฝ ๐‘ฅ ๐‘ ๐‘ฆ= ฮฃ๏ฟฝ๐‘๏ฟฝ ๐‘ฅ 1 โˆ’ ๐‘ ๐‘ฅ

โ‰คยผฮฃ๏ฟฝ๐‘ ๐‘ฅ = v๏ฟฝ

๐‘{|}(s,s,d)=ยผ

(1/2,1/2)โžž p(s,s,d)=1/8+1/8=1/4

๐‘{|}({1,2})=ยพ Recall:๐‘๏ฟฝ|} 1, 2 =2/3

Page 17: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

PML({1,1,2})

ฮฆ(๐›ผ, ๐›พ, ๐›ฝ, ๐›พ) = {1,1, 2}

๐‘{|} 1,1,2 = ๐‘ˆ[5]

PMLcanpredictexistenceofnewsymbols

Page 18: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

ProfilemaximumlikelihoodPMLof{1,2}is{ยฝ,ยฝ}

๐‘{|} 1,2 =31

12

๏ฟฝ 12 +

12

๏ฟฝ 12 =

34 >

1827

ฮฃ๏ฟฝ๏ฟฝ๏ฟฝ๐‘๏ฟฝ ๐‘ฅ ๐‘ ๐‘ฆ = ฮฃ๏ฟฝ๐‘๏ฟฝ ๐‘ฅ 1 โˆ’ ๐‘ ๐‘ฅ โ‰คยผฮฃ๏ฟฝ๐‘ ๐‘ฅ = v๏ฟฝ

๐‘‹a = ๐›ผ, ๐›พ, ๐›ฝ, ๐›พ,ฮฆ ๐‘‹a = {1,1, 2}

๐‘{|} 1,1,2 = ๐‘ˆ[5]

PMLcanpredictexistenceofnewsymbols

Page 19: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

PMLPlug-inToestimateasymmetricproperty๐‘“

Find๐‘{|} ฮฆ(๐‘‹a)Output๐‘“(๐‘{|})

Simple

UnifiedNotuningparameters

Someexperimentalresults(c.2009)

Page 20: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Uniform500symbols350samples

2x6,3x4,13x3,63x2,161x1242appeared, 258didnot

Page 21: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

U[500],350x,12experiments

Page 22: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Uniform500symbols350samples

2x6,3x4,13x3,63x2,161x1248appeared,258didnot

700samples

Page 23: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

U[500],700x,12experiments

Page 24: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Staircase

15Kelements,5steps,~3x30KsamplesObserve8,882elts6,118missing

Page 25: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Zipf

Underliesmanynaturalphenomenapi=C/i,i=100โ€ฆ15,00030,000samplesObserve9,047elts5,953missing

Page 26: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

1990Census- LastnamesSMITH 1.006 1.006 1JOHNSON 0.810 1.816 2WILLIAMS 0.699 2.515 3JONES 0.621 3.136 4BROWN 0.621 3.757 5DAVIS 0.480 4.237 6MILLER 0.424 4.660 7WILSON 0.339 5.000 8MOORE 0.312 5.312 9TAYLOR 0.311 5.623 10

AMEND 0.001 77.478 18835ALPHIN 0.001 77.478 18836ALLBRIGHT 0.001 77.479 18837AIKIN 0.001 77.479 18838ACRES 0.001 77.480 18839ZUPAN 0.000 77.480 18840ZUCHOWSKI 0.000 77.481 18841ZEOLLA 0.000 77.481 18842

18,839 names77.48% population~230 million

Page 27: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

1990Census- Lastnames18,839 lastnamesbasedon~230 million35,000 samples,observed9,813 names

Page 28: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Coverage(#newsymbols)Zipfdistributionover15Kelements

Sample30KtimesEstimate:#newsymbolsinsampleofsizeฮป*30K

Good-Toulmin:

EstimatePML&predictExtendstoฮป>1Appliestootherproperties

ฮป<1

ฮป>1

Page 29: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

ฮป>1

Page 30: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

FindingthePMLdistribution

EMalgorithm[+ Pan,Sajama,Santhanam,Viswanathan,Zhangโ€™05- โ€™08]

ApproximatePMLviaBethePermanents[Vontobel]

ExtensionsofMarkovChains[Vatedka,Vontobel]

Noprovablealgorithmsknown

MotivatedValiant&Valiant

Page 31: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

MaximumLikelihoodEstimationPlugin

Generalpropertyestimationtechnique

๐’ซ- collectionofdistributionsoverdomain๐’ต

๐‘“:๐’ซ โ†’ โ„ anyproperty(sayentropy)

MLEestimator

Given๐‘ง โˆˆ ๐’ต

Determine๐‘ยชยซยฌยญ โ‰œ argmax{โˆˆ๐’ซ

๐‘ ๐‘ง

Output๐‘“(๐‘ยชยซยฌยญ)

HowgoodisMLE?

Page 32: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

CompetitivenessofMLEplugin

๐’ซ - collectionofdistributionsoverdomain๐’ต

๐‘“ยฎ: ๐’ต โ†’ โ„ any estimaorsuchthatโˆ€๐‘ โˆˆ ๐’ซ, ๐‘ โˆผ ๐‘

Pr ๐‘“ ๐‘ โˆ’ ๐‘“ยฎ ๐‘ > ๐œ€ < ๐›ฟ

MLEpluginerrorboundedby

Pr ๐‘“ ๐‘ โˆ’ ๐‘“(๐‘ยชยซยฌยญ) > 2 โ‹… ๐œ€ < ๐›ฟ โ‹… |๐’ต|

Simple,universal,competitivewithany๐‘“ยฎ

Page 33: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Quiz:Probabilityofunlikelyoutcomes6-sideddie,p=(๐‘v, ๐‘๏ฟฝ, โ€ฆ , ๐‘ยต)

๐‘ยถ โ‰ฅ 0,andฮฃ๐‘ยถ = 1,otherwisearbitrary

Z๏ฝžp

๐‘ƒยบ ๐‘ยช โ‰ค 1/6

(1,0,โ€ฆ,0)โžž Pr(๐‘ยชโ‰ค1/6)=0

(1/6,โ€ฆ,1/6)โžž Pr(๐‘ยช โ‰ค1/6)=1

๐‘ƒยผ ๐‘ยช โ‰ค 0.01

๐‘ƒยผ ๐‘ยช โ‰ค 0.01 = ฮฃยถ:{ยพยฟร€.ร€v ๐‘ยถ โ‰ค 6 โ‹… 0.01 = 0.06

Canbeanything

โ‰ค0.06

Page 34: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

CompetitivenessofMLEplugin- proof๐‘“ยฎ: ๐’ต โ†’ โ„: โˆ€๐‘ โˆˆ ๐’ซ, ๐‘ โˆผ ๐‘โžž Pr ๐‘“ ๐‘ โˆ’ ๐‘“ยฎ ๐‘ > ๐œ€ < ๐›ฟ

thenPr ๐‘“ ๐‘ โˆ’ ๐‘“(๐‘ยชยซยฌยญ) > 2 โ‹… ๐œ€ < ๐›ฟ โ‹… |๐’ต|

Forall๐‘งsuchthat๐‘ ๐‘ง โ‰ฅ ๐›ฟ: 1) ๐‘“ ๐‘ โˆ’ ๐‘“ยฎ ๐‘ง โ‰ค ๐œ€

2)๐‘ยชยซยฌยญ ๐‘ง โ‰ฅ ๐‘ ๐‘ง > ๐›ฟ,hence ๐‘“(๐‘ยชยซยฌยญ) โˆ’ ๐‘“ยฎ ๐‘ง โ‰ค ๐œ€

Triangleinequality: ๐‘“(๐‘ยชยซยฌยญ) โˆ’ ๐‘“ ๐‘ โ‰ค 2๐œ€

If ๐‘“(๐‘ยชยซยฌยญ) โˆ’ ๐‘“ ๐‘ > 2๐œ€then๐‘ ๐‘ง < ๐›ฟ,

Pr ๐‘“ ๐‘ยชยซยฌยญ โˆ’ ๐‘“ ๐‘ > 2๐œ€ โ‰ค Pr ๐‘ ๐‘ < ๐›ฟ โ‰ค u ๐‘(๐‘ง)๏ฟฝ

{ ยช ร†ร‡

โ‰ค ๐›ฟ โ‹… |๐’ต|

Page 35: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

PMLperformancebound

If๐‘› = ๐‘† ๐‘“, ๐‘˜, ๐œ€, ๐›ฟ ,then๐‘†{|} ๐‘“, ๐‘˜, 2 โ‹… ๐œ€, ฮฆa โ‹… ๐›ฟ โ‰ค ๐‘›

|ฮฆa|:numberofprofilesoflength๐‘›

Profileoflengthn:partitionofn

{3},{1,2},{1,1,1}โžž 3,2+1,1+1+1ฮฆa = ๐‘๐‘Ž๐‘Ÿ๐‘ก๐‘–๐‘ก๐‘–๐‘œ๐‘›#๐‘œ๐‘“๐‘›

Hardy-Ramanujan:|ฮฆa| < ๐‘’Q a๏ฟฝ

Easy:๐‘’ijk aโ‹… a๏ฟฝ

If ๐’ = ๐‘บ ๐’‡, ๐’Œ, ๐œบ, ๐’†ร๐Ÿ’ ๐’๏ฟฝ ,then ๐‘บ๐’‘๐’Ž๐’ ๐’‡, ๐’Œ, ๐Ÿ๐œบ, ๐’†ร ๐’๏ฟฝ โ‰ค ๐’

Page 36: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

SummarySymmetricpropertyestimationPMLplug-inapproach

SimpleUniversalSampleoptimalforknownsublinearproperties

FuturedirectionsProvablyefficientalgorithmsIndependentprooftechnique

ThankYou!

Page 37: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

PMLforsymmetricfForany symmetricproperty๐‘“,

if๐‘› = ๐‘† ๐‘“, ๐‘˜, ๐œ€, 0.1 ,then๐‘†{|} ๐‘“, ๐‘˜, 2 โ‹… ๐œ€, 0.1 = ๐‘‚(๐‘›๏ฟฝ).

Proof.Bymediantrick,๐‘† ๐‘“, ๐‘˜, ๐œ€, ๐‘’ร| = ๐‘‚ ๐‘› โ‹… ๐‘š .Therefore,

๐‘†{|} ๐‘“, ๐‘˜, 2 โ‹… ๐œ€, ๐‘’Q aโ‹…|๏ฟฝ ร| = ๐‘‚(๐‘› โ‹… ๐‘š),Pluggingin๐‘š = ๐ถ โ‹… ๐‘›,givesthedesiredresult.

Page 38: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

Bettererrorprobabilitiesโ€“ warmup

Estimatingadistribution๐‘ over[๐‘˜] toL1distance๐œ€ w.p.>0.9 requiresฮ˜(๐‘˜/๐œ€๏ฟฝ) samples.Proof:Exercise.Estimatingadistribution๐‘ over[๐‘˜] toL1distance๐œ€ w.p.1 โˆ’ ๐‘’รh requiresฮ˜(๐‘˜/๐œ€๏ฟฝ) samples.Proof:โ€ข Empiricalestimator๏ฟฝฬ‚๏ฟฝโ€ข |๏ฟฝฬ‚๏ฟฝ โˆ’ ๐‘| hasboundeddifferenceconstant(b.d.c.)2/๐‘›โ€ข ApplyMcDiarmidโ€™s inequality

Page 39: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

BettererrorprobabilitiesRecall

๐‘† ๐ป, ๐œ€, ๐‘˜, 2/3 = ฮ˜๐‘˜

๐œ€ โ‹… log ๐‘˜โ€ข Existingoptimalestimators:highb.d.c.โ€ข Modifythemtohavesmallb.d.c.,andstillbeoptimalโ€ข Inparticular,cangetb.d.c.=๐‘›รร€.ร–ร— (exponentcloseto1)

โ€ข Withtwicethesampleserrordropssuper-fast

๐‘† ๐ป, ๐œ€, ๐‘˜, ๐‘’รaร˜.ร™ = ฮ˜ hgโ‹…ijk h

Similarresultsforotherproperties

Page 40: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

EvenapproximatePMLworks!โ€ข Perhaps findingexactPMLishard.โ€ข EvenapproximatePMLworks.

Findadistribution๐‘ž suchthat

๐‘ž ฮฆ ๐‘‹va โ‰ฅ ๐‘’รa.ร› โ‹… ๐‘{|} ฮฆ(๐‘‹va)

Eventhisisoptimal(forlarge๐‘˜)

Page 41: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

InFisherโ€™swordsโ€ฆ

OfcoursenobodyhasbeenabletoprovethatMLEisbestunderallcircumstances.MLEcomputedwithalltheinformationavailablemayturnouttobeinconsistent.Throwingawayasubstantialpartoftheinformationmayrenderthemconsistent.

R.A.Fisher

Page 42: Unified Maximum Likelihood Estimation of Symmetric ...ccanonne/workshop-focs2017/files/...Unified Maximum Likelihood Estimation of Symmetric Distribution Properties JayadevAcharya

ProofofPMLperformanceIf๐’ = ๐‘บ ๐’‡, ๐’Œ, ๐œบ, ๐œน ,then๐‘บ๐’‘๐’Ž๐’ ๐’‡, ๐’Œ, ๐Ÿ โ‹… ๐œบ, ๐œฑ๐’ โ‹… ๐œน โ‰ค ๐’

๐‘† ๐‘“, ๐‘˜, ๐œ€, ๐›ฟ ,achievedbyanestimator๐‘“ยฎ(ฮฆ(๐‘‹a))

โ€ข Profilesฮฆ ๐‘‹a suchthat๐‘ ฮฆ ๐‘‹a > ๐›ฟ,

๐‘รรžรŸ ฮฆ โ‰ฅ ๐‘ ฮฆ > ๐›ฟ๐‘“ ๐‘wรรžรŸ โˆ’ ๐‘“ ๐‘ โ‰ค ๐‘“ ๐‘wรรžรŸ โˆ’ ๐‘“ยฎ ฮฆ + ๐‘“ยฎ ฮฆ โˆ’ ๐‘“ ๐‘ < 2๐œ€

โ€ข Profileswith๐‘ ฮฆ ๐‘‹a < ๐›ฟ,

๐‘(๐‘ ฮฆ ๐‘‹a < ๐›ฟ < ๐›ฟ โ‹… |ฮฆa|