Top Banner
Goodness of Fit G. Jogesh Babu Center for Astrostatistics http://astrostatistics.psu.edu
36

Goodness of Fit - Center for Astrostatistics

Feb 03, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Goodness of Fit - Center for Astrostatistics

Goodness of Fit

G. Jogesh BabuCenter for Astrostatistics

http://astrostatistics.psu.edu

Page 2: Goodness of Fit - Center for Astrostatistics

Astrophysical Inference from Data

Fitting Astrophysical data • Non-linear regression• Density (shape) estimation• Parametric modeling• Goodness of fit

Page 3: Goodness of Fit - Center for Astrostatistics

Chandra X-ray Observatory ACIS dataCOUP source # 410 in Orion Nebula with 468 photons

Fitting to binned data using χ2 (XSPEC package)Thermal model with absorption, A_V~1 mag

Page 4: Goodness of Fit - Center for Astrostatistics

Fitting to unbinned EDF Maximum likelihood (C-statistic)Thermal model with absorption

Page 5: Goodness of Fit - Center for Astrostatistics

Empirical Distribution Function

Page 6: Goodness of Fit - Center for Astrostatistics

Correct model family, incorrect parameter valueThermal model with absorption set at A_V~10 mag

Question : What is the 99% confidence interval for A_V?

Page 7: Goodness of Fit - Center for Astrostatistics

Incorrect model family Power law model, absorption A_V~1 mag

Question : Can a power law model be excluded with 99% confidence?

Page 8: Goodness of Fit - Center for Astrostatistics

K-S Confidence bands

Page 9: Goodness of Fit - Center for Astrostatistics

Model fittingAim: To provide most parsimonious `best’ fitTo answer:• Is the underlying nature of a stellar spectrum a

non-thermal power law or a thermal gas with absorption?

• Are the fluctuations in the cosmic microwave background best fit by Big Bang models with dark energy or with quintessence?

• Are there interesting correlations among the properties of objects in any given class (e.g. the Fundamental Plane of elliptical galaxies), and what are the optimal analytical expressions of such correlations?

Page 10: Goodness of Fit - Center for Astrostatistics

• These issues arise when data are used to repudiate or support astrophysical theories but the underlying processes generating the data are not confidently known.

• We have developed nonparametric resampling methods for inference, when the data come from an unknown distribution which may or may not belong to a specified family

Page 11: Goodness of Fit - Center for Astrostatistics

Statistics Based on EDFKolmogrov-Smirnov: Supx |Fn(x) - F(x)|,

Supx (Fn(x) - F(x))+, Supx (Fn(x) - F(x))-

Cramer - van Mises:

Anderson - Darling:

All of these statistics are distribution free

dF(x)F(x))(x)(F 2n −∫

dF(x) F(x))F(x)(1

F(x))(x)(F 2n∫ −

Page 12: Goodness of Fit - Center for Astrostatistics
Page 13: Goodness of Fit - Center for Astrostatistics

Statistics Based on EDFKolmogrov-Smirnov: Supx |Fn(x) - F(x)|

Cramer - van Mises:

Anderson - Darling:

All of these statistics are no longer distribution freeif parameters are estimated or the data ismultivariate.

dF(x)F(x))(x)(F 2n −∫

dF(x) F(x))F(x)(1

F(x))(x)(F 2n∫ −

Page 14: Goodness of Fit - Center for Astrostatistics

Multivariate CaseWarning: K-S does not work in multidimensions

Example – Paul B. Simpson (1951)

F(x,y) = ax2 y + (1 – a) y2 x, 0 < x, y < 1

(X1, Y1) data from F, F1 EDF of (X1, Y1)

P(| F1(x,y) - F(x,y)| < 0.72, for all x, y) is> 0.065 if a = 0, (F(x,y) = y2 x)< 0.058 if a = 0.5, (F(x,y) = xy(x+y)/2)

Page 15: Goodness of Fit - Center for Astrostatistics

Processes with estimated Parameters

{F(.; θ): θ ε Θ} - a family of distributionsX1, …, Xn sample from F

Kolmogorov-Smirnov, Cramer-von Mises etc.,when θ is estimated from the data, areContinuous functionals of the empirical process

Yn (x; θn) = (Fn (x) – F(x; θn))n

Page 16: Goodness of Fit - Center for Astrostatistics

In the Gaussian case,

θ = (µ,σ2) and )s,X(θ 2nn =

∑=

=n

1iiX

n1X

∑=

−=n

1i

2i

2n )X(X

n1s

Page 17: Goodness of Fit - Center for Astrostatistics

• Darling, D. A. (1955). The Cramér-Smirnov test in the parametric case. Ann. Math. Statist., 26, 1–20.

• Kac, M., Kiefer, J., and Wolfowitz, J. (1955). On tests of normality and other tests of goodness of fit based on distance methods. Ann. Math. Statist., 26, 189–211.

• Durbin, J. (1973). Weak convergence of the sample distribution function when parameters are estimated. Ann. of Statist., 1, 279–290.

Page 18: Goodness of Fit - Center for Astrostatistics

Bootstrap

Page 19: Goodness of Fit - Center for Astrostatistics

Parametric BootstrapX1

*, …, Xn* sample generated from F(.; θn).

In Gaussian case .

Both supx |Fn (x) – F(x; θn)| and

supx |Fn* (x) – F(x; θn

*)|have the same limiting distribution

(In the XSPEC packages, the parametric bootstrap is command FAKEIT, which makes Monte Carlo simulation of specified spectral model)

)s,X(θ *2n

*n

*n =

n

n

Page 20: Goodness of Fit - Center for Astrostatistics
Page 21: Goodness of Fit - Center for Astrostatistics

Nonparametric BootstrapX1

*, …, Xn* i.i.d. from Fn.

A bias correctionBn(x) = Fn (x) – F(x; θn) is needed.

supx |Fn (x) – F(x; θn)| and

supx |Fn* (x) – F(x; θn

*) - Bn (x) |have the same limiting distribution

(XSPEC does not provide a nonparametric bootstrap capability)

n

n

Page 22: Goodness of Fit - Center for Astrostatistics
Page 23: Goodness of Fit - Center for Astrostatistics

• Chi-Square type statistics – (Babu, 1984, Statistics with linear combinations of chi-squares as weak limit. Sankhya, Series A, 46, 85-93.)

• U-statistics – (Arcones and Giné, 1992, On the bootstrap of U and V statistics. Ann. of Statist., 20, 655–674.) Resampling methods for semi-parametricgoodness-of-fit tests – p. 12/28

Page 24: Goodness of Fit - Center for Astrostatistics

Confidence limits under misspecification of model family

X1, …, Xn data from unknown H.H may or may not belong to the family {F(.; θ): θ ε Θ}.

H is closest to F(.; θ0), in Kullback - Leibler information

h(x) log (h(x)/f(x; θ)) dν(x) 0

h(x) |log (h(x)| dν(x) <

h(x) log f(x; θ0) dν(x) = maxθ h(x) log f(x; θ) dν(x)

∫ ∫

Page 25: Goodness of Fit - Center for Astrostatistics

• White, H. (1982). Maximum likelihood estimation of misspecified models. Econometrica, 50, 1–25.

• Nishii, R. (1988). Maximum likelihood principle and model selection when the true model is unspecified. J. Multi. Analysis., 27, 392–403.

• Foutz, R. V., and Srivastava, R. C. (1977). The performance of the likelihood ratio test when the model is incorrect. Ann. Statist., 5, 1183–1194.

Page 26: Goodness of Fit - Center for Astrostatistics
Page 27: Goodness of Fit - Center for Astrostatistics

For any 0 < α < 1,P( supx |Fn (x) – F(x; θn) – (H(x) – F(x; θ0)) | <Cα

*) α

Cα* is the α-th quantile of

supx |Fn* (x) – F(x; θn

*) – (Fn (x) – F(x; θn)) |

This provide an estimate of the distance between the true distribution and the family of distributions under consideration.

n

n

Page 28: Goodness of Fit - Center for Astrostatistics

Similar conclusions can be drawn forvon Mises-type statistics

(Fn (x) – F(x; θn) – (H(x) – F(x; θ0)))2 d F(x; θ0)∫

Page 29: Goodness of Fit - Center for Astrostatistics
Page 30: Goodness of Fit - Center for Astrostatistics
Page 31: Goodness of Fit - Center for Astrostatistics

The Covariance Function

Page 32: Goodness of Fit - Center for Astrostatistics
Page 33: Goodness of Fit - Center for Astrostatistics
Page 34: Goodness of Fit - Center for Astrostatistics
Page 35: Goodness of Fit - Center for Astrostatistics

References• Bootstrap Methodology

G. J. Babu and C. R. Rao (1993), Handbook of Statistics, Vol 9, Chapter 19.

• Bootstrap Techniques for Signal ProcessingAbdelhak M. Zoubir and D. Robert Iskander, Cambridge, U.K.: Cambridge University Press, 2004.

This book serves as a handbook on bootstrap for engineers, to analyze complicated data with little or no model assumptions. Bootstrap has found many applications in engineering field including, artificial neural networks, biomedical engineering, environmental engineering, image processing, and Radar and sonar signal processing. Majority of the applications are taken from signal processing literature.

Page 36: Goodness of Fit - Center for Astrostatistics

Summary

• MLE and Chi-square fits are often OK, but it is hard to evaluate goodness of fit

• KS probabilities often wrong (multivariate, estimated parameters)

• Use instead parametric bootstrap to get parametric confidence bands and evaluate different model families