Top Banner
All Models are Right Thaddeus Tarpey Introduction Parameters Model Underspeci- fication Coefficient Interpreta- tion Probability Models Conclusions All Models are Right ... most are useless Thaddeus Tarpey Wright State University [email protected]
103

All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University [email protected]. All Models are Right Thaddeus

Feb 07, 2019

Download

Documents

trinhdan
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

All Models are Right... most are useless

Thaddeus Tarpey

Wright State University

[email protected]

Page 2: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

George Box’s Quote

“All Models are Wrong, some areuseful”

This quote is useful ... but wrong.

Page 3: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

George Box’s Quote

“All Models are Wrong, some areuseful”

This quote is useful ...

but wrong.

Page 4: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

George Box’s Quote

“All Models are Wrong, some areuseful”

This quote is useful ... but wrong.

Page 5: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Here is an extended quote:

... The fact that the polynomial is an

approximation does not necessarily detract

from its usefulness because all models are

approximations. Essentially, all models are

wrong, but some are useful. However, the

approximate nature of the model must always

be borne in mind.”

From the book: Empirical Model-Building and Response Surfaces (1987, p 424), by Box

and Draper.

Page 6: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Models are Approximations – Canapproximations be Wrong?

π = 3.14

This is WRONG

π ≈ 3.14 is not wrong

Page 7: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Models are Approximations – Canapproximations be Wrong?

π = 3.14 This is WRONG

π ≈ 3.14 is not wrong

Page 8: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Models are Approximations – Canapproximations be Wrong?

π = 3.14 This is WRONG

π ≈ 3.14 is not wrong

Page 9: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Stay Positive

When teaching, why focus on the negativeaspect of Box’s quote:

“Ok class, today I will introduceregression models.

Oh, and by the way,all these models are wrong.”

Instead:“Ok class, today we will introduceregression models which can be veryuseful approximations to the truth.”

Page 10: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Stay Positive

When teaching, why focus on the negativeaspect of Box’s quote:

“Ok class, today I will introduceregression models. Oh, and by the way,all these models are wrong.”

Instead:“Ok class, today we will introduceregression models which can be veryuseful approximations to the truth.”

Page 11: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Stay Positive

When teaching, why focus on the negativeaspect of Box’s quote:

“Ok class, today I will introduceregression models. Oh, and by the way,all these models are wrong.”

Instead:“Ok class, today we will introduceregression models which can be veryuseful approximations to the truth.”

Page 12: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Fallacy of Reification

Fallacy of Reification: When an abstraction(the model) is treated as if it were a realconcrete entity.

The fallacy of reification is committed overand over, even by statisticians, who believea particular model represents the truth ...instead of an approximation.

The model is not wrong but treating themodel as the absolute truth (i.e.reification) is wrong.

Page 13: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Fallacy of Reification

Fallacy of Reification: When an abstraction(the model) is treated as if it were a realconcrete entity.

The fallacy of reification is committed overand over, even by statisticians, who believea particular model represents the truth ...instead of an approximation.

The model is not wrong but treating themodel as the absolute truth (i.e.reification) is wrong.

Page 14: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Fallacy of Reification

Fallacy of Reification: When an abstraction(the model) is treated as if it were a realconcrete entity.

The fallacy of reification is committed overand over, even by statisticians, who believea particular model represents the truth ...instead of an approximation.

The model is not wrong but treating themodel as the absolute truth (i.e.reification) is wrong.

Page 15: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

A Dress, A suit, A Model

If a dress or suit fits nicely, it is useful...

If the model fits the data nicely, it can be auseful approximation to the truth.

“Does this model make me look fat?”

“No dear”

Page 16: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

A Dress, A suit, A Model

If a dress or suit fits nicely, it is useful...

If the model fits the data nicely, it can be auseful approximation to the truth.

“Does this model make me look fat?”

“No dear”

Page 17: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

A Dress, A suit, A Model

If a dress or suit fits nicely, it is useful...

If the model fits the data nicely, it can be auseful approximation to the truth.

“Does this model make me look fat?”

“No dear”

Page 18: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

A Dress, A suit, A Model

If a dress or suit fits nicely, it is useful...

If the model fits the data nicely, it can be auseful approximation to the truth.

“Does this model make me look fat?”

“No dear”

Page 19: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

If we just tweak the language a bit

In the simple linear regression model,

y = β0 + β1x+ ε, .

Saying:“Assume ε is normal” is almost always wrong.

Saying:“Assume ε is approximately normal” will oftenbe accurate.

Page 20: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

If we just tweak the language a bit

In the simple linear regression model,

y = β0 + β1x+ ε, .

Saying:“Assume ε is normal” is almost always wrong.

Saying:“Assume ε is approximately normal” will oftenbe accurate.

Page 21: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

A Quote

Paul Velleman writes:“A model for data, no matter howelegant or correctly derived, must bediscarded or revised if it does not fit thedata or when new or better data arefound and it fails to fit them.”

From “Truth, Damn Truth, and Statistics” in the Journal of Statistical Education, 2008.

Page 22: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Velleman’s quote is useful ... but not always

Newton’s 2nd Law of Motion F = ma hasn’tbeen discarded...

... even though it has been revised due toEinstein’s special theory of relativity

F =d{mv}dt

.

F = ma is still a useful approximation...aslong as you don’t go too fast.

Page 23: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Velleman’s quote is useful ... but not always

Newton’s 2nd Law of Motion F = ma hasn’tbeen discarded...... even though it has been revised due toEinstein’s special theory of relativity

F =d{mv}dt

.

F = ma is still a useful approximation...aslong as you don’t go too fast.

Page 24: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Velleman’s quote is useful ... but not always

Newton’s 2nd Law of Motion F = ma hasn’tbeen discarded...... even though it has been revised due toEinstein’s special theory of relativity

F =d{mv}dt

.

F = ma is still a useful approximation...

aslong as you don’t go too fast.

Page 25: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Velleman’s quote is useful ... but not always

Newton’s 2nd Law of Motion F = ma hasn’tbeen discarded...... even though it has been revised due toEinstein’s special theory of relativity

F =d{mv}dt

.

F = ma is still a useful approximation...aslong as you don’t go too fast.

Page 26: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Cylinder-Shaped Soda Can ExampleModel soda volume as a function of height of soda in can

Volume = β0 + β1height + ε.

Then β0 = 0 and β1 = πr2.

Thad: I’m going to use a reduced model:

Volume = β0 + ε.

Fellow Statistician: “Hey Tarpey, your reducedmodel is wrong.”Thad: “No, it is correct.”

Page 27: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Cylinder-Shaped Soda Can ExampleModel soda volume as a function of height of soda in can

Volume = β0 + β1height + ε.

Then β0 = 0 and β1 = πr2.

Thad: I’m going to use a reduced model:

Volume = β0 + ε.

Fellow Statistician: “Hey Tarpey, your reducedmodel is wrong.”Thad: “No, it is correct.”

Page 28: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Cylinder-Shaped Soda Can ExampleModel soda volume as a function of height of soda in can

Volume = β0 + β1height + ε.

Then β0 = 0 and β1 = πr2.

Thad: I’m going to use a reduced model:

Volume = β0 + ε.

Fellow Statistician: “Hey Tarpey, your reducedmodel is wrong.”

Thad: “No, it is correct.”

Page 29: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Cylinder-Shaped Soda Can ExampleModel soda volume as a function of height of soda in can

Volume = β0 + β1height + ε.

Then β0 = 0 and β1 = πr2.

Thad: I’m going to use a reduced model:

Volume = β0 + ε.

Fellow Statistician: “Hey Tarpey, your reducedmodel is wrong.”Thad: “No, it is correct.”

Page 30: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Soda Cans continued ...

Models used in practice are conditional onavailable information (i.e. variables).

The full model Volume = β0 + β1height + ε isuseless if height of the soda in the can was notmeasured.

The reduced model y = β0 + ε is equivalent toy = µ+ ε ... which is a correct model.

Page 31: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Soda Cans continued ...

Models used in practice are conditional onavailable information (i.e. variables).

The full model Volume = β0 + β1height + ε isuseless if height of the soda in the can was notmeasured.

The reduced model y = β0 + ε is equivalent toy = µ+ ε ... which is a correct model.

Page 32: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Soda Cans continued ...

Models used in practice are conditional onavailable information (i.e. variables).

The full model Volume = β0 + β1height + ε isuseless if height of the soda in the can was notmeasured.

The reduced model y = β0 + ε is equivalent toy = µ+ ε

... which is a correct model.

Page 33: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Soda Cans continued ...

Models used in practice are conditional onavailable information (i.e. variables).

The full model Volume = β0 + β1height + ε isuseless if height of the soda in the can was notmeasured.

The reduced model y = β0 + ε is equivalent toy = µ+ ε ... which is a correct model.

Page 34: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Parameters – A Source of Confusion

In the soda can example, the same symbol β0

is being used to represent two differentparameters.

Question: What is a parameter?

Page 35: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

True Model, Approximation Model

The Truth: Let f(y; θ) denote the density forthe true model; let θ∗ denote the true value ofθ

An Approximation: Let h(y; α) denote aproposed approximation model.

Hopefully α∗ → α∗ as n→∞.

Question: But what is α∗?

α∗ = arg maxα

∫f(y; θ∗) log(h(y; α))dy.

Page 36: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

True Model, Approximation Model

The Truth: Let f(y; θ) denote the density forthe true model; let θ∗ denote the true value ofθ

An Approximation: Let h(y; α) denote aproposed approximation model.

Hopefully α∗ → α∗ as n→∞.

Question: But what is α∗?

α∗ = arg maxα

∫f(y; θ∗) log(h(y; α))dy.

Page 37: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

True Model, Approximation Model

The Truth: Let f(y; θ) denote the density forthe true model; let θ∗ denote the true value ofθ

An Approximation: Let h(y; α) denote aproposed approximation model.

Hopefully α∗ → α∗ as n→∞.

Question: But what is α∗?

α∗ = arg maxα

∫f(y; θ∗) log(h(y; α))dy.

Page 38: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

True Model, Approximation Model

The Truth: Let f(y; θ) denote the density forthe true model; let θ∗ denote the true value ofθ

An Approximation: Let h(y; α) denote aproposed approximation model.

Hopefully α∗ → α∗ as n→∞.

Question: But what is α∗?

α∗ = arg maxα

∫f(y; θ∗) log(h(y; α))dy.

Page 39: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Example: External Fixator to hold a brokenbone in place.

Thad: I’m going to use the slope of a straightline to estimate the stiffness of an externalfixator.

0 1 2 3 4 5

050

100

150

200

250

300

External Fixator Data

Extension (mm)

For

ce (

in N

ewto

ns) Stiffness = Force/Extension

Page 40: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Example: External Fixator to hold a brokenbone in place.

Thad: I’m going to use the slope of a straightline to estimate the stiffness of an externalfixator.

0 1 2 3 4 5

050

100

150

200

250

300

External Fixator Data

Extension (mm)

For

ce (

in N

ewto

ns) Stiffness = Force/Extension

Page 41: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Regression

Fellow Statistician: “Surely, if you fit astraight line to data with a nonlinear trend,then the straight line model is wrong.”

Thad: “No, it is not wrong and quit calling meShirley.”

Page 42: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Regression

Fellow Statistician: “Surely, if you fit astraight line to data with a nonlinear trend,then the straight line model is wrong.”

Thad: “No, it is not wrong and quit calling meShirley.”

Page 43: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Least Squares

Suppose

E[y|x] = f(x; θ) (True Model),

for some unknown function f .

Propose an approximation, f(x; α).Then

α∗ = arg minα

∫(f(x; θ∗)− f(x; α))2dFx.

Page 44: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Least Squares

Suppose

E[y|x] = f(x; θ) (True Model),

for some unknown function f .

Propose an approximation, f(x; α).

Then

α∗ = arg minα

∫(f(x; θ∗)− f(x; α))2dFx.

Page 45: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Least Squares

Suppose

E[y|x] = f(x; θ) (True Model),

for some unknown function f .

Propose an approximation, f(x; α).Then

α∗ = arg minα

∫(f(x; θ∗)− f(x; α))2dFx.

Page 46: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Illustration: A Straight-Line Approximation

True Model: E[y|x] =∑∞

j=0 θjxj.

Extract the Linear Trend: E[y|x] ≈ α0 + α1x.

The least-squares criterion (for x ∼ U(0, 1))gives

α0 = µy − α1µx, and α1 =∞∑j=0

6jθj(j + 2)(j + 1)

.

Page 47: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Illustration: A Straight-Line Approximation

True Model: E[y|x] =∑∞

j=0 θjxj.

Extract the Linear Trend: E[y|x] ≈ α0 + α1x.

The least-squares criterion (for x ∼ U(0, 1))gives

α0 = µy − α1µx, and α1 =∞∑j=0

6jθj(j + 2)(j + 1)

.

Page 48: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Illustration: A Straight-Line Approximation

True Model: E[y|x] =∑∞

j=0 θjxj.

Extract the Linear Trend: E[y|x] ≈ α0 + α1x.

The least-squares criterion (for x ∼ U(0, 1))gives

α0 = µy − α1µx, and α1 =∞∑j=0

6jθj(j + 2)(j + 1)

.

Page 49: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Predict Body Fat % Using Regression(data from Johnson 1996, JSE)

Model body fat percentage as a function ofweight

150 200 250 300

010

2030

40

Body Fat % versus Weight

Weight

Bod

y F

at %

Page 50: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Predict Body Fat % Using Regression(data from Johnson 1996, JSE)

Model body fat percentage as a function ofweight: y = −9.99515 + 0.162Wt.

150 200 250 300

010

2030

40

Body Fat % versus Weight

Weight

Bod

y F

at %

Page 51: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Body Fat % continued ...

Thad: “I just fit a line to the body fatpercentage (y) versus weight data.”

Fellow Statistician: “Tarpey, your model iswrong...under-specified – there are othervariables that also predict body fat percentage;your estimated slope will be biased. You needmore predictors”

Page 52: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Body Fat % continued ...

Thad: “I just fit a line to the body fatpercentage (y) versus weight data.”Fellow Statistician: “Tarpey, your model iswrong...

under-specified – there are othervariables that also predict body fat percentage;your estimated slope will be biased. You needmore predictors”

Page 53: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Body Fat % continued ...

Thad: “I just fit a line to the body fatpercentage (y) versus weight data.”Fellow Statistician: “Tarpey, your model iswrong...under-specified – there are othervariables that also predict body fat percentage;

your estimated slope will be biased. You needmore predictors”

Page 54: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Body Fat % continued ...

Thad: “I just fit a line to the body fatpercentage (y) versus weight data.”Fellow Statistician: “Tarpey, your model iswrong...under-specified – there are othervariables that also predict body fat percentage;your estimated slope will be biased. You needmore predictors”

Page 55: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Multiple Regression

In multiple regression with two predictors x1

and x2 correlated to each other and to y:

Full Model : y = β0 + β1x1 + β2x2 + ε.

Reduced Model : y = β0 + β1x1 + ε.

We may drop x2 for the sake of modelparsimony or because x2 does not appearsignificant.

Question: What is wrong with what I havewritten here?

Page 56: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Multiple Regression

In multiple regression with two predictors x1

and x2 correlated to each other and to y:

Full Model : y = β0 + β1x1 + β2x2 + ε.

Reduced Model : y = β0 + β1x1 + ε.

We may drop x2 for the sake of modelparsimony or because x2 does not appearsignificant.

Question: What is wrong with what I havewritten here?

Page 57: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Full and Reduced Models

The coefficient β1 in the full model is neverthe same as β1 in the reduced model unlessβ2 = 0.

β2 in the full model equals zero if and onlyif

cor(x1, x2) =cor(x2, y)

cor(x1, y).

Hence, β2 cannot be zero ifcor(x2, y) > cor(x1, y).

Page 58: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Full and Reduced Models

The coefficient β1 in the full model is neverthe same as β1 in the reduced model unlessβ2 = 0.

β2 in the full model equals zero if and onlyif

cor(x1, x2) =cor(x2, y)

cor(x1, y).

Hence, β2 cannot be zero ifcor(x2, y) > cor(x1, y).

Page 59: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Full and Reduced Models

The coefficient β1 in the full model is neverthe same as β1 in the reduced model unlessβ2 = 0.

β2 in the full model equals zero if and onlyif

cor(x1, x2) =cor(x2, y)

cor(x1, y).

Hence, β2 cannot be zero ifcor(x2, y) > cor(x1, y).

Page 60: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Model Under-specification

If β2 6= 0, then β1 in the reduced model is adifferent parameter than in the full model.

In the model under-specification literature,β1 in the reduced model is called biased.

According to this logic, β1 in a simplelinear regression is always biased if thereexists any other predictor more highlycorrelated with the response.

Page 61: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Model Under-specification

If β2 6= 0, then β1 in the reduced model is adifferent parameter than in the full model.

In the model under-specification literature,β1 in the reduced model is called biased.

According to this logic, β1 in a simplelinear regression is always biased if thereexists any other predictor more highlycorrelated with the response.

Page 62: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Model Under-specification

If β2 6= 0, then β1 in the reduced model is adifferent parameter than in the full model.

In the model under-specification literature,β1 in the reduced model is called biased.

According to this logic, β1 in a simplelinear regression is always biased if thereexists any other predictor more highlycorrelated with the response.

Page 63: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Back to Body Fat

Thad: Ok friend, to minimize theunder-specification problem, I’ll add thepredictor abdomen circumference to my model:

y = −41.35 + 0.92(abdomen)− 0.14(weight).

Are you happy now?Fellow Statistician: Ah-ha! The coefficient ofweight has the wrong sign now. Your model isclearly wrong. In your face Tarpey!

Thad: No, the model is clearly right.

Page 64: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Back to Body Fat

Thad: Ok friend, to minimize theunder-specification problem, I’ll add thepredictor abdomen circumference to my model:

y = −41.35 + 0.92(abdomen)− 0.14(weight).

Are you happy now?Fellow Statistician: Ah-ha! The coefficient ofweight has the wrong sign now. Your model isclearly wrong. In your face Tarpey!

Thad: No, the model is clearly right.

Page 65: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Back to Body Fat

Thad: Ok friend, to minimize theunder-specification problem, I’ll add thepredictor abdomen circumference to my model:

y = −41.35 + 0.92(abdomen)− 0.14(weight).

Are you happy now?

Fellow Statistician: Ah-ha! The coefficient ofweight has the wrong sign now. Your model isclearly wrong. In your face Tarpey!

Thad: No, the model is clearly right.

Page 66: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Back to Body Fat

Thad: Ok friend, to minimize theunder-specification problem, I’ll add thepredictor abdomen circumference to my model:

y = −41.35 + 0.92(abdomen)− 0.14(weight).

Are you happy now?Fellow Statistician: Ah-ha! The coefficient ofweight has the wrong sign now. Your model isclearly wrong. In your face Tarpey!

Thad: No, the model is clearly right.

Page 67: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Back to Body Fat

Thad: Ok friend, to minimize theunder-specification problem, I’ll add thepredictor abdomen circumference to my model:

y = −41.35 + 0.92(abdomen)− 0.14(weight).

Are you happy now?Fellow Statistician: Ah-ha! The coefficient ofweight has the wrong sign now. Your model isclearly wrong. In your face Tarpey!

Thad: No, the model is clearly right.

Page 68: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Coefficient Interpretation in Multiple Regression

The usual interpretation of a coefficient, say βjof a predictor xj, is that

βj represents the mean change in theresponse for a unit change in xjprovided all other predictors are heldconstant.

Page 69: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

y = −41.35 + 0.92(abdomen)− 0.14(weight).

Estimated coefficient of weight in the fullmodel: −0.14.

Consider the population of men with somefixed abdomen circumference value.

What happens to body fat percentage asthe weights of men in this group increase?

Body fat % will go down ... hence thenegative coefficient.

Page 70: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

y = −41.35 + 0.92(abdomen)− 0.14(weight).

Estimated coefficient of weight in the fullmodel: −0.14.

Consider the population of men with somefixed abdomen circumference value.

What happens to body fat percentage asthe weights of men in this group increase?

Body fat % will go down ... hence thenegative coefficient.

Page 71: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

y = −41.35 + 0.92(abdomen)− 0.14(weight).

Estimated coefficient of weight in the fullmodel: −0.14.

Consider the population of men with somefixed abdomen circumference value.

What happens to body fat percentage asthe weights of men in this group increase?

Body fat % will go down ... hence thenegative coefficient.

Page 72: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

y = −41.35 + 0.92(abdomen)− 0.14(weight).

Estimated coefficient of weight in the fullmodel: −0.14.

Consider the population of men with somefixed abdomen circumference value.

What happens to body fat percentage asthe weights of men in this group increase?

Body fat % will go down ...

hence thenegative coefficient.

Page 73: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

y = −41.35 + 0.92(abdomen)− 0.14(weight).

Estimated coefficient of weight in the fullmodel: −0.14.

Consider the population of men with somefixed abdomen circumference value.

What happens to body fat percentage asthe weights of men in this group increase?

Body fat % will go down ... hence thenegative coefficient.

Page 74: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Probability Models

In life, all probability is conditional.

Basically, “...randomness is fundamentallyincomplete information (Taleb, Black Swan, p198).

Page 75: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Probability Models

In life, all probability is conditional.

Basically, “...randomness is fundamentallyincomplete information (Taleb, Black Swan, p198).

Page 76: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Card ... any card

Thad: “Ok my statistical friend, pick a card,any card.”

Unbeknownst to me, my friend picked an Ace.Question: What is P (Ace)?Answer:Fellow Statistician: 1

Thad: 4/52 (I haven’t seen the card).

Page 77: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Card ... any card

Thad: “Ok my statistical friend, pick a card,any card.”

Unbeknownst to me, my friend picked an Ace.

Question: What is P (Ace)?Answer:Fellow Statistician: 1

Thad: 4/52 (I haven’t seen the card).

Page 78: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Card ... any card

Thad: “Ok my statistical friend, pick a card,any card.”

Unbeknownst to me, my friend picked an Ace.Question: What is P (Ace)?

Answer:Fellow Statistician: 1

Thad: 4/52 (I haven’t seen the card).

Page 79: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Card ... any card

Thad: “Ok my statistical friend, pick a card,any card.”

Unbeknownst to me, my friend picked an Ace.Question: What is P (Ace)?Answer:Fellow Statistician: 1

Thad: 4/52 (I haven’t seen the card).

Page 80: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Card ... any card

Thad: “Ok my statistical friend, pick a card,any card.”

Unbeknownst to me, my friend picked an Ace.Question: What is P (Ace)?Answer:Fellow Statistician: 1

Thad: 4/52 (I haven’t seen the card).

Page 81: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Confidence Intervals

A quote from Devore and Peck’s book Statistics: The Exploration

and Analysis of Data (2005, p 373) regarding the 95% confidence

interval for a proportion π:

“... it is tempting to say there is a‘probability’ of .95 that π is between.499 and .561. Do not yield to thistemptation!...Any specific interval ...either includes π or it does not...Wecannot make a chance statementconcerning this particular interval.”

Page 82: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Sample ... any sample

Compute a 95% confidence interval for µ.

Chance selects a random sample of size n...

95% of all possible confidence intervals containµ ... Chance has picked one of them forus...similar to picking a card from the deck.

Page 83: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Sample ... any sample

Compute a 95% confidence interval for µ.

Chance selects a random sample of size n...

95% of all possible confidence intervals containµ ... Chance has picked one of them forus...similar to picking a card from the deck.

Page 84: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Sample ... any sample

Compute a 95% confidence interval for µ.

Chance selects a random sample of size n...

95% of all possible confidence intervals containµ ... Chance has picked one of them forus...similar to picking a card from the deck.

Page 85: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Sample ... any sample

Question: What is the probability that myinterval contains µ?

Answer:For the Omniscient: 0 or 1

For me: 0.95

Fellow Statistician: Didn’t your read yourDevore and Peck book?

Thad: If I don’t know if µ is in the confidenceinterval... from my perspective, there isuncertainty; the probability cannot be 0 or 1.The correct probability model is conditional.

Page 86: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Sample ... any sample

Question: What is the probability that myinterval contains µ?

Answer:For the Omniscient: 0 or 1

For me: 0.95

Fellow Statistician: Didn’t your read yourDevore and Peck book?

Thad: If I don’t know if µ is in the confidenceinterval... from my perspective, there isuncertainty; the probability cannot be 0 or 1.The correct probability model is conditional.

Page 87: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Sample ... any sample

Question: What is the probability that myinterval contains µ?

Answer:For the Omniscient: 0 or 1

For me: 0.95

Fellow Statistician: Didn’t your read yourDevore and Peck book?

Thad: If I don’t know if µ is in the confidenceinterval... from my perspective, there isuncertainty; the probability cannot be 0 or 1.The correct probability model is conditional.

Page 88: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Sample ... any sample

Question: What is the probability that myinterval contains µ?

Answer:For the Omniscient: 0 or 1

For me: 0.95

Fellow Statistician: Didn’t your read yourDevore and Peck book?

Thad: If I don’t know if µ is in the confidenceinterval... from my perspective, there isuncertainty; the probability cannot be 0 or 1.The correct probability model is conditional.

Page 89: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Sample ... any sample

Question: What is the probability that myinterval contains µ?

Answer:For the Omniscient: 0 or 1

For me: 0.95

Fellow Statistician: Didn’t your read yourDevore and Peck book?

Thad: If I don’t know if µ is in the confidenceinterval...

from my perspective, there isuncertainty; the probability cannot be 0 or 1.The correct probability model is conditional.

Page 90: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Sample ... any sample

Question: What is the probability that myinterval contains µ?

Answer:For the Omniscient: 0 or 1

For me: 0.95

Fellow Statistician: Didn’t your read yourDevore and Peck book?

Thad: If I don’t know if µ is in the confidenceinterval... from my perspective, there isuncertainty; the probability cannot be 0 or 1.

The correct probability model is conditional.

Page 91: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Pick a Sample ... any sample

Question: What is the probability that myinterval contains µ?

Answer:For the Omniscient: 0 or 1

For me: 0.95

Fellow Statistician: Didn’t your read yourDevore and Peck book?

Thad: If I don’t know if µ is in the confidenceinterval... from my perspective, there isuncertainty; the probability cannot be 0 or 1.The correct probability model is conditional.

Page 92: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Conclusions

Calling a model right or wrong is just a matterof perspective.

With enough data, any imperfection in amodel can be detected.

The temptation then is to say all models arewrong.

However, if we regard models asapproximations to the truth, we could just aseasily call all models right.

Page 93: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Conclusions

Calling a model right or wrong is just a matterof perspective.

With enough data, any imperfection in amodel can be detected.

The temptation then is to say all models arewrong.

However, if we regard models asapproximations to the truth, we could just aseasily call all models right.

Page 94: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Conclusions

Calling a model right or wrong is just a matterof perspective.

With enough data, any imperfection in amodel can be detected.

The temptation then is to say all models arewrong.

However, if we regard models asapproximations to the truth, we could just aseasily call all models right.

Page 95: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Conclusions

Calling a model right or wrong is just a matterof perspective.

With enough data, any imperfection in amodel can be detected.

The temptation then is to say all models arewrong.

However, if we regard models asapproximations to the truth, we could just aseasily call all models right.

Page 96: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Conclusions

In any given data analysis situation, amultitude of models can be proposed.

Most of these will be useless ...

and perhaps a few will be useful.

Page 97: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Conclusions

In any given data analysis situation, amultitude of models can be proposed.

Most of these will be useless ...

and perhaps a few will be useful.

Page 98: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Conclusions

In any given data analysis situation, amultitude of models can be proposed.

Most of these will be useless ...

and perhaps a few will be useful.

Page 99: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Conclusions

Models have served us very well ...

and also, at times, quite poorly.

Page 100: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Conclusions

Models have served us very well ...

and also, at times, quite poorly.

Page 101: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Some quotes: Breiman 2001 Statistical Science

“...as data becomes more complex, thedata models become more cumbersomeand are losing the advantage ofpresenting a simple and clear picture ofnature’s mechanism (p 204)...

Unfortunately, our field has a vestedinterest in data models, come hell orhigh water (p 214).”

Page 102: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

Some quotes: Taleb, Black Swan

“...the gains in our ability to model(and predict) the world may be dwarfedthe increases in its complexity (p 136)”

Page 103: All Models are Right - most are useless · All Models are Right... most are useless Thaddeus Tarpey Wright State University thaddeus.tarpey@wright.edu. All Models are Right Thaddeus

All Modelsare Right

ThaddeusTarpey

Introduction

Parameters

ModelUnderspeci-fication

CoefficientInterpreta-tion

ProbabilityModels

Conclusions

A Final Quote:

Or, as Peter Norvig, Google’s researchdirector, says

Let’s stop expecting to find a simpletheory, and instead embrace complexity,and use as much data as well as we canto help define (or estimate) the complexmodels we need for these complexdomains.

∗ From http://norvig.com/fact-check.html. Note, Norvig was misquoted using avariation of the Box quote in: The End of Theory: The Data Deluge Makes the ScientificMethod Obsolete” by Chris Anderson in Wired Magazine, 2008.