Top Banner
G89.2247 Lect 2 1 G89.2247 Lecture 2 • Regression as paths and covariance structure • Alternative “saturated” path models • Using matrix notation to write linear models • Multivariate Expectations • Mediation
24

G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

Dec 31, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 1

G89.2247Lecture 2

• Regression as paths and covariance structure

• Alternative “saturated” path models

• Using matrix notation to write linear models

• Multivariate Expectations

• Mediation

Page 2: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 2

Question: Does exposure to childhood foster care (X) lead to adverse outcomes (Y) ?

• Example of purported "causal model" X Y

Y = B0 + B1X + e• Regression approach

B0 and B1 can be estimated using OLS Estimates depend on sample standard deviations of Y and X, sample means,

and covariance between Y and X• B1 = SXY/S2

X

• B0 = MY -B1MX

Correlation, rXY = SXY/SXSY, can be used to estimate the variance of the residual, e, V(e).

• S2e = S2

Y(1-r2XY) = S2

Y - S2XY/S2

X

eB1

Page 3: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 3

A Covariance Structure Approach

• If we have data on Y and X we can compute a covariance matrix

• This estimates the population covariance structure,

Y can itself be expressed as B2

12X + 2

e

Three statistics in the sample covariance matrix are available to estimate three population parameters

2

2

XXY

YXY

SS

SSS

221

2

2`

2

XX

Y

XXY

YXY

B

Page 4: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 4

Covariance Structure Approach, Continued

• A structural model that has the same number of parameters as unique elements in the covariance matrix is "saturated".

• Saturated models always fit the sample covariance matrix.

221

21

2221

XX

XeX

B

BB

Page 5: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 5

Another saturated model: Two explanatory variables

• The first model is likely not to yield an unbiased estimate of foster care because of selection factors (Isolation failure).

• Suppose we have a measure of family disorganization (Z) that is known to have an independent effect on Y and also to be related to who is assigned to foster care (X)

YX

Z

e

XZ

Page 6: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 6

Covariance Structure Expression

• The model: Y=b0+b1X+b2Z+e If we assume E(X)=E(Z)=E(Y)=0 and V(X) = V(Z) = V(Y) = 1 then b0=0 and 's are standardized

• The parameters can be expressed

When sample correlations are substituted, these expressions give the OLS estimates of the regression coefficients.

211 XZ

XZYZYX

22

1 XZ

XZYXYZ

Page 7: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 7

Covariance Structure: 2 Explanatory Variables

• In the standardized case the covariance structure is:

• Each correlation is accounted by two components, one direct and one indirect

• There are three regression parameters and three covariances.

1

1

1

1

1

1

12

21

XZXZ

XZ

XZZY

XZXY

ZYXY

Page 8: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 8

The more general covariance matrix for two IV multiple regression

• If we do not assume variances of unity the regression model implies

212

221

2

12

21

1

1

1

ZYXZZYXZZ

XYXZX

Y

XZXZ

XZ DD

Z

X

Y

V

Page 9: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 9

More Math Review for SEM

• Matrix notation is useful

Z

X

Y

XZXZ

XZXZ

XZXZ

Z

X

Y

Z

X

Y

V

00

00

00

1

1

1

00

00

00

12

21

1221

212

221

2

ZYXZZYXZZ

XYXZX

Y

Page 10: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 10

A Matrix Derivation of OLS Regression

• OLS regression estimates make the sum of squared residuals as small as possible.If Model is

Then we choose B so that e'e is minimized.

• The minimum will occur when the residual vector is orthogonal to the regression planeIn that case, X'e = 0

eBXY

Page 11: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 11

When will X'e = 0? When e is the residual from an OLS fit.

BXXYXBXYXeX ˆ''ˆ'0'

YXXXB

YXXXBXXXX

YXBXX

''ˆ

''ˆ''

'ˆ'

1

11

Page 12: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 12

Multivariate Expectations

• There are simple multivariate generalizations of the expectation facts:E(X+k) = E(X)+k = x+kE(k*X) = k*E(X) = k*x

V(X+k) = V(X) = x2

V(k*X) = k2*V(X) = k2*x2

• Let XT=[X1 X2 X3 X4], T=[] and let k be scalar valueE(k*X) = k*E(X) = k*E(X+k* 1) = {E(X) + k* 1} = + k*1

Page 13: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 13

Multivariate Expectations

• In the multivariate case Var(X) is a matrixV(X)=E[(X-) (X-)T]

24434241

34233231

24232221

14131221

Page 14: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 14

Multivariate Expectations

• The multivariate generalizations of V(X+k) = V(X) = x

2

V(k*X) = k2*V(X) = k2*x2

• Are: Var(X + k*1) = Var(k* X) = k2

• Let cT = [c1 c2 c3 c4]; cT X is a linear combination of the X's. Var(cT X) = cT c

• This is a scalar value• If this positive for all values of c then is positive

definite

Page 15: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 15

Semi Partial Regression Adjustment

• The multiple regression coefficients are estimated taking all variables into accountThe model assumes that for fixed X, Z has an effect

of magnitude Z.Sometimes people say "controlling for X"

• The model explicitly notes that Z has two kinds of association with YA direct association through Z (X fixed)An indirect association through X (magnitude

XXZ)

Page 16: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 16

Pondering Model 1:Simple Multiple Regression

• The semi-partial regression coefficients are often different from the bivariate correlationsAdjustment effectsSuppression effects

• Randomization makes XZ = 0 in probability.

YX

Z

e

XZ

Page 17: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 17

Mathematically Equivalent Saturated Models

• Two variations of the first model suggest that the correlation between X and Z can itself be represented structurally.

YX

Z

eY

eZ

YX

Z

eY

eX

Page 18: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 18

Representation of Covariance Matrix

• Both models imply the same correlation structure

• The interpretation, however, is very different.

1

1

1

1

1

1

3132

231

XZZY

XZXY

ZYXY

Page 19: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 19

Model 2:X leads to Z and Y

• X is assumed to be causally prior to Z.The association between X and Z is due to X effects.

• Z partially mediates the overall effect of X on YX has a direct effect 1 on YX has an indirect effect on Y through ZPart of the bivariate association between Z and Y is spurious (due

to common cause X)

YX

Z

eY

eZ

Page 20: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 20

Model 3:Z leads to X and Y

• Z is assumed to be causally prior to X.The association between X and Z is due to Z effects.

• X partially mediates the overall effect of Z on YZ has a direct effect 2 on YZ has an indirect effect on Y through XPart of the bivariate association between X and Y is spurious (due

to common cause Z)

YX

Z

eY

eX

Page 21: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 21

Choosing between models

• Often authors claim a model is good because it fits to data (sample covariance matrix)All of these models fit the same (perfectly!)

• Logic and theory must establish causal order• There are other possibilities besides 2 and 3

In some instances, X and Z are dynamic variables that are simultaneously affecting each other

In other instances both X and Z are outcomes of an additional variable, not shown.

Page 22: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 22

Mediation: A theory approach

• Sometimes it is possible to argue on theoretical grounds thatZ is prior to X and YX is prior to YThe effect of Z on Y is completely accounted for by the

indirect path through X.

• This is an example of total mediation• If is fixed to zero, then Model 3 is no longer

saturated.Question of fit becomes informativeTotal mediation requires strong theory

Page 23: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 23

A Flawed Example

• Someone might try to argue for total mediation of family disorganization on low self-esteem through placement in foster care

• Baron and Kenny(1986) criteria might be metZ is significantly related to YZ is significantly related to XWhen Y is regressed on Z and X, is significant but is not

significant.

• Statistical significance is a function of sample size.• Logic suggests that children not assigned to foster care

who live in a disorganized family may suffer directly.

Page 24: G89.2247 Lect 21 G89.2247 Lecture 2 Regression as paths and covariance structure Alternative “saturated” path models Using matrix notation to write linear.

G89.2247 Lect 2 24

A More Compelling Example of Complete Mediation

• If Z is an experimentally manipulated variable such as a prime

• X is a measured process variable• Y is an outcome logically subsequent to X

It should make sense that X affects Y for all levels of Z

E.g. Chen and Bargh (1997)• Are participants who have been subliminally primed

with negative stereotype words more likely to have partners who interact with them in a hostile manner?