Structural modelling: Causality, exogeneity and unit roots Andrew P. Blake CCBS/HKMA May 2004.

Post on 14-Dec-2015

212 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

Transcript

Structural modelling: Causality, exogeneity and unit roots

Andrew P. Blake

CCBS/HKMA May 2004

What do we need to do with our data?

• Estimate structural equations (i.e. understand what’s happening now)

• Forecast (i.e. say something about what’s likely to happen in the future)

• Conduct scenario analysis (i.e. perform simulations) to inform policy

What do we need to know?

• Inter-relationships between variables– Causality in the Granger sense– Exogeneity

• Concepts

– Unit roots• Spurious regression

• Role of pre-testing

• Appropriate single equation methods

-0.08

-0.04

0.00

0.04

0.08

94 95 96 97 98 99 00 01

X Y

Period t Period t+1

xt

yt yt+1

xt+1

Inter-relationships between variables

How best to estimate an equation?

• Single equation structural model (estimated by OLS)

• Single equation reduced form (IV/OLS)

• Structural system (estimated by TSLS, 3SLS or by a system method - SUR, FIML)

• Unrestricted VAR (OLS)

• VECM (FIML)

xt is autoregressive

Period t Period t+1

xt

yt yt+1

xt+1

xt has an autoregressive representation

Period t Period t+1

xt

yt yt+1

xt+1

xt has an ARMA representation

11

11

111

1

,

1

ttttt

tttt

ttt

ttt

ttt

xxso

xy

xy

yy

yx

Structural system

Reduced form

}

Period t Period t+1

xt

yt yt+1

xt+1

Granger Causality

Period t Period t+1

xt

yt yt+1

xt+1

Vector autoregressions (VARs)

Needs to be modelled to have

a structural interpretation

Granger causality

• If past values of y help to explain x, then y Granger causes x

• Statistical concept

• A lack of Granger causality does not imply no causal relationship

GC tested by an unrestricted VAR

• Definition of Granger Causality:– y does not Granger cause x if a12=b12=...=0– x does not Granger cause y if a21=b21=...=0

• NB. x and y could still affect each other in the same period or via unmeasured common shocks to the error terms.

tttttt

tttttt

ybxbyaxay

ybxbyaxax

...

...

222221122121

212211112111

Eviews Granger causality test resultNull Hypothesis F-Statistic Probability

x does not Granger Cause y F1 P1

y does not Granger Cause x F2 P2

• The closer P1 is to zero, the less the likelihood of accepting the null that x does not Granger cause y.

• (P1<0.10 : at least 90% confident that s1 Granger causes s2).

• P1 should be less than 0.10 for us to be reasonably confident that x Granger causes y.

y is a leading indicator of x if

• y Granger causes x;

• x does not Granger cause y;

• and y is weakly exogenous.

Leading indicators

Long term trends of money and prices in UK

0.0

5.0

10.0

15.0

20.0

25.0

30.0

% o

n y

ear

earl

ier,

sm

oo

thed

, p

rices l

ag

ged

6

qu

art

ers

Broad Money Prices

Criticisms of Granger causality

• Granger causality can be assessed using an unrestricted VAR - not tied to any particular theory

• How would you explain to your governor when it goes wrong?

• It depends on the choice of lags, data frequency and variables in VAR

Exogeneity

• Engle et al. (1983)– Separate parameters into two groups– Those that matter, those that don’t

• These are endogenous and weakly exogenous variables

• In practice a bit more complicated than that

Exogeneity (cont.)

• Correct assumptions of exogeneity simplify modeling, reduce computational expense and aid interpretation

• But incorrect assumptions may lead to inefficient or inconsistent estimates and misleading forecasts

Exogeneity (cont.)

• A variable is exogenous if it can be taken as given without losing information for the purpose at hand

• This varies with the situation

• We do not want the independent variables to be correlated with the regressors

• If they are, the estimates will be biased

Period t Period t+1

xt

yt yt+1

xt+1

Relationships between variables

• We do not want the black arrows

• We need to understand the red arrows

Both demand and supply shocks

0

2

4

6

8

10

12

14

0 1 2 3 4 5 6

P

Q

OLS is unable to identify either the demand or supply curve

Only supply shocks

0

2

4

6

8

10

12

14

0 1 2 3 4 5 6

P

Q

We can identify the demand schedule using OLS

Weak exogeneity

• Is y weakly exogenous with respect to x?• Do values of current x affect current y?• Are x and y both affected by a common

unmeasured third variable?• Does the range of possible values for the

parameters in the process that determines x affect the possible values of those that determine y

Weak exogeneity: example 1

• Money demand function:

• Would you estimate this as a single equation using OLS?

• Very unlikely that money does not affect real output or the nominal interest rate

ttt rym

Weak exogeneity: example 2

• Uncovered interest parity:

• Tests of UIP have performed very poorly, but ...

• No risk premia and monetary policy might react to exchange rate changes

*1tE ttt rre

Interest rate differentials

Exchange rate change

Question: how would you test for exogeneity in UIP?

Weak exogeneity: example 3

• In UK consumption had been forecast using single-equation ECM

• But relationship broke down in late 1980s

• Problem was that possibility that wealth reactions to disequilibrium had been ignored

11

11

...

...

tt

tttt

xy

xxyy

Single Equation ECM

Dynamic terms

Long run

Vector ECMS

Halfway between structural VARs and unrestricted VARs

ECMyxxy

ECMyxyx

tttt

tttt

21221212

11121111

...

...

Strong exogeneity

• Necessary for forecasting

• Is y strongly exogenous to x?– Is y weakly exogenous to x– Does x Granger cause y?

• Need the answers to be yes and no respectively

Strong exogeneity: example

First order VAR, ‘core’ and non-‘core’ inflation:

Given a forecast of {yt} can we forecast {xt}?

• If y is not strongly exogenous to x, feedback problems

', ,1 ttttt-t yxzAzz

Super exogeneity

Necessary for policy/scenario analysis. Is y super exogenous to x?

• Is y weakly exogenous to x?

• Is the relationship between x and y invariant?

Need the answers to be yes to both

Invariance

• The process driving a variable does not change in the face of shocks

• Linked to ‘deep parameters’

• Example: the Lucas critique

Testing for weak exogeneity: orthogonality test

• Estimate a reduced form (marginal model) for x, regress x on any exogenous variables of the system

• Take residuals from this reduced form and put them into the structural equation for y

• If they are significant then x is not weakly exogenous with respect to the estimation of c10

Testing for weak exogeneity with respect to c(lr)

• Estimate a reduced form (marginal model) for x: regress x on exogenous variables of system, including lagged ECM term involving x and y

• Test if coefficient of ECM term is significant• If it is, then x is not weakly exogenous with

respect to the estimation of long-run coeff, c(lr)• Consequence is that estimate is inefficient

Stationarity

• Why should we test whether series are stationary?• A non-stationary time series implies that shocks

never die out• The mean, variance and higher moments depend

on time• Standard statistics do not have standard

distributions• Problem of spurious regression

Non-stationarity

• Start with the following expression

yt = + yt-1 + ut u, 2• Substitute recursively:

yt = n + n yt-n + n-1jut-j

• The variable will be non-stationary if =E(y)=t

Var(y) = Var(n-1ut-j - t) = t 2

• Displays time dependency

Non-stationarity (cont.)

t is a stochastic trend• The series drifts upwards or downwards

depending on sign of ; increases if positive• Stationary series tend to return to its mean value

and fluctuate around it within a more-or-less constant range

• Non-stationary series has a different mean at different points in time and its variance increases with the sample size

Non-stationarity (cont.)

• Mean and variance increase with time

• yt = n + n yt-n +n-1jut-j

• If = then shocks never die out

• If | |<1 as n, then y is like a finite MA

• What do non-stationary series look like?

• Could show made-up series (with and without drift)

Difference vs trend stationarity• Compare previous equation with

yt = a + b t + ut

E(y) = a + b t

var(y) = 2

• b t - deterministic trend

• But stationary around a trend

E(y - b t) = a

Difference vs trend stationarity (2)

• Compare two generated series

• Stationary around trend

• Difference stationary are non-constant around a trend

• But can be difficult to tell apart

• Also difficult to tell series with AR coefficients 1 and 0.95

Difference vs trend stationary

0

100

200

300

400

500

-20

0

20

40

60

80

00 10 20 30 40 50 60 70 80 90 00

X Z

Difference vs trend stationarity

• Can you tell the difference?

xt = 1 + xt-1 + 0.6 ut

zt = 1 + 0.15 t + 0.8 et

• Can you tell the difference with a near-unit root?

Unit root vs near-unit root

0

10

20

30

40

50

00 01 02 03 04 05 06 07 08 09 10

X W

0

100

200

300

400

500

00 10 20 30 40 50 60 70 80 90 00

X W

Testing for unit roots

• Dickey-Fuller test

• Write

yt = yt-1 + et

as

yt - yt-1 = (-1)yt-1 + et

Null: Coefficient on lagged value 0, vs < 0

Dickey-Fuller tests

• Test akin to t-test but distributions not standard• Depends if series contains constant and/or trends• Must incorporate this into DF test• Augmented DF test - use lags of dependent

variable to remove serial correlation• All of these must be checked against relevant DF

statistic• But introducing extra variables reduces power

Unit versus near-unit roots

• Thus difficult to tell the difference between two series over small samples

• Low power of ADF tests (sample of 400)

x: ADF statistic -0.77048 p-value 0.8258

w: ADF statistic -6.90130 p-value 0.0000

• Small sample (40 observations)

x: ADF statistic 0.39323 p-value 0.9804

w: ADF statistic -0.49216 p-value 0.8828

Stationarity in non-stationary time series

• A variable is integrated of order d - I(d) - if it musto be differenced d times for stationarity

• The required number of differences depends on the number of unit roots a series has

• For example, an I(1) variable needs to be differenced once to achieve stationarity: it has only one unit root

Spurious regressions• Trends in data can lead to spurious correlation

between variables: there appears to be meaningful relationships

• What is present are uncorrelated trends

• Time trend in a trend-stationary variable can be removed by regressing variable on time

• Regression model then operates with stationary series with constant means and variances (standard t and F test inferences)

Spurious regressions

• Regressing a non-stationary variable on a time trend generally does not yield a stationary variable (it must be differenced) i.e. taking trend away does not lead to stationarity

• Using standard regression techniques with non-stationary data can lead to the problem of spurious regression involving invalid inference based on usual t and F tests

Spurious regressions• Consider the following DGP:

yt = yt-1 + ut u , 1

xt = xt-1 + et e , 1• y and x are uncorrelated, but estimating

yt = a + b xt + vt

we find that we can reject b = 0.

• Why? Non-stationary data => v non-stationary gives problems with t and F stats

• Also find high R2 and low DW (G&N 1974)

Spurious RegressionsDependent Variable: YMethod: Least SquaresDate: 03/31/03 Time: 18:28Sample: 1900:1 2003:4Included observations: 416

Variable Coefficient Std. Error t-Statistic Prob.

X 0.964478 0.001112 867.6800 0.0000

R-squared 0.997879 Mean dependent var 202.9399Adjusted R-squared 0.997879 S.D. dependent var 120.3730S.E. of regression 5.543177 Akaike info criterion 6.265414Sum squared resid 12751.63 Schwarz criterion 6.275103Log likelihood -1302.206 Durbin-Watson stat 0.023766

Spurious regression

• Why do we find significant coefficients?

• What will happen if we estimate a spurious regression with the variables in first differences?

• What ‘economic problem’ do we encounter if we only use differenced variables in economics?

• We lose information about the long-run

Spurious RegressionDependent Variable: DYMethod: Least SquaresDate: 03/31/03 Time: 18:36Sample(adjusted): 1900:2 2003:4Included observations: 415 after adjusting endpoints

Variable Coefficient Std. Error t-Statistic Prob.

C 0.989704 0.016085 61.52980 0.0000DX -0.005194 0.012185 -0.426235 0.6702

R-squared 0.000440 Mean dependent var 0.984475Adjusted R-squared -0.001981 S.D. dependent var 0.211713S.E. of regression 0.211922 Akaike info criterion -0.260386Sum squared resid 18.54827 Schwarz criterion -0.240973Log likelihood 56.03014 F-statistic 0.181676Durbin-Watson stat 1.752192 Prob(F-statistic) 0.670159

Cointegration (definition)

• In general, regressing two I(d) variables, d>0, leads to the problem of spurious regression

• Assume two I(d) variables and estimate:

• If is a vector such that t is I(d-b) then we say that y and x are co-integrated of order CI(d,b)

ttt xy

What is cointegration?

• If two (or more) series have an equilibrium relationship in the long run even though the series contain stochastic trends they move together such that a (linear) combination of them is stationary

• Cointegration resembles a long-run equilibrium and differences from the relationship are akin to disequilibrium

• Trivially, a stationary model must be cointegrated but may not co-break

Modelling the short-run

• Are we ever in the long run?

• How do we model the short run?

• Problem of using only differenced data and the loss of long-run information

• Assume

• In steady state has little meaning for the long run

ttt xy 0 tt xy

Modelling short run

• Assume

yt = xt + yt-1 + xt-1 + t, , 2

• If a LR relationship exists

yt = + xt

• We can write

yt = xt - (1- )(yt-1 - - xt-1 ) + t

• (1- ) is speed of adjustment

• Implications for the sign of ECM

Modelling the short-run• There are some issues about the estimation

of • Stock (1987) shows that OLS is fine, is

super-consistent; the estimator converges to its true value at a faster rate when a series is I(1) than when it is I(0)

• However, there is significant of bias in small samples

Testing strategies• Perron’s suggestion:

– start with regression with constant and trend

– proceed trying to reduce unnecessary paramaters

– if we fail to reject parameters continue testing until we are able to reject the hypothesis of a unit root

• In the end we should use common sense and economics– If there should not be a unit root - probably a

break

Cointegration and single equations

• When looking at single equations it is easy to test for cointegration– Engle and Granger two-step procedure– Engle-Granger-Yoo three-step approach

• What if there is more than a single cointerating relationship?– Need a system approach– VECMs

Modelling strategies• Understand the data

– Do whatever tests necessary to be sure of using appropriate models

• Understand the limitations of individual methods– By not taking limitations into account a rejection does not

necessarily imply that the hypothesis is false

• Use appropriate methods for different problems

EXOGENEITY• Banerjee, A, D.F. Hendry and G.E. Mizon (1996) “The econometric analysis of economic policy”, Oxford Bulletin of

Economics and Statistics 58(4), 573-600

• Ericsson, N.R. and J.S. Irons (eds) (1994) Testing Exogeneity. Advanced Texts in Econometrics. Oxford University Press.

• Lindé, J. (2001) “Testing for the Lucas Critique: A quantitative investigation”, American Economic Review 91(4), 986-1005.

• Monfort, A and R. Rabemananjara (1990) “From a VAR model to a structural model, with an application to the wage-price spiral”, Journal of Applied Econometrics 5, 203-227

• Urbain, J.P. (1995) “Partial versus full system modelling of cointegrated systems: An empirical illustration”, Journal of Econometrics 69(1), 177-210.

• Boswijk, P. and J.P. Urbain (1997) “Lagrange Multiplier tests for weak exogeneity: A synthesis”, Econometric Reviews 16(1), 21-38.

• Charezma, W.W and D.F. Deadman, (1997) New Directions in Econometric Practice, Edward Elgar, Second Edition.

• Urbain, J.P. (1992) “On weak exogeneity in error correction models”, Oxford Bulletin of Economics and Statistics 54(2), 187-207.

MODELLING AND FORECASTING SHORT-TERM DATA

• Jondeau, É., H. Le Bihan and F. Sédillot (1999) Modelling and Forecasting the French Consumer Price Index Components, Banque de France Working paper 68.

• Clements, M. P. and D.F. Hendry (1999) Forecasting non-stationary economic time series. MIT Press.

• Bardsen, G and P.G. Fisher (1996) On the roles of economic theory and equilibria in estimating dynamic econometric models-with an application to wages and prices in the United Kingdom, Essays in Honour of Ragnar Frisch.

VARS

• Levtchenkova, S., A.R. Pagan and J.C. Robertson (1998) “Shocking stories”, Journal of Economic Surveys 12(5), 507-532.

top related