Top Banner
Limited Dependent Variables Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement Frequently it is even dichotomous.
23

Limited Dependent Variables Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Dec 16, 2015

Download

Documents

Homer Fowler
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Limited Dependent Variables

Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Frequently it is even dichotomous.

Page 2: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Examples

War(1) vs. no War(0) Vote vs. no vote Regime change vs. no change

Page 3: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

These are often Probability Models

E.g. Power disparity leads to war:

Where Yt is the occurrence (or not) of war, and Xt

is a measure of power disparity

We call this a Linear Probability Model

ttt eXBaY 1

Page 4: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Problems with LPM Regression

OLS in this case is called the Linear Probability Model

Running regression produces some problems Errors are not distributed normally Errors are heteroskedastic Predicted Ys can be outside the 0.0-1. bounds

required for probability

Page 5: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Logistic Model

We need a model that produces true probabilities The Logit, or cumulative logistic distribution offers one

approach.

This produces a sigmoid curve. Look at equation under 2 conditions:

Xi = +∞ Xi = -∞

)( 211

1iXBBi e

Y

Page 7: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Probability Ratio

Note that

Where

Z

Z

ZXBBi e

e

eeP

ii

11

1

1

1)_( 21

ii XBBZ 21

Page 8: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Log Odds Ratio

The logit is the log of the odds ratio, and is given by:

This model gives us a coefficient that may be interpreted as a change in the weighted odds of the dependent variable

iii

ii XBBZ

P

PL 211

ln

Page 9: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Estimation of Model

We estimate this with maximum likelihood The significance tests are z statistics We can generate a Pseudo R2 which is an attempt to

measure the percent of variation of the underlying logit function explained by the independent variables

We test the full model with the Likelihood Ratio test (LR), which has a χ2 distribution with k degrees of freedom

Page 10: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Neural Networks

The alternate formulation is representative of a single-layer perceptron in an artificial neural network.

Page 11: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Probit

If we can assume that the dependent variable is actually the result of an underlying (and immeasurable) propensity or utility, we can use the cumulative normal probability function to estimate a Probit model

Also, more appropriate if the categories (or their propensities) are likely to be normally distributed

It looks just like a logit model in practice

Page 12: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

The Cumulative Normal Density Function

The normal distribution is given by:

The Cumulative Normal Density Function is:

2

2

2

)(

22

1)(

X

eXf

0 2

2

2

)(

22

1)(

XX

eXF

Page 13: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

The Standard Normal CDF

We assume that there is an underlying threshold value (Ii) that if the case exceeds will be a 1, and 0 otherwise.

We can standardize and estimate this as

iXBB zi dzeIF

21 2 2/

2

1)(

Page 14: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Probit estimates

Again, maximum likelihood estimation Again, a Pseudo R2 Again, a LR ratio with k degrees of freedom

Page 15: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Assumptions of Models

All Y’s are in {0,1} set They are statistically independent No multicollinearity The P(Yi=1) is normal density for probit, and

logistic function for logit

Page 16: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Ordered Probit

If the dependent variable can take on ordinal levels, we can extend the dichotomous Probit model to an n-chotomous, or ordered, Probit model

It simply has several threshold values estimated

Ordered logit works much the same way

Page 17: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Multinomial Logit

If our dependent variable takes on different values, but they are nominal, this is a multinomial logit model

Page 18: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Some additional info

The Modal category is good benchmark Present % correctly predicted

This can be calculated and presented. This, when compared to the modal category,

gives us a good indication of fit.

Page 19: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Stata

Use Leadership Change data (1992 cross section) 1992-Stata

Page 20: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Test different models

Dependent variable Leadership change Examine distribution

tables ledchan1 Independent variables

Try differentTry corr and then (pwcorr)

Page 21: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Try the following

regress ledchan1 grwthgdp hlthexp illit_f polity2

logit ledchan1 grwthgdp hlthexp illit_f polity2

logistic ledchan1 grwthgdp hlthexp illit_f polity2

probit ledchan1 grwthgdp hlthexp illit_f polity2

ologit ledchan1 grwthgdp hlthexp illit_f polity2

oprobit ledchan1 grwthgdp hlthexp illit_f polity2

mlogit ledchan1 grwthgdp hlthexp illit_f polity2

tobit ledchan1 grwthgdp hlthexp illit_f polity2, ul ll

Page 22: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Tobit

Assumes a 0 value, and then a scale E.g., the decision to incarcerate

0 or 1 (Imprison or not)

If Imprison, than for how many years?

Page 23: Limited Dependent Variables  Often there are occasions where we are interested in explaining a dependent variable that has only limited measurement

Other models

This leads to many other models Count models & Poisson regression Duration/Survival/hazard models Censoring and truncation models Selection bias models