Top Banner
Multivariate Regression and Data Collection 11/21/2013
50
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Multivariate Regression and Data Collection 11/21/2013.

Multivariate Regression and Data Collection

11/21/2013

Page 2: Multivariate Regression and Data Collection 11/21/2013.

Readings

• Chapter 8 (pp 187-206)

• Chapter 9 Dummy Variables and Interaction Effects (Pollock Workbook)

Page 3: Multivariate Regression and Data Collection 11/21/2013.

OPPORTUNITIES TO DISCUSS COURSE CONTENT

Page 4: Multivariate Regression and Data Collection 11/21/2013.

Office Hours For the Week

• When– Friday 7:00AM-3:00PM– Monday 7:00AM-1:00 PM – Tuesday 8-12– And by appointment

Page 5: Multivariate Regression and Data Collection 11/21/2013.

Course Learning Objectives

1. Students will be able to interpret and explain empirical data.

2. Students will achieve competency in conducting statistical data analysis using the SPSS software program.

3. Third, students will learn the basics of polling and be able to analyze and explain polling and survey data.

Page 6: Multivariate Regression and Data Collection 11/21/2013.

MULTIPLE REGRESSION

Page 7: Multivariate Regression and Data Collection 11/21/2013.

What we can do with it

• Test the significance, strength and direction of more than one independent variable on the dependent variable, while controlling for the other independent variables.

• We can compare the strength of each independent variable against each other

• We can examine an entire model at one time!

Page 8: Multivariate Regression and Data Collection 11/21/2013.

This allows us to model additive relationships

Page 9: Multivariate Regression and Data Collection 11/21/2013.

Regression Outputs

• These have 3 parts1. The Model Summary 2. ANOVA3. The Variables/Model

Page 10: Multivariate Regression and Data Collection 11/21/2013.

THINGS THAT BEGIN WITH “R”Part I

Page 11: Multivariate Regression and Data Collection 11/21/2013.

With So Many, How do we know?

• There are many R's out there: – lower case "r" for correlation – upper case "R" for regression

Page 12: Multivariate Regression and Data Collection 11/21/2013.

What the R’s look like

The R Square

Adj R-Square, the preferred measure

Page 13: Multivariate Regression and Data Collection 11/21/2013.

THE ANALYSIS OF VARIANCE (ANOVA)

Part II

Page 14: Multivariate Regression and Data Collection 11/21/2013.

What The F-Score tells us• It is like a chi-square for

Regression. The F-score tells us if we have a significant regression model

• If the F-Score is not significant, we accept the null hypothesis (no relationship).

• It usually tells us at least one of our variables is significant.

• It is a way of examining the entire regression.

Page 15: Multivariate Regression and Data Collection 11/21/2013.

The F-Score

• We look at the Sig value and use the p<.05 measurement

• In the model above, our p value is .001 – We Reject the null hypothesis – At least one variable is significant

Page 16: Multivariate Regression and Data Collection 11/21/2013.

THE MODELPart III

Page 17: Multivariate Regression and Data Collection 11/21/2013.

The Model

• What it tells us– Variable relationships and direction– Variable significance– Variable Strength

Page 18: Multivariate Regression and Data Collection 11/21/2013.

Old Friends

Beta Values• Measure the change in the

dependent variable

• Show the direction of the relationship

T-Tests• Test the significance of each

independent variable on the dependent variable

• Accept or reject the null for that variable

Page 19: Multivariate Regression and Data Collection 11/21/2013.

Standardized Beta Coefficients

• They show us the variables which have the greatest influence.

• These are measured in absolute value

• The larger the standardized beta, the more influence it has on the dependent variable.

Page 20: Multivariate Regression and Data Collection 11/21/2013.

Looking at our Model

Beta Values

T-Score- Significance

Page 21: Multivariate Regression and Data Collection 11/21/2013.

TRYING IT OUT

Page 22: Multivariate Regression and Data Collection 11/21/2013.

Another One

• D.V. Palin_therm-post (Feeling thermometer for Palin 0-100)

• IV's– enviro_jobs (Environment vs.

jobs tradeoff) 0=envir, 1=middle, 2=jobs

– educ_r- education in years – Gunown- do you own a gun

(1=yes, 5=no) – relig_bible_word (Is Bible

actual word of God?) 1=yes, 0=No

Page 23: Multivariate Regression and Data Collection 11/21/2013.

Another one from the states

• Gay Rights involves many concepts. The Lax-Phillips index uses content validity to address this issue at the state level. It examines the support for the following issues– Adoption – Hate Crimes legislation – Health Benefits – Housing Discrimination – Job Discrimination – Marriage Laws – Sodomy Laws – Civil Unions – It then averages these to get a statewide level

Page 24: Multivariate Regression and Data Collection 11/21/2013.

State Example

• Dependent Variable- gay_support (higher is more supportive on Lax-Phillips)

• Independent Variables – relig_import (% of people in

state that say religion provides a great deal of guidance)

– south (1=south, 0= NonSouth – abortlaw (restrictions on

abortion)

Page 25: Multivariate Regression and Data Collection 11/21/2013.

Tautology

• it is tempting to use independent variables that are actually components of the dependent variable.

• How you will notice this: – if the dependent variables seem to be measures of

each other (human development vs. education) they probably are, (female literacy and literacy rate)

– High Adj. R-Squares (above .900)

Page 26: Multivariate Regression and Data Collection 11/21/2013.

Multicollinearity

• Your independent variables should not only be independent of the d.v. (non tautological) but they should be independent of each other!

• Picking independent variables that are very closely related, or are actually part of the same measure What can happen here is these variables will negate the influence of each other on the dependent variable.

Page 27: Multivariate Regression and Data Collection 11/21/2013.

Symptoms of Multicollinearity

• the multiple regression equation is statistically significant (big R values, even a significant ANOVA), but none of the t-ratios are statistically significant

• the addition of the collinear independent variable radically changes the values of the standardized beta coefficients (they go from positive to negative, or weak to strong), without a corresponding change in the ADJ R-square.

• Variables, that you would swear on a stack of bibles should be related, are not

Page 28: Multivariate Regression and Data Collection 11/21/2013.

Solving Tautology and Multicolinearity

• Solving tautology- Drop the independent variable

• What to do About Multicollinearity – run bivariate correlations on each of your

variables. If the r-square value is >.60. – You will want to drop one of the variables, or

combine them into a single measure.

Page 29: Multivariate Regression and Data Collection 11/21/2013.

Data collection

Page 30: Multivariate Regression and Data Collection 11/21/2013.

Collecting Primary Data

• Document Analysis

• Direct Observation

• Interview Data

Page 31: Multivariate Regression and Data Collection 11/21/2013.

DOCUMENT ANALYSIS

Page 32: Multivariate Regression and Data Collection 11/21/2013.

Document Analysis (The Written Record)

• What is it

• When to use it

Page 33: Multivariate Regression and Data Collection 11/21/2013.

Types of Document Analysis

• The Episodic Record

• The Running Record

Page 34: Multivariate Regression and Data Collection 11/21/2013.

Limitations and Advantages

Page 35: Multivariate Regression and Data Collection 11/21/2013.

OBSERVATION

Page 36: Multivariate Regression and Data Collection 11/21/2013.

Observation

• What is it

• Types of Observation

Page 37: Multivariate Regression and Data Collection 11/21/2013.

Problems of Observation

• Reactivity

• Ethics

Page 38: Multivariate Regression and Data Collection 11/21/2013.

Which Method to use?

Page 39: Multivariate Regression and Data Collection 11/21/2013.

THE LITERATURE REVIEW

Page 40: Multivariate Regression and Data Collection 11/21/2013.

What it Should Contain

• Bring the reader up to speed on the status of the research (what has been done)

• Establish face validity (why I am using these variables)

• Point out potential problems with previous research

Page 41: Multivariate Regression and Data Collection 11/21/2013.

What it should Contain

• what are the main texts in this area

• what are the general theories in this area

• how has the question been measured in the past

Page 42: Multivariate Regression and Data Collection 11/21/2013.

QUESTIONNAIRE CONSTRUCTION

Page 43: Multivariate Regression and Data Collection 11/21/2013.

Question Style

• Open Ended (advantages & disadvantages)

• Closed Ended (advantages & disadvantages)

Page 44: Multivariate Regression and Data Collection 11/21/2013.

Demographic Questions

• Who are you?

• These tend to be overrated

• Don’t get too personal!

Page 45: Multivariate Regression and Data Collection 11/21/2013.

Behavior Questions

• What do you do, and how often?

• Knowing behavior is a good dependent and independent variable

Page 46: Multivariate Regression and Data Collection 11/21/2013.

Opinion and Attitude Questions

• What do you think?

• Easy to Answer

Page 47: Multivariate Regression and Data Collection 11/21/2013.

Knowledge/Factual Questions

• Use sparingly

Page 48: Multivariate Regression and Data Collection 11/21/2013.

Question Order is Key

• Intro and Filter

• First Questions

• Major Questions

• Final Questions- demographics

Page 49: Multivariate Regression and Data Collection 11/21/2013.

How you should Phrase Questions• Language (be clear)

• One question 1 concept

Page 50: Multivariate Regression and Data Collection 11/21/2013.

Information Level