Top Banner
Business Research Methods William G. Zikmund Chapter 23 Bivariate Analysis: Measures of Associations
66

Research Methods William G. Zikmund, Ch23

Nov 10, 2014

Download

Business

TAHMID SHAWROVE

Research Methods
William G. Zikmund
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Research Methods William G. Zikmund, Ch23

Business Research Methods

William G. Zikmund

Chapter 23

Bivariate Analysis: Measures of Associations

Page 2: Research Methods William G. Zikmund, Ch23

Measures of Association

• A general term that refers to a number of bivariate statistical techniques used to measure the strength of a relationship between two variables.

Page 3: Research Methods William G. Zikmund, Ch23

Relationships Among Variables

• Correlation analysis

• Bivariate regression analysis

Page 4: Research Methods William G. Zikmund, Ch23

Type ofMeasurement

Measure of Association

Interval andRatio Scales

Correlation CoefficientBivariate Regression

Page 5: Research Methods William G. Zikmund, Ch23

Type ofMeasurement

Measure of Association

Ordinal Scales Chi-squareRank Correlation

Page 6: Research Methods William G. Zikmund, Ch23

Type ofMeasurement

Measure of Association

NominalChi-Square

Phi CoefficientContingency Coefficient

Page 7: Research Methods William G. Zikmund, Ch23

Correlation Coefficient• A statistical measure of the covariation or

association between two variables.

• Are dollar sales associated with advertising dollar expenditures?

Page 8: Research Methods William G. Zikmund, Ch23

The Correlation coefficient for two variables, X and Y is

xyr.

Page 9: Research Methods William G. Zikmund, Ch23

Correlation Coefficient

• r

• r ranges from +1 to -1

• r = +1 a perfect positive linear relationship

• r = -1 a perfect negative linear relationship

• r = 0 indicates no correlation

Page 10: Research Methods William G. Zikmund, Ch23

22YYiXXi

YYXXrr ii

yxxy

Simple Correlation Coefficient

Page 11: Research Methods William G. Zikmund, Ch23

22yx

xyyxxy rr

Simple Correlation Coefficient

Page 12: Research Methods William G. Zikmund, Ch23

= Variance of X

= Variance of Y

= Covariance of X and Y

2x2y

xy

Simple Correlation Coefficient Alternative Method

Page 13: Research Methods William G. Zikmund, Ch23

X

Y

NO CORRELATION

.

Correlation Patterns

Page 14: Research Methods William G. Zikmund, Ch23

X

Y

PERFECT NEGATIVECORRELATION - r= -1.0

.

Correlation Patterns

Page 15: Research Methods William G. Zikmund, Ch23

X

Y

A HIGH POSITIVE CORRELATIONr = +.98

.

Correlation Patterns

Page 16: Research Methods William G. Zikmund, Ch23

Pg 629

589.5837.17

3389.6r

712.99

3389.6 635.

Calculation of r

Page 17: Research Methods William G. Zikmund, Ch23

Coefficient of Determination

Variance

variance2

Total

Explainedr

Page 18: Research Methods William G. Zikmund, Ch23

Correlation Does Not Mean Causation

• High correlation

• Rooster’s crow and the rising of the sun– Rooster does not cause the sun to rise.

• Teachers’ salaries and the consumption of liquor – Covary because they are both influenced by a

third variable

Page 19: Research Methods William G. Zikmund, Ch23

Correlation Matrix

• The standard form for reporting correlational results.

Page 20: Research Methods William G. Zikmund, Ch23

Correlation Matrix

Var1 Var2 Var3

Var1 1.0 0.45 0.31

Var2 0.45 1.0 0.10

Var3 0.31 0.10 1.0

Page 21: Research Methods William G. Zikmund, Ch23

Walkup’s First Laws of Statistics

• Law No. 1– Everything correlates with everything, especially

when the same individual defines the variables to be correlated.

• Law No. 2– It won’t help very much to find a good correlation

between the variable you are interested in and some other variable that you don’t understand any better.

Page 22: Research Methods William G. Zikmund, Ch23

• Law No. 3– Unless you can think of a logical reason why

two variables should be connected as cause and effect, it doesn’t help much to find a correlation between them. In Columbus, Ohio, the mean monthly rainfall correlates very nicely with the number of letters in the names of the months!

Walkup’s First Laws of Statistics

Page 23: Research Methods William G. Zikmund, Ch23

Going back to previous conditionsGoing back to previous conditions Tall men’s sonsTall men’s sons

DICTIONARYDICTIONARYDEFINITIONDEFINITION

GOING OR GOING OR MOVINGMOVINGBACKWARDBACKWARD

Regression

Page 24: Research Methods William G. Zikmund, Ch23

Bivariate Regression

• A measure of linear association that investigates a straight line relationship

• Useful in forecasting

Page 25: Research Methods William G. Zikmund, Ch23

Bivariate Linear Regression

• A measure of linear association that investigates a straight-line relationship

• Y = a + bX

• where

• Y is the dependent variable

• X is the independent variable

• a and b are two constants to be estimated

Page 26: Research Methods William G. Zikmund, Ch23

Y intercept

• a

• An intercepted segment of a line

• The point at which a regression line intercepts the Y-axis

Page 27: Research Methods William G. Zikmund, Ch23

Slope

• b

• The inclination of a regression line as compared to a base line

• Rise over run

• D - notation for “a change in”

Page 28: Research Methods William G. Zikmund, Ch23

Y

160

150

140

130

120

110

100

90

80

70 80 90 100 110 120 130 140 150 160 170 180 190

X

My lineYour line

.

Scatter Diagram and Eyeball Forecast

Page 29: Research Methods William G. Zikmund, Ch23

130

120

110

100

90

80

80 90 100 110 120 130 140 150 160 170 180 190X

Y

.

XaY ˆˆ

XY

Regression Line and Slope

Page 30: Research Methods William G. Zikmund, Ch23

X

Y

160

150

140

130

120

110

100

90

80

70 80 90 100 110 120 130 140 150 160 170 180 190

Y “hat” forDealer 3

Actual Y forDealer 7

Y “hat” for Dealer 7

Actual Y forDealer 3

Least-Squares Regression Line

Page 31: Research Methods William G. Zikmund, Ch23

130

120

110

100

90

80

80 90 100 110 120 130 140 150 160 170 180 190X

Y

}}

{Deviation not explained

Total deviation

Deviation explained by the regression

Y

.

Scatter Diagram of Explained and Unexplained Variation

Page 32: Research Methods William G. Zikmund, Ch23

The Least-Square Method

• Uses the criterion of attempting to make the least amount of total error in prediction of Y from X. More technically, the procedure used in the least-squares method generates a straight line that minimizes the sum of squared deviations of the actual values from this predicted regression line.

Page 33: Research Methods William G. Zikmund, Ch23

The Least-Square Method

• A relatively simple mathematical technique that ensures that the straight line will most closely represent the relationship between X and Y.

Page 34: Research Methods William G. Zikmund, Ch23

Regression - Least-Square Method

n

iie

1

2 minimumis

Page 35: Research Methods William G. Zikmund, Ch23

= - (The “residual”)

= actual value of the dependent variable

= estimated value of the dependent variable (Y hat)

n = number of observations

i = number of the observation

ie iY iY

iY

iY

Page 36: Research Methods William G. Zikmund, Ch23

The Logic behind the Least-Squares Technique

• No straight line can completely represent every dot in the scatter diagram

• There will be a discrepancy between most of the actual scores (each dot) and the predicted score

• Uses the criterion of attempting to make the least amount of total error in prediction of Y from X

Page 37: Research Methods William G. Zikmund, Ch23

XYa ˆ

Bivariate Regression

Page 38: Research Methods William G. Zikmund, Ch23

22

ˆ

XXn

YXXYn

Bivariate Regression

Page 39: Research Methods William G. Zikmund, Ch23

= estimated slope of the line (the “regression coefficient”)

= estimated intercept of the y axis

= dependent variable

= mean of the dependent variable

= independent variable

= mean of the independent variable

= number of observations

X

Y

n

a

Y

X

Page 40: Research Methods William G. Zikmund, Ch23

625,515,3759,24515

875,806,2345,19315ˆ

625,515,3385,686,3

875,806,2175,900,2

760,170

300,93 54638.

Page 41: Research Methods William G. Zikmund, Ch23

12554638.8.99ˆ a

3.688.99

5.31

Page 42: Research Methods William G. Zikmund, Ch23

12554638.8.99ˆ a

3.688.99

5.31

Page 43: Research Methods William G. Zikmund, Ch23

XY 546.5.31ˆ

89546.5.31

6.485.31

1.80

Page 44: Research Methods William G. Zikmund, Ch23

XY 546.5.31ˆ

89546.5.31

6.485.31

1.80

Page 45: Research Methods William G. Zikmund, Ch23

165546.5.31ˆ

129) value Y (Actual 7Dealer

7

Y6.121

95546.5.31ˆ

)80 value Y (Actual 3Dealer

3

Y4.83

Page 46: Research Methods William G. Zikmund, Ch23

99 YYei 5.9697

5.0

Page 47: Research Methods William G. Zikmund, Ch23

165546.5.31ˆ

129) value Y (Actual 7Dealer

7

Y6.121

95546.5.31ˆ

)80 value Y (Actual 3Dealer

3

Y4.83

Page 48: Research Methods William G. Zikmund, Ch23

99 YYei 5.9697

5.0

Page 49: Research Methods William G. Zikmund, Ch23

119546.5.319 Y

Page 50: Research Methods William G. Zikmund, Ch23

F-Test (Regression)

• A procedure to determine whether there is more variability explained by the regression or unexplained by the regression.

• Analysis of variance summary table

Page 51: Research Methods William G. Zikmund, Ch23

Total Deviation can be Partitioned into Two Parts

• Total deviation equals

• Deviation explained by the regression plus

• Deviation unexplained by the regression

Page 52: Research Methods William G. Zikmund, Ch23

“We are always acting on what has just finished happening. It happened at least

1/30th of a second ago.We think we’re in the present, but we aren’t. The present we

know is only a movie of the past.”Tom Wolfe in

The Electric Kool-Aid Acid Test

.

Page 53: Research Methods William G. Zikmund, Ch23

iiii YYYYYY ˆ ˆ

Partitioning the Variance

Total deviation

=Deviation explained by the regression

Deviation unexplained by the regression (Residual error)

+

Page 54: Research Methods William G. Zikmund, Ch23

= Mean of the total group

= Value predicted with regression equation

= Actual value

Y

Y

iY

Page 55: Research Methods William G. Zikmund, Ch23

222 ˆ ˆ iiii YYYYYY

Total variation explained

=Explained variation

Unexplained variation (residual)

+

Page 56: Research Methods William G. Zikmund, Ch23

SSeSSrSSt

Sum of Squares

Page 57: Research Methods William G. Zikmund, Ch23

Coefficient of Determination r2

• The proportion of variance in Y that is explained by X (or vice versa)

• A measure obtained by squaring the correlation coefficient; that proportion of the total variance of a variable that is accounted for by knowing the value of another variable

Page 58: Research Methods William G. Zikmund, Ch23

Coefficient of Determination r2

SStSSe

SStSSr

r 12

Page 59: Research Methods William G. Zikmund, Ch23

Source of Variation

• Explained by Regression

• Degrees of Freedom– k-1 where k= number of estimated constants

(variables)

• Sum of Squares – SSr

• Mean Squared– SSr/k-1

Page 60: Research Methods William G. Zikmund, Ch23

Source of Variation

• Unexplained by Regression

• Degrees of Freedom– n-k where n=number of observations

• Sum of Squares – SSe

• Mean Squared– SSe/n-k

Page 61: Research Methods William G. Zikmund, Ch23

r2 in the Example

875.4.882,3

49.398,32 r

Page 62: Research Methods William G. Zikmund, Ch23

Multiple Regression

• Extension of Bivariate Regression

• Multidimensional when three or more variables are involved

• Simultaneously investigates the effect of two or more variables on a single dependent variable

• Discussed in Chapter 24

Page 63: Research Methods William G. Zikmund, Ch23
Page 64: Research Methods William G. Zikmund, Ch23
Page 65: Research Methods William G. Zikmund, Ch23

Correlation Coefficient, r = .75

Correlation: Player Salary and Ticket Price

-20-10

0102030

1995 1996 1997 1998 1999 2000 2001

Change in TicketPrice

Change inPlayer Salary

Page 66: Research Methods William G. Zikmund, Ch23