Top Banner
29

Correlation Coefficients Pearson’s Product Moment Correlation Coefficient interval or ratio data only What about ordinal data?

Dec 22, 2015

Download

Documents

Primrose Bates
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?
Page 2: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Correlation Coefficients

• Pearson’s Product Moment Correlation

Coefficient

interval or ratio data only

• What about ordinal data?

Page 3: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Spearman’s Rank Correlation Coefficient

rs = 1 - di

2i=1

i=n

n3 - n

6

Page 4: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

http://www.mnstate.edu/wasson/ed602spearcorr.htm

Spearman’s Rank Correlation Coefficient: Example

Page 5: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Spearman’s Rank Correlation Coefficient: Example

Page 6: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

A Significance Test for rs

SErs =

1

n -1

ttest = rs

SErs

= rs n -1

df = n - 1

Page 7: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

http://www.mnstate.edu/wasson/ed602spearcorr.htm

Spearman’s Rank Correlation Coefficient: Example

Page 8: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Pearson’s r - Assumptions

1. Interval or ratio scale data

2. Selected randomly

3. Linear

4. Joint bivariate normal distribution

S-Plus (qqnorm)

Page 9: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Spearman’s Rank Correlation Coefficient

• Ordinal data

• already in a ranked form

• Interval or ratio data

• convert it to rankings

Page 10: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Spearman’s Rank Correlation Coefficient

TVDI (x)0.2740.5420.4190.2860.3740.4890.6230.5060.7680.725

Rank (x)17423586

109

Theta (y)0.4140.3590.3960.4580.3500.3570.2550.1890.1710.119

Rank (y)978

10564321

Difference (di)

-80-4-8-2-14388

Page 11: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

A Significance Test for rs

Page 12: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

S-Plus

http://www.mnstate.edu/wasson/ed602spearcorr.htm

Page 13: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

TVDI (x)0.2740.5420.4190.2860.3740.4890.6230.5060.7680.725

Theta (y)0.4140.3590.3960.4580.3500.3570.2550.1890.1710.119

Page 14: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

• Correlation Direction & Strength

• We might wish to go a little further

• Rate of change

• Predictability

Correlation Regression

Page 15: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

• Deterministic

perfect knowledge

• Probabilistic

estimate

not with absolute accuracy

(or certainty)

Two Sorts of Bivariate Relationships

Page 16: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

• Travel at a constant speed

• Deterministic time spent driving vs. distance traveled

A Deterministic Relationship

s = s0 + vt

s: distance traveleds0: initial distancev: speedt: time traveledtime (t)

distance (s)

slope (v)

intercept (s0)

• Truly deterministic rare

Page 17: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

• More often probabilistic

• e.g., ages vs. heights (2 – 20 yrs)

A Probabilistic Relationship

age (years)

height (meters)

• Good relationship

• Unpredictability or error

Page 18: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Sampling and Regression

• Our expectation (less than perfect)

• Collecting data measurement errors

height

• Other factors (not accounted for in the model)

plant growth vs. T

Page 19: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Simple vs. Multiple Regression

• Simple linear regression

y

x

• Multiple linear regression

y

x1, x2, … xn

Page 20: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

• Model

y = a + bx + e

Simple Linear Regression

x: independent variable

y: dependent variable

b: slope

a: intercepte: error term

x (independent)

y (dependent)

b

a

error:

Page 21: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

• Scatterplot fitting a line

Fitting a Line to a Set of Points

x (independent)

y (dependent)

• Least squares method

• Minimize the error term e

Page 22: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Sampling and Regression

• Sampled data model

y = a + bx + e

• Attempt to estimate a “true” regression line

y = + x +

• Multiple samples several similar regression

lines the population regression line

Page 23: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

• Minimize the error term e

• The line of best fit

ŷ = a + b

Least Squares Method

y

ŷ = a + bxŷ

(y - ŷ)

Page 24: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Estimates and Residuals

• Errors

e = y – ŷ

• Residuals

Underestimate

Overestimate

Page 25: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

• Errors (residuals)

e = (y - ŷ)

• Overall error

Simply sum these error terms 0

Square the differences and then sum them up to

create a useful estimate

Minimizing the Error Term

SSE = (y - ŷ)2

i = 1

n

Page 26: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Minimizing the SSE

(y - ŷ)2

i = 1

n

mina,b

n

(yi - a - bxi)2

i = 1

mina,b

=

Page 27: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

• Least squares method

Finding Regression Coefficients

(xi - x) (yi - y)i = 1

n

b =

(xi - x)2

i = 1

n

a = y - bx

Page 28: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Interpreting Slope (b)

• Slope of the line (b the change in y due to a unit change in x

b > 0 b < 0

Page 29: Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?

Regression Slope and Correlation

(xi - x)(yi - y)i=1

i=n

(n - 1) sXsY

r =(xi - x) (yi - y)i = 1

n

b =

(xi - x)2

i = 1

n

b = rsy

sx