Top Banner
1 Analysis of Variance
56
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Analysis of variance ppt @ bec doms

1

Analysis of Variance

Page 2: Analysis of variance ppt @ bec doms

2

Chapter Goals

After completing this chapter, you should be able to:

Recognize situations in which to use analysis of variance

Understand different analysis of variance designs

Perform a single-factor hypothesis test and interpret results

Conduct and interpret post-analysis of variance pairwise comparisons procedures

Set up and perform randomized blocks analysis

Analyze two-factor analysis of variance test with replications results

Page 3: Analysis of variance ppt @ bec doms

3

Chapter Overview

Analysis of Variance (ANOVA)

F-testF-test

Tukey-Kramer

testFisher’s Least

SignificantDifference test

One-Way ANOVA

Randomized Complete

Block ANOVA

Two-factor ANOVA

with replication

Page 4: Analysis of variance ppt @ bec doms

4

General ANOVA Setting Investigator controls one or more independent

variables Called factors (or treatment variables) Each factor contains two or more levels (or

categories/classifications)

Observe effects on dependent variable Response to levels of independent variable

Experimental design: the plan used to test hypothesis

Page 5: Analysis of variance ppt @ bec doms

5

One-Way Analysis of Variance Evaluate the difference among the means of three or

more populations

Examples: Accident rates for 1st, 2nd, and 3rd shift

Expected mileage for five brands of tires

Assumptions Populations are normally distributed Populations have equal variances Samples are randomly and independently drawn

Page 6: Analysis of variance ppt @ bec doms

6

Completely Randomized Design Experimental units (subjects) are assigned

randomly to treatments Only one factor or independent variable

With two or more treatment levels

Analyzed by One-factor analysis of variance (one-way ANOVA)

Called a Balanced Design if all factor levels have equal sample size

Page 7: Analysis of variance ppt @ bec doms

7

Hypotheses of One-Way ANOVA

All population means are equal

i.e., no treatment effect (no variation in means among groups)

At least one population mean is different

i.e., there is a treatment effect

Does not mean that all population means are different (some pairs may be the same)

k3210 μμμμ:H

same the are means population the of all Not:HA

Page 8: Analysis of variance ppt @ bec doms

8

One-Factor ANOVA

All Means are the same:The Null Hypothesis is True

(No Treatment Effect)

k3210 μμμμ:H

same the are μ all Not:H iA

321 μμμ

Page 9: Analysis of variance ppt @ bec doms

9

One-Factor ANOVA

At least one mean is different:The Null Hypothesis is NOT true

(Treatment Effect is present)

k3210 μμμμ:H

same the are μ all Not:H iA

321 μμμ 321 μμμ

or

(continued)

Page 10: Analysis of variance ppt @ bec doms

10

Partitioning the Variation Total variation can be split into two parts:

SST = Total Sum of SquaresSSB = Sum of Squares BetweenSSW = Sum of Squares Within

SST = SSB + SSW

Page 11: Analysis of variance ppt @ bec doms

11

Partitioning the Variation

Total Variation = the aggregate dispersion of the individual data values across the various factor levels (SST)

Within-Sample Variation = dispersion that exists among the data values within a particular factor level (SSW)

Between-Sample Variation = dispersion among the factor sample means (SSB)

SST = SSB + SSW

(continued)

Page 12: Analysis of variance ppt @ bec doms

12

Partition of Total Variation

Variation Due to Factor (SSB)

Variation Due to Random Sampling (SSW)

Total Variation (SST)Commonly referred to as: Sum of Squares Within Sum of Squares Error Sum of Squares Unexplained Within Groups Variation

Commonly referred to as: Sum of Squares Between Sum of Squares Among Sum of Squares Explained Among Groups Variation

= +

Page 13: Analysis of variance ppt @ bec doms

13

Total Sum of Squares

k

i

n

jij

i

)xx(SST1 1

2

Where:

SST = Total sum of squares

k = number of populations (levels or treatments)

ni = sample size from population i

xij = jth measurement from population i

x = grand mean (mean of all data values)

SST = SSB + SSW

Page 14: Analysis of variance ppt @ bec doms

14

Total Variation(continued)

Group 1 Group 2 Group 3

Response, X

X

2212

211 )xx(...)xx()xx(SST

kkn

Page 15: Analysis of variance ppt @ bec doms

15

Sum of Squares Between

Where:

SSB = Sum of squares between

k = number of populations

ni = sample size from population i

xi = sample mean from population i

x = grand mean (mean of all data values)

2

1

)xx(nSSB i

k

ii

SST = SSB + SSW

Page 16: Analysis of variance ppt @ bec doms

16

Between-Group Variation

Variation Due to Differences Among Groups

i j

2

1

)xx(nSSB i

k

ii

1

k

SSBMSB

Mean Square Between =

SSB/degrees of freedom

Page 17: Analysis of variance ppt @ bec doms

17

Between-Group Variation(continued)

Group 1 Group 2 Group 3

Response, X

X1X 2X

3X

2222

211 )xx(n...)xx(n)xx(nSSB kk

Page 18: Analysis of variance ppt @ bec doms

18

Sum of Squares Within

Where:

SSW = Sum of squares within

k = number of populations

ni = sample size from population i

xi = sample mean from population i

xij = jth measurement from population i

2

11

)xx(SSW iij

n

j

k

i

j

SST = SSB + SSW

Page 19: Analysis of variance ppt @ bec doms

19

Within-Group Variation

Summing the variation within each group and then adding over all groups

i

kN

SSWMSW

Mean Square Within =

SSW/degrees of freedom

2

11

)xx(SSW iij

n

j

k

i

j

Page 20: Analysis of variance ppt @ bec doms

20

Within-Group Variation(continued)

Group 1 Group 2 Group 3

Response, X

1X 2X3X

22212

2111 )xx(...)xx()xx(SSW kknk

Page 21: Analysis of variance ppt @ bec doms

21

One-Way ANOVA Table

Source of Variation

dfSS MS

Between Samples

SSB MSB =

Within Samples

N - kSSW MSW =

Total N - 1SST =SSB+SSW

k - 1 MSB

MSW

F ratio

k = number of populationsN = sum of the sample sizes from all populationsdf = degrees of freedom

SSB

k - 1

SSW

N - k

F =

Page 22: Analysis of variance ppt @ bec doms

22

One-Factor ANOVAF Test Statistic

Test statistic

MSB is mean squares between variances

MSW is mean squares within variances

Degrees of freedom df1 = k – 1 (k = number of populations)

df2 = N – k (N = sum of sample sizes from all populations)

MSW

MSBF

H0: μ1= μ2 = … = μ k

HA: At least two population means are different

Page 23: Analysis of variance ppt @ bec doms

23

Interpreting One-Factor ANOVA F Statistic

The F statistic is the ratio of the between estimate of variance and the within estimate of variance The ratio must always be positive df1 = k -1 will typically be small df2 = N - k will typically be large

The ratio should be close to 1 if H0: μ1= μ2 = … = μk is true

The ratio will be larger than 1 if H0: μ1= μ2 = … = μk is false

Page 24: Analysis of variance ppt @ bec doms

24

One-Factor ANOVA F Test Example

You want to see if three different golf clubs yield different distances. You randomly select five measurements from trials on an automated driving machine for each club. At the .05 significance level, is there a difference in mean distance?

Club 1 Club 2 Club 3254 234 200263 218 222241 235 197237 227 206251 216 204

Page 25: Analysis of variance ppt @ bec doms

25

••••

One-Factor ANOVA Example: Scatter Diagram

270

260

250

240

230

220

210

200

190

••

•••

•••••

Distance

1X

2X

3X

X

227.0 x

205.8 x 226.0x 249.2x 321

Club 1 Club 2 Club 3254 234 200263 218 222241 235 197237 227 206251 216 204

Club1 2 3

Page 26: Analysis of variance ppt @ bec doms

26

One-Factor ANOVA Example Computations

Club 1 Club 2 Club 3254 234 200263 218 222241 235 197237 227 206251 216 204

x1 = 249.2

x2 = 226.0

x3 = 205.8

x = 227.0

n1 = 5

n2 = 5

n3 = 5

N = 15

k = 3

SSB = 5 [ (249.2 – 227)2 + (226 – 227)2 + (205.8 – 227)2 ] = 4716.4

SSW = (254 – 249.2)2 + (263 – 249.2)2 +…+ (204 – 205.8)2 = 1119.6

MSB = 4716.4 / (3-1) = 2358.2

MSW = 1119.6 / (15-3) = 93.325.275

93.3

2358.2F

Page 27: Analysis of variance ppt @ bec doms

27 F = 25.275

One-Factor ANOVA Example Solution

H0: μ1 = μ2 = μ3

HA: μi not all equal

= .05

df1= 2 df2 = 12

Test Statistic:

Decision:

Conclusion:

Reject H0 at = 0.05

There is evidence that at least one μi differs from the rest

0

= .05

F.05 = 3.885Reject H0Do not

reject H0

25.27593.3

2358.2

MSW

MSBF

Critical Value:

F = 3.885

Page 28: Analysis of variance ppt @ bec doms

28

SUMMARY

Groups Count Sum Average Variance

Club 1 5 1246 249.2 108.2

Club 2 5 1130 226 77.5

Club 3 5 1029 205.8 94.2

ANOVA

Source of Variation

SS df MS F P-value F crit

Between Groups

4716.4 2 2358.2 25.275 4.99E-05 3.885

Within Groups

1119.6 12 93.3

Total 5836.0 14        

ANOVA -- Single Factor:Excel Output

EXCEL: tools | data analysis | ANOVA: single factor

Page 29: Analysis of variance ppt @ bec doms

29

The Tukey-Kramer Procedure Tells which population means are

significantly different e.g.: μ1 = μ2 μ3

Done after rejection of equal means in ANOVA

Allows pair-wise comparisons Compare absolute mean differences with critical

range

xμ1 = μ

3

Page 30: Analysis of variance ppt @ bec doms

30

Tukey-Kramer Critical Range

where:q = Value from standardized range table with k

and N - k degrees of freedom for the desired level of MSW = Mean Square Within ni and nj = Sample sizes from populations (levels) i and j

ji n

1

n

1

2

MSWqRange Critical

Page 31: Analysis of variance ppt @ bec doms

31

The Tukey-Kramer Procedure: Example

1. Compute absolute mean differences:Club 1 Club 2 Club 3254 234 200263 218 222241 235 197237 227 206251 216 204 20.2205.8226.0xx

43.4205.8249.2xx

23.2226.0249.2xx

32

31

21

2. Find the q value from the table in appendix J with k and N - k degrees of freedom for the desired level of

3.77qα

Page 32: Analysis of variance ppt @ bec doms

32

The Tukey-Kramer Procedure: Example

5. All of the absolute mean differences are greater than critical range. Therefore there is a significant difference between each pair of means at 5% level of significance.

16.2855

1

5

1

2

93.33.77

n

1

n

1

2

MSWqRange Critical

jiα

3. Compute Critical Range:

20.2xx

43.4xx

23.2xx

32

31

21

4. Compare:

Page 33: Analysis of variance ppt @ bec doms

33

Tukey-Kramer in PHStat

Page 34: Analysis of variance ppt @ bec doms

34

Randomized Complete Block ANOVA Like One-Way ANOVA, we test for equal population means

(for different factor levels, for example)...

...but we want to control for possible variation from a second factor (with two or more levels)

Used when more than one factor may influence the value of the dependent variable, but only one is of key interest

Levels of the secondary factor are called blocks

Page 35: Analysis of variance ppt @ bec doms

35

Partitioning the Variation Total variation can now be split into three

parts:

SST = Total sum of squaresSSB = Sum of squares between factor levelsSSBL = Sum of squares between blocksSSW = Sum of squares within levels

SST = SSB + SSBL + SSW

Page 36: Analysis of variance ppt @ bec doms

36

Sum of Squares for Blocking

Where:

k = number of levels for this factor

b = number of blocks

xj = sample mean from the jth block

x = grand mean (mean of all data values)

2

1

)xx(kSSBL j

b

j

SST = SSB + SSBL + SSW

Page 37: Analysis of variance ppt @ bec doms

37

Partitioning the Variation Total variation can now be split into three

parts:

SST and SSB are computed as they were in One-Way ANOVA

SST = SSB + SSBL + SSW

SSW = SST – (SSB + SSBL)

Page 38: Analysis of variance ppt @ bec doms

38

Mean Squares

1

k

SSBbetween square MeanMSB

1

b

SSBLblocking square MeanMSBL

)b)(k(

SSW withinsquare MeanMSW

11

Page 39: Analysis of variance ppt @ bec doms

39

Randomized Block ANOVA Table

Source of Variation

dfSS MS

Between Samples

SSB MSB

Within Samples

(k–1)(b-1)SSW MSW

Total N - 1SST

k - 1

MSBL

MSW

F ratio

k = number of populations N = sum of the sample sizes from all populationsb = number of blocks df = degrees of freedom

Between Blocks

SSBL b - 1 MSBL

MSB

MSW

Page 40: Analysis of variance ppt @ bec doms

40

Blocking Test Blocking test: df1 = b - 1

df2 = (k – 1)(b – 1)

MSBL

MSW

...μμμ:H b3b2b10

equal are means block all Not:HA

F =

Reject H0 if F > F

Page 41: Analysis of variance ppt @ bec doms

41

Main Factor test: df1 = k - 1

df2 = (k – 1)(b – 1)

MSB

MSW

k3210 μ...μμμ:H

equal are means population all Not:HA

F =

Reject H0 if F > F

Main Factor Test

Page 42: Analysis of variance ppt @ bec doms

42

Fisher’s Least Significant Difference Test

To test which population means are significantly different e.g.: μ1 = μ2 ≠ μ3

Done after rejection of equal means in randomized block ANOVA design

Allows pair-wise comparisons Compare absolute mean differences with critical

range

x = 1 2 3

Page 43: Analysis of variance ppt @ bec doms

43

Fisher’s Least Significant Difference (LSD) Test

where: t/2 = Upper-tailed value from Student’s t-distribution

for /2 and (k -1)(n - 1) degrees of freedom MSW = Mean square within from ANOVA table

b = number of blocks k = number of levels of the main factor

b

2MSWtLSD /2

Page 44: Analysis of variance ppt @ bec doms

44

...etc

xx

xx

xx

32

31

21

Fisher’s Least Significant Difference (LSD) Test

(continued)

b

2MSWtLSD /2

If the absolute mean difference is greater than LSD then there is a significant difference between that pair of means at the chosen level of significance.

Compare:

?LSDxxIs ji

Page 45: Analysis of variance ppt @ bec doms

45

Two-Way ANOVA Examines the effect of

Two or more factors of interest on the dependent variable e.g.: Percent carbonation and line speed on soft drink

bottling process

Interaction between the different levels of these two factors e.g.: Does the effect of one particular percentage of

carbonation depend on which level the line speed is set?

Page 46: Analysis of variance ppt @ bec doms

46

Two-Way ANOVA Assumptions

Populations are normally distributed

Populations have equal variances

Independent random samples are drawn

(continued)

Page 47: Analysis of variance ppt @ bec doms

47

Two-Way ANOVA Sources of Variation

Two Factors of interest: A and B

a = number of levels of factor A

b = number of levels of factor B

N = total number of observations in all cells

Page 48: Analysis of variance ppt @ bec doms

48

Two-Way ANOVA Sources of Variation

SSTTotal Variation

SSA

Variation due to factor A

SSB

Variation due to factor B

SSAB

Variation due to interaction between A and B

SSEInherent variation (Error)

Degrees of Freedom:

a – 1

b – 1

(a – 1)(b – 1)

N – ab

N - 1

SST = SSA + SSB + SSAB + SSE

(continued)

Page 49: Analysis of variance ppt @ bec doms

49

Two Factor ANOVA Equations

a

i

b

j

n

kijk )xx(SST

1 1 1

2

2

1

)xx(nbSSa

iiA

2

1

)xx(naSSb

jjB

Total Sum of Squares:

Sum of Squares Factor A:

Sum of Squares Factor B:

Page 50: Analysis of variance ppt @ bec doms

50

Two Factor ANOVA Equations

2

1 1

)xxxx(nSSa

i

b

jjiijAB

a

i

b

j

n

kijijk )xx(SSE

1 1 1

2

Sum of Squares Interaction Between A and B:

Sum of Squares Error:

(continued)

Page 51: Analysis of variance ppt @ bec doms

51

Two Factor ANOVA Equations

where:Mean Grand

nab

x

x

a

i

b

j

n

kijk

1 1 1

Afactor of level each of Meannb

x

x

b

j

n

kijk

i

1 1

B factor of level each of Meanna

xx

a

i

n

kijk

j

1 1

cell each of Meann

xx

n

k

ijkij

1

a = number of levels of factor A

b = number of levels of factor B

n’ = number of replications in each cell

(continued)

Page 52: Analysis of variance ppt @ bec doms

52

Mean Square Calculations

1

a

SS Afactor square MeanMS A

A

1

b

SSB factor square MeanMS B

B

)b)(a(

SSninteractio square MeanMS AB

AB 11

abN

SSEerror square MeanMSE

Page 53: Analysis of variance ppt @ bec doms

53

Two-Way ANOVA:The F Test Statistic

F Test for Factor B Main Effect

F Test for Interaction Effect

H0: μA1 = μA2 = μA3 = • • •

HA: Not all μAi are equal

H0: factors A and B do not interact to affect the mean response

HA: factors A and B do interact

F Test for Factor A Main Effect

H0: μB1 = μB2 = μB3 = • • •

HA: Not all μBi are equal

Reject H0

if F > FMSE

MSF A

MSE

MSF B

MSE

MSF AB

Reject H0

if F > F

Reject H0

if F > F

Page 54: Analysis of variance ppt @ bec doms

54

Two-Way ANOVASummary Table

Source ofVariation

Sum ofSquares

Degrees of Freedom

Mean Squares

FStatistic

Factor A SSA a – 1MSA

= SSA /(a – 1)

MSA

MSE

Factor B SSB b – 1MSB

= SSB /(b – 1)

MSB

MSE

AB(Interaction)

SSAB (a – 1)(b – 1)MSAB

= SSAB / [(a – 1)(b – 1)]

MSAB

MSE

Error SSE N – abMSE =

SSE/(N – ab)

Total SST N – 1

Page 55: Analysis of variance ppt @ bec doms

55

Features of Two-Way ANOVA F Test

Degrees of freedom always add up

N-1 = (N-ab) + (a-1) + (b-1) + (a-1)(b-1)

Total = error + factor A + factor B + interaction

The denominator of the F Test is always the same but the numerator is different

The sums of squares always add up

SST = SSE + SSA + SSB + SSAB

Total = error + factor A + factor B + interaction

Page 56: Analysis of variance ppt @ bec doms

56

Examples:Interaction vs. No Interaction

No interaction:

1 2

Factor B Level 1

Factor B Level 3

Factor B Level 2

Factor A Levels 1 2

Factor B Level 1

Factor B Level 3

Factor B Level 2

Factor A Levels

Mea

n R

espo

nse

Mea

n R

espo

nse

Interaction is present: