Transcript

Introduction to applied statistics

& applied statistical methods

Prof. Dr. Chang Zhu 1

Overview

• Independent ANOVA

• Repeated measures ANOVA

• MANOVA

Analysis of Variance

• Enormously useful

• T-test compare two sets of scores or two

groups of participants

• ANOVA can be used for analysing more than

two groups or more than two conditions

Conditions before conducting

ANOVA

• The dependent variables should be interval or

ratio data

• Normal distribution

• Variances are equal

Analysis of Variance

One way ANOVA

One Independent

Variable

Between

subjects

Repeated

measures /

Within

subjects

Different

participants

Same

participants

Group

A

Group

B

Group

C

5 4 3

6 2 7

9 4 3

2 5 4

9 3 2

Time

A

Time

B

Time

C

1 2 4

5 5 9

7 8 6

3 5 8

2 2 4

Between Subjects ANOVA

Data points in each group are

unrelated

Repeated Measures ANOVA

Data points in each group are

related

One-way ANOVA

E.g.

• Are there differences of computer use skills

among participant groups in different study

domains?

One-way ANOVA

• F-ratio

F= Variance due to manipulation of IV/Error

variance

The larger the F-ratio, the greater the effect of

the IV compared to the error variance

• F (df)

• p <.05 • p<.01 • p<.001

(the means of the groups are different)

Post Hoc Analysis

• What ANOVA tells us:

– Rejection of the H0 tells you that there is a high

PROBABILITY that AT LEAST ONE difference

exists among the groups

• What ANOVA doesn’t tell us:

– Where the differences lie

• Post hoc analysis is needed to determine which

mean(s) is(are) different

One-way ANOVA post-hoc

analysis

• ANOVA determined that differences exist

among the means.

• Post hoc tests determine which means differ.

or

One-way ANOVA in SPSS

Compare Means > One-way ANOVA General Linear Model > Univariate

The ANOVA analysis results

• Brief report:

e.g.

• The ANOVA results show that there were

significant differences of xxxx (eg. the

powerpoint use) among the groups of

participants (F(df)=…., p<.05)

Results: brief example report

•Post-hoc analyses show that group x was

different to group y (mean difference=xx,

p<xx) and group z (mean difference=xx,

p<xx) ….

Effect size

• In experiential research, effect size is a useful measure.

• Effect size is the magnitude of the difference between groups

• For ANOVAs, the effect size can be calculated by:

r (or η: eta) , ω (omega) : effect

size

SSM: between-group effect

SST: total amount of variance in the

data

MSR: within-subject effect

dfM: degree of freedom, which is

the number of the groups minus 1 (these values are in the SPSS output)

Practice

Practice 1: independent ANOVA

H1: reward will lead to better exam

results than either punish or

indifferent.

H2: indifferent will lead to better

exam results than punish.

1

2

3

Practice 1: independent ANOVA

Carry out a one-way ANOVA and use planned

comparisons to test the hypotheses that

H1: reward results in better exam results than

either punishment or indifferent; and

H2: indifferent will lead to significantly better

exam results than punishment.

Analyze > Compare Means > One-way ANOVAs

The data file is teach.sav.

• Rule 1: We should be careful in pair selection as if we

exclude any group in one comparison, it will be excluded

in subsequent comparison as well.

• Rule 2: Groups coded with positive weights will be

compared against groups coded with negative weights.

• Rule 3: The sum of weights for a comparison should be

zero.

• Rule 4: If a group is not involved in a comparison,

automatically assign it a weight of 0.

• Rule 5: For a given contrast, the weights assigned to the

group(s) not included in the contrast should be equal to

the number of groups included in the pair comparison.

(Field, 2009)

Practice 1: independent ANOVA

(rules for contrast weights

Practice 1: independent ANOVA

H2: indifferent will lead

to better exam results

than punish.

H1: reward will lead to

better exam results

than either punish or

indifferent.

contrast 1 condition contrast 2

1 punish (1) 1

1 indifferent (2) -1

-2 reward (3) 0

Practice 1: independent ANOVA

(Post Hoc)

• Equal variances assumed: R-E-G-W-Q, Tukey, Dunnnett

• Equal variances not assumed: Games-Howell

Practice 1: independent ANOVA

(SPSS output)

ANOVA

Exam Mark

Sum of Squares df

Mean

Square F Sig.

Between

Groups

(Combined) 1205.067 (SSM) 2 (dfM) 602.533 21.008 .000

Linear Term Contrast 1185.800 1 1185.800 41.344 .000

Deviation 19.267 1 19.267 .672 .420

Quadratic Term Contrast 19.267 1 19.267 .672 .420

Within Groups 774.400 27

28.681

(MSR)

Total 1979.467 (SST) 29

There is a significant difference in exam marks among

different teaching conditions.

Practice 1: independent ANOVA

(SPSS output)

Contrast Tests

Contrast

Value of

Contrast SE t df

Sig. (2-

tailed)

Exam

Mark

Assume equal

variances 1 -24.8000 4.14836 -5.978 27 .000

2 -6.0000 2.39506 -2.505 27 .019

H1: reward will lead to better exam results than either

punish or indifferent.

H2: indifferent will lead to better exam results than punish.

Practice 1: independent ANOVA

(report)

There was a significant effect of teaching conditions on exam

marks, F (2, 27) = 21.01, p < .001, ω = .76. Planned

contrasts revealed that reward produced significantly better

exam grades than punishment and indifference, t (27) = -

5.978, p < .01, r = .75 and that punishment produced

significantly lower exam marks than indifference, t (27) = -

2.51, p < .05, r = .43.

Independent vs. Repeated measures

ANOVA

• There are two possible scenarios when

obtaining various sets of data for comparison:

– Independent samples: The data in the first sample

is completely independent from the data in the

other samples.

– Dependent/Related samples: The sets of data are

dependent on one another. There is a relationship

between/among the sets of data.

• Three or more data sets?

– If three or more sets of data are

independent of one another Independent

(ANOVA)

– If three or more sets of data are dependent

on one another Repeated Measures

ANOVA

Independent vs. Repeated measures

ANOVA

Post hoc testing

• Significant F value

– At least one condition mean is significantly different from

the others

• But which one?

• Post hoc tests

– Bonferroni

– Tukey

– Sidak

– ….

Practice 2: repeated measures ANOVA

Tutors Essays

1. Dr Field 8

2. Dr Smith 8

3. Dr. Scrote 8

4. Dr. Deadth 8

Are there significant differences in the essay marking

among the tutors?

Analyze > General Linear Model > Repeated Measures

The data file is TutorMarks.sav.

Practice 2: repeated measures ANOVA

(SPSS output)

Tests of Within-Subjects Effects

Measure: MEASURE_1

Source Type III Sum of Squares df Mean Square F Sig.

tutor Sphericity Assumed 554.125 (SSM) 3 184.708 (MSM) 3.700 .028

Greenhouse-Geisser 554.125 1.673 331.245 3.700 .063

Huynh-Feldt 554.125 2.137 259.329 3.700 .047

Lower-bound 554.125 1.000 554.125 3.700 .096

Error(tutor) Sphericity Assumed 1048.375 (SSR) 21 49.923 (MSR)

Greenhouse-Geisser 1048.375 11.710 89.528

Huynh-Feldt 1048.375 14.957 70.091

Lower-bound 1048.375 7.000 149.768

Practice 2: repeated measures ANOVA

(SPSS output)

Tests of Within-Subjects Contrasts

Measure: MEASURE_1

Source tutor

Type III

Sum of

Squares df

Mean

Square F Sig.

Partial

Eta

Squared

tutor Level 1 vs. Level 2

(Dr. Field and Dr. Smith) 171.125 1 171.125 18.184 .004 .722

Level 2 vs. Level 3 8.000 1 8.000 .152 .708 .021

Level 3 vs. Level 4 496.125 1 496.125 3.436 .106 .329

tutora

Measure: MEASURE_1

Dependent Variable

tutor

Level 1 vs. Level 2 Level 2 vs. Level 3 Level 3 vs. Level 4

Dr. Field (1) 1 0 0

Dr. Smith (2) -1 1 0

Dr. Scrote (3) 0 -1 1

Dr. Death (4) 0 0 -1

Practice 2: repeated measure ANOVA

(report)

Mauchly’s test indicated that the assumption of sphericity had

been violated, χ² (5) = 11.63, p < .05, therefore degrees of

freedom were corrected using Greenhouse-Geisser

estimates of sphericity (ε = .556). The results show that there

were no significant differences in essay marking among the

tutors, F (1.67, 11.71) = 3.7, p > .05.

MANOVA

• One-Way Multivariate Analysis of Variance

– Multivariate analysis of variance (MANOVA) is a multivariate extension of analysis of variance.

– As with ANOVA, the independent variables for a MANOVA are factors, and each factor has two or more levels.

– Unlike ANOVA, MANOVA includes multiple dependent variables rather than a single dependent variable.

– MANOVA evaluates whether the population means on a set of dependent variables vary across levels of a factor or factors.

MANOVA

• Understanding One-Way MANOVA – A one-way MANOVA tests the

hypothesis that the population means for the dependent variables are the same (or not) for all levels of the factor, that is, across all groups.

MANOVA

– If a one-way MANOVA is significant, follow-up analyses can assess whether there are differences among groups on the population means on certain dependent variables and on particular linear combinations of dependent variables.

– The most popular follow-up approach is to conduct multiple ANOVAs, one for each dependent variable.

ANOVA vs. MANOVA

• In all cases ANOVAs have only 1 dependent variable (they are univariate tests)

• When you have more than 1 related dependent variables you need to conduct a MANOVA

– 2 or more DVs (interval / ratio)

– 1 or more categorical IVs

• MANOVA can be one-way, two-way, between-groups, repeated measures and mixed

ANOVA vs. MANOVA

• Why not multiple ANOVAs?

• ANOVAs run separately cannot take into

account the pattern of covariation among the

dependent measures – It may be possible that multiple ANOVAs may show no

differences while the MANOVA brings them out.

– MANOVA is sensitive not only to mean differences but

also to the direction and size of correlations among the

dependent variables.

MANOVA

• an extension of ANOVA in which main effects

and interactions are assessed on a combination

of DVs.

• MANOVA tests whether mean differences

among groups on a combination of DVs is

likely to occur (by chance or not).

MANOVA

– SPSS reports a number of statistics to

evaluate the MANOVA hypothesis, labeled

Wilks’ Lambda, Pillai’s Trace, Hotelling’s

Trace, and Roy’s Largest Root.

• Each statistic evaluates a multivariate

hypothesis that the population means are equal.

• We will use Wilks’ lambda (Λ) because it is

frequently reported in social science and

business literatures.

• Pillai’s trace (V) is a reasonable alternative to

Wilks’ lambda.

Interpretation of the output

2 important tables:

• Multivariate tests

– Wilks’ Lambda (most commonly used)

– Pillai’s Trace (most robust)

(see Tabachnick & Fidell, 2007)

• Tests of between-subjects effects (ANOVAs)

– Use a Bonferroni Adjustment

– Check Sig. column

Interpretation of the output

• Effect size

– Partial Eta Squared: the proportion of the variance in the DV that can be explained by the IV (see Cohen, 1988)

• Comparing group means

– Estimated marginal means

• Follow-up analyses

(see Hair et al., 1998; Weinfurt, 1995)

Weinfurt, K. P. (1995). Multivariate analysis of variance.

In L. G. Grimm, & P. R. Yarnold (Eds.), Reading and understanding multivariate statistics. Washington, DC: APA. [QA278 .R43 1995]

Post-hoc analysis

• If the multivariate test chosen is significant,

you’ll want to continue your analysis to discern

the nature of the differences.

• A first step would be to check the plots of mean

group differences for each DV.

• Graphical display will enhance interpretability

and understanding of what might be going on

(however it is still in ‘univariate’ mode).

• A discriminant analysis following a MANOVA

is also recommended.

Practice 3: MANOVA

Five knowledge tests

1.Exper (experimental psychology

such as cognitive and

neuropsychology etc.)

2.Stats (statistics);

3.Social (social psychology);

4.Develop (developmental

psychology);

5.Person (personality).

Three cohorts:

•First year

•Second year

•Third year

Are there are overall group differences along these five

measures?

The data file is

psychology.sav.

Practice 3: MANOVA

Five knowledge tests

1.Exper (experimental psychology

such as cognitive and

neuropsychology etc.)

2.Stats (statistics);

3.Social (social psychology);

4.Develop (developmental

psychology);

5.Person (personality).

Three cohorts:

•First year

•Second year

•Third year

Are there are overall group differences along these five

measures?

The data file is

psychology.sav.

Analyze > General Linear Model > Multivariate

Practice 3: MANOVA

(report)

Using Pillai’s trace, there was a significant difference in the

scores on the five knowledge tests among the first, second,

and third year students, V = .51, F (10, 68) = 2.33, p < .05.

Assignment 8

• Detail:

Lecture 8_practical guidelines_assignment

(p. 17)

Deadline: December 24, 2014

• Questions?

45

top related