Sociology 5811: T-Tests for Difference in Means Wes Longhofer, pinch-hitting for Evan Schofer.

Sociology 5811:T-Tests for Difference in Means

Wes Longhofer, pinch-hitting for Evan Schofer

Strategy for Mean Difference

• We never know true population means• So, we never know true value of difference in means

• So, we don’t know if groups really differ

• If we can figure out the sampling distribution of the difference in means…

• We can guess the range in which it typically falls

• If it is improbable for the sampling distribution to overlap with zero, then the population means probably differ

• An extension of the Central Limit Theorem provides information necessary to do calculations!

Sampling Distribution for Difference in Means

• The mean (Y-bar) is a variable that changes depending on the particular sample we took

• Similarly, the differences in means for two groups varies, depending on which two samples we chose

• The distribution of all possible estimates of the difference in means is a sampling distribution!

• The “sampling distribution of differences in means”

• It reflects the full range of possible estimates of the difference in means.

Mean Differences for Small Samples

• Sample Size: rule of thumb

• Total N (of both groups) > 100 can safely be treated as “large” in most cases

• Total N (of both groups) < 100 is possibly problematic

• Total N (of both groups) < 60 is considered “small” in most cases

• If N is small, the sampling distribution of mean difference cannot be assumed to be normal

• Again, we turn to the T-distribution.


• To use T-tests for small samples, the following criteria must be met:

• 1. Both samples are randomly drawn from normally distributed populations

• 2. Both samples have roughly the same variance (and thus same standard deviation)

• To the extent that these assumptions are violated, the T-test will become less accurate

• Check histogram to verify!

• But, in practice, T-tests are fairly robust.


• For small samples, the estimator of the Standard Error is derived from the variance of both groups (i.e. it is “pooled”)

• Formulas:

2

))(1())(1(s

21

222

211

)Y-Y( 21

NN

sNsN

Probabilities for Mean Difference

• A T-value may be calculated:

21

212

1121

21

NNs

)YY(t

)YY(

)N(N

• Where (N1 + N2 – 2) refers to the number of degrees of freedom– Recall, t is a “family” of distributions– Look up t-dist for “N1 + N2 -2” degrees of freedom.

T-test for Mean Difference

• Back to the example: 20 boys & 20 girls

• Boys: Y-bar = 72.75, s = 8.80

• Girls: Y-bar = 78.20, s = 9.55

• Let’s do a hypothesis test to see if the means differ:

• Use -level of .05

• H0: Means are the same (boys = girls)

• H1: Means differ (boys ≠ girls).


21

212

1121

21

NNs

)YY(t

)YY(

)N(N

201

201

45.5

21

38

)YY(

)(

s

)(t

• Calculate t-value:

T-Test for Mean Difference

• We need to calculate the Standard Error of the difference in means:

2

))(1())(1(s

21

222

211

)Y-Y( 21

NN

sNsN

38

)55.9)(19()80.8)(19(s

22

)Y-Y( 21


• We also need to calculate the Standard Error of the difference in means:

38

)85.1732()36.1471(s )Y-Y( 21

18.932.84s )Y-Y( 21


21

212

1121

21

NNs

)YY(t

)YY(

)N(N

201

201

)18.9(

45.538

)(t )(

• Plugging in Values:


)316)(.18.9(

45.538

)(t )(

88.1)90.2(

45.538

)(t )(


• Question: What is the critical value for =.05, two-tailed T-test, 38 degrees of freedom (df)?

• Answer: Critical Value = approx. 2.03

• Observed T-value = 1.88

• Can we reject the null hypothesis (H0)?

• Answer: No! Not quite!• We reject when t > critical value


• The two-tailed test hypotheses were:

GirlsBoys μμ :H1

GirlsBoys μμ :H0

• Question: What hypotheses would we use for the one-tailed test?

GirlsBoys μμ :H1

GirlsBoys μμ :H0


• Question: What is the critical value for =.05, one-tailed T-test, 38 degrees of freedom (df)?

• Answer: Around 1.684 (40 df)

• One-tailed test: T =1.88 > 1.684• We can reject the null hypothesis!!!

• Moral of the story:• If you have strong directional suspicions ahead of time, use

a one-tailed test. It increases your chances of rejecting H0.

• But, it wouldn’t have made a difference at =.01

Another Example• Question: Do the mean batting averages for American

League and National League teams differ?• Use a random sample of teams over time

• American League: Y-bar = .2677, s = .0068, N=14• National League: Y-bar = .2615, s = .0063, N=16• Let’s do a hypothesis test to see if the means differ:

• Use -level of .05

• H0: Means are the same (American = National)

• H1: Means differ (American ≠ National)


21

212

1121

21

NNs

)YY(t

)YY(

)N(N

161

141

0062.

21

28

)YY(

)(

s

)(t

• Calculate t-value:


• We need to calculate the Standard Error of the difference in means:

2

))(1())(1(s

21

222

211

)Y-Y( 21

NN

sNsN

28

)0063)(.15()0068)(.13(s

22

)Y-Y( 21


• We also need to calculate the Standard Error of the difference in means:

28

)0006(.)0006(.s )Y-Y( 21

0065.s )Y-Y( 21


21

212

1121

21

NNs

)YY(t

)YY(

)N(N

161

141

)0065(.

0062.)28(

)(t

• Plugging in Values:


)366)(.0065(.

0062.28

)(t )(

58.2)0024(.

0062.28

)(t )(


• Question: What is the critical value for =.05, two-tailed T-test, 28 degrees of freedom (df)?

• Answer: Critical Value = approx. 2.05

• Observed T-value = 2.58

• Can we reject the null hypothesis (H0)?

• Answer: Yes• We reject when t > critical value

• What if we used an -level of .01?– Critical value=2.76

T-Test for Mean Difference• Question: What if you wanted to compare 3 or

more groups, instead of just two?• Example: Test scores for students in different educational

tracks: honors, regular, remedial

• Can you use T-tests for 3+ groups?• Answer: Sort of… You can do a T-test for every

combination of groups• e.g., honors & reg, honors & remedial, reg & remedial

• But, the possibility of a Type I error proliferates… 5% for each test

• With 5 groups, chance of error reaches 50%• Solution: ANOVA.

Sociology 5811: T-Tests for Difference in Means Wes Longhofer, pinch-hitting for Evan Schofer.

Documents

Sociology 5811: T-Tests for Difference in Means Wes Longhofer, pinch-hitting for Evan Schofer.