Top Banner
Questions of Goodness of Fit
79
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Questions of goodness of fit

Questions of Goodness of Fit

Page 2: Questions of goodness of fit

Is the problem you are working on a question of Goodness of Fit?

Page 3: Questions of goodness of fit

Is the problem you are working on a question of Goodness of Fit?

Page 4: Questions of goodness of fit

Questions of Goodness of Fit have become increasingly important in modern statistics.

Page 5: Questions of goodness of fit

Goodness of fit is a method used to determine how close a hypothesized pattern fits an observed pattern.

Page 6: Questions of goodness of fit

Goodness of fit is a method used to determine how close a hypothesized pattern fits an observed pattern.

Hypothesized Pattern- the way you think

things are.

Page 7: Questions of goodness of fit

Goodness of fit is a method used to determine how close a hypothesized pattern fits an observed pattern.

fits

Hypothesized Pattern- the way you think

things are.

Page 8: Questions of goodness of fit

Goodness of fit is a method used to determine how close a hypothesized pattern fits an observed pattern.

Observed Pattern – the way things

actually are.

Hypothesized Pattern- the way you think

things are.

fits

Page 9: Questions of goodness of fit

For example, let’s say we hypothesize that there are an equal number of females as there are males in the town of Solvang, California.

Page 10: Questions of goodness of fit

So, in a sample of 200 Solvangans we would hypothesize that 100 would be female.

Page 11: Questions of goodness of fit

So, in a sample of 200 Solvangans we would hypothesize that 100 would be female.

The hypothesized number

of females in a sample of 200 is

100

Page 12: Questions of goodness of fit

So, in a sample of 200 Solvangans we would hypothesize that 100 would be female.

The hypothesized number

of females in a sample of 200 is

100

That is because we assume that an equal number will be males and an equal number

will be females

Page 13: Questions of goodness of fit

We then take a sample of 200 and find that there are actually 84.

Page 14: Questions of goodness of fit

Once again, our hypothesized number of females from a sample of 200 is 100.

Page 15: Questions of goodness of fit

Once again, our hypothesized number of females from a sample of 200 is 100.

The hypothesized number

of females in a sample of 200 is

100

Page 16: Questions of goodness of fit

But, our actual number of females from a sample of 100 is 84.

Page 17: Questions of goodness of fit

Is the difference between 100 and 84 statistically significant?

Page 18: Questions of goodness of fit

Is the difference between 100 and 84 statistically significant?

Note - Even though we are using the word difference here, in this

case we are referring to how well the data FITS the hypothesis.

Page 19: Questions of goodness of fit

Is the difference between 100 and 84 statistically significant?

The HYPOTHESIZED number

of females in a sample of 200 is

100

Page 20: Questions of goodness of fit

Is the difference between 100 and 84 statistically significant?

The ACTUAL number

of females in a sample of 200 is

84

The HYPOTHESIZED number

of females in a sample of 200 is

100

Page 21: Questions of goodness of fit

Is the difference between 100 and 84 statistically significant?

The ACTUAL number

of females in a sample of 200 is

84 16

The HYPOTHESIZED number

of females in a sample of 200 is

100

Page 22: Questions of goodness of fit

Is the difference between 100 and 84 statistically significant?

The ACTUAL number

of females in a sample of 200 is

84 16

The HYPOTHESIZED number

of females in a sample of 200 is

100

This value can

be tested

statistically to

determine its

Goodness of

Fit

Page 23: Questions of goodness of fit

If it is significantly different, then we may need to collect a new sample that is more representative of the hypothesized population.

Page 24: Questions of goodness of fit

If it is significantly different, then we may need to collect a new sample that is more representative of the hypothesized population.

Page 25: Questions of goodness of fit

Here is an equation that we will use as a guide to identify goodness of fit questions.

Page 26: Questions of goodness of fit

Here is an equation that we will use as a guide to identify goodness of fit questions.

Hypothesized Number

fit theActual

NumberDoes the ?

Page 27: Questions of goodness of fit

Examples of Goodness of Fit Tests

Page 28: Questions of goodness of fit

Example #1

Page 29: Questions of goodness of fit

Consider a standard package of milk chocolate M&Ms.

Page 30: Questions of goodness of fit

Consider a standard package of milk chocolate M&Ms. There are six different colors: red, orange, yellow, green, blue and brown.

Page 31: Questions of goodness of fit

Consider a standard package of milk chocolate M&Ms. There are six different colors: red, orange, yellow, green, blue and brown. Suppose that we are curious about the distribution of these colors and ask, do all six colors occur equally?

Page 32: Questions of goodness of fit

Consider a standard package of milk chocolate M&Ms. There are six different colors: red, orange, yellow, green, blue and brown. Suppose that we are curious about the distribution of these colors and ask, do all six colors occur equally? You collect 24 M&Ms with 4 reds, 4 oranges, 3 yellows, 5 greens, 2 blues, & 6 browns.

Page 33: Questions of goodness of fit

Consider a standard package of milk chocolate M&Ms. There are six different colors: red, orange, yellow, green, blue and brown. Suppose that we are curious about the distribution of these colors and ask, do all six colors occur equally? You collect 24 M&Ms with 4 reds, 4 oranges, 3 yellows, 5 greens, 2 blues, & 6 browns. Are these differences statistically significant?

Page 34: Questions of goodness of fit

Consider a standard package of milk chocolate M&Ms. There are six different colors: red, orange, yellow, green, blue and brown. Suppose that we are curious about the distribution of these colors and ask, do all six colors occur equally? You collect 24 M&Ms with 4 reds, 4 oranges, 3 yellows, 5 greens, 2 blues, & 6 browns. Are these differences statistically significant?

Hypothesized Number

fit theActual

NumberDoes the ?

Page 35: Questions of goodness of fit

Consider a standard package of milk chocolate M&Ms. There are six different colors: red, orange, yellow, green, blue and brown. Suppose that we are curious about the distribution of these colors and ask, do all six colors occur equally? You collect 24 M&Ms with 4 reds, 4 oranges, 3 yellows, 5 greens, 2 blues, & 6 browns. Are these differences statistically significant?

Hypothesized Number

fit theActual

NumberDoes the ?

Page 36: Questions of goodness of fit

Consider a standard package of milk chocolate M&Ms. There are six different colors: red, orange, yellow, green, blue and brown. Suppose that we are curious about the distribution of these colors and ask, do all six colors occur equally? You collect 24 M&Ms with 4 reds, 4 oranges, 3 yellows, 5 greens, 2 blues, & 6 browns. Are these differences statistically significant?

fit theActual

NumberDoes the

Hypothesized Number =

4 red, 4 orange4 yellow, 4 green4 blue, 4 brown

?

Page 37: Questions of goodness of fit

Consider a standard package of milk chocolate M&Ms. There are six different colors: red, orange, yellow, green, blue and brown. Suppose that we are curious about the distribution of these colors and ask, do all six colors occur equally? You collect 24 M&Ms with 4 reds, 4 oranges, 3 yellows, 5 greens, 2 blues, & 6 browns. Are these differences statistically significant?

fit theActual

NumberDoes the

Hypothesized Number =

4 red, 4 orange4 yellow, 4 green4 blue, 4 brown

?

Page 38: Questions of goodness of fit

Consider a standard package of milk chocolate M&Ms. There are six different colors: red, orange, yellow, green, blue and brown. Suppose that we are curious about the distribution of these colors and ask, do all six colors occur equally? You collect 24 M&Ms with 4 reds, 4 oranges, 3 yellows, 5 greens, 2 blues, & 6 browns. Are these differences statistically significant?

fit theDoes the

Hypothesized Number =

4 red, 4 orange4 yellow, 4 green4 blue, 4 brown

Actual Number =

4 red, 4 orange3 yellow, 5 green2 blue, 6 brown

?

Page 39: Questions of goodness of fit

Example #2

Page 40: Questions of goodness of fit

Absenteeism of college students from math classes is a major concern to math instructors because missing class appears to increase the drop rate.

Page 41: Questions of goodness of fit

Absenteeism of college students from math classes is a major concern to math instructors because missing class appears to increase the drop rate. Suppose that a study was done to determine if the actual student absenteeism follows faculty perception.

Page 42: Questions of goodness of fit

Absenteeism of college students from math classes is a major concern to math instructors because missing class appears to increase the drop rate. Suppose that a study was done to determine if the actual student absenteeism follows faculty perception. The faculty expected that a group of 100 students would miss class according to the following chart.

Page 43: Questions of goodness of fit

Absenteeism of college students from math classes is a major concern to math instructors because missing class appears to increase the drop rate. Suppose that a study was done to determine if the actual student absenteeism follows faculty perception. The faculty expected that a group of 100 students would miss class according to the following chart.

# of absences EXPECTED # of Students

0-2 50

3-5 30

6-8 12

9-11 6

12+ 2

Page 44: Questions of goodness of fit

Absenteeism of college students from math classes is a major concern to math instructors because missing class appears to increase the drop rate. Suppose that a study was done to determine if the actual student absenteeism follows faculty perception. The faculty expected that a group of 100 students would miss class according to the following chart. Here were actual results of a random sample.

# of absences EXPECTED # of Students

0-2 50

3-5 30

6-8 12

9-11 6

12+ 2

Page 45: Questions of goodness of fit

Absenteeism of college students from math classes is a major concern to math instructors because missing class appears to increase the drop rate. Suppose that a study was done to determine if the actual student absenteeism follows faculty perception. The faculty expected that a group of 100 students would miss class according to the following chart. Here were actual results of a random sample.

# of absences EXPECTED # of Students

0-2 50

3-5 30

6-8 12

9-11 6

12+ 2

# of absences ACTUAL # of Students

0-2 35

3-5 40

6-8 20

9-11 1

12+ 4

Page 46: Questions of goodness of fit

Absenteeism of college students from math classes is a major concern to math instructors because missing class appears to increase the drop rate. Suppose that a study was done to determine if the actual student absenteeism follows faculty perception. The faculty expected that a group of 100 students would miss class according to the following chart. Here were actual results of a random sample. Did the faculty perception fit the reality?

Hypothesized Number

fit theActual

NumberDoes the ?

Page 47: Questions of goodness of fit

Absenteeism of college students from math classes is a major concern to math instructors because missing class appears to increase the drop rate. Suppose that a study was done to determine if the actual student absenteeism follows faculty perception. The faculty expected that a group of 100 students would miss class according to the following chart. Here were actual results of a random sample. Did the faculty perception fit the reality?

Hypothesized Number

fit theActual

NumberDoes the ?

Page 48: Questions of goodness of fit

Absenteeism of college students from math classes is a major concern to math instructors because missing class appears to increase the drop rate. Suppose that a study was done to determine if the actual student absenteeism follows faculty perception. The faculty expected that a group of 100 students would miss class according to the following chart. Here were actual results of a random sample. Did the faculty perception fit the reality?

Faculty Perceptions of Student

Absenteeism

fit theActual

NumberDoes the ?

Page 49: Questions of goodness of fit

Absenteeism of college students from math classes is a major concern to math instructors because missing class appears to increase the drop rate. Suppose that a study was done to determine if the actual student absenteeism follows faculty perception. The faculty expected that a group of 100 students would miss class according to the following chart. Here were actual results of a random sample. Did the faculty perception fit the reality?

Faculty Perceptions of Student

Absenteeism

fit theActual

NumberDoes the ?

Page 50: Questions of goodness of fit

Absenteeism of college students from math classes is a major concern to math instructors because missing class appears to increase the drop rate. Suppose that a study was done to determine if the actual student absenteeism follows faculty perception. The faculty expected that a group of 100 students would miss class according to the following chart. Here were actual results of a random sample. Did the faculty perception fit the reality?

Actual Student

Absenteeism

Faculty Perceptions of Student

Absenteeism

fit theDoes the ?

Page 51: Questions of goodness of fit

An exception to the rule

Page 52: Questions of goodness of fit

An exception to the rule

As was just shown, if you are comparing an observed count with a hypothesized count, then you will use goodness of fit statistical methods.

Page 53: Questions of goodness of fit

An exception to the rule

As was just shown, if you are comparing an observed count with a hypothesized count, then you will use goodness of fit statistical methods.

Hypothesized Count = 100

Actual Count = 84

Page 54: Questions of goodness of fit

An exception to the rule

As was just shown, if you are comparing an observed count with a hypothesized count, then you will use goodness of fit statistical methods.

Hypothesized Count = 100

Actual Count = 84

Page 55: Questions of goodness of fit

An exception to the rule

However,

Page 56: Questions of goodness of fit

An exception to the rule

However, if you are comparing a hypothesized proportion (5 out of 10) or percentage (50%)

Page 57: Questions of goodness of fit

An exception to the rule

However, if you are comparing a hypothesized proportion (5 out of 10) or percentage (50%) with an actual proportion or percentage, then you will use a “Difference” method.

Page 58: Questions of goodness of fit

An exception to the rule

However, if you are comparing a hypothesized proportion (5 out of 10) or percentage (50%) with an actual proportion or percentage, then you will use a “Difference” method.

Hypothesized Percentage =

50%

Actual Percentage =

42%

Page 59: Questions of goodness of fit

An exception to the rule

However, if you are comparing a hypothesized proportion (5 out of 10) or percentage (50%) with an actual proportion or percentage, then you will use a “Difference” method.

Hypothesized Percentage =

50%

Actual Percentage =

42%

Page 60: Questions of goodness of fit

Here are the two classifications with their equations:

Page 61: Questions of goodness of fit

Question of Goodness of Fit:

Page 62: Questions of goodness of fit

Question of Goodness of Fit:

Hypothesized Number

Actual Number

fit theDoes the ?

Page 63: Questions of goodness of fit

Question of Goodness of Fit:

Question of Difference:

Hypothesized Number

Actual Number

fit theDoes the ?

Page 64: Questions of goodness of fit

Question of Goodness of Fit:

Question of Difference:

Hypothesized Number

Actual Number

Hypothesized Percentage or

Proportion

differActual

Percentage or Proportion

Does the

fit theDoes the ?

?

Page 65: Questions of goodness of fit

Let’s see an example:

Page 66: Questions of goodness of fit

You have been asked to determine if a sample is representative of the general population in terms of gender.

Page 67: Questions of goodness of fit

You have been asked to determine if a sample is representative of the general population in terms of gender. Since there are roughly equal numbers of men and women in the population, your sample of 500 should have 250 females.

Page 68: Questions of goodness of fit

You have been asked to determine if a sample is representative of the general population in terms of gender. Since there are roughly equal numbers of men and women in the population, your sample of 500 should have 250 females. However, in your sample there are 325 females.

Page 69: Questions of goodness of fit

You have been asked to determine if a sample is representative of the general population in terms of gender. Since there are roughly equal numbers of men and women in the population, your sample of 500 should have 250 females. However, in your sample there are 325 females. How well does your sample of 325 fit this hypothesized expectation statistically?

Page 70: Questions of goodness of fit

You have been asked to determine if a sample is representative of the general population in terms of gender. Since there are roughly equal numbers of men and women in the population, your sample of 500 should have 250 females. However, in your sample there are 325 females. How well does your sample of 325 fit this hypothesized expectation statistically? Since this question is dealing with number

counts, it will be classified as a Goodness of Fit Question

Page 71: Questions of goodness of fit

Now let’s see the same question but as a “difference” question.

Page 72: Questions of goodness of fit

You have been asked to determine if a sample is representative of the general population in terms of gender.

Page 73: Questions of goodness of fit

You have been asked to determine if a sample is representative of the general population in terms of gender. Since there are roughly equal numbers of men and women in the population, your sample should have 50% females.

Page 74: Questions of goodness of fit

You have been asked to determine if a sample is representative of the general population in terms of gender. Since there are roughly equal numbers of men and women in the population, your sample should have 50% females. However, in your sample there are 65% females.

Page 75: Questions of goodness of fit

You have been asked to determine if a sample is representative of the general population in terms of gender. Since there are roughly equal numbers of men and women in the population, your sample should have 50% females. However, in your sample there are 65% females. How much does your sample of 65% differ from the hypothesized expectation of 50% statistically?

Page 76: Questions of goodness of fit

You have been asked to determine if a sample is representative of the general population in terms of gender. Since there are roughly equal numbers of men and women in the population, your sample should have 50% females. However, in your sample there are 65% females. How much does your sample of 65% differ from the hypothesized expectation of 50% statistically?

Since this question is dealing with percentages or proportions, it will be

classified as a Difference Question

Page 77: Questions of goodness of fit

Examine the question or problem you are working on.

Page 78: Questions of goodness of fit

Is it a question of goodness of fit?

Page 79: Questions of goodness of fit

If so, select GOODNESS OF FIT