Top Banner
Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data
22

Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Jan 17, 2016

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 1

Dr. Ka-fu Wong

ECON1003Analysis of Economic Data

Page 2: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 2l

GOALS

1. List the characteristics of the Chi-square distribution.

2. Conduct a test of hypothesis comparing an observed set of frequencies to an expected set of frequencies.

3. Conduct a test of hypothesis for normality using the chi-square distribution.

4. Conduct a hypothesis test to determine whether two classification criteria are related.

Chapter FifteenNonparametric Methods: Chi-Square Nonparametric Methods: Chi-Square ApplicationsApplications

Page 3: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 3

Characteristics of the Chi-Square Distribution

The major characteristics of the chi-square distribution are: It is positively skewed. It is non-negative. It is based on degrees of freedom.When the degrees of freedom change a

new distribution is created.

Chi-square distribution is characterized by only one degree of freedom. F distribution is characterized by two degree of freedom.

Similar to F distri-bution

Page 4: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 4

df = 3

df = 5

df = 10

Page 5: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 5

Goodness-of-Fit Test: Equal Expected Frequencies

Let f0 and fe be the observed and expected frequencies respectively.

H0: There is no difference between the observed and expected frequencies.

H1: There is a difference between the observed and the expected frequencies.

Page 6: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 6

Goodness-of-fit Test: Equal Expected Frequencies

The test statistic is:

The critical value is a chi-square value with (k-1) degrees of freedom, where k is the number of categories

e

eo

f

ff 22

Page 7: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 7

EXAMPLE 1

The following information shows the number of employees absent by day of the week at a large a manufacturing plant. At the .05 level of significance, is there a difference in the absence rate by day of the week?

Day Frequency

Monday 120

Tuesday 45

Wednesday 60

Thursday 90

Friday 130

Total 445

Page 8: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 8

EXAMPLE 1 continued

Assume equal expected frequency:

(120+45+60+90+130)/5=89.

The degrees of freedom is (5-1)=4.

The critical value is 9.488. Use Appendix I in the textbook.

Page 9: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 9

Example 1 continued

Day Frequency Expected (f0-fe)2/fe

Monday 120 89 10.80

Tuesday 45 89 21.75

Wednesday 60 89 9.45

Thursday 90 89 0.01

Friday 130 89 18.89

Total 445 89 60.90

Because the computed value of chi-square is greater than the critical value (9.488), H0 is rejected.

We conclude that there is a difference in the number of workers absent by day of the week.

Page 10: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 10

EXAMPLE 2

The U.S. Bureau of the Census indicated that 63.9% of the population is married, 7.7% widowed, 6.9% divorced (and not re-married), and 21.5% single (never been married). A sample of 500 adults from the Philadelphia area showed that 310 were married, 40 widowed, 30 divorced, and 120 single. At the .05 significance level can we conclude that the Philadelphia area is different from the U.S. as a whole?

Page 11: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 11

EXAMPLE 2 continued

Status f0 fe (f0-fe)2/ fe

Married 310 319.5 .2825

Widowed 40 38.5 .0584

Divorced 30 34.5 .5870

Single 120 107.5 1.4535

Total 500 2.3814

Page 12: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 12

EXAMPLE 2 continued

Step 1: H0: The distribution has not changed

H1: The distribution has changed.

Step 2: H0 is rejected if 2 >7.815, df=3, = .05

Step 3: 2 = 2.3814 Step 4: The null hypothesis is rejected. The

distribution regarding marital status in Philadelphia is different from the rest of the United States.

Status f0 fe (f0-fe)2/ fe

Married 310 319.5 .2825

Widowed 40 38.5 .0584

Divorced 30 34.5 .5870

Single 120 107.5 1.4535

Total 500 2.3814

Page 13: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 13

Goodness-of-Fit Test for Normality

This test investigates if the observed frequencies in a frequency distribution match the theoretical normal distribution.

The procedure is to determine the mean and standard deviation of the frequency distribution. Compute the z-value for the lower class

limit and the upper class limit for each class.

Determine fe for each category Use the chi-square goodness-of-fit test to

determine if fo coincides with fe .

Page 14: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 14

EXAMPLE 3

A sample of 500 donations to the Arthritis Foundation is reported in the following frequency distribution. Is it reasonable to conclude that the distribution is normally distributed with a mean of $10 and a standard deviation of $2? Use the .05 significance level.

Amount spent F0

<$6 20

$6 up to $8 60

$8 up to $10 140

$10 up to $12 120

$12 up to $14 90

> $14 70

Total 500

Page 15: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 15

Example 3 continued

To compute fe for the first class, first determine the z-value.

Find the probability of a z-value less than –2.00

00.22

106

X

z

0228.4772.5000.0)00.2( zP

The expected frequency is the probability of a z-value less that –2.00 times the samples size.

fe = (.0228)(500) = 11.4

The other expected frequencies are computed similarly.

Page 16: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 16

EXAMPLE 3 continued

Amount spent F0 Area fe (f0-fe)2/fe

<$6 20 .02 11.40 6.49

$6 up to $8 60 .14 67.95 .93

$8 up to $10 140 .34 170.65 5.50

$10 up to $12 120 .34 170.65 15.03

$12 up to $14 90 .14 67.95 7.16

> $14 70 .02 11.40 301.22

Total 500 500 336.33

Page 17: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 17

EXAMPLE 3 continued

Step 1: H0: The observations follow the normal distribution.H1: The observations do not follow a normal distribution.

Step 2: H0 is rejected if 2 is greater than 7.815. There are 6 degrees of freedom and is .05.

Step 3: The computed value of 2 is 336.33. Step 4: H0 is rejected . The observations do not follow the

normal distribution.

Amount spent F0 Area fe (f0-fe)2/fe

<$6 20 .02 11.40 6.49

$6 up to $8 60 .14 67.95 .93

$8 up to $10 140 .34 170.65 5.50

$10 up to $12 120 .34 170.65 15.03

$12 up to $14 90 .14 67.95 7.16

> $14 70 .02 11.40 301.22

Total 500 500 336.33

Page 18: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 18

Contingency Table Analysis

A contingency table is used to investigate whether two traits or characteristics are related.

Each observation is classified according to two criteria.

We use the usual hypothesis testing procedure. The degrees of freedom is equal to: (number of

rows-1)(number of columns-1). The expected frequency is computed as:

Expected Frequency = (row total)(column total)/grand total

Page 19: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 19

EXAMPLE 4

Is there a relationship between the location of an accident and the gender of the person involved in the accident? A sample of 150 accidents reported to the police were classified by type and gender. At the .05 level of significance, can we conclude that gender and the location of the accident are related?

Sex Work Home Other Total

Male 60 20 10 90

Female 20 30 10 60

Total 80 50 20 150

Page 20: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 20

EXAMPLE 4 continued

Sex Work Home Other Total

Male 60 20 10 90

Female 20 30 10 60

Total 80 50 20 150

The expected relative frequency for work is 80/150. The expected relative frequency for male is 90/150. The expected relative frequency for the work-male

intersection under the hypothesis that there is no relationship between work and male is (90/150)(80/150).

The expected relative frequency for the work-male intersection under the hypothesis that there is no relationship between work and male is (90/150)(80/150)*150 = 48.

Similarly, we can compute the expected frequencies for the other cells.

Page 21: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 21

EXAMPLE 4 continued

Step 1: H0: Gender and location are not related.

H1: Gender and location are related.

Step 2: H0 is rejected if the computed value of 2 is greater than 5.991. There are (3- 1)(2-1) = 2 degrees of freedom.

Step 3: Find the value of 2.2=(60-48)2/48 + … + (10-8)2/8 = 16.667

Step 4: H0 is rejected. Gender and location are related.

Sex Work Home Other Total

Male 60 (48) 20 (30) 10 (12) 90

Female 20 (32) 30 (20) 10 (8) 60

Total 80 50 20 150

Expected frequency in parentheses

Page 22: Ka-fu Wong © 2003 Chap 15- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.

Ka-fu Wong © 2003 Chap 15- 22

- END -

Chapter FifteenNonparametric Methods: Chi-Square Nonparametric Methods: Chi-Square ApplicationsApplications