Top Banner
Chapter 8 Sampling Variability and Sampling Distributions
25

Chapter 8 Sampling Variability and Sampling Distributions.

Jan 04, 2016

Download

Documents

Arabella Moore
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Chapter 8 Sampling Variability and Sampling Distributions.

Chapter 8

Sampling Variability and Sampling Distributions

Page 2: Chapter 8 Sampling Variability and Sampling Distributions.

2 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Any quantity computed from values in a sample is called a statistic.

The observed value of a statistic depends on the particular sample selected from the population; typically, it varies from sample to sample. This variability is called sampling variability.

Basic Terms

Page 3: Chapter 8 Sampling Variability and Sampling Distributions.

3 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Sampling Distribution

The distribution of a statistic is called its sampling distribution.

Page 4: Chapter 8 Sampling Variability and Sampling Distributions.

4 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

ExampleConsider a population that consists of the numbers 1, 2, 3, 4 and 5 generated in a manner that the probability of each of those values is 0.2 no matter what the previous selections were. This population could be described as the outcome associated with a spinner such as given below with the distribution next to it.

x p(x)1 0.22 0.23 0.24 0.25 0.2

Page 5: Chapter 8 Sampling Variability and Sampling Distributions.

5 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

ExampleIf the sampling distribution for the means of samples of size two is analyzed, it looks like

Sample Sample

1, 1 1 3, 4 3.51, 2 1.5 3, 5 41, 3 2 4, 1 2.51, 4 2.5 4, 2 31, 5 3 4, 3 3.52, 1 1.5 4, 4 42, 2 2 4, 5 4.52, 3 2.5 5, 1 32, 4 3 5, 2 3.52, 5 3.5 5, 3 43, 1 2 5, 4 4.53, 2 2.5 5, 5 53, 3 3

frequency p(x)

1 1 0.041.5 2 0.082 3 0.12

2.5 4 0.163 5 0.20

3.5 4 0.164 3 0.12

4.5 2 0.085 1 0.04

25

Page 6: Chapter 8 Sampling Variability and Sampling Distributions.

6 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

ExampleThe original distribution and the sampling distribution of means of samples with n=2 are given below.

54321 54321

Original distribution

Sampling distribution

n = 2

Page 7: Chapter 8 Sampling Variability and Sampling Distributions.

7 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

ExampleSampling distributions for n=3 and n=4 were calculated and are illustrated below.

54321

54321Original distribution Sampling distribution n = 2

Sampling distribution n = 3 Sampling distribution n = 454321 54321

Page 8: Chapter 8 Sampling Variability and Sampling Distributions.

8 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Simulations

432Means (n=120)

432Means (n=60)

432Means (n=30)

To illustrate the general behavior of samples of fixed size n, 10000 samples each of size 30, 60 and 120 were generated from this uniform distribution and the means calculated. Probability histograms were created for each of these (simulated) sampling distributions.

Notice all three of these look to be essentially normally distributed. Further, note that the variability decreases as the sample size increases.

Page 9: Chapter 8 Sampling Variability and Sampling Distributions.

9 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Simulations

Skewed distribution

To further illustrate the general behavior of samples of fixed size n, 10000 samples each of size 4, 16 and 30 were generated from the positively skewed distribution pictured below.

Notice that these sampling distributions all all skewed, but as n increased the sampling distributions became more symmetric and eventually appeared to be almost normally distributed.

Page 10: Chapter 8 Sampling Variability and Sampling Distributions.

10 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

TerminologyLet denote the mean of the observations in a random sample of size n from a population having mean and standard deviation . Denote the mean value of the distribution by and the standard deviation of the distribution by (called the standard error of the mean), then the rules on the next two slides hold.

xx

xx

x

Page 11: Chapter 8 Sampling Variability and Sampling Distributions.

11 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Properties of the Sampling Distribution of the Sample Mean.Rule 1:

Rule 2: This rule is approximately correct as long as no more than 5% of the population is included in the sample.

Rule 3: When the population distribution is normal, the sampling distribution of is also normal for any sample size n.

x

xn

x

Rule 1:

Rule 2: This rule is approximately correct as long as no more than 5% of the population is included in the sample.

Rule 3: When the population distribution is normal, the sampling distribution of is also normal for any sample size n.

x

xn

Rule 1:

Rule 2: This rule is approximately correct as long as no more than 5% of the population is included in the sample.

Rule 3: When the population distribution is normal, the sampling distribution of is also normal for any sample size n.

x Rule 1:

Rule 2: This rule is approximately correct as long as no more than 5% of the population is included in the sample.

Rule 3: When the population distribution is normal, the sampling distribution of is also normal for any sample size n.

x

xn

x

Page 12: Chapter 8 Sampling Variability and Sampling Distributions.

12 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Central Limit Theorem.

Rule 4: When n is sufficiently large, the sampling distribution of is approximately normally distributed, even when the population distribution is not itself normal.

x

Page 13: Chapter 8 Sampling Variability and Sampling Distributions.

13 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Illustrations of Sampling Distributions

Symmetric normal like population

Populationn = 4n = 9n = 25

Page 14: Chapter 8 Sampling Variability and Sampling Distributions.

14 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Illustrations of Sampling Distributions

Skewed population

Populationn=4n=10n=30

Page 15: Chapter 8 Sampling Variability and Sampling Distributions.

15 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

More about the Central Limit Theorem.

The Central Limit Theorem can safely be applied when n exceeds 30.

If n is large or the population distribution is normal, the standardized variable

has (approximately) a standard normal (z) distribution.

X

X

x xz

n

Page 16: Chapter 8 Sampling Variability and Sampling Distributions.

16 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

ExampleA food company sells “18 ounce” boxes of cereal. Let x denote the actual amount of cereal in a box of cereal. Suppose that x is normally distributed with = 18.03 ounces and = 0.05.

a) What proportion of the boxes will contain less than 18 ounces?

18 18.03P(x 18) P z

0.05

P(z 0.60) 0.2743

Page 17: Chapter 8 Sampling Variability and Sampling Distributions.

17 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Example - continuedb) A case consists of 24 boxes of cereal.

What is the probability that the mean amount of cereal (per box in a case) is less than 18 ounces?

18 18.03P(x 18) P z

0.05 24

P(z 2.94) 0.0016

The central limit theorem states that the distribution of is normally distributed sox

Page 18: Chapter 8 Sampling Variability and Sampling Distributions.

18 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Some proportion distributions where p = 0.2

0.2

n = 10

0.2

n = 50

0.2

n = 20

0.2

n = 100

Let p be the proportion of successes in a random sample of size n from a population whose proportion of S’s (successes) is .

Page 19: Chapter 8 Sampling Variability and Sampling Distributions.

19 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Properties of the Sampling Distribution of p

Let p be the proportion of successes in a random sample of size n from a population whose proportion of S’s (successes) is . Denote the mean of p by p and the standard deviation by p. Then the following rules hold

Page 20: Chapter 8 Sampling Variability and Sampling Distributions.

20 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Properties of the Sampling Distribution of p

p Rule 1:

p

(1 )n

Rule 2:

Rule 3: When n is large and is not too near 0 or 1, the sampling distribution of p is approximately normal.

Page 21: Chapter 8 Sampling Variability and Sampling Distributions.

21 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Condition for Use

The further the value of is from 0.5, the larger n must be for a normal approximation to the sampling distribution of p to be accurate.

Rule of Thumb

If both n 10 and n(1-) 10, then it is safe to use a normal approximation.

Page 22: Chapter 8 Sampling Variability and Sampling Distributions.

22 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Example

If the true proportion of defectives produced by a certain manufacturing process is 0.08 and a sample of 400 is chosen, what is the probability that the proportion of defectives in the sample is greater than 0.10?

Since n400(0.08)10 and n(1-) = 400(0.92) = 368 > 10,

it’s reasonable to use the normal approximation.

Page 23: Chapter 8 Sampling Variability and Sampling Distributions.

23 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Example (continued)

p

p

0.08

(1 ) 0.08(1 0.08)0.013565

n 400

p

p

p 0.10 0.08z 1.47

0.013565

P(p 0.1) P(z 1.47)

1 0.9292 0.0708

Page 24: Chapter 8 Sampling Variability and Sampling Distributions.

24 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

ExampleSuppose 3% of the people contacted by phone are receptive to a certain sales pitch and buy your product. If your sales staff contacts 2000 people, what is the probability that more than 100 of the people contacted will purchase your product?

Clearly = 0.03 and p = 100/2000 = 0.05 so

0.05 0.03P(p 0.05) P z

(0.03)(0.97)2000

0.05 0.03P z P(z 5.24) 0

0.0038145

Page 25: Chapter 8 Sampling Variability and Sampling Distributions.

25 Copyright (c) 2001 Brooks/Cole, a division of Thomson Learning, Inc.

Example - continuedIf your sales staff contacts 2000 people, what is the probability that less than 50 of the people contacted will purchase your product?

Now = 0.03 and p = 50/2000 = 0.025 so

0.025 0.03P(p 0.025) P z

(0.03)(0.97)2000

0.025 0.03P z P(z 1.31) 0.0951

0.0038145