Top Banner
Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences
23

Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

Dec 17, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

Copyright © Cengage Learning. All rights reserved.

8 Introduction to Statistical Inferences

Page 2: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

Copyright © Cengage Learning. All rights reserved.

8.1 The Nature of Estimation

Page 3: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

3

Are We Taller or Shorter Today?

Page 4: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

4

The central limit theorem gave us some very important information about the sampling distribution of sample means (SDSM).

Specifically, it stated that in many realistic cases (when the random sample is large enough) a distribution of sample means is normally or approximately normally distributed about the mean of the population.

With this information we were able to make probability statements about the likelihood of certain sample mean values occurring when samples are drawn from a population with a known mean and a known standard deviation.

Are We Taller or Shorter Today?

Page 5: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

5

We are now ready to turn this situation around to the case in which the population mean is not known.

We will draw one sample, calculate its mean value, and then make an inference about the value of the population mean based on the sample’s mean value.

The objective of inferential statistics is to use the information contained in the sample data to increase our knowledge of the sampled population.

We will learn about making two types of inferences:(1) estimating the value of a population parameter and(2) testing a hypothesis.

Are We Taller or Shorter Today?

Page 6: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

6

Are We Taller or Shorter Today?

The sampling distribution of sample means (SDSM) is the key to making these inferences as shown in Figure 8.1.

Where the Sampling Distribution Fits into the Statistical ProcessFigure 8.1

Page 7: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

7

Are We Taller or Shorter Today?

In this chapter, we deal with questions about the population mean using two methods that assume the value of the population standard deviation is a known quantity.

This assumption is seldom realized in real-life problems, but it will make our first look at the techniques of inference much simpler.

Page 8: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

8

Are We Taller or Shorter Today?

Starting with the concept of estimation, let’s consider a company that manufactures rivets for use in building aircraft.

One characteristic of extreme importance is the “shearing strength” of each rivet.

The company’s engineers must monitor production to be certain that the shearing strength of the rivets meets the required specs.

To accomplish this, they take a sample and determine the mean shearing strength of the sample.

Page 9: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

9

Are We Taller or Shorter Today?

Based on this sample information, the company canestimate the mean shearing strength for all the rivets it ismanufacturing.

A random sample of 36 rivets is selected, and each rivet is tested for shearing strength.

The resulting sample mean is = 924.23 lb. Based on this sample, we say, “We believe the mean shearing strength of all such rivets is 924.23 lb.”

Page 10: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

10

Are We Taller or Shorter Today?

Notes

1. Shearing strength is the force required to break a material in a “cutting” action. Obviously, the manufacturer is not going to test all rivets because the test destroys each rivet tested.

Therefore, samples are tested and the information about each sample must be used to make inferences about the

population of all such rivets.

Page 11: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

11

Are We Taller or Shorter Today?

2. Throughout Chapter 8 we will treat the standard deviation, , as a known, or given, quantity and concentrate on learning the procedures for making statistical inferences about the population mean, .

Therefore, to continue the explanation of statistical inferences, we will assume = 18 for the specific rivets described in our example.

Page 12: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

12

Are We Taller or Shorter Today?

Point estimate for a parameter A single number designed to estimate a quantitative parameter of a population, usually the value of the corresponding sample statistic.

That is, the sample mean, , is the point estimate (single-number value) for the mean, , of the sampled population.

Unbiased statistic A sample statistic whose sampling distribution has a mean value equal to the value of the population parameter being estimated. A statistic that is not unbiased is a biased statistic.

Page 13: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

13

Are We Taller or Shorter Today?

Figure 8.2 illustrates the concept of being unbiased and the effect of variability on the point estimate.

Effects and Variability and BiasFigure 8.2

Page 14: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

14

Are We Taller or Shorter Today?

The value A is the parameter being estimated, and the dots represent possible sample statistic values from the sampling distribution of the statistic.

If A represents the true population mean, , then the dots represent possible sample means from the sampling distribution.

Figure 8.2(a), (c), (d), and (f) show biased statistics; (a) and (d) show sampling distributions whose mean values are less than the value of the parameter, whereas (c) and (f) show sampling distributions whose mean values are greater than the parameter.

Page 15: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

15

Are We Taller or Shorter Today?

Figure 8.2(b) and (e) show sampling distributions thatappear to have a mean value equal to the value of the parameter; therefore, they are unbiased.

Effects and Variability and BiasFigure 8.2

Page 16: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

16

Are We Taller or Shorter Today?

Figure 8.2(a), (b), and (c) show more variability, whereas (d), (e), and (f) show less variability in the sampling distributions. Diagram (e) represents the best situation, an estimator that is unbiased (on-target) and has low variability (all values close to the target).

The sample mean, , is an unbiased statistic because the mean value of the sampling distribution of sample means, , is equal to the population mean, .

Page 17: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

17

Sample means vary in value and form a sampling distribution in which not all samples result in values equal to the population mean. Therefore, we should not expect this sample of 36 rivets to produce a point estimate (sample mean) that is exactly equal to the mean of the sampled population.

We should, however, expect the point estimate to be fairlyclose in value to the population mean.

Are We Taller or Shorter Today?

Page 18: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

18

Are We Taller or Shorter Today?

The sampling distribution of sample means (SDSM) and the central limit theorem (CLT) provide the information needed to describe how close the point estimate, , is expected to be to the population mean, .

The sampling distribution of sample means (SDSM) and the central limit theorem (CLT) provide the information needed to describe how close the point estimate, is expected to be to the population mean, .

Page 19: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

19

Are We Taller or Shorter Today?

Therefore, we should anticipate that 95% of all random samples selected from a population with unknown mean and standard deviation = 18 will have means between

Page 20: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

20

This suggests that 95% of all random samples of size 36 selected from the population of rivets should have a mean between – 6 and + 6.

Figure 8.3 shows the middle 95% of the distribution, the bounds of the interval covering the 95%, and the mean .

Figure 8.3

Sampling Distribution of Unknown

Are We Taller or Shorter Today?

Page 21: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

21

Are We Taller or Shorter Today?

Now let’s put all of this information together in the form of a confidence interval.

Interval estimate An interval bounded by two values and used to estimate the value of a population parameter. The values that bound this interval are statistics calculated from the sample that is being used as the basis for the estimation.

Level of confidence 1 – The portion of all interval estimates that include the parameter being estimated.

Confidence interval An interval estimate with a specified level of confidence.

Page 22: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

22

Applied Example 1 – Yellowstone Park’s Old Faithful

Welcome to the Old Faithful WebCam. Predictions for the time of the next eruption of Old Faithful are made by the rangers using a formula that takes into account the length of the previous eruption.

The formula used has proved to be accurate, plus or minus 10 minutes, 90% of the time.

At 3:05 p.m. on August 14, 2009, the posted prediction time of the next eruption was:

Next Prediction:

3:19 p.m. 10 min.

Page 23: Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.

23

Note the time at which the picture was recorded:3:25:19 p.m. Right on time!

Source: http://www.nps.gov/yell/oldfaithfulcam.htm

cont’dApplied Example 1 – Yellowstone Park’s Old Faithful