Top Banner
1 Slide Cengage Learning. All Rights Reserved. May not be scanned, copied duplicated, or posted to a publicly accessible website, in whole or in part. John Loucks St. Edward’s University . . . . . . . . . . . SLIDES . BY
26

John Loucks St . Edward’s University

Feb 25, 2016

Download

Documents

Ed Sippel

SLIDES . BY. . . . . . . . . . . . John Loucks St . Edward’s University. Chapter 14, Part B Simple Linear Regression. Using the Estimated Regression Equation for Estimation and Prediction. Computer Solution. Residual Analysis: Validating Model Assumptions. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: John Loucks St . Edward’s University

1 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

John LoucksSt. Edward’sUniversity

...........

SLIDES . BY

Page 2: John Loucks St . Edward’s University

2 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Chapter 14, Part B Simple Linear Regression

Using the Estimated Regression Equation for Estimation and Prediction

Residual Analysis: Validating Model Assumptions Residual Analysis: Outliers and Influential

Observations

Computer Solution

Page 3: John Loucks St . Edward’s University

3 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Using the Estimated Regression Equationfor Estimation and Prediction

The margin of error is larger for a prediction interval.

A prediction interval is used whenever we want to predict an individual value of y for a new observation corresponding to a given value of x.

A confidence interval is an interval estimate of the mean value of y for a given value of x.

Page 4: John Loucks St . Edward’s University

4 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Using the Estimated Regression Equationfor Estimation and Prediction

**

/ 2 ˆˆyy t s

where:confidence coefficient is 1 - andt/2 is based on a t distributionwith n - 2 degrees of freedom

*/ 2ˆ predy t s

Confidence Interval Estimate of E(y*)

Prediction Interval Estimate of y*

Page 5: John Loucks St . Edward’s University

5 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

If 3 TV ads are run prior to a sale, we expectthe mean number of cars sold to be:

Point Estimation

^y = 10 + 5(3) = 25 cars

Page 6: John Loucks St . Edward’s University

6 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

*

* 2

2ˆ( )1( )y

i

x xs sn x x

Estimate of the Standard Deviation of *̂y

Confidence Interval for E(y*)

*

2

2 2 2 2 2ˆ(3 2)12.16025 5 (1 2) (3 2) (2 2) (1 2) (3 2)ys

*ˆ1 12.16025 1.44915 4ys

Page 7: John Loucks St . Edward’s University

7 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

The 95% confidence interval estimate of the mean number of cars sold when 3 TV ads are run is:

Confidence Interval for E(y*)

25 + 4.61

**

/ 2 ˆˆyy t s

25 + 3.1824(1.4491)

20.39 to 29.61 cars

Page 8: John Loucks St . Edward’s University

8 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

* 2

2( )11 ( )pred

i

x xs sn x x

Estimate of the Standard Deviation

of an Individual Value of y*

1 12.16025 1 5 4preds

2.16025(1.20416) 2.6013preds

Prediction Interval for y*

Page 9: John Loucks St . Edward’s University

9 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

The 95% prediction interval estimate of the number of cars sold in one particular week when 3 TV ads are run is:

Prediction Interval for y*

25 + 8.2825 + 3.1824(2.6013)

*/ 2ˆ predy t s

16.72 to 33.28 cars

Page 10: John Loucks St . Edward’s University

10 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Computer Solution

Recall that the independent variable was named Ads and the dependent variable was named Cars in the example.

On the next slide we show Minitab output for the Reed Auto Sales example.

Performing the regression analysis computations without the help of a computer can be quite time consuming.

Page 11: John Loucks St . Edward’s University

11 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

The regression equation isCars = 10.0 + 5.00 Ads

Predictor Coef SE Coef T pConstant 10.000 2.366 4.23 0.024Ads 5.0000 1.080 4.63 0.019

S = 2.16025 R-sq = 87.7% R-sq(adj) = 83.6%

Analysis of Variance

SOURCE DF SS MS F pRegression 1 100 100 21.43 0.019Residual Err. 3 14 4.667Total 4 114

Predicted Values for New Observations

NewObs Fit SE Fit 95% C.I. 95% P.I.1 25 1.45 (20.39, 29.61) (16.72, 33.28)

Computer Solution

Minitab Output

EstimatedRegressionEquation

ANOVATable

IntervalEstimate

s

Page 12: John Loucks St . Edward’s University

12 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Minitab Output

Minitab prints the standard error of the estimate, s, as well as information about the goodness of fit. .

For each of the coefficients b0 and b1, the output shows its value, standard deviation, t value, and p-value.

Minitab prints the estimated regression equation as Cars = 10.0 + 5.00 Ads.

The standard ANOVA table is printed. Also provided are the 95% confidence interval estimate of the expected number of cars sold and the 95% prediction interval estimate of the number of cars sold for an individual weekend with 3 ads.

Page 13: John Loucks St . Edward’s University

13 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Residual Analysis

ˆi iy y

Much of the residual analysis is based on an examination of graphical plots.

Residual for Observation i The residuals provide the best information about e .

If the assumptions about the error term e appear questionable, the hypothesis tests about the significance of the regression relationship and the interval estimation results may not be valid.

Page 14: John Loucks St . Edward’s University

14 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Residual Plot Against x

If the assumption that the variance of e is the same for all values of x is valid, and the assumed regression model is an adequate representation of the relationship between the variables, then

The residual plot should give an overall impression of a horizontal band of points

Page 15: John Loucks St . Edward’s University

15 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

x

ˆy y

0

Good PatternRe

sidua

l

Residual Plot Against x

Page 16: John Loucks St . Edward’s University

16 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Residual Plot Against x

x

ˆy y

0

Resid

ual

Nonconstant Variance

Page 17: John Loucks St . Edward’s University

17 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Residual Plot Against x

x

ˆy y

0

Resid

ual

Model Form Not Adequate

Page 18: John Loucks St . Edward’s University

18 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Residuals

Residual Plot Against x

Observation Predicted Cars Sold Residuals

1 15 -1

2 25 -1

3 20 -2

4 15 2

5 25 2

Page 19: John Loucks St . Edward’s University

19 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Residual Plot Against x

TV Ads Residual Plot

-3

-2

-1

0

1

2

3

0 1 2 3 4TV Ads

Resi

dual

s

Page 20: John Loucks St . Edward’s University

20 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Standardized Residual for Observation i

Standardized Residuals

ˆ

ˆi i

i i

y y

y ys

ˆ 1i i iy ys s h

2

2( )1( )i

ii

x xhn x x

where:

Page 21: John Loucks St . Edward’s University

21 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Standardized Residual Plot

The standardized residual plot can provide insight about the assumption that the error term e has a normal distribution.

If this assumption is satisfied, the distribution of the standardized residuals should appear to come from a standard normal probability distribution.

Page 22: John Loucks St . Edward’s University

22 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Observation Predicted y ResidualStandardized

Residual1 15 -1 -0.53452 25 -1 -0.53453 20 -2 -1.06904 15 2 1.06905 25 2 1.0690

Standardized Residuals

Standardized Residual Plot

Page 23: John Loucks St . Edward’s University

23 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Standardized Residual Plot

Standardized Residual Plot

A B C D2829 RESIDUAL OUTPUT3031 Observation Predicted Y ResidualsStandard Residuals32 1 15 -1 -0.53452233 2 25 -1 -0.53452234 3 20 -2 -1.06904535 4 15 2 1.06904536 5 25 2 1.06904537

-1.5

-1

-0.5

0

0.5

1

1.5

0 10 20 30

Cars Sold

Stan

dard

Res

idua

ls

Page 24: John Loucks St . Edward’s University

24 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Standardized Residual Plot

All of the standardized residuals are between –1.5 and +1.5 indicating that there is no reason to question the assumption that e has a normal distribution.

Page 25: John Loucks St . Edward’s University

25 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

Outliers and Influential Observations

Detecting Outliers

• Minitab classifies an observation as an outlier if its standardized residual value is < -2 or > +2.• This standardized residual rule sometimes fails to identify an unusually large observation as being an outlier.

• This rule’s shortcoming can be circumvented by using studentized deleted residuals.• The |i th studentized deleted residual| will be larger than the |i th standardized residual|.

• An outlier is an observation that is unusual in comparison with the other data.

Page 26: John Loucks St . Edward’s University

26 Slide© 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied

or duplicated, or posted to a publicly accessible website, in whole or in part.

End of Chapter 14, Part B