Unit 6: Simple Linear Regression Lecture 3: Confidence and ...tjl13/s101/slides/unit6lec3.pdf · Announcements Announcements Notes from HW: remember to check conditions and interpret
Post on 29-Aug-2020
5 Views
Preview:
Transcript
Unit 6: Simple Linear RegressionLecture 3: Confidence and prediction intervals for
SLR
Statistics 101
Thomas Leininger
June 19, 2013
Announcements
1 Announcements
2 Confidence intervals for average values
3 Prediction intervals for specific predicted values
4 Recap - CI vs. PI
Statistics 101
U6 - L3: Confidence and prediction intervals for SLR Thomas Leininger
Announcements
Announcements
Notes from HW: remember to check conditions and interpretfindings in context when doing a CI/HT.
Notes on project: link on schedule has example projects
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 2 / 17
Announcements
Visualization of the Day
−120 −110 −100 −90 −80 −70 −60
2530
3540
4550
2007 4th Max O3 Prediction
Longitude
Latit
ude
40
60
80
100
120
140
http:// stat.duke.edu/∼tjl13/ s101/ DailyO3 2007 160.gif
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 3 / 17
Confidence intervals for average values
1 Announcements
2 Confidence intervals for average values
3 Prediction intervals for specific predicted values
4 Recap - CI vs. PI
Statistics 101
U6 - L3: Confidence and prediction intervals for SLR Thomas Leininger
Confidence intervals for average values
Can we make CIs for predicting a foster twin’s IQ?
Two type of intervals available:
Confidence interval for the average foster twin’s IQ
Prediction interval for a single foster twin’s IQ
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 4 / 17
Confidence intervals for average values
Can we make CIs for predicting a foster twin’s IQ?
Two type of intervals available:Confidence interval for the average foster twin’s IQ
Prediction interval for a single foster twin’s IQ
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 4 / 17
Confidence intervals for average values
Can we make CIs for predicting a foster twin’s IQ?
Two type of intervals available:Confidence interval for the average foster twin’s IQ
Prediction interval for a single foster twin’s IQ
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 4 / 17
Confidence intervals for average values
Confidence intervals for average values
A confidence interval for E(y | x?), the average (expected) value of y fora given x?, is
y ± t?n−2 sy
√1n+
(x? − x)2
(n − 1)s2x
where sy is the standard deviation of the residuals, calculated as
sy =
√∑(yi − yi)2
n − 2.
sy is called residual standard error in R regression output.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 5 / 17
Confidence intervals for average values
Confidence intervals for average values
A confidence interval for E(y | x?), the average (expected) value of y fora given x?, is
y ± t?n−2 sy
√1n+
(x? − x)2
(n − 1)s2x
where sy is the standard deviation of the residuals, calculated as
sy =
√∑(yi − yi)2
n − 2.
sy is called residual standard error in R regression output.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 5 / 17
Confidence intervals for average values
Calculate a 95% confidence interval for the average IQ score of fostertwins whose biological twins have IQ scores of 100 points. Note thatthe average IQ score of 27 biological twins in the sample is 95.3 points,with a standard deviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
y = 9.2076 + 0.90144 × 100 ≈ 99.35df = n − 2 = 25 t? = 2.06
ME = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742
≈ 3.2CI = 99.35 ± 3.2
= (96.15, 102.55)
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 6 / 17
Confidence intervals for average values
Calculate a 95% confidence interval for the average IQ score of fostertwins whose biological twins have IQ scores of 100 points. Note thatthe average IQ score of 27 biological twins in the sample is 95.3 points,with a standard deviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
y = 9.2076 + 0.90144 × 100 ≈ 99.35
df = n − 2 = 25 t? = 2.06
ME = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742
≈ 3.2CI = 99.35 ± 3.2
= (96.15, 102.55)
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 6 / 17
Confidence intervals for average values
Calculate a 95% confidence interval for the average IQ score of fostertwins whose biological twins have IQ scores of 100 points. Note thatthe average IQ score of 27 biological twins in the sample is 95.3 points,with a standard deviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
y = 9.2076 + 0.90144 × 100 ≈ 99.35df = n − 2 = 25
t? = 2.06
ME = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742
≈ 3.2CI = 99.35 ± 3.2
= (96.15, 102.55)
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 6 / 17
Confidence intervals for average values
Calculate a 95% confidence interval for the average IQ score of fostertwins whose biological twins have IQ scores of 100 points. Note thatthe average IQ score of 27 biological twins in the sample is 95.3 points,with a standard deviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
y = 9.2076 + 0.90144 × 100 ≈ 99.35df = n − 2 = 25 t? = 2.06
ME = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742
≈ 3.2CI = 99.35 ± 3.2
= (96.15, 102.55)
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 6 / 17
Confidence intervals for average values
Calculate a 95% confidence interval for the average IQ score of fostertwins whose biological twins have IQ scores of 100 points. Note thatthe average IQ score of 27 biological twins in the sample is 95.3 points,with a standard deviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
y = 9.2076 + 0.90144 × 100 ≈ 99.35df = n − 2 = 25 t? = 2.06
ME = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742
≈ 3.2CI = 99.35 ± 3.2
= (96.15, 102.55)
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 6 / 17
Confidence intervals for average values
Calculate a 95% confidence interval for the average IQ score of fostertwins whose biological twins have IQ scores of 100 points. Note thatthe average IQ score of 27 biological twins in the sample is 95.3 points,with a standard deviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
y = 9.2076 + 0.90144 × 100 ≈ 99.35df = n − 2 = 25 t? = 2.06
ME = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742
≈ 3.2
CI = 99.35 ± 3.2= (96.15, 102.55)
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 6 / 17
Confidence intervals for average values
Calculate a 95% confidence interval for the average IQ score of fostertwins whose biological twins have IQ scores of 100 points. Note thatthe average IQ score of 27 biological twins in the sample is 95.3 points,with a standard deviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
y = 9.2076 + 0.90144 × 100 ≈ 99.35df = n − 2 = 25 t? = 2.06
ME = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742
≈ 3.2CI = 99.35 ± 3.2
= (96.15, 102.55)
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 6 / 17
Confidence intervals for average values
Calculate a 95% confidence interval for the average IQ score of fostertwins whose biological twins have IQ scores of 100 points. Note thatthe average IQ score of 27 biological twins in the sample is 95.3 points,with a standard deviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
y = 9.2076 + 0.90144 × 100 ≈ 99.35df = n − 2 = 25 t? = 2.06
ME = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742
≈ 3.2CI = 99.35 ± 3.2
= (96.15, 102.55)
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 6 / 17
Confidence intervals for average values
Question
How would you expect the width of the 95% confidence interval for theaverage IQ score of foster twins whose biological twins have IQ scoresof 130 points (x? = 130) to compare to the previous confidence interval(where x? = 100)?
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
y ± t?n−2 sy
√1n+
(x? − x)2
(n − 1)s2x
(a) wider
(b) narrower
(c) same width
(d) cannot tell
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 7 / 17
Confidence intervals for average values
Question
How would you expect the width of the 95% confidence interval for theaverage IQ score of foster twins whose biological twins have IQ scoresof 130 points (x? = 130) to compare to the previous confidence interval(where x? = 100)?
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
y ± t?n−2 sy
√1n+
(x? − x)2
(n − 1)s2x
(a) wider
(b) narrower
(c) same width
(d) cannot tell
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 7 / 17
Confidence intervals for average values
How do the confidence intervals where x? = 100 and x? = 130 com-pare in terms of their widths?
x? = 100
x? = 130
ME100 = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742 = 3.2
ME130 = 2.06 × 7.729 ×
√127+
(130 − 95.3)2
26 × 15.742 = 7.53
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 8 / 17
Confidence intervals for average values
How do the confidence intervals where x? = 100 and x? = 130 com-pare in terms of their widths?
x? = 100
x? = 130
ME100 = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742 = 3.2
ME130 = 2.06 × 7.729 ×
√127+
(130 − 95.3)2
26 × 15.742 = 7.53
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 8 / 17
Confidence intervals for average values
How do the confidence intervals where x? = 100 and x? = 130 com-pare in terms of their widths?
x? = 100
x? = 130
ME100 = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742 = 3.2
ME130 = 2.06 × 7.729 ×
√127+
(130 − 95.3)2
26 × 15.742 =
7.53
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 8 / 17
Confidence intervals for average values
How do the confidence intervals where x? = 100 and x? = 130 com-pare in terms of their widths?
x? = 100
x? = 130
ME100 = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742 = 3.2
ME130 = 2.06 × 7.729 ×
√127+
(130 − 95.3)2
26 × 15.742 = 7.53
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 8 / 17
Confidence intervals for average values
How do the confidence intervals where x? = 100 and x? = 130 com-pare in terms of their widths?
x? = 100
x? = 130
ME100 = 2.06 × 7.729 ×
√1
27+
(100 − 95.3)2
26 × 15.742 = 3.2
ME130 = 2.06 × 7.729 ×
√127+
(130 − 95.3)2
26 × 15.742 = 7.53
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 8 / 17
Confidence intervals for average values
Recap
The width of the confidence interval for E(y) increases as x? movesaway from the center.
Conceptually: We are much more certain of our predictions atthe center of the data than at the edges (and our level ofcertainty decreases even further when predicting outside therange of the data – extrapolation).Mathematically: As (x? − x)2 term increases, the margin of errorof the confidence interval increases as well.
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 9 / 17
Confidence intervals for average values
Recap
The width of the confidence interval for E(y) increases as x? movesaway from the center.
Conceptually: We are much more certain of our predictions atthe center of the data than at the edges (and our level ofcertainty decreases even further when predicting outside therange of the data – extrapolation).
Mathematically: As (x? − x)2 term increases, the margin of errorof the confidence interval increases as well.
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 9 / 17
Confidence intervals for average values
Recap
The width of the confidence interval for E(y) increases as x? movesaway from the center.
Conceptually: We are much more certain of our predictions atthe center of the data than at the edges (and our level ofcertainty decreases even further when predicting outside therange of the data – extrapolation).Mathematically: As (x? − x)2 term increases, the margin of errorof the confidence interval increases as well.
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 9 / 17
Prediction intervals for specific predicted values
1 Announcements
2 Confidence intervals for average values
3 Prediction intervals for specific predicted values
4 Recap - CI vs. PI
Statistics 101
U6 - L3: Confidence and prediction intervals for SLR Thomas Leininger
Prediction intervals for specific predicted values
Question
Earlier we learned how to calculate a confidence interval for averagey, E(y), for a given x?.
Suppose we’re not interested in the average, but instead we want topredict a future value of y for a given x?.
Would you expect there to be more uncertainty around an average ora specific predicted value?
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 10 / 17
Prediction intervals for specific predicted values
Prediction intervals for specific predicted values
A prediction interval for y for a given x? is
y ± t?n−2 sy
√1 +
1n+
(x? − x)2
(n − 1)s2x
The formula is very similar, except the variability is higher sincethere is an added 1 in the formula.
Prediction level: If we repeat the study of obtaining a regressiondata set many times, each time forming a XX% predictioninterval at x?, and wait to see what the future value of y is at x?,then roughly XX% of the prediction intervals will contain thecorresponding actual value of y.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 11 / 17
Prediction intervals for specific predicted values
Prediction intervals for specific predicted values
A prediction interval for y for a given x? is
y ± t?n−2 sy
√1 +
1n+
(x? − x)2
(n − 1)s2x
The formula is very similar, except the variability is higher sincethere is an added 1 in the formula.
Prediction level: If we repeat the study of obtaining a regressiondata set many times, each time forming a XX% predictioninterval at x?, and wait to see what the future value of y is at x?,then roughly XX% of the prediction intervals will contain thecorresponding actual value of y.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 11 / 17
Prediction intervals for specific predicted values
Prediction intervals for specific predicted values
A prediction interval for y for a given x? is
y ± t?n−2 sy
√1 +
1n+
(x? − x)2
(n − 1)s2x
The formula is very similar, except the variability is higher sincethere is an added 1 in the formula.
Prediction level: If we repeat the study of obtaining a regressiondata set many times, each time forming a XX% predictioninterval at x?, and wait to see what the future value of y is at x?,then roughly XX% of the prediction intervals will contain thecorresponding actual value of y.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 11 / 17
Prediction intervals for specific predicted values
Application exercise:Prediction intervalCalculate a 95% prediction interval for the average IQ score of foster twinswhose biological twins have IQ scores of 100 points. Note that the averageIQ score of 27 biological twins in the sample is 95.3 points, with a standarddeviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
We already found that y ≈ 99.35 and t?25 = 2.06.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 12 / 17
Prediction intervals for specific predicted values
Application exercise:Prediction intervalCalculate a 95% prediction interval for the average IQ score of foster twinswhose biological twins have IQ scores of 100 points. Note that the averageIQ score of 27 biological twins in the sample is 95.3 points, with a standarddeviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
We already found that y ≈ 99.35 and t?25 = 2.06.
y ± t?n−2 sy
√1 +
1n+
(x? − x)2
(n − 1)s2x
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 12 / 17
Prediction intervals for specific predicted values
Application exercise:Prediction intervalCalculate a 95% prediction interval for the average IQ score of foster twinswhose biological twins have IQ scores of 100 points. Note that the averageIQ score of 27 biological twins in the sample is 95.3 points, with a standarddeviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
We already found that y ≈ 99.35 and t?25 = 2.06.
ME = 2.06 × 7.729 ×
√1 +
127+
(100 − 95.3)2
26 × 15.742 ≈ 16.24
CI = 99.35 ± 16.24
= (83.11, 115.59)
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 12 / 17
Prediction intervals for specific predicted values
Application exercise:Prediction intervalCalculate a 95% prediction interval for the average IQ score of foster twinswhose biological twins have IQ scores of 100 points. Note that the averageIQ score of 27 biological twins in the sample is 95.3 points, with a standarddeviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
We already found that y ≈ 99.35 and t?25 = 2.06.
ME = 2.06 × 7.729 ×
√1 +
127+
(100 − 95.3)2
26 × 15.742 ≈ 16.24
CI = 99.35 ± 16.24
= (83.11, 115.59)
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 12 / 17
Prediction intervals for specific predicted values
Application exercise:Prediction intervalCalculate a 95% prediction interval for the average IQ score of foster twinswhose biological twins have IQ scores of 100 points. Note that the averageIQ score of 27 biological twins in the sample is 95.3 points, with a standarddeviation is 15.74 points.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 9.20760 9.29990 0.990 0.332
bioIQ 0.90144 0.09633 9.358 1.2e-09
Residual standard error: 7.729 on 25 degrees of freedom
We already found that y ≈ 99.35 and t?25 = 2.06.
ME = 2.06 × 7.729 ×
√1 +
127+
(100 − 95.3)2
26 × 15.742 ≈ 16.24
CI = 99.35 ± 16.24
= (83.11, 115.59)
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 12 / 17
Recap - CI vs. PI
1 Announcements
2 Confidence intervals for average values
3 Prediction intervals for specific predicted values
4 Recap - CI vs. PI
Statistics 101
U6 - L3: Confidence and prediction intervals for SLR Thomas Leininger
Recap - CI vs. PI
CI for E(y) vs. PI for y (1)
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
confidenceprediction
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 13 / 17
Recap - CI vs. PI
CI for E(y) vs. PI for y (2)
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●●
● ●
●
70 80 90 100 110 120 130
60
80
100
120
140
biological IQ
fost
er IQ
confidenceprediction
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 14 / 17
Recap - CI vs. PI
CI for E(y) vs. PI for y - differences
A prediction interval is similar in spirit to a confidence interval,except that
the prediction interval is designed to cover a “moving target”,the random future value of y, whilethe confidence interval is designed to cover the “fixed target”,the average (expected) value of y, E(y),
for a given x?.Although both are centered at y, the prediction interval is widerthan the confidence interval, for a given x? and confidence level.This makes sense, since
the prediction interval must take account of the tendency of y tofluctuate from its mean value, whilethe confidence interval simply needs to account for theuncertainty in estimating the mean value.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 15 / 17
Recap - CI vs. PI
CI for E(y) vs. PI for y - differences
A prediction interval is similar in spirit to a confidence interval,except that
the prediction interval is designed to cover a “moving target”,the random future value of y, while
the confidence interval is designed to cover the “fixed target”,the average (expected) value of y, E(y),
for a given x?.Although both are centered at y, the prediction interval is widerthan the confidence interval, for a given x? and confidence level.This makes sense, since
the prediction interval must take account of the tendency of y tofluctuate from its mean value, whilethe confidence interval simply needs to account for theuncertainty in estimating the mean value.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 15 / 17
Recap - CI vs. PI
CI for E(y) vs. PI for y - differences
A prediction interval is similar in spirit to a confidence interval,except that
the prediction interval is designed to cover a “moving target”,the random future value of y, whilethe confidence interval is designed to cover the “fixed target”,the average (expected) value of y, E(y),
for a given x?.
Although both are centered at y, the prediction interval is widerthan the confidence interval, for a given x? and confidence level.This makes sense, since
the prediction interval must take account of the tendency of y tofluctuate from its mean value, whilethe confidence interval simply needs to account for theuncertainty in estimating the mean value.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 15 / 17
Recap - CI vs. PI
CI for E(y) vs. PI for y - differences
A prediction interval is similar in spirit to a confidence interval,except that
the prediction interval is designed to cover a “moving target”,the random future value of y, whilethe confidence interval is designed to cover the “fixed target”,the average (expected) value of y, E(y),
for a given x?.Although both are centered at y, the prediction interval is widerthan the confidence interval, for a given x? and confidence level.This makes sense, since
the prediction interval must take account of the tendency of y tofluctuate from its mean value, whilethe confidence interval simply needs to account for theuncertainty in estimating the mean value.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 15 / 17
Recap - CI vs. PI
CI for E(y) vs. PI for y - differences
A prediction interval is similar in spirit to a confidence interval,except that
the prediction interval is designed to cover a “moving target”,the random future value of y, whilethe confidence interval is designed to cover the “fixed target”,the average (expected) value of y, E(y),
for a given x?.Although both are centered at y, the prediction interval is widerthan the confidence interval, for a given x? and confidence level.This makes sense, since
the prediction interval must take account of the tendency of y tofluctuate from its mean value, while
the confidence interval simply needs to account for theuncertainty in estimating the mean value.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 15 / 17
Recap - CI vs. PI
CI for E(y) vs. PI for y - differences
A prediction interval is similar in spirit to a confidence interval,except that
the prediction interval is designed to cover a “moving target”,the random future value of y, whilethe confidence interval is designed to cover the “fixed target”,the average (expected) value of y, E(y),
for a given x?.Although both are centered at y, the prediction interval is widerthan the confidence interval, for a given x? and confidence level.This makes sense, since
the prediction interval must take account of the tendency of y tofluctuate from its mean value, whilethe confidence interval simply needs to account for theuncertainty in estimating the mean value.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 15 / 17
Recap - CI vs. PI
CI for E(y) vs. PI for y - similarities
For a given data set, the error in estimating E(y) and y grows asx? moves away from x. Thus, the further x? is from x, the widerthe confidence and prediction intervals will be.
If any of the conditions underlying the model are violated, thenthe confidence intervals and prediction intervals may be invalidas well. This is why it’s so important to check the conditions byexamining the residuals, etc.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 16 / 17
Recap - CI vs. PI
CI for E(y) vs. PI for y - similarities
For a given data set, the error in estimating E(y) and y grows asx? moves away from x. Thus, the further x? is from x, the widerthe confidence and prediction intervals will be.
If any of the conditions underlying the model are violated, thenthe confidence intervals and prediction intervals may be invalidas well. This is why it’s so important to check the conditions byexamining the residuals, etc.
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 16 / 17
Recap - CI vs. PI
For further discussion of confidence intervals and predictionsintervals for y given a specific level of x, see the video below:
http:// www.youtube.com/ watch?feature=player embedded&v=qVCQi0KPR0s
Statistics 101 (Thomas Leininger) U6 - L3: Confidence and prediction intervals for SLR June 19, 2013 17 / 17
top related