Selected Text Problems with Answers from Chapters 1−4 for THE PRACTICE OF BUSINESS STATISTICS DAVID S. MOORE GEORGE P. MCCABE Purdue University WILLIAM DUCKWORTH Iowa State University STANLEY SCLOVE University of Illinois at Chicago W. H. Freeman and Company New York
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Selected Text Problems with Answersfrom Chapters 1−4 for
No part of this book may be reproduced by any mechanical, photographic, or electronicprocess, or in the form of a phonographic recording, nor may it be stored in a retrievalsystem, transmitted, or otherwise copied for public or private use, without writtenpermission from the publisher.
Printed in the United States of America
First printing 2001
Selected Text Problems with Answersfrom Chapters 1−4 for
THE PRACTICE OF BUSINESS STATISTICSTable of Contents
Chapter 1 Examining Distributions
Exercise 1.4 Occupational deaths. 1Exercise 1.45 Education and income. 2Exercise 1.53 A hot stock? 3Exercise 1.54 Initial public offerings. 5Exercise 1.69 GMAT scores. 6Exercise 1.95 Grading managers. 7
Chapter 2 Examining Relationships
Exercise 2.10 Health and wealth. 8Exercise 2.15 Business starts and failures. 10Exercise 2.18 Where the stores are. 12Exercise 2.25 Mutual fund performance. 14Exercise 2.29 CEO compensation and stock market performance. 15Exercise 2.45 Moving in step? 16Exercise 2.46 Interpreting correlation. 17Exercise 2.48 Cash or credit? 18Exercise 2.55 What's my grade? 20Exercise 2.75 Education and income. 21Exercise 2.100 Are high interest rates bad for stocks? 22Exercise 2.108 Size and selling price of houses. 24
Chapter 3 Producing Data
Exercise 3.11 Sampling by accountants. 26Exercise 3.13 Ring-no-answer. 27Exercise 3.19 Quality control sampling. 28Exercise 3.40 Does charting help investors? 29
Chapter 4 Probability and Sampling Distributions
Exercise 4.24 Stock price movements. 30Exercise 4.26 Car colors. 31Exercise 4.39 How many cars? 32Exercise 4.53 How many rooms. 33Exercise 4.61 Time and motion studies. 34Exercise 4.63 Time and motion studies. 35Exercise 4.70 Diversification. 36Exercise 4.87 Flaws in carpets. 37
Table B Random Digits Table 38
Answers 39
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 1
EXERCISE 1.4 Occupational deaths.
In 1999 there were 6023 job-related deaths in the United States. Among these were 807deaths in agricultural-related jobs (including forestry and fishing), 121 in mining, 1190in construction, 719 in manufacturing, 1006 in transportation and public utilities, 237 inwholesale trade, 507 in retail trade, 105 in finance-related jobs (including insurance andreal estate), 732 in service-related jobs, and 562 in government jobs.
(a) Find the percent of occupational deaths for each of these job categories, rounded to thenearest percent. What percent of job-related deaths were in categories not listed above?
(b) Make a well-labeled bar graph of the distribution of occupational deaths. Be sure toinclude an “other occupations” bar.
(c) Make a well-labeled Pareto chart of these data. What percent of all occupational deathsare accounted for by the first 3 categories in your Pareto chart?
(d) Would it also be correct to use a pie chart to display these data? Explain your answer.
2 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 1.45 Education and income.
Each March, the Bureau of Labor Statistics (BLS) records the incomes of all adults in asample of 50,000 American households. We are interested in how income varies with thehighest education level a person has reached. Computer software applied to the data fromthe March 2000 survey gives the following results for people aged 25 or over:
It is common to make boxplots of large data sets using the 5% and 95% points in placeof the minimum and maximum. The highest income among the 31,970 people with onlya high school education, for example, is $425,510. It is more informative to see that 95%of this group earned less than $56,294. The 5% and 95% points contain between them themiddle 90% of the observations.
(a) Use this output to make boxplots that compare the income distributions for the fiveeducation groups.
(b) Write a brief summary of the relationship between education and income. For example,do people who start college but don’t get a degree do much better than people with only ahigh school education?
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 3
EXERCISE 1.53 A hot stock?
We saw in Example 1.14 that it is usual in the study of investments to use the mean andstandard deviation to summarize and compare investment returns. On the following page,Table 1.10 gives the monthly returns on Philip Morris stock for the period from August1990 to August 2001. (The return on an investment consists of the change in its price plusany cash payments made, given here as a percent of its price at the start of each month.)
(a) Make either a histogram or a stemplot of these data. How did you decide which graphto make?
(b) There are two clear outliers. What are the values of these observations? (The mostextreme observation is explained by news of action against smoking, which depressed thistobacco company stock.) Describe the shape, center, and spread of the data after you omitthe two outliers.
(c) Find the mean monthly return and the standard deviation of the returns. If you invested$100 in this stock at the beginning of a month and got the mean return, how much wouldyou have at the end of the month?
(d) The distribution can be described as “symmetric and single-peaked, with two low out-liers.” If you invested $100 in this stock at the beginning of the worst month in the data(the most extreme outlier), how much would you have at the end of the month? Find themean and standard deviation again, this time leaving out the two low outliers. How muchdid these two observations affect the summary measures? Would leaving out these twoobservations substantially change the median? The quartiles? How do you know, withoutactual calculation? (Returns over longer periods of time, or returns on portfolios containingseveral investments, tend to follow a Normal distribution more closely than these monthlyreturns. So use of the mean and standard deviation is better justified for such data.)
4 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
Table 1.10Monthly percent returns on Phillip Morris stock
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 5
EXERCISE 1.54 Initial public offerings.
During the stockmarket boom of the 1990s, initial public offerings (IPOs) of the stock ofnew companies often produced enormous gains for people who bought the stocks when theyfirst became available. At least that’s what legend says. A study of all 4567 companies thatwent public in the years 1990 to 2000 (excluding very small IPOs) found that on the averagetheir stock prices had either risen 111% or declined 31% by the end of the year 2000.One ofthese numbers is the mean change in price and one is the median change. Which is which,and how can you tell?
6 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 1.69 GMAT scores.
Most graduate schools of business require applicants for admission to take the GraduateManagement Admission Council’s GMAT examination.Total scores on the GMAT for themore than 500,000 people who took the exam between April 1997 and March 2000 areroughly Normally distributed with mean µ = 527 and standard deviation σ = 112.
(a) What percent of test-takers have scores above 500?
(b) What GMAT scores fall in the lowest 25% of the distribution?
(c) How high a GMAT score is needed to be in the highest 5%?
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 7
EXERCISE 1.95 Grading managers.
Some companies “grade on a bell curve” to compare the performance of their managersand professional workers. This forces the use of some low performance ratings, so thatnot all workers are graded “above average.” Until the threat of lawsuits forced a change,Ford Motor Company’s “performance management process” assigned 10% A grades, 80%B grades, and 10% C grades to the company’s 18,000 managers. It isn’t clear that the“bell curve” of ratings is really a Normal distribution. Nonetheless, suppose that Ford’sperformance scores are Normally distributed. This year, managers with scores less than 25received C’s and those with scores above 475 received A’s. What are the mean and standarddeviation of the scores?
8 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 2.10 Health and wealth.
On the following page, Figure 2.7 is a scatterplot of data from the World Bank. Theindividuals are all the world’s nations for which data are available. The explanatory variableis a measure of how rich a country is, the gross domestic product (GDP) per person. GDPis the total value of the goods and services produced in a country, converted into dollars.The response variable is life expectancy at birth.We expect people in richer countries tolive longer. Describe the form, direction and shape of the overall pattern. Does the graphconfirm our expectation? The three African nations marked on the graph are outliers;a detailed study would ask what special factors explain the low life expectancy in thesecountries.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 9
Figure 2.7
o
o
o
oo
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
oo
o
o
o
o
oo
oo
o
o
o
o
o
o
o
o
o
o
oo
o
o
o
o
o
o
o
o
o
o
o
oo
o
o
o
o
o
oo
oo
o
oo
oo
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o
o o
o
o
o
o
o
o
o
o
oo
o
o
o
o
o
o
o
0 5000 10000 15000 20000 25000 30000
4050
6070
80
Gross domestic product per person
Life
exp
ecta
ncy,
yea
rs
Botswana
Gabon
Namibia
Costa Rica
USA
10 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 2.15 Business starts and failures.
On the following page, Table 2.4 gives data for number of businesses started and number ofbusinesses failed by state for 1998. We might expect an association to exist between theseeconomic measures.
(a) Make a scatterplot of business starts against business failures. Take business starts asthe explanatory variable.
(b) The plot shows a positive association between the two variables. Why do we say thatthe association is positive?
(c) Find the point for Florida in the scatterplot and circle it.
(d) There is an outlier at the upper right of the plot. Which state is this?
(e) We wonder about clusters and gaps in the data display. There is a relatively clear clusterof states at the lower left of the plot. Four states are outside this cluster. Which states arethese? Are they mainly from one part of the country?
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 11
12 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 2.18 Where the stores are.
Target and Wal-Mart are two of the largest retail chains in the United States. Target has977 stores and Wal-Mart has 2624 stores. The file ex02-18.dat has the number of Targetstores and Wal-Mart stores listed by state. (The contents of this file are displayed on thefollowing page.)
(a) Use software to create a scatterplot of the number of Target stores versus the numberof Wal-Mart stores for each of the fifty states.
(b) Identify any unusual observations in your scatterplot by labeling the point with thestate abbreviation. Describe specifically what about the observation makes it stand out inthe scatterplot.
(c) Comment on the form, direction, and strength of the relationship between the numberof Target stores and the number of Wal-Mart stores per state.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 13
14 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 2.25 Mutual fund performance.
Many mutual funds compare their performance with that of a benchmark, an index of thereturns on all securities of the kind that the fund buys. The Vanguard International GrowthFund, for example, takes as its benchmark the Morgan Stanley EAFE (Europe, Australasia,Far East) index of overseas stock market performance. Here are the percent returns for thefund and for the EAFE from 1982 (the first full year of the fund’s existence) to 2000.
Make a scatterplot suitable for predicting fund returns from EAFE returns. Is there a clearstraight-line pattern? How strong is this pattern? (Give a numerical measure.) Are thereany extreme outliers from the straight-line pattern?
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 15
EXERCISE 2.29 CEO compensation and stock market performance.
An academic study says that, “The evidence indicates that the correlation between thecompensation of corporate CEOs and the performance of their company’s stock is close tozero.” A business magazine reports this as “A new study shows that companies that paytheir CEOs highly tend to perform poorly in the stock market, and vice versa.” Explainwhy the magazine’s report is wrong. Write a statement in plain language (don’t use theword “correlation”) to explain the study’s conclusion.
16 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 2.45 Moving in step?
One reason to invest abroad is that markets in different countries don’t move in step. WhenAmerican stocks go down, foreign stocks may go up. So an investor who holds both bears lessrisk. That’s the theory. Now we read that “The correlation between changes in Americanand European share prices has risen from 0.4 in the mid-1990s to 0.8 in 2000.”Explain to aninvestor who knows no statistics why this fact reduces the protection provided by buyingEuropean stocks.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 17
EXERCISE 2.46 Interpreting correlation.
The same article mentioned in Exercise 2.45 that claims that the correlation betweenchanges in stock prices in Europe and the United States was 0.8 in 2000 goes on to saythat “Crudely, that means that movements on Wall Street can explain 80% of price move-ments in Europe.” Is this true? What is the correct percent explained if r = 0.8?
18 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 2.48 Cash or credit?
We might expect credit card purchases to differ from cash purchases at the same store.Let’s compare regressions for credit card and cash purchases using the consignment shopdata in Table 2.1 (shown on the following page).
(a) Make a scatterplot of daily gross sales y versus items sold x for credit card purchases.Using a separate plot symbol or color, add daily gross sales and items sold for cash. It issomewhat difficult to compare the two patterns by eye.
(b) Regression can help. Find the equations of the two least-squares regression lines ofgross sales on items sold, for credit card sales and for cash sales. Draw these lines on yourplot. What are the slopes of the two regression lines? Explain carefully what comparingthe slopes says about credit card purchases versus cash purchases.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 19
Table 2.1 Gross Sales and Number of Items Sold
20 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 2.55 What’s my grade?
In Professor Friedman’s economics course, the correlation between the students’ total scoresbefore the final examination and their final examination scores is r = 0.6. The pre-examtotals for all students in the course have mean 280 and standard deviation 30. The finalexam scores have mean 75 and standard deviation 8. Professor Friedman has lost Julie’sfinal exam but knows that her total before the exam was 300. He decides to predict herfinal exam score from her pre-exam total.
(a) What is the slope of the least-squares regression line of final exam scores on pre-examtotal scores in this course? What is the intercept?
(b) Use the regression line to predict Julie’s final exam score.
(c) Julie doesn’t think this method accurately predicts how well she did on the final exam.Calculate r2 and use the value you get to argue that her actual score could have been muchhigher (or much lower) than the predicted value.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 21
EXERCISE 2.75 Education and income.
There is a strong positive correlation between years of schooling completed x and lifetimeearnings y for American men. One possible reason for this association is causation: moreeducation leads to higher-paying jobs. But lurking variables may explain some of the cor-relation. Suggest some lurking variables that would explain why men with more educationearn more.
22 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 2.100 Are high interest rates bad for stocks?
The scatterplot in Figure 2.25 (shown on the following page) suggests that returns oncommon stocks may be somewhat lower in years with high interest rates. Here is part ofthe Excel output for the regression of stock returns on the bill returns for the same years:
(a) What is the equation of the least-squares line? Use this line to predict the percentreturn on stocks in a year when Treasury bills return 5%.
(b) Explain what the slope of the regression line tells us. Does the slope confirm that highinterest rates are in general bad for stocks?
(c) If you knew the return on Treasury bills for next year, do you think you could predictthe return on stocks quite accurately? Use both the scatterplot in Figure 2.25 and theregression output to justify your answer.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 23
Figure 2.25
o
o
o
o
o
o
o
o
o
o
o
o
o
o
oo
o
o
o
o
o
oo
o
o
o
o
o
o
o
o
o
oo
o
o
o
o
o
o
o
o
oo
o
o
o
o
oo
o
0 5 10 15 20
-40
-20
020
4060
Treasury bill return (percent)
Sto
ck r
etur
n (p
erce
nt)
24 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 2.108 Size and selling price of houses.
On the following page, Table 2.13 provides information on a random sample of 50 housessold in Ames, Iowa, in the year 2000.
(a) Describe the distribution of selling price with a graph and a numerical summary. Whatare the main features of this distribution?
(b) Make a scatterplot of selling price versus square feet and describe the relationshipbetween these two variables.
(c) Calculate the least-squares regression line for these data. On average, how much doeseach square foot add to the selling price of a house?
(d) What would you expect the selling price of a 1600 square foot house in Ames to be?
(e) What percentage of the variability in these fifty selling prices can be attributed todifferences in square footage?
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 25
26 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 3.11 Sampling by accountants.
Accountants use stratified samples during audits to verify a company’s records of suchthings as accounts receivable. The stratification is based on the dollar amount of the itemand often includes 100% sampling of the largest items. One company reports 5000 accountsreceivable. Of these, 100 are in amounts over $50,000; 500 are in amounts between $1000and $50,000; and the remaining 4400 are in amounts under $1000. Using these groups asstrata, you decide to verify all of the largest accounts and to sample 5% of the midsizeaccounts and 1% of the small accounts. How would you label the two strata from whichyou will sample? Use Table B, starting at line 115, to select only the first 5 accounts fromeach of these strata.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 27
EXERCISE 3.13 Ring-no-answer.
A common form of nonresponse in telephone surveys is “ring-no-answer.” That is, a call ismade to an active number, but no one answers. The Italian National Statistical Institutelooked at nonresponse to a government survey of households in Italy during two periods,January 1 to Easter and July 1 to August 31. All calls were made between 7 and 10 p.m.,but 21.4% gave “ring-no-answer” in one period versus 41.5% “ring-no-answer” in the otherperiod.Which period do you think had the higher rate of no answers? Why? Explain whya high rate of nonresponse makes sample results less reliable.
28 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 3.19 Quality control sampling.
A manufacturer of chemicals chooses 3 containers from each lot of 25 containers of a reagentto test for purity and potency. Below are the control numbers stamped on the bottles inthe current lot. Use Table B at line 111 to choose an SRS of 3 of these bottles.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 29
EXERCISE 3.40 Does charting help investors?
Some investment advisors believe that charts of past trends in the prices of securities canhelp predict future prices. Most economists disagree. In an experiment to examine theeffects of using charts, business students trade (hypothetically) a foreign currency at com-puter screens. There are 20 student subjects available, named for convenience A, B, C,. . . , T. Their goal is to make as much money as possible, and the best performances arerewarded with small prizes. The student traders have the price history of the foreign cur-rency in dollars in their computers. They may or may not also have software that highlightstrends. Describe two designs for this experiment, a completely randomized design and amatched pairs design in which each student serves as his or her own control. In both cases,carry out the randomization required by the design.
30 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 4.24 Stock price movements.
You watch the price of Cisco Systems stock for four days. Give a sample space for each ofthese random phenomena:
(a) You record the sequence of up days and down days.
(b) You record the number of up days.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 31
EXERCISE 4.26 Car colors.
Choose a new car or light truck at random and note its color. Here are the probabilities ofthe most popular colors for cars made in North America in 2000:
Color Silver White Black Dark Green Dark Blue Medium RedProbability 0.176 0.172 0.113 0.089 0.088 0.067
What is the probability that the car you choose has any color other than the six listed?What is the probability that a randomly chosen car is either silver or white?
32 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 4.39 How many cars?
Choose an American household at random and let the random variable X be the number ofcars (including SUVs and light trucks) they own. Here is the probability model if we ignorethe few households that own more than 5 cars:
Number of cars X 0 1 2 3 4 5Probability 0.09 0.36 0.35 0.13 0.05 0.02
(a) Verify that this is a legitimate discrete distribution. Display the distribution in aprobability histogram.
(b) Say in words what the event {X ≥ 1} is. Find P (X ≥ 1).
(c) Your company builds houses with two-car garages. What percent of households havemore cars than the garage can hold?
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 33
EXERCISE 4.53 How many rooms?
Furniture makers and others are interested in how many rooms housing units have, becausemore rooms can generate more sales. Here are the distributions of the number of rooms forowner-occupied units and renter-occupied units in San Jose, California:
(a) Make probability histograms of these two distributions, using the same scales. Whatare the most important differences between the distributions for owner-occupied and rentedhousing units?
(b) Find the mean number of rooms for both types of housing unit. How do the meansreflect the differences you found in (a)?
34 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
EXERCISE 4.61 Time and motion studies.
A time and motion study measures the time required for an assembly line worker to performa repetitive task. The data show that the time required to bring a part from a bin to itsposition on an automobile chassis varies from car to car with mean 11 seconds and standarddeviation 2 seconds. The time required to attach the part to the chassis varies with mean20 seconds and standard deviation 4 seconds.
(a) What is the mean time required for the entire operation of positioning and attachingthe part?
(b) If the variation in the worker’s performance is reduced by better training, the standarddeviations will decrease. Will this decrease change the mean you found in (a) if the meantimes for the two steps remain as before?
(c) The study finds that the times required for the two steps are independent. A part thattakes a long time to position, for example, does not take more or less time to attach thanother parts. How would your answer in (a) change if the two variables were dependent withcorrelation 0.8? With correlation 0.3?
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 35
EXERCISE 4.63 Time and motion studies.
Find the standard deviation of the time required for the two-step assembly operation studiedin Exercise 4.61, assuming that the study shows the two times to be independent. Redothe calculation assuming that the two times are dependent, with correlation 0.3. Can youexplain in nontechnical language why positive correlation increases the variability of thetotal time?
36 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
Portfolio analysis
Here are the means, standard deviations, and correlations for the monthly returns fromthree Fidelity mutual funds for the 36 months ending in December 2000. Because there arenow three random variables, there are three correlations. We use subscripts to show whichpair of random variables a correlation refers to.
W = monthly return on Magellan Fund µW = 1.14% σW = 4.64%X = monthly return on Real Estate Fund µX = 0.16% σX = 3.61%Y = monthly return on Japan Fund µY = 1.59% σY = 6.75%
CorrelationsρWX = 0.19 ρWY = 0.54 ρXY = −0.17
EXERCISE 4.70 Diversification.
Many advisors recommend using roughly 20% foreign stocks to diversify portfolios of U.S.stocks. Michael owns Fidelity Magellan Fund, which concentrates on stocks of large Ameri-can companies. He decides to move to a portfolio of 80% Magellan and 20% Fidelity JapanFund. Show that (based on historical data) this portfolio has both a higher mean returnand less volatility than Magellan alone. This illustrates the beneficial effects of diversifyingamong investments.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 37
EXERCISE 4.87 Flaws in carpets.
The number of flaws per square yard in a type of carpet material varies with mean 1.6 flawsper square yard and standard deviation 1.2 flaws per square yard. The population distri-bution cannot be Normal, because a count takes only whole-number values. An inspectorsamples 200 square yards of the material, records the number of flaws found in each squareyard, and calculates x, the mean number of flaws per square yard inspected. Use the centrallimit theorem to find the approximate probability that the mean number of flaws exceeds2 per square yard.
38 Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics42
(c) See plot below. 50%.
Const. Trans.&
Utils.
Ag. Svc. Manuf. Govt. Retail Whole-sale
Mines Finance Other
05
1015
20
Pareto chart of occupational deathsPe
rcen
t
(d) Yes it would because each death can be assigned to exactly one category.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 43
EXERCISE 1.45 Education and income.(a) See plot below.
Inco
me
050
000
1500
00
High school Some college Bachelor's Master's Professional
(b) Median income in March of 2000 appeared to increase with education level, although therewas more income variability (as evidenced by the IQR) in each progressively higher educationgroup. On average those with a higher level of education earn more, but there are plenty ofexceptions to this pattern. For example, the top 25% of earners with only a high school diplomaearn more than the bottom 25% with a Master's degree.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics44
EXERCISE 1.53 A hot stock?(a) The sample size is large enough that a stemplot is probably more difficult to read than ahistogram. Further, plotting these data in a stemplot requires the values to be rounded, whicheliminates one common advantage of this plot. See plot below.
-30 -20 -10 0 10 20
010
2030
40
Monthly percent return on Phillip Morris stock (8/90-8/01)
Percent return
Num
ber o
f mon
ths
(b) The outliers are -26.6 and -22.9. After the outliers have been omitted, the distribution ofmonthly returns is unimodal and approximately symmetric with only some left skew. The centerof the returns appears to be just above zero. The returns range from about -20 to about 25, withwell over half of the returns between -10 and 15.(c) The mean is 1.63, the standard deviation is 8.29. An investment of $100 would be worth$101.63.(d) $73.4. Without the outliers, the mean is 2.04 and the standard deviation is 7.65. The meanincreases by about .41 percentage points, while the standard deviation drops by 0.63 percentagepoints. Since the median and quartiles are robust or resistant to outliers, they would not changevery much if the outliers were omitted.
EXERCISE 1.54 Initial public offeringsThe median is -31% and the mean is +111%. Some very successful companies will inflate themean, but the median is more resistant to extreme values. It was not true that half of allcompanies increased their stock price more than 111% and half by less.
EXERCISE 1.69 GMAT Scores(a) 59.53%(b) 451 and below.(c) Above 711.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 45
EXERCISE 1.95 Grading managersMean is 250; standard deviation is 175.57.
EXERCISE 2.10 Health and wealthThere appears to be a positive relationship between GDP per person and life expectancy at birth.The relationship is not linear but rather curved and, judging by the scatter about the c-shape,moderately strong. The graph supports our expectation of a positive relationship.
EXERCISE 2.15 Business starts and failures(a) See plot below.
Business failures by business starts in 1998 (50 states and DC)
Business starts
Busi
ness
failu
res
0 5000 10000 15000 20000
050
0010
000
1500
0
FL
CA
NY
TX
(b) Because, on average, as the explanatory variable increases, so does the response or dependentvariable. Further, the straight line that might be used to describe these data has a positive slope.(c) See labeled point on plot above.(d) California.(e) The states are California, Florida, New York, and Texas. They are not from one region, butare the four most populous states.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics46
EXERCISE 2.18 Where the stores are.(a) See plot below.
Target stores vs. Wal-Mart stores in 50 states
Wal-Mart stores
Targ
et s
tore
s
0 50 100 150 200 250
050
100
150
TX
CA
(b) Two points stand out: California and Texas. California has a very high number of Targetstores, and a relatively high number of Wal-Marts. Texas has an extremely high number of Wal-Mart stores and more Targets than every state but California.(c) The relationship between Wal-Mart stores and Target stores is positive and approximatelylinear. The strength of the relationship is moderate to weak since the points are not tightlyclustered about a line.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 47
EXERCISE 2.25 Mutual fund performanceSee plot below. A straight line appears to describe the data adequately; the correlation isestimated to be 0.85. There are no extreme outliers, although 1983 and 1986 are relatively farfrom the line.
EAFE return (percent)
Vang
uard
retu
rn (p
erce
nt)
0 20 40 60
010
2030
4050
EXERCISE 2.29 CEO compensation and stock market performance.The magazine is incorrectly interpreting a correlation of zero to mean there is a negativerelationship between compensation and performance. We might explain the study's conclusionsby saying that knowing a company's stock performance provides almost no information aboutwhat the CEO will earn, and knowing a CEO's compensation provides little to no informationabout the company's stock performance.
EXERCISE 2.45 Moving in step?The correlation between American and European stock prices is positive and has become fairlystrong. Therefore stocks in the US and Europe are now to some degree marching in lockstep. Ashift in the prices of shares in the American market is likely to be met with a shift in the samedirection and of a similar magnitude in the European market. Therefore the European marketdoesn't counterbalance moves in the American market.
EXERCISE 2.46 Interpreting correlation.No this is not true. About 64% of the variability in one market is explained by changes in theother market.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics48
EXERCISE 2.48 Cash or credit?(a) See plot below.
Gross sales vs. items sold for credit and cash
Items sold
Gro
ss s
ales
($)
0 10 20 30 40 50
200
400
600
800 = credit
= cash
(b) Gross sales = 240.88 + 10.34(Item count credit card); Gross sales = 64.58 + 12.46(Item countcash); One additional cash purchase is expected to increase the daily gross sales by an estimated$12.46, while an additional credit card purchase is expected to increase the gross sales by anestimated $10.34.
EXERCISE 2.55 What's my grade?(a) The slope is 0.16. The intercept is 30.2. (b) 78.2(c) r2 = 0.36. Only 36% of the variation in final exam scores is explained by the students' pre-exam totals. Other factors (such as time spent preparing for the exam) also influenced final examscores. Students could have done much worse or much better than predicted based on these otherfactors.
EXERCISE 2.75 Education and income.Lurking variables might include health and socioeconomic background. Men from families withmore resources tend to be better educated, and men who are in better health will live longer andearn more.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 49
EXERCISE 2.100 Are high interest rates bad for stocks?(a) Stock return = 16.2352 - 0.5727(Tbill return). 13.37%.(b) A negative slope suggests that as treasury bill returns increase, average stock returns woulddecrease. No because we cannot conclude the true slope is different from zero and therefore wecannot conclude there is a linear relationship between T-bills and stock returns.(c) Neither the scatterplot nor the regression output provide evidence that there is a linearrelationship between these two variables. Knowing the T-bill return for next year would probablynot help us predict stock returns.
EXERCISE 2.108 Size and selling price of houses. (a) The distribution is unimodal with a median of $127,100 but it is heavily skewed right. Theselling prices ranged from $53,600 to $340,000. About one quarter of all homes sold for less than$110,500 and about one quarter for more than $169,100. See plot below.
50000 100000 150000 200000 250000 300000 350000
05
1015
20
Selling price of Ames, IA homes
Selling price ($)
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics50
(b) There appears to be a moderate to strong positive linear relationship between the squarefootage of an Ames-area home and its selling price. See plot below.
Selling price vs. square feet of Ames, IA homes
Square feet
Pric
e ($
)
500 1000 1500 2000 2500
5000
015
0000
2500
0035
0000
(c) The estimated intercept is 4786.46 and the estimated slope is 92.82. On average, an additionalsquare foot will add an estimated $92.82 to the selling price of a home.(d) $153,300.10.(e) 69.64%.
EXERCISE 3.11 Sampling by accountants.From the medium-debt accounts strata, sample accounts 417, 494, 322, 247, and 097. From thesmall-debt accounts strata, sample accounts 3698, 1452, 2605, 2480, and 3716.
EXERCISE 3.13 Ring-no-answer.Most likely the higher "ring-no-answer" rate is from the July 1 to August 31 period, sincehouseholds are more likely to be away on vacation during the summer months. A high rate ofnonresponse increases the chances that certain groups are systematically underrepresented in thesample; the sample does not represent the population of interest so much as it represents thosepeople who tend to be at home and are willing to answer the survey questions.
EXERCISE 3.19 Quality control sampling.Bottles 12, 4, and 11. (B0986, A1101, A2220.)
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 51
EXERCISE 3.40 Does charting help investors?Ten of the 20 subjects are selected at random and given computers with the software thathighlights trends. The other ten subjects are given computers without the software. After a fixedperiod of trading, compare average profits for the two groups. One possible randomization is toassign students A, B, C, D, F, G, I, M, R, and S to the computers with the trend software andstudents E, H, J, K, L, N, O, P, Q, and T to the computers without the trend software.
For the matched pairs design, ten subject are initially assigned computers with the trendhighlighting software while the other ten are given standard computers. After a fixed tradingperiod, students switch computer types and trade for the same amount of time. One possiblerandomization is to initially assign students C, D, F, H, I, J, L, M, N, and S to the computers withthe trend software and students A, B, E, G, K, O, P, Q, R, and T to the computers without it.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics52
EXERCISE 4.39 How many cars?(a) The probabilities are all between zero and one and sum to 1. See plot below.
0 1 2 3 4 5
0.0
0.1
0.2
0.3
0.4
Probability histogram for number of cars owned
Cars
Prob
abilit
y
(b) A randomly chosen household has one or more cars. The probability of this event is 0.91.(c) 0.20.
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics 53
EXERCISE 4.53 How many rooms?(a) The owner-occupied distribution is mounded with a center near 6 while the renter-occupieddistribution is more peaked with a centered near 4. See plots below and on following page.
1 2 3 4 5 6 7 8 9 10
0.0
0.1
0.2
0.3
0.4
Rooms
Prob
abilit
y
Moore/McCabe/Duckworth/Sclove: The Practice of Business Statistics54
1 2 3 4 5 6 7 8 9 10
0.0
0.1
0.2
0.3
0.4
Probability histogram of rooms in rented units
Rooms
Prob
abilit
y
(b) The mean for owner-occupied units is 6.284, the mean for renter-occupied units is 4.187. Thedifference in means is evident in the plots.
EXERCISE 4.61 Time and motion studies.(a) Mean time is 31 seconds.(b) No because a reduction in variation about the mean is not the same as a reduction in averagetime.(c) The overall mean does not depend on either standard deviation or the correlation and wouldnot change.
EXERCISE 4.63 Time and motion studies.4.47; 4.73; Longer times in the first step are associated with longer times in the second step, andshorter times in the first step are associated with shorter times in the second step. Therefore, if thefirst step takes longer than average, the second one is likely to as well, and the two steps couldend up taking much longer than average. The positive correlation causes more spread about themean. If the correlation were negative, the delays or advances would tend to cancel each otherout and there would be less spread about the mean.
EXERCISE 4.70 Diversification.The mean return of the diversified portfolio is 1.23%. The standard deviation is 4.58%.