chapter 3

,{cuepuel I?4uec Jo seJnseeru eeJql---€poru pue 'uutpeur 'IIseIu eq} lnoqe ,(lleunogut'ir--1e1 ere ,fuqi ..'enlerr luenberg lsoru,, eql Jo ..enlel elppFu,, eql Jo ..onI€A o?erotre,, u€ lnoq€qn aldoed ueqry1 '1urod le4uec € prmoJu dnor8 ol ,{cuepuel lcuqslp e

^\oqs elep Jo sles lsol tr

A)N3CN3r lVUrNff, JO $UnSV3l l L'E

'{ees

,,;,:1 sremsue eql ecr^Jes sJno sI ecroqJ eql Jo sJeluolsnc eql entS plnon soJns€eu eseql

i;s1 'selqeu€A lecrJerunu o,4q uee ueq uoll€Icoss€ erllJo qfuerls etll eJns€eu dleq qclqm-r'rlleleJJoc Jo luelclJJeoc eql pu? ecu?IJeôc eql lnoqe uJ€el osle IIP\ no 'olqslJe^ B Joil:?r{s pu€ ouorleuel tcuepuel l?4uec eql eJns€elu uec no,( sfe,l sossncsry reldeqc sq;

lseqsq fiB o:*,* ge,,,ol arfl uro{I sotq' Jo uopnqpsgp agtJo uronqd.r, , *::fft

ffi:::::ffi;fid-:, r€rluec eqr reprsuoc ol peeu no .z;?ff#j#"#:H::#"ff#'J;J:l#'Hl':ffiffi'#rEql aJoru op ol poou no,{ 'selqeuel I?cuelunu Surqucsep puu Eurzuetutuns uell\\ 'solqe

T- -E-\ I?cuerunu lnoq€ suo4senb 3ur1se eJ€ ou€uecs scllspels Sutsn eql uI sJeruolsno eqr-l-l

espunJ I€nlntu eql elsnleê re]]eq plnoc ,(eqt wql, s suorlsenb eseql ol sJe/y\sue le8 sreruolsnc eq] dleq nor( plnoc llroH

esenlen eErel pu€ Ileurs Jo roqrunu mU

-;IJS € oJor{Jere Jo'ssJel ecrn Jo'seuo e8rel A\oJ epue senlul IleulsJo }olx aJ3ql eJV esenl€n eErel .ften pu? Ilsrus ̂ (ren epnlcur ,(eql op Jo 'lelnuts,i e rrleler senle^ egl 1le erv 'urnlor Jo elar.leer(-eerq] eql ul ,QIIlqeIrsA

;-{tJo }ue}xe eq}Jo eopl ou e^Bq osl€ {tql'seuo8eler req}oJo senle^ IecI-a.rr eql o1 sereduroc enlerr lecrdhrcqtaog A\oDI ,(eql op rou'sputg {slr- ruo{ se qcns 'spung Ien}nru go froflepc relncq rcd e JoJ eq plno/K uJn}eJ

:,r, eleJ rue.(-eerql lecrd.Q e teLI/K €epr ou eABr{ f,eqt ?etnqlJ}slp eJ€ urn}er

:,r seler ree.(-eoJr{} gEg orl} A\oq A\ou4 r(eql ellrl16'ecueru.ro;red punJ

?ntnu olenlene o1 SurfJ] pe]eJlsn4 etuocoq eneq sJeruolsnc 'lenezwoll

eJrAJes sJno1 sI ecroqC er{} Jo sJoruo}snc eq} o} InJesn penord ser{

rpunJ I€nlntu B€g go eldrues oql roJ perederd nor( sueqc pue selq€} eqJ

q#ru*rhpuffi

ll ued 'srno^ sl of,lor.{l @ s)llsllvls DNlsnsernseel4l enqdrroseq lecrrorunN E1IUHJ UEIdVHC 96

i+iH$ffffil$ffi

3. 1 : Measures of Central Tendencv 97

The Mean

The arithmetic mean (typically referred to as the mean) is the most common measure of cen-tral tendency. The mean is the only common measure in which all the values play an equal role.The mean serves as a "balance point" in a set of data (like the fulcrum on a seesaw). you calcu-late the mean by adding together all the values in a data set and then dividing that sum by thenumber of values in the data set.

The symbol X, called,X-bar,isused to represent the mean of a sample. For a sample con-taining n values, the equation for the mean of a sample is written as

X=Sum of the values

Number of values

Using the seies Xr, Xr, . . . , Xnto represent the set of n values and n torepresent the number ofvalues, the equation becomes:

Xr+Xr+.. .+Xn

By using summation notation (discussed fully in Appendix B), you replace the numerator

Xr+ X2* ' ' ' * Xnby the term f *r,which means sum all theX, values from the firstXvalue,l= l

Xt, to the lastXvalue, Xn,to form Equation (3.1), a formal definition of the sample mean.

x=

SAMPLE MEAN

The sample rneaR,: ;. u of the va ues on,*ro uv *re nu;-, "r*ur:

ffii=i"n, ,. : '

..t..i ".t'.,;t..;, '.,r.1.t,,,..,.t,l.f

;,il.,t t1,'. ;,..,f ..;;:.,

','',t. ..t'..i ...ttil . .. :lt t.",: ,.

(3.1)

ff,.,:.*: r:11rl rl;::;::::1;: i,::,t,::::

where;ilir+

ffJ,tii.,i,:lii.:

.r',r:. i,:,i:,:tt,tt::,fi

ffiff#

a *u*nt.

' ' . .in the sample

$ian$!#:si:s{ifi$q*,is&fitfi:l*ii*si*u€!l*!*tifr$i:'a:S$i$:Sfe;$!*lii:}S$!S-*Sp[#_ilqf*i*S,ii#ii i

Because all the values play an equal role, a mean is greatly affected by any value that isgreatly different from the others in the data set. When you have such extreme values, youshould avoid using the mean.

The mean can suggest a typical or central value for a data set. For example, if you knew thetypical time it takes you to get ready in the morning, you might be able to better plan yourmorning andminimize any excessive lateness (or earliness) going to your destination. Supposeyou define the time to get ready as the time (rounded to the nearest minute) from when you getout of bed to when you leave your home. You collect the times shown below for 10 consecutivework days (stored in the data file@S$:

:€ueluc oseql il€ leelu seru€dluoc uaês 3uyto11o3 eql qslJ raol oq o1 pe,\Iecred ere.s,glcsfqo rllrtorE

" al?q osetu€dtuoc dec-1eurs ut Suzqerceds S" peIJISS€Ic oJ€ tr"gl spurU

ry e12col pu€ €lep puru l?n1ntll eql uos no,( 'snq; lsF A{oI qll^{ spuru esoql fpo e1e3r5e'ru1

pe,tr nof ore,roero141 '1ei1ue1od {gor3 Jo }ol 3 qlrm seruedruoc fierus ut pelseJelu '!repc4ere no1'(qafq p"n '.3nr.* llof sprng l€n1nur eqlJo IeôI >lslr eql pue '(snle'r ro qu*rorE)

eql ,(d;; eEiel pue ,dec prur 'dec 11erus) .,fto3e1ec eql ol Eulprocce peIJISs?1c ere (96

ees) ou?uecs scqsllp,ls Bursn eqt3lo ged ere l€ql(EE4E@) spury lenlruu 8€8 eqr

)sru Mol HllM scNnJ lvnrnwHLV\OUg dvf,-]]Vl S UOJ NUnIlu caznvnNNv UVSA-3:UHI NVSN 3HI

'r(cuepuel

UscJo emseeru rood e a\ou sI uebur eqt'enlerr eule4xe eqlJo esneceg 'soru4 '{peer-3uq1e8

utuo 6 ueql releer3 sI ueelu ^l.ou

eql '(serurl Jer{lo S oq} u€ql ssel pu€ seuq fpeer-8ur11e8

fo S "nqf :rlqat? 'sl 1eql) ,,eIPPIU,, oql ur s"1d 1eIP u€elu leurErro eql ol lseJluoc uI 'ssln

g'W 019'6€ uro{J 'o 0I rjrlql eroru ,(q ueelu eql pos?eJcul s?q onI?A elue4xe euo eql

OIf iV =

WV

senl€A Jo reqrunN

senl€A orll Jo urns

:slv\olloJ s? 'selnunu 9'rV of esu ol u€eur eql sosnec onI?A elue4xe

'selnuru ZSJo p€elsut selnuru Z0I q 7 feq uo onl€^ eql qclqa uI es€c 3 JeplsuoJ'sonlel e8rel ro leus

rqdecxe ,{ue ureluoc lou seop les €1ep eql esneceq es€c slql uI fcuepuel lequecJo emseelu

E sr u€oru eq; .sEuru.rour mo,{ Suuueld JoJ eIru pooS e eq plnoa\ ,{peer 1e3 01 selnullu

yroqe Eurgolle oselnurur 9.6€ enl? eql peq.(11enpe eldures eql ur ,{zP euo ou q8noql ue,rg

OI9'68 =

g6E=

OI

9E + VV +IE + 0V + vv + 6E + 7,9+ EV + 67,+ 6E

senl€A Jo roqrunN

sonl€A oID Jo rrrns1/

-A

:S.ry\olloJ se polnduloc 'selnutul 9'68 sl elull u€elu eql

6t :(selnulLu) ourll

:[eq

u

-rt17'xsu

L' t 31d tNVXl l

X

X

Xu

-T-I - r ry.A\

u

6EI€ av vv 7,9 EV 6Z

sernseew eÎldlrcsoc I€cIroIunN lIituHI u[IIdvHJ 86

3. 1 : Measures of Central Tendency 99

FundCategory Objective

Three-YearReturn

t{

R+sX

Baron GrowthColumbia Acorn ZFBR Small CapPerritt Micro Cap OpportunitiesSchroder Capital US Opportunities InvValue Line Emerging OpportunitiesWells Fargo Advtg Small Cap Opp Adm

Small CapSmall CapSmall CapSmall CapSmall CapSmall CapSmall Cap

Growth LowGrowth LowGrowth LowGrowth LowGrowth LowGrowth LowGrowth Low

20.826.024.929.922.319.022.4

Compute the mean three-year annualized return for the small-cap growth funds with low risk.

SOLUTION The mean three-year annualized return for the small-cap growth funds with lowrisk is 23.61. calculated as follows:

Sum of the values

Number of values

n

)x,,LJ '_ i-|

n

- 16s'3 - 23.6143

7

The ordere d array for the seven small- cap growth funds with low risk is:

19.0 20.8 223 22.4 24.9 26.0 29.9

Four of these returns are below the mean of 23.61. and three of them are above the mean.

The MedianThe median is the middle value in a set of data that has been ranked from smallest to largest.Half the values are smaller than or equal to the median, and half the values are larger than orequal to the median.The median is not affected by extreme values, so you can use the medianwhen extreme values are present.

To calculate the median for a set of data, you first rank the values from smallest to largestand then use Equation (3.2) to compute the rank of the value that is the median.

Mi'b Ar.r

t3#}:,.:.:.:.l;tllit;l::it,:tt: i::

You compute-the median value by following one of two rules:

, Rule 1 If there are an odd nttnrber of values in the data set, the median is the middle-rankedvalue.

; Rule 2If there ate arl even number of values in the data set, then the median isthe averageof the two middle ranked values.

To compute the median for the sample of 10 times to get ready in the morning, you rank thedailv times as follows:

'xtt.rr1 smcco senl€A eseqlJo gc?e osruceq 'selnutul W pw selnurrrr 6E 'Sepotu o $ eJe eJeql

zswwEv0v6E6E9tIg6Z

:^\oleq rl\{oqs ?lep fp€eJ-1e3-ol-eru4 eIil Jeplsuoc 'eldtuexe Jod '?tr€pJo los 3 uI sepou

eJ€ eJeql Jo epo{u ou sr eJeg} 'ueUO 'opou eql lceJ? lou op senl€A elue4xe lnelu eqlpu? uerpoul eql o{lT ,tpuenbe.g tsolu sr€edd€ l€ql?}?pJo }es e uI enIBA erl} sI epou ogJ,

aFsfftfi eq_il

'solnurur 9'6€ Jo ,(puer leE ol

q rreeru eql ol esolc ,(re,r sr selnuru 9'6€ Jo ,(peer pE 01 ourp IIsIpoIu eq1 oesec slql q 'seln

F S.6E o1 lenbe ro u€ql reteerS sr ,tpeer le3 ot eu[l eqrKfep eqlJlsrl roJ prrB 'selnutur 9'65pnbe ro u?gl ssel sl ,(peer te8 ot eurp eql 's,{ep eq} ipq roJ t€ql sueelu 9'6€ Jo Irelperu ogJ,

6t sr uerperu aql'srogereql'0t pue 69 'sen1e,r p$tlraqlxls pu€ qUIJ sIIl e8ere,re pu? Z eFU

lsnurno,( 'g1goeyftuessrqlroJ g'S:T, l0 +gI)st Z[9, |+zSurpvnp3ol lnseroqlesn€ceg

g'6t _ uBIpeIN

J

:s)ueu

6Z

:sonle^ Po)ueu

seJnseelN e^lldrJcsoc IecIJeIunN

'v'zz eôqe ro o1 I€nbe or3 surnlor$unleJ pozllenuve ree.(-ee-lql eq] JI?H 'V'27, sI uJnloJ

erllJI€r{ pu€ 'v'zz,/Koleq ro o} I€nbepez\enuue J€o^-oerql u€rpeu eql

u€rpoIN

J

v

:s)ueu

6'62 0'92 6'VZ V'ZZ E'ZT, 8'OT, 0'6I

:sonleA Polueu

fseEr?I oql ol lsolletus eql uro{ pe{u?J ere (66 e8ed ees) {slr n\ol qlu!\ spurg W \orE decletus

eqt roJ srunleJ pezquruIu? reef-eerql eqJ'enle peTl?r IFmoJ oql sI u€1peur eq1'1 epa Sursn

go eydues sFt roJ V: ZIC + ,) q Z Kgt + z Srnpnpgo llnser eW esn€ceg NO[nlOs

IsrJ 1t\ol rlllm spurg qy(oJ8 declleurs eql JoJ ILmleJ pezq"nuue reed-eeJgl uerPeruapduro3 '(qfig pue oeflete,t€ 'rvrof sprng I€nlnu eqlJo leêl {slr eql pue '(en1en ro qUnorS)

eqt '(dec e8rey pue 'dec pnu 'deq llerus) ,{roEelec eql ol Eurprocc€ pelJlssep en (96

ees) oueu6cs scllsq€ls Sursg eql;o wdere 1elil (@@ft@) spurU len1ruu 8€8 eql

:rldWVS CIZIS-GA6 NV NOUI NVlCSt l SHr gNEndNOJ z' t l ld l lvx3

vv EV 0v 6E 6E SE I€

EIUHI UEIdVHJ 00I

3.3

3.4

rlnnrnmfiiieil,-" and O?

-i5,:- 50th,il@rflEBr: ies,

E:gli-eftons

mrd 3"4) can

i' ;€,êra lly intrn,c esl'c'enti/es;

merce"ntr/e -

'llllll'imrr6 e,3 rra/ue.

3.1 : Measures of Central Tendency 1 0 I

COMPUTING THE MODE

A systems manager in charge of a company's network keeps track of the number of server fail-ures that occur in a day. Compute the mode for the follow,ing data, which represents the num-ber of server failures in a day for the past two weeks:

130326274023363

SOLUTION The ordered array for these data is

001223333346726

Because 3 appears five times, more times than any other value, the mode is 3. Thus, the systemsmar:riger can say that the most common occurrence is having three server failures in a day. For thisdata set, the median is also equal to 3, and the mean is equal to 4.5. The extreme value 26 is an out-lier. For these data,the median and the mode better measure central tendencv than the mean.

A set of data has no mode if none of the values is "most typical." Example 3.4 presents a dataset with no mode.

DATA WITH NO MODE

Compute the mode for the three-year annualized return for the small-cap growth funds([@[@@) with low risk (see page 99).

SOLUTION The ordered arrav for these data is

19.0 20.8 22.3 22.4 24.9 26.0 29.9

These data have no mode. None of the values is most typical because each value appeaxs once.

Cluartiles

Quartiles split a set of data into four equal parts-the first quartileo 01, divides the smallest25.0% of the values from the other 75.0% that are larger. The second quartile, Q2, is themedian-50 .0o/o of the values are smaller than the median and, 50.0o/o are larger. The thirdquartile, Q, divides the smallest 75.0% of the values from the largest 25.0%. Equations (3.3)and (3.4) define the first and third quartiles.l

:s)ueu

6'62 0'92 6'VT, v'zz E ZZ 8'02 0'6I

:anleA Pe)ueu

:eJB (66 aftedees) >1su lrol t{ll1v\ sprng qprorE dec-11erus

eql roJ srlmler pezq?nuu€ reed-eerql eql 'lse8r€I ol lsellslrrs urog pa{u€U NO[n]Os'{slr ,/Kol Il}lzn spunJ

dec-11eurs erp roJ Irmler pezq?nuue ree,{-eerql (80) eprenb prlq} pu€ (I@) eplrenb prg

epdruo3 '(qEH pue 's8ele,re atol) spurg len1ilu eqtrJo leêl {slr eql pue '(enlerr ro qplorfl)

aql ,(dee e8rel pue 'dec p1ur 'dec 1leurs) ,fto8e1ec eqt ol Eul.procc€ polJlssslc ere (96

eas) oueuecs scrlsF?ls Eursn eqlgo ged ere r"ql (E$EE@) sptry I€nlruu 8€8 eIIr

sSlrruvno 3Hr 9Nllndwof 9'g f ld lNVXl

'solnuru py ollenbe Jo u"q1 retreer8 sl ,(peer treB ot eur4 eql's,{ep eqllo V,SZpuu 'selnwur y7 o1 lenbe Jo II?gl ssel st fpeer pB ol eurrl eql 's,fup e\t Io oAS L uo 'snql 'se1n

tt $ enle po{u€r rlfiflIe eql 'enl€A pe>Iu€r rllq8le aq} otr ulllop sryl ptmor nod 'selqrenb

€ e1nu Sursn 'en1er' pe{u"r gz'B: nKt + 0I)€ : tlj + u)E aqt sr eypenb prlrlr eqJ,'solnurur 9E 01 Isnbe Jo

nleet| s ,(peer leE ol eruq eq1 's,fup oLlt Jo yoSL uo pIIs 'selnutul gg o1 lenbe Jo II€ql ssel

l dpeer 1e3 o1 eurrl eq1 's,(ep eql Jo yoSZ uo }€ql ueelu ol gg 3o el4renb 1s4J oIIl lerd:e1ut noa

urur Sg sI €lsp .(peer-1eE-o1-eluq oql JoJ enlel pe>Iuer p4ql eql 'enl€A po{usr pJlql eql ol

punor no,{ 't elnU Eursn 'en1e,r pe.{u€r I ;Z: t(t + Ot) : n(t + z) eql sr o1p:enb }srIJ er{I

:s)ueu

vv vv EV 0v 6E 6E 9€ IE 6Z

:sanleA Pe)ueu

:1se8re1 ol Nell€ws uro{ €lsp 8upto1erll )Tu?J 'e1ep KpeerleE-ol-orurl erp JoJ sepgenb oql Jo uollelnftuoc eql el€4snIII oJ

'onl?A pe>lu?r prrql egl

pue € 01 S;Z prmo1 'enl€ pe{uet g;Z: ilt + Ot) eqt ot lanbe st't@'ell4mnb 1srt3,0I : u ezrs eldures eql;r 'eldurexe Jod 'onl€A pe>[u"J 1€ql lcoles pue re8elul trsoJeeu eql

qnsor eril punor nod Jleq leuoqce4l e rou Jequmu eloqa\ € reqtleu sI lpser er1pt JI t aPV'enl€ pa>Tlrer pJlql eql pu€ enl€ ps{u€r

s eqt ueeA$eq ,ftirg1eq 'en1€,t pe>Iu?r 9'Z : VIO + 6) eql o1 lenbe st 'I@ 'eppenbg eql'O : u ezrs eydrues eql;l'eldurexe Jod 'sonl€A po{ueJ Eutpuodserroc eq13o efie

:sA? eql o1 lenbe sl epgenb eql ueql '('clc'9'v '9'z) JVq l€uo4c€.u 3 sI llnser ew JI z apv'onl?A peTreJ

f,Eoces : nKt + 4) eqt ot lenbe sr 'rQ 'ey4;ranb lsrrJ eIIl o 2: u ezrs eldures sqlgr 'eldurexe

.q{ 'enl? pe{u€r 1egl ol lenbe st eygenb egl ueq} tequmu elogl\ € sI llnser eqt 5 7 a1iry

:selruenb oql elslncl€c ol selru EutznolloJ or{} esn

6'8 19S

sornseolN e^rldrrcsec IecrrorunN EiruHJ uEIdvHJ Z0l

3. 1 : Measures of Central Tendencv I 03

For these data

^ (n+l)()r-- ' rankedvalue

4

7 +1- i---ranked value = 2nd ranked value

4

Therefore, using Rule l, Qris the second ranked value. Because the second ranked value is20.8, the first quartile , Qp is 20.8.

To find the third quariile, Qr:

1/r + l \O, - -''" -' ranked value

43(7 + l)= --)---------11s11ked value = 6th ranked value

4

Therefore, using Rule l, Qris the sixth ranked value. Because the sixth ranked value is 26.0,Q3is26.

The first quartile of 20.8 indicates that25Yo of the returns are below or equal to 20.8 and75o/o are greater than or equal to 20.8. The third quartile of 26.0 indicates thatT5Yo of thereturns are below or equal to 26.0 and25o/o are greater than or equal to 26.0.

The Geometric MsanThe geometric mean measures the rate of change of a variable over time. Equation (3.5)

.defines the geometric mean.

The geometric mean rate of return measures the average percentage return of an invest-ment over time. Equation (3.6) defines the geometric mean rate of return.

To illustrate these measures, consider an investment of $100.000 that declined to a value of$50,000 at the end ofYear 1 and then rebounded back to its orisinal $100.000 value at the end

YoEZ'II sr sJ?ef o1lrl eql roJ xepq 0002 ilessnu eql uI umlerJo el€r uselu cl4eluoeS eql

EZI I '0 = I - ET,I I ' I =

I - ,^ltr LtZ'Il =

/ I - r,rlGgvo' t) x (sEsI'I) l =

r - ,,rl(Gsto'o) + t) x ((sE8I'o) + t)l -

I -, *l(zv + t) x (Iu + I)l = "Y.

sr sJ?eA o u eqlJoJ xepul

flessnU eq] ur umterJo sler u€elu rlrleruoa8 eql'(9'g) uotlenbg Sutsn NO;1nlOS

'urn{er;o etu cuteruos8 eql elnduroC 'SOOdut o ggt+pue V00Z q %€€'8I+ s€A\oc geuls 000'ZJo secud 1co1s eqlJo xepq 0002 lessn1 eql ut e8ueqc e3eluecred eql

NUnISU lO SrVU NV3ht f,tuJ.3l lol9 3HI DN[ndWO) 9'€ 31d1AlVX3

'u?oru c4eruqlu? oql seop ueqt pouad ree,{-orrr1 eq} JoJ luerqselur eqlJosql ur e8ueqc (orez) erp sltegeJ flelemcce eJolu umleJJo elal ueelu culeuroeE eql'snq;

0=I- I=

I - z^lo' Il :

r - ,,rl(o'd x (os'o)l =

I - .,,[((o'I) + I) x ((os'o-) + t)] =

I - u5l(7v + I) x (Iu + I)l =

sr sreer( ong eqlroJ urnleJJo eleJ ueolu clrleruoe8 eql'(9'€) uollenbg Eursp

0v

%oor Io 'oo'I = [

000'09

) : zv

s! T, J€eI roJ urnler Jo eler eql pu€

000'0s - 000'00I

%09 - JO '09'0 - = 000'00I00r - 000000'

_)'og )

J€oI

=Iu

S}I roJ urqer Jo aler eql esn€ceq

o gzro ,gz '0 _ (oo'r) + (os'o-)L.

-L

sr luerulseûl slqlJo urnpJJo setr€JeqlJo rreeru crpurqllJs eq1 te,remo11 'pe3ueqcrm sI lueuqselul eqtJo enye,r Surpue pues eril esn€ceq g sr pousd reaf-ortl eIIl JoJ lueu4selul slql JoJ ILmFJJo eleJ oqJ 'Z ree1Jo

serns€o141 en4drrcsoq lecrrerunN gf,UHJ UEIdVHJ V1l

3.2: Yaiationand Shaoe 105

3.2 VARIATION AND SHAPEIn addition to central tendency, every data set can be characterizedby its variation and shape.Variation measures the spread, or dispersion, of values in a data set. One simple measure ofvariation is the range, the difference between the largest and smallest values. More commonlyused in statistics are the standard deviation and variance, two measures explained later in thissection. The shape of a data set represents a pattern ofall the values, from the lowest to highestvalue. As you will learn later in this section, many data sets have apaltemthat looks approxi-mately like a bell, with a peak of values somewhere in the middle.

The RangeThe range is the simplest numerical descriptive measure of variation in a set of data.

fiThe rtnge is equal to,the largest value,minus,the

Range = Xlurg"rt

smallest va1ue.' , , '

'Xi**llest(3.7)

3.7

To determine the range of the times to get ready in the morning, you rank the data from small-est to largest:

29 3L 35 39 39 40 43 44 44 s2

Using Equation (3.7), the range is 52 - 29 : 23 minutes. The range of 23 minutes indicatesthat the largest difference between any two days in the time to get ready in the morning is 23minutes.

COMPUTING THE RANGE IN THE THREE-YEAR ANNUALIZED RETURNSFOR SMALL.CAP GROWTH MUTUAL FUNDS WITH LOW RISK

The 838 mutual funds (E!!@@ that are part of the Using Statistics scenario (see page96) are classified according to the category (small cap, mid cap, and large cap), the type(growth or value), and the risk level of the mutual funds (low, average, and high). Compute therange of the three-year arcnalized returns for the small-cap growth funds with low risk (seepage 99).

SOLUTION Ranked from smallest to largest, the three-year annualized returns for the sevensmall-cap growth funds with low risk are

19.0 20.8 22.3 22.4 24.9 26.0 29.9

Therefore, using Equation (3.7),the range - 29.9 - 19.0 - 10.9.The largest difference between any two returns is 10.9.

The range measures the total spread inthe set of data. Although the range is a simple mea-sure ofthe total variation in the data, it does not take into account how the data are distributedbetween the smallest and largest values. In other words, the range does not indicate whether thevalues are evenly distributed throughout the data set, clustered near the middle, or clusterednear one or both extremes. Thus, using the range as a measure of variation when at least onevalue is an extreme value is misleadine.

i{s

iJ

T ,rloleq o$ql4slp senIBA JeII?us ^\oq

pu€ 1l eôq" elentrcng senl€A re8rel al,oq-ueeru eqlJa$BOs ,,e?etete,, eql ems"our scllsqels eseql 'uopcllop pJspusls eql pu? oJUUIJBA

pe$qr4srp ere z]P-p eql uI senle^ eql il? ^rog

ltmocc? olul o{3} leq} uoIlBu€AJo semseelu

{uouruoc oA[ 'serue4xe eql ueoaleq Je]snlc Jo olnql4slp sonl€A eql Moq uol]BJeplsuocte:pl tou op,{eqt ouorlerre,rgo sems?etu eJe eAu€J eygenbrelul eql pu? eEuer eql q8noqlly

usf&egAeffi p"!ep$rs&s sL{& puB ssrrslrffiA &q&

ru luslslsa.r pell"c oJB 'sen1% erue4xe ,(q pecuengur eq louuec gcrqrrr 'e8uer elqrenbrelul

'Ea'lo 'uerpeul eqtr s€ qcns soms?elrr d.reurumS'-senlsl eluo4xe fq pepege eq ]oullsc 1rre3rel ro I@ ueqt relleus en1e,r,(ue rsplsuoc lou soop eEuer eplrenbrelul aql asneceg

]r ol s€ r€Arer* es'sernunu 6 sr.(peer leE ol eu'r "*f!{r#:;ff";'#i'T::jffJ:$;selnuru 6 : gE - W : e8uer eplrenbrelul

:W :EO pu€ S€ :rA'Z0I a8ed uo stlnser rsllrse er{r pue (g'g) uoaenbg esn no,(

z9 vv vv Ev 0v 6E 6E s€ IE 6Z

,{peer 1eB ol sorml eq};o o8uer epgenbrelul eq} eulluJe}ep oI 'sonl€ eluo4xo ,(q pecuengutsr 1r 'ero;ereq L'rtrr4p eql Jo %0S e1ppFu eql ut peerds eql sems€etu e8uer epgenbrelq eql

'?lepJoleseur sapLnnbpue W!.fl oqt uee $eq ocuereJllp eql sr (peerdsppr pellsc osle) eiuu.r eglrenbralul eql

e6ueg allpenbratul aql

'z's $ umFJ pezrlsrurus ree,{-eerql eql ur sSuer elgenbrelul eql'eroJereql

Z'g : g'02 - 0'gZ: o8uer epgenbrelul

:0'97,:tO p* g'02 =rO 'gg1 e8ed uo stlnser rellree eqt pu€ (3'g) uoqenbg Sursn

6'62 0'92 6'nZ n'ZZ t'ZZ 8'02 0'6I

er€ {su {\oI qll^{ spuq q4or8 dec-11euseql rqr'sumler pezq"ilIlls ree,{-eerql eql '1se3re1 o} }sell"Ius uro{ pe>Iu€u No[n-los

s8ed ees) 4su x\ol WI1!\ spurg q1mo.fideclerus eIR JoJ sILmleJ pezqunrlrlr- ne,{-eerq1 eql;o o8uanur eql elnduroc '(qatq pve's?ente mof spwg pnlruu er0Jo Ie eI >Islr ew pue'(en1en

qmor8) ed,(l ern'(dec eSrel pue'dec p1ul'd"c leus) d;o8etec egl ol Srnprocce peIJIs$Ic ergaEed ess) ousuots scpspsls Sqsn eqlgo ged sre 1eql ($$@[@) spurU Ien$ur 8€8 eql

)slu Mol HilAsoNnj tvnJ.nru Hlv\ouD dvf,-]lvl ls uol sNunrSu cSznvnNNV

UV3A.]3UI{I3HI UOI 3DNVU 3]IIUVNOUSINI 3HI DNUNdWOf, 8' t 31d l lVX3

sernseelN e^rldlrcsec lecrrerunN aiIuHJ uEIdvHJ 90I

3.2: Variation and Shape 107

A simple measure of variation around the mean might take the difference between eachvalue and the mean and then sum these difflerences. However, if you did that, you would findthat because the mean is the balance point in a set of data, for every set of data, these differ-ences would sum to zero. One measure of variation that differs from data set to data set squaresthe difference between each value and the mean and then sums these squared differences. In,statistics, this quantity is called a sum of squares (or .S^S). This sum is then divided by the num-ber of values minus I (for sample data) to get the sample variance (S2;. fne square root of th6sample variance is the sample standard deviation (^9).

Because the sum of squares is a sum of squared differences that by the rules of arithmetic,will always be nonnegative, neither the variance nor the standard deviation cqn ever be nega-tive. For virtually all sets of data, the variance and standard deviation will be a positive value,although both of these statistics will be zero if there is no variation at all in a set of data andeach value in the sample is the same.

For a-sample containing n values, xp x2, X3, . . . , Xn, rhe sample variance (given by thesymbol 52) is

q2L)

(x, - X)2 + (x, - X)2 + ... + (x, - X)2

Equation (3.9) expresses the sample variance;-:rn-ation notation, and Equation (3.10)expresses the sample standard deviation.

. 'u,til',

,,'(X'-',

If the denominator were n instead of n - l, Equation (3.9) [and the inner term in Equation(3.10)] would calculate the average of the squared differences around the mean. However, n - 1is used because of certain desirable mathematical properties possessed by the statistic 52 that

i i!,,itiit i,

:i:rt:::,:i:l;:::;l

z8'91 :

6v'zw

I - OI

,G'68 - sE) + " ' + r@'68 - ad+ rG'68-ot)I - " (1

ZD

, lX -

; ) uorlenbE uI sluJel eql JoJ senlen Supnlpsqns f,q ecu€IJen e{} elelncl€c osls u€c no^

Z8'SV 0v'T,rv

I=!

,flK

:(t - u) f,qap1,r1q:V dalg

grrc9t '6r96' tL9 t '09E'619E'0g L't;r99' t IgE T, l l9€'0

:Iuns: g da4g

09'v-0v'v09'8-07'00v'v09'0-OV'T,I0v't09'0I-09'0-

9€vvIE0vvv6EZ9w6Z6E

,(x - !x)" ida$

&-!x)71 dwS

(X)OIuII

g'6t = X

soLulI{peeg-6u!}leD eL{} io

of, uerren oL1} 6ultnduo3

'(7 dels) eou€Irel eql elnduroc ol 6 : I - Ot fq popl^lp uoql sI Islol, -I

.1.€ eIq"JJo ruo'oq rqiin *oqt sr (g dels) secuoreglp perenbs eqlJo Iuns eq1'7 dels

rqs I'€ elqelJo Ilumloc prlql eqJ'1 dels s1v\oqs I'€ elqelJo Irunloc puoces eql ('ueeur eqt

rt-rrlulncl€c oqt roJ 96 e8ad ""S)

:q'Og o1 lenbe (; ) ueeul e qlyv\ etr€p serurl-fpeer-3urge? eq1

r _ lor|€rAep pJ€pu€ls pug ecu€u€A eql 3ur1elnclec ro3 sdels moJ lsrlJ eql s^roqs I'€ eIq€J

.uoq€rlep pr€puels eldrues eql le8 ol ecueu€A eldrues eql Jo looJ erenbs eql e{?I 'S dars

'ecIIsuBA eldures eql te8 ot I - ufq lelol sql eplÎq 't dars

'secuerelllp perenbs eql ppv 't dets

'eouoreJ;Ip qcee erenbg 'Z dels

'u€eru oql pue enIBA gc€e ueenqoq ocuoJoJllp eql etrndruo3 'I dels

:g ouorlerrrep pJ?puels aldrues eql pu? o 75

oectJerlBtr eldures oql olslncl€c-pu?g oI.Buuelsnlc

"rn ,".r1nn e1gp eqlJo ,{fuohul oq} N€el }e eroq^\ eulJop sdleq flensn uoq

-, ...rp pJspuels eql pug u€eru eqlJo e?pel.norq'ero;ereq1 'useru eql '4Aoleq pu€ eôq? uollelêp

-.-:lrrgls euo snutru pue snydgo I€AJolut ue uq1ril\ eII senlel pelJosqo eq1;o '{luofeur eqt'e}ep3o

: r.s llu lsorul€ Jod .u€etu slr prmoJ€ selnqr4srp Jo sJetrsnlc €lepJo les € A\oq ^\orq

o1 nof sdleq uo4

-: r:p pJ"pue$ eql'etep eldures 1eu6uo eql S3 strm elu€s eql uI sI l€ql Jequmu e sferrqe s] uoF

-: .,ap pr€puerr rw &tr*r,6 perenbs "

sr qorq^\'ecu?u€A eydures eql e{llun 'l(ot'E) uoqenbg ur

'';:rJap] uoq"rJ?AJo ernseeur rno^( se uollsllep pJepu€}s eldrues eql esn '{1e111 poru [Ina no,\

,:.,eurs pue Jell€rus seruoceq r - u f,qpue u tlqSqpp,yp uoe qeq eoueJeJJlp eql'seseercur ezrs

, :;ues .ql tV-(f reideq3 uI pessnoslp sr qcq,t) ecuoJeJul l€cqsll€ls ro; eleudordde il eleur

L 'g ! l18 vr

serns€e141 errrldlrssec leslrerunN EEUHI UEIdVHJ 80I

3.2: Variation and, Shape I 09

Because the variance is in squared units (in squared minutes, for these data), to compute

the standard deviation, you take the square root ofthe variance. Using Equation (3.10) on page

l07,the sample standard deviation, S, is

= 6.77

This indicates that the getting-ready times in this sample arerclustering within 6.77 minutes

around the mean of 39.6 minutes (i.e., clustering between X - lS:32.83 and X + 1S:

46.37).ln fact, 7 out of l0 getting-ready times lie within this interval.Using the second column of Table 3.1, you can also calculate the sum of the differences

between each value and the mean to be zerc. For any set of data, this sum will always be zero:

- X) = 0 for all sets of data

This property is one of the reasons that the mean is used as the most cofilmon measure of cen-

ffal tendency.

COMPUTING THE VARIANCE AND STANDARD DEVIATION OF THETHREE-YEAR ANNUALIZED RETURNS FOR SMALL-CAP GROWTH MUTUAL

FUNDS WITH LOW RISK

The 838 mutual fu"ds (E@tf,@El@ that are part of the Using Statistics scenario (see page

96) are classified according to the category (small cap, mid cap, and large cap), the type

(gro6h or value), and the risk level of the mutual funds (low, average, and high). Compute the

variance and standard deviation of the three-year annualized returns for the small-cap growth

funds with low risk (see page 99).

SOLUTION Table 3.2 illustrates the computation of the variance and standard deviation for

the three-year annualizedreturns for the small-cap growth funds with low risk.

X - 23.6143

,S =^tr-

nsr

L/ l ,a

^L\t ' ii :1

u, E 3.9

3"2

_::* = Thrree--: o Retu rn s

Three-\'earAnnualized

Return

Sten 2:(xi : x)2

Step(Xi -

I:Xt

.{r;1f1T;.;' 2 n.Jf ve t i . /

r[ii1{|ir*1r- 3 FundS 7.92025.69161.653 I

39.5102r.7273

2r.2916r.47 45

Step 4:Divide by (tr - 1):

20.826024.929.922.319.022.4

-2.81432.3857r,285i6.2857

--1.3143-4.6143.-I.2143

Step 3:Sum:

45.82

79.2686 T3.2TT4

ueeru eldules : Xuor]Brlop pJ€puets eldulus - S

or0qA\

['f] ;;;{f)= t13

usou eql dq Tp':rp uorl' *"l plBpu?ls n# - ffitffiffidJ*f, ##

'ueeru eql ol e^rlelo t elep eql ur JolJqr semseoru '13 pquf,s eqt ,{q pelouep ouorleue,rgo luercrJJooc oqJ '"1€p reyncrged eqt

erpJo suilel ur rreqt JorDeJ e8eluecred e se pesserdxe sfelqe sr l€ql uoq€lJ? LJo a"msoawa/ e sr uoIlaIJEa Jo luolr.rJJaoc aq1 peluessrd uo4euerr 3o semsseru snor,rard eql e{llun

uoltelrB^ *o lu€pll*sof, orll

'enrpSeu eq,Dâ uec (ecueuenTorlerlep pJ?pueF 'e8uer epgenbrelur 'e8uer eq1) uorleuenJo soms€erll eqlJo euoN r

'orez lenbe il" IIII!\ uorlerlep pJ€puels pue ,ecueu?,r ,e8uer ep1agur oe8u€r eql'(ewp oql ul uoq€rJ€A ou sr eJeql 1eq1 os) e{u"s eql ile er€ senlel oqlJl r

'uo4erêp prcpuels pu? 'ocueuett ,eauet

:gnbrelur 'e8uur eql Jollerus eqtr 'snoeueSoruoq Jo pep4uecuoc en ewp eql eJour eqr r'uorl€rêp pJepu€ls pue .ecu€rJ€A

eyprenbralur oe8uer eql re8re1 eql pesredsrp Jo lno peerds era etep eql oJorrr eqr r

:uo4erêp pJ€puels pu?Fel 'eEuer eplrenbrelur oe?uet eqlJo scrlsrJelceJ?rlc oql sezu"uuns Bul,t.ollog eq1

'le ralur sq] uqlrzll. erT srrrnlor pezrlenuu€ reef-eerql eqlgo (1go no S) rAV.tt'Gvz'tz:SI + X plur-9L6'6r:^gI -x uee \ leqSuFelsnyc ' 'a' I)r9'EzJou€erueqlgE9'€ uflll^r Surr-1snyc or? sumler sqr leql selecrpur SE9.€ Jo uorlerlep pr?pu€ls eql

9t9'E =

q S 'uotletêp prepuels eldules eql ' L1I e8ed uo (0I '€) uorlenbE 3urs61

VTIZ.TI =

9

= 'S[=S:_J

I=!,DK

9892'61

I_ L-

I-uT1

ZJ

Z(X _

:LU e8ed uo (6'E) uollenbE Eursq

serns?e14 errrldrrcsoc IecrrerunNl EEuHI UIIJdVHJ OI I

3.2: Yaiationand Shape I I I

For the sample of l0 getting-ready times, because X : 39.6 and. S : 6.77, the coefficient ofvariation is

= (q)roo%= r7.ro%[3e.6 )

cT/ -( 3 'g \r/w = lfr )r00%

_ 15.0%

For volume, the coefficient of variation is

#

cv = [+)'oo%

For the getting-ready times, the standard deviation is I7.l% of the size of the mean.The coefficient of variation is very useful when comparing two or more sets of data that

are measured in different units, as Example 3.10 illustrates.

3.10 COMPARING TWO COEFFICIENTS OF VARhilON WHEN TWO VARIABLESHAVE DIFFERENT UNITS OF MEASUREMENT

The operations manager of a package delivery service is deciding whether to purchase a new fleetof trucks. When packages are stored in the trucks in preparation for delivery you need to considertwo major constraints-the weight (in pounds) and the volume (in cubic feet) for each item.

The operations manager samples 200 packages and finds that the mean weight is 26.0pounds, with a standard deviation of 3.9 pounds, and the mean volume is 8.8 cubic ieet, with astandard deviation of 2.2 cubic feet. How can the operations manager compare the variation ofthe weight and the volume?

SOLUTION Because the measurement units differ for the weight and volume constqaints, theoperations manager should compare the relative variability in the two types of measurements.

For weight, the coefficient of variation is

Cvr - 25.0%

Thus, relative to the mean, the package volume is much more variable than the package weight.

Z ScorasAn extreme value or outlier is a value located far away frorn the mean. Z scores are useful inidentifying outliers' The larger the Z score,the greater the distance from the value to the mean.The Zscore is the difference between the value and the mean, divided by the standard deviation.

= ('-r-\roo%\ .8.8 i

, .2,,*,

, : : : : I , , ' , '

z scoRES

Q,l2)

For the time-to-get-ready data, the mean is 39.6 minutes, and the standard deviation is 6.77minutes' The time to get ready on the first day is 39.0 minutes. You compute the Z scorcfor Day1 by using Equation (3.12):

Z=X-X

^S39.0 - 39.6

6.77= - 0.09

.sonlel q8rq ro senlel /rclJo ecuel€qrul ue uI sllnsoJ sseu,&\e>ls sIqJ 'ueew eql plmoJ"

:Luru/ft lou eJ€ senl€A oql <uorlnqulsry paaa{s € uI 'lno Jeqlo gc€e ecuel"q senl€A qAIq

,.-r-r{ a{l 'esec sril uI 'u€eu eql eôq€ sonle^ eql se fllssxe pelnql4srp eJ? u?olu eql ,!\oleq

'r,. Jql 'uotlnqulstp IBJIJla1urufs € q 'pe.4ae>Is Jo lucl4elurufs reqlle sI uoqnqutslp V'sen::il IIeJo e8uer er4ue eql tnoq8norql senlsl slepJo uoqnql4slp oqlJo ureil€d eql sI ed"qs

&deqS

€€'0-LZ'Y9€'0-EL'TI E'099'0LL'O_

E9'EIg.EZV'ZZ0'6IE'ZZ6'626'VZ0'gz8'02

uollulâo prBpuBlsuuOIAI

soJoJS

Z

uJnlourBa^-aarqI

)s!u ̂ ôl 91!nn sPunllenlnlA LllMorD def

-l leuls oL{} ro+ suln}oupoz! | e n u uv I ea^-ool LII

oL.ll +o sero)s z

v' t 318Vr

'€+ ueql releer8 Jo €- u"ql ssel oJ€ seJocs Z aLA Jo euou esmceq €l€p osoql uI sJeII

: :ue;edde ou eJ€ eJeqJ '0'6I Jo uJnloJ pezlTenuu€ ue JoJ ' LZ'Y sI oJocs Z lsemol eql'6'62

urual pez{"nuue ve :r.irS'g2'1sI eJocs 7 1se?r 1eql qslJ 1v\ol qlyv\ sprng qlmor8 dec-11eurs

;oJ surnler pezllenuu€ rte,(-eerqt eqlJo sorocs z aql sel€4snlll t'€ elqsl Nolln-Ios

n* rSed ees) >1su naol qlp\ sprng qyvror8 declyeus eql roJ srlmleJ pezq€nuu€ ree,(-eerqt eqtJo

:rs Z eql elndruo3'(q8lq pue 'e8erem .u'o1) spury Isnlntu eqlJo Is el )slr eql puu '(enle^ ro

,,r3) ed^ft eqt'(dec e8rey pue 'dec ptur'dec leuls),fuo8elec eql ol Sutprocm pelJlss€lc an (96

:aas)oueue'sScI lsp€lS3ulsneq13owdarcteq}(W)spuryIeqnu8€8oqJ

)slu A 01 Hrn scNnJ lvnlnN Hr/v\ou9 dvf,-]]vlls uolsNunl3u oSzllvnNNv uVSA-33UHI 3Ht lo sluoSs z 3HL 9N|Indno)

E

iI

L L' t 11d hlvxl I6

fi'."'"'.,"."-,".[

89'0-s9'0LT'T-90'099'060'0-g8'r0s'0LS'I_60'0-

LL'99'68

9EvvTE0vvv6E7,9EV676E

uopulôp prspuulsuuat\l

arcJs, Z W) outtr

'srerTlno peJeprsuoc eq ol uolrsllJc lBql lelu seu4 eqlJo euo\l 'Q'[1 u€ql rel€eJts ro 0'€-illnr;: ssal sr lr Jr Jelllno ue peJeprsuoc sr eJoos 7 e'elnt yereue8 € sV 'selnulru 6Z selll. r(peer 1sB

),rj ; -rrrl oql qcr+!\ uo 'Z teO.JoJ ,g'I- s€,l\ eJocs Z lso1rcl oqJ, 'selnulru ZS sal{peer 1eB ol erul}

;iilrr: rJrq.ry\ uo '7 ,ftq JoJ €g'I sr erocs Z lse8rel eql 's,fup 0I II€ roJ seJocs Z eql s1l\or{s €'€ elqeJ,

soul l f{peey-6u!}}eD 0 L

oL.ll lo+ sorDs z

t 'g : l18VI

sernsseN e^rldlrcsec leclrelunN EEUHJ UEIdVHJ ZII

3.2: Yaiationand Shape

Shape influences the relationship of the mean to the median in the following ways:Mean < median: negative, or left-skewedMean: median: symmetric, or zero skewnessMean > median: positive, or right-skewed

Figure 3.1 depicts three data sets, each with a different shape.

113

ffi

ffi

ffi

3"1

fri;-'l:,: n of th ree15 ] -e i l ' tng In

Panel ANegative, or left-skewed

Panel BSymmetrical

Panel CPositive, or right-skewed

The data in Panel A are negative, or left-skewed. In this panel, most of the values are in theupper portion of the distribution. A long tail and distortion to the left is caused by someextremely small values. These extremely small values pull the mean downward so that themean is less than the median.

The data in Panel B are symmetrical. Each half of the curve is a mirror image of the otherhalf of the curve. The low and high values on the scale balance, and the mean equals the median.

The data in Panel C are positive, or right-skewed. In this panel, most of the values are in thelower portion of the distribution. A long tail on the right is caused by some extremely large values.These extremely large values pull the mean upward so that the mean is greater than the median.

AL EXPLORATIONS Exploring Descriptive Statistics

iiitoiinuu -;i61 use the Visual Explorations Descriptiveaililillliltmic$ trocedure to see the effect of changing datavalues

diagram for the sample of 10 getting-ready times usedthroughout this chapter.

Experiment by entering an extreme value such as 10minutes into one of the tinted cells of column A. Whichmeasures are affected by this change? Which ones are not?You can flip between the "before" and "aftef' dtagrams byrepeatedly pressing Crtl + Z (undo) followed by Crtl + Y(redo) to help see the changes the extreme value caused inthe diagram.

i;res of central tendency, variation, and shape. Openadd-in workbook (see Appendix D)

lrffirrvr: \isualExplorations t Descriptive Statistics: --1003) or Add-ins + Visual Explorations t

h e Statistics (Excel 2007) from the Microsoft"mreriu bar. Read the instructions in the pop-up box

il;ur*mation below) and click oK to examine a dot-scale

sod go ecueprlo eruos pe./Koqs slelel >lslJ er{} Jo r{cBE 'sdnor8 eeJr{}

uuls eq] q ecueroJJrp elllll frerr se,/y\ eregrspunJ 4su-eEete\e plp

; req8F{ ,(l}qBIIs e peqspunJ 4srr-q8F{ puB >lsIr-./Ko'I 'sleêl >lslr oerq}

*uB reef-eel{} eq} ur secuereJJlp }qBIIs eQ of rcedde orer{} 's11nser e{}

.sseu^/y\o{s

erll Jo suoll€Iêpue{} u€Ipelu pu€

oql roJ uJnler pozr

Sututurexe uI

. { f f i . *#*t l f f i t t *sg

iu;:" ' litg-

'$, - ' t ffiffi*--*"*i#$$;[ i$s##-* l*m*ff.* i #**s$r*H

iffi#ffi ffi&e--*i*;Yffi Hy*ggg fff,*giH -L i*gve'r

- iffiSs i----**oiltu*rh*ffi"ff,' ffip-r'ttts

iw.In"itffi""itffi$---,-.ffi-um*J$-$*mp*ro**

fistg q#tFJ,*Ss"*sir, *9J" --*t. : T.Y,,,,,.":..",.,.x.-:'r

d,'[ffihstFilffi$ffir$$#$#ffi

#$F'ft"r

ffit

#' wffi#

.slgl

eJeu) ol L'93 uollfas aas

loôl )su uo PeseqsuJnle) Pazt lenuue

.ree{-eerq} oL1} }ostr rls !]els enlld lJlse P

lorxf }+osolf l[N

z'g lunDll

('teeqs euo

-tel€pqosuoc oJe { steoq$lJorrr eleredes eeJlp uo pereedde 1€q} sllnseJ eql pu€ 'seru4 eeJIIl

* ,n^ rr.p"cord eql) .sprmJ I?nlruu lsp-q3u pue '1su-eSelel€ '4su-tltol JoJ sllnsoJ etr€J

-,+m;,s e1epc1e, o1 "*p"*ra

mtl.ttnts e,rqducseq eqt Eursn.;fo slFSeJ oql s^\oqs 7'g ern81g'uoIlnqIDSIp pedeqslleq e ueql 4eed

.llm:lEr.ls € qll^{ uo4nqulsrp "

selecrpur 3nI?A e^qrsod y'uorlnqulsrp pedeqsleq 3 u€ql JeDsU sI

mtr' uollnqulslp € selsclpur enls^ e,uleSeu v'uollnql4slp pedeqs-1sq e seletlpul oJezJo enle^

$ssrun)l v .EIel oql qryv\ pereduroc se 'uotlnqplsp eql Jo Jeluec oqtr uI senl€A Jo uollsquecuoc

fiidl€f3J oql sems€eru sffo4ny'sserl \e{S Uel se}eclpur OnI?A e,rqe8eu € ellqa sseu,le>1s lqBF,ffir[aJrpur anl€A elr]rsod y 'uorlnqUsrp leculeururfs ? sel"clpul oJoz Jo onl€A sseul\e>ls V 'e]€p

"nm: ui-,trteruru,{s Jo {o€I eql seJns"ou ssauways 'ezts eldrues eqt Jo looJ erenbs eqt fQ pepFlp

ffi':lernep pJ€puels eqt si 'lretdeqJ uI pessnc Stp "totta

pnpuDts eql 'uollces stql ut flsnor'rerd

@snJsrp lou scrlsrl€ts eerql'sseurvre>1s pu€ 'slsolm>l eql torre pJepu"ts eqi sfeldstp Pu? solelno

-*. ,r.,p""ord ?qt 'uourppe q 'teeq$IJoa\ ^\eu

€ uo scqsllsls eseqt sfeldsrp pue (ezls sldures)

fimf--DJ pue 'UInUIIXeUJ 'UInUIIUIUI 'gEUet 'OOU€Uert 'UOrler'rep pJ€pU€lS 'epotU 'U?Ipetu 'UBoIU

pr. salnduroc (1'gg uorlceg ees) q-ppe {edlool egtrJo empecord scqsqelS e'r4drrcse61 eqJ

silnsog stlls!1et5 e4ldprsa6 ls3x3 ltosortlp\l

serns€o141 enrldlrcseq l€clrerunN Ef,UHI UAIdVHJ VIT

tlNe Basics

following is a set of data from a sample

7 49 8 2

fu mean, median, and mode.:ffie ftmge, interquartile range, varrance, stan-

mmion- and coefficient of variation.ffis Z scores. Are there any outliers?

shape of the data set.

followitrg is a set of data from a sample

7 497 3 12

tffis mean, me dian, and mode.ffis range, interquartile tange, variance, stan-

and coefficient of variation.M Z scores. Are there any outliers?ffiE shape of the data set.

following set of data is from a sample

[27 49 0 7 3

lilffie mean, median, and mode.ffie range, interquartile range, variance, stan-

ilmim' and coefficient of variation.ffie Z scores. Are there any outliers?

shape of the data set.

The following is a set of data from a samplem: 5:

7-5-879

ffis mean, median, andmode.ffis range, interquartile ran5e, variance, stan-

tffiilm, and coefficient of variation.Z gcores. Are there any outliers?shape of the data set.

Suppose the rate of return for a particularduring the past two years was l0% and

Compute the geometric mean rate of return.of return of 1 0o/o is recorded as 0.10, and affi3'0f/o is recorded as 0.30.)

Concepts

The operations manager of a plant thats tires wants to compare the actual

diameters of two grades of tires, each of

3.2: Variation and Shape 115

the results represent-ranked from smallest

Grade Y

568 s70 s75 578 584 573 s74 575 577 578

a. For each of the two grades of tires, compute the mean,median, and standard deviation.

b. Which grade of tire is providing better quality? Explain.c. What would be the effect on your answers in (a) and (b)

if the last value for grade )'were 588 instead of 5 7g?Explain.

3.7 The datain the file @ contain the pricefor two tickets with online service charges, large popcorn,and two medium soft drinks at a sample of six theatrechains:'

$36. 1 s $3 I .00 $3 5.0s $40.2s $33.75 $43.00

source: Extractedfrom K. Kelly, "The Multiplex (Inder siege," TheWall Street Journal , December 24-25, 200i, pp. pI, p5.

a. Compute the mean, median, first quartile, and thirdquartile.

b. Compute the va:nance, standard deviation, range,interquartile range, and coefficient of variation.

c. Are the data skewed? If so, how?d. Based on the results of (a) through (c), what conclusions

canyou reach concerning the cost of going to the movies?

3.8 A total of 92,000 new single-family homes were soldin the united States during February 2006. The medianprice of the homes was $230,400, a decrease of 2.9% fromFebruary 2005 (U.S. Census Bureau, www.census.gov).Why do you think the Census Bureau refers to the medianprice instead of the mean price?

3.9 The data in the file @ contain the bouncedcheck fees, in dollars, for a su*pt. of 23 banks for direct-deposit customers who marntain a$100 balance:

26 28 20 20 2t 22 25 25 18 2s 15 20

18 20 2s 2s 22 30 30 30 ls 20 29

Source: Extractedfrom "Tlte l{ew Face of Bankiftg," June 2000.Copyright @ 2000 by Consumers (Jnion of (J.5., Inc.,yonkers, Nyr0703-r057.

L. Compute the mean, median, first quartile, and thirdquartile.

b. Compute the variance, standard deviation, range,interquartile rarrge, coefficient of variation, and Z scores.

c. Are the data skewed? If so, how?d. Based on the results of (a) through (c), what conclusions

can you reach concerning the bounced check fees?

tires of each grade was selecte{ anding the inner diameters of the tires,to largest, are as follows:

Grade X

tf f i

to be 57 5 millimeters. A sample of five

6L'E 6r '9 9V'9 Zy S 8€'0 0I '9 js 'v

ff i i l i lE V9'E VE Z LL'V E I '9 Z0'E S9'S rZ'V

:,ry\oloq petsll ere pTrelfi@fit ellJ ewq

$ peuleluoc eJ€ sllnseJ oIII '>loo./y\ euo Jo poped 3 JoAo

sr rnoq s1ql Suunp sreluolsnc g I Jo eldures 3 JoLY\ Jellel oql seI{ceeJ el{s Jo oI{ uoq,&\ ol ewl oq} sJe}ue

eq] erull oql se peulJep) solnuru uI 'eulp Eultlezlt

?olred qcunl 'ru'd 00:I-ol-uoou eql Eupnp sroruolffmnres JoJ ssecord penordult ue pedololep seq ,qlc

nsFlslp l€Icreruruoc E rrI pelecol qcu€rq >lueq v I L'g

'sllnsoJ eql uI ocue

eq] uo tuolu-ruoJ 'enlB^ srql Sutsn '(c) qEnorqt (e)'ggJo peelsul 86 s€^/y\ enl€A lsrlJ eql }€q1 esoddnS 'p

'slo>lcl1 ,(ep-euo

ud uorssnupe Surlrels eql EuluJecuoc l{Jeer nort

rsnlcuoc let!!r. '(q) pu€ (e) lo sllnser eIP uo pes?g

pBIAop pJ€pu€ls pu€ 'ocrJeuen'eEu€J eql elndulo3'eppenb

pue 'eppenb lsJlJ 'uerpeur 'u€eur eql elndulo3 'v'hd 'Id 'dd

"900e 'g fS t U'tdY'leuJnof lee4s II€16 egl,'ronoi illrqJ aql

'!qpY,, 'I,tDttilau'r,g 'fl puo uosY)Df 2 wo'{papoL-txg :ac'mos

0v0vEvz9 0s 6zzvwE9 89

:solels pollun eql uI sFed

0I ol slo>lcr1 ,(ep-euo roJ ($ ur) ecrrd uotssrup€ Eul

eqlur€luoc@ellJ or{}q EwP eqJ, VVe

'ureldxg eporoJJo spletf

Trl eql uI uoIleIJeA eJolu oAeII SCIJ ree^(-euo Jo

J3 }e>lr€ur ,(euoru op '(e) Jo sllnser eql uo peseg 'q'uollel IELfo lueIOIJJeoc pu€ 'eEuer eppenbJelul

'uorlellop pJ€pu?1s 'ocu€uul oql elnduroc .(1e1er

'sC[J ree.(-euo pue s]unocc€ ]e{tgTrr,(euour JoC 'v

' g00e' 27 ronnunf1aocarufiluog ruo'tt papo4xg : a)'mos

F 98' t S8'? 06'V V6'V 8E V 8E'? \V'V \s'V sg'v

cJ ruo^-ouo slunoJcY lo{rBHI,teuo6

:900(,'VZ Kmnu€f

su SOJ ree,(-euo pue Slunocce 1o>lJeul ̂(euolu ro3 sp1er,(

H opr./v\uorl?u or{} s}uoserder ffi ol}J erlt

rqsp eql equ€q uee/yqoq sluotulsenul 3o sed,fi lueroJJlp

splelf, orll Jo uor]err€^ egl ut ocuoroJJlp 3 ororll sI t L'g

tsloqs ul) oJIIesereru€ c 1eyfip lox1d eerql

,ftepeq eqt EuluJecuoc qc€er nof rrel

'(c) qEnorql (e) Jo sllnser eql uo poseg 'p

e/Koq 'or JI epe./Ke>ls ?l€p eql eJV 'J'ureldxg esrorllno ftm erelll erY

Z pTffi'uo4e1 relJo luelclJJeoc 'e8uer eppenbrolul

J 'uorlerlop pJepuels 'ocu€u€zt or{} elndulo3 'q'e14renb

pue 'eppenb IsJIJ 'uetpeur 'ueeur eql elndruo3 'v

0v7, 0I I 07r 08€ 9E 092

09v 08E \Lr 98 08I 00€

:sereruec lelr?Ip Iex1d eerql roJ (s1oqs ul) oJII ,("re1-wq,eqt slueserder@ellJ otll q ewp eqJ, ZVt

aseqcl^\-pu€s ue>lclqc Jo 1€J lelol oql Suturecuoc qc€eJ no.( uelsuorsnlcuoc ler{â '(c) qEnorq} (€) Jo s}lnser eq} uo pes€g 'p

el\oq 'or JI ape,/Ke>ls elep oql eJY 'c'ureldxg esrerllno tlue ororp oJV

'seJocs Zpue 'uo4epenJo luoTclJJooc 'e?uet elpenb;olul'e1uet 'uorlerlop pJepuels 'ecu€uen oI{} e}ndulo3 'q

'eppenb

pJlrll pue 'eltpenb lsrlJ 'uerpeur 'ueeul eql elndulo3 'e

'I fge 'dd 'h002 raqulatdas 'sgodeS rarunsuoJ

,,'nua74J aql of tppag Sutpp7:pool 1sDl,, u'ro'r{papuqxfl :a)'mos

9S 0V 0€ 0€ 6Z 67, 61 9Z 0E EZ

OE 6IVZOZOZgI 9 V 8 L

:s^\olloJ s3 sI e$P eql 'suleqc pooJ-ls€J ruoJJ

seqcr/y\pu?s ue>lcq J 0z;o eldrues € JoJ 'EutAJes red surerS

uI'wJ Ielol oql uleluoc[[$s[[lollJ orll ur ewp eql LL't

e$lcnqr€ls pu€ slnuocl (uPlunc ]3 $lulrp oeJJoc

pocl ul leJ pu€ solJolec eq] Euturecuoc r{ceer no^( uac

suorsnlcuoc l€I{â '(c) q8norqt (e).lo s}lnser elp uo pes?g 'p

e^\or{ 'os JI epe/Ke>ls e}€p eq} erv 'J'ureldxg esJerllno kse eJoril erv 'soJocs Z pIffi'uotleuen;io

luelclgeoc'e8uel elprenbrelq' e?rtet'uotletnop pJepu€1s

'elveuan eg elnduloc '(leJ pue seuolsc) elqeu€A qcee Jo.{ 'q'eppenb prlql pue 'elnrenb lsrlJ 'uerpeur

'u€oru eql elnduoc '(teJ pue sorJolec) elqulJuA r{oee Jo.f 'e

" '6 'd 'f002 aunl"spodeA JorunsuoJ ,,'V)nq'mgpuo sruuoclutyun1[ 7n [puo3 sn aa[o3,, u,ro,tf papD"tlxfl :a)'mos

.J

'q

0'61 OES

O'ZT, OI9

0'9I \zv

(ueerc peddrqm) eurqrJ Pepuelgouecndderg elelocoq3 s{onqre}S

(ureerc peddgm) eegoc Pepuelqouccndderg ellrîy\oJg ol€locoqJ $lcnqJ€ls

(rueerc peddrqm) ee;;oc

Icuoc l€rllA

popuelq owccndderg €qcolN $lcnqrs]S

0'07, ggt (ueerc peddrqm pue {llul oloqm)

osserdxg €rlcohl eogo3 pecl qcnqre]S

0'ZT, gSE (uluerc) epelooJ oeJJoJ slnuocl 6uPIunC

9'E 0g7, oaJJoc

pepuelq ouccndderg eoJJoJ $lcnqr€]S

0'8 \VZ (lttur eloqzn)

onel lrl,lrs ?I{co}\] pecl s}nuo(l 6ul4un(

lB.{ soIroIBJ lrnpord

:s>lcnqrelspu? slnuocl 6ul>luncl lE s>lulrp eeJJoc poclecuno-9IJo(sulerEuI)}eJpu3SeIJoI€cer{}}uoSff i-erder @ allJ or{} w etep eql ol.g lslrslr I

sernseol4l enrldlrcseq IeolrerunNl EitUHI UEJdVHJ 9II

-e mrean, median, first quartile, and third

the r-ariance, standard deviation, tange,range, coefficient of vanattofl, and Z

fiiMG thEre any outliers? Explain.skewed? If so, how?

rvalks into the branch office during theshe asks the branch manager how long she

tm u-ait. The branch manager replies, 'Almostfiqxi's than five minutes." On the basis of the( a p through (c), evaluate the accuracy of this

that another branch. located in a residen-rurilsn concerned with the noon-to- 1 p.m. lunch

w,ururing time, in minutes (defined as the timeqnters the line to when he or she reaches the). of a sample of 15 customers during this

over a period of one week. The results

5 ar) 8.02 5.79 9.73 3.82 9.01 9.35

ffi"ffi8 5.64 4.09 6.t7 g.gI 5.47

tfue mean, median, first quartile, and third

the variance, standard deviation, range,.of variation. Arerange, and coefficient

mwC .trwm finance.yahoo.com, April I 7, 2006).

rhe geometric mean rate of increase for the

S 1,000 of GE stock at the start of 2004.mmn ralue at the end of 2005?ttttrne result of (b) to that of Problem 3.18 (b).

lnternational, Inc., develops, manufac-su,[,[,s nonlethal self-defense devices known as

3.2: Variation and Shape ll7

tions institutions, and the military, TASER's popularityhas enjoyed a roller-coaster ride. The stock price in 2004increased 36r .4%, but in 2005, it decreased 78.0%(Source: Extracted from finance.yahoo.com, April I 7, 2 006) .a. Compute the geometric mean rate of increase for fhe

two-year period 2004-2005. (Hint: Denote an increaseof 3 61.4% as Rl : 3 .614.)

b. If you purchased $ 1,000 of TASER stock at the start of2004, what was its value at the end of 20A5?

c. compare the result of (b) to that of problem 3.17 (b).

3.19 In 2002, all the major stock market indexesdecreased dramatically as the attacks on glll drove stockprices spiraling downward. Stocks soon rebounde4 butwhat type of mean return did investors experience over thefour-year period from 2002 to 2005? The data in the fol-1owingtab1e(containedinthedataf i1eEBI@)repre-sent the total rate of return (in percentage) for the DowJones Industrtal Average (DJIA), the Standard & Poor's500 (s&P 500), and the technology-heavy NASDAeComposite (Nasdaq).

Year DJIA s&P 500 NASDAQ

2005200412001200t/

-0._63.4

30.0* 16.8

2.9g.L

26.4-24.2

r .48.6

50.0-3 1.5

ruruthers? Explain.skewed? If so" how?

walks into the branch office during thehe asks the branch manager how long he can

tM, wrait. The branch manager replies, 'Almostiless than five minutes." On the basis of the(iat nhrough (c), evaluate the accuracy of this

Electric (GE) is one of the world's largestm fuelops, manufactures, and markets a wide

. including medical diagnostic imagingcngines, lighting products, and chemicals.

ilare, NBC Universal, GE produces and deliv- Year Platinum'ierision and motion pictures. In 2004., GE's20.6%, but in2005, the price dropped 1.4% 12.3

5.736.024.6

Mcriod 2004-2005. (Hint' Denote an increase;msrRt:0.206.)

source: Extracted from finance.yahoo.com, April I 4, 2006.

a. Calculate the geometric mean rateof return for the DJIA,S&P 500, and Nasdaq.

b. What conclusions can you reach concerning the geomet-nc rates of return of the three market indexes?

c. Compare the results of (b) to those of Problem 3.20 (b).

3.20 In 2002-2005 precious metals changed rapidly invalue. The data in the following table (contained in the dataf i1e@@representthetota|rcteofreturn(inpercent.age) for platinum, gold, and silver:

Gold Silver

200520042003i2402

17.84.6

19.92,5.6

29.514,:C77 i&'."3'3

Source: Extracted from www.kitco.com , April 14, 2006.

a. Calculate the geometric mean ruteof refurn for platinum,go14 and silver.

b. What conclusions can you reach concerning the geomet-ric rates of return of the three precious metals?

c. Compare the results of (b) to those of Problem 3.19 (b).ing primarily to law enforcement, correc-

'Eg'z q spury puoq eseql roJ urnler eSeluecrod ueeru eq] 6snrlJ

t9 'Z =sI 'EI 99' t + g8'Z + 9Z'Z + Z9' I + VL'Z

=d

:(gt'E) uorlenbgelqer ut uenrE spung puoqJo uorlelndod eql JoJ urnler ree.(-euo u?oru eql etnduroc o1

11

-r'*K/1

uoltelndod oql ur sonlen 'X llnJo uorlururunr - ,X K

x eIqErrBA erllJo onls^ rll! _!f 11

4{{l[I 'f)

ueeru uor]Plndod - d

SJeI{,/K

NV3kld Ntrttv'xndod

ffiiffi#&&$ tu$#ff8,ffi#ffidffiffi# ffia-$a

'9V 'd '900e 'g [.,ncn"tqai 'Ierrrnof ]eer]S II€A eq7 wo"{ papzrtx7 ;artnos

.41 ,- - I__: l=n

'*K

:ortelndod eqt dq pep1,r1p uopelndod.Ur,r, ,.nr* eqlJo rrms eql sr u€oru ""*"tu#

3'Uf

99'E88'ZSZ'ZZ9' IV L'7,

ylcul CI VJ urpluerCnu1ly61l\t9 pren8uel

ylpuog spunC uecrrorrryrulipg loJ prenEuellsullul6 lelol:ocrurd

sPu nl PuoB1se6re1 e^!J eLl] +o

6ur lsrsuo3 uor le;ndo6oLl] roj urnlou leo^-ouo

9'g ! l lgvturnlou r8o^-auo punc puog

( ffiffiolIJ eqt ur poureluoc er€mm :q1) '9002'lt,ftenue1go se (slesse l€rolJo sruel ur) spurg puoq lse8rel e^rJ eql roJ runler.;m;i-auo eql sur€luoc qclq \'g't elquJ,trerleJ lsrr;'sreleure;ed eseql eleJlsnllr dleq o1

'uorl€rlep pJspuels uorleyndod pue ,ecueuel uorlepdod ,ueeur uorl

--,lod eql :sJelelu€red uotlelndod err.tlducsep eeJql lnoqe rrJ€el ilLr\ nof 'uorlcss sql uI .uor]"fl'-.'lod € JoJ seJns?our freuuns 'sta1awo"md lerdrelur pu€ e1elnclec ol peeu notl,,uogn1ndodililrLra ue JoJ slueluoms"elu leclJerunu slueserder les elep mof g1 'a1dwos e JoJ uorl"lJe1 pue,':r-:puel lg4uecJo sergedord eql paqrJcsop teqt ffillsrlqrs snou?A lueserd z'tpw I.€ suorlces

NOUVlndOd V UOI StUnSVSru 3^trdtutrs]q lv)tuSwnN E'g

sernseel4i enrldrrcsoq lecrreunN EEUHJ UEIdVHJ g I I

3.3: Numerical Descriptive Measures for a population 119

The Population Variance and Standard DeviationThe population variance and the population standard deviation measure variation in a popu-lation. Like the related sample statistics, the population standard deviation is the square rool ofthe population variance. The symbol o2,the Greek lowerc aselerter sigma squared, represents thepopulation variance, and the symbol o, the Greek lowercase lelter sigma,represents the popula-tion standard deviation. Equations (3.14) and (3.15) define these parameters. The denominatorsfor the right-side terms in these equations use N and not the (n - l) term that is used in the equa-tions for the sample variance and standard deviation [see Equations (3.9) and (3.10) on page tOZ1.

*'TP"on mean

o2= (3;14)

ffiere

POPULATION STANDARD

l"r : population mean

O=

1r xi:

Lr ' , - ,u)r -i=!

ith value of the var rrble X

summation of all the squared differences between the

:Ji;,il''nr

I t t i -p)ri=7

ff(3.15)

To compute the population variance for the data of Table 3.5, you use Equation (3.14):

o2=

_(2.74 -2.63)2 + (r .62 -2.63)2 + (2.25 -2$)2 + e.88 - 2.$)2+ (3 .66 - 2.$)25

0.0121 + 1.0201 + 0. 1444 + 0.0625 + 1.0609

)10=:::: = 0.46

5

Thus, the variance ofthe one-year returns is 0.46 squared percentage return. The squaredunits make the variance hard to interpret. You should use the standard deviation that isexpressed in the original units ofthe data (percentage return). From Equation (3.15),

,A/

Zrri- ' ,)2i= l

t/

o= ̂ F_I

= 40.46 = 0.68

o/oo0rx(tt i l -)

IIBetu oq] Iuo{ suollelAep pJ€puBls 4 Jo secu€lslp uilIll/!\lseel ]B eq

punoJ ere Wql senle^ Jo e8eluecocuereJer) alnr aeqsriqeqJ eql

afimH A$r,tsAq$q3 aq&

sql 'edeqs Jo sselpreEer '1os ewp Kue roJ l€rll selels ( t

'elnJ lecurdrua

Jo peelsur pegdde oq plnoqs naoleq pessncslp elnr neqsfqeqJ eql'uos€er raqlo fue roggeq Eurreedde 1ou esoql Jo 's1es elep peA\e{s ,Q1,reeq Jog 'sJelllno poJeplsuoc s,{ern1eeJg o€ a r{ lerrrelur eql ut prmoJ lou senlel 'erogereq; 'useru eql ruo{ suoq€Iêp pJ€p

eerql puo,fuq eq ilI1!\ 000'I uI € lnoq" dpo leql segdur osl" eIru eql'srelllno 1el1ue1od sea rl lerrrelur egl ur punoJ lou senl€A Jeprsuot uec no,{ 'e1ru lereue8 s sV'uoqceJlp Jaqlle uI

eql ruoJJ suorlsrêp pJ?pu?ls orrq puo,teq eq rur!\ senl?A OzJo }no I lnoqe dpo 'suoqnqp pedeqsleq roJ leql segdun e1ru lecFrdrue egl 'sJelltno ,!4uept no,{ dleq u?c pu? useruaoleq pue eloqe elnqrqsrp senIBA elp

^{oq eJns€elu nof sdleq elnr lecrrtdure sq;

'lreetrr eql ruo.U suo4?Iêp prepu€ls g+Jo ecIIBlsIp B ulq{1( en yo;16,feleurxorddy'ueeru eql

suorlerlop pr€puels Z+ Jo ecu?lsrp e ulqll^{ eJB senl€A eq} Jo %S6 ,(lepurxorddy.u€eru eql

uorlerêp pJ"pusls I+ Jo ecuslslp 3 ulqlll|/\ eJ€ senl?A oLll Jo o 8g,(leleunxorddy

:suo4nqlrlslp

JIeq q dillqeuurr eql eurruexe ol olnr 1uc;.4dure eql esn u€c no1'uollnql4slp pedeqse Eurcnpord 'useur pue u€rperu eql prmoJ€ Jelsnlc ol puel uego senl€A eq1 'eures eql

s?etu pu€ uerperu oql eJaqy\ 's1es elep leculeurul(s uI 'ueeru eql u€gl releerS en1e,r e 1e 'sru€eur erpJo tqEp eqt otr Jelsnlc ol puel senl€A eql 'sles elep pe l.e{s-gel uI 'u€eul eql uerll

enlel € 1e 's1 pq1-ueeu eql Jo SeI egl ol smcco 8uFe1sn1c srql 'sles epp perrc1s-1q8F'rrerperu oql Jeeu lerl,&\eruos Jetrsnlc ot puel sonl"A oql Jo uoluod eErel € osles €1zp lsou uI

alng le)!r;du1 aql

flleerE ragroop leql sllnseJ ocnpord sputu puoq s8rel eseqtr 1eql sNsSSns uoq?u€AJo 1tmolue lleurs sql'g flepunxordde ,Q $'Z Jo rr€eru eql uIoU sreJIIp umlar eEetuecred lectddl eql 'erogereq;

'secrmo zI ueql ssel ursluoc ilrx\rpq1,!e1ypmf,UBUst1r'erogereql'sectmo ZIZIpw 00'ZI uee qequlsluoclll1'l.oA'66

xordde pu? 'secrmo 0I'T,l pue Z0'Zl ueell.leq uleluoc ll4 %56,(leleurxordde osectmo

Vn rgU ueea$eq ursluoc ilur su€c eqlJo yo1g,(leleurxordde 'e1ru yecutdure eql Euls61

(T,r'T,r '00'TD - (20'0)g + 90'zI : o€ + tJ

(ot'zt 'zo'z) - (zo'o)z + go'zt : ez r rf

(so'zt 'vo'zl) : zo'oT 90'zr :o T t ' f NO|Inl0S

{,?loc Jo secrmo zI veql ssel uleluoc ilF& u€c e pq1 ,!e>11 dre,r 1t s1 's1q8tem-1llJ Jo uo4

ry eql eqrrcseq 'pedeqs ileq eq otr urtou>I st uo4eyndod eql'20'0Jo uoll€Ilep prupuets €s'ouno g0'ZI Jo lqElemlg u€eru € effiq ol ux\otDl sI elocJo su€c ectmo-Z1go uo4epdod y

31nU ]V3rutdru3 tHr 9NrSn zL' t 31dl lvx]

serns€ohl e^4drrcsec lecrrerunN EituHJ uiIIdvHJ 0zI

3.3: Numerical Descriptive Measures for a Population l2I

You can use this rule for any value of ft greater than 1. Consider k : 2. The Chebyshev rulestates that at least 1t - 1ttZ121x 100%io:75% of the values must be found within +2 standarddeviations of the mean.

The Chebyshev rule is very general and applies to any type of distribution. The rule indi-cates at least what percentage of the values fall within a given distance from the mean.However, if the data set is approximately bell shaped, the empirical rule will more accuratelyreflect the greater concentration of data close to the mean. Table 3.6 compares the Chebyshevand empirical rules.

l r i l l l l i l r

ililillll\\\h*

illllliltffililll

" iilllllililll'

rrrrrliiillllli'''

ililrfilulltll]r

hlllrtttttillllrir

m||m|tfill[l|rr

'"i,rililliittlllll[[,,-,S: 3 6

i i r r r t l l l i ' i , ' . r , ,11111, l i t : : " -" . , ; , , ' - , +fCUnd

o/o afValues Found in Intervals Around the Mean

IntervalChebyshev

(an distribution)Empirical Rule

(bell-shaped distribution)

(p-o,Fr+o)

0r-26,p+2o)(p-3o,p+3o)

At least 0%At least75%At least 88 .89%

Approximately 68%Approximately 95%Approximately 99.7%

,'J. E 3.1 3 USING THE CHEBYSHEV RULE

As in Example 3.12, a population of 12-ounce cans of cola is known to have a mean fill-weightof 12.06 ounces and a standard deviation of 0.02. However, the shape of the population isunknown, and you cannot assume that it is bell shaped. Describe the distribution of fill-weights. Is it very likely that a can will contain less than 12 ounces of cola?

SOLUTION p + o - 12.06+ 0.02 : (12.04, 12.08)

p + 2o - 12.06 + 2(0.02) : (12.02, t2.10)

p + 30 - 12.06 + 3(0.02) - (12.00, 12.12)

Because the distribution may be skewed, you cannot use the empirical rule. Using theChebyshev rule, you cannot say anything about the percentage of cans containing betweent2.04 and 12.08 ounces. You can state that at least 75Yo of the cans will contain between 12.02arrd 12.10 onnces and at least 88.89% will contain between 12.00 and 12.12 ounces. Therefore ,between 0 and l l.Il% of the cans will contain less than 12 ounces.

You can use these two rules for understanding how data are distributed around the meanwhen you have sample data. In each case, you use the value you calculate d for X in place of trrand the value you calculated for ̂ S in place of o. The results you compute using the sample sta-tistics are approximations because you used sample statistics ( X, ^9) and not population param-eters (p, o).

the Basics

3"21 The following is a set of data for a popula-. I \ \ i rh i / -10:

8 3 62 98

::e population mean.:re population standard deviation.

3.22 The following is a set of data for a popula-tion with l/: 10:

7 5 6664 8 6931t

a. Compute the population mean.b. Compute the population standard deviation.

'(V prur- g secuere;er) 1o1d rolsqm.-pu?-xoq oqt pu€ Ar€luums

ulrluiJlfln-eArJ eql sepnlcut pqt srs,(leue elep ,fto1ero1dxe qEnorqt sl "tep I€cIJoIrmu Eutqucsep

** JaqiolrV.edeqs pue 'uorlepe,r',{cuepuel l€4uecJo sems"alu ssncslp €'€-I'g suo4ces

slSAlVNV VIVO AUOTVUOIdXS v't

'(e) q pelelnclec sreletu€ rcd eql lord're1u1 'q

"soru€duloo 0E Jo uollelndod srql roJ uolpz\erydec

eglJouoq€IêppJepu€}spu€ueolueq}o}€In3I33.v'g00e 'f lldY 'uroJ'uuc',teuotu wo'{paPurtxq :acJnos

l[l![![lollJ eIPq Poprocor sI sen

nouBZI leydec le>lJ€ru Jo uollelndod erque eqI 'uo[[q

t$ s.[qoN-uoxxg 01 uoqilq s9'E$ scpre>lc€d-$el^\eH

paEuer setuedluoc oseql Jo uolqezrlelrdec 1e>lJ€Iu

Z'V IIrdV uO '>lcols Jo orer{s e Jo ecud oq} {q pe{d

ser€r{s >lco}s Jo roqlunu e{} 3ur4e1 f,q pttndtuoc st

'uor1ez11e11dec lo>lJ€* sll esn 01 sl ,(ueduloc e Jo ezIS

oru o1 poqleut uolutuoc ouo esolueduloc oseql oJe

q lsnf 'VI1CI eql eslrdruoc seruedruoc 'Qrrq; LZ'g

lpe8ueqc sllnseJ orll eêq /Y\oH 'peôruer

IoJ Jo lcrrlslc oql qll^\ (c) qsnorl{} (e) leedeu 'p

e(O ur sllnsor eIIl 1e Pesudno,{ eJV 'elnr lecmdure er{} uo pos€q pe}cedxe eq

l€qlA SnSJeA sEurpur; .mo.( 1ffi4uoc pue e;edruo3 'J

eu€eIu eql Jo suop€lêp pr€pu?}s E+ ulqll^\*ueerrr eril Jo Suollellop pJ€pu€ls Z+ Il\Llllle 'u€elu

uoll€llop pJ€pu€ls I+ ull11li!\ uolldrunsuoc ^(E-reue

red e1err'te seq se131s esoql Jo uo4rodord teg^[ 'q'uopelndod eqt

mrlernep pJBpu€1S pue 'elveluel 'u€eul oql olndruo3 'v

:ree.,( luecer e Eur.rnp €IqrunloJ Jo lcl4slg oql pu€ sol€ls

Jo tlc€e JoJ 'smoq il?^\oll>l q 'uoqdrunsuoc '(Ereuered oql sureluoc I[[[[@ollJ otlt q e$P oqJ, gZ't

'ureldxg espunJ

oser{} Jo s}ess€ oq} ur ,QqIqeIJ€A Jo }ol € erol{} sI 'J

sreleu€;ed esoql 1e;d;elul'uoll€1ndod

roJ uollelêp pJspu€ls pue ecu€IJ€A eql elnduo3 'q

'roleru€ ted s1ql 1erd.re1q 'sputlJ >lcols

I e^lJ erpJo uoq€lndod sFIl roJ u€eru eql elndruo3 'e

eslunolus o $ l€qle ueo/Yueq

SuJnleJ l€1ol ;ee^(-euo ol€r{ ol pelcedxe eJe SpunJ oseql

Jo %g L'86lseel 13 'e1nr neqs,(qeqC eql o1 Eurproccv 'p

auBeru eril Jo suopelêp pr€p

-u€1s €+ ro 'Z+ 'IT ulqllrrt eQ ol pelcedxe ere spunJ eseql

go e8eluecJod leqm 'e1nr ,reqs..(qel{3 eq} o1 EurpJocov 'J

eueotu eqlJo suopelêp prepuels z+ ulqll^\ 'q

eueelu erpJo uo4€Iêp pJ€pu€ls IT ullill1!\ 'E

oQ o] pelcedxo sl spuru eseql;o eEeluec;ed

ter{Â oelru

lecuduro eqt o1 Eurproccv '(O g'0I pu€ (r0) s's

ere selpmnb oql ]elp pu? f U ol yZ- IuoU sI surnlor Ielol

ree^(-euo oql ut eEueJ erll 1€q1 peunuJelop en€q no'( esod

-dns 'uoplpp€ uI '9 ;Z q 'uo4e1êp pr€pue1s e$ 'o leW pu€

0Z'8 sr 'spury er{} II" ,(q peôqce urnqt e?e}uecred 1e1o1

.ree^(-euo ueoul e{} 'rl 1eq} pouruJe}ep eneq no1'sorueduroc eErel q pelse^q ,(1peu4ld leqt spuqJ

Ienlnul VZy'I Jo uoll€lndod e roplsuoJ VZ'e

a(O ur sllnsor arp 1e Pesud

-Jns nor( eJV'elnJ lecutdure eq1Jo slseq oql uo pelcedxe

eq plno,&\ ]€ql!\ qly!\ sEurpurg rno^( lse4uoc pu€ eredruo3 'J

euseru eql Jo suo4

-€rAOp pJepuels €+ lo 'Z+ 'IT ulqllzlr sldtecerxal seles

^(pepenb el€q sosseulsnq esel{1 Jo uolgodord }eI{16 'q'uoqelndod sFil

JoJ uoll€lnop pJ€pu€ls pue 'eoueuen 'u€eul oql olndulo3 'e

9'8 6 '8 I '0I 9 ' L S'0I

8 ' L 9 ' I I IU S' I I E '6

9'0I tz l t 'OI E'6 9 ' ( ' r

8 '7,1 0 '0I z '6 6 'Z1 0 '0I

9 'L s'9 9'ZI I 'S I 9 ' I I

I ' I I Z 'OI I ' I I

L '8 8 ' I I 0 '8

E'L Z' IT O'€I

0 ' I I L '9 0 '€ I

9 '6 I ' I I € 'OI

:elsJol

l€ql rn sluoulqslqelse sseusnq 0g II€ f,q 9OOZqrml I Eul

-pue poged erp roJ e{eT rl€C 3io eEelll1 oII} Jo rollo4duloc

eql ol poilFuqns (sre11op Jo spu€snoql ur) sldlaoer xel sel€s

,(1repenbeql}uesoJder@eIIJeI{}vI .e lepeqIez.e

sldasuoS eql Fu;{ddY

1141 enqdlrcseq I€cIreIunN 1I1IUHI UEIdVHJ ZZI

pury€4uoJ ,,qllepld

V:r{s16 spund u€clretuv

V:VOI spun{ u€clreurv

^q ixopul 00S P.ren8ue.6

y iorg spunC u€clroruv

8'6 6'6s'6 9 '0I9 'Zr E'S

€'0I t '8s'vl 0'6

r '09v'290'Lgi'eBg',tL

s lseErel

slessv punf,

'spury

o^lJ or{} Jo 'sre11op Jo suoIIIIq uI 'slessu

@ollJ or{} ur ewp oqJ, g?'tlueseJder

, I

; illilril

Wffiruw ffiHqdw-ffi wmhwr Swffisffiffiryr

A five-number summarv that consists

Xsm.allest

provides a way to determine the shapeships among the "five numbers" allows

3.4: Exploratory DataAnalysis 123

of

Qt Median Qt Xrurg"rt

of a distribution. Table 3.7 explains how the relation-you to reco gnrze the shape of a data set.

i i i[ruiriifl l|,i"! 3.7 ?elationships Among the Five-Number Summary and the Type of Distribution

Tlpe of DistributionifillmN$llll

ilillitltttttun',

,ttlitliililulrmmnmutufftlnlr [l ffir

lil]Iltllh, iillnlululii*rnu,. Irom Xsmallest

Trru"-I;:ll \-efSUS

l; :rOm the

r; * ftom Xsmallest,(ms*is the

i: in O. to4-J

: ,e from Qltor,iinm ", ensus the:: nn the

i t !u

IPLE 3.14

Left-Skewed

The distance from

Right*Skewed

Q3 to xlurn.rt'

Both distancesare the same.



The distance fromXsmalt"st, to the medianis less than thedistance from themedian to Xtarg..t.

The distance fromXsmattert to Q, is lessthan the distance from

Symmetric

For the sample of 10 getting-ready times, the smallest valu e ts 29value is 52 minutes (see page 100). Calculations done in Section 3.139.5, Qt - 35, and Qt: 4L.Therefore, the five-number summary is

Xsmall.st to the median isgreater than the distancefrom the median to1/Xlargest'

The distance fromXsmaltest to Ql ts greaterthan the distance frorn

Q3 toxlurg.rt'

The distance from Ql tothe median is greaterthan the distance fromthe median to Qs.

The distanc e from Q,to the median is lessthan the distance fromthe median to Qz.

minutes and the largestshow that the median -

29 35 39.5 44 52

The distance fromX.-uu"., to the median (39.5 - 29: 10.5) is slightly less than the distancefrom the median to Xl.g"rt 62 - 39.5 : 12.5). The distance from Xr.uur, ,to et e5 - 29 : 6) isslightly less than the disiance from Qrto Xl*n.rt 62 - 44: 8). Therefoii,-ihe getting-ready timesare slightly right-skewed

COMPUTING THE FIVE.NUMBER SUMMARY OF THE THREE.YEAR ANNUALIZEDRETURNS FOR SMALL-CAP GROWTH MUTUAL FUNDS WITH LOW RISK

The 838 mutual funds ([!E[[@ED that are part of the Using Statistics scenario (see page96) are classified according to the category (small cap, mid cap,and large cap), the type(growth or value), and the risk level of the mutual firnds (low, average, and high). Compute thefive-number summary of the three-year annualized returns for the small-cap growth flrnds withlow risk (see page 99).

SOLUTION From previous computations for the three-year annualized returns for the small-cap growth funds with low risk (see pages 100, 102, and 103), the median : 22.4, e1: 20.8,*fi, Qz: 26.0.In addition, the smallest value in the data set is 19.0, and the largest value is29.9.Therefore. the five-number summary is

19.0 20.8 22.4 26.0 29.9

're{slg \ reddn 3uo1 eq} ol enp pe,le4slq8u er€ spurTJo tsd'q aerq} II!

rru-* r-su-e321e € eql JoJu€ql spwg >1su-q8lq pu€ >Isu-./t\ol eql-JoJ req31q f,tfq8tts ere sepgenb

ril r l: ',unleJ u€Ipeu er{I 'spurg l"n1ruu {slJ-qEH pue '1su-e8ere't€ '>1su-'t'ol JoJ umleJ pezl

;ea.{-eerq1 eqi lo rotd re>lslq \-pue-xoq Iocxg Uosorcltr^{ e q V't em8tg NO111IOS

,tiiiili]iiii,l;1,,.:, i'{sIJ-1\Nol JoJ suJn}eJ poz\enurr?

llilillilliiii'i;; tLIe 'e?ete,te llot) spunJ l€nlnru

; I -S SI

ililrIl$ ue^

'spunJ Ienlnru >1su-q8g Pu€ '{sF

tee,l-ee1p er{} Jo }old Je>lsq./v\-pu€-xoq oq} }cnJ}suo,

er{t Jo loêl >lslr er{} o} Surprocre poIJISSeIT ere (ge

.;edera$Tqtffi) spury I€nlntu 898 er{r::s ) olJeuecs scllsll€1s Sutsg orpJo

soNnl lvnrnN )slu'H9lH ONV'-39VU3^V'-MOl lO

,3nI3U C3ZnVnNruv uvr,r.-::luHl3HI JO SIOId U3)SIHM'ONV-XO8 3Hl

,,,11,il'F'!Ilq !\

re{snll\ uol eql ueql re8uol fpq311s

lqarr eqt .os1y .uerpeur egl pu? enI"A lse./rol egl uea qeq ocuslslp eqt wql releerS

enlen lseq8tq eql pus uelperu oql ue€aleq ecu€lslp eql esn€oeq sseurrrels-1q8r'r

,r1nr1p,r1"Ejg "t'ratg

ur selug ,{pzer-Surge8 eqt 3o 1o1d re>1stqm-pu3-xoq eqJ' .6"a'drx- o1 xoq aql Jo epts lqEp eql Su4ceuuoc re>lsrqrrr e fq

u,e;erdar ercegp eqlJo yog1reddn eql',(pegu4g '1se1pusX'<enl€A lsoll€rus eqlJo uollsJol eql

urc. :{1Jo epls uol "w aotl""oooc Qa4slt4rtt'e ''o'l) eu1e.'{q p"ty:t?ltl

?!-Y--lYJ!"!:f1:' :qysenp^ eqlJo %0g elppltu eql sululuoc xoq eql 'tttqJ '€O Jo uollerol eql sluasarder

,qrgo .pr. lqap eql rn *tiinttlrel oI{} pun-'r7Jo uoq€col "qr

t@1]93i-Y^"Jtî:'pF

;qr le ouq Iecrue^ eqyu€Ipeu eql slueserdeJ xoq eql ulqll^{ ulreJp eu{ I"cqJeA eqr

s L ' t ! l1d l^ lvxS

t ' t SunDll

9g09(solnulur) oul l l

0v 00 OZ

uelpeN

{peer }oO ot orl!} oL1} }o]old ro)sl t{M-Pue-xog

r;:r-3u111e3 eql

; eq] uo pos€qroJ lold ro>lslqA\-pu€-xoq

elep oql Jo uoll€]ueserder

'serul]

eql se]€rlsnlll E E ornBIC 'r(reuurns roqlunu

I€cqdErE e sepllord lold ro{slqa '-puu-xoq V

em$d #wp#sgq&&-pffiw*H#ffi ffiq&

'uo4nql4slp Pem,ols-1q8tr e elec

*mr:: suosneduroc eorqr nv'G'e : v'zz - o'gdtfi o1';rztpau eql uo'g ""1tt:Jl#111i:YT::ltl

rLih, - =g'02 -v'zz)u€rperu "rltotrOuro{ocu?1slqe5-r-'(o'€:0'92-6'6d ';;;#--o1-'9-::{

n*lr"rsrp eqr u€qr ssel sr (g'1 : 0'6I - g'oz) IA o1Nelleursx.'ou ecu?trsrp :11- -:^l{:l,HiPfS

nuLrro{I G't:v'Zz- e'Adecu€lslp eqtru€ql sse1 sr (7'g : 0'6I - V'zOu€Ipeu eqlol " x

[&j :tr eJrr€lsrp eql 'sseua\e{s o}€nlele ol pesn ole L'E eIq€I ur pe}slT suosu€dluoc eeJql oql

sernssel I e^4dlrcso0 l€clrelunl\{ Ef,UHI UEIdVHJ VZI

3.4: Exploratory DataAnalysis I25

i i ipE 3.4

:-cel box-' , , : - c lots of the

'" : - :^ ' tual ized" I *û'-r iSk,

-" : . and high-' . . : t - lds

Tfires'Year Annudized Return By Risk

r i ' , ' , l i l l

i , , l i l l l l l l l l l l r

l t l l

illlllllilililrr,

" Average

ffi-

- i?5"3

: i : "" '- s<er plotsiililli, tl'l*':'*":;1 : I C i n g

l l t ' ' t : i " : - -CUf

Figure 3.5 demonstrates the relationship between the box-and-whisker plot and the poly-gon for four different types of distributions. (Note: The area under each polygon is split intoquartiles corresponding to the five-number sunmary for the box-and-whisker plot.)

h-- ffi --jPanel A

Bel l -sha ped distr i but ion

L** **_r**T*L**tf L**L*J E

Panel BLeft-skewed d istri bution

Panel DRectangu la r d istr ibution

h--ffiPanel C

Rig ht-skewed distr i bution

Panels A and D of Figure 3.5 are symmetrical. In these distributions, the mean and medianare equal. In addition, the length of the left whisker is equal to the length of the right whisker,and the median line divides the box in half.

Panel B of Figure 3.5 is left-skewed. The few small values distort the mean toward the left tail.For this left-skewed distribution, the skernness indicates that there is a healy clustering of values atthe high end of the scale (i.e., the right side);75% of aJl values are found between the left edge ofthe box (01) and tne end of the right whisker (Xl.*.J.Therefore, the long left whisker contains thesmallest 25%o of Ihe values, demonstrating the distortion from slzmmetry in this data set.

Panel C of Figure 3.5 is right-skewed. The concentration of values is on the low end of thescale (i.e., the left side of the box-and-whisker plot). Here, 75Yo of all data values are foundbetween the beginning of the left whisker (Xr.urr"rt) and the right edge of the box (Q), andtheremaining 25%o of the values are dispersed along the long right whisker at the upper end of thescale.

eAJ ree.(-ent; e PTffi'CO-:ro 'lunocc€ 1o>lJeur .(euoul erpJo plelf eqt JoJ Suo4

i,; *slp eql ur oJoID eJ? SocuoJeJJIp pu€ solllJslluIls ]€II1v\ 'J.CI3

t;i,, -: \lJ B pue 'CIC ree^(-euO 'lunOCCe 1O>lJ€Iu ,(euOUr

r:t . plelr( oql roJ lold re>lsry1Y\-pue-xoq .3 lcnrlsuoC 'q

of teeG-e lJ € pus 'cc ree.(-euo 'lunocce }o>lJelu:;;lrrl --r a{}Jo plolf eqt rog freluluns Jeqlunu-ellJ eq} }slT 'E

'(SOOZ 602 leqtiloceq

luuurrrli - JleDIu€g ruou polceJlxe) 9002 '02 Jeqruecoc Jo,ilur' r: -rolJ r{}nos w $lu€ q 0V roJ CJ teaK-eAIJ € pu€ '(CC)

;]p Jo ol€cIJIUec reer(-euo e 'lunocc€ 1e>lJelu ̂(euour

,iuiiii- sp1er,( oqt er€ EffiSEUUEE ellJ eI{} vr Etep eqr St'g

LWJ roJ pu? selrol€c roJ suopnq

;;I E{} UI AJEIP EJE SOCUEJEJJIP PU€ SOI}IJEIIUIS }€I{^(\ 'J

-tr'{ pue SolJolec JoJ }old Je>lsn{.&\-pu€-xoq € }cn4suoJ 'q

freururns

urTu-eAIJ eID 1s11 '(WJ pu€ selrol€c) elqelr€A l{c€o roC 'E

'6 'd 'f00e aunl"spodeg rorunsuo3 ,is4cnq'mry,Jrr:ttt|n SJT1tOQ,UlYUnO 70 r(puo3 sn affi3,, U'to"ttpapCI'ltx4 :a)'mos

'edeqs

oql oqlrcsop pu€ ]old re>lslqA\-pu€-xoq e lcnrlsuo3 'q

{reururns Jeqrunu-onlJ erl} }slT 'E

'BI-rt 'dd '500e ,tlnf 'sgodeg rerunsuoJ

,,'x!w aw 4 salnrcai alow :snntttCI),, u,to'tf papo"tlx7 :aJ'mos

\vz 0I I 0u 088 S€ 092

09v 088 }Lr s8 08I 008

:seroruec leyltp lex1d eerql rog (s1oqs ul) aJII f,rel

-teqeql luoserder @ ollJ eql q elep oql Zgt'edeqs

oql eqlJcsep pu€ lold ro>lslq/Y\-pu€-xoq e lcnJlsuoC 'q

^(;eururns Jeqrunu-ollJ oql lslT 'e

'I €-BZ 'dd 'f00e nqwafias 'spodeg rerunsuo3

,,'nuary aqt of aflpag SutppY :pool |so,{,, ruo'{papo'tlxff :actnos

9S OV OE O€ 6Z 6Z 61 9Z OT EZ

0€ 6lvz0z0z9l s v 8 L

:s/!\olloJ se ore epp oql 'sul€qc pool-lseJ luoJJ

Soqcl.,rr\pu€s uo>lcq ) 0Z3o eldules € JoJ 'Eurnres red surerE

ur 'rEJ l€lo] eq] sur€]uoc EIE$EEffi ellJ ewp oq'L Lg't

sldaeuo) aqt 6uylddY

'ssncsl(I '911 eEud uo (P)

V'E Trlelqord tuo4 Wql qllÂ (q) q rolrrrsu€ ;nor( e;edulo3 'r'edeqs

oql oqlrcsop pue lold re>lslq,{\-pue-xoq e }cnr}suoJ 'q

^(reununs Jogrunu-ellJ erll lsIT 'e

6 L 8- 9- L

rg : u Jo eldues u IuoU EWpJo los € sl SurznolloJ aIIJ 0t't

'ssncsl6l '911 e8ed uo (P)

E'E ulolqord IuoU l€I{} WIzn (q) ul ro/Y\su€ rno.'( eredulo3 'J'edeqs

eql eqlrcsep pu€ lold re>lslql\-pu€-xoq e lcnrlsuoJ 'q

^(reuruns Joqrunu-enlJ eql lsIT 'e

E L O 6V LZI

tL: u3o e1d

-rues € ruou eqp Jo les € sl SurmolloJ eql 5Z't

'ssncslq 'S 1 1 e8ed uo (P)

Z'E ulelgord luor; 1€q} q}l^\ (q) q re^\su€ rnor( e;eduro3 'c'edeqs

er{l eqlJcsep pu€ }old re>lslq/!\-pu€-xoq E }cnJ}suo3 'q

.(reururns Jegrunu-ellJ erp lslT 'E

UEL6VL

i9:upe1d

-rues € ruo+ E'WpJo los € sl EurzlrolloJ oql g7'e

5r!se8 alll 6u1u'leal

0gs

0I9

Uzv

lr slo>lcr1 ,(ep-ouou0 ercp sulsluoc

(uleerc peddrqzn) aulqrJ popuelg

oulccndderg olelocorlS $lcnqrsls

(ueerc Peddrqzn) eegoc Popuolqouccndderg orI/Y\oJg elelocol{J s4cnqrs}s

(ureerc Peddrqtr) ee;;oc

pepuolq oulccndderg er{colN s>lrnqre}S

gSE (ureerc pedd1l^ puu il1ur eloqm)

osserdxg €IIcoN ooJJoJ pocl $lonqre]S

0gg (ueerc) epelooC ooJJoJ slnuoq 6uDIunC

0g7 ae#or

popuelq oulscndde;g eogo3 $lrnqrels

ov|' (>tttul eloqzn)oll3l lrl^\s eqcoN pocl slnuocl 6uDlun0

--soIroIBJ lrnpord

:s>lcnqr€}s

slnuocl 6ur>lun0 te s>lulrp eeJJoc pocl

io-9I Jo (suler8 u1) teJ pu€ selrol€c oql luos

r'ar ellJ or{} urewp oqr ?t't

vtep eI{} Jo edeqs

pqlJcsep pu€ ]old Jo>lslq./K-pue-xoq e lcnJlsuoJ 'q

^(reururns Jeqrunu-enlJ orp lsIT 'e

'rd 'Id 'dd

'g00e 'g fS I lUdV1€urnof leels IIeA\. e{1. ,,1o1cng UbryJ aql

ffiuruwrqlay,, 1tot'tuau'ro7 'g puo uos4)Df '3 tuo"$ papollxl :a1mos

0v0vEvz9 09 6ZZVWE9 8s

:sol€}s pollun eql q uFed elueqlroJ (S ul) ecud uolsslrupe Sutlr€ls

ollJ Ewp oql tt't

sernseol4 enqdlrcseq I€cIreIunN E;IUHJ UEIdVHJ 9T,I

i l l r l l l l t l l i

,ll, ilti

i , r:-inch located rn a commercial district of a:: : :r3d an improved process for serving cus-

-_ : -e noon-to-1:00 p.m. lunch per iod. The":; - :ninutes (defined as the time the customer

ilnri " : n hen he or she reaches the teller window),imllllt : - 5 customers during this hour is recordedilrlrir:r"" - : rrfle week. The results are contained in the

iril'lii 'irrh:, and are listed below:

: : 3.02 5.13 4.77 2.34 3.54 3.20

0 3 8 5.12 6.46 6.19 3.79

located Ln a residential are1 is also con-noon-to-1 p.m. lunch hour. The waiting

3.5: The Covariance and the Coefficient of Correlation I27

time, in minutes (defined as the time the customer entersthe line to when he or she reaches the teller window), of asample of 15 customers during this hour is recorded over aperiod of one week. The results are contained in the datafile ffi and are listed below:

9.66 5.90 8.02 5.79 8.73 3.82 8.01 8.35

10.49 6.68 5 .64 4.08 6.n 9 .9T s .47

a. List the five-number summaries of the waiting times atthe two bank branches.

b. Construct box-and-whisker plots and describe the shapeof the distribution of each for the two bank branches.

c. What similarities and differences are there in the distrib-utions of the waiting time at the two bank branches?

3.5 THE COVARIANCE AND THE COEFFICIENT OF CORRELATIONIn Section 2.5, yott used scatter plots to visually examine the relationship between two numeri-cal variables. This section presents lnryo numerical measures that examine the relationshipbetween two numerical variables: the covariance and the coefficient of correlation.

The Covariance

The covariance measures the strength of the linear relationship between two numerical variables(X and Y). Equation (3.16) defines the sample covariance, and Example 3.16 illustrates its use.

THE SAMPLE COVARIANCE

- x)v, -v1cov(X,Y)

/1";"'1(3.16)

; : : / l : , .

Xt:i,:,i+r

t E 3.15

'ar tt{ tfl

' - t , , ' lU

COMPUTING THE SAMPLE COVARIANCE

In Section 2.5 on page 58, you examined the relationship between the cost of a fast-food ham-burger meal and the cost of two movie tickets in 10 cities around the world (extracted from K.Spors, "KeepingUp with . . . Yourself," TheWall StreetJournal,Apilll,2005,p.R4).The datafileEltEflffifficontains the complete data set. Compute the sample covariance.

SOLUTION Table 3.8 provides the cost of a fast-food hamburger meal and the cost of twomovie tickets in l0 cities around the world.

City Hamburger Nlovie Tickets

5.997.62s.7 54.4s4.995.294.393.704.622.99

TokyoLondonNew YorkSydneyChicaeo

tr)

San FranciscoBostonAtlantaTorontoRio de Janeiro

32.6628.4r20.0020.7118.0019.5018.0016.001 8.059.90

g loued

, , I . i .a.{ . l . j I j r r i . i i t } r r r r i r j . ; r j l r i l r r r t r r t l l t t t t t I I r : ! . . , r , , . . , . , i . . . , . . .dLaa*

!*,***-4a66l@*1**Yryq4sa$::44qry44"-fsaTYw4,.ry*"TT*"Y***T*ry*1TT*T*.*a1*-T*^:-***-T

( [+ = dl uo! ]e leJrocenrlrsod lcelJod

c laued

(o=d)uo!]elorroc oN

(L- = d) uot le lo l locanrleOeu lco+rad

selqelJen uoo^^laquorlenosse 1o sedr{1

L't lunDll

'selgeuel o r1 uee qeq uo4elcosseJo seddl luereglp eeJIil sele4snlp 1'g em8tg 'uoIPIeJJoc

luiJrjeoc eql JoJ 1oqur,{s eq} s€ pesn s d regel leerg eq1 'selqeuerr I€clJoIImu olq JoJ €tr€p uop

cld qil^\ Eqleap ueql\ 'ouII fqEle.qs 3 qllv\ pepeuuoc eq ppoc slutod eql 1e '1o1d rog€cs 3 uI

:',i a;arn slurod eqlgr leqtr sueeru pato4 'aolleptroc e,ultsod lceped e rog I+ ol uoq€lerloo eAIlfuu naped € JoJ I- urog e3uer uorleloJJocJo luarcrJeoc oqlJo senls oql'solq€IJ? Iscuarrmu

:eJ.!qeq drqsuoqeler J€elrl ?Jo qfue4s ellleloJ eIil seJns€eu UoPBIaIJoJ Jo lueIJIJJooJ eql

ffiffi$&wff@r*s#* #ffi &ffi#$s$ffie*- *q&

'uo4€IerrocJo luerclJleoc eql elnduroc ol poou nof'dtqsuorleleJ oqlJo q13ue4s'rr eql eurruJelep Jopeq oJ 'dtqsuorleler {e3 \ € ro dlqsuotleler ?uor1s 3 Jo uoll€3lpul

_LLEg'g anl€A oqtr rer{leq^\ ile} louu€c nof 'sprom reqlo uI 'drqsuo4eyer eqt go ql8uels

Ier oql eurruJelep ol olq?tm ere no.,( 'snlerr fue e^€q u?c ecIIBLI€Aoc eql esrucog 'selquu€A

lumu o q uoelqeq dlqsuorpler Jseu{ oq}Jo ems€elu € s€ 1rteg rofeur e s€q ecueueloc eqJ

LLL[8'9 =

I-oI - ( [y)noct66Es'19

tLLLEg'g sI ocIIBIr€Aoc eql l€ql pur; nod'dpcelp (91'g) uoaunbg Sqsn,tqro'973 IIec ruorJiqelmlec roll€ursJo les e o1q (91'g) uo4enbg uY\op $l?erq 9'g em8rglo eer€ suoqslncluJ eqJ

ssorD JoJ ecrr?IJBôc eql selslncl?c leqt leeq$lJo1( Iecxg gosoJcll tr ? sul€luoc 9'[ emtsIc

S[.3 I Sl]*frsr*:rplsrfr$*

& " {filsr.w}*xn s*,{* r,,w: tg} :t s VH #r\S*knw#*$wT*As*

{*n**,* - st#} * t*$*#$ " silS*{e**p* " #:[ff] " t**fr*$ " [,tS*{g**m* - $$s} " *g,[$}* " t-[S*{gr*p* - sf-&} " {*n-$## - s$s}*

fu*gm**##"{pl$n*"sS*frssss$- ffi} "{,*r.$}$" M*ftssp*-t#*tgr.$#$"aYl*ksss**ffi*tgt#3$"ffit*fu,$**s"M"{st$p$"sv}*ftg$#$- f#"{s|-$p#- v}*

#**r$,Fffi#ryrrsf*s

' $"to

ssrru*H

#**sf*stffi*ffi,#

frtfittridt$i$tFt

'$fffFs*S ffi#S itrs"f

i$Sffit.{f iS.#[ ,Str"S

trffisfftr ,.F*"#tr $f.fifiS$S$r# 'ffi 'Sd.$

j.,SSSS'#.$ iffi.ffd ,S#.$

*[flE$t*fiI:ffiffi.t{,ffisf

sA"#**,ruffitrIH1, *,&trftt[ , _*ffi,*m, *,p*flry i**#*m4.***xtr1g

xt,

wffiffi

sol l l3 0L ul s lo) l l ]ornorrj o^^] Jto ]so)

pue leaur te6rnquleqpooj-lsej e +o ]sof,

uoeMloq ofuel lenotreLll lo+ ]oaLls)roM

lotrxf uosolf,llA

9'g SUnDll

v loued

sernseery enrldrrcse( IeorrerunN ES-UHI UEIdVHJ gZI

3.5: The Covariance and the Coefficient of Correlation 129

In Panel A of Figure 3.7, there is a perfect negative linear relationship between X and Y.Thus, the coefficient ofcorrelation, p, equals -1, and whenXincreases, fdecreases in a per-fectly predictable manner. Panel B shows a situation in which there is no relationship betweenX and Y. In this case, the coefficient ofcorrelation, p, equals 0, and as Xincreases, there is notendency for Ito increase or decrease. Panel C illustrates a perfect positive relationship wherep equals +1. In this case, Iincreases in a perfectly predictable manner whenXincreases.

When you have sample data, the sample coefficient of correlation, r, is calculated. Whenusing sample data, you are unlikely to have a sample coefficient of exactly +1, 0, or -1. Figure 3.8presents scatter plots along with their respective sample coefficients of correlation, r, for six datasets. each of which contains 100 values of X and Y.

.300 "t4t "?0$

Panel B (r : -0.6)

+

.} t

t

30s {00 $m

Panel D (r :0.3)

.{0 .2S U * dO 6S _

* t00 120 140 t60 i80A

Panel F(r :0.9)

+

t

+a

l+

+

It

o

0lr

- J, i

&8

r- ,: r, is created from Microsoft Excel and their sample coeff ic ients of correlat ion, r

T,( 4 * !i)

ilr'fi

= '{S

- x,s,

; G,'y)troi

T?.

, (X -t*rT

(j - I.r)(x -l::li:,,:.lr ::::: i , !: I

!#l

t't')lsxs

qt--" t

NOrrvllUUOf JO i*= r lgaoe

sJerlm

,,,.,,,,... ir,,,,,,1,,,,,,, i ,,, l

!t'tdtAtvs IHI

'esn slr sele4r 1 1 'g eydurexg pue '"r 'uopulo-rroJ Jo luoIJIJJaor eldures oqt seurJep (1 1

.g) uoaunbg'uorlesn€c fldwr

saop euol" uorl?leJJoc 1nq 'uorleyerroc serlduu uorlesnec legl ,{es uec no,{ ,erogereq;uEIaJJoc eql pecnpoJd ,(11en1ce suorl€nlrs eeJgl esoql Jo qcIrIA eurruJelap o1 srsfleuerurpp€ urrogred ot peeu plno \ noa 'drqsuolleleJ loeJJe-pu?-asnec e ,(q ro ouorlelerroc

-ro uorl"Insl?c eql uI poJeprsuoc 10u elqsrJe^ pJlql ? Jo iceJJe eql ,tq 'ecueqc ,(q ,{ldrurspord eq uec uorleleJroc Euorls v 'olqerJ€^ Jerllo eql ur e8ueqc eqr pasnDJ elqerJgl euo

JnleA eql ul e8ueqc eql let{} 'sI leql-lceJJo uorlesnsr e sr oJeql leql e,rord }ouu€o euol€rcleJJoJ'esodrnd uo pesn sen Burpron suqT's\calla puy sasnDc se lou pue sanuapualleqrrcsep ,,(1e1ereqr1ep ere,1. sdrqsuorl€lor erll 'g'€ ernSrg Jo uorssncslp eql uI

7 Jo senL aEJsl r{}r^\ pelercoss" eq o} puelxJo senlsl e3re1 eqt pue

"{Jo

sonl"A Ilerus qlr^\ perred'Jr puel,YJo sanle^ IIsIus osn€coq uorleleJJoc Jo sluercrJJeoc e,rrllsod e^Bq l€ql sles €lepbp g q8norql c sleued 7 Jo senle re8rel eqr qrrrvr perred eq orxJo senl?A lleurs eql roJnpuel lqEqs e fpo sr eJoqr pue '€'0- : "r'4eem,{re,rr sr,J puexuoe \leq dqsuor1e1", rl"ur1'J Ieu"d uI 'v leu"d ul leqt se e,n1e3eu s? lou sr g leued ur uorlsleJJoc Jo luercrJJeoc eql

q.[ v leu?d ur leql se 3uo4s se rou sr g Ieued ar f, pt:r- x uee,/rueq drqsuorlelsr Jeeurl eqJitr SOftl?A eErel qlr,l. perred eq otr puelxJo senl€A llslus eql pue .g.0- o1 lenbe uo4elerro":uorcrJJeoc e 0A€rI g Ioued lu:'erep eq7'Tcattad se peqrJcsep eq touu€J

^ prnxuee geq

r?rJosse eql os 'eury lq8rerls e uo IIeJ IIe lou op etr?p eqJ Z Jo senl€A II?us qlr^\ perredol puetrxJo senle^ e8rel eqt 'esum4r1 'eEre1 eq o11 JoJ fcuepuel 8uor1s .(rerr e sr ereql ,aienl€A llerus JoJ leql oes uec no^ '6'0- sl ',t 'uotlelettoc Jo luarcrJJeoc ar{l ,y

1aue4 u1

I= l

sernseel4 errrldrrssec IecrrerunN aEuHI uiIJdvHJ 0E I

3.5: The Covariance and the Coefficient of Correlation 131

. . l l i

i , , i i i ' r l l l i r l

) : l - l l i

" " r l l l l l l l l l l r

Itl l lt l ltr

,r t |{ l l rrrrJ&"- f 3 "17

{r fi,&,v

COMPUTING THE SAMPLE COEFFICIENT OF CORRELATION

Consider the cost of a fast-food hamburger meal and the cost of two movie tickets in 10 citiesaround the world (see Table 3.8 on page 127). From Figure 3.9 and Equation (3.17), computethe sample coefficient of correlation.

SOLUTION

cov(x,Y)at, -

SxSy

6.83 777

llunnbxrg*rlil*sl

$.$$.

(r.2e2sx6.337)- 0.8348

*s-{r i s"$F{s: &s.sy.{a,?s s.*$.44 s,st${

tg-trt , n*rwi s#**s*s" n rl *ry1

ts,$, fi$wri s.ffil{$, s.3*s.s, t,s&rit$, ,tr-s,$ , {$,ffi*-

1*,SSl {I"{?ffii {.ffif3 s-:?sdfrffis33$S,$' $-$ffitr *S{,ffi'F

suw;Js*S$,

S*luul*,clor *XSmr r ,,i[.SfS

f , ffi"tPs#"1 .S

S#nrfsl*rs . g"*gytrf$x i !*S*$

s'3$r*r i $-sS{S

*AtffiS*ffS$n$t$*fiVSffifrSSffirS*tI*S$t$ffiYtfitr*Atil$*Sl* d,$tF*$SRY{f;-t*

'r #tf}*$*ftY{*t* I Ht$**Sffiffi#|"{&tue1i, ffi${t}

F,S?.s.F$l4,4S,4-S$.S;*Si4.SS's,f i

4"$?,3.$S;

Kt*ffi$,'W*S-*.***#s,ss3ds,ts3fir#ws"3tr$.s

The cost of a fast-food hamburger meal and the cost of two movie tickets are positivelycorrelated. Those cities with the lowest cost of a fast-food hamburger meal tend to be associ-ated with the lowest cost of two movie tickets. Those cities with the highest cost of a fast-foodhamburger meal tend to be associated with the highest cost of two movie tickets. This relation-ship is fairly strong, as indicated by a coefficient of correlation, r: 0.8348.

You cannot assume that having a low cost of a fast-food hamburger meal caused thelow cost of two movie tickets. You can only say that this is what tended to happen in thesample.

In summary the coefficient of correlation indicates the linear relationship, or association,between two numerical variables. When the coefficient of correlation gets closer to *1 or -1,the linear relationship between the two variables is stronger. When the coefficient of correla-tion is near 0, little or no linear relationship exists. The sign of the coefficient of correlationindicates whether the data are positively correlated (i.e., the larger values of X ne typicallypaired with the larger values of I) or negatively correlated (i.e., the larger values ofXare typi-cally paired with the smaller values of I). The existence of a strong correlation does not implya causation effect. It only indicates the tendencies present in the data.

M,, i , , f i I i l r , i l r r ) r { i l i l i l . . i i l l l l l

rilri|;;:rr-TT spJ€pII?]S IUOUruIOAOS ]UOJJnC pU? JOU^/KO Uee goq

i|Lli; ,f i: :leleJ er{} }no qE q.cBeJ no.( ue} suolsnlcuoc }€r{1V\ 'p'ur?ldxf, euollslerroc

-,: r[J[30c oq] Jo e3u€IJeôc eq]-aEeeFul spJepue]s

iiri -,. , -J-\OE lUeJJnC pUU J3UA\O UeO1KlOq drqsuoll€leJ

iilr " ::sserdxe q elq€nl€A ororu sl {uF{} nof op qcll{71\'uorlslerJoc Jo luorcrJJooc eq] olndulo3

'ecuerJeôc oq] elndulo3

'gt 'd'900e 'y y ["mnun; "lepoJ

VSn ,,'pa'tallYaqrir ',,, :,xtDlnt)p3 [u,touoJg lanl,, 'tapa11 'p u,to,{paputlxfl :a),mos

:s>lcnqrelspu? slnuocl 6ur>lunc le s>lulrp eeJJoc poclecuno-g I Jo 'surerS ul ']€J pu€ serJol€c oql lues-erder ollJ eql ur etep eql O?'t

' (e) 8 € ' €urelqord Jo esoq] ol (e) Jo sllnser eql eredulo3 'q

LsluelulsoAuI

Jo sedr$ reqlo e^lJ eseql Jo qcee pue spuog 'S'n Joluerulselul uo uJnloJ er{} uee.&ueq drqsuo}}eleJ eq} Joql8uer1s ot{} lnoqe o>leru nor( vel suolsnlcuoc }el{7y1 'E

'0I'0 se..l\ lqep sle>lreru SurErerue pu€ spuoq 'S'n pue

'02'0- sBA s>lcols sle>lr€ru EurErerue pu€ spuoq 'S'n '8?'0,s3.&\ spuoq l€uol]surolul pu€ spuoq'S'n'8I'0- s€/!\ s>lcols

dec II€us Ieuolleurelul pu€ spuoq 'S'O 'tl'g- s€.,K s>lcols

dec e8rel Ieuorl€uralul pue spuoq 'S'n Jo luetulsonuluo uJnleJ er{} uee,&ueq uor}€leJJoc Jo }uolclileoc eq} wqlpelels spuoq u8rerog ur ]uerulselur pqssncslp wqt ( t C'd 6

nyz 'gZ JegruenoN 'pu,tnoy pa"tts na/ll aqJ ,.'spunguEre;og ur or loJuod {co}S r leql Jo %08 o} dn }ndplnoqs srolsenul {q4,, 'slueulolJ 'f) elcl}r€ uV 6g'g

' (e) $turolqord Jo esoql ol (e) Jo sllnser er{} ereduro3 'q

eslueIulseûl

Jo sedfi rer{}o e^lJ esoqt Jo r{c€e pue s>lco}s 'S 'n Joluorulsonul uo uJnleJ eq] uao.ttleq dlqsuoll€leJ er{} Joql8uer1s eql lnoqe e>leru nof uec suolsnlouoc 13q16 'E

'89'0 seln lqep sle>lreru Eur8rerue pu€ s>lcols 'S'n pue

'IL'0 s€1K s>lcols sle>lreru Sur8reruo pu€ s>lcols 'S'n '80'0s€1y\ spuoq I€uolleuJolul pu€ s>lJols 'S'n 'Eg' 0 s€zlr s>lcols

dec lleurs Iauortaurelul pu€ s>lcols 'S'n '08'0 sen{ s>lco}s

dec e8rel Ieuolleurelul pue s>lcols 'S'n Jo luetulsonuluo uJnloJ eI{} uee./Kleq uol}sleJJoc Jo }uelclJJeoc eq} Wqlpel€ls s>lcols u8rerog ur luetulseûl pessncslp wql (tC'd 6

n1Z 'gZ Jeqruolo5l 'lnutnoy paus Ua/U aqJ ..'spunguSrerog ur orloJuod {co}S rlaql Jo %08 o} dn }ndplnoqs srolsalul f,{16,, 'slueulel3 'f) elclue uV 8t't

sldaeuo3 aql 6u;{ddy

'ureldxE

iJ pue X uee,,vrleq drqsuolleler eql sr Suorls zlroH 'J'uonelerroc Jo luercrJJeoc eql olnduro3 'q

'ocuerJeloc eql olndulo3 'B

V9 SV LZ U 9E OE 8I 6 VZ 9I IZ T

8I 9I 6 V ZI OI 9 E 8 9 L X

:sluell I I- u go eldures e ruo4 elep Jo les e sl Eurzlr.olloJ erll Lg'e

sflseg alll 6ulu.lee-l

.J

'q.E

" , 99 ELE:Eg; 8Z L'IT,J"8 r 8 '91'r"Lv 8'8t;"F[ 6'LZ:"97, 8'LZ!"1, r 0'9 I$ '9r E Vr

snlJd e1o.(o; S00Zelloro3 e1o,(o1 E00Z,furue3 e1o.(o; g00Zreroldxg proC 7,002

plrq,(H cl^lc spuoH v00zclÎ3 upuoH 7,002

xT proscY BpuoH 7,002opsro^lls ]elorôqc 9002

091-c prod 9002

aao9 Jouaro reJ

:spJspu€ls luaruureno8 lueJJnc fq pue

ul,o,(qpe}3In3I€cse,e3eOIFIeq}Se}€cIpu(E@Huegl uI peuleluoc) elq€l SutznolloJ orII '.(ulouoce

frurlelnclec roJ spoqleur I€renos ete erer{J ,V'e

LIEJ pu? selJolec uee^\}eq

rtBIeJ oql lno qe I4cEe; nor( u€c suolsnlcuoc ]?r{a 'p'ureldxg euollelorroc Jo luelclJJeoc eql

ueôc eql-lal prJe solJolec uoe./Kleq drqsuolleleJ

imsserdxe q elqenl€A eroru sl {u}q1 nor( op qcll{A'uorlelerroc Jo luercrJJeoc eql olndulo3

'ocu€rJeloc eql olnduro3

'6 'd 'h00e aunl"sl;ode6 Jelunsuo3 ,,'s4cnq"rcryw:a slnuoT,ur4unQ 7o r(puo3 so aa.[o3,, uto",ttpapo"\xf, :ac'mos

'c'q.E

'o;7:

0vT,

(ruee"rc peddrqzn) eul?rJ pepuelg

ouccndderg ol€locotlJ slcnqre]S

(uleerc peddlq,ur) eegoc popuolq

ouccndderg oIIII\org el€looot{J $lcnqru}S

(ureerc peddrqm) ee;;oc

pepuolq ouccndderg ur{co6 $lcnqre}S

OSE (ureerc peddrqzn pue {llur eloqzn)osserdxg eqcolN eoJJoJ pocl $lcnqr€1s

9:! (ureerc) epeloof eoJJoJ sln"oO:lJ>IunCI

092 eeJJorpepuolg oulocndderg eeJJoJ $lcnqrets

(ttttur eloqllr)

ell€l lrl,/Ks €qcolN pecl slnuocl 6uDlunc

r{ salroluJ lcnpord

sernseel4l en4drrcsoq l€crrerunN IIEUHJ UEIdVHf, T,EI

imi3.6: Pitfalls in Numerical Descriptive Measures and Ethical Issues 1 3 3

,*,',ilil-l [*tn,[lege basketball is big business, with coaches'rllrrLr$tti,. rrqvenues, and expenses in millions of dollars. The

'riiiiillu* contains the coaches'ir,,rllnrrrtti$i nnC revenue for college basketball at selected;,,,rrr,irlrilllrlillrltilliliii fim & recent year (extracted from R. Adams, "Pay.ttr,rrrllllllluumuirrruffi."' The Wall Street Journal, March I 1-12, 20A6,

iilffi1r irrrrrr , ili'$

,rrr,rn|lnmmrume the cov arLance .the coefficient of correlation.

rnmclusions can you reach about the relationshiplllMunen a coach's salary and revenue?

3.6 PITFALLS IN NUMERICAL DESCRIPTIVE MEASURESAND ETHICAL ISSUESIn this chapter, you have studied how a set ofnumerical data can be characterizedby variousstatistics that measure the properties ofcentral tendency, variation, and shape. Your next step isqnalysis and interpretation ofthe calculated statistics. Your analysis is objective; your interpre-tation is subjective.You must avoid errors that may arise either in the objectivity of your analy-sis or in the subjectivity of your interpretation.

The analysis of the mutual funds is objective and reveals several impartial findings.Objectivity in data analysis means reporting the most appropriate numerical descriptive mea-sures for a given data set. Now that you have read the chapter and have become familiar withvarious numerical descriptive measures and their strengths and weaknesses, how should youproceed with the objective analysis? Because the data distribute in a slightly asymmetricalmanner, shouldn't you report the median in addition to the mean? Doesn't the standard devia-tion provide more information about the property of variation than the range? Should youdescribe the data set as right-skewed?

On the othe* hand, data interpretation is subjective. Different people form different conclu-sions when interpreting the analytical findings. Everyone sees the world from different per-spectives. Thus, because data interpretation is subjective, you must do it in a fair, neutral, andclear manner.

Ethical lssuesEthical issues are vitally important to all data analysis. As a daily consumer of information, youneed to question what you read in newspapers and magazines, what you hear on the radio ortelevision, and what you see while surfing the Internet. Over time, much skepticism has beenexpressed about the purpose, the focus, and the objectivity ofpublished studies. Perhaps nocomment on this topic is more telling than a quip often attributed to the famous,nineteenth-century British statesman Benjamin Disraeli: "There are three kinds of lies: lies, damned lies,and statistics."

Ethical considerations arise when you are deciding what results to include in a report.You should document both good and bad results. In addition, when making oral presenta-tions and presenting written reports, you need to give results in a fair, objective, and neutralmanner. Unethical behavior occurs when you willfully choose an inappropriate summarymeasure (for example, the mean for a very skewed set of data) to distort the facts in order tosupport a particular position. In addition, unethical behavior occurs when you selectivelyfail to report pertinent findings because it would be detrimental to the support of a particularposition.

3.43 College football players trying out for the NFL aregiven the Wonderlic standardtzedintelligence test. The datain the file@ contains the average Wonderlic scoreof football players tryitrg out for the NFL and the gradua-tion rate for football players at selected schools (extractedfrom S. Walker, "The NFI-s Smartest Team," The WallStreet Journal, September 30,2005, pp. Wl, W10).a. Compute the covanance.b. Compute the coefficient of correlation.c. What conclusions can you reach about the relationship

between the averuge Wonderlic score and graduation rate?

:lllll:: ! l l ! :

(s'g)

(v'E)

(t'g)

(z'E)

u,r(uxx" 'x zxxrx)_ "X

l l i t :=

"S/\=S:___lilflllr i

H;frirl

venle^pe>luer@

venle^ pe>lueJ

r+u

UBoIAI clrleruoec

=t0

tO orqlrun| prlrtrI

=w

'O'rqlrun| lsrld

uBIpoHI

uuetrAl aldruug

sornseon enrlduf,soc|etruoLlrnN jo fueuuLuns

uopu1,roq prupuulS eldulug

l -uCl

ZJ

ecuslrul eldulug

t6 - t0 - e8uer eppunbrelul

aEuug elprunbrelul

rsellerusx _ tse8relx _ eEueg

eBuug

,51( 'v + I ) x " 'x (zY + I ) x ( Iu+ I) l =

urnlou Jo alBu u?atrAl rlrlaluoeo

uolloo5) uo|1€lorroc Jo luolclJJeoc'scuBIl€AoJ

ZonlsA pe>lu€r = ueIpeIAI

r+u

l=!

, lx- 'x lK

[ - ev (t 'g)u

17

I . r Ar17'xs

u

(V' E-f' g suolrras) lold re>lsry/!\-pu3-xoq'soJocs y'uo\ternalJo luelclJJeoc'ocu€u€A

'uopelnep pJepu€1s'eEu€J elluenbrelur'eEuerrreeru crJloruoeE'seltgunb'epoul'uelpour'uue141

selqslJuA IsclJerunu o/vuuoo^\loq drqsuorlelor eqt Eurqlrcsoq

slqeue^

I€crJorunu e go edeqs pue 'uo4euen'r(cuepuol l€Jlueo Surqucseql

uluo IsclrolunN

'scrlsrl€]s

reJul Jo lcofgns eq] pue scllsq€1s entldtrcsop yo lcefaql ueolueq de8 eqt eEpuq ol repro ul polueserd ere

ilirrqieqord Jo seldrcuud crseg eql 'rc1der4c lxeu eql uI'reldeqc sq] ur pere

seJnseeru elrldrrcsep leclJerunu er{} Jo }sI e septno;dg elq€I 'uoll€lerroc Jo luelclJJeoc puB 'uollslêp prsp

slsriluuyJo edn; 5'E 318Vr

enrlducsop I€crrerunu Eursn'edeqs pu€ Ktlgqerlel'r(cuep-ue] I€Jluec se qcns 'ecu€urro;red lsed ;o sc|lslJolceJer{cperoldxe no1 'spogleur lecrqdet? rorllo pue 'surerSolsFl'sgeqc eld Jo esn eqt gEnorql uolleulroJul InJosn luese-rdo1 elqs ere/v\ no.( 'elap punJ I€nlnw oql gllzlr 8ur1€ep uer{^&'pelerdJolur pue'paztlleue'pequcsep'pezttetuluns ue{}pu€ 'speqc pue selq€l ut pelueserd oJE elep ,!\oq-scllsll€lsenrlducsop pelpnls nor( 'reldeqc snorne;d eql puu sn{} uI'e8uer 'selrpenb 'uetpeul'u€eur eql se qcns 'seJns€eul

serns€e14 enrldrrcsoq l€orrorunN I1IUHJ UEJdVHJ VtI

Chapter Review Problems 135 /"./*'

rllllllllllllililill

lrillllii: ," r ''

llt irillllllllllll',,

rumrrrliiiiilmrulfi: ,iilililllllillllillu'

ilffill|||lililililruiii

I 'lrnrtrmt of Variation

CV=

X_XZ=

\Iean

\ ariance

o2

l-r8,3X1 9l

ijr ,sKer plot I24:c\- 96

-*,e 120:: correlation I28

-: .,, ariation 1 10: ' -" t - l t ' i

*rlri3 120iililitilllll*tr I I 1

$;;rrunary I23'T',[it;*n 103

Tnrs;j:x rate of return 103lris :inqe 106

Sample Coefficient of Correlation

cov(X,Y)

sxsv(3.17)

sample coefficient of correlation 130sample covariance I27sample mean 97sample standard deviation 107sample variance I0lshape 96skewed II2spread 105standard deviation 106sum of squares (SS) I07symmetrical I12variance 106variation 96Zscore 111

[{)t oo%\x)

,A/

2*,U=

i=l' l /

l/

Lrri - v)2i= l

t/

(3.11)

(3.12)

(3.13)

(3.14)

Population Standard Deviation

o_

Sample Covariance n

I r r , - x)9,-71i= l

n-I

(3.15)

itilililll11|lliillli

cov(X,Y) = (3.16)

- ,1 J

mean 97median 99midspread 106mode 100outlier 1 1 1population mean 1 18population standard deviationpopulation variance 1 19

Q{ first quartile 101

Qz: second quartile 101

Q{ third quartile 101quartiles 101range 105resistant measure 106right-skewed 1 13

t19

3.47 How do you interpret the first quartile, medraf\ andthird quartile?

W 3.48 what is meant by the property of variation?

I Assrsr I

3.49 What does the Z score measure?

3.50 What are the differences among the vari-

ous measures of variation, such as the range,

interquartile range, variance, standard deviation,

Vour Understanding"l$I4ri{llhrun w: :he properties of a set of numerical data?

!.'l[5 \\hat is meant by the property of cenftal',lii*slh*[ffi[" J']-l

3,*416 \\laat are the differences among the mean,

:m#;frtji-i:- and mode, and what are the advantages

gss of each?

iffilil,lllrllll111], I

[1:rt11

ffitr

Hlll, ,ii

60r'8 86t'8 0ZV'8 LZV'\ IIt '8 6EV'8 S07'8 LVV'\ 96€'8

,;,r f l rZE'8 90t'8 0It '8 0ZV'8 ZIV'8 097'8 6ZV'8 VVV'\ 09t'8

rl ' ,f, i l gs?'8 6zv'B 6Lv'8 glt '8 I8t'8 vlv'8 68t'8 Elv'B 9Ev'8

:r1111r {i g6t'8 99V'8 S8€'8 6It'8 VIV'8 €0t'8 V8V'8 Z8E'8 9LV'\

l i r r , [8?'8 ELt '8 I S€'8 0It '8 8t€'8 E8€'8 LlE S EVE S ZIE S

:6V : u Jo eldures 3 roJ 'seqcul q 'sqEnorl eqlJo

uu, a$ sursluotEsEEollJ e'Iep oql'seqrul I9'8 pu€

I E'g uoe.&$eq oq q8no,r1 oquo qlplâ orp leql sorlnber

oJ or{J 'suolleclldde roop}no uI Eur;ooJdrerl}eoln

Ieclluc sI reqlo oql ol luroJ orll Jo epls euo luoJJ

Tp eql 'qEnorl eql a{ulu 01 leo}s }3II eq} uI sruroJ

oA$ Surnnd 'uoqeledo u/v\op-edtrvr € l{}Izlr sseJd

eArssorEord uol-gg 7 e Eursn pecnpord sI ]I 'Uoc Ieols

I € Jo lno epelu sI l€ql qEno4 Ieols € sl Eursnoq

ued lueuoduoc uleur eql 'luerudlnbe lecl4cole roJ

fut*q Ieels secnpord z(ueduroc EuunlceJnu€tu Y 89't

'ursldxg er(€s nof plno l'penlosor luleldruoc e onerl ol llelv\ o] loedxo plnoqs

e Euol 1soq.(ueduloc oqlJo tuoplsord eq] 11o] o]

no{ J} '(c) q8nolqt (e) .+o s}lnser erl} Jo slseq eq} uO 'p

s elep erll erv 'lold re>lslrll!\-pus-x"O rt;?tlrtT"5 'r'uollsl re|Jo luelclJJooc pu€ 'uo4e1ôp prep

'ecu€uerr 'eEuer eqrenbrelut'efiueJ eIP elndulo3 'q'ell}renb

pu€ 'eplrenb IsJIJ 'uerpetu 'u€eul oql olndruoS 'B

'uolleT reLJo luelclJJeoc pu€ 'uoqelôp pJsp-uels 'ecu€ue.,t 'efivel eppenbJelu 'eEuer eql elndulo3 'q

'eppenb

pJIq] pue 'elpenb IsJIJ 'uerpaur 'ueolu eql elnduloJ 'v

LT 91 69 IS 09 T9 Z6 16 LI LI LI 8V SV

8I ZT, 99 Ig 9S 09 06 I€ 8Z 8Z V9 91 6I EL

'@ollJ otil q poul€luoc ele elep oql lpeprooer ere,r 'sf,ep uI

sorurl Eutssecord Ielol SutznolloJ erll pu€ pelcelos sezl\ selc

-1od penordde tZgo eldrues ruopueru 'q]uoru euoJo polred

e Euunq .4ueq erll ot ecllros srql Jo ,Qryqelrgord eql ol I€c-IlFc sI Jouuuul .(1eulg e uI sJetuolsnc ol selc11od penordde

ro^rlep o1 .Qrpqu oql .ftenr1ep roJ {ueq eq} o} }uos pue

pel€reue8 ere seEed fcrlod eql qclqn Eulrnp eEels uorlepd

-ruoc ,(cr1od e prm'sruexe Ieclperu pue uoll€ruJoJul Isclporu

I€uolllppe roJ slsenber elqlssod 51ceqc n€ernq uoll€ruroJul

I€crporu e 'uo11ecr1dde eIIl Jo /KeIAer 3 sopnlcq qclq/K 'Eul

-]rr^\repun Jo slslsuoc ssecord lenordde eql '(tfgS) ecue

-rnsur oJII >lueq sEutzres pe11€o ocu€rnsul oJII Jo ruroJ € IIesol pollrur.red oJe $lueq s8utrres 'el?ls >lJoI ,'!\3N uI 99't

000'09 6EL' 19 | L9'91

000'01 }tl 'I L lzv'lz

000'gZI 0AZ'6 ' 668 reeurEug

,, , . . i . ' . : ,$tenD. ' : : .i , l :

000,ZVT i'.' .000i1,1: ..:,, Egg,1 re3euq41

89

OI

v6I

o(, 9z

vt I

9E ZZ OE 7,9 V LZ 9

97, 9Z 6Z 8Z 67, T,E S9I

V6 9E 19 6Z OII OII 9ZI

ET,I Z Z9I LZ I € LEI SE

uslpotrN uuotrN uollulê( unuTxBIAi tunlulultrAl ozls allllprupuuls elduug

'sJoeulSue ,95unb pu€ sreEeueul Jo seuulss

eqt eredluoJ 'QV-VZ 'dd 'V007, 'requrece6l 'ssa't&ot4

rQ1pnfi .,'s1€uolsseJord ,Qr1en| leuo|llperl roJ r€e1

pooD,, 'uospl€uoq-sdtn1qd elqqeq ruo4 pelce4xE) aoloq

uenr8 ete sollll o/K] eseq] JoJ selJel€s Sururecuoc sclls1l

-els enrlducsoq :eeut8ue ,tlenb pue ;e?evetu eJo \ sol]1l

qol uoruruoc lsoru ollrl er{J 'pellecoJ eJoA\ sesuodseJ pII€A

ZV7'S pue 'srequreru 669'88 ol lues eJo/y\ sll€rue 'r(enrns'Sn erll roC ,fi11enb ur lserelq ueJo erueq] uoruruoc e qllâ

'suorlnlrlsul pel€loJ-oollJes pue Euun]ce;nu€ru Jo See rE lVul >lro^\ sreqlueu OSV 'sroqlueru sll II€ Jo ,(enrns Kmles e

polcnpuoc (bSV) ,911en6 rog ,Qelcos ueclroruv oql 99't

sldaeuo3 eqt 6u!$ddy

ereJJIp uo4el

-erroc Jo luorclJJeos erp pu? o}lrerreôc oql op A\oH ?g'g

6edeqs go ,Qredord eql {q }ueoru sI }et{I& tg't

EJEJIPelnr neqcfqeqJ eqt pue elnr lecutdure eql op ,/KoH Zg'e

eelnqlrlslp pu€ rolsnlc e'wp

IeorrerunuJo les s q sonlen or{} qclq1tr q sz(ezn eq}ureldxe dleq olnr lecmdure eql soop /KoH Lg't

Lqreeyo seE€lueApBSIP PueseEeluerye eql oJe l€rlla pu€ 'uo4ettelJo ]uelclJJooc pu?

serns€ery e,rlldrrcseq lecrrerunN EitUHI UIIIdVHJ 9El

ET

€I

ZI

I I

9V9

9T I€

VL I8

:lurulduroc oql Jo uollnloseJ

pue lqelduoc s;o ldrecor orll uoeAqoq sr(€p Jo roqrunu

1uaserdtt @ ellJ er{} ur €}ep eql 'reer( }uecortTflp polcolos se/y\ uopell€1sut ledmc Suturecuoo slureld

0g Jo eldures V 's/KeJc uon€ll€1sut g I pIrc 'JeJns€eul

rnredns uollsllelsul u€ o1 slt.oJc uoll€llelsul Z wory

xe peq luolupedep Suuoog eql 'relnctlted u1 'srue,(

trsed eql u1 uolsu€dxe roleur e euoEropun pel{ 'ledtec

lcq 'Euuoog pu€ ornlluJng Eulllos eJols luoruuedep-,(1gueg eErel V 'slulelduroc reluolsnc ol spuodser

Ig/y\ qll^\ peeds eql sI uollezrueflro ^(ue {q peptnord

Jo rq1enb eqlJo serns€oru rolew erllJo ouo Lg't

eso>l€1 ssecord lerrordde eqt 3uo1s>ls€ pue ,(cr1od ecueJnsul Jo ed,Q sF{} eseqcrnd

quuq eI{} SJo}ue oq,&\ JeIuo}Snc € ilo} no.( plno,/\t, 1€tl6 'p

eA\oq 'os JI>ls €lep eql ery 'lold re>lslq/!\-pu3-xoq 3 lcrulsuoJ 'J

lllll$rllfltt,ttl

mm'

iriillililllillllllll; ilmr ::le;&lnr riledian, range, and standard devia-lltillil' jffiffi *:,',ith. Interpret these measures of central

uiurffiilllilIlul)$r rumffi -"'aiability.

rlnrltfl illllllllm ilitiiinruits -"1 -lmb er sufirmary.

ililffililillllis ,& ':n: x-end-whisker plot and describe its shape.,"iliillllffiilttttutirumr ,,/:r* conclude about the number of troughs,rffilttrrrffiiiluilMi ulnreff uhe company's requirement of troughsrtilffi;tllll||tmmm rilmn E"3 I and 8.61 inches wide?

irrilMrnr;fecruring company in Problem 3.58 alsou umsulators. If the insulators break when in

-rnm,w;lt ils likely to occur. To test the strength of

ciestmctive testing is carried out to deter-rrmnuush tonce is required to break the insulators.

by observing how many pounds must beilfu mmsmtrator before it breaks. The data from 30

this experiment are contained in the file

trfffii56 il .6 10 1,634 I ,7 84 | ,522 I ,696 I ,592 | ,662

mLru4 N.662 1,734 1,774 1,550 r,756 r,762 1,966

il[/ff i i fr [,688 1,910 1,752 1,690 1,910 1,652 1,736

ffiB mrean, median, range, and standard devia-fforce variable.

ffis measures of central tendency and variabil-

,nihox-and-whisker plot and describe its shape.

Sml conclude about the strength of the insula-

'unqpany requires a force measurement of atporrnds before break age?

with a telephone line that prevent a cus-meiving or making calls are disconcerting to

and the telephone company. The datarffimr 'ffi€ file EEEIE represent samples of 20

to two different offices of a telephoneffiE time to clear these probleffiS, in minutes,

' l ines:

ffice I Time to Clear Problems (minutes)

, f f i ' r-78 2.85 0.52 1.60 4.15 3.97 1.48 3.10

rffit93 1.60 0.80 1.05 6.32 3.93 5.45 0.97

ffice II Time to Clear Problems (minutes)

t f f i i [0 1.10 0.60 0.52 3.30 2.10 0.58 4.A2

m.97 0.60 1.s3 4.23 0.08 1.48 r.65 0.72

two central office locations:first quartile, and thirdffis mean, median,

fu range, interquartile range, variance, stan-

Chapter Review Problems I37

c. Construct side-by-side box-and-whisker plots. Are thedata skewed? If so, how?

d. on the basis of the results of (a) through (c), are thereany differences between the two central offices?Explain.

3.61 In many manufacturing processes, the term work-in-process (often abbreviated wIP) is used. In a bookmanufacturing plant, the WIP represents the time it takesfor sheets from a press to be folded, gathered sewn, tippedon end sheets, and bound. The data contained in the file

U[ilG represent samples of 20 books at each of two pro-duction plants and the processing time (operationallydefined as the time, in days, from when the books cameoff the press to when they were packed in cartons) forthese jobs:

Plant A

5.62 s.2g 16.25 r0.g2 1r.46 2r.62 8.45 8.58 s.4r rr.42

rr.62 7.2g 7.50 7.96 4.42 10.50 7.58 g.2g 7.s4 8.g2

Plant B

9.54 tr.46 t6.62 12.62 25.75 rs.4r 14.29 13.13 13.71 10.04

5.75 12.46 9.r7 t3.21 6.00 2.33 14.25 s.37 6.25 9.7r

For each of the two plants:a. Compute the mean, median, first quartile, and third

quartile.b. Compute the range, interquartile range, varrance, stan-

dard deviation, and coefficient of variation.c. Construct side-by-side box-and-whisker plots. Are the

data skewed? If so, how?d. on the basis of the results of (a) through (c), are there

arly differences between the two plants? Explain.

3.52 The data contained in the fil.@consistof the in-state tuition and fees and the out-of-state tuitionand fees for four-year colleges with the highest percent ageof students graduating within six years.

Source: US. Department of Education, 2006.

For each variable:a. Compute the mean, median, first quartile, and third

quartile.b. Compute the tange, interquartile tange, variance, stan-

dard deviation, and coefficient of variation.c. Construct a box-and-whisbr plot. Are the data skewed?

If so, how?d. Compute the coefficient of correlation between the in-

state tuition and fees and the out-of-state tuition andfees.

e. What conclusions canyou reach concerning the in-statetuition and fees and the out-of-state tuition and fees?iilmion- and coefficient of variation.

l; i l i l l l l l l l l l lt it l

ffimrdffir

, -"u mneasurements made on the company'sil utillilrffitis)lrr;S 3.ffid 140 measurements made on vermont

t:rrltttutrllrl,

illnirllililililtnu *r*, :-:lurnber summary for the Boston shinglesrrruulllliiltlttmrr"rrtffitr -, cnrront shingles.r,irrilltillllMffilllliurrirr,r s,1e-by-side box-and-whisker plots for the

rumnntnlrrllittilrruumlil* :'i shingles and describe the shapes of the,,mnniiiilitttrurutn,fl ,ti )nius

r',,,,,,,,, i,,iuuuillililffimflm[fi, :,r. uhre shingles' ability to achieve a granulei{ffiuiurililt''irrtilr li E:J.rTI or less.

,ffiur,d" im the file lssffi represent the results ofCcrnmunity Survey, a sampling of 700,000

mrums,n m each state during the 2000 U.S. Census.

'rmtili'ffi,e u-ariables average travel-to-work time in

mmnrrulcr3mge of homes with eight or more rooms,

oild income, and percentage of mortgage'

mers whose housing costs exceed 3 |oh of

rnhE rnean, median, first quartile, and third

ffie range, interquartile range, variance, stan-

and coefficient of variation.

# **-and-whisker plot. Are the data skewed?

snons can you reach concerning the mean

r time in minutes, percentage of homes

$'r' more rooms. medran household income,ge of mortgage-paying homeowners whose

luroilsms exceed 30% of income?

cs of baseball has caused a great deal ofw,ith owners arguing that they are losing

aryuingthatowners are making money, andrnrn,i'rno about how expensive it is to attend a

Eames on cable television. In addition toffim $eam statistics for the 2001 season" the file

ins team-by-team statistics on ticket prices;

'rmflrnd€K' regular season gate receipts; local televi-

;md cahle receipts; all other operating revenue;lon and benefits; national and other local

rncorne from baseball operations. For each

ffire mean, median, first quartile, and third

l' -s tra.nge, interquartile range, variance, stan-urm- and coefficient of variation.

;nr hor-and-whisker plot. Are the data skewed?,10)

tffire correlation between the number of wins

clrrmpensation and benefits. How strong is the

rmm hem€en these two vartables?rons can you reach concerning the regular

r@ receipts; local television, radio, and cable

Chapter Review Problems I39

tion and benefits; national and other local expenses; andincome from baseball operations?

3.69 In Section 3.5 on page 131, the correlation coeffi-cient between the cost of a fast-food hamburger meal andthe cost of movie tickets in 10 different cities was com-puted. The datafile@also includes the overallcost index, the monthly rent for a two bedroom apartment,and the costs of a cup of coffee with service, dry cleaningfor a men's blazer, and toothpaste.a. Compute the correlation coefficient between the overall

cost index and the monthly rent for a two-bedroomapartment, the cost of a cup of coffee with service, the

cost of a fast food hamburger meal, the cost of drycleanin g a men's blazer, the cost of toothpaste, and the

cost of movie tickets. (There will be six separate corre-lation coefficients.)

b. What conclusions carr you reach about the relationshipof the overall cost index to each of these six vniables?

3.7O The data in the file EBIIfr contains the character-istics for a sample of 20 chicken sandwiches from fast-foodchains.a. Compute the correlation coefficient between calories

and carbohydrates.b. Compute the correlation coefficient between calories

and sodium.c. Compute the correlation coefficient between calories

and total fat.d. Which variable (total fat, carbohydrates, or sodium)

seems to be most closely related to calories? Explain.

3.71Thedatainthef [email protected] (in $millions) of CEOs of the 100 largest compa-nies, by revenue (extracted from "Special Report:Executive Compensation," USA Tbday, April 10, 2006, pp.

38,4B).a. Compute the mean, median, first quartile, and third

quartile.b. Compute the range, interquartile tange, variance, stan-

dard deviation, and coefficient of variation.c. Construct a box-and-whisker plot. Are the data skewed?

If so, how?d. What conclusions can you draw concerning the total

c9-mpensation (in $millions) of CEOs?

3.72 The data in the file @ is the per capttaspending, in thousands of dollars, for each state in2004.a. Compute the mean, median, first quartile, and third

quartile.b. Compute the tange, interquartile tange, variance, stan-

dard deviation, and coefficient of variation.c. Construct side-by-side box-and-whisker plots. Are the

data skewed? If so, how?d. What conclusions can you reach concerning per caprta

spending, in thousands of dollars, for each state in2004?orffiili omher operating revenue; player compensa-

'lottdsoH ounavJ ru puo 'lotldsoH orcnbas .taqua3 p)rpary p,rotuots ruo"tt paqdopV :n,mos

l i

ru'suor leredo l le +o lsoo eDelene aql are elep prorr .uels

.u o ltre; ed oqceo ro; sa6reqc l le +o %Agolpp!. e, . { } 1o se6erone ere s}soo etonbeg

'Aels Aep-euru e ql tM luoruoce;del d lq e pue Aels Aep-onnl e ql lm qlr lq

e;duts e rol se6;eqc nnol pue q6tq;o e6e.roê oql a le s lsoc oulure3 l : l

W[]

ffiluouroceldat dlp qulq e;durg ssedAq Areuoro3

V/N0:

000'01,

000'02

000'0t

000'0t

000'09

Uo0)U)proluels ffi

ero n bas

ourrJes l:l ff i

'uol tgteduloc lecol uteul s, lo luaC leclpol4 proluels

oJe slelrdso;1 ounr, teJ l l pue etonbeg 'suot leJedo snot len Jo+

eruJo+! leC ur sableqc le l rdsoq gO-OBG;, a6eran e +o uost ledtuoc yslsoc erec qlleeH leqfut

'seJns€oru elqducsep leclJelunu pu€ 'sgeqc

:rpr eleudordde ile eq plnoqs Uodor rno.( 01 pepueddy

rrlo U red (surer8 ul) sol€rpfqoqrec Jo regrunu"seJuno 71 nd serrol€3 Jo Joqrunu 'loqocl€ e8eluocred

p{qBue^ Iscrrerunu eI{} Jo qc€e Jo uol}?nl?Ao entlducsep,Cruoc e uo peseq lroder e elIJ./K o1 sI {se} JnoA

'9002 '[€ qcffiW 'trroJ'00[nag'ituu tuo'ttpapouxg :a),mos

'secuno u red (sulerS

"eterp,(qoqJ€c Jo Jeqrunu pu€ 'secuno Zl red selJolsc Jou '1oqo c1e e?eluecred :pspnlcul eJe selqelJe^ ooJlp JoJ

iE\ eqJ'ffia1J oI{} ur pe}ecol er€ 'S n oq} ulrrlseruop Surllos-lsoq equo 89 Sururecuoc epe g4t

soslrJax3 6u;ry.1M uodeu

6r(1der rnort sI ler{A ,;E;t sl rofeur roJ useru"0s'I sr repueS ro3 u€eru oqt'9L'z sr. xepu ]urod eperS

Ir lrEeru eql 'EZ'gg sr 1q8rer{ roJ u€eru eq} 'ees lSurqtr(reôr uLreql lo8 I-selqerr€A erll Jo eruos JoJ slels ollldrrcsep

;mw la8 l,uec e/K ples lelqqoJ) JosseJoJd fqrn pu€lsrepun'fi,ilHrro | 'oslv '1{3req roJ pue xepq }urod eper8 roJ sueqc,ilmr$ aql pue rofeur roJ pue JopueE rog s1o1d Jo>lslql\-pue4i1c'r4 3r{} e>lrl-pJre \ qool 1nd1no erl} Jo eruos 'st urelqordim_l_ 'selqer rel rno II€ roJ-sperlc eld eq] 's1o1d re>lslqa

-mrm-xoq oql 'suotlellep pJepu€1s e{} 'suerpeur oq} 'sueeultm:-lle I loE eA.L, 'surrelcxe pue lnolurrd eqt gll/tr no.(il{ffiu ;e-\o seruoc uos;ed sFII 'sesodrnd ,(pn1s JoJ Jo}cnJ}sul

eql ,(q peu8rsse selqelJe^ lecrro8 elac pu€ I€clJeunuIeJoAes Surureluoc les ewp e JoJ sper{c pue 'se1qe1 'uot1-BruroJur freununs pepeeu aq] le8 o1 Iacxfl gosorcll tr esno] poroolunlo^ ssq l€npl^lpq slql 'sserdu4 ol ]u€./y\ .(pe1n-ctped no.( woq./KJo euo 'seleulsselc;o dnotB e qll1\ uollsu-rruexo sorlsrlels rno,( rog ,(prus o1 Suruueld ere no1 VL'e

' ' ' {der pue 'qlee;q daep B e>lel 'epurs no1'esuodse.rreq eredord nor{ leqt s}senbor zlrou oqs 'uorutdo req pe>lsgpus sselEuru€eru f11e1o1 sBA ueqc s1p Wqt peuollueru rueql

Jo euo wqt pu€ lq81u ]s€l sOf,J reluec Isclpeul eerv leuor8ergo Surloeu e Jo ued se Surpes dnorE uorssncslp e ur. pelues-erd se./K elcrue eq] }€q} nor( slle] oqs 'sFp ssncsrp ol uI nofsil€c pu€ scrlsllels q esJnoc e Surlel r(lluerrnc ore no,( s/!\oDI

OAJ JnoI'Jeluoc I€clpotu e q Eurqrozn eJe./v\ nof esoddng'(progu€]S pue 'etonbeg 'outure3

IE) suorlnlllsur Eurleduroc oorr{t te (}ueurecelder dtq pue

'I{ulq elduus 'ssedfq ,ftuuoroc) seJnpoco;d Iuclpetu eorq}roJ se8reqc lelrdsoq 0661 o1 686I o8erene oql eredruoco] peprnord s€.&\ A\oleq Neqc erIJ 'sluollsd xeldruoc eJotupue 're>1crs ?leclpery 'erecrpenl 'lue8tpw ]€oJ1 01 suollez-rue8ro Jor{}o uerp fle{}l eJoru se.,lt. JoruJoJ er{} esn€ceq suoq

-nlrlsrn Surleduroc w ueqtreq8rq dn uenlrp ueeq per{ re}ueJ

IecrpelN proJuels 1€ slsoc rcqt pelldurt (OOO I 'I I requeôNl

'uotlceg ssoursng fepung saLuIJ 4Jor{. Mal{ aqJ ..'ecueqo1 lu€qdrlg proJu€ls eqt Eux€oJ,,) uourer; uuelg ,(q e1c

-Um u€ 'scrlsrlels Jo osnspr orll Jo uol]€rlsnlll u€ sv t4e

sernseory en4drrcseq lecrrerunNl EitUHI U;IIdVHJ yVI

{mWlll i

lffil*r111ffi6,EtiitffMH[ii'

m, lililHhi

lllr -trqffiS

L..l|||llllt|l|lllii[[ri|gfficontainsinformationregard-

rntttutu''i'rr{rilur:iinrb"es from a sample of 838 mutual funds:,ulturuumnl,,-T)?e of stocks comprising the mutual fund

rrrrrnnMlllilll urffiffi^, mid cdp,large cap)illllllfuMm-,r-Cbjective of stocks comprising the mutualirrffillllilltqgmrrrrnm,,*-il o r value)lrrillffiililtii---: nnilli ons of dollarsiiitimr'- S'tl,lE: ch&rges (no or yes)lillfihpnmrrrm

-rlm-ratio of expenses

'iirffiffiilllilmH';-of-loss factor of the

"mxgh ITwelve-month return in 2005

return-Annuali zed return, 2003 100 5uerurn-Annuali zed return, 200 | -200 5

rmmgrense ratio in percentage, 2005 return, three-md five-year return,the mean, median, first quartile, and third

rthe range, interquartile tange, vatiance, stan-io'o. and coefficient of variation.

n box-and-whisker plot. Are the data skewed?'?

,rommmclusions can you reach concerning these

Chapter Review Problems L4I

d. What conclusions can you reach about differencesbetween mutual funds that have a growth objective andthose that have a value objective?

3.79 You wish to compare sm aII cap mid e&p, and largecap mutual funds. For each of these three groups, for thevariables expense ratio in percentage , 2005 return, three-year return, and five-year return,a. Compute the mean, median, first quartile, and third

quartile.b. Compute the range, interquartile tange) variance, stan-


If so, how?d. What conclusions can you reach about differences

between small cap, mid edp, and large cap mutual funds?

Student Survey Data Base3.80 Problem I.27 on page 15 describes a survey of 50undergraduate students (see the file ).For these data, for each numerical variablea. Compute the mean, median, first quartile, and third

quartile.b. Compute the range, interquartile range, variance, stan-


If so, how?d. Write a report summafiztngyour conclusions.

3.81 Problem 1.27 on page 15 describes a survey of 50undergraduate students (see the file ).L. Select a sample of 50 undergraduate students at your

school and conduct a similar survey for those students.b. For the data collected in (a), repeat (a) through (d) of

Problem 3.80.c. Compare the results of (b) to those of Problem 3.80.

3.82 Problem I.28 on page 15 describes a survey of 50MBA students (see the file ffi). For these data,for each numerical variable,a. Compute the mean, median, first quartile, and third

quartile.b. Compute the tange, interquartile tange, vatrance, stan-


If so, how?d. Write a report summarrzing your conclusions.

3.83 Problem I.28 on page 15 describes a survey of 50MBA students (see the fileffi).a,, Select a sample of 50 graduate students from your MBA

program and conduct a similar survey for those students.b. For the data collected in (a), repeat (a) through (d) of

Problem 3 .82.c. Compare the results of (b) to those of Problem 3 .82.

to net assets ln per-

mutual fund (low,

,,fllll

w"""""""ush to compare mutual funds that have fees to,ffi, not have fees. For each of these two groups,

expense ratio in percentage, 2005 return,and five-year return,

the mean, median, first quartile, and third

ffie range, interquartile range, variance, stan-ion- and coefficient of variation.

n hox-and-whisker plot. Are the data skewed?

lu@mmmctusions aan you reach about differencesand those that dommunral funds that have fees

fuss?

wrsh to compare mutual funds that have ah"e to those that have a value objective. Forfino groups, for the variables expense ratio in

3005 return, three -yeat return, and five-year

uhe nnean, median, first quartile, and third

ffie range, interquartile range, vatiance, stan-Mon. and coefficient of variation.a box-and-whisker plot. Are the data skewed?l

'( t SO t 'ssel4 .ftnqxnq:gl srsrfJouy otoe {"toqruo1dxE Io SuryndwoJ puos 'sttorlncryddy'urlEeoH 'C 'CI puu 'C A 'ueurelle1

' ( t tA t'r(e1s e16-uos rppv"frurpeeU) slsrQnuy otoe rfuorutoldxE ''1'fe>1n; 't

h,riltr!il:'itsenb leril o1 sosuodserJo Jequnu eql pel5ull eAeI{$rolJeJ ler{,/y\.suor}senb Jor{}o eq} ue{} sesuodser

t seq ^(enrns er{} Jo uorlsenb }sel eq} leqt e}oNI 't

u eseq] ezrrerJrums o] r(puereglp op plno1K no.( 3u1qtprerl] sI reploJ eseS qe1!\ I Iou-cIJ luepnls eql uo

rorq'.{e,rrn S-Ufl oq} ro u4q'fa,rrn S-U fl/a1gnEu1r dSquard' alara ees) fen"rns Jeruolsnc sll Jo sllnseJ

#zueruruns ol pesn unupug spoqleul eq] olsnleÊ 'z

aprocer s6un6pug Jo uopdec"redrreJ€ scrlsll€1s ,fueululns esoql plno a 1KoH eslul€lc

,(pn1s EocroJ {ss} oq} q}l/K pepnlcul;uJ wLIt ]roder e uI sEurpurg rnor( ezrtevnuns 'g

elold re>lslq^\-\og eql ruor; epetu eq }ouue.c wqt }old rcqt IuoU

'(IOOZ'uor1€rodro3

Uosorcll tr :V16 puourpe$ n7Z p)xfl tIoso,tcry'Z'fuaot

'sser4 .(lrsrenlun proJXO :>lro1 A\oI\D 'pe r{}9 '{toaq1uounqqusle : I awnlol 'srttsryo$ Io [,toaqJ pacuo^pvs, l lopuax ?. lO') ' f pue'pen1g 'V ' 'D' IN 6l l€pue) ' I

s.un6pug Uoddns scrlsrl€1s ,fteurruns qcns plno^\ lll.oH

eselqelret Kue JoJ pelnduoc eq seJnseeru err4dposop ueJ 'I

:SurzlrolloJer{} Jo./v\sue uer{} pu? Ercp Surpoddns Jror{} euru€xe3J pueeurll puoces e (reploJ eseC qe6 I IOU-CJ tuepruS eql urellJ urlq'ungpufl oql uedo ro) rulq'ungpufi7ag,r8u1rdg

/ruoJ'IluquoJd'ar,t. tr 1B 'secrnJes 8u4senul ungpug ]lsIA'7 "ta4dnq3

tuo,tt asDJ qaful SumuquoJ slql ur sarnsoaw aa$dunsap

lo)uawnu to asn ,tadotd aql ruoqn a8pafv,owt tno{ tlddy

ruJoJ nor( rre} suorsnlcuos }?q1v[ '11 lcnrlsuoc pue InJ-esn eq tq8pr 1eq1 ,(eldslp I€clqder8 rerlloue ,.(grluepl 'Z

'lold Je>lstq.,l\-puu-xoq e eleJeue8 pue 'sems-?Oru enrlducsep I€crJerunu eleudo"rdde eq1 elnduro3 'I

:4Jrluopr no^ elq?u? orll roc epepesu serns-eoru enrlducsep Iecrrerunu oJe (tt eEed ees) es€c (plo"taHalpa8uuds eq1 3ur8uuu6,,7 nldeqJ erp q elqelJe lerlê roC

'v

sernseory en4drrcseq l€crrerunN EEUHI UEJdyHJ zvl

chapter 3

Documents

eql op jo lelnuts

jo sjoruo

eql op rousputg

jo eopl ou e

sremsue eql ecr

b jo il

uec eql ejnselu uec

values number of values