Top Banner
LECTURE 1 MATH 221 STATISTICS FOR DECISION MAKING Professor Heard Lecturer All materials © BN Heard and can not be copied without permission. Visit the Stat Cave online at www.facebook.com/statcave
27

Statistics for Decision Making Week 1 Lecture

Jul 04, 2015

Download

Education

Brent Heard

Samples, populations, quantitative and qualitative data, random sampling, stratified sampling, systematic sampling, convenience sampling, cluster sampling, frequency distributions, class width, class boundaries, midpoints, histograms, stem and leaf plots, mean, median, mode, quartiles, outliers, etc. Using Excel for statistics and statistical analysis.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Statistics for Decision Making Week 1 Lecture

LECTURE 1

MATH 221

STATISTICS FOR DECISION MAKINGProfessor Heard

Lecturer

All materials © BN Heard and can not be copied without permission. Visit the Stat Cave online at

www.facebook.com/statcave

Page 2: Statistics for Decision Making Week 1 Lecture

35 0 30 35 30 35

35 25 35 35 33 26

34 35 35 35 34 35

25 33 25 35 15 0

20 30 20 25 18 27

The following data represents the Discussion Board assignment

scores for all of my Math 221 students. Would this be a

population or a sample?

(Scores out of 35 points)

Page 3: Statistics for Decision Making Week 1 Lecture

35 0 30 35 30 35

35 25 35 35 33 26

34 35 35 35 34 35

25 33 25 35 15 0

20 30 20 25 18 27

The following data represents the Discussion Board assignment

scores for all of my Math 221 students. Would this be a

population or a sample?

(Scores out of 35 points)

This would be a population since the data is from ALL of my students.

Samples are “subsets” of populations.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 4: Statistics for Decision Making Week 1 Lecture

0 2 1 5 0 7

3 3 2 2 2 1

2 0 1 2 3 0

0 0 1 2 0 0

1 6 0 1 2 1

The following data represents the number of children each of my

Math 221 students have. Would this data set be qualitative or

quantitative?

(Number of Children)

Page 5: Statistics for Decision Making Week 1 Lecture

0 2 1 5 0 7

3 3 2 2 2 1

2 0 1 2 3 0

0 0 1 2 0 0

1 6 0 1 2 1

The following data represents the number of children each of my

Math 221 students have. Would this data set be qualitative or

quantitative?

(Number of Children)

This would be quantitative data since we are dealing with numbers having

meaning.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 6: Statistics for Decision Making Week 1 Lecture

1| 0 5 5 5 8

2| 5 5 8 8 9 9

3| 0 0 2 5 5 5 5 5 5

The following data set represents the DB scores of 2 students randomly

chosen from each of a total of 10 Statistics classes. What type of data set

(or sampling) would this represent?

Page 7: Statistics for Decision Making Week 1 Lecture

Know the difference in Sampling Techniques

Random (simply picking where every member has an equal chance – drawing out of a bag – generating random numbers)

Stratified (dividing your population into strata and then picking a certain number from each strata)

Systematic (picking every nth one – for example testing every 20th unit off of an assembly line)

Convenience (just asking who is available or who is listening, not making an effort to get a true sample)

Cluster (dividing the population into clusters and sampling everyone in one or two of the clusters)

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 8: Statistics for Decision Making Week 1 Lecture

1| 0 5 5 5 8

2| 5 5 8 8 9 9

3| 0 0 2 5 5 5 5 5 5

The following data set represents the DB scores of 2 students randomly

chosen from each of a total of 10 Statistics classes. What type of data set

(or sampling) would this represent?

This would be stratified sampling since we divided the population into

Strata (classes) and then randomly selected two from each.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 9: Statistics for Decision Making Week 1 Lecture

row Class (Weight lbs) frequency

1 4-11 7

2 12-19 4

3 20-27 2

4 28-35 2

5 36-43 1

The following table of data represents the weight of 16 randomly selected

dogs in my neighborhood.

Looking only at row 2, determine the class width, class boundaries and

midpoint.

Page 10: Statistics for Decision Making Week 1 Lecture

row Class (Weight lbs) frequency

1 4-11 7

2 12-19 4

3 20-27 2

4 28-35 2

5 36-43 1

Looking only at row 2, determine the class width, class boundaries and

midpoint.

Class

Width

12-4 = 8

You

could

have

just as

easily

said 20-

12,28-

20 or

36-28

and still

gotten

8.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 11: Statistics for Decision Making Week 1 Lecture

row Class (Weight lbs) frequency

1 4-11 7

2 12-19 4

3 20-27 2

4 28-35 2

5 36-43 1

Looking only at row 2, determine the class width, class boundaries and

midpoint.

The class

boundaries

for row 2

would be

11.5 and

19.5,

simply

subtract .5

from the

lower and

add .5 to

the upper.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 12: Statistics for Decision Making Week 1 Lecture

row Class (Weight lbs) frequency

1 4-11 7

2 12-19 4

3 20-27 2

4 28-35 2

5 36-43 1

Looking only at row 2, determine the class width, class boundaries and

midpoint.

The

midpoint for

row 2 would

be 15.5, just

add (12+19)

and divide

by 2.

Similarly the

midpoint for

row 5 would

be 39.5

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 13: Statistics for Decision Making Week 1 Lecture

The histogram below describes the height of 25

students. Looking only at the 64-65 group, determine

the class width, class boundaries and midpoint.

53-54 55-57 58-59 60-61 62-63 64-65 66-67 68-69 70-71 72-73

Page 14: Statistics for Decision Making Week 1 Lecture

The histogram below describes the height of 25

students. Looking only at the 64-65 group, determine

the class width, class boundaries and midpoint.

Class width would be 2, because 64 minus 62 is 2.

Class boundaries would be 63.5 and 65.5

Midpoint would be 64.5 because (64+65)/2 = 64.5

53-54 55-57 58-59 60-61 62-63 64-65 66-67 68-69 70-71 72-73

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 15: Statistics for Decision Making Week 1 Lecture

1| 0 5 5 5 8

2| 5 5 8 8 9 9

3| 0 0 2 5 5 5 5 5 5

The following is a sample of discussion board scores

for a group of students. Find the first quartile, median

and third quartile

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 16: Statistics for Decision Making Week 1 Lecture

1| 0 5 5 5 8

2| 5 5 8 8 9 9

3| 0 0 2 5 5 5 5 5 5

The following is a sample of discussion board scores

for a group of students. Find the first quartile, median

and third quartile. Input data input an Excel column, if

the data is not provided. 10

15

15

15

18

25

25

28

28

29

29

30

30

32

35

35

35

35

35

35

Etc.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 17: Statistics for Decision Making Week 1 Lecture

10

15

15

15

18

25

25

28

28

29

29

30

30

32

35

35

35

35

35

35

To find the first quartile use the quartile

function in Excel to get 23.25

To find the median use the median

function in Excel to get 29.00. Note this is

also the second quartile

To find the third quartile use the quartile

function in Excel to get 35.00

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 18: Statistics for Decision Making Week 1 Lecture

20 16 12 8

18 18 20 18

20 12 20 20

16 20 20 20

The following is a random sample of

module scores for a group of students.

Find the mean, median and mode.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 19: Statistics for Decision Making Week 1 Lecture

20 16 12 8

18 18 20 18

20 12 20 20

16 20 20 20

The following is a random sample of module scores

for a group of students. Find the mean, median and

mode. Input data input an Excel column, if the data is

not provided. Ordering is a good idea. Use the Data

Tab, select “Sort.”8

12

12

16

16

18

18

18

20

20

20

20

20

20

20

20

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 20: Statistics for Decision Making Week 1 Lecture

8

12

12

16

16

18

18

18

20

20

20

20

20

20

20

20

To find the mean use the “average”

function in Excel to get 17.375, if rounded

to the nearest whole number this would be

simply 17.

To find the median use the “median”

function in Excel to get 19.

To find the mode VISUALLY INSPECT

THE DATA TO FIND 20 AS THE MODE.

NOTE: WITH MULTIPLE MODES, EXCEL

WILL ONLY RETURN ONE MODE.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 21: Statistics for Decision Making Week 1 Lecture

20 16 12 8

18 18 20 18

20 12 20 20

16 20 20 20

The following is a random sample of

module scores for a group of students.

Find the standard deviation.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 22: Statistics for Decision Making Week 1 Lecture

8

12

12

16

16

18

18

18

20

20

20

20

20

20

20

20

To find the standard deviation for a

sample, use the “stdev” function in Excel

to get 3.703602 or 3.7 rounded to one

decimal place.

Note: Use “stdev” for samples and

“stdevp” for populations.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 23: Statistics for Decision Making Week 1 Lecture

52

58

75

78

82

88

89

92

92

92

96

96

98

98

98

98

99

99

99

100

100

100

100

100

100

The following is a random sample of exam scores for a

group of students. The mean is 91.2 and the standard

deviation is 13.0. Determine if there are any outliers,

defining an outlier as a data point outside of the mean

+/- 2 standard deviations.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 24: Statistics for Decision Making Week 1 Lecture

52

58

75

78

82

88

89

92

92

92

96

96

98

98

98

98

99

99

99

100

100

100

100

100

100

The following is a random sample of exam scores for a group

of students. The mean is 91.2 and the standard deviation is

13.0. Determine if there are any outliers, defining an outlier

as a data point outside of the mean +/- 2 standard deviations.

91.2 – 2(13) = 65.2 and 91.2 + 2(13) =

117.2.

The only scores outside of these bounds

are 52 and 58, thus they are the only two

outliers.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

65.2

117.2

Page 25: Statistics for Decision Making Week 1 Lecture

52

58

75

78

82

88

89

92

92

92

96

96

98

98

98

98

99

99

99

100

100

100

100

100

100

The following is a random sample of exam

scores for a group of students. The mean is

91.2 and the standard deviation is 13.0. How

many standard deviations is a score of 88 from

the mean?

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 26: Statistics for Decision Making Week 1 Lecture

52

58

75

78

82

88

89

92

92

92

96

96

98

98

98

98

99

99

99

100

100

100

100

100

100

The following is a random sample of exam

scores for a group of students. The mean is

91.2 and the standard deviation is 13.0. How

many standard deviations is a score of 88 from

the mean?

(88-92.1)/13 = -0.24615 or -0.25 rounded

to two decimal places.

This could also be described as “0.25

below the mean.”

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave

Page 27: Statistics for Decision Making Week 1 Lecture

Visit me on Facebook at

www.facebook.com/statcave

You DO NOT have to have a Facebook account

to view these, but I would like having you “Like”

the Stat Cave if you do.

I post things from time to time, both informative

and fun.

You can download these charts by clicking the

“Download” button above the charts.

(c) B

N H

ea

rd w

ww

.face

bo

ok.c

om

/sta

tcave