Top Banner
Understanding Numerical Data
41

Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Dec 30, 2015

Download

Documents

Arline Lyons
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Understanding Numerical Data

Page 2: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Statistics

• Statistics is a tool used to answer general questions on the basis of a limited amount of specific data.

• Statistics allows us to make decisions about a population based on a sample of that population rather than on the entire population.

Page 3: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Why do we need Statistics?

• Let’s say that you want to know the lipid content of a typical corn grain.

• You could analyze one grain, but how would you know that you’d picked a “typical” grain?

• You’d get a better estimate of “typical” if you increased you sample size to a few hundred grain, or even to 10,000. Or to 1,000,000.

• Better yet….The only way to be certain your conclusions would be to measure all of the corn grains in the world.

• Since this is clearly impossible, you must choose grains that represent all of the grains in the world – that is, you must be working with a representative sample.

Page 4: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Statistics Terms

• Mean- The mean is the arithmetic average of a group of measurements.

Page 5: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Scientists often base answers to investigative questions on averages

• Thus in the earlier investigative question about the lipid content of a typical corn grain, if you took a sample of 10,000 corn, measured their lipid content,

• then calculated their average(mean) lipid content, would that average (mean) be an adequate description the lipid content of all corn in the world?

• Why? Or why not?

Page 6: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Other considerations - - -

Page 7: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Assessment Statement

• 1.1.1 State that error bars are a graphical representation of the variability of data.

• 1.1.4 Explain how the standard deviation is useful for comparing the means and the spread of data between two or more samples.

Page 8: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Looking at these data sets what observations can you make?

Boys Scores Girls Scores60 9862 4268 8870 9263 3865 5665 9558 9264 5063 89

Page 9: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Based on data, what do you think is the average boy score and girl’s score

Boys Scores Girls Scores60 9862 4268 8870 9263 3865 5665 9558 9264 5063 89

Page 10: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Average Score

• Boys – 64% • Girls – 74%

Does this mean that girls did significantly better on the test?

Page 11: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Does the average for girls ’74%’ accurately describe how the typical girl did on this test? Why? Or Why not?

  Boys Scores Girls Scores

  60 98

  62 42

  68 88

  70 92

  63 38

  65 56

  65 95

  58 92

  64 50

  63 89

Average 63,8 74

Page 12: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Does the average for boys ’63.8%’ accurately describe how the typical boy did on this test? Why? Or Why not?

  Boys Scores Girls Scores

  60 98

  62 42

  68 88

  70 92

  63 38

  65 56

  65 95

  58 92

  64 50

  63 89

Average 63,8 74

Page 13: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Looking at the data what is range (lowest score & highest score) of data (scores) for both boys & girls?

Boys Scores Girls Scores60 9862 4268 8870 9263 3865 5665 9558 9264 5063 89

Page 14: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Standard Deviation

• Show the average difference each data point has from the mean.

• Shows how big the range of a data set is.

Page 15: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

The spread of data

• Averages do not tell us everything about a sample.

• Data can be very uniform meaning all bunched around the mean, or data can be spread out a long way from the mean.

• The statistic that measures this spread is called the standard deviation.

Page 16: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Standard Deviation

• The standard deviation is a measure of the variation of the data.

• For data that is evenly distributed each side of the mean (a normal distribution) 68% of the data lies within one standard deviation of the mean.

Page 17: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

=square root=sum (sigma)X=score for each point in data_X=mean of scores for the variablen=sample size (number of observations or cases

SD =

Formula for Standard Deviation

1)-(n

2)( XX

Page 18: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.
Page 19: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Standard Deviation

• 68% of data falls within 1 standard deviation

• 95% of data falls within 2 standard deviation.

Page 20: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Based on the range of the data sets, which gender do you think would have a bigger Standard Deviation, boys or girls?

Boys Scores Girls Scores60 9862 4268 8870 9263 3865 5665 9558 9264 5063 89

Page 21: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Standard Deviation

• Boys – 3.4 • Girls – 24.3

What does this difference in standard deviation mean?

Page 22: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

What would be the best way to graph this data in lab report? What things should your graph include?

  Boys Scores Girls Scores

  60 98

  62 42

  68 88

  70 92

  63 38

  65 56

  65 95

  58 92

  64 50

  63 89

Average 63,8 74

Standard Deviation

3,5 24,3

Page 23: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Based on this graph what can you conclude, about the difference between how boys and girls did on this test?

Average Score of Boys and Girls on a test

0

20

40

60

80

100

120

1

Sco

re o

n T

est

(%)

boys girls

Error bars represent the standard deviation of the data sets

Page 24: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Data Analysis Conclusions things to think about:

BIG vs. SMALL -- ERROR BARS• Big error bars means lots of variation in data

& data is less reliable to draw conclusions from

• Small error bars means less variation in data & data is more reliable to draw conclusion from

Page 25: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Big error bars = large standard deviation = BIG Range in data

Small error bars = small standard deviation = small range in data

Page 26: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

BIG vs. SMALL Error Bars

Average Score of Boys and Girls on a test

0

20

40

60

80

100

120

1

Sco

re o

n T

est

(%)

boys girls

Error bars represent the standard deviation of the data sets

Page 27: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Data Analysis Conclusions things to think about:

OVERLAPPING ERROR BARS– When the values of

error bars overlap on a graph it means that there is NOT a significant difference in averages and data sets.

Average Score of Boys and Girls on a test

0

20

40

60

80

100

120

1

Sco

re o

n T

est

(%)

boys girls

Error bars represent the standard deviation of the data sets

Page 28: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

What overlapping error bars mean with respect to average data between

Page 29: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Overlapping Error BarsAverage Score of Boys and Girls on a test

0

20

40

60

80

100

120

1

Sco

re o

n T

est

(%)

boys girls

Error bars represent the standard deviation of the data sets

Page 30: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Data Analysis Conclusions things to think about:

NON- OVERLAPPING ERROR BARS– When the values of error

bars DO NOT overlap on a graph it means that there MAY BE a significant difference in averages and data sets.

– In order to prove that there is a difference between this data set you must do a t test

– t- tests test the differences between means.

Average Score of Boys and Girls on a test

0

20

40

60

80

100

120

1

Sco

re o

n T

est

(%)

boys girls

Error bars represent the standard deviation of the data sets

Page 31: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

NON OVERLAPPING ERROR BARSAverage Score of Boys and Girls on a test

0

20

40

60

80

100

120

1

Sco

re o

n T

est

(%)

boys girls

Error bars represent the standard deviation of the data sets

Page 32: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

What non-overlapping error bars mean

Page 33: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

YOUR Turn To PRACTICE.

Page 34: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

For condition A is there a significant difference between the control group & experiment group? Why or Why not?

Page 35: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

For condition B, is there a significant difference between the control group & experiment group? Why or Why not?

Page 36: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

For condition C, is there a significant difference between the control group & experiment group? Why or Why not?

Page 37: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Which data set (type of food) seems to be the most reliable and why?

Page 38: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Between which type of food does there seem to be a significant difference in the growth of fish? and explain why you made that

conclusion?

Page 39: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.

Assessment Statement

• 1.1.1 State that error bars are a graphical representation of the variability of data.

• 1.1.4 Explain how the standard deviation is useful for comparing the means and the spread of data between two or more samples.

Page 40: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.
Page 41: Understanding Numerical Data. Statistics Statistics is a tool used to answer general questions on the basis of a limited amount of specific data. Statistics.