Top Banner
Rules of Data Dispersion • By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval about the mean.
18

Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Jan 16, 2016

Download

Documents

Neil Short
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Rules of Data Dispersion

• By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval about the mean.

Page 2: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Rules of Data Dispersion

• Empirical Rule• Chebyshev’s Theorem

(IMPORTANT TERM: AT LEAST)

Page 3: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Empirical Rule

Applicable for a symmetric bell shaped distribution / normal distribution.There are 3 rules:i. 68% of the observations lie in the interval

(mean ±SD)ii. 95% of the observations lie in the interval

(mean ±2SD)iii. 99.7% of the observations lie in the interval

(mean ±3SD)

Page 4: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Empirical Rule

Page 5: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Empirical Rule

• Example: 95% of students at school are between 1.1m and 1.7m tall. Assuming this data is normally distributed can you calculate the mean and standard deviation?

Page 6: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Empirical Rule

Page 7: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Empirical Rule

• The age distribution of a sample of 5000 persons is bell shaped with a mean of 40 yrs and a standard deviation of 12 yrs. Determine the approximate percentage of people who are 16 to 64 yrs old.

Page 8: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.
Page 9: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Chebyshev’s Theorem

Page 10: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Chebyshev’s Theorem

• Applicable for any distribution /not normal distribution

• At least of the observations will be in the range of k standard deviation from mean where k is the positive number exceed 1 or (k>1).

2

1(1 )

k

Page 11: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Chebyshev’s Theorem

• Example Assuming that the weight of students in this

class are not normally distributed, find the percentage of student that falls under 2SD.

Page 12: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Chebyshev’s Theorem

• Consider a distribution of test scores that are badly skewed to the right, with a sample mean of 80 and a sample standard deviation of 5. If k=2, what is the percentage of the data fall in the interval from mean?

Page 13: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Measures of Position

To describe the relative position of a certain data value within the entire set of data.•z scores•Percentiles•Quartiles•Outliers

Page 14: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Quartiles

• Divide data sets into fourths or four equal parts.

Page 15: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Boxplot

1

3

Lower Fence 1.5( )

Upper Fence 1.5( )

Q IQR

Q IQR

3 1IQR Q Q

Page 16: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Boxplot

Page 17: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Outliers

• Extreme observations• Can occur because of the error in

measurement of a variable, during data entry or errors in sampling.

Page 18: Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.

Outliers

Checking for outliers by using Quartiles

Step 1: Determine the first and third quartiles of data.Step 2: Compute the interquartile range (IQR).

Step 3: Determine the fences. Fences serve as cutoff points for determining outliers.

Step 4: If data value is less than the lower fence or greater than the upper fence, considered outlier.

3 1IQR Q Q

1

3

Lower Fence 1.5( )

Upper Fence 1.5( )

Q IQR

Q IQR