! www.clutchprep.com
!
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
CONCEPT: MEAN EVALUATION
The _____________________________ measures how close data results are in relation to the mean or average value.
s =(xi − x)
2
i∑n−1
____ = Individual Measurement ____ = Number of Measurements
____ = Average or Mean ____ = Degrees of Freedom
_________ = variance _________ = Relative Standard Deviation (Coefficient of Variation)
EXAMPLE: Data below gives the volumes obtained by a chemist from the use of a pipet. Determine the standard deviation.
24.9 mL, 25.0 mL, 24.8 mL, 24.6 mL, 24.6 mL, 24.3 mL
Volume
(xi)
Difference from the mean
(xi − x)
Difference from the mean squared
(xi − x)2
(xi − x)2
i∑
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 2
CONCEPT: THE GAUSSIAN DISTRIBUTION
Performing an experiment numerous times with no systematic error results in a smooth curve called the Gaussian Distribution.
σ
µx
f(x)
e = 2.71828
f(x) = 1σ 2π
e−(x−µ)2 /2σ2
In terms of the Gaussian Distribution curve, increasing the number of measurements in the experiment:
• Changes the mean, _____, to mu, _____ to represent the population mean.
• Changes the standard deviation, _____, to sigma, _____, to represent the population standard deviation.
The shape of the Gaussian Distribution curve can occur by:
• Changing _____, which will shift the distribution curve to the left or right.
• Changing _____, which will increase or decrease the broadness of the distribution curve.
Normally each distributed variable has its own mean and standard deviation.
• The standard normal distribution simplifies this by setting the mean at ___ and standard deviation in units of___.
x
f(x)
-3 -2 -1 0 +1 +2 +3
-3 -2 -1 0 +1 +2 +3
0.1% 2.3% 15.9% 50% 84.1% 97.7% 99.9%
______ of the data falls between the -1 to +1 area
______ of the data falls between the -2 to +2 area
______ of the data falls between the -3 to +3 areaStandard Deviation
y = e−z2 /2
2π
σ
Z-ScoreCumulative %
z = Value−meanStandard Deviation
=X−µσ
Standard Normal Distribution Formula Abscissa (Z-Score)
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 3
PRACTICE: THE GAUSSIAN DISTRIBUTION & Z-TABLES
The use of Z-Tables is essential in the determination of probabilities.
-3 z 0 +3
Probability
-3 0 z +3
Probability
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 4
PRACTICE: THE GAUSSIAN DISTRIBUTION & Z-TABLES CALCULATIONS 1
EXAMPLE 1: Suppose there are 100 students in your analytical lecture and at the end of the semester the class average is an 80 with a standard deviation of 5.3, determine the distribution and probability of grades based on your understanding of the Gaussian distribution curve.
x
f(x)
-3 -2 -1 0 +1 +2 +3
Standard Deviation
EXAMPLE 2: From EXAMPLE 1, determine the percentage of final grades that would lie below 71.
x
f(x)
-3 -2 -1 0 +1 +2 +3
Standard Deviation
PRACTICE: From EXAMPLE 1, determine the percentage of final grades that would lie between 88 to 92.
x
f(x)
-3 -2 -1 0 +1 +2 +3
Standard Deviation
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 5
CONCEPT: CONFIDENCE INTERVALS
A confidence interval is a specific interval estimate of a parameter determined by using data obtained from a sample.
• For example a 95% confidence interval means we are 95% confident the mean lies within a given interval.
Confidence int erval = x ± tsn
______ = Student's t
______ = standard deviation
______ = # of measurements
______ = average or mean
A Student’s t is a statistical table used in our understanding of confidence intervals and in the comparative data from different experiments.
EXAMPLE: Construct a 95% confidence interval for an experiment that found the mean temperature for a given city in July as 103.5 oC with a standard deviation of 1.8 from 10 measurements.
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 6
PRACTICE: CONFIDENCE INTERVALS CALCULATIONS 1
EXAMPLE 1: The barium content of a metal ore was analyzed several times by a percent composition process.
0.010 0.011 0.004 0.011
Calculate the mean, median and mode.
EXAMPLE 2: From EXAMPLE 1, calculate the standard deviation.
PRACTICE: From the examples given above, find the 90% confidence interval.
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 7
CONCEPT: T-TEST
The t-test is used to test the _______________ of two populations, one of which could be a standard.
• In order to test the similarities and differences between these two populations you can utilize the t-score. Use the t score formula when we don’t know the population standard deviation and have a sample size less than _______.
t = x−µ0sn
______ = sample average
______ = population average
______ = sample standard deviation
______ = number of samples
t-calculated (for equal variance)
t-calculated (for unequal variance)
t-calculated (paired data)
tCalculated =x1 − x2
spooled⋅
n1n2n1 + n2
spooled =s12 (n1 −1)+ s2
2 (n2 −1)n1 + n2 − 2
t − calculated =x1 − x2
s12
n1+s22
n2
s = Σ (di −d)2
n−1tCalculated =d
s⋅ n
s12
n1+s22
n2
⎞
⎠⎟2⎛
⎝⎜⎜
s12
n1
⎞
⎠⎟2⎛
⎝⎜⎜
n1 −1+
s22
n2
⎞
⎠⎟2⎛
⎝⎜⎜
n2 −1
⎫
⎬
⎪⎪⎪⎪
⎭
⎪⎪⎪⎪
⎧
⎨
⎪⎪⎪⎪
⎩
⎪⎪⎪⎪
Degrees of freedom = n1 + n2 − 2
Degrees of freedom =
• The larger the t-score then the more ________________ the populations.
• The smaller the t-score then the more ________________ the populations.
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 8
PRACTICE: T-TEST CALCULATIONS 1
EXAMPLE: A student wishing to calculate the amount of arsenic in cigarettes decides to run two separate methods in her analysis. The results (shown in ppm) are shown below:
Sample Method 1 Method 2
1 110.5 104.7
2 93.1 95.8
3 63.0 71.2
4 72.3 69.9
5 121.6 118.7
Is there a significant difference between the two analytical methods under a 95% confidence interval?
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 9
PRACTICE: T-TEST CALCULATIONS 2
EXAMPLE: You want to determine if concentrations of hydrocarbons in seawater measured by fluorescence are significantly different than concentrations measured by a second method, specifically based on the use of gas chromatography/flame ionization detection (GC-FID). You measure the concentration of a certified standard reference material (100.0 µM) with both methods seven (n=7) times. Specifically, you first measure each sample by fluorescence, and then measure the same sample by GC-FID. The concentrations determined by the two methods are shown below.
[fluorene (µM)]
Sample Fluorescence GC-FID
1 100.2 101.1 2 100.9 100.5 3 99.9 100.2 4 100.1 100.2 5 100.1 99.8 6 101.1 100.7 7 100.0 99.9
Calculate the appropriate t-statistic to compare the two sets of measurements.
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 10
PRACTICE: T-TEST CALCULATIONS 3
EXAMPLE: A sample of size n = 100 produced the sample mean of 16. Assuming the population deviation is 3, compute a 95% confidence interval for the population mean.
PRACTICE: The average height of the US male is approximately 68 inches. What is the probability of selecting a group of males with average height of 72 inches or greater with a standard deviation of 5 inches?
-3 0 z +3
Probability
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 11
CONCEPT: F-TEST
The f-test is used to test the _______________ of two populations, which recall is equal to the standard deviation2.
• FCalculated represents the quotient of the squares of the standard deviations:
• When calculating the f quotient always set the larger standard deviation as the numerator so that F ≥1
• If FCalculated ________ FTable then the difference will not be significant.
tCalculated =x1 − x2
spooled⋅
n1n2n1 + n2
spooled =s12 (n1 −1)+ s2
2 (n2 −1)n1 + n2 − 2
• If FCalculated ________ FTable then the difference will be significant.
t − calculated =x1 − x2
s12
n1+s22
n2
s12
n1+s22
n2
⎞
⎠⎟2⎛
⎝⎜⎜
s12
n1
⎞
⎠⎟2⎛
⎝⎜⎜
n1 −1+
s22
n2
⎞
⎠⎟2⎛
⎝⎜⎜
n2 −1
⎫
⎬
⎪⎪⎪⎪
⎭
⎪⎪⎪⎪
⎧
⎨
⎪⎪⎪⎪
⎩
⎪⎪⎪⎪
Degrees of Freedom for s1Degrees of Freedom for s2 2 3 4 5 6 7 8 9 10 12 15 20 30 ∞
FCalculated =s12
s22
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 12
PRACTICE: F-TEST CALCULATIONS 1
EXAMPLE 1: In the process of assessing responsibility for an oil spill, two possible suspects are identified. To differentiate between the two samples of oil, the ratio of the concentration for two polyaromatic hydrocarbons is measured using fluorescence spectroscopy. These values are then compared to the sample obtained from the body of water:
Mean Standard Deviation # Samples
Suspect 1 2.31 0.073 4
Suspect 2 2.67 0.092 5
Sample 2.45 0.088 6
From the above results, should there be a concern that any combination of the standard deviation values demonstrates a significant difference?
EXAMPLE 2: Can either (or both) of the suspects be eliminated based on the results of the analysis at the 99% confidence interval?
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 13
PRACTICE: F-TEST CALCULATIONS 2
EXAMPLE 1: You are measuring the effects of a toxic compound on an enzyme. You expose five (test tubes of cells to 100 µL of a 5 ppm aqueous solution of the toxic compound and mark them as treated, and expose five test tubes of cells to an equal volume of only water and mark them as untreated. You then measure the enzyme activity of cells in each test tube; enzyme activity is in units of µmol/minute. The following are the measurements of enzyme activity:
Activity (Treated) Activity (Untreated)
Tube (µmol/min) Tube (µmol/min)
1 3.25 1 5.84
2 3.98 2 6.59
3 3.79 3 5.97
4 4.15 4 6.25
5 4.04 5 6.10
Average: 3.84 Average: 6.15
Standard Standard
Deviation: 0.36 Deviation: 0.29
Is the variance of the measured enzyme activity of cells exposed to the toxic compound equal to that of cells exposed to water alone?
EXAMPLE 2: Is the average enzyme activity measured for cells exposed to the toxic compound significantly different (at 95% confidence level) than that measured for cells exposed to water alone?
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 14
CONCEPT: DETECTION OF GROSS ERRORS
Grubbs test is used to detect a single outlier in a single variable data set that follows some type of normal distribution.
Grubbs Test
GCalculated =Questionable value− x
s
GTable <GCalculated ∴ Disregard Value
GTable >GCalculated ∴ Hold Value
Number GTable or GCritical of Observations (90% Confidence) (95% Confidence) (99% Confidence)
3 1.153 1.154 1.155
4 1.463 1.481 1.496
5 1.671 1.715 1.764
6 1.822 1.887 1.973
7 1.938 2.020 2.139
8 2.032 2.127 2.274
9 2.110 2.215 2.387
10 2.176 2.290 2.482
The Q-Test is another method used in finding outliers in very small, normally distributed, data sets.
• The number of measurements is normally between 3 to 7 values.
Q-Test
Number QTable or QCritical of Observations (90% Confidence) (95% Confidence) (99% Confidence)
3 0.941 0.970 0.994
4 0.765 0.829 0.926
5 0.642 0.710 0.821
6 0.560 0.625 0.740
7 0.507 0.568 0.680
8 0.468 0.526 0.634
9 0.437 0.493 0.598
10 0.412 0.466 0.568
QTable <QCalculated ∴ Disregard Value
QTable >QCalculated ∴ Retain Value
x1 = _____________________xn+1 = _____________________r = range (largest − smallest value in data set)
QCalculated =GapRange
=x1 − xn+1r
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 15
PRACTICE: DETECTION OF GROSS ERRORS CALCULATIONS 1
EXAMPLE 1: Wishing to measure the amount of caffeine in a cup of coffee you pour ten cups. From the data provided perform a Q-test to determine if the outlier can be retained or disregarded.
Caffeine per cup of coffee
Cup of Coffee
1
2
3
4
5
6
7
8
9
10
ppm of coffee
81
83
78
82
72
79
77
81
82
78
EXAMPLE 2: White blood cells are the defending cells of the human immune system and fight against infectious diseases.
Provided below is the “normal” white blood cell counts for a healthy adult woman. Determine if the current white
blood cell count is reasonable by Grubbs test.
"Normal" Days Today
5.1×106 cells /µL5.4×106 cells /µL4.9×106 cells /µL5.2×106 cells /µL5.3×106 cells /µL5.0×106 cells /µL
6.1×106 cells /µL
ANALYTICAL CHEMISTRY - CLUTCH 1E
CH.4 + 5 - STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS
Page 16