IB Biology 2009-2010 Recording Data & Statistical Tests IB Biology 2009-2010.

2009-2010 IB Biology

Recording Data & Statistical Tests

IB Biology 2009-2010

IB Biology

Recording RAW Data Be sure all data tables have

DESCRIPTIVE TITLE Not “the results” or “data” Include some mention of methodology

If handwritten, use ruler to draw gridlines Units and error recorded in heading of table

ALL measurements recorded to the same level of precision ) watch decimal places

Include qualitative data observed during the investigation

IB Biology

“Complete”Table 1: Lengths of two plant tissues, potato (Solanum) and apple

(Malus) after soaking in solutions of sucrose of different concentrations. The initial lengths were 4.0 cm.

It was also noticed before the soaking that the potato tissue floated in the solution from 0.4 to 1.0 mol. The apple tissue, however, only floated in the solutions from 0.6 to 1.0 mol. After soaking the tissues became softer at higher sucrose concentrations but they were quite hard in the lower concentrations.

IB Biology

“Partial”

IB Biology

“Partial”

IB Biology

“Not at all”

IB Biology

Processing Data Processing means mathematical calculation has

occurred Means are most common method for recording data in

Biology Other formulas may be used (rate, change, % change,

stoichiometry, etc) Show sample calculation (except mean) Don’t forget units and error

Mean can’t be more precise than raw data Use standard deviation as SEM when means used

Make NEW data table for processed data for clarity You want DATA TABLE AND GRAPH Graph PROCESSED DATA!!!!!

IB Biology

Standard Deviation The standard deviation is a statistic

that tells you how tightly all the various examples are clustered around the mean in a set of data. 68% fall +/- 1 95% fall +/- 2 99% fall +/- 3

IB Biology

Standard Error The standard error of the mean This statistic gives us an idea of how “good” our sample

is. The narrower the limits are the closer the mean of the sample represents the mean of the population

The standard error of the mean is given by Its size indicates how representative your mean is.The

sample mean ± gives the 95% confidence limits. In other words, we are 95% sure that the mean of the

population lies somewhere between: NB We would only use this formula for n>30. If n<30

and we are fairly sure the population we are sampling from has normal distribution we replace 1.96 by the 95% point from our t-test tables with n-1 degrees of freedom.

IB Biology

Calculating Standard Deviation EXCEL – use built in function

IB Biology

Calculating Standard Deviation TI-83/84

Enter data into List in STAT menu

Use STAT CALC Select “1:1-Var Stats” Type L1 (or list used)

IB Biology

The t-test Used to determine if two sets of data

are significantly different Paired t-test (same subjects receive

both treatments) Unpaired t-test (compares two groups,

but individuals in groups are different)

IB Biology

Null and Alternate Hypotheses Null Hypothesis

There is no significant difference between the results of the two groups The treatment HAD NO effect

Alternate Hypothesis There IS a significant difference

between the results of the two groups The treatment HAD an effect This is usually the hypothesis we stated in

lab

IB Biology

Level of significance (p-value) p-value indicates the probability that

the differences between the two groups is due to chance

If p<.05 then we reject the null hypothesis (test supports the alternate hypothesis)

If p>0.05, we cannot reject the null hypothesis (note wording)

IB Biology

t-test in excel In Excel the t test is performed using

the formula: =TTEST (range1, range2, tails, type) For the examples you'll use in biology,

tails is always 2 (for a "two-tailed" test), and type can be 1 or 2 depending on the circumstances. Type 1 = paired Type 2 = unpaired

IB Biology

Type 2 – unpaired test

Number of WBC in infected vs uninfected patients

IB Biology

Type 1 – paired test

Pulse rate measured before and after eating

IB Biology

Formula – Unpaired t-test

IB Biology

Formula – Paired t-test

IB Biology

Paired T-test on TI-83/84 [STAT] [1]; cursor to L1 column

heading, press [CLEAR] [ENTER], and enter the “Before” numbers.

Cursor to L2 column heading, press [CLEAR] [ENTER], and enter the “After” numbers. You want to get the column of differences, After−Before, That’s the same as L2−L1, which is the formula for L3.

Cursor to L3 column heading. Enter the formula, [2nd 2 makes L2] [−] [2nd 1 makes L1] [ENTER].

NOTE – you need to make print data table with these differences and means.

Pulse Rate (+/- 1 BPM) before and after 100 mg

caffeine consumption

IB Biology

Paired t-test, cont’d Use 1 VAR Stats to get mean and std dev

[STAT] [►] [1] for 1-VarStats, then [2nd 3 makes L3]. Since the sample statistics are automatically pasted

to the t test screen, you don’t need to copy them. Just enter the μ (hypothesized mean/difference) and the “>” from your H1.[STAT] [◄] [2] for T-Test Enter 0 for μo. Skip past xX , s, and n. Enter >μo (since you hypothesize caffeine increases HR), then select CALCULATE or DRAW.

In this case, p< 0.05, so the null hypothesis is rejected… the alt hyp that caffeine increases HR is supported

IB Biology

Showing work using TI-83/84 Write what you entered in each list Record stats for mean and SD State t-test statistic and p-value State what this info “tells” you

IB Biology

UNPAIRED t-test on TI-83/84USE this test when populations samples aren’t

the same individuals Select 2-SampTTest Enter mean, sample size (n) and

SD for each sample Select appropriate alt hypothesis Selected POOLED data Calculate results Note, in this

case p>0.05 Null hypothesis

NOT rejected NO sig difference in

two samples

IB Biology

Problem #1 As a biochemist working for a pharmaceutical company, your

job is to test new drugs for possible side effects, both deterimental and beneficial. The chemistry of agent TFK-05W suggests that it may have the side effect of reducing the tendency toward obesity. The Zucker rat is an established genetic model for both obesity and hypertension. As rats of the obese strain age they gain weight much more rapidly than do the so-called "lean" Zucker strain. You plan to treat a group of animals over a period of one year and compare their average weight with that of a group of untreated animals.

The odds of the drug actually showing this side effect are small and maintenance of rats for a year is expensive, so you limited the scope of the study to twelve animals in each group. The null hypothesis is that treated animals will show an average weight gain over one year that is no different from the average weight gain of untreated animals. What statistical test will you use to compare the two means?

IB Biology

Answer You'll run a t test for independent samples. It

doesn't matter that the number of animals in each data set is the same, nor that they are all the same type of animal. You sampled 12 treated individuals and 12 different untreated individuals. There is no special relationship between a data point from one group and any particular data point from a second. The sampling method was independent.

IB Biology

Problem #2

This study follows from problem #1. Not only did your study suggest that the agent TFK-05W indeed does affect weight gain, but it also proved effective and safe (so far) and it is in clinical trials. Because the drug was designed to treat symptoms that have nothing to do with obesity, the clinical trials do not focus on that problem and won't answer the question of whether or not the agent is a potential weight loss drug. The company, however, has permitted you to test the agent on a group of 12 people with morbid obesity, who have signed the appropriate consent forms.

This time the plan is to treat the 12 obese individuals for a year, having measured their weights on the day treatment was started. The paid participants will be monitoring their weight regularly, taking the drug, and are required to keep a daily log of activity and eating habits so that the experiment can be properly controlled. Nevertheless, the simplest initial test of the hypothesis that obese individuals treated with TFK-05W for a year will show an average weight loss is to compare average weight at the beginning and end of the experiment. Thus, as with problem #1, you will have two sets of 12 data points each to compare. What statistical test will you use to compare the two means?

IB Biology

Answer This study calls for running a paired t test. The same

individuals were sampled (weights measured) at the beginning and at the end of the study. Thus each data point in the first set can be paired with a data point from the same individual in the second set.

Variability among distinct individuals contributes considerable experimental error to many experiments. Such error can mask effects, especially small effects, even if the null hypothesis is indeed false. For example, if the average individual lost 10 pounds but the standard deviation at the beginning of the experiment was 55 pounds, the loss might not show up as a significant difference. By controlling for individual variability the paired t test can focus on the average change in weight.

IB Biology

Problem #3 Embryonic cells (stem cells) from a single human blastula are genetically equivalent. Any of

them has the potential to form any kind of tissue that is normally found in an adult human body. Exploitation of stem cells for therapeutic purposes has the potential to revolutionize medicine and expand the average human lifespan considerably. Stem cells have very complex cultural requirements. So far your stem cell lines require the addition of fetal bovine serum to the culture medium, and the exact composition (and efficacy) of animal sera varies from lot to lot, To exercise the greatest control over your experiments, it would be valuable to be able to culture your cells in a synthetic medium that includes only those components that are essential to support survival and growth of your cultures.

You have developed a synthetic medium that keeps your cells going for several days, but not indefinitely. You think that you can extend the life of your cultures by adding an expensive hormone to the medium. To test the hypothesis that stem cells cultured with medium 2 will survive longer than stem cells cultured with medium 1, you will set up twenty cultures from ten original embryos, growing them in complex medium to the point at which each culture contains about 100 cells. You will then remove the original medium and feed the cultures from now on with synthetic medium. One culture from each original embryo will receive medium 1 and the other medium 2. For your data you will record the time at which each culture declines to the point of having only 50% of its original viable cells remaining. The null hypothesis is that this average "survival time" will be the same for cultures treated with either synthetic medium. Alternative hypotheses, of course, are that feeding with one or the other medium will enhance survival time by comparison.

What statistical test will you run on the two sets of 10 data points each?

IB Biology

Answer This case can be thought of as a set of replicate experiments.

In each experiment one culture from a single source was fed one medium and a second culture from the same source was fed the other medium. The experiment was replicated 10 times, using 10 different sources. Since each replicate experiment consists of a pair of data points linked by the common origin of the respective cultures, you have a set of 10 pairs of data (two sets of paired data).

A paired t test is appropriate for the same reasons it was appropriate for problem #2. The paired method controls for experimental error that might be contributed by the 10 different sources.

Why not conduct all of the replicate experiments on cultures from a single source, eliminating all experimental error that is contributed by individual variability? Then we run the risk that the result won't hold for cultures from other embryos. We want to know if the medium we are testing will work for most or all cultures, not only for cultures from one particular embryo.

IB Biology

Problem #4 You suspect that a cause of decline of your stem cell cultures is a failure to

produce sufficient superoxide dismutase to rid the cells of oxygen free radicals. You have an assay for the enzyme, but to conduct the assay you must destroy the culture. From a single source of stem cells you can prepare about thirty cultures that remain healthy about 10 days in your synthetic medium after growing to a sufficient number of cells to permit you to run your assay. From then on, they decline rapidly.

You prepared thirty cultures from the same source and sampled half of them when alll thirty cultures had reached the point at which the assay was feasible. You then sampled the other half 10 days later. Your null hypothesis is that superoxide dismutase activity will not be significantly different between the two sets of cultures. What statistical test will you run to determine whether or not the difference is significant?

Normally, this would be a rather poor experimental design, because all of the cultures are identical. Why conduct replicate sampling on the exact same culture? The issue is that enzyme assays are notoriously inaccurate. The chances of mulitple comparisons yielding dubious results are much smaller than for a single comparison. The p value that we obtain will give us a fairly accurate estimate of the level of confidence with which we can interpret the result.

IB Biology

Answer This time your samples are all coming from the same

population of cultures, presumably all identical except that half of them were sampled at one time and half at the other time. All of the data points are linked by the fact that they were obtained from cultures from a common source. However, there is no special one to one correspondence between any one data point in one set and a unique data point in the other. There is no basis for a paired t test, so we must run a test for independent samples.

The assay itself is the variable in this example. If the assay was 100% accurate and reliable, we would only have needed one sample at each time. On the other hand, any significant difference should be considered preliminary until the experiment can be repeated on at least one or two more sets of cultures.

IB Biology

Problem #5

Referring to problem #1, one of your rats died of natural causes during the study, leaving 11 animals in one group while you still had 12 animals in the other. Does this turn of events ruin the experiment? If you do conduct the analysis, how will you modify it, if at all?

IB Biology

You lost one animal, but because each data set represents an independent sample it is not necessary that the numbers of data points be equal. You conduct the t test for independent samples, comparing a set of 11 data points with a set of 12.

IB Biology

Problem #6

Referring to problem #2, one of your human subjects died of a massive heart attack halfway through the study. The death was clearly not related to the drug treatments. Please answer the same questions as for problem #5.

IB Biology

This time you lost both data points that were to be contributed by the deceased individual. You now have 11 data points in each set. It shouldn't be a problem unless the others start dropping off as well.

IB Biology

Problem #7

In problem #2 it was stated that Zucker rats treated with the agent TFK-05W were significantly less obese than untreated rats. Does this mean that when the t test was run it returned a probability (p) value of > 0.05 or a p value of < 0.05?

IB Biology

P would be <.05 The p value is the probability that the null hypothesis is

accurate. The higher the p value, the greater is the probability that there is no significant difference between means. A probability of 0.05 (1 in 20 chance) or less is considered sufficient evidence on which to reject the null hypothesis. Rejecting a null hypothesis means we accept an alternative. The result with the Zucker rats was that the treated group weighed less, so we accept the alternative hypothesis that the drug reduced weight gain. The other alternative, which was not supported by the data, would be that the drug caused additional weight gain.

Of course, working with probabilities there is always a chance that the results of an experiment are simply wrong. Realistically, though, experimental results are seldom wrong due to an improbable distribution of samples. They are usually wrong because of a bad experiment, especially when an experiment is not well controlled.

IB Biology

Problem #8

For the third problem the difference between means was 12 hours, that is, the average half life of one culture was 12 hours longer than the average half life of the other one. The t test returned a p value of 0.33. What is your conclusion regarding the original hypothesis?

Would you be correct in stating that the result is significant or insignificant?

IB Biology

p > 0.05 so don’t reject H0 The difference in sample means may have been 12

hours, but apparently there was enough variability among cultures that the difference was not significant. With p > 0.3 it is unlikely that you will get a significant difference even by testing more cultures.

Unless a difference is supported by probability it is not considered significant at all. Think about it. Maybe the range over which cultures lasted was quite wide. Perhaps the difference in sample means would be reduced to zero or even reduced if you just switched two data points.

You would not be correct to say that the result is insignificant. The result is that medium 2 has no apparent effect on longevity of a culture, and that finding is indeed significant. The difference in means was insignificant, not the result itself.

IB Biology

Problem #9

In problem #4, average superoxide dismutase activity was 30% lower after 10 days than it was in the beginning. The t test gave a p value of 0.07. What result do you report? It turns out that this study is very important, and if you indeed find that a decline in superoxide dismutase activity is a primary cause of cell death then good things will happen to your career. How will you proceed from this point?

IB Biology

Repeat, but don’t falsify results With a p value of 0.07 you are so close to finding a significant

difference that it is sorely tempting to drop a data point in favor of your hypothesis, or maybe "round off" to 0.05. Scientific integrity requires that you treat the data as they stand, however. You do not have sufficient evidence with which to reject the null hypothesis.

How to proceed? Because it is so important to you that you come to a conclusion, an appropriate course of action is to repeat the experiment. Analysis will be more complicated because the second experiment will be conducted on a different set of cultures. To keep it simple you could repeat the experiment twice and if the results are consistent, pool all of the data points. By the way, with 45 data points in each set you probably would no longer need a t test. You could base your analysis on the normal distribution and simply look at overlap between standard deviations in order to determine a p value.

IB Biology 2009-2010 Recording Data & Statistical Tests IB Biology 2009-2010.

Documents