Top Banner
Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n from N f = n/N = sampling fraction
65

Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Dec 22, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Probability Sampling

uses random selection N = number of cases in sampling framen = number of cases in the sample

NCn = number of combinations of n from Nf = n/N = sampling fraction

Page 2: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Variations

Simple random sampling based on random number generation

Stratified random sampling divide pop into homogenous subgroups, then simple

random sample w/in

Systematic random sampling select every kth individual (k = N/n)

Cluster (area) random sampling randomly select clusters, sample all units w/in cluster

Multistage sampling combination of methods

Page 3: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Non-probability sampling

accidental, haphazard, convenience sampling ...may or may not represent the population well

Page 4: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Measurement

... topics in measurement that we don’t have time to cover ...

Page 5: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Research Design

Elements: Samples/Groups Measures Treatments/Programs Methods of Assignment Time

Page 6: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Internal validity

the approximate truth about inferences regarding cause-effect (causal) relationships can observed changes be attributed to the program or intervention and NOT to other possible causes (alternative explanations)?

Page 7: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Establishing a Cause-Effect Relationship

Temporal precedenceCovariation of cause and effect if x then y; if not x then not y if more x then more y; if less x then

less y

No plausible alternative explanations

Page 8: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Single Group Example

Single group designs: Administer treatment -> measure outcome X -> O

assumes baseline of “0” Measure baseline -> treat -> measure

outcome0 X -> O

measures change over baseline

Page 9: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Single Group Threats

History threat a historical event occurs to cause the outcome

Maturation threat maturation of individual causes the outcome

Testing threat act of taking the pretest affects the outcome

Instrumentation threat difference in test from pretest to posttest affects the

outcomeMortality threat

do “drop-outs” occur differentially or randomly across the sample?

Regression threat statistical phenomenon, nonrandom sample from

population and two imperfectly correlated measures

Page 10: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Addressing these threats

control group + treatment group both control and treatment groups

would experience same history and maturation threats, have same testing and instrumentation issues, similar rates of mortality and regression to the mean

Page 11: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Multiple-group design

at least two groupstypically: before-after measurement treatment group + control group treatment A group + treatment B

group

Page 12: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Multiple-Group Threats

internal validity issue: degree to which groups are

comparable before the study “selection bias” or “selection threat”

Page 13: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Multiple-Group Threats

Selection-History Threat an event occurs between pretest and posttest that groups

experience differentlySelection-Maturation Threat

results from differential rates of normal growth between pretest and posttest for the groups

Selection-Testing Threat effect of taking pretest differentially affects posttest outcome of

groupsSelection-Instrumentation Threat

test changes differently for the two groupsSelection-Mortality Threat

differential nonrandom dropout between pretest and posttestSelection-Regression Threat

different rates of regression to the mean in the two groups (if one is more extreme on the pretest than the other)

Page 14: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Social Interaction Threats

Problem: social pressures in research context can

lead to posttest differences that are not directly caused by the treatment

Solution: isolate the groups Problem: in many research contexts, hard

to randomly assign and then isolate

Page 15: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Types of Social Interaction Threats

Diffusion or Imitation of Treatment control group learns about/imitates experience of

treatment group, decreasing difference in measured effect

Compensatory Rivalry control group tries to compete w/treatment group, works

harder, decreasing difference in measured effect

Resentful Demoralization control group discouraged or angry, exaggerates

measured effect

Compensatory Equalization of Treatment control group compensated in other ways, decreasing

measured effect

Page 16: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Intro to Design/ Design Notation

Observations or MeasuresTreatments or ProgramsGroupsAssignment to GroupTime

Page 17: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Observations/Measure

Notation: ‘O’ Examples:

Body weight Time to complete Number of correct response

Multiple measures: O1, O2, …

Page 18: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Treatments or Programs

Notation: ‘X’ Use of medication Use of visualization Use of audio feedback Etc.

Sometimes see X+, X-

Page 19: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Groups

Each group is assigned a line in the design notation

Page 20: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Assignment to Group

R = randomN = non-equivalent groupsC = assignment by cutoff

Page 21: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Time

Moves from left to right in diagram

Page 22: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Types of experiments

True experiment – random assignment to groupsQuasi experiment – no random assignment, but has a control group or multiple measuresNon-experiment – no random assignment, no control, no multiple measures

Page 23: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Design Notation ExampleR O1 X O1,2

R O1 O1,2

Pretest-posttest treatment versus

comparison group

randomized experimental design

Page 24: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Design Notation Example

N O X O

N O O

Pretest-posttest

Non-Equivalent Groups

Quasi-experiment

Page 25: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Design Notation ExampleX O

Posttest Only

Non-experiment

Page 26: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Goals of design ..

Goal:to be able to show causalityFirst step: internal validity: If x, then y AND If not X, then not Y

Page 27: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Two-group Designs

Two-group, posttest only, randomized experiment

R X O

R O

Compare by testing for differences between means of groups, using t-test or one-way Analysis of Variance(ANOVA)

Note: 2 groups, post-only measure, two distributions each with mean and variance, statistical (non-chance) difference between groups

Page 28: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

To analyze …

What do we mean by a difference?

Page 29: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Possible Outcomes:

Page 30: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Measuring Differences …

Page 31: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Three ways to estimate effect

Independent t-testOne-way Analysis of Variance (ANOVA)Regression Analysis (most general)

equivalent

Page 32: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Computing the t-value

Page 33: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Computing the variance

Page 34: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Regression Analysis

Solve overdetermined system of equations for β0 and β1, while minimizing sum of e-terms

Page 35: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Regression Analysis

Page 36: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

ANOVA

Compares differences within group to differences between groupsFor 2 populations, 1 treatment, same as t-testStatistic used is F value, same as square of t-value from t-test

Page 37: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Other Experimental Designs

Signal enhancers Factorial designs

Noise reducers Covariance designs Blocking designs

Page 38: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Factorial Designs

Page 39: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Factorial Design

Factor – major independent variable Setting, time_on_task

Level – subdivision of a factor Setting= in_class, pull-out Time_on_task = 1 hour, 4 hours

Page 40: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Factorial Design

Design notation as shown2x2 factorial design (2 levels of one factor X 2 levels of second factor)

Page 41: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Outcomes of Factorial Design Experiments

Null caseMain effectInteraction Effect

Page 42: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

The Null Case

Page 43: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

The Null Case

Page 44: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Main Effect - Time

Page 45: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Main Effect - Setting

Page 46: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Main Effect - Both

Page 47: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Interaction effects

Page 48: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Interaction Effects

Page 49: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Statistical Methods for Factorial Design

Regression AnalysisANOVA

Page 50: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

ANOVA

Analysis of variance – tests hypotheses about differences between two or more meansCould do pairwise comparison using t-tests, but can lead to true hypothesis being rejected (Type I error) (higher probability than with ANOVA)

Page 51: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Between-subjects design

Example: Effect of intensity of background

noise on reading comprehension Group 1: 30 minutes reading, no

background noise Group 2: 30 minutes reading,

moderate level of noise Group 3: 30 minutes reading, loud

background noise

Page 52: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Experimental Design

One factor (noise), three levels(a=3)Null hypothesis: 1 = 2 = 3

Noise None Moderate High

R O O O

Page 53: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Notation

If all sample sizes same, use n, and total N = a * nElse N = n1 + n2 + n3

Page 54: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Assumptions

Normal distributions

Homogeneity of variance Variance is equal in each of the

populations

Random, independent samplingStill works well when assumptions not quite true(“robust” to violations)

Page 55: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

ANOVA

Compares two estimates of variance MSE – Mean Square Error, variances

within samples MSB – Mean Square Between, variance

of the sample means

If null hypothesis is true, then MSE approx = MSB, since

both are estimates of same quantity Is false, the MSB sufficiently > MSE

Page 56: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

MSE

Page 57: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

MSB

Use sample means to calculate sampling distribution of the mean,

= 1

Page 58: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

MSB

Sampling distribution of the mean * nIn example, MSB = (n)(sampling dist) = (4) (1) = 4

Page 59: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Is it significant?

Depends on ratio of MSB to MSEF = MSB/MSEProbability value computed based on F value, F value has sampling distribution based on degrees of freedom numerator (a-1) and degrees of freedom denominator (N-a)Lookup up F-value in table, find p valueFor one degree of freedom, F == t^2

Page 60: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Factorial Between-Subjects ANOVA, Two factors

Three significance tests Main factor 1 Main factor 2 interaction

Page 61: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Example Experiment

Two factors (dosage, task)3 levels of dosage (0, 100, 200 mg)2 levels of task (simple, complex)2x3 factorial design, 8 subjects/group

Page 62: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Summary tableSOURCE df Sum of Squares Mean Square F pTask 1 47125.3333 47125.3333 384.174 0.000 Dosage 2 42.6667 21.3333 0.174 0.841 TD 2 1418.6667 709.3333 5.783 0.006 ERROR 42 5152.0000 122.6667 TOTAL 47 53738.6667

Sources of variation: Task Dosage Interaction Error

Page 63: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Results

Sum of squares (as before)Mean Squares = (sum of squares) / degrees of freedomF ratios = mean square effect / mean square errorP value : Given F value and degrees of freedom, look up p value

Page 64: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Results - example

Mean time to complete task was higher for complex task than for simpleEffect of dosage not significantInteraction exists between dosage and task: increase in dosage decreases performance on complex while increasing performance on simple

Page 65: Probability Sampling uses random selection N = number of cases in sampling frame n = number of cases in the sample N C n = number of combinations of n.

Results