Top Banner
Introduction • Populations and Samples – Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population of measurements as opposed to individuals or units. – Sample - Subset of a population that is observed and measured by investigators.
30

Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Dec 23, 2015

Download

Documents

Pearl Jones
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Introduction

• Populations and Samples– Population - Set of all individuals or units of

interest to investigators. Sometimes we may refer to a population of measurements as opposed to individuals or units.

– Sample - Subset of a population that is observed and measured by investigators.

Page 2: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Quantitative and Qualitative Variables

• Quantitaive variables take on numeric values. They can be further classified as:– Continuous variables can take on values along an interval

(e.g. blood pressure, temperature)

– Discrete variables can take on distinct values with “breaks” (e.g. Woman’s parity, Number of prior cardiac events)

• Qualitative variables take on various categories. They can be classified as:– Nominal variables take on values with no inherent ordering

(e.g. Presence/Absence of parasite, gender, race)

– Ordinal variables take on categories that can be ordered (e.g. Prognosis, Attitude toward a proposal)

Page 3: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Dependent and Independent Variables

• Dependent variables are outcomes of interest to investigators. Also referred to as Responses or Endpoints

• Independent variables are Factors that are often hypothesized to effect the outcomes (levels of dependent variables). Also referred to as Predictor or Explanatory Variables

• Research ??? Does I.V. D.V.

Page 4: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Example - Clinical Trials of Cialis

• Clinical trials conducted worldwide to study efficacy and safety of Cialis (Tadalafil) for ED

• Patients randomized to Placebo, 10mg, and 20mg• Co-Primary outcomes:

– Change from baseline in erectile dysfunction domain if the International Index of Erectile Dysfunction (Numeric)

– Response to: “Were you able to insert your P… into your partner’s V…?” (Nominal: Yes/No)

– Response to: “Did your erection last long enough for you to have succesful intercourse?” (Nominal: Yes/No)

Source: Carson, et al. (2004).

Page 5: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Example - Clinical Trials of Cialis

• Population: All adult males suffering from erectile dysfunction

• Sample: 2102 men with mild-to-severe ED in 11 randomized clinical trials

• Dependent Variable(s): Co-primary outcomes listed on previous slide

• Independent Variable: Cialis Dose: (0, 10, 20 mg)• Research Questions: Does use of Cialis improve

erectile function?

Page 6: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Parameters and Statistics

• Parameters: Numerical descriptive measures for Populations: - Mean (average) of a numeric variable 2 - Variance - Standard deviation of a numeric variable CV - Coefficient of variation of a numeric variable - Proportion of population with a nominal characteristic

Page 7: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Parameters and Statistics

• Statistics: Numerical descriptive measures for Samples– Sample Mean (of a sample of size n):

n

yy

^

– Sample Variance (s2) and standard deviation (s):

22

2

1

)(ss

n

yys

– Sample coefficient of variation (cv): %100

y

scv

– Sample Proportion with a characteristic: ^

Page 8: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Example - Carbonate of Bismuth• Samples of Carbonate of Bismuth from a sample of 6

London manufacturing chemists

• Measurements: Quantity of Teroxide (Theoretically should be 88.30 per 100 parts)

• Measured levels: 89, 88.5, 86.16, 87.66, 87.66, 86

%39.1%1005.87

21.121.147.1

47.116

)5.8786()5.8766.87()5.8766.87()5.8716.86()5.875.88()5.8789(

50.876

8666.8766.8716.865.8889

2222222

cvs

s

y

Source: Umney (1864)

Page 9: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Example - Clinical Trials of Cialis

• Among the 638 patients receiving placebo (dose=0), 198 responded “Yes” to “Did your erection last long enough for you to have succesful intercourse?”

• Of 321 receiving 10mg dose, 186 replied “Yes”• Of 1143 receiving 20mg dose, 777 replied “Yes”

68.1143

77758.0

321

18631.0

638

19820

^

10

^

0

^

Note that proportions are often reported as percentages (number with characteristic per 100 exposed) or as rates per 10,000 such as mortality rates for rare causes

Page 10: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Graphical Techniques

• Pictures are worth a bunch of words and computer packages make graphing easy!– Histograms show the number or percent by

category or within ranges of values– Pie charts show proportionally the number or

percent by category or within ranges of values– Scatterplots plot a dependent variable on the

vertical axis versus an independent variable with each subject being a point on the chart

Page 11: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Histogram of ED Severity Level

• In the Cialis trial, the baseline severity level was reported for 2099 patients on an ordinal scale: 1=Normal, 2=Mild, 3=Moderate, 4=Severe

SEVERITY

4.03.02.01.0

Cases w eighted by PATIENTS

800

600

400

200

0

Std. Dev = .92

Mean = 2.9

N = 2099.00

Page 12: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Pie Chart of ED Severity Level

Cases weighted by PATIENTS

Severe

Moderate

Mild

Normal

Page 13: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Histogram of Disposition by Dose (Count=%)

Placebo

10mg

20mg

dose

Bars show counts

2 4 6

dispose

0

25

50

75

Disposition:

1=Completed

2=Adverse event

3=Lack of Efficacy

4=Lost to follow-up

5=Patient Decision

6=Protocol Violation

7=Others

Page 14: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Scatterplot of Math Score vs LSD Level

• Response - Mean Math score for 7 subjects • Predictor - Mean LSD Concentration

Source: Wagner and Bing (1968)

20

30

40

50

60

70

80

mat

hsco

re

1 2 3 4 5 6 7lsdconc

Bivariate Scattergram

Conc Score1.17 78.932.97 58.203.26 67.474.69 37.475.83 45.656.00 32.926.41 29.97

Page 15: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Basic Probability

• Probability measures the likelihood or chances of particular outcomes (or events) of random experiment or observation

• Let A and B be two events, with probabilities P(A) & P(B):– Intersection - Event that both A and B occur (Notation: AB)

– Union - Event that either A and/or B occur (Notation: AB)

– Complement - Event that the event does not occur (Notation: Ā)

• Probability Rules:

)(1)(

)|()()|()()(

)(

)()|(

)()()()(

APAP

ABPBPABPAPABP

BP

ABPBAP

ABPBPAPBAP

= P(A occurs Given B has occurred)

Page 16: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Example - High Cholesterol By Age and Sex

• WHO MONICA Survey of 50000 Adults

• Proportions by Age, Gender, and Cholesterol:

High Chol Low Chol High Chol Low Chol Total25-29 0.0066 0.0486 0.0046 0.0527 0.112530-34 0.0111 0.0542 0.0056 0.0563 0.127235-39 0.0167 0.0476 0.0079 0.0582 0.130440-44 0.0196 0.0457 0.0100 0.0526 0.127945-49 0.0207 0.0440 0.0172 0.0490 0.130950-54 0.0222 0.0430 0.0256 0.0400 0.130855-59 0.0229 0.0425 0.0303 0.0342 0.129960-64 0.0185 0.0344 0.0304 0.0280 0.1113Total 0.1383 0.3600 0.1316 0.3710 1

Male Female

Source: Gostynski, et al (2004)

Page 17: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Example - High Cholesterol By Age and Sex

• Probability a Randomly Selected Subject is Male:

4983.3600.1383.)&()&()( LCMPHCMPMP

• Probability a Randomly Selected Subject is over 40 years:

6308.1113.1299.1308.1309.1279.

)6460()5955()5450()4945()4440()40(

PPPPPP

• Probability Female given subject has High Cholesterol:

4876.2699.

1316.

)(

)&()|(

2699.1316.1383.)&()&()(

1316.)&(

HCP

HCFPHCFP

HCFPHCMPHCP

HCFP

Page 18: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Independence

• Two events A and B are independent if:

P(A|B) = P(A) or, equivalently P(B|A) = P(B)

• Cholesterol Example:

4876.)|(

5017.4983.1)(1)(

MFP

MPFP

The occurrence of high cholesterol is not independent of gender

Page 19: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Diagnostic Tests

• True state: Disease Present (D+) or Absent (D-) based on a gold standard

• Diagnostic test result: Positive (T+) or Negative (T-) • Subjects can be classified in following table (where

a,b,c, and d are the number of subjects in the 4 cells:

Test Result\True State Positive (D +) Negative (D -) TotalPositive (T +) a b a+bNegative (T -) c d c+dTotal a+c b+d a+b+c+d

Page 20: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Diagnostic Tests

• Sensitivity - The ability for the test to detect that the disease is present: P(T+ | D+)

• Specificity - The ability for the test to detect that the disease is absent: P(T- | D-)

• Positive Predictive Value (PPV) - Proportion of positive test results that actually have the disease

• Negative Predictive Value (NPV)- Proportion of negative test results that do not have the disease

• Overall Accuracy - Proportion of subjects who are correctly diagnosed

Page 21: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Diagnostic Tests Test Result\True State Positive (D +) Negative (D -) TotalPositive (T +) a b a+bNegative (T -) c d c+dTotal a+c b+d a+b+c+d

dcba

daAccuracy

dc

cTDPNPV

ba

aTDPPPV

db

bDTPySpecificit

ca

aDTPySensitivit

:

)|(:

)|(:

)|(:

)|(:

*

*

** Assuming prevalence rates in test subjects is same as in population

Page 22: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Example - Paracheck Test for Plasmodium Falciparum (Pf)

• Goal: Develop an inexpensive test for Pf in asymptomatic children in remote parts of India

• Gold Standard: Microscopy• Diagnostic Test: Paracheck ($0.65/test)

Test Result\True State Positive (D +) Negative (D -) TotalPositive (T +) 119 49 168Negative (T -) 7 398 405Total 126 447 573

Source: Singh, et al (2002)

Page 23: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Example - Paracheck Test for Plasmodium Falciparum (Pf)

Test Result\True State Positive (D +) Negative (D -) TotalPositive (T +) 119 49 168Negative (T -) 7 398 405Total 126 447 573

%)23.90(9023.573

398119

%)27.98(9827.405

398

%)03.78(7803.168

119

%)04.89(8904.447

398

%)44.94(9444.126

119

Accuracy

NPV

PPV

ySpecificit

ySensitivit

Page 24: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Basic Study Designs

• Studies can generally be classified as observational or experimental– Observational - Subjects (or nature) select their

groups (levels of the independent variable)• Studies comparing ethnicities or sexes wrt drug disposition

• Studies of effects of smoking or other behaviors

• Studies comparing effects of patients on different therapies

– Experimental - Researchers assign subjects to treatment groups

• Clinical trials with patients being randomized to active drug or placebo. Typically double-blind (patient/assessor)

Page 25: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Observational Studies

• Case-Control -- Subjects are identified based on presence/absence of the outcome of interest (D.V.). It is then determined whether the subject had been exposed to risk factor (I.V.). Retrospective Studies.

• Cohort -- Subjects are identified by risk factor or treatment (I.V.) and followed over time to observe outcome (D.V.). Prospective Studies.

• Cross-sectional -- Subjects sampled at random from population and levels of both I.V. and D.V. are simultaneously observed. Many studies based on large medical databases are cross-sectional

Page 26: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Example - Case-Control Study

• Purpose: Study Risk Factors of Hepatitis-A in Hispanic Children living in U.S. on Mexican border (San Diego, CA)

• Cases: 132 Children with Hepatitis-A • Controls: 354 Children without Hepatitis-A• Risk Factors:

– Travel outside U.S. (67% of cases, 25% of cases)

– Eating food at taco stand/street vendor on travel

– Eating salad/lettuce on travel

Source: Weinberg, et al (2004)

Page 27: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Example - Cohort Study

• Purpose: Determine whether male adolescents who develop schizophrenia were more likely to smoke prior to onset

• Subjects: Israeli male military recruits, not suffering major psychopathology who complete smoking questionnaire

• Cohorts: 4052 smokers, 10196 non-smokers• Follow-up/outcome: 4-16 year follow-up for onset

of schizophrenia (20 smokers, 24 nonsmokers)

Source: Weiser, et al (2004)

Page 28: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Example - Cross-Sectional Study

• Purpose - Investigate effect of high altitude on maternal hemorrheology

• Subjects - Pregnant and non-pregnant women at high altitude and at sea level

• Measurements - Blood/Plasma viscosities, Hematocrit, total protein, Fibrinogen, Albumin

• Selected Findings - Blood and Plasma viscosities are higher in pregnant and non-pregnant women at higher altitudes

Source: Kametas, et al (2004)

Page 29: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Experimental Studies

• Randomized Clinical Trials - Studies where investigators assign subjects at random to treatments

• Special Cases (more than one may apply):– Parallel Groups - Each subject receives only one treatment

– Crossover - Each subject receives each trt (in random order)

– Placebo Controlled - One group receives only a placebo

– Double Blind - Subject nor assessor are aware of which trt

– Double Dummy - Subjects receive similar regimens wrt appearance, when different drugs look different

– Intention-to-Treat - Analysis is based on all subjects randomized, including those lost to follow-up

– Completed Protocol - Analysis based on only subjects who completed study

Page 30: Introduction Populations and Samples –Population - Set of all individuals or units of interest to investigators. Sometimes we may refer to a population.

Example - Randomized Clinical Trial• Purpose - Three treatments for primary dysmenorrhea in women

• Subjects - 337 women (18-40) suffering dysmenorrhea during past 3 consecutive menstrual cycles

• Treatments (Parallel Groups, double-blind, double-dummy)– Group 1: 1 tablet meloxicam 7.5mg o.a.d.

1 tablet placebo matching meloxicam 15mg o.a.d.

1 tablet placebo matching mefenamic acid 500mg t.i.d.

– Group 2: 1 tablet meloxicam 15mg o.a.d.

1 tablet placebo matching meloxicam 7.5mg o.a.d.

1 tablet placebo matching mefenamic acid 500mg t.i.d.

– Group 3: 1 tablet mefenamic acid 500mg t.i.d.

1 tablet placebo matching meloxicam 7.5mg o.a.d.

1 tablet placebo matching meloxicam 15.0mg o.a.d.

• Outcomes: Ordinal global assessment of safety/tolerability by patients and investigators (Good, Satisfactory, Not satisfactory, Bad)

Source: de Mello, et al (2004)