Top Banner
1 Diagnostic Testing Prof. Wei-Qing Chen MD PhD Department of Biostatistics and Epidemiology School of Public Health 87332199 [email protected]
90

05 diagnostic tests cwq

May 07, 2015

Download

Health & Medicine

Sumit Prajapati
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 05 diagnostic tests cwq

1

Diagnostic Testing Prof. Wei-Qing Chen MD PhD Department of Biostatistics and Epidemiology School of Public Health 87332199 [email protected]

Page 2: 05 diagnostic tests cwq

2

OBJECTIVES OF LECTUREOBJECTIVES OF LECTURE

understand that making a diagnosis is not black & white

understand subclinical, preclinical disease

understand accuracy (validity) of a diagnostic test

understand the need for a ‘Gold Standard’

understand the indices by which accuracy is assessed

understand reliability of a diagnostic test

Page 3: 05 diagnostic tests cwq

3

Section A

Diagnosis and Diagnostic Testing

Page 4: 05 diagnostic tests cwq

4

Diagnosis

Diagnosis is “the process determining health status and the factors responsible for producing it. Separating a target disease from health/other

diseases Indicating that his outcome will be different

(die earlier, suffer more, develop complications)

Being indicative of treatment Changing the patterns following treatment

Page 5: 05 diagnostic tests cwq

5

Way of diagnosis

Collecting clinical data Symptoms: malaise, memory loss,

fatigue, anxiety, etc. Signs: pale skin, hyperactive, red in

face, etc. Special tests: cell count, x-ray, PB,

cholesterolemia

Page 6: 05 diagnostic tests cwq

6

Way of diagnosis

Assembling a diagnostic entity (category) Diagnosis based on one variable: made

by an “abnormal” value of a physiological function.

Hypertension: SBP160mmHg,or/and DBP 95mmHg,

Anaemia: haemoglobin is less than 12mg% in adult women.

Page 7: 05 diagnostic tests cwq

7

Way of diagnosis

Assembling a diagnostic entity (category) Diagnosis based on several variable:

made by an “abnormal” value of several physiological function.

Metabolism syndrome: Hypertension, overweight, obesity in abdomen, high blood lipid, high blood glucose.

Page 8: 05 diagnostic tests cwq

8

Diagnostic testing

A “diagnostic test” originally is meant a test performed in a laboratory.

In this chapter, including histories of disease, signs, symptoms, physical exams, special tests (x-ray, ECG, CT, cell counts, etc)

Page 9: 05 diagnostic tests cwq

9

Section B

Assessment on Diagnostic Testing

Page 10: 05 diagnostic tests cwq

10

Diagnostic Tests

Suppose a researcher had developed a new test for diagnosing the presence of disease A

The new test is half the price of the current test for the same disease and can be administered during a routine checkup, as opposed to a half day hospital stay

Page 11: 05 diagnostic tests cwq

11

Diagnostic Tests

From a cost benefits perspective, this new test sounds like a winner

However, before it becomes part of standard medical practice, it is important to evaluate the accuracy of this test compared to the existing technology

Page 12: 05 diagnostic tests cwq

12

Is the diagnostic test valid??? Goal: Evaluate the “accuracy” of

the new test

Page 13: 05 diagnostic tests cwq

13Continued

diagnose individuals for “disease” using both Gold standard (perfect) New diagnostic test

The way to evaluate a new diagnostic test

Page 14: 05 diagnostic tests cwq

14

Selection of Gold standard

A gold standard test is currently recognized as the most reliable test. Tissue diagnosis Radiological exam Autopsies Prolonged follow-up

Page 15: 05 diagnostic tests cwq

15

Selection of subjects

Target patients Patients diagnosed by “gold standard

test” Typical and untypical patients Patients in early, middle and later

period Light, middle and serious patients With and without complications

Page 16: 05 diagnostic tests cwq

16

Selection of subjects

Patients without target disease as control group

Healthy persons are not suitable for as control group

Page 17: 05 diagnostic tests cwq

17

Blindly comparing the results between the gold standard test and the new test

Page 18: 05 diagnostic tests cwq

18

GOLD STANDARD(The Truth)

Yes (+) No (–)

Yes (+)

No (–)

 

Creating a 2 x 2 Table

New

Test

True P False P

False N

True N

Page 19: 05 diagnostic tests cwq

19

Dichotomous model

DiseaseYes (D+) No (D-) Total

Positive (T+) a b a+bTest

Negative (T-) c d c+d

Total a+c b+d n

Test true from Dichotomization

Types of true

•True Positives = positive tests that are correct = a

•True Negatives = negative tests that are correct = d

Page 20: 05 diagnostic tests cwq

20

Dichotomous model

DiseaseYes (D+) No (D-) Total

Positive (T+) a b a+bTest

Negative (T-) c d c+d

Total a+c b+d n

Test Errors from Dichotomization

Types of errors

•False Positives = positive tests that are wrong = b

•False Negatives = negative tests that are wrong = c

Page 21: 05 diagnostic tests cwq

21

DIAGNOSTIC ACCURACY OF A TESTDIAGNOSTIC ACCURACY OF A TEST

DefinitionDefinitionThe extent to which the results of a diagnostic test reflect true disease status

TerminologyTerminologyAccuracy / Validity interchangeable

Page 22: 05 diagnostic tests cwq

22

Indicators for assessing diagnostic test

Page 23: 05 diagnostic tests cwq

23

Measures of diagnostic accuracyMeasures of diagnostic accuracy Sensitivity Specificity Predictive values

Measures of reliability / Measures of reliability / reproductivityreproductivity

Percent agreement

Page 24: 05 diagnostic tests cwq

24

The ability of the test to detect the presence of disease (i.e. abnormality)

The proportion of those with the disease who test positive (positive in disease, PID)

True PositivesTrue Positives + False Negatives

a / a+c

Sensitivity

Page 25: 05 diagnostic tests cwq

25

Developmental characteristics: test parameters

Sensitivity = Pr(T+|D+) = a/(a+c)

Sensitivity is PID (Positive In Disease)

DiseaseYes (D+) No (D-) Total

Positive (T+) a b a+bTest

Negative (T-) c d c+d

Total a+c b+d n

Page 26: 05 diagnostic tests cwq

26

The ability of the test to detect freedom from disease (i.e. normality)

The proportion of those without the disease who have a normal test (negative in health, NIH)

True NegativesTrue Negatives + False Positives

d / b+d

Specificity

Page 27: 05 diagnostic tests cwq

27

Developmental characteristics: test parameters

Specificity = Pr(T-|D-) = d/(b+d)

Specificity is NIH (Negative In Health)

DiseaseYes (D+) No (D-) Total

Positive (T+) a b a+bTest

Negative (T-) c d c+d

Total a+c b+d n

Page 28: 05 diagnostic tests cwq

28

Developmental characteristics: test parameters

Pr(T+|D-) = False Positive Rate (FP rate) =

b/(b+d)

DiseaseYes (D+) No (D-) Total

Positive (T+) a b a+bTest

Negative (T-) c d c+d

Total a+c b+d n

Page 29: 05 diagnostic tests cwq

29

Developmental characteristics: test parameters

Pr(T-|D+) = False Negative Rate (FN rate) =

c/(a+c)

DiseaseYes (D+) No (D-) Total

Positive (T+) a b a+bTest

Negative (T-) c d c+d

Total a+c b+d n

Page 30: 05 diagnostic tests cwq

30

Developmental characteristics: test parameters

Sensitivity = Pr(T+|D+) = 1 - FN rate Specificity = Pr(T-|D-) = 1 - FP rate

DiseaseYes (D+) No (D-) Total

Positive (T+) a b a+bTest

Negative (T-) c d c+d

Total a+c b+d n

Page 31: 05 diagnostic tests cwq

31

Example

Accuracy of an exercise test for diagnosing coronary artery disease Screen: A random sample of 1,442

patients with symptoms of coronary artery disease

Gold standard: Angiography New diagnostic test: Exercise tolerance

test (ECG)

Page 32: 05 diagnostic tests cwq

32

Resulting 2 x 2 Table

Coronary Artery Disease(based on angiography) + –

Exercise + 800115 915Tolerance test – 200327 527Test 1000442 1442

Source: Weiner (1979) NEJM

Page 33: 05 diagnostic tests cwq

33

80.1000

800

Continued

Sensitivity and Specificity

Sensitivity Proportion of those with disease who

are positive on the new diagnostic test

Page 34: 05 diagnostic tests cwq

34

74.442

327

Sensitivity and Specificity

Specificity Proportion of those without disease who

are negative on the new diagnostic test

Page 35: 05 diagnostic tests cwq

35

Positive and Negative Predictive Value

Positive Predictive Value : Of all the people who tested positive for a disease, the proportion that actually has it

Negative Predictive Value : Of all the people who tested negative for a disease, the proportion that actually does not have it

In these patients, what you know are their test results, from which you are trying to determine whether they actually have the disease.

Page 36: 05 diagnostic tests cwq

36

Positive Predictive Value

Referring back to the exercise tolerance test: We want to know the chances of having

coronary artery disease for someone who tests positive with the exercise tolerance test

Page 37: 05 diagnostic tests cwq

37

Resulting 2 x 2 Table

Coronary Artery Disease(based on angiography) + –

Exercise + 800115 915Tolerance test – 200327 527Test 1000442 1442

Source: Weiner (1979) NEJM

Page 38: 05 diagnostic tests cwq

38

874.915

800

Continued

Positive Predictive Value

Positive predictive value (PPV) = The proportion of all individuals with a

positive test who actually have the disease

Page 39: 05 diagnostic tests cwq

39Continued

Positive Predictive Value

Positive predictive value (PPV) = “Given that someone has a positive test

result, what are the chances this person has the disease?”

Page 40: 05 diagnostic tests cwq

40

Positive Predictive Value

This is not the same as sensitivity Sensitivity =

“Given that someone has the disease, what are the chances this person gets a positive result?”

Page 41: 05 diagnostic tests cwq

41Continued

Negative Predictive Value

Referring back to the exercise tolerance test: We want to know the chances of not

having coronary artery disease for someone who tests negative with the exercise tolerance test

Page 42: 05 diagnostic tests cwq

42

620.527

327

Negative Predictive Value

Negative predictive value (NPV) = The proportion of all individuals with a

negative test who do not have the disease

Page 43: 05 diagnostic tests cwq

43Continued

Negative Predictive Value

Negative predictive value (NPV) = “Given that someone has a negative

test result, what are the chances this person does not have the disease?”

Page 44: 05 diagnostic tests cwq

44

Negative Predictive Value

This is not the same as specificity Specificity =

“Given that someone does not have the disease, what are the chances this person gets a negative result?”

Page 45: 05 diagnostic tests cwq

45Continued

Notes on Interpretation

The positive predictive value is 88% and the negative predictive value is 62%

The sample was from a population of patients with symptoms of coronary artery disease

Page 46: 05 diagnostic tests cwq

46Continued

Notes on Interpretation

Interpretation If you have symptoms of coronary

disease and you have a positive exercise test, there is an 88% chance you have coronary artery disease

If you have a negative test result, there is a 62% you do not have coronary artery disease

Page 47: 05 diagnostic tests cwq

47

Notes on Interpretation

However, in an asymptomatic population the positive predictive value might be much lower

These estimates only apply to the population tested—the population of individuals with symptoms of coronary artery disease

Page 48: 05 diagnostic tests cwq

48Continued

Summary

Sensitivity and specificity do not depend on prevalence of a disease and can always be estimated in a diagnostic test

PPV and NPV do depend on the population prevalence of disease

Page 49: 05 diagnostic tests cwq

49Continued

Summary

If we start with a completely random sample, we can also estimate PPV and NPV for the population from which we have sampled

If we want to estimate PPV and NPV for a different population we will need more machinery

Page 50: 05 diagnostic tests cwq

50

Summary

If we over sample cases we will need more machinery to estimate PPV and NPV in a population with a given prevalence of disease

Page 51: 05 diagnostic tests cwq

51

Likelihood Ratios and Post-Test Disease Probability

Pre-test probability of disease

Pre-test odds of disease

Likelihood ratio

Post-test odds of disease

Post-test probability of disease

Page 52: 05 diagnostic tests cwq

52

Likelihood Ratio(LR)

An LR is the probability of a particular test result for a person with the disease of interest divided by the probability of that test result for a person without the disease of interest

Page 53: 05 diagnostic tests cwq

53

Clinical Interpretation: likelihood ratios

Likelihood ratio = Pr{test result|disease present}Pr{test result|disease absent}

LR+ = Pr{T+|D+}/Pr{T+|D-} = Sensitivity/(1-Specificity)=0.93/(1-0.92)=11.63

LR- = Pr{T-|D+}/Pr{T-|D-} = (1-Sensitivity)/Specificity=(1-0.93)/0.92=0.08

Page 54: 05 diagnostic tests cwq

54

Pretest probability of disease 0.13

Page 55: 05 diagnostic tests cwq

55

Pretest odds of disease

Pretest odds of disease are defined as the estimate before diagnostic testing of the probability that a patient has the disease of interest divided by the probability that the patient does not have the disease of interest.

Pretest odds=Pretest probability/(1- Pretest probability)

=0.13/(1-0.13=0.13/0.87=0.15

Page 56: 05 diagnostic tests cwq

56

Posttest odds of disease

Posttest odds of disease are defined as the estimate after diagnostic testing of the probability that a patient has the disease of interest divided by the probability that the patient does not have the disease of interest.

Posttest odds=Pretest probability LR+

=0.15 11.63=1.76

Page 57: 05 diagnostic tests cwq

57

Posttest probability

Posttest probability=Posttest odds/(1+ Posttest odds) =1.76/(1+1.76)=1.76/2.76=0.64

Page 58: 05 diagnostic tests cwq

58

Clinical interpretation of post-test probability

Don't treat for disease

Do further diagnostic

testingTreat for disease

Probability of disease:

0 1

Testing threshold

Treatment threshold

Disease ruled out

Disease ruled in

Page 59: 05 diagnostic tests cwq

59

Advantages of LRs

The higher or lower the LR, the higher or lower the post-test disease probability

Which test will result in the highest post-test probability in a given patient?

The test with the largest LR+ Which test will result in the lowest post-

test probability in a given patient? The test with the smallest LR-

Page 60: 05 diagnostic tests cwq

60

Advantages of LRs

Clear separation of test characteristics from disease probability.

Page 61: 05 diagnostic tests cwq

61

Likelihood Ratios - Advantage

Provide a measure of a test’s ability to rule in or rule out disease independent of disease probability

Test A LR+ > Test B LR+ Test A PV+ > Test B PV+ always!

Test A LR- < Test B LR- Test A PV- > Test B PV- always!

Page 62: 05 diagnostic tests cwq

62

Figure 1a: Likelihood Ratio Nomogram

Page 63: 05 diagnostic tests cwq

63

Figure 1b: Likelihood Ratio Nomogram

Page 64: 05 diagnostic tests cwq

64

Developmental characteristics: Cut-points and Receiver Operating Characteristic (ROC)

Healthy

Page 65: 05 diagnostic tests cwq

65

Developmental characteristics: Cut-points and Receiver Operating Characteristic (ROC)

Healthy Sick

Page 66: 05 diagnostic tests cwq

66

Developmental characteristics: Cut-points and Receiver Operating Characteristic (ROC)

Fals pos= 20% True pos=82%

Page 67: 05 diagnostic tests cwq

67

Developmental characteristics: Cut-points and Receiver Operating Characteristic (ROC)

F pos= 100% T pos=100%

Page 68: 05 diagnostic tests cwq

68

Developmental characteristics: Cut-points and Receiver Operating Characteristic (ROC)

Fals pos= 9% True pos=70%

Page 69: 05 diagnostic tests cwq

69

Developmental characteristics: Cut-points and Receiver Operating Characteristic (ROC)

T neg= 100% F neg=100%

Page 70: 05 diagnostic tests cwq

70

Developmental characteristics: Cut-points and Receiver Operating Characteristic (ROC)

“F pos + T pos “ is the highest

Page 71: 05 diagnostic tests cwq

71

Page 72: 05 diagnostic tests cwq

72

Developmental characteristics: Cut-points and Receiver Operating Characteristic (ROC)

Receiver Operating Characteristic (ROC)

Page 73: 05 diagnostic tests cwq

73

Developmental characteristics: Cut-points and Receiver Operating Characteristic (ROC)

Receiver Operating Characteristic (ROC)

Page 74: 05 diagnostic tests cwq

74

Receiver Operating Characteristic (ROC)

ROC Curve allows comparison of different tests for the same condition without (before) specifying a cut-off point.

The test with the largest AUC (Area under the curve) is the best.

Page 75: 05 diagnostic tests cwq

75

Page 76: 05 diagnostic tests cwq

76

Section C

Diagnostic strategies

Page 77: 05 diagnostic tests cwq

77

Combination tests: serial and parallel testing

Combinations of specificity and sensitivity superior to the use of any single test may sometimes be achieved by strategic uses of multiple tests. There are two usual ways of doing this.

Serial testing: Use >1 test in sequence, stopping at the first negative test. Diagnosis requires all tests to be positive.

Parallel testing: Use >1 test simultaneously, diagnosing if any test is positive.

Page 78: 05 diagnostic tests cwq

78

Combination tests: serial testing

Doing the tests sequentially, instead of together with the same decision rule, is a cost saving measure.

This strategy increases specificity above that of any of the individual

tests, but degrades sensitivity below that of any of them singly.

However, the sensitivity of the serial combination may still be higher than would be achievable if the cut-point of any single test were raised to achieve the same specificity as the serial combination.

Page 79: 05 diagnostic tests cwq

79

Combination tests: serial testing Demonstration: Serial Testing with Independent Tests

SeSC = sensitivity of serial combinationSpSC = specificity of serial combination

SeSC = Product of all sensitivities= Se1X Se2X…etc Hence SeSC < all individual Se

1-SpSC = Product of all(1-Sp)

Hence SpSC > all individual Spi

Serial test to rule-in disease

Page 80: 05 diagnostic tests cwq

80

Combination tests: parallel testing

Parallel Testing Usual decision strategy diagnoses if any test positive.

This strategy increases sensitivity above that of any of the individual

tests, but degrades specificity below that of any individual test.

However, the specificity of the combination may be higher than would be achievable if the cut-point of any single test were lowered to achieve the same sensitivity as the parallel combination.

Page 81: 05 diagnostic tests cwq

81

Combination tests: parallel testing Demonstration: Parallel Testing with Independent Tests

SePC = sensitivity of parallel combinationSpPC = specificity of parallel combination

1-SePC = Product of all(1 - Se)

Hence SePC > all individual Se

SpPC = Product of all Sp

Hence SpPC < all individual Spi

Parallel test to rule-out disease

Page 82: 05 diagnostic tests cwq

82

Clinical settings for parallel testing

Parallel testing is used to rule-out serious but treatable conditions (example rule-out MI by CPK, CPK-MB, Troponin, and EKG. Any positive is considered positive)

When a patient has non-specific symptoms, large list of possibilities (differential diagnosis). None of the possibilities has a high pretest probability. Negative test for each possibility is enough to rule it out. Any positive test is considered positive.

Page 83: 05 diagnostic tests cwq

83

Because specificity is low, further testing is now required (serial testing) to make a diagnosis (Sp P In).

Page 84: 05 diagnostic tests cwq

84

Clinical settings for serial testing

When treatment is hazardous (surgery, chemotherapy) we use serial testing to raise specificity.(Blood test followed by more tests, followed by imaging, followed by biopsy).

Page 85: 05 diagnostic tests cwq

85

Calculate sensitivity and specificity of parallel tests

(Serial tests in HIV CDC exercise) 2 tests in parallel 1st test sens = spec = 80% 2nd test sens = spec = 90% 1-Sensitivity of combination =

(1-0.8)X(1-0.9)=0.2X0.1=0.02 Sensitivity= 98% Specificity is 0.8 X 0.9 = 0.72

Page 86: 05 diagnostic tests cwq

86

Increasing the prevalence of disease

Referral process Selected demographic groups Specific of the clinical situation

Page 87: 05 diagnostic tests cwq

87

Effect of prevalence on predictive value(Se=70%, Sp=90%)

SettingPrevalence

(case/100,000)PPV(%)

General population 35 0.4

Men, age 75or greater

500 5.6

Clinical suspicious prostatic nodule

50,000 93.0

Page 88: 05 diagnostic tests cwq

88

Lead Time

1990 1997 2000

death

Diagnosisand

treatment

Biologiconset ofdisease

1990 1994 2000

death

Biologiconset ofdisease

Screening:diagnosis &treatment

Page 89: 05 diagnostic tests cwq

89

Length Bias

1995 2000

death

Biologiconset ofdisease

1989 1994

death

Biologiconset ofdisease

Screening:diagnosis &treatment

2002

1994

Screening:

Page 90: 05 diagnostic tests cwq

90