Ida Sim, MD, PhD Associate Professor of Medicine
Post on 02-Jan-2016
39 Views
Preview:
DESCRIPTION
Transcript
Copyright 2007, Ida Sim
Ida Sim, MD, PhDAssociate Professor of Medicine
Director, Center for Clinical and Translational InformaticsUniversity of California, San Francisco, CA
Supported by The Trial Bank Project R01-LM-06780
National Center for Biomedical Ontology U54 HG004028-01
RCT SchemaThe Trial Bank Project
Copyright 2007, Ida Sim
Outline
• Background• RCT Schema
» modeling approach» the class structure» evaluation
• Relationship to CTO • Summary
Copyright 2007, Ida Sim
Goals of the CTO
• (1) fully and faithfully capture the types of entities and relationships involved in clinical trials
• (2) comprehend terms like: cohort, randomization, placebo, etc., including ... statistical terms and terms for ... meta-analysis;
• (3) organize these terms in a structured way, providing definitions and logical relations designed to enhance retrieval of, reasoning with, and integration of the data annotated in its terms
• ... • (6) draw on and seek maximal alignment with existing clinical
trial ontologies, including:» RCT Schema ontology used by theTrial Bank Project
Copyright 2007, Ida Sim
Major Axes for Aligning Models• Domain of clinical trials
» design: (non-)randomized, crossover, cluster randomized, factorial...» objective: interventional, diagnostic, preventive...» clinical domain: drugs, procedures, organizational change...
• Task» trial design, execution, reporting, analysis, application» for individual trials, sets of clinically related trials
• Purpose (application vs. domain ontology)» to support accomplishment of domain task(s)» to define shared meaning for “integration of the data annotated in its
terms”
Copyright 2007, Ida Sim
Systematic Review
Decision Models
Guidelines Electronic Patient Record
Trial Execution
Trial Interpretation
Trial Application
Regulatory Reporting Trial Conduct Trial Design
Feedback to Trial Design
Trial Bank
Scientific Reporting Trial Registration
Trial Tasks
Copyright 2007, Ida Sim
Trial Bank Definition
• Computable repository of RCT information sufficiently detailed to support scientific analysis for » designing future clinical trials» evidence-based practice and policy making
• Detailed information on » study design» study execution» summary and individual participant-level results
• Trial Bank is NOT for running a trial
Copyright 2007, Ida Sim
Systematic Review
Decision Models
Guidelines Electronic Patient Record
Trial Execution
Trial Interpretation
Trial Application
Regulatory Reporting Trial Conduct Trial Design
Feedback to Trial Design
Trial Bank
Scientific Reporting Trial Registration
Trial Bank Target Uses
Copyright 2007, Ida Sim
Trial Bank Software• RCT Bank built on RCT Schema
» Ocelot frame-based ontology
• Bank-a-Trial» web-based program for trialists to enter trial instances into RCT Bank» clinical descriptions of trial features (slot values) are in UMLS
• RCT Presenter» web-based browser of individual trials
Bank-a-Trial RCT Presenter
RCT Bank
RCT Schema
Copyright 2007, Ida Sim
Major Axes for RCT Schema• Domain of clinical trials
» design: (non-)randomized, crossover, cluster randomized, factorial...» objective: interventional, diagnostic, preventive...» clinical domain: drugs, procedures, organizational change...
• Task» trial design, execution, reporting, analysis, application» for individual trials, sets of clinically related trials
• Purpose» to support accomplishment of domain task(s) [application ontology]
» to define shared meaning for “integration of the data annotated in its terms”
Copyright 2007, Ida Sim
Outline
• Background• RCT Schema
» modeling approach» the class structure» evaluation
• Relationship to CTO • Summary
Copyright 2007, Ida Sim
Entity Specification Problem
• What RCT aspects to model in RCT Schema? What not to model?» multiple users (e.g., trialists, systematic reviewers)» multiple tasks (e.g., analysis, interpretation)
– multiple methods» no one correct RCT ontology
• Need principled, systematic approach» to specifying, documenting, and evaluating
Copyright 2007, Ida Sim
Competency Decomposition Method• To define the entities that must be in a conceptual
model• General approach
» specify a task hierarchy of target tasks and subtasks» specify methods for each task» specify entities required for completing each task
using each method• Generates a specification of required entities
» the information requirements for the competencies (tasks and subtasks) that the model/knowledge base is to support
(Sim, et al, JBI 2004: 37(2):108-119)
Copyright 2007, Ida Sim
Systematic Review
Decision Models
Guidelines Electronic Patient Record
Trial Execution
Trial Interpretation
Trial Application
Regulatory Reporting Trial Conduct Trial Design
Feedback to Trial Design
RCT Bank
Scientific Reporting Trial Registration
Target Task for RCT Bank• Using RCTs for trial design or clinical application requires synthesizing evidence across all trials on a topic
Copyright 2007, Ida Sim
Systematic Reviewing
• Canonical method for synthesizing evidence across trials
• Major steps are» retrieve related RCTs (e.g., 32 trials of metformin for
diabetes)» analyze how comparable the trials are» statistically combine data if appropriate
– combining smaller trials increases statistical power to detect effects
Copyright 2007, Ida Sim
Target Task = Systematic Review
• RCT Bank entities = sys. review information needs» identified all systematic review tasks
– review of literature and personal experience conducting 3 systematic reviews
» identified methods for completing these tasks» organized tasks and methods into a task hierarchy» derived RCT entities necessary and sufficient for each
(sub)task
Copyright 2007, Ida Sim
Top-Level Sys Review Tasks• Trial retrieval• Trial critiquing
» judging internal validity » judging generalizability
• Meta-analysis of quantitative results» analysis of clinical and statistical heterogeneity
• Contextual interpretation» scientific, socio-economic, and ethical
http://rctbank.ucsf.edu/tasks/tasks.html
Copyright 2007, Ida Sim
Judgment of Generalizability
• Were the people enrolled in the trial representative and unbiased?» were eligible patients randomly selected from the source
population?» were enrolled subjects a random subset of those
eligible?
• Are the trial subjects similar to mine? • Do I have the tested intervention available here?
Copyright 2007, Ida Sim
Method-(In)dependent Entity Specification• Were the people enrolled in the trial representative
and unbiased?» were eligible patients randomly selected from the
source population?– method: no computable algorithm available
recruitment method» were enrolled patients a random subset of those
eligible?– method: using standard statistics
number and clinical characteristics of enrolled subjects number and clinical characteristics of eligible but non-
enrolled subjects
Copyright 2007, Ida Sim
Results: Entity Requirements
Methods
High-levelTasks
SubTask I
SubTask II
SubTask III
Critiquing
ExternalValidity
13
4
. . .
. . .
InternalValidity
11
39. . .
. . .
QuantitativeSynthesis
4
7
. . .
. . .
Retrieval
1
1
2
2
ContextualInterpretation
3. . . . .
9. . .
2
1
. . .
22. . .
112 29 30171 Unique Entities Required
(n=35)
(n=74)
Copyright 2007, Ida Sim
Entity Specification Evaluation
• Evaluated match between » the 171 information items» 388 requirements in 18 published trial-critiquing
instruments
• Results» entity specification is comprehensive» entity specification is reasonable
Sim, et al, KR-MED 2004; JBI 2004: 37(2):108-119
Copyright 2007, Ida Sim
Benefits of Approach• Task hierarchy understandable by domain experts
» identifies which information items are required for which tasks
• Provides an evaluation “yardstick”» if an ontology contains all the information requirements for a
task– then is it “competent” for that task
» can evaluate and compare application ontologies
• Documents an (application/domain) ontology» states which tasks an ontology is competent for, and why» cross-indexes tasks and entities in the ontology
Copyright 2007, Ida Sim
Outline
• Background• RCT Schema
» modeling approach» the class structure» evaluation
• Relationship to CTO • Summary
Copyright 2007, Ida Sim
Implementation of 171 Items• Purpose of RCT Bank KB is to support scientific analysis of
trial evidence» needed a “data-schema” or “instance-style” ontology
• RCT Schema implemented the 171 information items in a frame-based ontology
Bank-a-Trial RCT Presenter
RCT Bank
RCT Schema
Copyright 2007, Ida Sim
RCT Schema Ontology
• Ocelot frame-based model» 7 levels deep
• 192 frames, 607 unique slots» avg. 9.8 slots/frame» 3 frames (1.6%) have multiple parents» 193/607 slots (32%) take other frames as values
• Available at http://rctbank.ucsf.edu/
• RCT Schema displayed in GKB Editor» classes are red boxes» instances are blue boxes
• Class hierarchy organizes entities as » trial concepts» trial descriptions (details)
• Not fully compliant with “Werner’s Rules”
Copyright 2007, Ida Sim
Includes IS-A Hierarchies
Copyright 2007, Ida Sim
Instantiating RCT Schema
• Clinical content described by terms from a clinical vocabulary, e.g., » for a breast cancer trial, instance of BASELINE-
CHARACTERISTIC is described by– term “menopause” from UMLS preferred term– the UMLS CUI
» Trial Bank software supports any vocabulary in UMLS (e.g., SNOMED)
• Each trial is a collection of instances of classes
• 518-TRIAL» 518-
BACKGROUND-DETAILS
» 518-ADMIN-DETAILS
» 518-EXECUTED-PROTOCOL– 518-ALL-SUBJECTS– 518-PRIMARY-
OUTCOME-1– etc.
» 518-ERRATUM» 518-CONCLUSION-
DETAILS
• 518-PRIMARY-OUTCOME-1 (e.g., all-cause mortality)» 518-STAT-ANALYSIS-AND-RESULTS-1 (e.g. t-test)
–518-ALL-COMPARISONS-AT-TIME-X-1 (e.g., at 6 months) 518-SINGLE-TIME-X-COMPARISON-1 (e.g., between
PCI and thrombolysis groups)
• 518-SINGLE-TIME-X-COMPARISON-1 » datapoint for PCI group
– numerator (all-cause deaths at 6 months)– denominator (had all-cause death outcome assessed at 6 months)– 518-STUDY-ARM-POPULATION-1 (the PCI group)
» datapoint for thrombolysis group» summary odds ratio under intention-to-treat analysis» summary odds ratio under efficacy analysis
Copyright 2007, Ida Sim
Outline
• Background• RCT Schema
» modeling approach» the class structure» evaluation
• Relationship to CTO• Summary
Copyright 2007, Ida Sim
Characteristic Examples
Clinical domains Cardiology, Radiology, Geriatrics, etc. Intervention Types
Procedures (thrombolysis), Single and Multi-step Drugs (aspirin, warfarin), Counseling, Multiple interventions in one arm
Outcome Types Dichotomous, continuous, univariate, multivariate, survival, regression, scored instruments (e.g., Wechsel Memory Scale)
Result Types Intention-to-treat, efficacy analysis, subgroup analyses
Expressivity Evaluation
• Captured 17 full and 20+ partial trials
Copyright 2007, Ida Sim
Modeling Challenges Met
• Multi-armed, crossover, cluster randomized studies• Many variations of patient drop-out, loss to followup
(e.g., excluded after randomization)• For each outcome, the # of subjects assessed at each
timepoint in each subgroup• Blinding efficacy: did subjects know which arm they
were assigned to?• etc.
Copyright 2007, Ida Sim
Modular, Extensible• Extensions possible with only minimal changes to 192
existing classes
• Extensible to new clinical domains (e.g., genomics) via clinical vocabularies» no clinical terms in RCT Schema
Modeling Extension
New Classes
New Slots
Old Classes Changed
Cluster randomized trials 2 10 3
Complex intervention regimens 4 8 3
Computable eligibilty rules ? ? 3
Copyright 2007, Ida Sim
Limitations of Representation• Modeled, not yet tested
» participant-level data» factorial designs» designs with run-in and washout periods
• In development» computable eligibility rules
• Not yet modeled» genomic data » nested subgroups» secondary studies (e.g,. followup studies)
Copyright 2007, Ida Sim
Trial Bank Publishing
• How to get trials into RCT Bank?• Collaborated with JAMA and Ann Int Med to explore co-
publishing trials as articles and RCT Bank entries» authors submit manuscripts for peer review as usual » trial-bank staff enter accepted trials into trial bank» co-published 14 trials in RCT Presenter
• Evaluation» 83 respondents evaluated a trial using both RCT
Presenter and the Journal Article– mostly trialists and meta-analysts
Copyright 2007, Ida Sim
RCT Presenter Evaluation
85 8165
85
15
73
42 46 42 50
12 1227
12
65
23
3139 50 39
0102030405060708090
100
% R
esp
ondents
Presenter Better Same as Article
• 70% of respondents rated RCT Presenter as good as or better than the Journal Article for all attributes
N=30
Copyright 2007, Ida Sim
Outline
• Background• RCT Schema
» modeling approach» the class structure» evaluation
• Relationship to CTO • Summary
Copyright 2007, Ida Sim
Alignment to CTO
• Have participated in CTO working group from inception (Simona Carini)
• Contributed RCT Schema classes and definitions to list of terms for consideration
• Contributed to draft of high-level concept hierarchy– http://www.bioontology.org/wiki/index.php/High-
level_Concepts_v0.2
• Contributed to creation of the draft CTO presented this morning
Copyright 2007, Ida Sim
How Would Trial Bank Use CTO?• Use CTO as common index into RCT Bank and into
other clinical trial data and information systems • Map RCT Schema class and slot names to terms in
CTO» so that RCT Bank instances can be made available to
machines and humans who wish to – query– reason, or – integrate
» clinical trial information using CTO» e.g., trials co-published with PLoS, etc. into RCT
Bank
Copyright 2007, Ida Sim
Example Use (NCBO)
• Map class names from any trial bank to terms in CTO» e.g., in RCT Bank or “European Trial Bank”
– PRIMARY-OUTCOME to CTO term for this– BASELINE-CHARACTERISTIC to CTO term for this
• CTeXplorer an ontology-driven tool for visualizing complex design differences across trials [MA Storey, et al UVic, Canada]
» given variable HgbA1C from any trial bank» would know how to handle and display if annotated as
– PRIMARY-OUTCOME [CTO] or – BASELINE-CHARACTERISTIC [CTO]
Copyright 2007, Ida Sim
CTO for Integration with Trial Bank Collaborators...• National Center for Biomedical Ontology• Electronic Primary Care Research Network
» Primary Care Research Object Model• Global Trial Bank, with AMIA
» trial-bank publishing» “dedicated to assuring the implementation and maintenance
of an open global infrastructure for computable clinical trial results information”
• European Clinical Trial Data Repository» submitted to Framework Programme 7
• with BRIDG, clinical trial management systems, etc?
Copyright 2007, Ida Sim
Summary• Trial banks are computable repositories of trial information
for analysis, interpretation, and application of trial evidence to research and care
• We specified the entities for RCT Bank using competency decomposition method
• RCT Schema implemented entities specification as an “instance-style” frame-based ontology» expressive, extensible, useful for reporting and interpretation
• Mapping of RCT Schema terms to CTO terms makes RCT Bank instances available for CTO-driven integration and reasoning
top related